Jue Wang
juewang
AI & ML interests
None yet
Organizations
juewang's activity
Context length?
10
#2 opened 10 months ago
by
turboderp
Missing files?
#1 opened about 1 year ago
by
juewang
Correct the output dtype of rmsnorm_func
2
#13 opened over 1 year ago
by
ag0
how to fine tune peft qlora and SFTTrainer?
12
#2 opened over 1 year ago
by
NickyNicky
Poor performance?
4
#6 opened over 1 year ago
by
Fionn
Can you help me fine-tune this with LoRA? (Having an error)
1
#12 opened over 1 year ago
by
AayushShah
What kind of machine would be suitable for this model (in amazon sagemaker)?
5
#7 opened over 1 year ago
by
juusohugs
Will it be possible to run this on PC with 8 GeForce RTX 3060 with 8 Gb VRAM each?
2
#11 opened over 1 year ago
by
ai2p
Any way to set the "stop, split by" when running the model locally?
4
#26 opened over 1 year ago
by
johnnyracer
Issue with loading model to GPU when using pipeline
2
#5 opened over 1 year ago
by
AlpYu-HubX
Is it a wrong prompt?
4
#8 opened over 1 year ago
by
tatyanavidrevich
Feature requests and suggestions for V2
9
#4 opened almost 2 years ago
by
zhangce
use accelerate to load model
1
#4 opened over 1 year ago
by
adolf669
This model requires A LOT of resources... But how much? Trying to build a chatbot
9
#3 opened over 1 year ago
by
joanfmendo
Generated Text have issues
10
#22 opened almost 2 years ago
by
asifahmed
Is UL2 used?
1
#2 opened over 1 year ago
by
JunnanLi
Question-Answering over documents
3
#19 opened almost 2 years ago
by
tmishinev
Confused about bidirectional attention when implementing custom sampling loop
2
#25 opened over 1 year ago
by
ericanthonymitchell
Model behavior during adaptation phase
2
#24 opened almost 2 years ago
by
jlli
Fine Tuning // Download Full Weights
2
#23 opened almost 2 years ago
by
idop11