PanoEvJ
AI & ML interests
None yet
Organizations
PanoEvJ/T5_summarization_RLAIF
PanoEvJ/summarization_finetuned_t5_base_4bit
PanoEvJ/T5_base_SFT_summarization
PanoEvJ/summarization_t5_base_4bit
Updated
PanoEvJ/output
Updated
PanoEvJ/instruct-tuned-llama-7b-hf-alpaca_gpt4_5_000_samples
Text Generation
•
Updated
•
3
PanoEvJ/gpt2-detoxified-RLAIF
Text Generation
•
Updated
•
11
PanoEvJ/gpt2-severe-detox-RLAIF
Text Generation
•
Updated
•
8
PanoEvJ/gpt2-severe-detox-RLAIF-with-rewards
Updated
PanoEvJ/repo
Text Generation
•
Updated
•
8
PanoEvJ/gpt2-detox-temp
Text Generation
•
Updated
•
8
PanoEvJ/lunarlander-ppo-custom
Reinforcement Learning
•
Updated
PanoEvJ/BLOOMZ-3b-marketmail-ai-finetuned
Updated
PanoEvJ/Bert-Classifier-News-Articles
Text Classification
•
Updated
•
2
PanoEvJ/a2c-PandaReachDense-v2
Reinforcement Learning
•
Updated
•
2
PanoEvJ/a2c-AntBulletEnv-v0
Reinforcement Learning
•
Updated
•
3
PanoEvJ/GenAI-CoverLetter
Text Generation
•
Updated
•
2
•
1
PanoEvJ/PyramidsRND
Reinforcement Learning
•
Updated
•
9
PanoEvJ/ppo-SnowballTarget-mlagents
Reinforcement Learning
•
Updated
•
9
PanoEvJ/Reinforce-Pixelcopter-PLE-v0
Reinforcement Learning
•
Updated
PanoEvJ/Reinforce-CartPole-v1
Reinforcement Learning
•
Updated
PanoEvJ/SpaceInvadersNoFrameskip
Reinforcement Learning
•
Updated
•
4
PanoEvJ/q-Taxi-v3
Reinforcement Learning
•
Updated
PanoEvJ/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
PanoEvJ/ppo-LunarLander-OG
Reinforcement Learning
•
Updated
•
2
PanoEvJ/ppo-LunarLander-v2-reloaded
Reinforcement Learning
•
Updated
•
2