Tuesday, December 09, 2025

Train the robot to cross the road.

 Train the robot in the following scenario to cross the road.

scenario setting artifact




Robot Walk RL

 Robot Walk RL Gemini 3 Pro

Robot Walk RL, Opus 4.5 (share)



Simulating BLUE LED

  Blue LED



Simulating Laser

 雷射(2d 模擬 energy pumping)




雷射(3d 模擬) 使用3d量子場

share (He-Ne)



Verification of AI

  verify geo modeling

HW#12 Reinforcement Learning (RL) 2

 自行決定是否做作業,做的話任選一題即可


1. 市區交通模擬,如何控制燈號以增進運輸量 (RL)

 (credit: 與專題生共同創作)






 2. Use RL to train Submarine to avoid unmanned torpedo from attaching and attacking


hi accomp share (artifact)


3. Use RL to optimize a Manufacturing Process overcoming a failure,  artifact (share) (inspired by Edward Chang)






4. Use RL to train inverted pendulum



5. Train the robot in the following scenario to cross the road.

scenario setting artifact









6. Use RL to train submarine maneuvers

Job Shop RL

Visualization of Manufacturing Process,  simulation of pipelines (inspired by Edward Chang)





 

  Job Shop











RL (Reinforcement Learning)


Protein Folding artifact  html
highly accomplish it artifact improved on student work
2 highly accomplish it artifact improved on student work

Inverted Pendulum (RL)

 



Saturday, December 06, 2025

RL for Submarine Tactical Operation

 




RL for Submarine Maneuvers

 Use RL to train Submarine to avoid unmanned torpedo from attaching and attacking


hi accomp share (artifact)

RL for ChungLi Traffic 400 tiers

 市區交通模擬,如何控制燈號以增進運輸量 (RL)

 (credit: 與專題生共同創作)



🚦 RL Traffic Control — Train, Save, Compare

Agents persist across sessions • Continue training anytime

🎮 Training (1747 episodes)

1747
Episodes
5%
Exploration
1358079
Q-Table
-303.5
Avg R (10)
Learning Curve

🚦 RL Traffic Control — Train, Save, Compare

Agents persist across sessions • Continue training anytime

📊 Comparison Complete

Metric
❌ Untrained
✅ Trained (1688 eps)
Total Reward
-359.8
-158.8(+56%)
Throughput
17
21
Avg Wait
44.3
29.2
✓ Learning Verified! Trained agent outperformed random.