Edward Hu edwhu

Hi, I'm Edward. Currently, I'm interested in developing data-driven methods that interact, explore, and learn from the world. My research investigates deep reinforcement learning, perception, and robotics

🤖 Research Artifacts

Here are the codebases of my research projects so far.


bst		The Belief State Transformer (ICLR'25)
scaffolder		Privileged Sensing Scaffolds RL (ICLR'24 Spotlight)
planning goals for exploration		Planning Goals for Exploration (ICLR'23 Spotlight)
interactive reward functions		Training Robots to Evaluate Robots (CoRL'22 Best Paper Award)
robot aware control		Know Thyself: Transferable Visual Control Policies Through Robot-Awareness (ICLR'22)
ikea furniture simulation		IKEA Furniture Assembly Environment for Long-Horizon Complex Manipulation Tasks (ICRA'21)

Nerd stuff

I like studying codebases that are elegantly written and do cool things. Some topics that I found interesting lately: dataloading at scale, Jax renderer, Jax Monte-Carlo Tree Search library.
Some of my side projects:


gpt2 slackbot		Chatting with GPT2 in slack
optical illusion		A cool optical illusion

Something to think about as an ML researcher

I think researchers start out very pure hearted, but can easily end up misled and lost. The incentives of the modern research community, particularly ML, are misaligned with doing good science. To employ an analogy, ML is currently like a hackathon. You are incentivized to put together an MVP that works just enough to pass the appraisal of the judges. You feel obligated to use the shiny new "X" because it will garner public attention. Companies with free t-shirts and kickbacks swarm around you.

Yes, some of these things are unavoidable. But if you blindly follow the noise, you may end up in the eye of the storm - at a standstill, with no exit in sight.

Don't be distracted by the noise!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Edward Hu edwhu

Organizations

Block or report edwhu

🤖 Research Artifacts

Nerd stuff

Pinned Loading

Uh oh!