That's a nice piece of motor engineering. It's well known that high ratio gearboxes for robots are a headache. Back driveability doesn't work, and tiny teeth are fragile. Comments on this go all the way back to Feynman writing about his time spent engineering automatic gunnery aiming systems in WWII.
This new discovery is that gearbox problems mess up a machine learning system. It's trying to track gearbox noise and is using up all its learning capacity on that.
This discovery means that robotics people can tap machine learning funding for motor and gearbox development. Robotics labs used to be really low-budget operations. No longer.
What you really want is a direct drive motor, but those have to be large-diameter. They can be flat; that's a pancake motor. That's too large for fingers. So their compromise moves partly in that direction; the rotor is flatter, torques are higher, speeds are slower, and gearbox ratios are lower. As they point out, reflected inertia is the square of the gear ratio, because the gear ratio gets you both going out and coming back. So this is a bigger than linear win.
Good back-drivabiilty means much less risk of gear breakage on overload. Some of the academic designs, such as harmonic drives and series elastic actuators, have huge gear ratios in a small space. That's OK for prototypes but not production. As I've mentioned before, "you cannot strip the teeth of a magnetic field", a line from a GE electric locomotive salesman around 1900. If an overload forces a motor backwards, nothing breaks.
Would have been nice to hear more about the motor design. That's the real achievement here. There are CAD tools which understand electromagnetic fields now, so strange motor geometries are not as much of a trial and error and experience process as it once was. It's also respectable for an EE to work on rotating machinery again. That field matured around the 1960s, and until computers took over motor control, didn't change much.
AceJohnny2 1 days ago [-]
(I am not a mechanical engineer) Are capstan drives a solution to some of these gearing problems?
Animats 1 days ago [-]
They don't back-drive well. The whole point of this hand design is to back-drive the contact forces into the motor, where there's force control. They're somewhat bulky, too.
Key concept: force-based motor control works quite well. Preserve that property through the gear train and force-based hand control works.
LoganDark 1 days ago [-]
> They don't back-drive well
What? An ideal capstan drive can be backdriven perfectly fine. You only run into problems once it stops being ideal (e.g. built out of heavy parts, high gear ratio, etc.)
Animats 1 days ago [-]
It's the high reduction ratio that's the problem. If you build a 200:1 capstan, it's not going to back drive well. And it won't be anywhere near ideal.
LoganDark 16 hours ago [-]
That falls under both of the examples I just listed, so yes.
rcxdude 20 hours ago [-]
capstan drives help with backlash (though they are in generally less stiff than gears, so it's not without tradeoffs), but they don't really help with any of the other issues.
soco 24 hours ago [-]
Why is a pancake motor too large for fingers? Human motors aren't placed on the fingers either...
Animats 23 hours ago [-]
This is what a pancake motor looks like.[1] Thin and flat. Wrong form factor for fitting into a wrist.
Strings ("tendons") have their own problems. Wear at bearing points, mostly. You need an opposed pair, too. So there's a motor, a winding drums, and something like a spring to maintain tension. Or two motors and two winding drums, which gives better control.
That's why snake robots have never caught on. They're good painting robots, because they can get into tight spots. But repetitive cycling wears the tendons at the same place on each cycle.
There was a good application - Tesla's snake charging robot.[2] All the complicated stuff is in the pedestal, and you can surround that with concrete to prevent damage if it's hit by a car. The snake part has no motors, just discs and strings, plus the connector and probably a cell phone camera at the end. It can be made easily replaceable, like a gas station hose.
That loses some of the pictures. Especially the one of their custom motor, which is the new development.
dang 24 hours ago [-]
Thanks, we've added that to the toptext also.
ortusdux 1 days ago [-]
Website down? "This deployment is temporarily paused"
eyeino 1 days ago [-]
Nothing scarier than a Vercel bill.
belviewreview 20 hours ago [-]
The basic problem is that human hands are made of flesh, while robot hands are made of artificial materials like steel and carbon fiber composites. Flesh is a million times more complex, starting at the molecular level, and so it can do all sorts of things that artificial materials cannot.
Compare for instance a jetliner and a bird. The jetliner can fly fast and carry a lot of cargo or human beings. The bird is a lot smaller and slower, but it can do all sorts of things a jetliner cannot. This includes growing itself from a single fertilized egg, self-repair, defense against attacks of various types, finding and getting food, building its own shelter, and reproduction. All because it is made of biological cells. And note that for a jetliner, the similar tasks are carried out by human beings also made of biological cells.
I think a similar analysis explains why artificial general intelligence is impossible.
amelius 1 days ago [-]
What is the power draw of these joints?
ACCount37 1 days ago [-]
You can redesign every single actuator, sure. Or you can design a wrist camera and use it to close the loop. Or just run enough imitation learning + "RL IRL" and fold the imperfect real world actuators into the policy implicitly.
Robotics doesn't have a single silver bullet - the design space is vast and underexplored.
Surgical robots, and robot pianos both exist. Neither employ humanoid hands. This all just illustrates how humanoid robots are, in multiple dimensions, going down technology rat holes. In some cases better solutions already exist without looking humanoid. In other cases, the humanoid form factor fails to address problems like a high center of gravity in a device that needs to not fall on grandma while helping her around the house.
I continue to be amazed that the wrong form factor keeps being pursued. Though I suppose I shouldn't be too surprised given the parade of failed "AI devices."
program_whiz 1 days ago [-]
I think one major draw to human-like for factors is the reuse of existing ecosystems and tools. If you have human-like grasping, you can reuse tools and utensils for human hands, otherwise, you need custom attachments. If you have human-like legs you can navigate stairs, wear pants for customization, and possibly operate a car or bike.
Its a bit like choosing JS / python -- of course performance is inferior to a compiled language with highly tailored code, but they are flexible and have an ecosystem that might do 99% of the lifting for you.
But in isolation, I agree with your idea that specialized robots with form fitted specifically to task will likely outperform a more generalized solution in a specific domain of behavior, the more generalized will likely outperform in flexibility and reusability (e.g. capable of reusing the human ecosystem).
Zigurd 9 hours ago [-]
> and possibly operate a car or bike.
This is exactly the example I used to show the ridiculousness of humanoid robots. You're not going to have a humanoid form climb into the driver seat of your car and act as your chauffeur. Your car will be the robot.
Just how ridiculous is a humanoid robot chauffeur? Should know how to drive a stick shift? Should it use its superior ability to swivel its head around to check for cars in the blind spot?
Someday business schools will have a chapter in their product management course that covers all the ways people slide ass backwards into thinking a humanoid robot makes sense. Humanoid robots will always be crushed between the rock of low value use cases below their price points the hard place of high value use cases that deserve tailored solutions.
nkrisc 1 days ago [-]
I think it’s less about tools and more about the spaces that humans operate in.
You don’t need a human-like hand to hold a tool made for humans. As an extreme example, you can make a robot operate a power drill with strap to hold it and a servo with a small bit of wood to operate the trigger mechanism.
But for a robot operating in a space made for humans there certainly are some physical requirements which are based on the human form: maximum volume and clearances, stairs, fragile fixtures that can’t be operated with too much force, etc.
Ever walk through some over-crowded antique shop where you need to twist and lean your body to avoid knocking into thing?
amluto 1 days ago [-]
There are a whole lot of tools intended for human use that I would use much more effectively if I could rotate my wrist repeatedly in the same direction.
mikepurvis 1 days ago [-]
But that's a superset of human functionality, aka even worse.
numpad0 1 days ago [-]
Reuse tools to do what? Touch type on a mechanical keyboard?
What makes human hands especially suitable for e.g. assembling a phone or installing a door handle onto a car?
tryauuum 1 days ago [-]
> Touch type on a mechanical keyboard
yes. do you think it's safe to just plug usb into some hole and type? the safest option for a robot is typing with fingers
beau_g 1 days ago [-]
Many overactuated, purpose built robots (like surgical robots and pianos) exist, and have existed since the Unimate, and work great in certain situations. The problem with all of them is they are very expensive, often extremely large, and single purpose or very narrow purpose (and even if they are narrowly multipurpose, require tons of setup to get to work for each job they are intended to do).
I personally am not bullish on 1:1 human hands either, but IMO the question shouldn't be $100k 2 ton Kuka arm vs biped with hands, it's overactuated robotics (build it from the floor with hard coded operations) vs underactuated (build it from the contact point of the work backwards with ML and sensors). We shall see which form factors prevail, but the type of robotics development posted here seems like the way forwards regardless, an ecosystem of small, power dense, reliable, accurate QDD actuators will lead to many general purpose robot applications. I recognize I am not using underactuated vs overactuated in their strict definition here but if you are familiar with robots I think you'll understand where I am coming from as far as a robot design ethos.
I will say though in designing robots of this type without necessarily being bound by trying to make a robot look like a human, I have often found myself accidentally recreating human arm DOF in a round trip way, it does just end up being well packaged beyond the "world designed for humans" talking point. Maybe hands will end up being a similar situation.
leonflexo 1 days ago [-]
Similar to how we are seeing LLMs shoved into spaces where existing ML was already doing well and better suited.
Not to dismiss the value of LLMs in those cases as an interface/interpretation layer.
If grandma goes into the windowless surgery factory, I just want the best bots working on her. There is value in having Dr. Bot the replicant give me the face-to-face status updates. We are not breaking out those layers as much, anymore, as the focus becomes minimizing FOMO.
haarts 1 days ago [-]
You are right. If the hand is doing a specific task, better morphologies are likely. But that's not always desirable. The canonical example is of course the household. I don't want X robots, I want 1. And I don't want to change anything. Robot hand!
fatbird 1 days ago [-]
Not to mention that the world is very widely designed to be manipulated by hands: doorknobs, handles, container sizes. A unique door opening appendage isn't going to do much good around your house.
amluto 1 days ago [-]
A humanoid human will fall over too if pushed into a sufficiently awkward corner. It’s a fundamental problem with things that aren’t statically stable and need active stabilization.
makeramen 1 days ago [-]
I see it as trying to apply the bitter lesson to robotics. Specialized robots will always have their place, but humanoid ones can take advantage of all the design interfaces that already exist in the world for humans.
Similar to how claude code gained so much traction in terminal by just leveraging the command line interface that already exists for humans, no need to invent a domain specific MCP to just run shell commands.
I agree with you that it's far from the most efficient approach for specific tasks. But the analogy would be that you also generally don't want to use LLMs to do something you can "just" write a script for... that doesn't make LLMs useless though.
analognoise 1 days ago [-]
It’s not X. It’s Y.
Multiple times, over and over.
We need to stop with the AI stuff.
furyofantares 1 days ago [-]
Tip for those who want to skip shit like this, excessive headings glued together by bullet points is quicker to spot, especially since the headings almost always start with "The".
I now scroll any AI-adjacent article I see and just read headings and if I see this I know what I'm getting into:
The Dexterity Deadlock
The Problem
The Geometric Curse
The Sim-to-Real Gap
The Structural Gap f(⋅)
Seeing It in Motion
The N^2 Impedance Mismatch
The Chaos Term ϵchaos
The Information Wall
The Weakest Link
Why Manipulation Needs Better
What We Built
From 288 to 15
Does It Work?
Hardware Validation
Robot Hand Landscape
The Take-Home
mort96 1 days ago [-]
Oh my god, it's absolutely chock full of AI-isms. Almost every sentence is a list of 3 items, often nested lists of 3 items.
modeless 1 days ago [-]
I think they did a search-and-replace to turn em dashes into semicolons in an attempt to hide the AI. Weird usage of semicolons.
roenxi 1 days ago [-]
Deciding whether A is an X or a Y is a really basic part of why we're all communicating. Suspicion of em dashes is one thing, but once you start getting nervous on seeing "It’s not X. It’s Y." then you're just going to get paranoid.
The fundamentals of an LLM is to statistically match their output with the corpus. The tics they have are really common in natural human usage too.
card_zero 23 hours ago [-]
This line of argument would fall down if it turns out that a human with statistically normal output is a bizarre-sounding human.
knollimar 1 days ago [-]
Did you read the article? It's all AI tells. The tone may as well be a fax machine
roenxi 23 hours ago [-]
I didn't reply to the comments talking about the AI tells. I replied to the comment that is making a bad argument. It doesn't matter to me whether the article is or isn't LLM assisted.
knollimar 22 hours ago [-]
Is the not X it's Y in large frequency not an AI tell? I think they're pointing out it has a tell. Humans use those but not that often.
roenxi 20 hours ago [-]
> Is the not X it's Y in large frequency not an AI tell?
I doubt it. The AIs are statistical models, if they've picked up a habit of saying "not X it's Y" then that is probably the most likely thing for humans to say when they are explaining something. The whole training process is about making what the AI says statistically indistinguishable from what humans do; the only way to pick up that it is an AI is either because the model is badly fitted (which, in fairness, many are, they're still working out the ideal weights) or because it isn't grounded in reality. It isn't reasonable to say "oh this looks like AI" based on small phrases like that, AIs use the same phrases humans tend to. It is where they should be doing the best job of fitting in with us.
"This uses really common phrasing ergo it is AI" is a bad case to be trying to make.
roywiggins 1 days ago [-]
Even the title follows a common LLM pattern (The X Problem/Issue/etc).
sho_hn 1 days ago [-]
The authors presumably don't have English as their first language.
airstrike 1 days ago [-]
Plus a bunch of other elements that are dead giveaways.
In this day and age, I wish people would ask any model OTHER than ChatGPT to rewrite their shit. At least we'd get a different flavor of slop.
k1ko 1 days ago [-]
Once I became aware of this AI slop pattern, I can't stop seeing it everywhere.
kristjansson 1 days ago [-]
While it’s got some clear LLM patterns, the content seems novel enough to be worth the squeeze. That or I’m far enough outside of my Gell-Mann amnesia bubble that I can’t see the slop
1 days ago [-]
kstrauser 1 days ago [-]
"This deployment is temporarily paused" is the crappiest "this account has gone over its budget" error page ever. Does that mean the site's down, or is that some meta-joke like "you've reached the end of the Internet"? Quick, explain to my non-technical friends what a "deployment" is. They're not trying to go on a deployment; they're trying to look at a website.
827a 1 days ago [-]
In this case, I'm not sure it matters what it says or how your non-technical friends interpret it. The site is down. Why it is down doesn't change the next thing casual viewers will do (close the tab).
mort96 1 days ago [-]
But it does matter to their opinion of the site: is it down because the author took it down, is it down due to a technical problem, or is it is it down because the hosting provider took it down?
"This deployment is temporarily paused", if anything, sounds like the people who put the site up took it down again. That sends the wrong message.
Personally, if my hosting provider took my post down, I'd want them to make that obvious to my visitors. Or at the very least make it look like a technical issue. Not make it look like I took it down.
kstrauser 1 days ago [-]
> The site is down.
Is it? The title is "The Robotic Dexterity Deadlock". For all I know, it's a joke about what deadlock looks like for robots, showing what could be interpreted as a deadlock in a webserver. At a glance, I can't tell if the site is down, or if it's up and correctly showing its very short message.
So, yeah, in reality, I'm 99% sure it really is an error message. That's only because I've seen similar error messages in the past and can infer how to interpret it.
Rendered at 23:03:00 GMT+0000 (Coordinated Universal Time) with Vercel.
This new discovery is that gearbox problems mess up a machine learning system. It's trying to track gearbox noise and is using up all its learning capacity on that. This discovery means that robotics people can tap machine learning funding for motor and gearbox development. Robotics labs used to be really low-budget operations. No longer.
What you really want is a direct drive motor, but those have to be large-diameter. They can be flat; that's a pancake motor. That's too large for fingers. So their compromise moves partly in that direction; the rotor is flatter, torques are higher, speeds are slower, and gearbox ratios are lower. As they point out, reflected inertia is the square of the gear ratio, because the gear ratio gets you both going out and coming back. So this is a bigger than linear win.
Good back-drivabiilty means much less risk of gear breakage on overload. Some of the academic designs, such as harmonic drives and series elastic actuators, have huge gear ratios in a small space. That's OK for prototypes but not production. As I've mentioned before, "you cannot strip the teeth of a magnetic field", a line from a GE electric locomotive salesman around 1900. If an overload forces a motor backwards, nothing breaks.
Would have been nice to hear more about the motor design. That's the real achievement here. There are CAD tools which understand electromagnetic fields now, so strange motor geometries are not as much of a trial and error and experience process as it once was. It's also respectable for an EE to work on rotating machinery again. That field matured around the 1960s, and until computers took over motor control, didn't change much.
Key concept: force-based motor control works quite well. Preserve that property through the gear train and force-based hand control works.
What? An ideal capstan drive can be backdriven perfectly fine. You only run into problems once it stops being ideal (e.g. built out of heavy parts, high gear ratio, etc.)
Strings ("tendons") have their own problems. Wear at bearing points, mostly. You need an opposed pair, too. So there's a motor, a winding drums, and something like a spring to maintain tension. Or two motors and two winding drums, which gives better control.
That's why snake robots have never caught on. They're good painting robots, because they can get into tight spots. But repetitive cycling wears the tendons at the same place on each cycle.
There was a good application - Tesla's snake charging robot.[2] All the complicated stuff is in the pedestal, and you can surround that with concrete to prevent damage if it's hit by a car. The snake part has no motors, just discs and strings, plus the connector and probably a cell phone camera at the end. It can be made easily replaceable, like a gas station hose.
But some people thought it was too creepy.
[1] https://www.maccon.com/pancake-motors.html
[2] https://www.youtube.com/watch?v=uMM0lRfX6YI
Compare for instance a jetliner and a bird. The jetliner can fly fast and carry a lot of cargo or human beings. The bird is a lot smaller and slower, but it can do all sorts of things a jetliner cannot. This includes growing itself from a single fertilized egg, self-repair, defense against attacks of various types, finding and getting food, building its own shelter, and reproduction. All because it is made of biological cells. And note that for a jetliner, the similar tasks are carried out by human beings also made of biological cells.
I think a similar analysis explains why artificial general intelligence is impossible.
Robotics doesn't have a single silver bullet - the design space is vast and underexplored.
I continue to be amazed that the wrong form factor keeps being pursued. Though I suppose I shouldn't be too surprised given the parade of failed "AI devices."
Its a bit like choosing JS / python -- of course performance is inferior to a compiled language with highly tailored code, but they are flexible and have an ecosystem that might do 99% of the lifting for you.
But in isolation, I agree with your idea that specialized robots with form fitted specifically to task will likely outperform a more generalized solution in a specific domain of behavior, the more generalized will likely outperform in flexibility and reusability (e.g. capable of reusing the human ecosystem).
This is exactly the example I used to show the ridiculousness of humanoid robots. You're not going to have a humanoid form climb into the driver seat of your car and act as your chauffeur. Your car will be the robot.
Just how ridiculous is a humanoid robot chauffeur? Should know how to drive a stick shift? Should it use its superior ability to swivel its head around to check for cars in the blind spot?
Someday business schools will have a chapter in their product management course that covers all the ways people slide ass backwards into thinking a humanoid robot makes sense. Humanoid robots will always be crushed between the rock of low value use cases below their price points the hard place of high value use cases that deserve tailored solutions.
You don’t need a human-like hand to hold a tool made for humans. As an extreme example, you can make a robot operate a power drill with strap to hold it and a servo with a small bit of wood to operate the trigger mechanism.
But for a robot operating in a space made for humans there certainly are some physical requirements which are based on the human form: maximum volume and clearances, stairs, fragile fixtures that can’t be operated with too much force, etc.
Ever walk through some over-crowded antique shop where you need to twist and lean your body to avoid knocking into thing?
What makes human hands especially suitable for e.g. assembling a phone or installing a door handle onto a car?
yes. do you think it's safe to just plug usb into some hole and type? the safest option for a robot is typing with fingers
I personally am not bullish on 1:1 human hands either, but IMO the question shouldn't be $100k 2 ton Kuka arm vs biped with hands, it's overactuated robotics (build it from the floor with hard coded operations) vs underactuated (build it from the contact point of the work backwards with ML and sensors). We shall see which form factors prevail, but the type of robotics development posted here seems like the way forwards regardless, an ecosystem of small, power dense, reliable, accurate QDD actuators will lead to many general purpose robot applications. I recognize I am not using underactuated vs overactuated in their strict definition here but if you are familiar with robots I think you'll understand where I am coming from as far as a robot design ethos.
I will say though in designing robots of this type without necessarily being bound by trying to make a robot look like a human, I have often found myself accidentally recreating human arm DOF in a round trip way, it does just end up being well packaged beyond the "world designed for humans" talking point. Maybe hands will end up being a similar situation.
Not to dismiss the value of LLMs in those cases as an interface/interpretation layer.
If grandma goes into the windowless surgery factory, I just want the best bots working on her. There is value in having Dr. Bot the replicant give me the face-to-face status updates. We are not breaking out those layers as much, anymore, as the focus becomes minimizing FOMO.
Similar to how claude code gained so much traction in terminal by just leveraging the command line interface that already exists for humans, no need to invent a domain specific MCP to just run shell commands.
I agree with you that it's far from the most efficient approach for specific tasks. But the analogy would be that you also generally don't want to use LLMs to do something you can "just" write a script for... that doesn't make LLMs useless though.
Multiple times, over and over.
We need to stop with the AI stuff.
I now scroll any AI-adjacent article I see and just read headings and if I see this I know what I'm getting into:
The Dexterity Deadlock
The Problem
The Geometric Curse
The Sim-to-Real Gap
The Structural Gap f(⋅)
Seeing It in Motion
The N^2 Impedance Mismatch
The Chaos Term ϵchaos
The Information Wall
The Weakest Link
Why Manipulation Needs Better
What We Built
From 288 to 15
Does It Work?
Hardware Validation
Robot Hand Landscape
The Take-Home
The fundamentals of an LLM is to statistically match their output with the corpus. The tics they have are really common in natural human usage too.
I doubt it. The AIs are statistical models, if they've picked up a habit of saying "not X it's Y" then that is probably the most likely thing for humans to say when they are explaining something. The whole training process is about making what the AI says statistically indistinguishable from what humans do; the only way to pick up that it is an AI is either because the model is badly fitted (which, in fairness, many are, they're still working out the ideal weights) or because it isn't grounded in reality. It isn't reasonable to say "oh this looks like AI" based on small phrases like that, AIs use the same phrases humans tend to. It is where they should be doing the best job of fitting in with us.
"This uses really common phrasing ergo it is AI" is a bad case to be trying to make.
In this day and age, I wish people would ask any model OTHER than ChatGPT to rewrite their shit. At least we'd get a different flavor of slop.
"This deployment is temporarily paused", if anything, sounds like the people who put the site up took it down again. That sends the wrong message.
Personally, if my hosting provider took my post down, I'd want them to make that obvious to my visitors. Or at the very least make it look like a technical issue. Not make it look like I took it down.
Is it? The title is "The Robotic Dexterity Deadlock". For all I know, it's a joke about what deadlock looks like for robots, showing what could be interpreted as a deadlock in a webserver. At a glance, I can't tell if the site is down, or if it's up and correctly showing its very short message.
So, yeah, in reality, I'm 99% sure it really is an error message. That's only because I've seen similar error messages in the past and can infer how to interpret it.