No details are provided by Apple on actual hardware specifications of Vision Pro so far. Some information is gathered through WWDC sessions, interviews, responses.
Apple announced 12 cameras, 5 sensors on the headset. LiDAR, and 3D imaging cameras are installed. Apple mentioned that only some of these cameras and sensors will be available for developers, while others will be used for face scanning and gesture recognition. Four cameras with LED illumnatio will be tracking eye movements.
Headset features Digital Crown (which also acts as a button) and separate button which could trigger photo capture or video recording. No controllers are required to work with Vision Pro. There are multiple "standard" hand gestures which are detected by the system and custom gestures could be implemented by application developers.
Notably, Vision Pro support direct touch manipulation and "indirect" subtle finger gestures like pinches. Eye glance is used to highlight element which will be responding to gesture.
Internal displays are made using micro-OLED technology and have total of 23 million pixels. It is reported that Vision Pro runs on 90Hz refresh rate with boosts to 96 Hz to display 24 FPS video content properly.
Vision Pro is powered by M2 and R1 chips. M2 acts as CPU/GPU engine for the headset while R1 is focused on signal processing to accumulate and analyze information coming from cameras and sensors.
Headset could be used plugged in or via external battery. Vision Pro should have 2h battery life.