← 返回
AI智能 中文

Hot3d

HOT3D (Hand-Object 3D Dataset) by Meta Facebook - multi-view egocentric hand and object 3D tracking for Aria/Quest smart glasses. State-of-the-art multi-view...
Meta发布的HOT3D(手-物3D数据集):专为Aria/Quest智能眼镜设计的多视角第一人称手部与物体3D追踪数据集,具备顶尖的多视角技术。
fujunhy
AI智能 clawhub v0.1.0 1 版本 100000 Key: 无需
★ 0
Stars
📥 458
下载
💾 17
安装
1
版本
#latest

概述

HOT3D - Multi-View 3D Hand & Object Tracking

Overview

State-of-the-art multi-view 3D tracking system for egocentric hand-object interactions from Meta Facebook Research. Designed for Aria smart glasses and Quest VR headsets, HOT3D provides precise 3D world coordinates for hand joints, manipulated objects, and their interactions. The system includes visualization tools for rendering 3D overlays on video frames with joint projections, hand meshes, and object models.

Project page: https://facebookresearch.github.io/hot3d

Best for: Research-grade 3D tracking with multi-camera setups, high-precision applications, and XR device integration.

When to Use This Skill

Use when you need:

  • Multi-view 3D tracking with world coordinates
  • High-precision hand pose in 3D space (millimeter accuracy)
  • Object tracking during manipulation
  • Aria/Quest integration for wearable devices
  • Research-grade tracking benchmarks
  • Hand-object interaction analysis in 3D

vs alternatives:

  • More advanced than single-view methods (hands-3d-pose)
  • Higher precision than bounding box detection (handtracking)
  • Full 3D world coordinates vs 2D projections

Core Capabilities

1. Multi-View 3D Hand Tracking

21-keypoint 3D hand pose from multiple synchronized cameras:

  • 3D world coordinates (x, y, z) for each joint
  • Joint confidence scores
  • Left/right hand identification
  • Temporal consistency across frames
  • Hand mesh reconstruction

2. Object Pose Estimation

6DOF object pose tracking:

  • 3D position and orientation (quaternion/rotation matrix)
  • Object mesh alignment
  • Tracking during manipulation
  • Multiple object support

3. Hand-Object Interaction

Interaction analysis:

  • Contact point detection
  • Grasp type classification
  • Manipulation phase detection
  • Force estimation (with sensor data)

4. Visualization Tools

Rich visualization options:

  • 3D skeleton projected to each camera view
  • Hand mesh rendering
  • Object model overlay
  • Trajectory visualization
  • Multi-view synchronized display

Quick Start

# Clone repository
git clone https://github.com/facebookresearch/hot3d.git
cd hot3d

# Install dependencies
pip install -r requirements.txt
# Key: PyTorch3D, Open3D, vispy

# Download dataset (requires registration)
# https://facebookresearch.github.io/hot3d/dataset.html

# Run demo
python demo/visualize_tracking.py \
    --sequence demo_sequence \
    --output_dir ./visualizations

Usage Example

from hot3d import HOT3DTracker
import numpy as np

# Initialize tracker
tracker = HOT3DTracker()
tracker.load_sequence('path/to/sequence')

# Get frame data
frame_data = tracker.get_frame(frame_id=100)

# Access 3D hand pose
hand_pose_3d = frame_data['left_hand']  # 21x3 array
print(f"Wrist position: {hand_pose_3d[0]}")  # [x, y, z]

# Access object pose
object_pose = frame_data['object_001']
position = object_pose['position']  # [x, y, z]
rotation = object_pose['rotation']  # 3x3 matrix

# Visualize
tracker.visualize_frame(
    frame_id=100,
    show_hands=True,
    show_objects=True,
    show_meshes=True,
    save_path='output.png'
)

Model Specs

  • Input: Multi-view RGB-D video streams (typically 3-5 cameras)
  • Output: 3D coordinates in world frame (millimeters)
  • Accuracy: ~5-10mm hand joint error
  • Frame rate: 30-60 Hz (depends on hardware)
  • Latency: <100ms for real-time applications

Requirements

  • Hardware: Multi-camera setup or Aria/Quest device
  • Computation: GPU recommended (NVIDIA RTX 3080 or better)
  • Storage: Large dataset (several TB for full HOT3D)
  • Software: PyTorch, PyTorch3D, Open3D

Dataset

HOT3D dataset includes:

  • 100+ sequences of daily activities
  • Multi-view RGB-D video
  • 3D hand and object annotations
  • Aria/Quest recordings
  • Smart glasses data

Access: https://facebookresearch.github.io/hot3d

Integration

Works with:

  • hand-tracking-toolkit: Evaluation and metrics
  • Aria SDK: Device integration
  • PyTorch3D: 3D processing
  • OpenXR: XR platform integration

Limitations

  • Requires multi-view setup or specialized hardware
  • Computational intensive
  • Dataset access requires registration
  • Complex setup compared to single-view methods

Best For

  • XR applications with smart glasses
  • Research in 3D hand tracking
  • High-precision manipulation analysis
  • Benchmarking new algorithms

References

  • Project: https://facebookresearch.github.io/hot3d
  • GitHub: https://github.com/facebookresearch/hot3d
  • Paper: HOT3D dataset publication
  • Citation: See project page

License

CC-BY-NC 4.0 (non-commercial only)

版本历史

共 1 个版本

  • v0.1.0 当前
    2026-03-30 06:42 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

ai-intelligence

Self-Improving + Proactive Agent

ivangdavila
自我反思+自我批评+自我学习+自组织记忆。智能体评估自身工作、发现错误并持续改进。
★ 1,358 📥 318,511
ai-intelligence

Nano Banana Pro

steipete
使用 Nano Banana Pro (Gemini 3 Pro Image) 生成或编辑图像。支持文生图、图生图及 1K/2K/4K 分辨率,适用于图像创建、修改及编辑请求,使用 --input-image 指定输入图像。
★ 418 📥 115,248
ai-intelligence

ontology

oswalpalash
类型化知识图谱,用于结构化智能体记忆与可组合技能。支持创建/查询实体(人员、项目、任务、事件、文档)及关联...
★ 712 📥 243,903