Cislunar Space Beginner's GuideCislunar Space Beginner's Guide
Satellite Orbit Simulation
Cislunar Glossary
Resources & Tools
Blue Team Research
Space News
AI Q&A
Forum
Home
Gitee
GitHub
  • 简体中文
  • English
Satellite Orbit Simulation
Cislunar Glossary
Resources & Tools
Blue Team Research
Space News
AI Q&A
Forum
Home
Gitee
GitHub
  • 简体中文
  • English
  • Site map

    • Home (overview)
    • Intro · what is cislunar space
    • Orbits · spacecraft trajectories
    • Frontiers · directions & labs
    • Glossary · terms & definitions
    • Tools · data & code
    • News · space industry archive
    • Topic · blue-team research
  • Cislunar glossary (terms & definitions)

    • Cislunar Space Glossary
    • Dynamics models

      • Circular Restricted Three-Body Problem (CR3BP)
      • CR3BP with Low-Thrust (CR3BP-LT)
      • A2PPO (Attention-Augmented Proximal Policy Optimization)
      • Curriculum Learning
      • Low-Thrust Transfer MDP Formulation
      • Generalized Advantage Estimation (GAE)
      • Direct Collocation
      • Birkhoff-Gustavson Normal Form
      • Central Manifold
      • Action-Angle Variables
      • Poincaré Section
      • Clohessy-Wiltshire (CW) Equation
      • Patched Method (拼接法)
      • Continuation (延拓)
      • Differential Correction (微分修正)
      • Poincaré Map (庞加莱图)
      • Impulsive Maneuver (脉冲机动)
      • Zero-Velocity Surface
      • Hill Three-Body Problem
      • Bicircular Four-Body Problem
      • Quasi-Bicircular Four-Body Problem
      • Strobe Map
      • Stability Set
      • Backward Stability Set
      • Capture Set
      • /en/glossary/dynamics/batch-deployment.html
      • /en/glossary/dynamics/state-dependent-tsp.html
      • /en/glossary/dynamics/q-law.html
      • /en/glossary/dynamics/mass-discontinuity.html
      • /en/glossary/dynamics/equinoctial-elements.html
      • /en/glossary/dynamics/dynamic-programming.html
      • /en/glossary/dynamics/coasting-arc.html
    • Mission orbits

      • Distant Retrograde Orbit (DRO)
      • Near-Rectilinear Halo Orbit (NRHO)
      • Earth-Moon L1/L2 Halo Orbit (EML1/EML2 Halo)
      • DRO Constellation
      • Orbit Identification
      • Transfer Orbit (转移轨道)
      • Perilune (近月点)
      • Apolune (远月点)
      • Retrograde (逆行)
      • Prograde (顺行)
      • Parking Orbit (停泊轨道)
      • Free-Return Trajectory (自由返回轨道)
      • Halo Orbit (Halo 轨道)
      • Lissajous Orbit (Lissajous 轨道)
      • Lyapunov Orbit (Lyapunov 轨道)
      • Cycler Trajectory
      • Multi-Revolution Halo Orbit
      • Ballistic Capture Orbit
      • Low-Energy Transfer Orbit
      • Full Lunar Surface Coverage Orbit
      • /en/glossary/orbits/hub-and-spoke.html
    • Navigation

      • X-ray Pulsar Navigation
      • LiAISON Navigation
    • Lunar minerals

      • Changeite-Mg (Magnesium Changeite)
      • Changeite-Ce (Cerium Changeite)
    • Other

      • Starshade
      • Noncooperative Target
      • Spacecraft Intention Recognition
      • Chain-of-Thought (CoT) Prompting
      • Low-Rank Adaptation (LoRA)
      • Prompt Tuning (P-tuning)
      • Cislunar Space (地月空间)
      • Low Earth Orbit / LEO (低地球轨道)
      • Lunar Gravity Assist / LGA (月球借力)
      • Powered Lunar Flyby / PLF (有动力月球借力)
      • Weak Stability Boundary / WSB (弱稳定边界)
      • /en/glossary/other/libration-point.html
      • Orbit Insertion (入轨)
      • /en/glossary/other/orbital-residence-platform.html
    • Organizations

      • Anduril Industries
      • Booz Allen Hamilton
      • General Dynamics Mission Systems
      • GITAI USA
      • Lockheed Martin
      • Northrop Grumman
      • Quindar
      • Raytheon Missiles & Defense
      • Sci-Tec
      • SpaceX
      • True Anomaly
      • Turion Space
    • Military space doctrine

      • Space Superiority
      • Competitive Endurance
      • DOTMLPF-P Framework
      • Mission Command
      • Force Design
      • Force Development
      • Force Generation
      • Force Employment
      • Space Force Generation Process (SPAFORGEN)
      • Mission Delta (MD)
      • System Delta (SYD)
      • Space Mission Task Force (SMTF)
      • Commander, Space Forces (COMSPACEFOR)
      • Component Field Commands
      • Space Domain Awareness (SDA)
      • Counterspace Operations
      • Resilient/Disaggregated Architecture
      • Operational Test and Training Infrastructure (OTTI)
      • Golden Dome
    • Observation techniques

      • Image Stacking
      • Shift-and-Add (SAA)
      • Synthetic Tracking
      • Sidereal Tracking
      • Signal-to-Noise Ratio (SNR)
      • Astrometry
      • Source Extraction
      • Ephemeris Correlation
      • Cislunar Moving Objects
      • Lunar Glare Zone
      • Image Registration
      • Background Star Elimination
      • Segmentation Map
      • Hot Pixel
    • Satellite Communication & TT&C

      • BeiDou Satellite System
      • Inter-Satellite Link (ISL)
      • All-Time Seamless Communication
      • Constellation Networking
      • Microwave Link
      • Laser-Microwave Communication

Direct Collocation

Definition

Direct Collocation is a class of methods that directly discretize Optimal Control Problems (OCP) into Nonlinear Programming Problems (NLP) for numerical solution. Unlike indirect methods that require analytical derivation of costate first-order optimality conditions, direct methods simultaneously discretize state and control variables through collocation, transforming the infinite-dimensional continuous OCP into a finite-dimensional NLP. It is currently one of the most widely used numerical methods in spacecraft trajectory optimization.

Basic Principles

Discretization Strategy

In direct collocation, the transfer interval [0,tf][0, t_f][0,tf​] is divided into NNN sub-intervals, with simultaneous satisfaction at collocation points in each sub-interval:

  1. State dynamics constraint: Enforce x˙=f(x,u)\dot{\mathbf{x}} = f(\mathbf{x}, \mathbf{u})x˙=f(x,u) at collocation points
  2. Boundary conditions: Initial state x(0)=x0\mathbf{x}(0) = \mathbf{x}_0x(0)=x0​ and terminal constraint x(tf)∈T\mathbf{x}(t_f) \in \mathcal{T}x(tf​)∈T
  3. Path constraints: Control constraints ∥u∥≤1\|\mathbf{u}\| \leq 1∥u∥≤1, obstacle avoidance constraints, etc.

Hermite-Simpson Collocation

The direct collocation implementation used in the A2PPO research employs the Hermite-Simpson collocation scheme:

  • On each sub-interval [ti,tt+1][t_i, t_{t+1}][ti​,tt+1​], state is interpolated using cubic Hermite polynomials
  • Dynamics defect constraints are enforced at the interval midpoint ti+1/2t_{i+1/2}ti+1/2​
  • State accuracy at collocation points is third-order, defect constraint accuracy is third-order

Comparison with A2PPO

Ul Haq et al. (2026) used trajectories generated by the A2PPO policy as initial guesses for direct collocation, verifying consistency between the two across four scenarios:

ScenarioA2PPO ToF (days)Direct Collocation ToF (days)A2PPO Fuel (kg)Direct Collocation Fuel (kg)
S14.954.992.081.28
S28.387.265.005.29
S37.607.635.105.11
S433.633.120.970.97

Direct collocation typically achieves better fuel efficiency due to exploitation of complete continuous optimality conditions, but:

  • Requires good initial guesses (A2PPO provides high-quality initial solutions)
  • High computational cost, requiring re-solution for each transfer
  • Cannot be computed online in real-time

A2PPO, after training, can perform real-time inference, providing near-instantaneous trajectory solutions.

Direct vs. Indirect Methods

PropertyDirect CollocationIndirect Methods
Derivation difficultyLower (no analytical costate equations)Higher (requires PMP derivation)
Initial guessMore robustSensitive (to costate initial values)
Solution accuracyHigherExtremely high (satisfies first-order optimality)
ConvergenceBetterDepends on initial guess quality
Computational efficiencyModerate (NLP solvers like Ipopt)Higher (but narrow convergence basin)

NLP Solvers

NLPs resulting from direct collocation discretization are typically solved using Sequential Quadratic Programming (SQP) or interior point method solvers:

  • Ipopt: Large-scale nonlinear programming solver based on interior point methods
  • SNOPT: Sequential Quadratic Programming solver
  • CasADi: Symbolic computation framework for constructing NLPs and calling the above solvers

References

  • Betts J T. Survey of numerical methods for trajectory optimization[J]. Journal of Guidance, Control, and Dynamics, 1998.
  • Ul Haq I U, Dai H, Du C. Autonomous low-thrust trajectory optimization in cislunar space via attention-augmented reinforcement learning[J]. Aerospace Science and Technology, 2026.
  • Hargraves C R, Paris S W. Direct trajectory optimization using nonlinear programming and collocation[J]. Journal of Guidance, Control, and Dynamics, 1987.
Improve this page
Last Updated: 4/29/26, 11:30 AM
Contributors: Hermes Agent, Cron Job
Prev
Generalized Advantage Estimation (GAE)
Next
Birkhoff-Gustavson Normal Form
地月空间入门指南
Cislunar Space Beginner's GuideYour guide to cislunar space
View on GitHub

Navigate

  • Home
  • About
  • Space News
  • Glossary

Content

  • Cislunar Orbits
  • Research
  • Resources
  • Blue Team

English

  • Home
  • About
  • Space News
  • Glossary

Follow Us

© 2026 Cislunar Space Beginner's Guide  |  湘ICP备2026006405号-1
Related:智慧学习助手 UStudy航天任务工具箱 ATK
支持我
鼓励和赞赏我感谢您的支持