Cislunar Space Beginner's GuideCislunar Space Beginner's Guide
Satellite Orbit Simulation
Cislunar Glossary
Resources & Tools
Blue Team Research
Space News
AI Q&A
Forum
Home
Gitee
GitHub
  • 简体中文
  • English
Satellite Orbit Simulation
Cislunar Glossary
Resources & Tools
Blue Team Research
Space News
AI Q&A
Forum
Home
Gitee
GitHub
  • 简体中文
  • English
  • Site map

    • Home (overview)
    • Intro · what is cislunar space
    • Orbits · spacecraft trajectories
    • Frontiers · directions & labs
    • Glossary · terms & definitions
    • Tools · data & code
    • News · space industry archive
    • Topic · blue-team research
  • Cislunar glossary (terms & definitions)

    • Cislunar Space Glossary
    • Dynamics models

      • Circular Restricted Three-Body Problem (CR3BP)
      • CR3BP with Low-Thrust (CR3BP-LT)
      • A2PPO (Attention-Augmented Proximal Policy Optimization)
      • Curriculum Learning
      • Low-Thrust Transfer MDP Formulation
      • Generalized Advantage Estimation (GAE)
      • Direct Collocation
      • Birkhoff-Gustavson Normal Form
      • Central Manifold
      • Action-Angle Variables
      • Poincaré Section
    • Mission orbits

      • Earth-Moon L1/L2 Halo Orbit (EML1/EML2 Halo)
      • Orbit Identification
    • Navigation

      • X-ray Pulsar Navigation
    • Lunar minerals

      • Changeite-Mg (Magnesium Changeite)
      • Changeite-Ce (Cerium Changeite)
    • Other

      • Starshade
    • Organizations

      • Anduril Industries
      • Booz Allen Hamilton
      • General Dynamics Mission Systems
      • GITAI USA
      • Lockheed Martin
      • Northrop Grumman
      • Quindar
      • Raytheon Missiles & Defense
      • Sci-Tec
      • SpaceX
      • True Anomaly
      • Turion Space

Direct Collocation

Definition

Direct Collocation is a class of methods that directly discretize Optimal Control Problems (OCP) into Nonlinear Programming Problems (NLP) for numerical solution[1]. Unlike indirect methods that require analytical derivation of costate first-order optimality conditions, direct methods simultaneously discretize state and control variables through collocation, transforming the infinite-dimensional continuous OCP into a finite-dimensional NLP. It is currently one of the most widely used numerical methods in spacecraft trajectory optimization.

Basic Principles

Discretization Strategy

In direct collocation, the transfer interval [0,tf][0, t_f][0,tf​] is divided into NNN sub-intervals, with simultaneous satisfaction at collocation points in each sub-interval:

  1. State dynamics constraint: Enforce x˙=f(x,u)\dot{\mathbf{x}} = f(\mathbf{x}, \mathbf{u})x˙=f(x,u) at collocation points
  2. Boundary conditions: Initial state x(0)=x0\mathbf{x}(0) = \mathbf{x}_0x(0)=x0​ and terminal constraint x(tf)∈T\mathbf{x}(t_f) \in \mathcal{T}x(tf​)∈T
  3. Path constraints: Control constraints ∥u∥≤1\|\mathbf{u}\| \leq 1∥u∥≤1, obstacle avoidance constraints, etc.

Hermite-Simpson Collocation

The direct collocation implementation used in the A2PPO research employs the Hermite-Simpson collocation scheme[2]:

  • On each sub-interval [ti,tt+1][t_i, t_{t+1}][ti​,tt+1​], state is interpolated using cubic Hermite polynomials
  • Dynamics defect constraints are enforced at the interval midpoint ti+1/2t_{i+1/2}ti+1/2​
  • State accuracy at collocation points is third-order, defect constraint accuracy is third-order

Comparison with A2PPO

Ul Haq et al. (2026) used trajectories generated by the A2PPO policy as initial guesses for direct collocation, verifying consistency between the two across four scenarios[2]:

ScenarioA2PPO ToF (days)Direct Collocation ToF (days)A2PPO Fuel (kg)Direct Collocation Fuel (kg)
S14.954.992.081.28
S28.387.265.005.29
S37.607.635.105.11
S433.633.120.970.97

Direct collocation typically achieves better fuel efficiency due to exploitation of complete continuous optimality conditions, but:

  • Requires good initial guesses (A2PPO provides high-quality initial solutions)
  • High computational cost, requiring re-solution for each transfer
  • Cannot be computed online in real-time

A2PPO, after training, can perform real-time inference, providing near-instantaneous trajectory solutions.

Direct vs. Indirect Methods

PropertyDirect CollocationIndirect Methods
Derivation difficultyLower (no analytical costate equations)Higher (requires PMP derivation)
Initial guessMore robustSensitive (to costate initial values)
Solution accuracyHigherExtremely high (satisfies first-order optimality)
ConvergenceBetterDepends on initial guess quality
Computational efficiencyModerate (NLP solvers like Ipopt)Higher (but narrow convergence basin)

NLP Solvers

NLPs resulting from direct collocation discretization are typically solved using Sequential Quadratic Programming (SQP) or interior point method solvers:

  • Ipopt: Large-scale nonlinear programming solver based on interior point methods
  • SNOPT: Sequential Quadratic Programming solver
  • CasADi: Symbolic computation framework for constructing NLPs and calling the above solvers

References

  • [1] Betts J T. Survey of numerical methods for trajectory optimization[J]. Journal of Guidance, Control, and Dynamics, 1998.
  • [2] Ul Haq I U, Dai H, Du C. Autonomous low-thrust trajectory optimization in cislunar space via attention-augmented reinforcement learning[J]. Aerospace Science and Technology, 2026.
  • [3] Hargraves C R, Paris S W. Direct trajectory optimization using nonlinear programming and collocation[J]. Journal of Guidance, Control, and Dynamics, 1987.
Improve this page
Last Updated: 4/27/26, 8:30 AM
Contributors: Hermes Agent
Prev
Generalized Advantage Estimation (GAE)
Next
Birkhoff-Gustavson Normal Form
地月空间入门指南
Cislunar Space Beginner's GuideYour guide to cislunar space
View on GitHub

Navigate

  • Home
  • About
  • Space News
  • Glossary

Content

  • Cislunar Orbits
  • Research
  • Resources
  • Blue Team

English

  • Home
  • About
  • Space News
  • Glossary

Follow Us

© 2026 Cislunar Space Beginner's Guide  |  湘ICP备2026006405号-1
Related:智慧学习助手 UStudy航天任务工具箱 ATK
支持我
鼓励和赞赏我感谢您的支持