Cislunar Space Beginner's GuideCislunar Space Beginner's Guide
  • Satellite Simulation
Cislunar Glossary
Resources & Tools
Space News
AI Q&A
Forum
Home
Gitee
GitHub
  • 简体中文
  • English
  • Satellite Simulation
Cislunar Glossary
Resources & Tools
Space News
AI Q&A
Forum
Home
Gitee
GitHub
  • 简体中文
  • English
  • Site map

    • Home (overview)
    • What is cislunar space
    • Spacecraft trajectories
    • Directions & labs
    • Glossary · terms & definitions
    • Data & code
    • Space industry archive
  • Cislunar glossary (terms & definitions)

    • Cislunar Space Glossary
    • Fundamentals

      • Absolute Range
      • Aerodynamic Coefficient
      • Aerodynamic Moment
      • Aerospace Vehicle
      • Allan Deviation (ADEV)
      • Ballistic Coefficient
      • Bi-Elliptic Transfer
      • Body Frame
      • Celestial Coordinate System
      • Celestial Sphere
      • Characteristic Velocity
      • Coverage Angle
      • Dual One-Way Ranging (DOWR)
      • Earth Ellipsoid
      • Earth Oblateness Perturbation
      • Earth-Centered Earth-Fixed Frame (ECEF)
      • Einstein Equivalence Principle (EEP)
      • Energy Parameter
      • Earth Observation (EO)
      • Finite Thrust Maneuver
      • Free-Flight Phase
      • Free-Flight Trajectory
      • Frozen Orbit
      • Gaussian Perturbation Equations
      • Geocentric Inertial Frame
      • GPS Time
      • Gravitational Potential
      • Gravitational Redshift
      • Gravity Turn
      • Gravity vs Gravitation
      • High Altitude Airship (HAA)
      • Hit Equation
      • Hohmann Transfer
      • Inertial Navigation System
      • Instantaneous Balance Assumption
      • In-Situ Resource Utilization (ISRU)
      • Julian Date
      • Kepler's Equation
      • Korea Multi-Purpose Satellite (KOMPSAT)
      • Lagrangian Perturbation Equations
      • Launch Azimuth
      • Launch Window
      • Lift-to-Drag Ratio
      • Load Factor
      • Longitudinal and Lateral Motion
      • Lunar Lander
      • Minimum Energy Trajectory
      • Near-space
      • Newton's Iteration Method
      • Nuri (KSLV-II)
      • Nutation
      • Optimal Velocity Inclination
      • Orbit Capture
      • Orbit Insertion Conditions
      • Orbital Elements
      • Orbital Equation
      • Orbital Maneuver
      • Orbital Phase
      • Orbital Transfer Vehicle
      • Passive Hydrogen Maser (PHM)
      • Perturbation Motion
      • Phasing Orbit
      • Pitch Program Angle
      • Powered Phase
      • Precession
      • Center of Pressure
      • Range Error Coefficient
      • Reentry Corridor
      • Reentry Phase
      • Repeat Ground Track Orbit
      • Reusable Launch Vehicle
      • Synthetic Aperture Radar (SAR)
      • Satellite Ring
      • Sequential Quadratic Programming
      • Skip Reentry
      • Solar Exposure Factor
      • Specific Angular Momentum
      • Specific Impulse
      • Stagnation Heat Flux
      • Standard Atmosphere
      • Stratospheric Airship
      • Subsatellite Track
      • Sun-Synchronous Orbit
      • Thrust-to-Weight Ratio
      • Thrust
      • Total Angle of Attack
      • Trajectory Equation
      • Trajectory Optimization
      • Trim Angle of Attack
      • True Anomaly
      • Tsiolkovsky Rocket Equation
      • Powered Phase Turning Process
      • Two-Body Problem
      • Coordinated Universal Time
      • Variation of Parameters
      • Velocity Frame
      • Velocity Inclination Angle
      • Vis-Viva Equation
      • Very Low Earth Orbit (VLEO)
      • Walker Constellation
      • Zero-Angle-of-Attack Reentry
    • Dynamics & math

      • A* Search Algorithm (A* Search)
      • A2PPO (Attention-Augmented Proximal Policy Optimization)
      • Action-Angle Variables
      • Backstepping Sliding Mode Control
      • Backward Stability Set
      • Bang-bang Control (Bang-bang Control)
      • Barycentric Synodic Coordinate System
      • Batch Deployment (Batch Deployment)
      • Bicircular Four-Body Problem
      • Birkhoff-Gustavson Normal Form
      • Buoyancy-weight Imbalance
      • Capture Set
      • Central Manifold
      • Chaos Effect
      • Clohessy-Wiltshire (CW) Equation
      • Co-state Normalization (Co-state Normalization)
      • Co-state Variables
      • Coasting Arc (Coasting Arc)
      • Continuation Method (Parameter Continuation)
      • Continuation
      • Cooperative Agent (CA)
      • CR3BP with Low-Thrust (CR3BP-LT)
      • Circular Restricted Three-Body Problem (CR3BP)
      • Curriculum Learning
      • Deep Deterministic Policy Gradient (DDPG)
      • Deep Reinforcement Learning
      • Detection Graph
      • Differential Correction
      • Differential Evolution (DE) Algorithm
      • Differential Games (Differential Games)
      • Direct Collocation
      • Dynamic Programming (Dynamic Programming)
      • Dynamic Target Method
      • Ephemeris Model
      • Equinoctial Orbital Elements (Equinoctial Orbital Elements)
      • Earth Restricted Three-Body Problem (ERTBP)
      • Fuel-optimal Control
      • Fuzzy Backstepping Control
      • Generalized Advantage Estimation (GAE)
      • Gaussian Process Regression
      • Geocentric Rotating Coordinate System (GRC)
      • Hamiltonian
      • Hybrid Cluster Particle Swarm Optimization (HCPSO)
      • Heteroclinic Orbit Transfer (Heteroclinic Orbit Transfer)
      • Hill Three-Body Problem
      • Homotopy Method (Homotopy Method)
      • Improved Baseline Control-Point Method (Improved Baseline Control-Point Method)
      • Impulsive Maneuver
      • Initial Value Optimization
      • Invariant Manifold (Invariant Manifold)
      • J2000 Geocentric Equatorial Coordinate System (J2000 Geocentric Equatorial Coordinate System)
      • Jacobi Constant (Jacobi Integral)
      • K-Means Clustering (K-Means Clustering)
      • K-Medoids Clustering (K-Medoids Clustering)
      • KD-Tree (KD-Tree)
      • Libration Point (Equilibrium Point)
      • Libration Point Spacecraft Body Coordinate System (Libration Point Spacecraft Body Coordinate System)
      • Libration Point Spacecraft Orbital Coordinate System (Libration Point Spacecraft Orbital Coordinate System)
      • Lindstedt-Poincare Method (Lindstedt-Poincare Method)
      • L2-centered Rotating Coordinate System (L2-centered Rotating Coordinate System, LRC)
      • LSTM Neural Network
      • Low-Thrust Transfer MDP Formulation
      • Mass Discontinuity (Mass Discontinuity)
      • Multi-Objective Monte Carlo Tree Search (MO-MCTS)
      • Modal Analysis
      • Monodromy Matrix
      • Monte Carlo Tree Search
      • Newton-Euler Equations
      • NSGA II (Non-dominated Sorting Genetic Algorithm II)
      • Pareto Optimality
      • Particle Swarm Optimization
      • Patch Point (Splicing Point)
      • Patched Method
      • Poincaré Map
      • Poincaré Section
      • Pontryagin's Maximum Principle
      • Pseudo-Arclength Continuation
      • Spacecraft Pursuit-Evasion Game
      • Q-Law Control Law
      • Quasi-Bicircular Problem (QBCP)
      • Quasi-Bicircular Four-Body Problem
      • Reachable Set
      • Reduced-Order Dynamic Equations
      • Regional Station-keeping Control
      • Regularization
      • Reinforcement Learning Enhanced Particle Swarm Optimization (RLEPSO)
      • Saddle-Point Strategy
      • Seven-node Model
      • Shooting Method
      • Six-DOF Motion Equations
      • Sliding Mode Control
      • Solar Radiation Pressure (SRP)
      • Stability Index
      • Stability Set
      • State-Dependent Traveling Salesman Problem (SDTSP)
      • State Transition Matrix (STM)
      • Static Lift
      • Strobe Map
      • Switching Function
      • Targeting Method
      • Thermo-mechanical Coupling Model
      • Thermodynamic Model
      • Two-Point Boundary Value Problem (TPBVP)
      • Trim Condition
      • Two-Dominant Invariant Manifold Method
      • Two-Level Differential Correction Method
      • Two-node Model
      • Variational Mode Decomposition
      • Zero-Effort Miss (ZEM)
      • Zero-Velocity Surface
    • Mission orbits

      • Apolune
      • Axial Orbit
      • Ballistic Capture Orbit
      • Butterfly Orbit
      • Cycler Trajectory
      • Distant Prograde Orbit (DPO)
      • DRO Constellation
      • Distant Retrograde Orbit (DRO)
      • Earth-Moon L1/L2 Halo Orbit (EML1/EML2 Halo)
      • Free-Return Trajectory
      • Full Lunar Surface Coverage Orbit
      • Halo Orbit
      • Heteroclinic Connection
      • Horseshoe Orbit
      • Hub-and-Spoke
      • Lissajous Orbit
      • Long Period Orbit
      • Low Prograde Orbit (LoPO)
      • Low-Energy Transfer Orbit
      • Low-Thrust Transfer Orbit
      • Lyapunov Orbit
      • Multi-Revolution Halo Orbit
      • Near-Rectilinear Halo Orbit (NRHO)
      • Orbit Identification
      • Orbit Keeping (Station-Keeping)
      • Parking Orbit
      • Perilune
      • Polynomial Constraint Station-Keeping
      • Primary Impulse Orbit Transfer
      • Prograde
      • Quasi-Periodic Orbit
      • Resonance Orbit
      • Retrograde
      • Short Period Orbit
      • Transfer Orbit
      • Triangular Libration Points
      • Vertical Orbit
    • Navigation & systems

      • Altitude Regulation
      • Autonomous Navigation
      • Cislunar Spatiotemporal Reference
      • Earth-Moon Hybrid Navigation
      • Extended Kalman Filter (EKF)
      • GPS Aided GEO Augmented Navigation (GAGAN)
      • Earth GNSS Weak Signal Navigation
      • Inter-Satellite Link Navigation
      • Indian Regional Navigation Satellite System (IRNSS)
      • LEO Navigation Augmentation
      • LiAISON Navigation
      • LunaNet (Lunar Network)
      • Lunar Navigation Constellation
      • Moonlight Initiative
      • Observability
      • Positioning, Navigation, and Timing (PNT)
      • Sun-Earth-Moon Autonomous Navigation
      • Tiandu-1
      • Trajectory Planning
      • X-ray Pulsar Navigation
    • Astronomy & observation

      • Astrometry
      • Background Star Elimination
      • Cislunar Moving Objects
      • Continuous Coverage (CP)
      • Earth Albedo
      • Ephemeris Correlation
      • Hot Pixel
      • Illumination Constraint
      • Image Registration
      • Image Stacking
      • Infrared Radiation
      • Lunar Glare Zone
      • Pointing Constraint
      • Quasi-zero Wind Layer
      • Segmentation Map
      • Shift-and-Add (SAA)
      • Sidereal Tracking
      • Signal-to-Noise Ratio (SNR)
      • Solar Radiation
      • Source Extraction
      • Synthetic Tracking
      • Zonal Wind
    • Military space doctrine

      • Anti-Satellite Test (ASAT)
      • Cislunar Space Situational Awareness
      • Civil-Military Integration
      • Competitive Endurance
      • Component Field Commands
      • Commander, Space Forces (COMSPACEFOR)
      • Counterspace Operations
      • Directed Energy Weapon (DEW)
      • Distributed Architecture
      • DOTMLPF-P Framework
      • Force Design
      • Force Development
      • Force Employment
      • Force Generation
      • Golden Dome
      • Kinetic Weapon
      • Mission Command
      • Mission Delta (MD)
      • Operational Test and Training Infrastructure (OTTI)
      • Persistent Detection Corridor (PDC)
      • Resilience Map
      • Resilient/Disaggregated Architecture
      • Space Domain Awareness (SDA)
      • Space Mission Task Force (SMTF)
      • Space Superiority
      • Space Force Generation Process (SPAFORGEN)
      • System Delta (SYD)
    • Organizations

      • Anduril Industries
      • Booz Allen Hamilton
      • Danuri Lunar Orbiter
      • General Dynamics Mission Systems
      • GITAI USA
      • Indian Space Research Organisation
      • Korea Aerospace Administration
      • Lockheed Martin
      • Northrop Grumman
      • Quindar
      • Raytheon Missiles & Defense
      • Sci-Tec
      • SpaceX
      • Satish Dhawan Space Centre SHAR
      • True Anomaly
      • Turion Space

Dynamic Programming (Dynamic Programming)

Editor Source: 胡敏, 肖金伟, 张天天, 陶雪峰 (2026) "面向中高轨小卫星批量部署的轨道转移飞行器任务规划"

Bellman R. Dynamic Programming[M]. Princeton University Press, 2010.

Website: https://cislunarspace.cn

Definition

Dynamic Programming (DP) is a mathematical method for solving multi-stage decision process optimization by decomposing complex problems into interrelated subproblems, solving bottom-up and gradually constructing optimal solutions.

The core characteristics of dynamic programming are:

  • Optimal substructure: Global optimal solution contains optimal solutions of local subproblems
  • No aftereffect: Current state choices only affect subsequent decisions, not past choices
  • Overlapping subproblems: Subproblems are repeatedly calculated multiple times and can be stored for reuse

Bellman's Principle of Optimality

Principle Statement

Bellman's Principle of Optimality is the theoretical foundation of dynamic programming:

"An optimal policy has the property that whatever the initial state and initial decision are, the remaining decisions must constitute an optimal policy with regard to the state resulting from the first decision."

Mathematical Expression

For the State-Dependent Traveling Salesman Problem (SDTSP), Bellman's equation is:

J(S,j)=min⁡i∈S∖{j}{J(S∖{j},i)+C(i,j,k)}J(S, j) = \min_{i \in S \setminus \{j\}} \{J(S \setminus \{j\}, i) + C(i, j, k)\} J(S,j)=i∈S∖{j}min​{J(S∖{j},i)+C(i,j,k)}

Where:

  • J(S,j)J(S, j)J(S,j): Minimum cumulative cost for planning node (S,j)(S, j)(S,j)
  • SSS: Set of visited orbits
  • jjj: Current orbit
  • C(i,j,k)C(i, j, k)C(i,j,k): Cost from orbit iii to orbit jjj (dependent on state kkk)

Physical Significance

The optimal path to reach the current state (S,j)(S, j)(S,j) must contain an optimal path to some predecessor state (S∖{j},i)(S \setminus \{j\}, i)(S∖{j},i), then transit from orbit iii to orbit jjj.

Application in Orbit Deployment

Problem Modeling

胡敏等 (2026) modeled the OTV batch deployment mission as SDTSP and solved it using dynamic programming:

  1. Planning node: Binary (S,j)(S, j)(S,j) uniquely determines state

    • S⊆VS \subseteq VS⊆V: Subset of visited orbits
    • j∈Sj \in Sj∈S: Current orbit of OTV
  2. State transition: After deploying a satellite, state updates

    k=∣S∣−1k = |S| - 1 k=∣S∣−1

    Where kkk is the number of deployed satellites

  3. Value function: J(S,j)J(S, j)J(S,j) represents the minimum cumulative cost from the starting orbit, visiting all orbits in set SSS, and ending at orbit jjj

Initial Subproblem

The basis of recursion is the smallest subproblem visiting only the starting orbit:

J({istart},istart)=0J(\{i_{start}\}, i_{start}) = 0 J({istart​},istart​)=0

Final Return

After completing all deployments, the cost of returning to the starting orbit must be calculated:

Jfinal=min⁡u∈V(J(V,u)+C(u,istart,N))J_{final} = \min_{u \in V} (J(V, u) + C(u, i_{start}, N)) Jfinal​=u∈Vmin​(J(V,u)+C(u,istart​,N))

Algorithm Implementation

State Compression Technique

To improve computational efficiency, 胡敏等 (2026) used state compression:

  • Orbital subset SSS is mapped to binary integer bitmask
  • Set inclusion test becomes efficient bitwise operation
  • Algorithm complexity reduces from exponential to polynomial

Algorithm Flow

Input: Total number of targets N, State-dependent cost matrix C, Starting orbit i_start
1. Initialize J to infinity, J[i_start, i_start] = 0
2. Iterate through all possible orbital subsets m and current node u
3. Update cumulative cost J[S_new, v]
4. Backtrack to extract optimal deployment sequence Π
Output: Optimal cost J and deployment sequence Π

Complexity Analysis

MetricComplexityDescription
TimeO(N2⋅2N)O(N^2 \cdot 2^N)O(N2⋅2N)Iterate through all state combinations
SpaceO(N⋅2N)O(N \cdot 2^N)O(N⋅2N)Store optimal cost array

Performance Comparison

Research results (胡敏等, 2026) show that in medium-orbit batch deployment missions:

AlgorithmN=8 Propellant (kg)N=12 Propellant (kg)Optimality
DP (This method)487.7632.1Exact optimal
GA507.8632.1Heuristic
Greedy502.2669.1*Locally optimal

*Exceeds initial propellant capacity

Dynamic programming performs optimally in medium-scale missions with N≤12, with advantages:

  • Global optimality guarantee: Ensures finding the optimal deployment sequence
  • High computational efficiency: CPU time only requires milliseconds
  • Good robustness: Not affected by initial solutions

Comparison with Other Algorithms

CharacteristicDynamic ProgrammingGenetic AlgorithmGreedy Algorithm
OptimalityExact optimalProbabilistic convergenceLocally optimal
Time complexityO(N2⋅2N)O(N^2 \cdot 2^N)O(N2⋅2N)AdjustableO(N2)O(N^2)O(N2)
Space complexityO(N⋅2N)O(N \cdot 2^N)O(N⋅2N)MediumO(N)O(N)O(N)
Applicable scaleN < 15Any scaleN < 10

Related Concepts

  • State-Dependent TSP
  • Bellman's Principle of Optimality
  • Batch Deployment
  • Mass Discontinuity
  • Q-law Control Law

References

  • 胡敏, 肖金伟, 张天天, 陶雪峰. 面向中高轨小卫星批量部署的轨道转移飞行器任务规划[J]. 航天器工程, 2026, 25(3): 634-646.
  • Bellman R. Dynamic Programming[M]. Princeton: Princeton University Press, 2010.
  • Narayanaswamy S, Wu B, Ludivig P, et al. Low-thrust rendezvous trajectory generation for multi-target active space debris removal using the RQ-law[J]. Advances in Space Research, 2023, 71(10): 4276-4287.
Improve this page
Last Updated: 5/3/26, 9:38 AM
Contributors: Cron Job
Prev
Direct Collocation
Next
Dynamic Target Method
地月空间入门指南
Cislunar Space Beginner's GuideYour guide to cislunar space
View on GitHub

Navigate

  • Home
  • About
  • Space News
  • Glossary

Content

  • Cislunar Orbits
  • Research
  • Resources

English

  • Home
  • About
  • Space News
  • Glossary

Follow Us

© 2026 Cislunar Space Beginner's Guide  |  湘ICP备2026006405号-1
Related:智慧学习助手 UStudy航天任务工具箱 ATK
微信公众号
欢迎关注天疆说扫码关注,手机获取航天资讯