Undergraduate Literature Review Guide
Example Topic: Reconstruction and Forecast of Spatiotemporal Data in Physics: Multi-Scale Challenges
π― Objective
You will conduct a structured literature review to support the introduction section of a research publication.
Your goal is to explore how scientists and engineers reconstruct and forecast multi-scale spatiotemporal physical data, and to compare the state of the art with and without machine learning (ML) approaches.
Your final review will:
- Explain why the problem (e.g., reconstructing and compressing multi-scale wavefields) is important, difficult, and impactful if it were resolved..
- Summarize traditional (non-ML), ML-based approaches, and whatever is the state of the art.
- Highlight what advanced architectures (e.g., SHRED, CNNs, Transformers, PINNs) or curated data sets (e.g., balanced and diverse, data volumes; e.g., The Well) can do β and where their limitations and opportunities.
π§ Step-by-Step Instructions
1. Define the Problem
- Write a short paragraph explaining what the problem (e.g., reconstruction and forecasting) mean in the context of physical systems (e.g., wavefields, turbulence, seismic data, or climate), and ensure to write a clear statement for why it will be impactful if solved for society, for engineering, etc.
- Explain what aspects of the problems (e.g., multi-scale problems) are challenging β mention scale separation, nonlinearity, and high-dimensionality.
- State that your goal is to understand how data-driven and physics-based approaches address these challenges.
2. Structure of the Literature Review
Your report should have the following sections:
A. Introduction and Motivation
- Define the general problem and societal impact.
- Describe why the problem (e.g., reconstructing and forecasting spatiotemporal) matters in geophysics.
- Identify multi-scale examples (e.g., wave propagation, turbulence, geophysical imaging).
B. Traditional / Physics-Based Approaches
- Discuss non-ML methods (e.g., PDE solvers, spectral methods, compressed sensing, and Kalman filters).
- Note their successes and limitations in high-dimensional or multi-scale systems.
C. Machine Learning-Based Approaches
- Summarize ML frameworks (CNNs, RNNs, Transformers, autoencoders, GANs, SHRED architectures).
- Discuss how they model spatial and temporal correlations.
- Highlight examples and benchmarks from physics or geosciences.
D. Limitations and Open Challenges
- Describe what remains difficult: generalization, interpretability, physical consistency, scalability.
- Identify hybrid methods (e.g., physics-informed networks) and what problems they solve.
E. Synthesis and Research Gaps
- Summarize 2β3 key research gaps that motivate further work.
π How to Find and Select Literature
πΉ Google Scholar
- Start with Google Scholar.
- Use targeted search terms such as:
spatiotemporal reconstruction physics
multi-scale forecasting physical systems
wavefield reconstruction machine learning
physics-informed neural networks multiscale
SHRED architecture geophysical data
- Sort by recent papers (e.g., since 2018) to find the latest work.
- Read the introductions of these papers to find more papers to read or use the βCited byβ feature to discover related or follow-up studies.
πΉ ASTA (Allen Institute for AI)
- Explore the ASTA search engine.
- It provides semantic access to scientific papers and can help identify connected ideas across disciplines.
- Use it to find papers related to physics-based ML, wavefield modeling, and spatiotemporal forecasting.
πΉ Researchersβ Websites
- Visit websites of researchers who publish in this area (e.g., in geophysics, applied math, AI for physics).
- Look for:
- Preprints or open-access PDFs not yet indexed by databases.
- Project pages or GitHub links that provide code, figures, and datasets.
- Identify multiple research groups from different institutions and countries to ensure your review reflects a broad perspective.
2. Read Strategically
-
Follow the citation trail:
When reading a paper, examine what it cites in its introduction. These cited works often explain foundational ideas or widely accepted methods β read them next.
-
Diversify your sources:
Cite papers from at least 3β4 different research groups in different institutions or countries to ensure breadth and representation of the global research landscape.
-
Assess quality and relevance:
Prioritize peer-reviewed papers but include influential arXiv preprints when relevant to recent machine learning advances.
-
Read efficiently:
Start with abstracts, introductions, and figures to grasp the method and findings before reading the entire paper.
3. Keep Notes Systematically
Create a table or spreadsheet like this to track your findings:
| Paper |
Year |
Data |
Method |
Application |
ML/Non-ML |
Key Results |
Limitations |
| Pathak et al., 2018 |
2018 |
Synthetic - from benchmark datasets |
ConvLSTM |
Turbulence |
ML |
Good short-term forecast |
Fails on chaotic regimes |
βοΈ Writing Guidelines
- Length: 4β6 pages (β1500β2500 words)
- Include:
- Clear introduction and motivation
- Well-organized review sections
- At least one figure or conceptual diagram (with citation)
- Consistent reference style (APA, IEEE, or similar)
- Use formal academic writing β avoid bullet lists in the final text.