Main-sequence stars with brighter than expected mid-infrared emission can signal the presence of a debris disk, rubble from planetary collisions, or even a theorized sign of a technologically advanced civilization. New research demonstrates a data-driven method to identify mid-infrared excesses in main-sequence stars.
An Excess of Emission

This image combines observations from the Hubble Space Telescope and the Atacama Large Millimeter/submillimeter Array to show the dusty debris disk surrounding the star Fomalhaut. [ALMA (ESO/NAOJ/NRAO). Visible light image: the NASA/ESA Hubble Space Telescope A. Fujii/Digitized Sky Survey 2. Acknowledgment: Davide De Martin (ESA/Hubble); CC BY 4.0]
Mid-infrared excesses are hypothesized to signal something even wilder: the presence of a Dyson sphere — a hypothetical artificial structure created by an advanced civilization to harness the power of their home star. Similar to dust and rubble, the components of a Dyson sphere would collect starlight and re-emit it at infrared wavelengths, potentially producing a mid-infrared bump.
Regardless of the cause, excess mid-infrared emission from mature Sun-like stars is something to investigate. But how do we find stars with this feature?
Taking Cues from Data
Gabriella Contardo (International School for Advanced Studies, Italy) and David Hogg (New York University; Flatiron Institute; Max Planck Institute for Astronomy) began their search for mid-infrared excesses with an expansive set of observations from the Gaia spacecraft, the Two Micron All Sky Survey, and the Wide-field Infrared Survey Explorer (WISE). After trimming these data sets down to include only main-sequence Sun-like stars, and to exclude objects that might be contaminated by close neighbors or are too dusty, they reduced the number of stars in their sample from 18,751,187 to 4,898,812.

Difference between observed and predicted magnitude in the WISE W1 and W2 bands (3.4 and 4.6 microns, respectively). Click to enlarge. [Adapted from Contardo & Hogg 2024]
Their data-driven method involves splitting the five million stars into eight sub-samples, each of which is used to train a separate random forest algorithm. Each algorithm “learns” what the mid-infrared emission “should” be from the stars in its sample, then predicts the mid-infrared emission of the stars in the other seven sub-samples. When a star’s actual mid-infrared emission is brighter than predicted, it gets flagged.
To Be Continued

Locations of the 53 stars in the final mid-infrared excess sample. Click to enlarge. [Contardo & Hogg 2024]
What happens now? To identify the stars that are the most promising hosts of extreme debris disks, Contardo and Hogg listed ways to pin down the ages of the stars in their sample, which may rule out stars whose mid-infrared excess is due to their youth. They also proposed to compare the mid-infrared behavior of their stellar sample to Dyson sphere models, exploring whether the observed stellar behavior matches the predictions for these hypothetical structures.
Citation
“A Data-Driven Search for Mid-infrared Excesses Among Five Million Main-Sequence FGK Stars,” Gabriella Contardo and David W. Hogg 2024 AJ 168 157. doi:10.3847/1538-3881/ad6b90