TLDR: Researchers have developed a new generative AI method to simplify complex galaxy images into “skeletonized” forms, highlighting only their spiral arms. This approach, using Conditional Generative Adversarial Networks (cGANs), allows for more precise and flexible analysis of galaxy shapes, overcoming limitations of traditional classification methods. The technique has been applied to 125,000 galaxies from the DESI Legacy Survey, creating a public catalog for detailed morphological studies.
The universe is vast, and with modern digital sky surveys, astronomers are collecting images of billions of galaxies. While these images are incredibly detailed, analyzing such a massive volume of data accurately and efficiently poses a significant challenge. Traditional methods often rely on machine learning that classifies galaxies into a set of predefined categories, which can be limiting.
A new research paper introduces an innovative approach to galaxy image analysis that leverages generative artificial intelligence (AI). Instead of categorizing galaxies, this method simplifies their complex images into a “skeletonized” form, essentially highlighting just the fundamental shape of their spiral arms. This simplification allows for more precise measurements of galaxy shapes and enables analysis that isn’t restricted to a fixed set of classes.
The challenge with existing machine learning solutions is that they are typically trained on pre-labeled galaxies, meaning they can only classify new galaxies into those same predefined categories. This can be an oversimplification, as galaxies within the same type can still have unique features. Moreover, these classification methods don’t provide precise quantitative measurements of specific shape elements, and their ability to explore questions outside their training scope is limited. They can also be susceptible to biases from data collection.
The new method employs a Conditional Generative Adversarial Network (cGAN). A cGAN consists of two main components: a generator and a discriminator. The generator learns to create simplified galaxy images, while the discriminator evaluates how realistic these generated images are. Through an adversarial training process, both components improve, with the generator becoming adept at producing images that accurately represent the galaxy’s arms.
To train this system, the researchers used a dataset of spiral galaxy images from the Dark Energy Spectroscopic Instrument (DESI) Legacy Imaging Survey. Each original galaxy image was paired with a manually annotated image where the galaxy arms were marked with white lines. This allowed the cGAN to learn the transformation from a complex galaxy image to its simplified, “skeletonized” counterpart. Data augmentation techniques like blurring, zooming, and flipping were used to enhance the training data’s robustness.
After the initial simplification, some generated images still had broken or shaky arm lines. To address this, a second cGAN was used in a post-processing step. This network was trained to connect and smooth these lines, iteratively refining the output to ensure continuous and clear representations of the galaxy arms. Finally, standard image processing techniques were applied to further isolate and enhance these skeletonized lines, resulting in clean, binary images showing only the arm shapes.
The ultimate goal of this research is to create catalogs of simplified galaxy images that researchers can easily analyze. The method was applied to 125,000 galaxies from the DESI Legacy Survey, resulting in a publicly available catalog. These simplified images, which are essentially binary masks of the galaxy arms, are much easier to process for studying various aspects of galaxy morphology. For instance, researchers can now easily measure the degree of arm curvature, their length, or even count the number of arms, which are difficult to determine from original images.
Also Read:
- A Foundational AI Model for Universal Physics Simulation Emerges
- Building Specialized Image Datasets: The MNIST-Gen Approach
Such detailed analysis can provide new insights into the mass distribution within galaxies, track galaxy evolution over time when redshift data is available, and help understand the large-scale structure of the universe. While the method simplifies images by focusing on the arms, it does lose some information like the galaxy’s bulge or arm thickness. However, for researchers specifically interested in the intricate shapes of galaxy arms, this generative AI approach offers a powerful new tool for astronomical discovery. The code and data for this project are publicly available, including the catalog of simplified images, which can be found at this link.


