MGED Workgroups and the MAGE Standard: Structuring Microarray Data for the Future
Microarray technology made it possible to measure thousands of genes in a single experiment. But as laboratories started generating more and more microarray data, one big problem appeared:
This is where the MGED Society workgroups and the MAGE standard come in. They created models and formats that help scientists describe microarray experiments clearly and exchange data between different databases and tools.
In this blog, we explain:
- What MGED workgroups are
- What MAGE means
- The role of MAGE-OM, MAGE-ML, and MAGE-TAB
- Why these standards still matter for labs today
1. MGED Society and Its Workgroups
The Microarray Gene Expression Data (MGED) Society was an international community of biologists, bioinformaticians, software developers, and data curators. Their mission was simple but ambitious:
- Improve the quality, annotation, and sharing of microarray data
- Create standards that everyone could use
To achieve this, MGED formed several workgroups, each focused on a specific area:
- MIAME workgroup – defined the minimal information needed about a microarray experiment
- MAGE workgroup – developed data models and formats for storing and exchanging microarray data
- Ontology workgroup – worked on controlled vocabularies and terms for sample and experiment description
- Database and tools workgroups – helped implement standards in real software and repositories
Among these, the MAGE workgroup became especially important for how microarray data is represented in computers.
2. What Is MAGE?
MAGE stands for MicroArray Gene Expression.
It is not a single file or software, but a family of standards designed to describe microarray experiments in a structured way. The idea is to capture:
- What was done (protocols and steps)
- To which samples (biomaterials, labels)
- On which arrays (design, probes, spots)
- With what results (raw and processed data)
MAGE is mainly composed of three components:
- MAGE-OM – Object Model
- MAGE-ML – XML Markup Language
- MAGE-TAB – Tab-delimited format for practical use
Let’s look at each of them.
3. MAGE-OM: The Object Model
MAGE-OM (MicroArray Gene Expression Object Model) is the conceptual heart of the MAGE standard.
You can think of it as a detailed blueprint of all the elements involved in a microarray experiment and how they relate to each other.
MAGE-OM describes objects such as:
-
Biomaterials
- Sources (patients, cell lines, organisms)
- Extracts (RNA, DNA)
- Labeled targets
-
Array design
- Probes and features
- Sequences and annotations
- Layout of spots on the slide
-
Hybridization and scanning
- Hybridization events
- Scanning conditions
- Detectors and channels
-
Data
- Raw intensities
- Normalized values
- Quality measures
By having this common object model, different tools and databases can “speak the same language” when they deal with microarray experiments.
4. MAGE-ML: XML for Data Exchange
Once the object model is defined, we need a way to store and exchange this information. That’s where MAGE-ML comes in.
MAGE-ML (MicroArray Gene Expression Markup Language) is an XML-based format that implements MAGE-OM. It allows:
- Repositories, analysis platforms, and laboratories to export and import microarray data in a common structure
- Storage of both annotations (who did what, to which samples) and data (intensities, log ratios, etc.)
- Long-term archiving of microarray experiments in a machine-readable format
Advantages of MAGE-ML:
- Very rich and detailed
- Capture everything from sample history to data processing steps
- Designed for interoperability between systems
Limitations:
- Complex and verbose
- Not always convenient for everyday manual editing by biologists
Because of this complexity, the community later created a more user-friendly format: MAGE-TAB.
5. MAGE-TAB: Simple Spreadsheets for Real Life
MAGE-TAB was designed to make microarray data annotation easier for normal users.
Instead of writing XML, you can describe experiments using tab-delimited text files (e.g. created in Excel). MAGE-TAB typically consists of:
-
IDF (Investigation Description Format)
- High-level information about the experiment: title, authors, summary, protocols, etc.
-
SDRF (Sample and Data Relationship Format)
- A table describing how samples, arrays, and data files are connected
- Rows often represent “paths” from biological material to data file
Sometimes, there is also:
-
ADF (Array Design Format)
- Describes probes, spots, and array layout
Benefits of MAGE-TAB:
- Human-readable and easy to edit
- Simple to share as text or spreadsheet files
- Widely used in public repositories and analysis tools
In practice, many microarray labs build their MIAME- and MAGE-compliant submissions using MAGE-TAB and then send them to repositories such as ArrayExpress or GEO.
6. How MAGE Workgroups Helped the Community
The MAGE workgroups did more than just define formats. They also:
- Encouraged MIAME-compliant reporting (Minimum Information About a Microarray Experiment)
- Worked with database teams to integrate MAGE into public repositories
- Provided examples, tools and documentation to help labs adopt the standard
- Laid the foundation for future omics data standards, not only microarrays
Their work helped solve several key problems:
- Inconsistent annotation – before standards, each lab documented experiments differently
- Difficult data exchange – each database and software used its own format
- Poor reproducibility – without clear documentation, it was hard to reproduce or compare results
With MAGE, it became much easier to structure and share microarray data.
7. Why MAGE Still Matters in a Post-NGS World
Today, next-generation sequencing (NGS) is everywhere, and microarrays are not as “trendy” as they once were. But MAGE and MGED workgroup ideas remain important:
- Many legacy microarray datasets are still valuable and can be reused in meta-analyses. MAGE-like structures make that possible.
- The principles of clear metadata, minimal information standards, and structured formats influenced later standards for RNA-Seq and other omics technologies.
- In some applications, microarrays are still widely used (for example, certain expression signatures, CGH arrays, or high-throughput screening). For these labs, MAGE/TAB remains very relevant.
So even if your lab is moving toward NGS, the MAGE mindset—well-structured experiments, clear relationships between samples and data files, and consistent annotation—is still essential.
8. What This Means for Your Lab and Equipment Website
If your website sells microarray instruments and genomics equipment, mentioning MAGE and MGED standards can show that you understand not just the hardware, but also the data and standards side.
You can emphasize that your solutions:
- Support MIAME-like documentation
- Fit into workflows where data can be structured using MAGE-TAB/SDRF
- Help researchers generate well-annotated, reusable datasets
For example:
- On your microarray scanner product page, you can mention integration with data extraction and annotation workflows.
9. Conclusion
The MGED workgroups and the MAGE standard were key milestones in the history of microarray data management. Through MAGE-OM, MAGE-ML and MAGE-TAB, they provided:
- A clear structure for describing experiments
- A common language for databases and software
- Practical tools to help labs create well-annotated, shareable data
Even in the era of NGS and multi-omics, the ideas behind MAGE remain highly relevant: if your data is not well described and structured, its long-term value is limited.
By understanding and referencing MAGE in your content, you position your lab or company as not only an equipment provider, but also a partner in good data practice.