Skip to main content
Home

Navigation Menu

  • Back
  • About
    • Back
    • About

      Contact Us

      Business Address
      5241 Broad Branch Rd. NW

      Washington , DC 20015
      United States place Map
      Call Us (202) 387-640
    • Who We Are
      • Back
      • Leadership
      • Our Blueprint For Discovery
      • Board of Trustees
      • Financial Stewardship
      • Awards & Accolades
      • History
    • Connect with Us
      • Back
      • Outreach & Education
      • Newsletter
      • Yearbook
    • Working at Carnegie
      • Back
      • Applications Open: Postdoctoral Fellowships

    Contact Us

    Business Address
    5241 Broad Branch Rd. NW

    Washington , DC 20015
    United States place Map
    Call Us (202) 387-6400
  • Research
    • Back
    • Research Areas & Topics
    • Research Areas & Topics
      • Back
      • Research Areas
      • From genomes to ecosystems and from planets to the cosmos, Carnegie Science is an incubator for cutting-edge, interdisciplinary research.
      • Astronomy & Astrophysics
        • Back
        • Astronomy & Astrophysics
        • Astrophysical Theory
        • Cosmology
        • Distant Galaxies
        • Milky Way & Stellar Evolution
        • Planet Formation & Evolution
        • Solar System & Exoplanets
        • Telescope Instrumentation
        • Transient & Compact Objects
      • Earth Science
        • Back
        • Earth Science
        • Experimental Petrology
        • Geochemistry
        • Geophysics & Geodynamics
        • Mineralogy & Mineral Physics
      • Ecology
        • Back
        • Ecology
        • Atmospheric Science & Energy
        • Adaptation to Climate Change
        • Water Quality & Scarcity
      • Genetics & Developmental Biology
        • Back
        • Genetics & Developmental Biology
        • Adaptation to Climate Change
        • Developmental Biology & Human Health
        • Genomics
        • Model Organism Development
        • Nested Ecosystems
        • Symbiosis
      • Matter at Extreme States
        • Back
        • Matter at Extreme States
        • Extreme Environments
        • Extreme Materials
        • Mineralogy & Mineral Physics
      • Planetary Science
        • Back
        • Planetary Science
        • Astrobiology
        • Cosmochemistry
        • Mineralogy & Mineral Physics
        • Planet Formation & Evolution
        • Solar System & Exoplanets
      • Plant Science
        • Back
        • Plant Science
        • Adaptation to Climate Change
        • Nested Ecosystems
        • Photosynthesis
        • Symbiosis
    • Divisions
      • Back
      • Divisions
      • Biosphere Sciences & Engineering
        • Back
        • Biosphere Sciences & Engineering
        • About

          Contact Us

          Business Address
          5241 Broad Branch Rd. NW

          Washington , DC 20015
          United States place Map
          Call Us (202) 387-640
        • Research
        • Culture
      • Earth & Planets Laboratory
        • Back
        • Earth & Planets Laboratory
        • About

          Contact Us

          Business Address
          5241 Broad Branch Rd. NW

          Washington , DC 20015
          United States place Map
          Call Us (202) 387-640
        • Research
        • Culture
        • Campus
      • Observatories
        • Back
        • Observatories
        • About

          Contact Us

          Business Address
          5241 Broad Branch Rd. NW

          Washington , DC 20015
          United States place Map
          Call Us (202) 387-640
        • Research
        • Culture
        • Campus
    • Instrumentation
      • Back
      • Instrumentation
      • Our Telescopes
        • Back
        • Our Telescopes
        • Magellan Telescopes
        • Swope Telescope
        • du Pont Telescope
      • Observatories Machine Shop
      • EPL Research Facilities
      • EPL Machine Shop
      • Mass Spectrometry Facility
      • Advanced Imaging Facility
  • People
    • Back
    • People
      Observatory Staff

      Featured Staff Member

      Staff Member

      Staff Member

      Professional Title

      Learn More
      Observatory Staff

      Search For

    • Search All People
      • Back
      • Staff Scientists
      • Leadership
      • Biosphere Science & Engineering People
      • Earth & Planets Laboratory People
      • Observatories People
    Observatory Staff
    Dr. Michael Blanton
    Observatories Director and Crawford M. Greenewalt Chair

    Featured Staff Member

    Observatories Director Michael Blanton

    Dr. Michael Blanton

    Observatories Director and Crawford M. Greenewalt Chair

    Learn More
    Observatory Staff
    Dr. Michael Blanton
    Observatories Director and Crawford M. Greenewalt Chair

    Astronomer Michael R. Blanton joined the Carnegie Science Observatories as its 12th director in January 2026. In this role he oversees astronomical research in Pasadena and telescope operations at Las Campanas Observatory in Chile.

    Search For

    Search All Staff
  • Events
    • Back
    • Events
    • Search All Events
      • Back
      • Public Events
      • Biosphere Science & Engineering Events
      • Earth & Planets Laboratory Events
      • Observatories Events

    Upcoming Events

    Events

    Events

    Lava exoplanet
    Seminar

    Ryan Rubenzahl (Flatiron Institute)

    Isolating astrophysical from instrumental variability at the pixel level in EPRV solar spectra

    April 3

    12:15pm PDT

    Colloquium

    Prof. Kevin Burdge (MIT)

    From 7 minutes to 70000 years: the Renaissance of compact objects in binary (and triple) systems

    April 7

    11:00am PDT

    Seminar

    Natasha Abrams (UC Berkeley)

    Probing Binaries and Black Holes with Microlensing

    April 10

    12:15pm PDT

  • News
    • Back
    • News
    • Search All News
      • Back
      • Biosphere Science & Engineering News
      • Earth & Planets Laboratory News
      • Observatories News
      • Carnegie Science News
    News

    Recent News

    News

    Latest

    • - Any -
    • Biosphere Sciences & Engineering
    • Carnegie Science
    • Earth & Planets Laboratory
    • Observatories
    expand_more
    Read all News
    An ancient immigrant: an artist's conception (not to scale) of the red giant SDSS J0915-7334, which was born near the Large Magellanic Cloud and has now journeyed to reside in the Milky Way. Credit: Navid Marvi/Carnegie Science.
    Breaking News
    April 03, 2026

    Found: Most pristine star in the universe

    This picture of Neptune was produced from the last whole planet images taken through the green and orange filters on NASA's Voyager 2 narrow angle camera. Credit: JPL
    Breaking News
    April 03, 2026

    The depths of Neptune and Uranus may be “superionic”

    An artistic rendering of TOI-5205 b courtesy of NASA
    Breaking News
    April 02, 2026

    How did this get made? Giant planet orbits small star

  • Resources
    • Back
    • Resources
    • Search All
      • Back
      • Employee Resources
      • Scientific Resources
      • Postdoc Resources
      • Media Resources
      • Archival Resources
    • Quick Links
      • Back
      • Employee Intranet
      • Dayforce
      • Careers
      • Observing at LCO
      • Locations and Addresses
  • Donate
    • Back
    • Donate
      - ,

    • Make a Donation
      • Back
      • Support Scientific Research
      • The Impact of Your Gift
      • Carnegie Champions
      • Planned Giving
    Jo Ann Eder

    I feel passionately about the power of nonprofits to bolster healthy communities.

    - Jo Ann Eder , Astronomer and Alumna

    Header Text

    Postdoctoral alumna Jo Ann Eder is committed to making the world a better place by supporting organizations, like Carnegie, that create and foster STEM learning opportunities for all. 

    Learn more arrow_forward
  • Home

Abstract
The complicated story of the Cetus Stream (CS) is recently revealed by its newly discovered similar to 150 members with 6D kinematics from the cross-matched catalog of the Large Sky Area Multi-Object Fiber Spectroscopic Telescope (LAMOST) DR5 K giants and Gaia DR2. It exhibits a very diffuse structure at heliocentric distances between 20 and 50 kpc, extending over at least 100 degrees, and crossing the Galactic plane. Interestingly, The CS is dynamically linked to a massive globular cluster, NGC 5824. A suggestive scenario is that NGC 5824 was the nuclear star cluster of the dwarf progenitor of the CS. We explore this scenario by modeling the disruption process of a dwarf galaxy in the Milky Way potential, on the orbit of NGC 5824, using a suite of N-body simulations. Our results show that the simulated stream can marginally recover the main component of the CS, which is the densest part of the observed stream. Inspired by this mismatch, we use a dwarf progenitor following the representative orbit of the main component members, and find it can reproduce the general morphology of the CS. This gives us a more favorable scenario of the CS progenitor, in which NGC 5824 was not the core, but located off-center. Our fiducial model also predicts a vast extension of the CS in the South, surprisingly coincident with a newly discovered wide southern stream "Palca." Another more diffuse substructure, the Eridanus-Phoenix overdensity is also likely to be related to the CS progenitor.
View Full Publication open_in_new
Xiaobin Zheng 2021 headshot

Xiaobin Zheng

Bioinformatician

 Ross Pedersen 2021 headshot

Ross Pedersen

Postdoctoral Associate

Minjie Hu 2021 headshot

Minjie Hu

Postdoctoral Associate

Abstract
Gene function curation via Gene Ontology (GO) annotation is a common task among Model Organism Database groups. Owing to its manual nature, this task is considered one of the bottlenecks in literature curation. There have been many previous attempts at automatic identification of GO terms and supporting information from full text. However, few systems have delivered an accuracy that is comparable with humans. One recognized challenge in developing such systems is the lack of marked sentence-level evidence text that provides the basis for making GO annotations. We aim to create a corpus that includes the GO evidence text along with the three core elements of GO annotations: (i) a gene or gene product, (ii) a GO term and (iii) a GO evidence code. To ensure our results are consistent with real-life GO data, we recruited eight professional GO curators and asked them to follow their routine GO annotation protocols. Our annotators marked up more than 5000 text passages in 200 articles for 1356 distinct GO terms. For evidence sentence selection, the inter-annotator agreement (IAA) results are 9.3% (strict) and 42.7% (relaxed) in F-1-measures. For GO term selection, the IAAs are 47% (strict) and 62.9% (hierarchical). Our corpus analysis further shows that abstracts contain similar to 10% of relevant evidence sentences and 30% distinct GO terms, while the Results/Experiment section has nearly 60% relevant sentences and >70% GO terms. Further, of those evidence sentences found in abstracts, less than one-third contain enough experimental detail to fulfill the three core criteria of a GO annotation. This result demonstrates the need of using full-text articles for text mining GO annotations. Through its use at the BioCreative IV GO (BC4GO) task, we expect our corpus to become a valuable resource for the BioNLP research community.
View Full Publication open_in_new
Abstract
Gene function curation via Gene Ontology (GO) annotation is a common task among Model Organism Database groups. Owing to its manual nature, this task is considered one of the bottlenecks in literature curation. There have been many previous attempts at automatic identification of GO terms and supporting information from full text. However, few systems have delivered an accuracy that is comparable with humans. One recognized challenge in developing such systems is the lack of marked sentence-level evidence text that provides the basis for making GO annotations. We aim to create a corpus that includes the GO evidence text along with the three core elements of GO annotations: (i) a gene or gene product, (ii) a GO term and (iii) a GO evidence code. To ensure our results are consistent with real-life GO data, we recruited eight professional GO curators and asked them to follow their routine GO annotation protocols. Our annotators marked up more than 5000 text passages in 200 articles for 1356 distinct GO terms. For evidence sentence selection, the inter-annotator agreement (IAA) results are 9.3% (strict) and 42.7% (relaxed) in F-1-measures. For GO term selection, the IAAs are 47% (strict) and 62.9% (hierarchical). Our corpus analysis further shows that abstracts contain similar to 10% of relevant evidence sentences and 30% distinct GO terms, while the Results/Experiment section has nearly 60% relevant sentences and >70% GO terms. Further, of those evidence sentences found in abstracts, less than one-third contain enough experimental detail to fulfill the three core criteria of a GO annotation. This result demonstrates the need of using full-text articles for text mining GO annotations. Through its use at the BioCreative IV GO (BC4GO) task, we expect our corpus to become a valuable resource for the BioNLP research community.
View Full Publication open_in_new
Abstract
Gene Ontology (GO) annotation is a common task among model organism databases (MODs) for capturing gene function data from journal articles. It is a time-consuming and labor-intensive task, and is thus often considered as one of the bottlenecks in literature curation. There is a growing need for semiautomated or fully automated GO curation techniques that will help database curators to rapidly and accurately identify gene function information in full-length articles. Despite multiple attempts in the past, few studies have proven to be useful with regard to assisting real-world GO curation. The shortage of sentence-level training data and opportunities for interaction between text-mining developers and GO curators has limited the advances in algorithm development and corresponding use in practical circumstances. To this end, we organized a text-mining challenge task for literature-based GO annotation in BioCreative IV. More specifically, we developed two subtasks: (i) to automatically locate text passages that contain GO-relevant information (a text retrieval task) and (ii) to automatically identify relevant GO terms for the genes in a given article (a concept-recognition task). With the support from five MODs, we provided teams with >4000 unique text passages that served as the basis for each GO annotation in our task data. Such evidence text information has long been recognized as critical for text-mining algorithm development but was never made available because of the high cost of curation. In total, seven teams participated in the challenge task. From the team results, we conclude that the state of the art in automatically mining GO terms from literature has improved over the past decade while much progress is still needed for computer-assisted GO curation. Future work should focus on addressing remaining technical challenges for improved performance of automatic GO concept recognition and incorporating practical benefits of text-mining tools into real-world GO annotation.
View Full Publication open_in_new
Abstract
Gene Ontology (GO) annotation is a common task among model organism databases (MODs) for capturing gene function data from journal articles. It is a time-consuming and labor-intensive task, and is thus often considered as one of the bottlenecks in literature curation. There is a growing need for semiautomated or fully automated GO curation techniques that will help database curators to rapidly and accurately identify gene function information in full-length articles. Despite multiple attempts in the past, few studies have proven to be useful with regard to assisting real-world GO curation. The shortage of sentence-level training data and opportunities for interaction between text-mining developers and GO curators has limited the advances in algorithm development and corresponding use in practical circumstances. To this end, we organized a text-mining challenge task for literature-based GO annotation in BioCreative IV. More specifically, we developed two subtasks: (i) to automatically locate text passages that contain GO-relevant information (a text retrieval task) and (ii) to automatically identify relevant GO terms for the genes in a given article (a concept-recognition task). With the support from five MODs, we provided teams with >4000 unique text passages that served as the basis for each GO annotation in our task data. Such evidence text information has long been recognized as critical for text-mining algorithm development but was never made available because of the high cost of curation. In total, seven teams participated in the challenge task. From the team results, we conclude that the state of the art in automatically mining GO terms from literature has improved over the past decade while much progress is still needed for computer-assisted GO curation. Future work should focus on addressing remaining technical challenges for improved performance of automatic GO concept recognition and incorporating practical benefits of text-mining tools into real-world GO annotation.
View Full Publication open_in_new
Abstract
We measure how the properties of star-forming central galaxies correlate with large-scale environment, delta, measured on 10 h(-1) Mpc scales. We use galaxy group catalogues to isolate a robust sample of central galaxies with high purity and completeness. The galaxy properties we investigate are star formation rate (SFR), exponential disc scale length R-exp, and Sersic index of the galaxy light profile, n(S). We find that, at all stellar masses, there is an inverse correlation between SFR and delta, meaning that above-average star-forming centrals live in underdense regions. For n(S) and R-exp, there is no correlation with delta at M-* less than or similar to 10(10.5) M-circle dot, but at higher masses there are positive correlations; a weak correlation with R-exp and a strong correlation with n(S). These data are evidence of assembly bias within the star-forming population. The results for SFR are consistent with a model in which SFR correlates with present-day halo accretion rate, (M) over dot(h). In this model, galaxies are assigned to haloes using the abundance-matching ansatz, which maps galaxy stellar mass onto halo mass. At fixed halo mass, SFR is then assigned to galaxies using the same approach, but. (M) over dot(h) is used to map onto SFR. The best-fitting model requires some scatter in the (M) over dot(h)-SFR relation. The R-exp and n(S) measurements are consistent with a model in which both of these quantities are correlated with the spin parameter of the halo, lambda. Halo spin does not correlate with delta at low halo masses, but for higher mass haloes, high-spin haloes live in higher density environments at fixed M-h. Put together with the earlier instalments of this series, these data demonstrate that quenching processes have limited correlation with halo formation history, but the growth of active galaxies, as well as other detailed galaxies properties, are influenced by the details of halo assembly.
View Full Publication open_in_new
Abstract
We measure how the properties of star-forming central galaxies correlate with large-scale environment, delta, measured on 10 h(-1) Mpc scales. We use galaxy group catalogues to isolate a robust sample of central galaxies with high purity and completeness. The galaxy properties we investigate are star formation rate (SFR), exponential disc scale length R-exp, and Sersic index of the galaxy light profile, n(S). We find that, at all stellar masses, there is an inverse correlation between SFR and delta, meaning that above-average star-forming centrals live in underdense regions. For n(S) and R-exp, there is no correlation with delta at M-* less than or similar to 10(10.5) M-circle dot, but at higher masses there are positive correlations; a weak correlation with R-exp and a strong correlation with n(S). These data are evidence of assembly bias within the star-forming population. The results for SFR are consistent with a model in which SFR correlates with present-day halo accretion rate, (M) over dot(h). In this model, galaxies are assigned to haloes using the abundance-matching ansatz, which maps galaxy stellar mass onto halo mass. At fixed halo mass, SFR is then assigned to galaxies using the same approach, but. (M) over dot(h) is used to map onto SFR. The best-fitting model requires some scatter in the (M) over dot(h)-SFR relation. The R-exp and n(S) measurements are consistent with a model in which both of these quantities are correlated with the spin parameter of the halo, lambda. Halo spin does not correlate with delta at low halo masses, but for higher mass haloes, high-spin haloes live in higher density environments at fixed M-h. Put together with the earlier instalments of this series, these data demonstrate that quenching processes have limited correlation with halo formation history, but the growth of active galaxies, as well as other detailed galaxies properties, are influenced by the details of halo assembly.
View Full Publication open_in_new

Pagination

  • Previous page chevron_left
  • …
  • Page 634
  • Page 635
  • Page 636
  • Page 637
  • Current page 638
  • Page 639
  • Page 640
  • Page 641
  • Page 642
  • …
  • Next page chevron_right
Subscribe to

Get the latest

Subscribe to our newsletters.

Privacy Policy
Home
  • Instagram instagram
  • Twitter twitter
  • Youtube youtube
  • Facebook facebook

Science

  • Biosphere Sciences & Engineering
  • Earth & Planets Laboratory
  • Observatories
  • Our Research Areas
  • Our Blueprint For Discovery

Legal

  • Financial Statements
  • Conflict of Interest Policy
  • Privacy Policy

Careers

  • Working at Carnegie
  • Scientific and Technical Jobs
  • Administrative & Support Jobs
  • Postdoctoral Program
  • Carnegie Connect (For Employees)

Contact Us

  • Contact Administration
  • Media Contacts

Business Address

5241 Broad Branch Rd. NW

Washington, DC 20015

place Map

© Copyright Carnegie Science 2026