Who We Are

About us

Semantify helps research organizations, repositories, and funders design metadata and PID systems that interoperate — and make sure they get adopted. Sara El-Gebali brings semantic architecture from EMBO, EMBL-EBI, SciLifeLab and DataCite. Xiaoli Chen brings programme leadership from CERN, DataCite, FORCE11 and RDA. We work remotely, globally.

Sara El-Gebali

Sara El-Gebali

Semantic infrastructure architect

Semantic infrastructure architect with hands-on production curation experience at EMBL-EBI (Pfam / InterPro), now a DataCite Metadata Specialist driving FAIR Digital Object work and Bioschemas / DataCite schema harmonisation.

Current

  • Metadata Specialist, DataCite (2023–present)
  • Steering Committee, FAIR Digital Objects Forum
  • Co-founder, FAIRPoints; founder, OpenCIDER
  • RDA / EOSC Future Life Sciences Ambassador
  • Governance Committee, Open Life Science

Past

  • Project Leader, Metadata & Curation — SciLifeLab Data Centre / DDLS
  • Head of Research Data Management — MDC Berlin
  • Scientific Database Curator — EMBL-EBI (Pfam, InterPro)
  • Scientific Data Editor — EMBO (SourceData)
  • PhD Biochemistry & Molecular Biology, University of Bern; MSc Queen Mary University of London; BSc Lund

Training & advocacy

  • Open Data module lead — NASA TOPS Open Science 101
  • Carpentries certified instructor
  • Keynote, BOSC 2023; speaker, FOSDEM 2023
  • Featured: FAIRDataPodcast, RSEng Dev Stories

Recognition

  • Wellcome Genome Campus Award for Best Practice in Equality and Diversity (2019)
  • eLife Innovation Leaders 2020 mentee

Representative publications

  • The Pfam protein families database in 2019 — Nucleic Acids Research
  • InterPro in 2019 / 2017 — Nucleic Acids Research
  • SourceData: a semantic platform for curating and searching figures — Nature Methods
  • Ten simple rules for pushing boundaries of inclusion at academic events — PLOS Comp. Biol., 2024
  • Harmonizing Metadata Across Disciplines — Bioschemas and the DataCite Metadata Schema — DataCite blog, 2024

Technical depth

RDFRDFSOWLSKOSSPARQLJSON-LDBioschemas profiles (Sample, BioSample)DataCite linked-data schemaControlled vocabularies & terminology servicesScientific curation at scaleAPI-based metadata quality analytics (DataCite, Crossref)PythonElasticSearch / KibanaGit / GitLabProtégéProfessional Scrum Product Owner
Xiaoli (Li) Chen

Xiaoli (Li) Chen

Research infrastructure strategist

Based in Edinburgh, UK

Research infrastructure strategist who leads DataCite's Templeton-funded FAIR Workflows programme across seven international partners, with a research career rooted in user studies of CERN's open-data and analysis-preservation services.

Current

  • FAIR Workflows Project Lead, DataCite (Templeton WCF-funded, 3 years, 7 partners, 38 outputs)
  • Co-chair, RDA “Working with PIDs in Tools” Interest Group

Past

  • Co-chair, DataCite APAC Expert Group
  • Doctoral researcher / data-management collaborator at CERN Scientific Information Service — communications and engagement lead on the EC THOR PID project; user research and information architecture on the CERN Open Data Portal and CERN Analysis Preservation
  • DataCite schema contributor — Award and Preregistration resource types; CSTR as alternative PID
  • FORCE11 summer school instructor (PIDs, RDM)
  • PhD Information Studies, University of Sheffield iSchool (data reuse practices among high-energy physicists, based at CERN); MS, Syracuse iSchool; BA, Beijing Institute of Clothing Technology

Representative publications

  • Chen, X., Dallmeier-Tiessen, S. et al. Open is not enough. Nature Physics 15, 113–119 (2019)
  • Chen, X., Dallmeier-Tiessen, S. et al. CERN Analysis Preservation: A Novel Digital Library Service. TPDL 2016
  • Guide for funders to support FAIR workflows & enable research tracking. Zenodo, 2023 (DataCite × Crossref × ORCID)
  • The Role of Funders in Building a Robust and Trustworthy Output Tracking Mechanism Using PIDs and Open Metadata. Zenodo, 2023

Methods depth

Qualitative user research (semi-structured interviews, contextual inquiry)Usability evaluation of scholarly toolsInformation architectureStakeholder mappingMulti-partner programme leadershipFunder engagement & guidance authoringAdoption strategy & change managementCross-organisational coordination (DataCite × Crossref × ORCID × MPI)Library & information science methodologyAPAC ecosystem access (CNIC, ARDC, China research-data community)

Want to work with us?

hello@semantify.co