Replication Data for: Dataset1 (doi:10.82210/AVHTY0)

View:

Part 1: Document Description
Part 2: Study Description
Part 3: Data Files Description
Part 4: Variable Description
Part 5: Other Study-Related Materials
Entire Codebook

Document Description

Citation

Title:

Replication Data for: Dataset1

Identification Number:

doi:10.82210/AVHTY0

Distributor:

Repositório Polen QTY

Date of Distribution:

2025-11-24

Version:

1

Bibliographic Citation:

Pontes, Eduardo, 2025, "Replication Data for: Dataset1", https://doi.org/10.82210/AVHTY0, Repositório Polen QTY, V1, UNF:6:GwmrGo7ALf2s8FUc9nWTaA== [fileUNF]

Study Description

Citation

Title:

Replication Data for: Dataset1

Identification Number:

doi:10.82210/AVHTY0

Authoring Entity:

Pontes, Eduardo (Instituto Politécnico de Tomar)

Other identifications and acknowledgements:

João Pereira

Other identifications and acknowledgements:

Marta Costa

Producer:

Ana Silva

Date of Production:

2025-11-24

Grant Number:

2023/11341/PEX

Distributor:

Repositório Polen QTY

Access Authority:

Pontes, Eduardo

Depositor:

Pontes, Eduardo

Date of Deposit:

2025-11-24

Holdings Information:

https://doi.org/10.82210/AVHTY0

Study Scope

Keywords:

Exact Sciences - Chemical Sciences, Ciências Exatas - Química, Type 2 Diabetes Mellitus, Clinical Data, Anthropometry, Glycated Hemoglobin A, Fasting Blood Glucose, Hepatic Metabolism

Abstract:

This dataset contains clinical, anthropometric, and biochemical information for 160,000 synthetic patients with Type 2 Diabetes Mellitus, generated for testing research data management tools. Variables include age, sex, body mass index (BMI), HbA1c levels, fasting glucose, and LDL cholesterol. The dataset simulates an observational study linked to the project “Measuring hepatic polyol pathway activity and connecting it with lipogenic glucose metabolism in Type 2 Diabetes patients” (ref. 2023.11517.PEX). All data are fully anonymized and synthetic, with no relation to real individuals.

Notes:

Este dataset é composto exclusivamente por dados sintéticos, gerados para testar funcionalidades de carregamento, ingestão, interoperabilidade e validação de metadados em plataformas de gestão e repositórios de dados científicos. Não representa dados reais de pacientes e não requer autorização ética. Os valores foram gerados aleatoriamente dentro de intervalos fisiologicamente plausíveis.

Methodology and Processing

Sources Statement

Data Access

Notes:

<a href="http://creativecommons.org/publicdomain/zero/1.0">CC0 1.0</a>

Other Study Description Materials

File Description--f388

File: T2D_baseline_clinical_large.tab

  • Number of cases: 160000

  • No. of variables per record: 5

  • Type of File: text/tab-separated-values

Notes:

UNF:6:GwmrGo7ALf2s8FUc9nWTaA==

Genomics

Geospatial

Variable Description

List of Variables:

Variables

patient_id

f388 Location:

Variable Format: character

Notes: UNF:6:gklrWfBlrkotHUpBKSIprA==

age_years

f388 Location:

Summary Statistics: StDev 15.848012925816244; Valid 160000.0; Mean 57.01638749999999; Min. 30.0; Max. 84.0

Variable Format: numeric

Notes: UNF:6:01uec+Rmrw/wo2vkJu0NVw==

sex

f388 Location:

Variable Format: character

Notes: UNF:6:apSLHaQS3JcWEJ2yUxJK6A==

bmi

f388 Location:

Summary Statistics: Min. 20.0; StDev 5.774077900534897; Mean 30.00978; Valid 160000.0; Max. 40.0;

Variable Format: numeric

Notes: UNF:6:nNRa8vS/rNw64lX/loNsdQ==

hba1c_percent

f388 Location:

Summary Statistics: Valid 160000.0; Min. 5.5; Max. 12.5; StDev 2.0196868106103687; Mean 9.000553749999998

Variable Format: numeric

Notes: UNF:6:jLYTHrrJYDGxdFoAxxV2Yg==

Other Study-Related Materials

Label:

T2D_baseline_labs_large.xlsx

Notes:

application/vnd.openxmlformats-officedocument.spreadsheetml.sheet

Other Study-Related Materials

Label:

T2D_variables_codebook_large.pdf

Notes:

application/pdf