Title: Drugs Databases Parser
Version: 2.0.3
Description: This tool is for parsing public drug databases such as 'DrugBank' XML database https://go.drugbank.com/. The parsed data are then returned in a proper 'R' object called 'dvobject'.
License: MIT + file LICENSE
Encoding: UTF-8
Imports: dplyr, progress, purrr, tibble, tools, XML
RoxygenNote: 7.2.3
Suggests: canvasXpress, knitr, rmarkdown, testthat, tidyr
VignetteBuilder: knitr
URL: https://docs.ropensci.org/dbparser/, https://github.com/ropensci/dbparser
BugReports: https://github.com/ropensci/dbparser/issues
Depends: R (≥ 3.5)
NeedsCompilation: no
Packaged: 2024-04-20 06:40:43 UTC; LEGION
Author: Mohammed Ali [aut, cre], Ali Ezzat [aut], Hao Zhu [rev], Emma Mendelsohn [rev]
Maintainer: Mohammed Ali <moh_fcis@yahoo.com>
Repository: CRAN
Date/Publication: 2024-04-20 06:52:52 UTC

dbparser: A package for reading and parsing DrugBank xml database.

Description

The main purpose of the 'dbparser' package is to parse [DrugBank](https://go.drugbank.com/) database which is downloadable in XML format from [this link](https://go.drugbank.com/releases/latest).

Details

The parsed data can then be explored and analyzed.

To achieve this purpose, 'dbparser“ package provides three main categories of functions:

- xml db reader,

- DrugBank elements parsers,

For more information kindly check the reference/index (https://docs.ropensci.org/dbparser/reference/index.html)

xml db reader functions

Reads DrugBank xml database and build drug elements full tree in memory

parsers functions

Each parser function is responsible of parsing certain drug element and returning its tibble.

Check this tutorial (https://docs.ropensci.org/dbparser/articles/dbparser.html)

Author(s)

Maintainer: Mohammed Ali moh_fcis@yahoo.com

Authors:

Other contributors:

See Also

Useful links:


add_drugbank_info Add passed DrugBank db metadata to passed dvobject

Description

add_drugbank_info Add passed DrugBank db metadata to passed dvobject

Usage

add_drugbank_info(dvobject)

Value

dvobject


Drugs/ Carriers/ Enzymes/ Targets/ Transporters articles element parser

Description

Return a list of articles that were used as references for drugs carriers

Usage

drugs_articles()

carriers_articles()

enzymes_articles()

targets_articles()

transporters_articles()

Value

a tibble with 4 variables:

ref-id

Identifier for the article being referenced. This is unique across all reference types (books, links, article, attachments).

pubmed-id

The PubMed identifier for the article.

citation

Article citation in a standard format.

parent_id

drug/carrier/target/enzyme/transporter id


Drugs/ Carriers/ Enzymes/ Targets/ Transporters attachments element parser

Description

Return a list of attachment that were used as references for drugs carriers

Usage

drugs_attachments()

carriers_attachments()

enzymes_attachments()

targets_attachments()

transporters_attachments()

Value

a tibble with 4 variables:

ref-id

Identifier for the article being referenced. This is unique across all reference types (books, links, article, attachments).

title

The title of the attachment.

url

The url to download the attachment from.

parent_id

drug/carrier/target/enzyme/transporter id


Drugs/ Carriers/ Enzymes/ Targets/ Transporters books element parser

Description

Return a list of text books that were used as references for drugs, carriers, enzymes, targets or transporters

Usage

drugs_textbooks()

carriers_textbooks()

enzymes_textbooks()

targets_textbooks()

transporters_textbooks()

Value

a tibble with 4 variables:

ref-id

Identifier for the article being referenced. This is unique across all reference types (books, links, article, attachments).

isbn

ISBN identifying the textbook.

citation

A Textbook citation in a standard format.

parent_id

drug/ carrier/ target/ enzyme/ transporter id


Carriers/ Enzymes/ Targets/ Transporters Actions parsers

Description

Collection of related actions

Usage

carriers_actions()

enzymes_actions()

targets_actions()

transporters_actions()

Value

a tibble with 2 variables:

action

describe related action

parent_id

carrier/ target/ enzyme/ transporter id


Carriers/ Enzymes/ Targets/ Transporters parsers

Description

Protein targets of drug action, enzymes that are inhibited/induced or involved in metabolism, and carrier or transporter proteins involved in movement of the drug across biological membranes.

Usage

carriers()

enzymes()

targets()

transporters()

Value

a tibble with 6 variables (8 for enzymes):

id

Universal Protein Resource (UniProt) Identifier for the record

name

related name

organism

Organism that the protein comes from.

known_action

Whether the pharmacological action of the drug is due to this target interaction.

inhibition-strength

Whether the strength of enzyme inhibition is strong, moderate, or unknown. Only applies to enzymes

induction-strength

Whether the strength of enzyme induction is strong or unknown. Only applies to enzymes

position

related position

parent_id

drugbank id


Carriers/ Enzymes/ Targets/ Transporters Polypeptide External Identifiers parsers

Description

Extract descriptions of identified polypeptide external identifiers for targets, enzymes, carriers, or transporters.

Usage

carriers_polypep_ex_ident()

enzymes_polypep_ex_ident()

targets_polypep_ex_ident()

transporters_polypep_ex_ident()

Value

a tibble with 3 variables:

resource

Name of the source database.

identifier

Identifier for this drug in the given resource.

parent_key

polypeptide id


Carriers/ Enzymes/ Targets/ Transporters Polypeptide GO Classifier parsers

Description

Extract descriptions of identified polypeptide go classifier for targets, enzymes, carriers, or transporters.

Usage

carriers_polypeptides_go()

enzymes_polypeptides_go()

targets_polypeptides_go()

transporters_polypeptides_go()

Value

a tibble with 3 variables:

category
description
parent_key

polypeptide id


returns carriers, enzymes,targets and transporters node valid options.

Description

returns carriers, enzymes,targets and transporters node valid options.

Usage

cett_nodes_options()

Value

list of CETT valid options

See Also

Other parsers: drug_node_options(), parseDrugBank(), references_node_options()


Carriers/ Enzymes/ Targets/ Transporters Polypeptide parsers

Description

Extract descriptions of identified polypeptide targets, enzymes, carriers, or transporters.

Usage

carriers_polypeptides()

enzymes_polypeptides()

targets_polypeptides()

transporters_polypeptides()

Value

a tibble with 20 variables:

id

Universal Protein Resource (UniProt) identifier

source

Specifies whether the identified polypeptide ID is associated with any of the following UniProt knowledge bases: Swiss-Prot, which is manually annotated and reviewed, or TrEMBL, which is automatically annotated and not reviewed.

name
general_function

General summary of the physiological function of the polypeptide

specific_function

A more specific description of the polypeptide’s physiological function within the cell.

gene_name

The short name commonly associated with the associated gene. Eg. PTGS1.

locus

The specific chromosomal location or position of the gene’s sequence on a chromosome.

cellular_location

The cellular location of the polypeptide.

transmembrane_regions

Areas of the polypeptide sequence that span a biological membrane.

signal_regions

Location of any signal peptides within the polypeptide sequence.

theoretical_pi

Theoretical isoelectric point.

molecular_weight

The molecular weight of the polypeptide.

chromosome_location

The chromosomal location of the polypeptide gene

organism

The organism in which this polypeptide functions.

organism_ncbi_taxonomy_id
amino_acid_sequence

The amino acid sequence of the polypeptide

amino_acid_format
gene_sequence

The sequence of the associated gene.

gene_format
parent_key

carrier/ target/ enzyme/ transporter id


Carriers/ Enzymes/ Targets/ Transporters Polypeptide PFAMS parsers

Description

Extract descriptions of identified polypeptide PFAMS targets, enzymes, carriers, or transporters.

Usage

carriers_polypeptides_pfams()

enzymes_polypeptides_pfams()

targets_polypeptides_pfams()

transporters_polypeptides_pfams()

Value

a tibble with 3 variables:

name

The sequence of the associated gene.

identifier
parent_key

polypeptide id


Carriers/ Enzymes/ Targets/ Transporters Polypeptide Synonyms parsers

Description

Extract descriptions of identified polypeptide synonyms for targets, enzymes, carriers, or transporters.

Usage

carriers_polypeptides_syn()

enzymes_polypeptides_syn()

targets_polypeptides_syn()

transporters_polypeptides_syn()

Value

a tibble with 2 variables:

synonym
parent_key

polypeptide id


Drug Affected Organism parser

Description

Organisms in which the drug may display activity; activity may depend on local susceptibility patterns and resistance.

Usage

drug_affected_organisms()

Value

a tibble with 2 variables:

affected-organism

affected-organism name

drugbank_id

drugbank id


Drug ahfs-codes parser

Description

The American Hospital Formulary Service (AHFS) identifier for this drug.

Usage

drug_ahfs_codes()

Value

a tibble with the following variables:

ahfs-code
drugbank_id

drugbank id


Drug ATC Codes element parser

Description

The Anatomical Therapeutic Classification (ATC) code for the drug assigned by the World Health Organization Anatomical Chemical Classification System.

Usage

drug_atc_codes()

Details

Each 'atc-code“ row has one or more level. The atc-code and level> have a code the code assigned by the World Health Organization Anatomical Therapeutic Chemical Classification system.

Value

a tibble with 10 variables


Drug Calculated Properties parser

Description

Drug properties that have been predicted by ChemAxon or ALOGPS based on the imputed chemical structure. Associated links below will redirect to descriptions of the specific term.

Usage

drug_calc_prop()

Value

a tibble with 4 variables:

kind

Name of the property.

value

Predicted physicochemical properties; obtained by the use of prediction software such as ALGOPS and ChemAxon.

source

Name of the software used to calculate this property, either ChemAxon or ALOGPS.

drugbank_id

drugbank id


Drug Categories parser

Description

General categorizations of the drug.

Usage

drug_categories()

Value

a tibble with 2 variables:

category

category name

mesh-id

The Medical Subjects Headings (MeSH) identifier for the category.

drugbank_id

drugbank id


Drug Classification parser

Description

A description of the hierarchical chemical classification of the drug; imported from ClassyFire.

Usage

drug_classification()

Value

a tibble with 9 variables:

description
direct-parent
kingdom
superclass
class
subclass
alternative-parent

One or more alternative parents

substituent

One or more substituents

drugbank_id

drugbank id


Drug Dosages parser

Description

A list of the commercially available dosages of the drug.

Usage

drug_dosages()

Value

a tibble with the following variables:

form

The pharmaceutical formulation by which the drug is introduced into the body

route

The path by which the drug or product is taken into the body.

strength

The amount of active drug ingredient provided in the dosage

drugbank_id

drugbank id


Drug External Identifiers parser

Description

Identifiers used in other websites or databases providing information about this drug.

Usage

drug_ex_identity()

Value

a tibble with the following variables:

resource

Name of the source database.

identifier

Identifier for this drug in the given resource.

drugbank_id

drugbank id


Drug Experimental Properties parser

Description

Drug properties that have been experimentally proven

Usage

drug_exp_prop()

Value

a tibble with the following variables:

kind

Name of the property.

value

Drug properties that have been experimentally proven.

source

Reference to the source of this experimental data.

drugbank_id

drugbank id

The following experimental properties are provided:

Water Solubility

The experimentally determined aqueous solubility of the molecule.

Molecular Formula

Protein formula of Biotech drugs

Molecular Weight

Protein weight of Biotech drugs.

Melting Point

The experimentally determined temperature at which the drug molecule changes from solid to liquid at atmospheric temperature.

Boiling Point

The experimentally determined temperature at which the drug molecule changes from liquid to gas at atmospheric temperature.

Hydrophobicity

The ability of a molecule to repel water rather than absorb or dissolve water.

Isoelectric Point

The pH value at which the net electric charge of a molecule is zero.

caco2 Permeability

A continuous line of heterogenous human epithelial colorectal adenocarcinoma cells, CAC02 cells are employed as a model of human intestinal absorption of various drugs and compounds. CAC02 cell permeability is ultimately an assay to measure drug absorption.

pKa

The experimentally determined pka value of the molecule

logP

The experimentally determined partition coefficient (LogP) based on the ratio of solubility of the molecule in 1-octanol compared to water.

logS

The intrinsic solubility of a given compound is the concentration in equilibrium with its solid phase that dissolves into solution, given as the natural logarithm (LogS) of the concentration.

Radioactivity

The property to spontaneously emit particles (alpha, beta, neutron) or radiation (gamma, K capture), or both at the same time, from the decay of certain nuclides.


Description

Links to other websites or databases providing information about this drug.

Usage

drug_external_links()

Value

a tibble with the following variables:

resource

Name of the source website.

identifier

Identifier for this drug in the given resource

drugbank_id

drugbank id


Drug Groups parser

Description

Food that may interact with this drug.

Usage

drug_food_interactions()

Value

a tibble with the following variables:

food-interaction
drugbank_id

drugbank id


Drugs General Information parser

Description

A description of the hierarchical chemical classification of the drug; imported from ClassyFire.

Usage

drug_general_information()

Value

a tibble with 15 variables:

primary_key

DrugBank id

other_keys

Other identifiers that may be associated with the drug

type

Either small molecule, or biotech. Biotech is used for any drug that is derived from living systems or organisms, usually composed of high molecular weight mixtures of protein, while small molecule describes a low molecular weight organic compound.

name
created

Date that this drug was first added to DrugBank.

updated

Denotes when this drug was last updated in DrugBank.

description

Descriptions of drug chemical properties, history and regulatory status.

cas_number

The Chemical Abstracts Service (CAS) registry number assigned to the drug.

unii

Unique Ingredient Identifier (UNII) of this drug.

average_mass

The weighted average of the isotopic masses of the drug

state

One of solid, liquid, or gas

monoisotopic_mass

The mass of the most abundant isotope of the drug

synthesis_reference

Citation for synthesis of the drug molecule.

fda_label

Contains a URL for accessing the uploaded United States Food and Drug Administration (FDA) Monograph for this drug.

msds

Contains a URL for accessing the Material Safety Data Sheet (MSDS) for this drug.


Drug Groups parser

Description

Groups that this drug belongs to. May include any of: approved, vet_approved, nutraceutical, illicit, withdrawn, investigational, and experimental.

Usage

drug_groups()

Value

a tibble with 2 variables:

group
drugbank_id

drugbank id


Drug Interactions parser

Description

Drug-drug interactions detailing drugs that, when administered concomitantly with the drug of interest, will affect its activity or result in adverse effects. These interactions may be synergistic or antagonistic depending on the physiological effects and mechanism of action of each drug.

Usage

drug_interactions()

Value

a tibble with the following variables:

drugbank-id

DrugBank ID of the interacting drug.

name

Name of the interacting drug.

description

Textual description of the physiological consequences of the drug interaction

drugbank_id

parent drugbank id


Drug International Brands parser

Description

The proprietary names used by the manufacturers for commercially available forms of the drug, focusing on brand names for products that are available in countries other than Canada and the Unites States.

Usage

drug_intern_brand()

Value

a tibble with 4 variables:

brand

The proprietary, well-known name for given to this drug by a manufacturer.

company

The company or manufacturer that uses this name.

drugbank_id

drugbank id


Drug Manufacturers parser

Description

A list of companies that are manufacturing the commercially available forms of this drug that are available in Canada and the Unites States.

Usage

drug_manufacturers()

Value

a tibble with the following variables:

generic

A list of companies that are manufacturing the generic form of the drug.

url

A link to the companies that are manufacturing the drug.

drugbank_id

drugbank id


Drug Mixtures parser

Description

All commercially available products in which this drug is available in combination with other drug molecules

Usage

drug_mixtures()

Value

a tibble with 4 variables:

name

The proprietary name provided by the manufacturer for this combination product.

ingredients

A list of ingredients, separated by addition symbols

supplemental-ingredients

List of additional active ingredients which are not clinically relevant to the main indication of the product, separated by addition symbols.

drugbank_id

drugbank id


returns drug node valid options.

Description

returns drug node valid options.

Usage

drug_node_options()

Value

list of drug valid options

See Also

Other parsers: cett_nodes_options(), parseDrugBank(), references_node_options()


Drug Packagers parser

Description

A list of companies that are packaging the drug for re-distribution.

Usage

drug_packagers()

Value

a tibble with 2 variables:

name
url

A link to any companies that are packaging the drug for re-distribution.

drugbank_id

drugbank id


Drug Patents parser A property right issued by the United States Patent and Trademark Office (USPTO) to an inventor for a limited time, in exchange for public disclosure of the invention when the patent is granted. Drugs may be issued multiple patents.

Description

Drug Patents parser A property right issued by the United States Patent and Trademark Office (USPTO) to an inventor for a limited time, in exchange for public disclosure of the invention when the patent is granted. Drugs may be issued multiple patents.

Usage

drug_patents()

Value

a tibble with the following variables:

number

The patent number(s) associated with the drug.

country

The country that issued the patent rights.

approved

The date that the patent request was filed.

expires

The date that the patent rights expire.

pediatric-extension

Indicates whether or not a pediatric extension has been approved for the patent. Granted pediatric extensions provide an additional 6 months of market protection.

drugbank_id

drugbank id


Drug Pathway parser

Description

Metabolic, disease, and biological pathways that the drug is involved in, as identified by the Small Molecule Protein Database (SMPDB).

Usage

drug_pathway()

Value

a tibble with the following variables:

smpdb-id

Small Molecule Pathway Database identifier for this pathway.

name

Pathway name

category

Pathway category

drugbank_id

drugbank id


Drug Pathway Drugs parser

Description

Drugs involved in this pathway.

Usage

drug_pathway_drugs()

Value

a tibble with pathway drugs properties


Drug Pathway Enzymes parser

Description

Enzymes involved in this pathway.

Usage

drug_pathway_enzyme()

Value

a tibble with pathway properties


Drug pdb-entries parser

Description

Protein Data Bank (PDB) identifiers for this drug.

Usage

drug_pdb_entries()

Value

a tibble with the following variables:

pdb-entry
drugbank_id

drugbank id


Drug Pharmacology parser

Description

Describes the use, mechanism of action, pharmacokinetics, pharmacodynamics, and physiological or biochemical effects in the body.

Usage

drug_pharmacology()

Value

a tibble with the following variables:

indication

The approved conditions, diseases, or states for which a drug can safely and effectively be used. An indication is considered to be FDA-approved when it has any of the following designations: NDA, ANDA, BLA, or OTC. May also include indications in other countries, such as Canada (through Health Canada) or in Europe (through the European Medicines Agency).

pharmacodynamics

A description of how the drug modifies or affects the organism it is being used in. May include effects in the body that are desired (enzyme or protein targets for example) and undesired (also known as “side effects”). This is in contrast to pharmacokinetics, which describes how the body modifies the drug being used.

mechanism_of_action

A component of pharmacodynamics that describes the biochemical interaction through which a drug produces its intended effect. May include the exact molecular protein or enzyme targets and/or a description of the physiological effects produced.

toxicity

Any adverse reaction, or side effect, that may or may not occur with use of the drug. May be attributed to a number of effects including: an enhanced therapeutic effect, rare anaphylactic reactions, interactions with other medications, or unanticipated binding of the molecule at different sites within the body.

metabolism

A description of the chemical degradation of the drug molecule within the body; most commonly by enzymes from the Cytochrome P450 (CYP) system in the liver.

absorption

A description of the movement of the drug from the site of administration into the bloodstream or target tissue. Common pharmacokinetic metrics used to evaluate absorption include Area Under the Curve (AUC), bioavailability (F), maximum concentration (Cmax), and time to maximum concentration (Tmax).

half-life

The period of time it takes for the amount of drug in the body to be reduced by one half. Provides a description of how quickly the drug is being eliminated and how much is available in the bloodstream.

protein-binding

A description of the drug’s affinity for plama proteins and the proportion of the drug that is bound to them when in circulation within the body.

route_of_elimination

A description of the pathway that is used to excrete the drug from the body. Common pharmacokinetic parameters used to evaluate excretion include elimination half life, renal clearance, and tracking of radiolabelled compounds through the renal and GI system.

volume_of_distribution

The Vd of a drug represents the degree to which it is distributed into body tissue compared to the plasma.

clearance

A pharmacokinetic measurement of the rate of removal of the drug from plasma, expressed as mL/min; reflects the rate of elimination of the drug.

drugbank_id

drugbank id


Drug Prices Parsers

Description

Unit drug prices

Usage

drug_prices()

Value

a tibble with 5 variables:

description
cost

Drug price per unit

currency

Currency of price, example: US.

unit
parent_id

drugbank id


Drug Products parser

Description

A list of commercially available products in Canada and the United States that contain the drug.

Usage

drug_products()

Value

a tibble with 32 variables:

name

The proprietary name(s) provided by the manufacturer for any commercially available products containing this drug.

labeller

The corporation responsible for labelling this product.

ndc-id

The National Drug Code (NDC) identifier of the drug

ndc-product-code

The National Drug Code (NDC) product code from the FDA National Drug Code directory.

dpd-id

Drug Product Database (DPD) identification number (a.k.a. DIN) from the Canadian Drug Product Database. Only present for drugs that are marketed in Canada

ema-product-code

EMA product code from the European Medicines Agency Database. Only present for products that are authorized by central procedure for marketing in the European Union.

ema-ma-number

EMA marketing authorization number from the European Medicines Agency Database. Only present for products that are authorized by central procedure for marketing in the European Union.

started-marketing-on

The starting date for market approval.

ended-marketing-on

The ending date for market approval.

dosage-form

The pharmaceutical formulation by which the drug is introduced into the body.

strength

The amount of active drug ingredient provided in the dosage

route

The path by which the drug or product is taken into the body

fda-application-number

The New Drug Application [NDA] number assigned to this drug by the FDA.

over-the-counter

A list of Over The Counter (OTC) forms of the drug.

generic

Whether this product is a generic drug.

approved

Indicates whether this drug has been approved by the regulating government.

country

The country where this commercially available drug has been approved.

source

Source of this product information. For example, a value of DPD indicates this information was retrieved from the Canadian Drug Product Database.

standing

One of good, discordant, or deprecated. Distinguishes products with up to date ingredient information (good) from products with conflicting information (discordant) or products that have been removed from an active label (deprecated).

standing-updated-on

The date on which the standing was last updated

standing-reason

Explains the non-good standing of the product. One of: ingredient_change, code_duplication, invalid, or removed.

jurisdiction-marketing-category

The marketing category of this product in its jurisdiction

branded

Whether this product has a named brand

prescription

Whether this product is only available with a prescription

unapproved

Whether this product is not approved in its jurisdiction

vaccine

Whether this product is a vaccine

allergenic

Whether this product is used in allergenic testing

cosmetic

Whether this product is a cosmetic, such as sunscreen

kit

Whether this product is a kit composed of multiple distinct parts

solo

Whether this product has only a single active ingredient

available

Whether this product can be sold in its jurisdiction

drugbank_id

drugbank id


Drug Reactions Parsers

Description

Extract the sequential representation of the metabolic reactions that this drug molecule is involved in. Depending on available information, this may include metabolizing enzymes, reaction type, substrates, products, pharmacological activity of metabolites, and a structural representation of the biochemical reactions.

Usage

drug_reactions()

Value

a tibble with 5 variables:

sequence

Reactions are displayed within a numerical sequence

left_drugbank_name

The substrate of the reaction. Maybe a drug or a metabolite.

rightt_drugbank_name

The product of the reaction. Maybe a drug or a metabolite.

left_drugbank_id
right_drugbank_id
parent_id

drugbank id


Drug Reactions Enzymes Parsers

Description

Enzymes involved in metabolizing this drug

Usage

drug_reactions_enzymes()

Value

a tibble with 3 variables:

name
uniprot-id
parent_id

drugbank id


Drug Salts parser

Description

Available salt forms of the drug. Ions such as hydrochloride, sodium, and sulfate are often added to the drug molecule to increase solubility, dissolution, or absorption.

Usage

drug_salts()

Value

a tibble with 1 variables:

drugbank-id

DrugBank identifiers of the available salt form(s).

name

Name of the available salt form(s)

unii

Unique Ingredient Identifier (UNII) of the available salt form(s).

cas-number

Chemical Abstracts Service (CAS) registry number assigned to the salt form(s) of the drug.

inchikey

IUPAC International Chemical Identifier (InChi) key identifier for the available salt form(s).

average-mass

Average molecular mass: the weighted average of the isotopic masses of the salt.

monoisotopic-mass

The mass of the most abundant isotope of the salt

smiles

The simplified molecular-input line-entry system (SMILES) is a line notation used for describing the structure of chemical species using short ASCII strings; calculated by ChemAxon.

inchi

A prediction of the IUPAC International Chemical Identifier (InChI); imported by ChemAxon.

formula

Indicates the simple numbers of each type of atom within the molecule; calculated by ChemAxon.

drugbank_id

parent drugbank id


Drug Sequences parser

Description

The amino acid sequence; provided if the drug is a peptide.

Usage

drug_sequences()

Details

Describes peptide sequences of biotech drugs

Value

a tibble with the following variables:

sequence

a textual representation of the sequence

format

Currently, only the FASTA format is used

drugbank_id

drugbank id


Drug SNP Adverse Drug Reactions parser

Description

The adverse drug reactions that may occur as a result of the listed single nucleotide polymorphisms (SNPs)

Usage

drug_snp_adverse_reactions()

Value

a tibble with the following variables:

protein-name

Proteins involved in this SNP.

gene-symbol

Genes involved in this SNP.

uniprot-id

Universal Protein Resource (UniProt) identifiers for proteins involved in this pathway.

rs-id

The SNP Database identifier for this single nucleotide polymorphism.

allele

The alleles associated with the identified SNP.

adverse-reaction
description
pubmed-id

Reference to PubMed article.

drugbank_id

drugbank id


Drug SNP Effects parser

Description

A list of single nucleotide polymorphisms (SNPs) relevant to drug activity or metabolism, and the effects these may have on pharmacological activity. SNP effects in the patient may require close monitoring, an increase or

Usage

drug_snp_effects()

Value

a tibble with the following variables:

protein-name

Proteins involved in this SNP.

gene-symbol

Genes involved in this SNP.

uniprot-id

Universal Protein Resource (UniProt) identifiers for proteins involved in this pathway.

rs-id

The SNP Database identifier for this single nucleotide polymorphism.

allele

The alleles associated with the identified SNP.

defining-change
description

A written description of the SNP effects.

pubmed-id

Reference to PubMed article.

drugbank_id

drugbank id


Drug Synonyms parser

Description

Other names or identifiers that are associated with this drug.

Usage

drug_syn()

Value

a tibble with 3 variables:

language

Names of the drug in languages other than English.

coder

Organization or source providing the synonym. For example, INN indicates the synonym is an International Nonproprietary Name, while IUPAC indicates the synonym is the nomenclature designated by the International Union of Pure and Applied Chemistry.

drugbank_id

drugbank id


init_dvobject initialize dvobject

Description

init_dvobject initialize dvobject

Usage

init_dvobject()

Value

dvobject


Description

Return a list of websites that were used as references for Drugs/ Carriers/ Enzymes/ Targets/ Transporters

Usage

drugs_links()

carriers_links()

enzymes_links()

targets_links()

transporters_links()

Value

a tibble with 4 variables:

ref-id

Name of the source website

title

Identifier for this drug in the given resource

url

The url of the website

parent_id

drug/ carrier/ target/ enzyme/ transporter id


parseDrugBank

Description

parses given DrugBank XML database into a dvobject. dvobject is a list of data.frames in which each data.frame represents a part of parsed data (i.e drugs, prices, carriers, ...)

Usage

parseDrugBank(
  db_path,
  drug_options = NULL,
  parse_salts = FALSE,
  parse_products = FALSE,
  references_options = NULL,
  cett_options = NULL
)

Arguments

db_path

string, full path for the DrugBank xml or zip file.

drug_options

character vector, list of sub drug related nodes names options to parse (default = NULL). Check drug_node_options() for all available options. If its value is 'NULL' ONLY 'drug_general_information' will be placed in the returned dvobject.

parse_salts

boolean, parse salts info (default = FALSE)

parse_products

boolean, parse products info (default = FALSE)

references_options

character vector, list of sub references related nodes names options to parse (default = NULL). Check references_node_options() for all available options.

cett_options

character vector, list of sub cett related nodes names options to parse (default = NULL). Check cett_nodes_options() for all available options.

Value

dvobject

See Also

Other parsers: cett_nodes_options(), drug_node_options(), references_node_options()


Run all CETT related parsers

Description

Run all parsers that retrieve carriers, enzymes, targets and transporters related information

Usage

parse_cett_node(cett_options)

Value

a list of all drugs parsed tibbles


Run all drug related parsers

Description

Run all parsers that retrieve drugs related information

Usage

parse_drug_nodes(drug_options)

Arguments

drug_options

- list of options to parse

Value

a list of all drugs parsed tibbles


Drugs/ Carriers/ Enzymes/ Targets/ Transporters references element parser

Description

Drugs/ Carriers/ Enzymes/ Targets/ Transporters references element parser

Usage

parse_references_node(references_options)

Value

a list of all references for drugs, carriers, enzymes, targets or transporters


Reads DrugBank xml database and load it into memory.

Description

read_drugbank_xml_db loads DrugBank xml database full tree into memory.

Usage

read_drugbank_xml_db(db_path)

Arguments

db_path

string, full path for the DrugBank xml or zip file.

Details

This functions reads DrugBank xml database and load it into memory for later processing. Hence; this method must be called before any other function in the package and it needs to be called one time only.

It takes one single mandatory argument which is the location of DrugBank db.

Value

loaded DB or NULL


returns references node valid options.

Description

returns references node valid options.

Usage

references_node_options()

Value

list of references valid options

See Also

Other parsers: cett_nodes_options(), drug_node_options(), parseDrugBank()


init_dvobject Returns data.frame with two columns (key, value) of dvobject attributes

Description

init_dvobject Returns data.frame with two columns (key, value) of dvobject attributes

Usage

show_dvobject_metadata(dvobject)

Arguments

dvobject

- dvobject list to show related metadata

Value

data.frame