https://github.com/nrnb/mirna-pathway-finder
Tip revision: 2a75f42dd8d65b42c60e38bffbab2e96f520a401 authored by Alexander Pico on 09 March 2018, 01:18:03 UTC
Create README.md
Create README.md
Tip revision: 2a75f42
README-input-file-sources.txt
################
## SOURCE INPUTS
* wikipathways-YYYYMMDD-gmt-Homo_sapiens -- downloaded via http://data.wikipathways.org/current/gmt/. Contains pathway names, links and gene sets unified to entrez gene.
* hsa_MTI_r#.#.xlsx -- downloaded from http://mirtarbase.mbc.nctu.edu.tw/php/download.php. Contains miRTarBase IDs for targeting miRNAs and Entrez for gene targets. Also includes strong/weak and pmid refs. PROCESSING: Release number added to filename.
* [IGNORE] mart_export_v##.csv -- exported from BioMART at http://www.ensembl.org/biomart/. See screenshot "mart_export.png". Filter for lincRMA, miRNA and processed_transcript; Attributes: entrez and mirbase ids. Export as CSV, unique results only. PROCESSING: Version number added to filename and extension set to CSV. Removed rows with blanks in Excel and renamed header rows: "entrez", "mirbase"
* [OPTIONAL] mirna_mature.txt -- downloaded from ftp://mirbase.org/pub/mirbase/CURRENT/database_files/ (log in as Guest). This file can be processed into a CSV to be used to prepare a _direct_ mapping from mirtarbase ids to pathway elements, i.e., via an additional MIMAT->Entrez mapping set by BridgeDb. PROCESSING: Open in Excel. Filter for mirtarbase mature names that begin with "hsa"; copy mirtarbase and mimat id columns to new tab and export as mirna_mature.csv.
###################
## PROCESSED INPUTS
* hsa-mirtarbase-entrez-targets-***.csv
1. open hsa_MTIr#.#.xlsx in Excel
2. rename "miRNA" to "mirtarbase" and "Target Gene (Entrez Gene ID)" to "entrez targets"
3. filter for both strong and weak support types: "Functional MTI" or "Functional MIT (Weak)"
4. copy "mirtarbase" and "entrez targets" columns to new tab called "funcional": miRNA, Target Gene (Entrez)
5. filter original tab for only strong support type: "Functional MTI" (i.e., excluding weak)
6. copy same two columns to new tab called "strong": miRNA, Target Gene (Entrez)
7. [important!] confirm paste lengths
8. remove rows with blanks (if any)
8. save hsa_MTI.xlsx
9. save each tab as: hsa-mirtarbase-entrez-targets-(strong|funcional).csv
* hsa_mirt_targets_***_hash.csv/robj
1. source mirt_targets-builder.R (Run App -> shiny)
* hsa_direct_entrez_wp_hash.csv/robj
1. source direct-entrez-builder.R (Source -> runs all lines)
* hsa_mirt_targeting_***_hash.csv/robj
1. source mirt-targeting-builder.R (Run App -> shiny)
* hsa_wp_annot_hash.csv/robj
1. source wp-annotation-builder.R (Source -> runs all lines)