https://github.com/nrnb/mirna-pathway-finder
Raw File
Tip revision: 2a75f42dd8d65b42c60e38bffbab2e96f520a401 authored by Alexander Pico on 09 March 2018, 01:18:03 UTC
Create README.md
Tip revision: 2a75f42
README-input-file-sources.txt
################
## SOURCE INPUTS


* wikipathways-YYYYMMDD-gmt-Homo_sapiens -- downloaded via http://data.wikipathways.org/current/gmt/. Contains pathway names, links and gene sets unified to entrez gene.

* hsa_MTI_r#.#.xlsx -- downloaded from http://mirtarbase.mbc.nctu.edu.tw/php/download.php. Contains miRTarBase IDs for targeting miRNAs and Entrez for gene targets. Also includes strong/weak and pmid refs. PROCESSING: Release number added to filename.

* [IGNORE] mart_export_v##.csv -- exported from BioMART at http://www.ensembl.org/biomart/. See screenshot "mart_export.png". Filter for lincRMA, miRNA and processed_transcript; Attributes: entrez and mirbase ids. Export as CSV, unique results only. PROCESSING: Version number added to filename and extension set to CSV. Removed rows with blanks in Excel and renamed header rows: "entrez", "mirbase"

* [OPTIONAL] mirna_mature.txt -- downloaded from ftp://mirbase.org/pub/mirbase/CURRENT/database_files/ (log in as Guest). This file can be processed into a CSV to be used to prepare a _direct_ mapping from mirtarbase ids to pathway elements, i.e., via an additional MIMAT->Entrez mapping set by BridgeDb.  PROCESSING: Open in Excel. Filter for mirtarbase mature names that begin with "hsa"; copy mirtarbase and mimat id columns to new tab and export as mirna_mature.csv.


###################
## PROCESSED INPUTS

* hsa-mirtarbase-entrez-targets-***.csv
  1. open hsa_MTIr#.#.xlsx in Excel
  2. rename "miRNA" to "mirtarbase" and "Target Gene (Entrez Gene ID)" to "entrez targets"
  3. filter for both strong and weak support types: "Functional MTI" or "Functional MIT (Weak)"
  4. copy "mirtarbase" and "entrez targets" columns to new tab called "funcional": miRNA, Target Gene (Entrez)
  5. filter original tab for only strong support type: "Functional MTI" (i.e., excluding weak)
  6. copy same two columns to new tab called "strong": miRNA, Target Gene (Entrez)
  7. [important!] confirm paste lengths
  8. remove rows with blanks (if any) 
  8. save hsa_MTI.xlsx
  9. save each tab as: hsa-mirtarbase-entrez-targets-(strong|funcional).csv


* hsa_mirt_targets_***_hash.csv/robj
  1. source mirt_targets-builder.R (Run App -> shiny)

* hsa_direct_entrez_wp_hash.csv/robj
  1. source direct-entrez-builder.R (Source -> runs all lines)

* hsa_mirt_targeting_***_hash.csv/robj
  1. source mirt-targeting-builder.R (Run App -> shiny)
  
* hsa_wp_annot_hash.csv/robj
  1. source wp-annotation-builder.R (Source -> runs all lines)

back to top