Skip to main content
  • Home
  • Development
  • Documentation
  • Donate
  • Operational login
  • Browse the archive

swh logo
SoftwareHeritage
Software
Heritage
Archive
Features
  • Search

  • Downloads

  • Save code now

  • Add forge now

  • Help

https://github.com/darylgohl/TaG-EM
12 November 2024, 10:46:02 UTC
  • Code
  • Branches (1)
  • Releases (0)
  • Visits
Revision a21990b0b122d53a1185cc89194d4653e600f279 authored by darylgohl on 01 November 2024, 18:12:33 UTC, committed by GitHub on 01 November 2024, 18:12:33 UTC
Add files via upload
Uploading script to correlate cell barcodes and TaG-EM barcodes from an enriched library.
1 parent b113fa9
  • Files
  • Changes
    • Branches
    • Releases
    • HEAD
    • refs/heads/main
    • a21990b0b122d53a1185cc89194d4653e600f279
    No releases to show
  • 3c227ec
  • /
  • README.md
Raw File Download
Take a new snapshot of a software origin

If the archived software origin currently browsed is not synchronized with its upstream version (for instance when new commits have been issued), you can explicitly request Software Heritage to take a new snapshot of it.

Use the form below to proceed. Once a request has been submitted and accepted, it will be processed as soon as possible. You can then check its processing state by visiting this dedicated page.
swh spinner

Processing "take a new snapshot" request ...

Permalinks

To reference or cite the objects present in the Software Heritage archive, permalinks based on SoftWare Hash IDentifiers (SWHIDs) must be used.
Select below a type of object currently browsed in order to display its associated SWHID and permalink.

  • revision
  • directory
  • content
  • snapshot
origin badgerevision badge
swh:1:rev:a21990b0b122d53a1185cc89194d4653e600f279
origin badgedirectory badge Iframe embedding
swh:1:dir:3c227ec9849dc66ccfee95cb86d147db91ab7fae
origin badgecontent badge Iframe embedding
swh:1:cnt:3c5ed8f318352674c00151f279028c9fe42cad20
origin badgesnapshot badge
swh:1:snp:ba45a25be196f1d13de5a06e808756c2aaf35a34
Citations

This interface enables to generate software citations, provided that the root directory of browsed objects contains a citation.cff or codemeta.json file.
Select below a type of object currently browsed in order to generate citations for them.

  • revision
  • directory
  • content
  • snapshot
Generate software citation in BibTex format (requires biblatex-software package)
Generating citation ...
Generate software citation in BibTex format (requires biblatex-software package)
Generating citation ...
Generate software citation in BibTex format (requires biblatex-software package)
Generating citation ...
Generate software citation in BibTex format (requires biblatex-software package)
Generating citation ...
Tip revision: a21990b0b122d53a1185cc89194d4653e600f279 authored by darylgohl on 01 November 2024, 18:12:33 UTC
Add files via upload
Tip revision: a21990b
README.md
# TaG-EM Code and Files
This repository contains code and reference files related to the following publication:
Mendana Jorge Blanco, Donovan Margaret, Gengelbach Lindsey, Auch Benjamin, Garbe John, Gohl Daryl M. (2023) Deterministic Genetic Barcoding for Multiplexed Behavioral and Single-Cell Transcriptomic Studies eLife 12:RP88334
https://doi.org/10.7554/eLife.88334.1

## TaG-EM Barcode Analysis Script
TaG-EM analysis scripts and files

### Prerequisites
Python 3

BioPython

cutadapt

### Usage
usage: TaG-EM_barcode_analysis.py [-h] [-i] [-r] [-l] [-o] [-m]

TaG-EM barcode counting script (v1.0) by Daryl Gohl This program takes in a
FASTQ file and barcode reference FASTA file and outputs a plot and a .txt file
of barcode counts.

options:

  -h, --help            show this help message and exit

  -i , --input_folder   Input folder containing raw FASTQ files [required].
  
  -r , --reference_file 
                        Input reference FASTA file [required].
  
  -l , --length         Barcode length (default: 14).
  
  -o , --output_dir     Output directory for barcode count file (default: same
                        folder as input file)
  
  -m , --mismatches_allowed 
                        Number of mismatches to barcode reference sequences
                        allowed (default 2)
### Usage example
TaG-EM_barcode_analysis.py -i <PathToFile/InputFileName> -r <PathToFile/TagEMReferenceFileName>


## TaG-EM Barcode/Cell Barcode Correlation Script
TaG-EM barcode and 10x cell barcode correlation script

### Prerequisites
Python 3

BioPython

### Usage
usage: TaG-EM_barcode_Cell_barcode_correlation.py [-h] [-r1] [-r2] [-x] [-w]
                                                  [-o] [-s] [-m]

TaG-EM barcode and 10x cell barcode correlation script (v1.0) by Daryl Gohl
This program takes in read 1 and read 2 FASTQ files from an TaG-EM barcode
enriched 10x Genomics library, as well as a filtered metadata .csv file and a
barcode reference FASTA file (whitelist) and outputs an updated .csv metadata
file that associates TaG-EM barcode and UMI counts with cell barcodes.

options:

  -h, --help            show this help message and exit
  
  -r1 , --r1_input_file 
                        Input read 1 FASTQ or FASTQ.gz files from enriched
                        TaG-EM library. For reading in 10x cell barcode and
                        UMI [required].
                        
  -r2 , --r2_input_file 
                        Input read 1 FASTQ or FASTQ.gz files from enriched
                        TaG-EM library. For reading in TaG-EM barcode
                        [required].
                        
  -x , --metadata_file 
                        Filtered scRNA-Seq metadata .csv file [required].
                        
  -w , --TagEM_barcode_file 
                        TaG-EM barcode reference FASTA file
                        (whitelist)[required].
                        
  -o , --output_dir     Output directory for barcode count file (default: Home
                        directory)
                        
  -s , --subsample      Number of subsampled reads to analyze (default
                        1000000)
                        
  -m , --mismatches_allowed 
                        Number of mismatches to barcode reference sequences
                        allowed (default 1)

### Usage example
TaG-EM_barcode_Cell_barcode_correlation.py -r1 <PathToFile/Read1FastqFile> -r2 <PathToFile/Read2FastqFile> -x <PathToFile/metadatacsvFile> -o <PathToFile/OutputDirectory> -w <PathToFile/TagEMReferenceFileName> -s <DesiredSubsamplingDepth>
<PathToFile/ReferenceFileName>


## TaG-EM Barcode Reference Files
### Original TaG-EM barcode reference file (initial 20 lines)
TaG-EM_barcodes_v1.fasta

### Extended TaG-EM barcode reference file (176 lines)
TaG-EM_barcodes_v2_extended.fasta
The diff you're trying to view is too large. Only the first 1000 changed files have been loaded.
Showing with 0 additions and 0 deletions (0 / 0 diffs computed)
swh spinner

Computing file changes ...

Software Heritage — Copyright (C) 2015–2025, The Software Heritage developers. License: GNU AGPLv3+.
The source code of Software Heritage itself is available on our development forge.
The source code files archived by Software Heritage are available under their own copyright and licenses.
Terms of use: Archive access, API— Contact— JavaScript license information— Web API

back to top