Skip to main content
  • Home
  • Development
  • Documentation
  • Donate
  • Operational login
  • Browse the archive

swh logo
SoftwareHeritage
Software
Heritage
Archive
Features
  • Search

  • Downloads

  • Save code now

  • Add forge now

  • Help

https://github.com/darylgohl/TaG-EM
12 March 2025, 06:07:10 UTC
  • Code
  • Branches (1)
  • Releases (0)
  • Visits
    • Branches
    • Releases
    • HEAD
    • refs/heads/main
    No releases to show
  • 464155c
  • /
  • README.md
Raw File Download
Take a new snapshot of a software origin

If the archived software origin currently browsed is not synchronized with its upstream version (for instance when new commits have been issued), you can explicitly request Software Heritage to take a new snapshot of it.

Use the form below to proceed. Once a request has been submitted and accepted, it will be processed as soon as possible. You can then check its processing state by visiting this dedicated page.
swh spinner

Processing "take a new snapshot" request ...

Permalinks

To reference or cite the objects present in the Software Heritage archive, permalinks based on SoftWare Hash IDentifiers (SWHIDs) must be used.
Select below a type of object currently browsed in order to display its associated SWHID and permalink.

  • content
  • directory
  • revision
  • snapshot
origin badgecontent badge Iframe embedding
swh:1:cnt:e6eac9bfde87b38203185073ac5667316839a94c
origin badgedirectory badge Iframe embedding
swh:1:dir:464155c60640d9297a9d6b468d4066e6703f1eb8
origin badgerevision badge
swh:1:rev:5dc9aa94c42e759c219de322ae067e817a5edf4c
origin badgesnapshot badge
swh:1:snp:fbc8bc92296dffbad081604f2bc6832e962f044e
Citations

This interface enables to generate software citations, provided that the root directory of browsed objects contains a citation.cff or codemeta.json file.
Select below a type of object currently browsed in order to generate citations for them.

  • content
  • directory
  • revision
  • snapshot
Generate software citation in BibTex format (requires biblatex-software package)
Generating citation ...
Generate software citation in BibTex format (requires biblatex-software package)
Generating citation ...
Generate software citation in BibTex format (requires biblatex-software package)
Generating citation ...
Generate software citation in BibTex format (requires biblatex-software package)
Generating citation ...
Tip revision: 5dc9aa94c42e759c219de322ae067e817a5edf4c authored by darylgohl on 20 February 2025, 18:24:16 UTC
Add files via upload
Tip revision: 5dc9aa9
README.md
# TaG-EM Code and Files
This repository contains code and reference files related to the following publication:
Mendana, Jorge Blanco, Donovan, Margaret, O'Brien, Lindsey Gengelbach, Auch, Benjamin, Garbe, John, Gohl, Daryl M. (2025) Deterministic Genetic Barcoding for Multiplexed Behavioral and Single-Cell Transcriptomic Studies. eLife2025;12:RP88334 DOI: https://doi.org/10.7554/eLife.88334.3

## TaG-EM Barcode Analysis Script
TaG-EM analysis scripts and files

### Prerequisites
Python 3

BioPython

cutadapt

### Usage
usage: TaG-EM_barcode_analysis.py [-h] [-i] [-r] [-l] [-o] [-m]

TaG-EM barcode counting script (v1.0) by Daryl Gohl This program takes in a
FASTQ file and barcode reference FASTA file and outputs a plot and a .txt file
of barcode counts.

options:

  -h, --help            show this help message and exit

  -i , --input_folder   Input folder containing raw FASTQ files [required].
  
  -r , --reference_file 
                        Input reference FASTA file [required].
  
  -l , --length         Barcode length (default: 14).
  
  -o , --output_dir     Output directory for barcode count file (default: same
                        folder as input file)
  
  -m , --mismatches_allowed 
                        Number of mismatches to barcode reference sequences
                        allowed (default 2)
### Usage example
TaG-EM_barcode_analysis.py -i <PathToFile/InputFileName> -r <PathToFile/TagEMReferenceFileName>


## TaG-EM Barcode/Cell Barcode Correlation Script
TaG-EM barcode and 10x cell barcode correlation script

### Prerequisites
Python 3

BioPython

### Usage
usage: TaG-EM_barcode_Cell_barcode_correlation.py [-h] [-r1] [-r2] [-x] [-w]
                                                  [-o] [-s] [-m]

TaG-EM barcode and 10x cell barcode correlation script (v1.0) by Daryl Gohl
This program takes in read 1 and read 2 FASTQ files from an TaG-EM barcode
enriched 10x Genomics library, as well as a filtered metadata .csv file and a
barcode reference FASTA file (whitelist) and outputs an updated .csv metadata
file that associates TaG-EM barcode and UMI counts with cell barcodes.

options:

  -h, --help            show this help message and exit
  
  -r1 , --r1_input_file 
                        Input read 1 FASTQ or FASTQ.gz files from enriched
                        TaG-EM library. For reading in 10x cell barcode and
                        UMI [required].
                        
  -r2 , --r2_input_file 
                        Input read 1 FASTQ or FASTQ.gz files from enriched
                        TaG-EM library. For reading in TaG-EM barcode
                        [required].
                        
  -x , --metadata_file 
                        Filtered scRNA-Seq metadata .csv file [required].
                        
  -w , --TagEM_barcode_file 
                        TaG-EM barcode reference FASTA file
                        (whitelist)[required].
                        
  -o , --output_dir     Output directory for barcode count file (default: Home
                        directory)
                        
  -s , --subsample      Number of subsampled reads to analyze (default
                        1000000)
                        
  -m , --mismatches_allowed 
                        Number of mismatches to barcode reference sequences
                        allowed (default 1)

### Usage example
TaG-EM_barcode_Cell_barcode_correlation.py -r1 <PathToFile/Read1FastqFile> -r2 <PathToFile/Read2FastqFile> -x <PathToFile/metadatacsvFile> -o <PathToFile/OutputDirectory> -w <PathToFile/TagEMReferenceFileName> -s <DesiredSubsamplingDepth>
<PathToFile/ReferenceFileName>


## TaG-EM Barcode Reference Files
### Original TaG-EM barcode reference file (initial 20 lines)
TaG-EM_barcodes_v1.fasta

### Extended TaG-EM barcode reference file (176 lines)
TaG-EM_barcodes_v2_extended.fasta

Software Heritage — Copyright (C) 2015–2025, The Software Heritage developers. License: GNU AGPLv3+.
The source code of Software Heritage itself is available on our development forge.
The source code files archived by Software Heritage are available under their own copyright and licenses.
Terms of use: Archive access, API— Contact— JavaScript license information— Web API

back to top