; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0004208 (gene) of Snake gourd v1 genome

Gene IDTan0004208
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionfibroin heavy chain-like
Genome locationLG01:15927034..15928005
RNA-Seq ExpressionTan0004208
SyntenyTan0004208
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6577377.1 hypothetical protein SDJN03_24951, partial [Cucurbita argyrosperma subsp. sororia]1.2e-6452.48Show/hide
Query:  MASLKYFLLCPFVFLCLSCTLANTVFNIDEGSGFGFGLNVGSGPKAGP---------RAGPVVEGGVSSVVAGPKAGPRAGPGVEGGVSSIV----AGPK
        M SLKYFLL PFVFLCLSCT AN V N D+GSGF  G   G+ P AGP         RAGP  EG V+ V AG KAGP+AGPG EG VS +     AGPK
Subjt:  MASLKYFLLCPFVFLCLSCTLANTVFNIDEGSGFGFGLNVGSGPKAGP---------RAGPVVEGGVSSVVAGPKAGPRAGPGVEGGVSSIV----AGPK

Query:  AGPRAGPGVEGGVSSIVAGPKAGPRAGPVVEGGVSSIVAGPKAGPRASVVAGPKAGPRAGPGVEGGVSSIVAGPKAGPRAGPRAGGG-----------VD
        AGP+AGPG E  VS + AGP+AGP+AGP  EG VS             V AG +AGP+AGPG EG VS++ AGP  GPRA P   GG           VD
Subjt:  AGPRAGPGVEGGVSSIVAGPKAGPRAGPVVEGGVSSIVAGPKAGPRASVVAGPKAGPRAGPGVEGGVSSIVAGPKAGPRAGPRAGGG-----------VD

Query:  PIVN--GIGVGVDVGFRSEFGPGMG---FWPDPIIG-GGGGYDDDCILGYACPVDEGRRCNKFSFETCDSYSFYPLTPAMDMHEVKINWAK-SKPVVAAQ
        P++N  G+G+GVD+G+RS F  G+G    W  P IG GGGG  ++C LGY CP    R C+KFS+  CD+Y F+PL  +M +HEV++ WAK SKP    Q
Subjt:  PIVN--GIGVGVDVGFRSEFGPGMG---FWPDPIIG-GGGGYDDDCILGYACPVDEGRRCNKFSFETCDSYSFYPLTPAMDMHEVKINWAK-SKPVVAAQ

Query:  HGV
        +GV
Subjt:  HGV

KGN56231.1 hypothetical protein Csa_011503 [Cucumis sativus]2.2e-4245.23Show/hide
Query:  MASLKYFLLCPFVFLCLSCTLANTVFNIDEGSGFGFGLNVGSGPKAGPRAGPVVEGGVSSVVAGPKAGPRAGPGVEGGVSSIVAGPKAGPRAGPGVEGGV
        MASLKYFLL PF+FLCLS T AN VFN D+G GFG      S P   P AGP V+ GVS+   GPKAGPRAG GV GG+S++  G   GP+AGPGV+  +
Subjt:  MASLKYFLLCPFVFLCLSCTLANTVFNIDEGSGFGFGLNVGSGPKAGPRAGPVVEGGVSSVVAGPKAGPRAGPGVEGGVSSIVAGPKAGPRAGPGVEGGV

Query:  SSIVAGPKAGPRAGPVVEGGVSSIVAGPKAGPRASVVAGPKAGPRAGPGVEGGVSSIVAGPKAGPRAGPRAGGGVDPIVNGIGVGVDVGFRSEF-GPGMG
        S++ AGP+                               PK           GVSSI AGP+AGP+       GVDPIV G+GVGV V     F GP MG
Subjt:  SSIVAGPKAGPRAGPVVEGGVSSIVAGPKAGPRASVVAGPKAGPRAGPGVEGGVSSIVAGPKAGPRAGPRAGGGVDPIVNGIGVGVDVGFRSEF-GPGMG

Query:  FWPDPIIGGGGGYD---------DDCILGYACPVDEGRRCNKFSFETCDSYSFYPLTPAMDMHEVKINWAKSKPVVAAQHGVS
          P P    GG Y          ++C+LGY CP +    C K  +  C+SY+F PL+ + ++H+VKINWAKSK V  AQHG S
Subjt:  FWPDPIIGGGGGYD---------DDCILGYACPVDEGRRCNKFSFETCDSYSFYPLTPAMDMHEVKINWAKSKPVVAAQHGVS

XP_022929340.1 fibroin heavy chain-like [Cucurbita moschata]1.6e-5651.97Show/hide
Query:  MASLKYFLLCPFVFLCLSCTLANTVFNIDEGSGFGFGLNVGSGPKAGPRAGPVVEGGVSSVVAGPKAGPRAGPGVEGGVSSIVAGPKAGPRAGPGVEGGV
        M SLKYFLL PFVFLCLSCT AN V N D+GSGF    +VG+GP A P AGP VE GVS+V        RAGP  EG V+ + AG KAGP+AGPG EG V
Subjt:  MASLKYFLLCPFVFLCLSCTLANTVFNIDEGSGFGFGLNVGSGPKAGPRAGPVVEGGVSSVVAGPKAGPRAGPGVEGGVSSIVAGPKAGPRAGPGVEGGV

Query:  SSIVAGPKAGPRAGPVVEGGVSSIVAGPKAGPRASVVAGPKAGPRAGPGVEGGVSSIVAGPKAGPRAGPRAGGGVDPIVN--GIGVGVDVGFRSEFGPGM
        S + AG +AGP+AGP  EG VS            +V AGP  GPRA PG EGGVSS   G +            VDP++N  G+G+GVD+G+RS F  G+
Subjt:  SSIVAGPKAGPRAGPVVEGGVSSIVAGPKAGPRASVVAGPKAGPRAGPGVEGGVSSIVAGPKAGPRAGPRAGGGVDPIVN--GIGVGVDVGFRSEFGPGM

Query:  G---FWPDPIIG-GGGGYDDDCILGYACPVDEGRRCNKFSFETCDSYSFYPLTPAMDMHEVKINWAK-SKPVVAAQHGV
        G    W  P IG GGGG  ++C LGY CP    R C+KFS+  CD+Y F+PL  +M +HEV++ WAK SKP    Q+GV
Subjt:  G---FWPDPIIG-GGGGYDDDCILGYACPVDEGRRCNKFSFETCDSYSFYPLTPAMDMHEVKINWAK-SKPVVAAQHGV

XP_023551823.1 fibroin heavy chain-like isoform X1 [Cucurbita pepo subsp. pepo]1.1e-4949.61Show/hide
Query:  GSGPKAGP---------RAGPVVEGGVSSVVAGPKAGPRAGPGVEGGVSSIVAGPKAGPRAGPGVEGGVSSIVAGPKAGPRAGPVVEGGVSSIVAGPKAG
        G+ P AGP         RAGP  EG V+ V AG KAGP+AGPG EG VS + AGP+AG +AGPG E  VS + AGP+AGP+AGP  EG VS         
Subjt:  GSGPKAGP---------RAGPVVEGGVSSVVAGPKAGPRAGPGVEGGVSSIVAGPKAGPRAGPGVEGGVSSIVAGPKAGPRAGPVVEGGVSSIVAGPKAG

Query:  PRASVVAGPKAGPRAGPGVEGGVSSIVAGPKAGPRAGPRAGGG-----------VDPIVN--GIGVGVDVGFRSEFGPGMG---FWPDPIIGGGGGYDDD
            V AGP+AGP+AGPG EG V+++ AGP  GPRA P   GG           VDP++N  G+G+GVD+G+RS F  G+G    W  P I GG G  ++
Subjt:  PRASVVAGPKAGPRAGPGVEGGVSSIVAGPKAGPRAGPRAGGG-----------VDPIVN--GIGVGVDVGFRSEFGPGMG---FWPDPIIGGGGGYDDD

Query:  CILGYACPVDEGRRCNKFSFETCDSYSFYPLTPAMDMHEVKINWAK-SKPVVAAQHGV
        C LGY CP    R C+KFS+  CDSY F+PL  +M +HEV++ WAK SKP    Q+GV
Subjt:  CILGYACPVDEGRRCNKFSFETCDSYSFYPLTPAMDMHEVKINWAK-SKPVVAAQHGV

XP_023551824.1 fibroin heavy chain-like isoform X2 [Cucurbita pepo subsp. pepo]1.9e-3847.66Show/hide
Query:  PKAGPRAGPVVEGGVSSVVAGPKAGPRAGPGVEGGVSSIVAGPKAGPRAGPGVEGGVSSIVAGPKAGPRAGPVVEGGVSSIVAGPKAGPRASVVAGPKAG
        P A P AGP VE GVS+V        RAGP  EG V+ + AGPK    AGPG EG VS + AGP+AGP+AGP  EG V+            +V AGP  G
Subjt:  PKAGPRAGPVVEGGVSSVVAGPKAGPRAGPGVEGGVSSIVAGPKAGPRAGPGVEGGVSSIVAGPKAGPRAGPVVEGGVSSIVAGPKAGPRASVVAGPKAG

Query:  PRAGPGVEGGVSSIVAGPKAGPRAGPRAGGGVDPIVN--GIGVGVDVGFRSEFGPGMG---FWPDPIIGGGGGYDDDCILGYACPVDEGRRCNKFSFETC
        PRA PG EGGVSS   G +            VDP++N  G+G+GVD+G+RS F  G+G    W  P I GG G  ++C LGY CP    R C+KFS+  C
Subjt:  PRAGPGVEGGVSSIVAGPKAGPRAGPRAGGGVDPIVN--GIGVGVDVGFRSEFGPGMG---FWPDPIIGGGGGYDDDCILGYACPVDEGRRCNKFSFETC

Query:  DSYSFYPLTPAMDMHEVKINWAK-SKPVVAAQHGV
        DSY F+PL  +M +HEV++ WAK SKP    Q+GV
Subjt:  DSYSFYPLTPAMDMHEVKINWAK-SKPVVAAQHGV

TrEMBL top hitse value%identityAlignment
A0A0A0L7X7 Uncharacterized protein1.1e-4245.23Show/hide
Query:  MASLKYFLLCPFVFLCLSCTLANTVFNIDEGSGFGFGLNVGSGPKAGPRAGPVVEGGVSSVVAGPKAGPRAGPGVEGGVSSIVAGPKAGPRAGPGVEGGV
        MASLKYFLL PF+FLCLS T AN VFN D+G GFG      S P   P AGP V+ GVS+   GPKAGPRAG GV GG+S++  G   GP+AGPGV+  +
Subjt:  MASLKYFLLCPFVFLCLSCTLANTVFNIDEGSGFGFGLNVGSGPKAGPRAGPVVEGGVSSVVAGPKAGPRAGPGVEGGVSSIVAGPKAGPRAGPGVEGGV

Query:  SSIVAGPKAGPRAGPVVEGGVSSIVAGPKAGPRASVVAGPKAGPRAGPGVEGGVSSIVAGPKAGPRAGPRAGGGVDPIVNGIGVGVDVGFRSEF-GPGMG
        S++ AGP+                               PK           GVSSI AGP+AGP+       GVDPIV G+GVGV V     F GP MG
Subjt:  SSIVAGPKAGPRAGPVVEGGVSSIVAGPKAGPRASVVAGPKAGPRAGPGVEGGVSSIVAGPKAGPRAGPRAGGGVDPIVNGIGVGVDVGFRSEF-GPGMG

Query:  FWPDPIIGGGGGYD---------DDCILGYACPVDEGRRCNKFSFETCDSYSFYPLTPAMDMHEVKINWAKSKPVVAAQHGVS
          P P    GG Y          ++C+LGY CP +    C K  +  C+SY+F PL+ + ++H+VKINWAKSK V  AQHG S
Subjt:  FWPDPIIGGGGGYD---------DDCILGYACPVDEGRRCNKFSFETCDSYSFYPLTPAMDMHEVKINWAKSKPVVAAQHGVS

A0A3Q3A0M9 Uncharacterized protein4.4e-1237.79Show/hide
Query:  GPKAGPRAGPVVEGGVSSVVAGPKAGPRAGP---GVEGGVSSIVAGPKAGPRAGP---GVEGGVSSIVAGPKAGPRAGPV---VEGGVSSIVAGPKAGPR
        GP AGP AGP  EG  +   AGP  GP AGP     EG  +   AGP  GP AGP     EG  +   AGP  GP AGP     EG  +   AGP  GP 
Subjt:  GPKAGPRAGPVVEGGVSSVVAGPKAGPRAGP---GVEGGVSSIVAGPKAGPRAGP---GVEGGVSSIVAGPKAGPRAGPV---VEGGVSSIVAGPKAGPR

Query:  ASVVAGPKAGPRAGPGVEGGVSSIVAGPKAGPRAGPRAGGGVDPIVNGIGVGVDVGFRS--EFGPGMGFWPDPIIGGGGGYDDDCILGYACPVDEGRRCN
        A   AGP  GP AGP   G      AGP AGP  GP AG    P   G   G   G       GP  G    P  G   G  +    G      EG    
Subjt:  ASVVAGPKAGPRAGPGVEGGVSSIVAGPKAGPRAGPRAGGGVDPIVNGIGVGVDVGFRS--EFGPGMGFWPDPIIGGGGGYDDDCILGYACPVDEGRRCN

Query:  KFSFETCDSYSFYPLTP
            +TC       L P
Subjt:  KFSFETCDSYSFYPLTP

A0A5A7V4J6 Uncharacterized protein1.2e-3842.91Show/hide
Query:  MASLKYFLLCPFVFLCLSCTLANTVFNIDEGSGFGFGLNVGSGPKAGPRAGPVVEGGVSSVVAGPKAGPRAGPGVEGGVSSIVAGPKAGPRAGPGVEGGV
        MASLKYFLL PF+FLCLS T A+ VFN D G  FG                          ++ P   P AGPGV+ GVS+I  GPKAGPRAG G+ GG+
Subjt:  MASLKYFLLCPFVFLCLSCTLANTVFNIDEGSGFGFGLNVGSGPKAGPRAGPVVEGGVSSVVAGPKAGPRAGPGVEGGVSSIVAGPKAGPRAGPGVEGGV

Query:  SSIVAGPKAGPRAGPVVEGGVSSIVAGPKAGPRASVVAGPKAGPRAGPGVEGGVSSIVAGPKAGPRAGPRAGGGVDPIVNGIGVGVDVGFRSEFG-----
        S +   P                               GPKAGP+A  G + GVS I AGP+AGP+            VNG GVGV V     FG     
Subjt:  SSIVAGPKAGPRAGPVVEGGVSSIVAGPKAGPRASVVAGPKAGPRAGPGVEGGVSSIVAGPKAGPRAGPRAGGGVDPIVNGIGVGVDVGFRSEFG-----

Query:  --PGMGFW--PDPIIGGGGGYDDDCILGYACPVDEGRRCNKFSFETCDSYSFYPLTPAMDMHEVKINWAKSKPVVAAQHGVS
          PG G W  P PII    G   +C+LGY CP +    C+KF++  CDSY+F+PL+ + D+HEVKINWAKSKP   AQHG S
Subjt:  --PGMGFW--PDPIIGGGGGYDDDCILGYACPVDEGRRCNKFSFETCDSYSFYPLTPAMDMHEVKINWAKSKPVVAAQHGVS

A0A671FXG7 Uncharacterized protein1.8e-1345.25Show/hide
Query:  VGSGPKAGPRAGPVVEGGVSSVVAGPKAGPRAGPGVEGGVSSIVAGPKAGPRAGPGVEGGVSSIVAGPKAGPRAGPVVEGGVSSIVAGPKAGPRASVVAG
        +G GP AGPR+ P +  G  S V GP AGPR+ PG   G  S   GP AGPR+GPG   G+ S V GP AGPR+GP           GP AGPR+  V G
Subjt:  VGSGPKAGPRAGPVVEGGVSSVVAGPKAGPRAGPGVEGGVSSIVAGPKAGPRAGPGVEGGVSSIVAGPKAGPRAGPVVEGGVSSIVAGPKAGPRASVVAG

Query:  PKAGPRAGPGVEGGVSSIVAGPKAGPRAGPRAGGG----VDP---IVNGIGVGVDVGFRSEFGPGMGFWPDPIIGGGGG
        P AGPR+GPG   G  S   GP+AGPR+GP  G G    + P   +  G+ VG  +G  +  G G G   +P  G G G
Subjt:  PKAGPRAGPGVEGGVSSIVAGPKAGPRAGPRAGGG----VDP---IVNGIGVGVDVGFRSEFGPGMGFWPDPIIGGGGG

A0A6J1EU53 fibroin heavy chain-like7.6e-5751.97Show/hide
Query:  MASLKYFLLCPFVFLCLSCTLANTVFNIDEGSGFGFGLNVGSGPKAGPRAGPVVEGGVSSVVAGPKAGPRAGPGVEGGVSSIVAGPKAGPRAGPGVEGGV
        M SLKYFLL PFVFLCLSCT AN V N D+GSGF    +VG+GP A P AGP VE GVS+V        RAGP  EG V+ + AG KAGP+AGPG EG V
Subjt:  MASLKYFLLCPFVFLCLSCTLANTVFNIDEGSGFGFGLNVGSGPKAGPRAGPVVEGGVSSVVAGPKAGPRAGPGVEGGVSSIVAGPKAGPRAGPGVEGGV

Query:  SSIVAGPKAGPRAGPVVEGGVSSIVAGPKAGPRASVVAGPKAGPRAGPGVEGGVSSIVAGPKAGPRAGPRAGGGVDPIVN--GIGVGVDVGFRSEFGPGM
        S + AG +AGP+AGP  EG VS            +V AGP  GPRA PG EGGVSS   G +            VDP++N  G+G+GVD+G+RS F  G+
Subjt:  SSIVAGPKAGPRAGPVVEGGVSSIVAGPKAGPRASVVAGPKAGPRAGPGVEGGVSSIVAGPKAGPRAGPRAGGGVDPIVN--GIGVGVDVGFRSEFGPGM

Query:  G---FWPDPIIG-GGGGYDDDCILGYACPVDEGRRCNKFSFETCDSYSFYPLTPAMDMHEVKINWAK-SKPVVAAQHGV
        G    W  P IG GGGG  ++C LGY CP    R C+KFS+  CD+Y F+PL  +M +HEV++ WAK SKP    Q+GV
Subjt:  G---FWPDPIIG-GGGGYDDDCILGYACPVDEGRRCNKFSFETCDSYSFYPLTPAMDMHEVKINWAK-SKPVVAAQHGV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCCTCAAATATTTCCTTCTCTGTCCCTTTGTTTTCCTCTGCTTAAGCTGCACCCTCGCCAACACAGTCTTCAACATTGATGAAGGATCTGGTTTTGGTTTCGG
TCTCAATGTCGGGTCTGGACCGAAAGCTGGACCAAGAGCTGGGCCAGTAGTTGAGGGAGGAGTAAGTAGTGTCGTGGCTGGACCGAAAGCTGGACCAAGAGCCGGGCCTG
GAGTTGAGGGAGGAGTAAGCAGCATCGTGGCTGGACCGAAAGCTGGACCAAGAGCCGGGCCAGGAGTTGAGGGAGGAGTAAGCAGCATCGTGGCTGGACCGAAAGCTGGA
CCAAGAGCCGGGCCAGTAGTTGAGGGAGGAGTAAGCAGCATCGTGGCTGGACCGAAAGCTGGACCAAGAGCTAGTGTCGTGGCTGGACCGAAAGCTGGACCAAGAGCCGG
GCCAGGAGTTGAGGGAGGAGTAAGCAGCATCGTGGCTGGACCGAAAGCTGGACCGAGAGCCGGACCGAGAGCTGGGGGAGGTGTTGATCCAATTGTTAATGGAATCGGAG
TAGGAGTCGATGTCGGGTTTAGGTCAGAATTTGGTCCAGGGATGGGGTTTTGGCCTGACCCAATAATTGGTGGTGGTGGGGGGTATGATGATGATTGCATATTGGGCTAT
GCTTGTCCAGTCGATGAAGGTAGGAGATGCAATAAATTTTCTTTTGAAACTTGCGACTCTTATAGCTTCTATCCATTGACGCCTGCAATGGACATGCACGAAGTTAAAAT
CAATTGGGCCAAAAGCAAGCCTGTTGTGGCGGCCCAACATGGCGTTTCTGCCATCGACCACCCTCAGCTCACTAACAAGGCCCAAAATGGTGTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCCCTCAAATATTTCCTTCTCTGTCCCTTTGTTTTCCTCTGCTTAAGCTGCACCCTCGCCAACACAGTCTTCAACATTGATGAAGGATCTGGTTTTGGTTTCGG
TCTCAATGTCGGGTCTGGACCGAAAGCTGGACCAAGAGCTGGGCCAGTAGTTGAGGGAGGAGTAAGTAGTGTCGTGGCTGGACCGAAAGCTGGACCAAGAGCCGGGCCTG
GAGTTGAGGGAGGAGTAAGCAGCATCGTGGCTGGACCGAAAGCTGGACCAAGAGCCGGGCCAGGAGTTGAGGGAGGAGTAAGCAGCATCGTGGCTGGACCGAAAGCTGGA
CCAAGAGCCGGGCCAGTAGTTGAGGGAGGAGTAAGCAGCATCGTGGCTGGACCGAAAGCTGGACCAAGAGCTAGTGTCGTGGCTGGACCGAAAGCTGGACCAAGAGCCGG
GCCAGGAGTTGAGGGAGGAGTAAGCAGCATCGTGGCTGGACCGAAAGCTGGACCGAGAGCCGGACCGAGAGCTGGGGGAGGTGTTGATCCAATTGTTAATGGAATCGGAG
TAGGAGTCGATGTCGGGTTTAGGTCAGAATTTGGTCCAGGGATGGGGTTTTGGCCTGACCCAATAATTGGTGGTGGTGGGGGGTATGATGATGATTGCATATTGGGCTAT
GCTTGTCCAGTCGATGAAGGTAGGAGATGCAATAAATTTTCTTTTGAAACTTGCGACTCTTATAGCTTCTATCCATTGACGCCTGCAATGGACATGCACGAAGTTAAAAT
CAATTGGGCCAAAAGCAAGCCTGTTGTGGCGGCCCAACATGGCGTTTCTGCCATCGACCACCCTCAGCTCACTAACAAGGCCCAAAATGGTGTCTAATTAAACGTCTCAT
CTCATGCTCATCTCCTTCCAATCATGCAATGTTAAATAATCTTATTTGACCCTCTTAAATAAAATAAGCTTAAAAAAAGCTGCAATATTTAC
Protein sequenceShow/hide protein sequence
MASLKYFLLCPFVFLCLSCTLANTVFNIDEGSGFGFGLNVGSGPKAGPRAGPVVEGGVSSVVAGPKAGPRAGPGVEGGVSSIVAGPKAGPRAGPGVEGGVSSIVAGPKAG
PRAGPVVEGGVSSIVAGPKAGPRASVVAGPKAGPRAGPGVEGGVSSIVAGPKAGPRAGPRAGGGVDPIVNGIGVGVDVGFRSEFGPGMGFWPDPIIGGGGGYDDDCILGY
ACPVDEGRRCNKFSFETCDSYSFYPLTPAMDMHEVKINWAKSKPVVAAQHGVSAIDHPQLTNKAQNGV