; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0020006 (gene) of Snake gourd v1 genome

Gene IDTan0020006
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptioncircumsporozoite protein-like
Genome locationLG01:103065185..103067068
RNA-Seq ExpressionTan0020006
SyntenyTan0020006
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KGN60236.1 hypothetical protein Csa_001434 [Cucumis sativus]1.8e-4177.7Show/hide
Query:  MIDPETAILFTAAPLPGVEAGDGACAGGVEIEVDGTGVATGESEGAVGGAGGEVAGAEAGGEVPVEGEGAAPAGAGDEDGVEAGGVATVGGRADLGEEAG
        MIDPETAILFT APLPGV+ GDGA AGG E+E DGTGV  GESEGAVGGAGGE AGA AGGEV V+G GA P GA  ED    GGVATVGG ADLGEEAG
Subjt:  MIDPETAILFTAAPLPGVEAGDGACAGGVEIEVDGTGVATGESEGAVGGAGGEVAGAEAGGEVPVEGEGAAPAGAGDEDGVEAGGVATVGGRADLGEEAG

Query:  GDLVGEEAGAAPGAWPREPATKAKRTKKTKAEEEAIAV-KNAKSREKK
         DLVGEEAGAAPGAWPRE A KAKRT KTKAEEEA+ V KN K REK+
Subjt:  GDLVGEEAGAAPGAWPREPATKAKRTKKTKAEEEAIAV-KNAKSREKK

XP_022136125.1 circumsporozoite protein-like [Momordica charantia]3.2e-4373.12Show/hide
Query:  MIDPETAILFTAAPLPGVEAGDGACAGGVEIEVDGTGVATGESEGAVGGAGGEVAGAEAGGEVPVEGEGAAPAGAGDEDGVEAGGVATVGGRADLGEEAG
        MIDP  A LF AAPLPGVEAGDGAC GG  IEV G GV  GESEGAVGGAGGE AGAEAGGEVPVEG G  PAGAGDEDGV+AGGVAT  G A  G++AG
Subjt:  MIDPETAILFTAAPLPGVEAGDGACAGGVEIEVDGTGVATGESEGAVGGAGGEVAGAEAGGEVPVEGEGAAPAGAGDEDGVEAGGVATVGGRADLGEEAG

Query:  GDLVGEEAGAAPGAWPREPATKAKRTKKTKAEEEAIAVKNAKSREKKMKMEDGREVLSCR
        G LVGEEAGAAPGAWP+EPATKA  TK TKA EEA+AVKNA+ RE+  +    R   SCR
Subjt:  GDLVGEEAGAAPGAWPREPATKAKRTKKTKAEEEAIAVKNAKSREKKMKMEDGREVLSCR

XP_023520820.1 uncharacterized protein LOC111784314 [Cucurbita pepo subsp. pepo]3.2e-0653.41Show/hide
Query:  MIDPETAILFTAAPLPGVEAGDGACAGGVEIEVDGTGVATGESEGAVGGAGGEVAGAEAGGEVPVEGEGAAPAGAGDEDGVEAGGVAT
        MID ET+ILFTA PLPGVEAGD                      GAV  A GE+ GA A GEVP++ EG  P GA D+DGVEAG VAT
Subjt:  MIDPETAILFTAAPLPGVEAGDGACAGGVEIEVDGTGVATGESEGAVGGAGGEVAGAEAGGEVPVEGEGAAPAGAGDEDGVEAGGVAT

TrEMBL top hitse value%identityAlignment
A0A0A0LJM1 Uncharacterized protein8.6e-4277.7Show/hide
Query:  MIDPETAILFTAAPLPGVEAGDGACAGGVEIEVDGTGVATGESEGAVGGAGGEVAGAEAGGEVPVEGEGAAPAGAGDEDGVEAGGVATVGGRADLGEEAG
        MIDPETAILFT APLPGV+ GDGA AGG E+E DGTGV  GESEGAVGGAGGE AGA AGGEV V+G GA P GA  ED    GGVATVGG ADLGEEAG
Subjt:  MIDPETAILFTAAPLPGVEAGDGACAGGVEIEVDGTGVATGESEGAVGGAGGEVAGAEAGGEVPVEGEGAAPAGAGDEDGVEAGGVATVGGRADLGEEAG

Query:  GDLVGEEAGAAPGAWPREPATKAKRTKKTKAEEEAIAV-KNAKSREKK
         DLVGEEAGAAPGAWPRE A KAKRT KTKAEEEA+ V KN K REK+
Subjt:  GDLVGEEAGAAPGAWPREPATKAKRTKKTKAEEEAIAV-KNAKSREKK

A0A6J1C301 circumsporozoite protein-like1.6e-4373.12Show/hide
Query:  MIDPETAILFTAAPLPGVEAGDGACAGGVEIEVDGTGVATGESEGAVGGAGGEVAGAEAGGEVPVEGEGAAPAGAGDEDGVEAGGVATVGGRADLGEEAG
        MIDP  A LF AAPLPGVEAGDGAC GG  IEV G GV  GESEGAVGGAGGE AGAEAGGEVPVEG G  PAGAGDEDGV+AGGVAT  G A  G++AG
Subjt:  MIDPETAILFTAAPLPGVEAGDGACAGGVEIEVDGTGVATGESEGAVGGAGGEVAGAEAGGEVPVEGEGAAPAGAGDEDGVEAGGVATVGGRADLGEEAG

Query:  GDLVGEEAGAAPGAWPREPATKAKRTKKTKAEEEAIAVKNAKSREKKMKMEDGREVLSCR
        G LVGEEAGAAPGAWP+EPATKA  TK TKA EEA+AVKNA+ RE+  +    R   SCR
Subjt:  GDLVGEEAGAAPGAWPREPATKAKRTKKTKAEEEAIAVKNAKSREKKMKMEDGREVLSCR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATAGATCCGGAGACGGCGATTCTGTTCACAGCAGCGCCATTGCCAGGGGTTGAGGCCGGCGACGGAGCCTGTGCCGGAGGAGTTGAGATCGAAGTTGATGGAACCGG
AGTGGCAACTGGTGAGTCCGAAGGAGCAGTAGGCGGAGCTGGTGGAGAAGTAGCCGGCGCCGAAGCTGGTGGTGAAGTTCCAGTGGAAGGAGAGGGAGCAGCGCCTGCCG
GAGCAGGAGATGAAGACGGTGTCGAAGCAGGAGGTGTCGCCACTGTAGGCGGAAGAGCCGATTTAGGCGAGGAGGCCGGCGGCGACTTGGTTGGAGAAGAAGCCGGTGCG
GCGCCAGGAGCCTGGCCCAGAGAGCCGGCAACCAAAGCGAAGAGGACGAAGAAGACAAAAGCGGAGGAGGAAGCCATAGCCGTGAAGAATGCGAAGAGCAGAGAGAAGAA
GATGAAAATGGAAGATGGAAGGGAAGTTTTGAGTTGCAGAGGTTAA
mRNA sequenceShow/hide mRNA sequence
GGTTGCTTAGAGCGCTAAATGAGTTATAATAATTTATGGAGTTATATAATTTGTGTTTAAGAGATATAGTTATTTAGTATGGTCATATAAGTCTGTTTTTTGGTGTAAAG
TTTTTCATAGATTATTATCACACATGTTATTACATGCACCATAAACAAACAAGCTCTTAGTTATTACTGTTTTGTAGCCAGACTTATTAGAGACTAAATCTAAAGACTTC
TATTTAGTTTTAAAAGAAGAAAGAAAATCAAGACTTATTTTTTAGATATGTTTTAGGGAGTACAAAAAGCAACATCTTTTTCTTTAATTTGTTTTAATATAAATCCTTGA
CATTGTAAAATTTCATCTTCGGGCCTCTAGATTTAACATCATACACAATCTAACATCAATTCCCCACTTTACTATGTTATTCAAACAAACTGTAGAAAAATAGTTAAAAT
ACAATTTAGATCTCTTTCATCATGCATTCTCCAAAATATTCATCACAGAGAAACAAAAGGCAGATTACAAAGGGAAAAAAAAAAAGAGAAAATTTGTTGGCTAAAAGTTT
TAAAAACAAACGTTAGAAAAAATGAAATTTTATTTCCAGGAAACCCTTTCATTGTATATTGAAAACCCCTCATTTTGTAGTCATTTCTGTTATTTTTTTTTAAACAAGAA
ATAATTCAGAAAATCGATCCAATGAATGGGCCAATTGGGCCCATGGCTCCAAAAGCTGTGGGCCCAAAGGGAACCTGTTAGCGTTTTCTTTTGAAAAAGACGAAGTACTG
TTGAAATGCACATGGGGAAAGCGTAGGCGCGCACAAAGCGACAGCCTACGCTACAATCCCTTTGCCTTTGTTTTTGTTCCTTTTTCATTATGAAACCCCAAATTTTTTTA
GAAAAGAGAAAAAAAAAAGCAATAATCACCAGCTGTCTTCAACTCCATTATTATCACCTTCCATAAAATTATTTCTAAAGCTTCAAGAATTACAGTTCAGTTCTGTACCT
TCGCCGCCGTAGCGCCGCCGGTATCGTCTTTCCGATGCGCGGAATTCAGTCGGAGGAGGTAAAAGACAAAACAGAGATCCGGCAGATTAACATATAAAATGAGATCTAAT
TAGTAATAATAGTGACAATAAGAGTGAAGCAAATCGACTCAACCACAGTAAAACCAAGAAATCCAAATAAATAACAATAATCATCAAGCGGCGCCACCGTCTAATCCTCT
CGGTTCTTCCTAAATACACCAAAAATAAGAGTAAAATAAAAAATATACTAAGAAAGCTACAGCAACGACTAATGCTGATAATGAAACCCTAAGATCTGAGTCTCCAGAGG
AGAGCGAAGACGAAGACAGACTGATCGCATTTAAAGCAGTAGAGCGGAGGAAATTACAGCCGCGATGATAGATCCGGAGACGGCGATTCTGTTCACAGCAGCGCCATTGC
CAGGGGTTGAGGCCGGCGACGGAGCCTGTGCCGGAGGAGTTGAGATCGAAGTTGATGGAACCGGAGTGGCAACTGGTGAGTCCGAAGGAGCAGTAGGCGGAGCTGGTGGA
GAAGTAGCCGGCGCCGAAGCTGGTGGTGAAGTTCCAGTGGAAGGAGAGGGAGCAGCGCCTGCCGGAGCAGGAGATGAAGACGGTGTCGAAGCAGGAGGTGTCGCCACTGT
AGGCGGAAGAGCCGATTTAGGCGAGGAGGCCGGCGGCGACTTGGTTGGAGAAGAAGCCGGTGCGGCGCCAGGAGCCTGGCCCAGAGAGCCGGCAACCAAAGCGAAGAGGA
CGAAGAAGACAAAAGCGGAGGAGGAAGCCATAGCCGTGAAGAATGCGAAGAGCAGAGAGAAGAAGATGAAAATGGAAGATGGAAGGGAAGTTTTGAGTTGCAGAGGTTAA
AGATGAGAGAAGAA
Protein sequenceShow/hide protein sequence
MIDPETAILFTAAPLPGVEAGDGACAGGVEIEVDGTGVATGESEGAVGGAGGEVAGAEAGGEVPVEGEGAAPAGAGDEDGVEAGGVATVGGRADLGEEAGGDLVGEEAGA
APGAWPREPATKAKRTKKTKAEEEAIAVKNAKSREKKMKMEDGREVLSCRG