; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0014590 (gene) of Snake gourd v1 genome

Gene IDTan0014590
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRetrotransposon protein
Genome locationLG09:57674697..57675498
RNA-Seq ExpressionTan0014590
SyntenyTan0014590
Gene Ontology termsNA
InterPro domainsIPR024752 - Myb/SANT-like domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_038887234.1 uncharacterized protein LOC120077425 [Benincasa hispida]4.9e-5652.26Show/hide
Query:  MTGTSKHSKHTWTKVEDARLVESLVYLVHNGC----GTFRPGYVQHLQKMLAEKLPNSSLELNTIDCKVRTLKMQYNAVAEMLGNGCSGFGWNEEFKCVE
        MTG SK SKH W+KVEDARLVE+L+YLV  G     GTFRPGY+QHL+++L EK+P  +L  NTI+CKVR+LK QYNAV+EML    SGF WNEEFKCV+
Subjt:  MTGTSKHSKHTWTKVEDARLVESLVYLVHNGC----GTFRPGYVQHLQKMLAEKLPNSSLELNTIDCKVRTLKMQYNAVAEMLGNGCSGFGWNEEFKCVE

Query:  AEKEVFDAWV------KGMRNKPFPHYDDLAFVFGKDRATGMGAETPGEMASNAAEQMEEEIRLGSQDLFGMEQRPMENPCTADVGEEELPETPTSRRNT
         E+E+FD WV      KGM  KPFPHYDDL+ VFGKDRA                            D    E R  E+P   D  +EE  E  T R + 
Subjt:  AEKEVFDAWV------KGMRNKPFPHYDDLAFVFGKDRATGMGAETPGEMASNAAEQMEEEIRLGSQDLFGMEQRPMENPCTADVGEEELPETPTSRRNT

Query:  SGTSSRCTGSKRKRSCFQTEMIDVVRTIMDIQTTHMQRLLSWR
           SSR  GSKRKRS FQ EMID+V++ +++Q+THM RL SW+
Subjt:  SGTSSRCTGSKRKRSCFQTEMIDVVRTIMDIQTTHMQRLLSWR

XP_038892629.1 uncharacterized protein At2g29880-like [Benincasa hispida]3.1e-5051.1Show/hide
Query:  MTGTSKHSKHTWTKVEDARLVESLVYLVHNGC----GTFRPGYVQHLQKMLAEKLPNSSLELNTIDCKVRTLKMQYNAVAEMLGNGCSGFGWNEEFKCVE
        M G  K SKH W+KVEDA+LVE+L+YLV  G     GTFRPGY+QHL+++L EK+P  +L  NTI+CKVR+LK QYNAV+EML    SG GWNEEFKCV 
Subjt:  MTGTSKHSKHTWTKVEDARLVESLVYLVHNGC----GTFRPGYVQHLQKMLAEKLPNSSLELNTIDCKVRTLKMQYNAVAEMLGNGCSGFGWNEEFKCVE

Query:  AEKEVFDAWV------KGMRNKPFPHYDDLAFVFGKDRATGMGAETPGEMASNAAEQMEEEIRLGSQDLFGMEQRPMENPCTADVGEEELPETPTSRRNT
         E+E+FD WV      K M NKPFPHYDDL+ +FGKDRA G  +E P  M                 D    E R  E+P   D  +EE  E  T R + 
Subjt:  AEKEVFDAWV------KGMRNKPFPHYDDLAFVFGKDRATGMGAETPGEMASNAAEQMEEEIRLGSQDLFGMEQRPMENPCTADVGEEELPETPTSRRNT

Query:  SGTSSRCTGSKRKRSCFQTEMIDVVRT
           SSR   +KR RS FQ EMID++R+
Subjt:  SGTSSRCTGSKRKRSCFQTEMIDVVRT

XP_038895852.1 uncharacterized protein LOC120084021 [Benincasa hispida]9.2e-4754.84Show/hide
Query:  MTGTSKHSKHTWTKVEDARLVESLVYLVHNGC----GTFRPGYVQHLQKMLAEKLPNSSLELNTIDCKVRTLKMQYNAVAEMLGNGCSGFGWNEEFKCVE
        M G  K SKH W+KVEDA+L+E+L+YLV  G     GTF+PGY+QHL+++L EK+   +L  NTI+CKVR+LK QYNAV+EML    SGF WNEEFKCV+
Subjt:  MTGTSKHSKHTWTKVEDARLVESLVYLVHNGC----GTFRPGYVQHLQKMLAEKLPNSSLELNTIDCKVRTLKMQYNAVAEMLGNGCSGFGWNEEFKCVE

Query:  AEKEVFDAWVKGMRNKPFPHYDDLAFVFGKDRATGMGAETPGEMASNAAEQMEEEIRLGSQDLFGMEQRPMENPCTADVGEEELPE
        +         KGM NK FPHYDDL+ VFGKDRA G  +E P  MA+NA  + E++IRLGSQD    E R  ++P   D  +EEL E
Subjt:  AEKEVFDAWVKGMRNKPFPHYDDLAFVFGKDRATGMGAETPGEMASNAAEQMEEEIRLGSQDLFGMEQRPMENPCTADVGEEELPE

XP_038896380.1 uncharacterized protein LOC120084641 [Benincasa hispida]8.0e-5149.79Show/hide
Query:  MTGTSKHSKHTWTKVEDARLVESLVYLVHNGC----GTFRPGYVQHLQKMLAEKLPNSSLELNTIDCKVRTLKMQYNAVAEMLGNGCSGFGWNEEFKCVE
        M G+ K SKH W+KVED +LVE+L+YLV  G     GTFR GY+Q+L+++L EK+P  +L  NTI+CKVR+LK QYNAV+EML    SGFGWNEEFKCV+
Subjt:  MTGTSKHSKHTWTKVEDARLVESLVYLVHNGC----GTFRPGYVQHLQKMLAEKLPNSSLELNTIDCKVRTLKMQYNAVAEMLGNGCSGFGWNEEFKCVE

Query:  AEKEVFDAWV------KGMRNKPFPHYDDLAFVFGKDRATGMGAETPGEMASNAAEQMEEEIRLGSQDLFGMEQRPMENPCTADVGEEELPETPTSRRNT
         EKE+FD WV      KGM NK F HYDDL+ VFGKDRA      TP                         E    E+P   D  +EE  E  T R + 
Subjt:  AEKEVFDAWV------KGMRNKPFPHYDDLAFVFGKDRATGMGAETPGEMASNAAEQMEEEIRLGSQDLFGMEQRPMENPCTADVGEEELPETPTSRRNT

Query:  SGTSSRCTGSKRKRSCFQTEMIDVVRTIMDIQTTHMQRLLSWR
           SSR  GSKRKR  FQ EMID++R+ +++Q+THM RL SW+
Subjt:  SGTSSRCTGSKRKRSCFQTEMIDVVRTIMDIQTTHMQRLLSWR

XP_038902479.1 uncharacterized protein At2g29880-like [Benincasa hispida]5.4e-5555.76Show/hide
Query:  MTGTSKHSKHTWTKVEDARLVESLVYLVHNGC----GTFRPGYVQHLQKMLAEKLPNSSLELNTIDCKVRTLKMQYNAVAEMLGNGCSGFGWNEEFKCVE
        MT   K SKH W+KVEDA+LVE+L+YLV  G     GTFRPGY+QHL+++L EK+P  +L  NTI+CKVR+LK QYN V+EML    SGF WNEEFKCV+
Subjt:  MTGTSKHSKHTWTKVEDARLVESLVYLVHNGC----GTFRPGYVQHLQKMLAEKLPNSSLELNTIDCKVRTLKMQYNAVAEMLGNGCSGFGWNEEFKCVE

Query:  AEKEVFDAWV------KGMRNKPFPHYDDLAFVFGKDRATGMGAETPGEMASNAAEQMEEEIRLGSQDLFGMEQRPMENPCTADVGEEELPETPTSRRNT
         E+E+FD WV      K M NKPFPHYDD + VFGKDR  G  +E P  MA+NA  + E+EIRLGSQD    E R  E+P   D  +EE  E  T R + 
Subjt:  AEKEVFDAWV------KGMRNKPFPHYDDLAFVFGKDRATGMGAETPGEMASNAAEQMEEEIRLGSQDLFGMEQRPMENPCTADVGEEELPETPTSRRNT

Query:  SGTSSRCTGSKRKRSCF
           SSR  GSKRKR  F
Subjt:  SGTSSRCTGSKRKRSCF

TrEMBL top hitse value%identityAlignment
A0A1S3B4L3 uncharacterized protein LOC1034859538.7e-4341.06Show/hide
Query:  MTGTSKHSKHTWTKVEDARLVESLVYLVHNG-----CGTFRPGYVQHLQKMLAEKLPNSSL-ELNTIDCKVRTLKMQYNAVAEMLGNGCSGFGWNEEFKC
        M   S+  KHTWTK E+ + VE LV LV +G      GTF+PGY+  LQ+M+AEKLP +++ E +TIDC V++LK  Y+A+AEM G  CSGFGWNEEF+C
Subjt:  MTGTSKHSKHTWTKVEDARLVESLVYLVHNG-----CGTFRPGYVQHLQKMLAEKLPNSSL-ELNTIDCKVRTLKMQYNAVAEMLGNGCSGFGWNEEFKC

Query:  VEAEKEVFDAWV------KGMRNKPFPHYDDLAFVFGKDRATGMGAETPGEMASNAAEQMEEEIRLGSQDLFGMEQRPMENPCTADVGEEELPETPTSRR
        + AE+++FD+W+      KG+ +K FP+YDDL++VFGKDRATG  +ET   + SN +    + I LG       +    + P     G    P+     R
Subjt:  VEAEKEVFDAWV------KGMRNKPFPHYDDLAFVFGKDRATGMGAETPGEMASNAAEQMEEEIRLGSQDLFGMEQRPMENPCTADVGEEELPETPTSRR

Query:  NTSGTSSR-CTG-SKRKRSCFQTEMIDVVRTIMDIQTTHMQRLLSW
            +  R C+  SKRKR   + E ++V+R++M+     ++ +  W
Subjt:  NTSGTSSR-CTG-SKRKRSCFQTEMIDVVRTIMDIQTTHMQRLLSW

A0A5A7U0H7 Retrotransposon protein8.7e-4341.06Show/hide
Query:  MTGTSKHSKHTWTKVEDARLVESLVYLVHNG-----CGTFRPGYVQHLQKMLAEKLPNSSL-ELNTIDCKVRTLKMQYNAVAEMLGNGCSGFGWNEEFKC
        M   S+  KHTWTK E+ + VE LV LV +G      GTF+PGY+  LQ+M+AEKLP +++ E +TIDC V++LK  Y+A+AEM G  CSGFGWNEEF+C
Subjt:  MTGTSKHSKHTWTKVEDARLVESLVYLVHNG-----CGTFRPGYVQHLQKMLAEKLPNSSL-ELNTIDCKVRTLKMQYNAVAEMLGNGCSGFGWNEEFKC

Query:  VEAEKEVFDAWV------KGMRNKPFPHYDDLAFVFGKDRATGMGAETPGEMASNAAEQMEEEIRLGSQDLFGMEQRPMENPCTADVGEEELPETPTSRR
        + AE+++FD+W+      KG+ +K FP+YDDL++VFGKDRATG  +ET   + SN +    + I LG       +    + P     G    P+     R
Subjt:  VEAEKEVFDAWV------KGMRNKPFPHYDDLAFVFGKDRATGMGAETPGEMASNAAEQMEEEIRLGSQDLFGMEQRPMENPCTADVGEEELPETPTSRR

Query:  NTSGTSSR-CTG-SKRKRSCFQTEMIDVVRTIMDIQTTHMQRLLSW
            +  R C+  SKRKR   + E ++V+R++M+     ++ +  W
Subjt:  NTSGTSSR-CTG-SKRKRSCFQTEMIDVVRTIMDIQTTHMQRLLSW

A0A5A7UME4 Retrotransposon protein1.1e-4243.57Show/hide
Query:  MTGTSKHSKHTWTKVEDARLVESLVYLVHNG-----CGTFRPGYVQHLQKMLAEKLPNSSLELNTIDCKVRTLKMQYNAVAEMLGNGCSGFGWNEEFKCV
        MT +S+  KHTWTK E+A LVE LV LV+ G      GTFRPGY+  L +M+A K+P S++  +TID +++ +K  ++A+AEM G  CSGFGWN+E KC+
Subjt:  MTGTSKHSKHTWTKVEDARLVESLVYLVHNG-----CGTFRPGYVQHLQKMLAEKLPNSSLELNTIDCKVRTLKMQYNAVAEMLGNGCSGFGWNEEFKCV

Query:  EAEKEVFDAW----VKGMRNKPFPHYDDLAFVFGKDRATGMGAETPGEMASNAAEQMEEEIRLGSQDLFGMEQRPMENPCTADVGEEELPETPTSRRNTS
         AEKEVFD W     KG+ NK F HYD+L++VFGKDRATG  AE+  ++ SN     + E      D    +  PM +P   ++  ++L ET T+R   S
Subjt:  EAEKEVFDAW----VKGMRNKPFPHYDDLAFVFGKDRATGMGAETPGEMASNAAEQMEEEIRLGSQDLFGMEQRPMENPCTADVGEEELPETPTSRRNTS

Query:  GTSSRCTGSKRKRSCFQTEMIDVVRTIMDIQTTHMQRLLSW
           +  +GSKRKR    T+  D+VRT ++     + R+  W
Subjt:  GTSSRCTGSKRKRSCFQTEMIDVVRTIMDIQTTHMQRLLSW

A0A5D3CBF7 Retrotransposon protein3.3e-4243.39Show/hide
Query:  MTGTSKHSKHTWTKVEDARLVESLVYLVHNG-----CGTFRPGYVQHLQKMLAEKLPNSSLELNTIDCKVRTLKMQYNAVAEMLGNGCSGFGWNEEFKCV
        MT +S+  KHTWTK E+A LVE LV LV+ G      GTFRPGY+  L +M+A K+P S++  +TID +++ +K  ++A+AEM G  CSGFGWN+E KC+
Subjt:  MTGTSKHSKHTWTKVEDARLVESLVYLVHNG-----CGTFRPGYVQHLQKMLAEKLPNSSLELNTIDCKVRTLKMQYNAVAEMLGNGCSGFGWNEEFKCV

Query:  EAEKEVFDAW----VKGMRNKPFPHYDDLAFVFGKDRATGMGAETPGEMASNAAEQMEEEIRLGSQDLF-GMEQRPMENPCTADVGEEELPETPTSRRNT
         AEKEVFD W     KG+ NK F HYD+L++VFGKDRATG  AE+  ++ SN     +     G+ D     +  PM +P   ++  ++L ET T+R   
Subjt:  EAEKEVFDAW----VKGMRNKPFPHYDDLAFVFGKDRATGMGAETPGEMASNAAEQMEEEIRLGSQDLF-GMEQRPMENPCTADVGEEELPETPTSRRNT

Query:  SGTSSRCTGSKRKRSCFQTEMIDVVRTIMDIQTTHMQRLLSW
        S   +  +GSKRKR    T+  D+VRT ++     + R+  W
Subjt:  SGTSSRCTGSKRKRSCFQTEMIDVVRTIMDIQTTHMQRLLSW

A0A5D3DPR5 Retrotransposon protein3.3e-4242.74Show/hide
Query:  MTGTSKHSKHTWTKVEDARLVESLVYLVHNG-----CGTFRPGYVQHLQKMLAEKLPNSSLELNTIDCKVRTLKMQYNAVAEMLGNGCSGFGWNEEFKCV
        MT +S+  KHTWTK E+A LVE LV LV+ G      GTFRPGY+  L +M+A K+P S++  +TID +++ +K  ++A+AEM G  CSGFGWN+E KC+
Subjt:  MTGTSKHSKHTWTKVEDARLVESLVYLVHNG-----CGTFRPGYVQHLQKMLAEKLPNSSLELNTIDCKVRTLKMQYNAVAEMLGNGCSGFGWNEEFKCV

Query:  EAEKEVFDAW----VKGMRNKPFPHYDDLAFVFGKDRATGMGAETPGEMASNAAEQMEEEIRLGSQDLFGMEQRPMENPCTADVGEEELPETPTSRRNTS
         AEKEVFD W     KG+ NK F HYD+L++VFGKDRATG  AE+  ++ SN     +    + +  +   +  PM +P   ++  ++L ET T+R   S
Subjt:  EAEKEVFDAW----VKGMRNKPFPHYDDLAFVFGKDRATGMGAETPGEMASNAAEQMEEEIRLGSQDLFGMEQRPMENPCTADVGEEELPETPTSRRNTS

Query:  GTSSRCTGSKRKRSCFQTEMIDVVRTIMDIQTTHMQRLLSW
           +  +GSKRKR    T+  D+VRT ++     + R+  W
Subjt:  GTSSRCTGSKRKRSCFQTEMIDVVRTIMDIQTTHMQRLLSW

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACAGGTACTTCAAAACACTCCAAGCATACATGGACGAAGGTTGAGGATGCGAGATTGGTCGAGTCATTGGTCTATTTGGTACATAATGGGTGTGGAACATTCAGGCC
TGGATATGTCCAACATCTTCAAAAGATGCTAGCAGAGAAACTACCAAATTCATCATTAGAACTGAATACAATAGATTGCAAAGTGAGAACTTTGAAAATGCAATACAATG
CTGTTGCAGAGATGCTTGGGAATGGTTGTAGCGGATTTGGATGGAACGAAGAATTTAAATGTGTTGAGGCAGAGAAGGAGGTATTTGATGCATGGGTCAAGGGGATGAGG
AACAAACCATTTCCACACTATGATGATCTTGCATTTGTATTTGGAAAAGACAGAGCAACGGGGATGGGCGCGGAAACCCCAGGGGAAATGGCCTCTAACGCTGCAGAACA
AATGGAGGAGGAGATCCGACTGGGATCGCAAGACTTATTCGGGATGGAGCAACGACCAATGGAGAATCCATGCACTGCTGATGTAGGGGAGGAAGAATTGCCAGAGACTC
CTACTAGTAGACGTAATACATCTGGCACGTCTTCTCGATGTACTGGTAGCAAAAGAAAGAGATCATGCTTCCAAACTGAAATGATTGATGTTGTGCGGACAATAATGGAC
ATCCAAACAACTCACATGCAACGCCTTCTATCGTGGCGCGGATGA
mRNA sequenceShow/hide mRNA sequence
ATGACAGGTACTTCAAAACACTCCAAGCATACATGGACGAAGGTTGAGGATGCGAGATTGGTCGAGTCATTGGTCTATTTGGTACATAATGGGTGTGGAACATTCAGGCC
TGGATATGTCCAACATCTTCAAAAGATGCTAGCAGAGAAACTACCAAATTCATCATTAGAACTGAATACAATAGATTGCAAAGTGAGAACTTTGAAAATGCAATACAATG
CTGTTGCAGAGATGCTTGGGAATGGTTGTAGCGGATTTGGATGGAACGAAGAATTTAAATGTGTTGAGGCAGAGAAGGAGGTATTTGATGCATGGGTCAAGGGGATGAGG
AACAAACCATTTCCACACTATGATGATCTTGCATTTGTATTTGGAAAAGACAGAGCAACGGGGATGGGCGCGGAAACCCCAGGGGAAATGGCCTCTAACGCTGCAGAACA
AATGGAGGAGGAGATCCGACTGGGATCGCAAGACTTATTCGGGATGGAGCAACGACCAATGGAGAATCCATGCACTGCTGATGTAGGGGAGGAAGAATTGCCAGAGACTC
CTACTAGTAGACGTAATACATCTGGCACGTCTTCTCGATGTACTGGTAGCAAAAGAAAGAGATCATGCTTCCAAACTGAAATGATTGATGTTGTGCGGACAATAATGGAC
ATCCAAACAACTCACATGCAACGCCTTCTATCGTGGCGCGGATGA
Protein sequenceShow/hide protein sequence
MTGTSKHSKHTWTKVEDARLVESLVYLVHNGCGTFRPGYVQHLQKMLAEKLPNSSLELNTIDCKVRTLKMQYNAVAEMLGNGCSGFGWNEEFKCVEAEKEVFDAWVKGMR
NKPFPHYDDLAFVFGKDRATGMGAETPGEMASNAAEQMEEEIRLGSQDLFGMEQRPMENPCTADVGEEELPETPTSRRNTSGTSSRCTGSKRKRSCFQTEMIDVVRTIMD
IQTTHMQRLLSWRG