; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0017547 (gene) of Snake gourd v1 genome

Gene IDTan0017547
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationLG11:26882353..26885141
RNA-Seq ExpressionTan0017547
SyntenyTan0017547
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0003723 - RNA binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0063620.1 protein IQ-DOMAIN 14-like [Cucumis melo var. makuwa]8.1e-3548.7Show/hide
Query:  ERLNKLDLVNSNLEKVYKERLNMNLNETCKKISVSAG---------------------------------------LNVPEYYVVVPKPTAETTLFSMDQ
        ER     L+N+NLE+VYKERLN+NLNE  K     +G                                        N PEYY++V KPTAETTL+SMDQ
Subjt:  ERLNKLDLVNSNLEKVYKERLNMNLNETCKKISVSAG---------------------------------------LNVPEYYVVVPKPTAETTLFSMDQ

Query:  PRHSYFVPDDYPFYPNYMAKTESSKATARLQ------------------RTVDRVSLNDQIQSSLQNLKHKGYENHNSPWYMKLHQITKTSKN
        PRHS FV DDY  YPNYMAKTESSKA  R Q                  RT +R+ LNDQIQ+SLQ+LKH GYENHN PW+MKL+Q  KTSKN
Subjt:  PRHSYFVPDDYPFYPNYMAKTESSKATARLQ------------------RTVDRVSLNDQIQSSLQNLKHKGYENHNSPWYMKLHQITKTSKN

KAG7021149.1 Protein IQ-DOMAIN 14, partial [Cucurbita argyrosperma subsp. argyrosperma]2.8e-2744.32Show/hide
Query:  KTSERLNKLDLVNSNLEKVYKERLNMNLNET---------------------------C-------------KKISVSAGLNVPEYYVVVPKPTAETTLF
        K  ER     L N+NLEKVYKE LNMNL+ET                           C             K +S+S   N+ EYYV++ KPTA  TL 
Subjt:  KTSERLNKLDLVNSNLEKVYKERLNMNLNET---------------------------C-------------KKISVSAGLNVPEYYVVVPKPTAETTLF

Query:  SMDQPRHSYFVPDDYPFYPNYMAKTESSKATARLQRTVDRVSLND------QIQSSLQNLKHKGYENHNSPWYMKLHQITKTSKN
        SMD PRHS FVPD+YP YPNYMAKTESS+A  R Q    +   +       Q +++LQN+KH GYE+H+  W+MKL+Q+TK+SKN
Subjt:  SMDQPRHSYFVPDDYPFYPNYMAKTESSKATARLQRTVDRVSLND------QIQSSLQNLKHKGYENHNSPWYMKLHQITKTSKN

XP_008455809.1 PREDICTED: uncharacterized protein LOC103495905 [Cucumis melo]1.4e-3448.19Show/hide
Query:  ERLNKLDLVNSNLEKVYKERLNMNLNETCKKISVSAG---------------------------------------LNVPEYYVVVPKPTAETTLFSMDQ
        ER     L+N+NLE+ YKERLN+NLNE  K     +G                                        + PEYY++V KPTAETTL+SMDQ
Subjt:  ERLNKLDLVNSNLEKVYKERLNMNLNETCKKISVSAG---------------------------------------LNVPEYYVVVPKPTAETTLFSMDQ

Query:  PRHSYFVPDDYPFYPNYMAKTESSKATARLQ------------------RTVDRVSLNDQIQSSLQNLKHKGYENHNSPWYMKLHQITKTSKN
        PRHS FVPDDY  YPNYMAKTESSKA  R Q                  RT +R+ LNDQIQ+SLQ+LKH GYENHN PW+MKL+Q  KTSKN
Subjt:  PRHSYFVPDDYPFYPNYMAKTESSKATARLQ------------------RTVDRVSLNDQIQSSLQNLKHKGYENHNSPWYMKLHQITKTSKN

XP_011648766.2 uncharacterized protein LOC101218293 [Cucumis sativus]7.9e-3044.56Show/hide
Query:  ERLNKLDLVNSNLEKVYKERLNMNLNETCKKISVSAG---------------------------------------LNVPEYYVVVPKPTAETTLFSMDQ
        ER      +N+NLE++YKERLN+NLNE  K     +G                                        +  EYY++V KPTA+TTL+SMDQ
Subjt:  ERLNKLDLVNSNLEKVYKERLNMNLNETCKKISVSAG---------------------------------------LNVPEYYVVVPKPTAETTLFSMDQ

Query:  PRHSYFVPDDYPFYPNYMAKTESSKATARLQR------------------TVDRVSLNDQIQSSLQNLKHKGYENHNSPWYMKLHQITKTSKN
         RHS FVPDDY  YPNYMAKTESS+A  R Q                   T DR++LNDQI +SLQ  KH GYENHN PW+MKL+Q  KTSKN
Subjt:  PRHSYFVPDDYPFYPNYMAKTESSKATARLQR------------------TVDRVSLNDQIQSSLQNLKHKGYENHNSPWYMKLHQITKTSKN

XP_038890615.1 uncharacterized protein LOC120080123 [Benincasa hispida]5.8e-3346.67Show/hide
Query:  ERLNKLDLVNSNLEKVYKERLNMNLNETCKKISVSAG---------------------LNVP------------------EYYVVVPKPTAETTLFSMDQ
        ER  +  L++ NL++ YKERLNMN+NE  +     +G                     L++P                  EYY++V KPTAET L+SMDQ
Subjt:  ERLNKLDLVNSNLEKVYKERLNMNLNETCKKISVSAG---------------------LNVP------------------EYYVVVPKPTAETTLFSMDQ

Query:  PRHSYFVPDDYPFYPNYMAKTESSKATARLQR--------------------TVDRVSLNDQIQSSLQNLKHKGYENHNSPWYMKLHQITKTSKN
        PRHS FVPDDY FYPNYMAKTESS+A  R Q                     T DR+SLNDQIQSSLQ+LKH GYENHN+PW+MKL+Q  K SKN
Subjt:  PRHSYFVPDDYPFYPNYMAKTESSKATARLQR--------------------TVDRVSLNDQIQSSLQNLKHKGYENHNSPWYMKLHQITKTSKN

TrEMBL top hitse value%identityAlignment
A0A0A0LJD2 DUF4005 domain-containing protein3.8e-3044.56Show/hide
Query:  ERLNKLDLVNSNLEKVYKERLNMNLNETCKKISVSAG---------------------------------------LNVPEYYVVVPKPTAETTLFSMDQ
        ER      +N+NLE++YKERLN+NLNE  K     +G                                        +  EYY++V KPTA+TTL+SMDQ
Subjt:  ERLNKLDLVNSNLEKVYKERLNMNLNETCKKISVSAG---------------------------------------LNVPEYYVVVPKPTAETTLFSMDQ

Query:  PRHSYFVPDDYPFYPNYMAKTESSKATARLQR------------------TVDRVSLNDQIQSSLQNLKHKGYENHNSPWYMKLHQITKTSKN
         RHS FVPDDY  YPNYMAKTESS+A  R Q                   T DR++LNDQI +SLQ  KH GYENHN PW+MKL+Q  KTSKN
Subjt:  PRHSYFVPDDYPFYPNYMAKTESSKATARLQR------------------TVDRVSLNDQIQSSLQNLKHKGYENHNSPWYMKLHQITKTSKN

A0A1S3C304 uncharacterized protein LOC1034959056.7e-3548.19Show/hide
Query:  ERLNKLDLVNSNLEKVYKERLNMNLNETCKKISVSAG---------------------------------------LNVPEYYVVVPKPTAETTLFSMDQ
        ER     L+N+NLE+ YKERLN+NLNE  K     +G                                        + PEYY++V KPTAETTL+SMDQ
Subjt:  ERLNKLDLVNSNLEKVYKERLNMNLNETCKKISVSAG---------------------------------------LNVPEYYVVVPKPTAETTLFSMDQ

Query:  PRHSYFVPDDYPFYPNYMAKTESSKATARLQ------------------RTVDRVSLNDQIQSSLQNLKHKGYENHNSPWYMKLHQITKTSKN
        PRHS FVPDDY  YPNYMAKTESSKA  R Q                  RT +R+ LNDQIQ+SLQ+LKH GYENHN PW+MKL+Q  KTSKN
Subjt:  PRHSYFVPDDYPFYPNYMAKTESSKATARLQ------------------RTVDRVSLNDQIQSSLQNLKHKGYENHNSPWYMKLHQITKTSKN

A0A5A7V8W2 Protein IQ-DOMAIN 14-like3.9e-3548.7Show/hide
Query:  ERLNKLDLVNSNLEKVYKERLNMNLNETCKKISVSAG---------------------------------------LNVPEYYVVVPKPTAETTLFSMDQ
        ER     L+N+NLE+VYKERLN+NLNE  K     +G                                        N PEYY++V KPTAETTL+SMDQ
Subjt:  ERLNKLDLVNSNLEKVYKERLNMNLNETCKKISVSAG---------------------------------------LNVPEYYVVVPKPTAETTLFSMDQ

Query:  PRHSYFVPDDYPFYPNYMAKTESSKATARLQ------------------RTVDRVSLNDQIQSSLQNLKHKGYENHNSPWYMKLHQITKTSKN
        PRHS FV DDY  YPNYMAKTESSKA  R Q                  RT +R+ LNDQIQ+SLQ+LKH GYENHN PW+MKL+Q  KTSKN
Subjt:  PRHSYFVPDDYPFYPNYMAKTESSKATARLQ------------------RTVDRVSLNDQIQSSLQNLKHKGYENHNSPWYMKLHQITKTSKN

A0A6J1CPD3 protein IQ-DOMAIN 14-like1.8e-2745.41Show/hide
Query:  ERLNKLDLVNSNLEKVYKERLNMNLNE---------------------------TC-------------KKISVSAGLNVPEYYVVVPKPTAETTLFSMD
        ER     LVN NLEK YKERL MNLNE                            C             K  S+    N  EYYV+V KP AE+ LFSMD
Subjt:  ERLNKLDLVNSNLEKVYKERLNMNLNE---------------------------TC-------------KKISVSAGLNVPEYYVVVPKPTAETTLFSMD

Query:  QPRHSYFVPDDYPFYPNYMAKTESSKATARLQ-------------------RTVDRVSL--NDQIQSSLQNLKHKGYENHNSPWYMKLHQITKTSK
        QPR+S  +P DYP YP+YMAKTESS+A  R Q                    TV R SL  NDQIQS  QNLKHKGYENHNS W+MKL+Q+ K +K
Subjt:  QPRHSYFVPDDYPFYPNYMAKTESSKATARLQ-------------------RTVDRVSL--NDQIQSSLQNLKHKGYENHNSPWYMKLHQITKTSK

A0A6J1FBW0 protein IQ-DOMAIN 14-like7.0e-2443.24Show/hide
Query:  KTSERLNKLDLVNSNLEKVYKERLNMNLNET---------------------------C-------------KKISVSAGLNVPEYYVVVPKPTAETTLF
        K  ER     L N+NLE   KE LNMNL+ET                           C             K  S+S   N+ EYYV++ KPTA  TL 
Subjt:  KTSERLNKLDLVNSNLEKVYKERLNMNLNET---------------------------C-------------KKISVSAGLNVPEYYVVVPKPTAETTLF

Query:  SMDQPRHSYFVPDDYPFYPNYMAKTESSKATARLQRTVDRVSLND------QIQSSLQNLKHKGYENHNSPWYMKLHQITKTSKN
        SMD PRHS FVPD+YP YPNYMAKTESS+A  R Q    +   +       Q +S+LQN+KH GYE+H+  W+MKL+Q+TK+SKN
Subjt:  SMDQPRHSYFVPDDYPFYPNYMAKTESSKATARLQRTVDRVSLND------QIQSSLQNLKHKGYENHNSPWYMKLHQITKTSKN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCGCCTTGAAGGAAAACTTGCGGCTGGCACAGGAAAGGATGAAGAAACAAGCAGACAAGAAGAGGAGGGATGTGGAATTCCAGGTGGGGGACGCCGTGTTTTTGAA
GTTACAGCCCTATAGGCAACGTTCTCTAGCAAGGAAAAGGTGTGAAAAACTATCCCCGAAATTCTTTGGCCCATATATCATCACGGAGAAGATAGGTCTGTGTGGCTACA
CCGTTTGGAGTTACCTAATGAAGCATCTATACATGACGTTTTTCATCTGCTCGAGGAGGAAGAGCTTCATCAAGACATCTGAGAGACTAAACAAGTTGGATCTTGTTAAC
TCCAATCTCGAAAAAGTATACAAAGAGAGACTAAACATGAATCTCAATGAAACTTGCAAAAAAATCAGTGTCTCAGCAGGACTGAACGTGCCTGAGTACTACGTTGTAGT
GCCCAAGCCAACAGCTGAGACAACCTTATTTTCCATGGATCAGCCAAGACATTCATACTTTGTGCCTGATGACTATCCCTTCTATCCAAATTATATGGCTAAAACAGAAT
CCTCTAAGGCAACAGCCCGATTGCAGAGAACAGTTGACAGAGTGAGTTTAAATGATCAAATCCAAAGCTCTTTACAAAACCTGAAGCATAAAGGTTATGAAAATCACAAC
AGCCCTTGGTACATGAAGCTTCATCAGATCACAAAAACCTCCAAGAATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTCGCCTTGAAGGAAAACTTGCGGCTGGCACAGGAAAGGATGAAGAAACAAGCAGACAAGAAGAGGAGGGATGTGGAATTCCAGGTGGGGGACGCCGTGTTTTTGAA
GTTACAGCCCTATAGGCAACGTTCTCTAGCAAGGAAAAGGTGTGAAAAACTATCCCCGAAATTCTTTGGCCCATATATCATCACGGAGAAGATAGGTCTGTGTGGCTACA
CCGTTTGGAGTTACCTAATGAAGCATCTATACATGACGTTTTTCATCTGCTCGAGGAGGAAGAGCTTCATCAAGACATCTGAGAGACTAAACAAGTTGGATCTTGTTAAC
TCCAATCTCGAAAAAGTATACAAAGAGAGACTAAACATGAATCTCAATGAAACTTGCAAAAAAATCAGTGTCTCAGCAGGACTGAACGTGCCTGAGTACTACGTTGTAGT
GCCCAAGCCAACAGCTGAGACAACCTTATTTTCCATGGATCAGCCAAGACATTCATACTTTGTGCCTGATGACTATCCCTTCTATCCAAATTATATGGCTAAAACAGAAT
CCTCTAAGGCAACAGCCCGATTGCAGAGAACAGTTGACAGAGTGAGTTTAAATGATCAAATCCAAAGCTCTTTACAAAACCTGAAGCATAAAGGTTATGAAAATCACAAC
AGCCCTTGGTACATGAAGCTTCATCAGATCACAAAAACCTCCAAGAATTGA
Protein sequenceShow/hide protein sequence
MVALKENLRLAQERMKKQADKKRRDVEFQVGDAVFLKLQPYRQRSLARKRCEKLSPKFFGPYIITEKIGLCGYTVWSYLMKHLYMTFFICSRRKSFIKTSERLNKLDLVN
SNLEKVYKERLNMNLNETCKKISVSAGLNVPEYYVVVPKPTAETTLFSMDQPRHSYFVPDDYPFYPNYMAKTESSKATARLQRTVDRVSLNDQIQSSLQNLKHKGYENHN
SPWYMKLHQITKTSKN