; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0001041 (gene) of Snake gourd v1 genome

Gene IDTan0001041
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRetrotransposon gag protein
Genome locationLG07:14096169..14100219
RNA-Seq ExpressionTan0001041
SyntenyTan0001041
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044978.1 retrotransposon gag protein [Cucumis melo var. makuwa]9.4e-2942.5Show/hide
Query:  SQIENQDVAESSQTPGVVTHLKNQMRNLALCDSCQFRGHKR-----KDKRQNKKDSRRSKPMRKGNTKSSQPRQLVTSTKFSFTSY--------------
        SQ + + + +  +   VV H K +  +     S   + H +     K K +  K   + KP++  +    QPRQ +T  +F   S+              
Subjt:  SQIENQDVAESSQTPGVVTHLKNQMRNLALCDSCQFRGHKR-----KDKRQNKKDSRRSKPMRKGNTKSSQPRQLVTSTKFSFTSY--------------

Query:  -----------YINTEEVDDSKEVKQRTSVFDRIKPSTTRPSVFQRMSMAATEEENLDLTSSSIRTSVFQRLNVSTSKKRRPSTSVFDRLKVTSGQDESM
                   Y + EEVD+S E+KQRTSVFDRIKP TTR SVFQR+SMA  EEEN   TS+  RTS F+RL++STSKK RPSTS FDRLK+T+ Q +  
Subjt:  -----------YINTEEVDDSKEVKQRTSVFDRIKPSTTRPSVFQRMSMAATEEENLDLTSSSIRTSVFQRLNVSTSKKRRPSTSVFDRLKVTSGQDESM

Query:  MNTSKAELF-DEVSHNKVQSTVPSRMKRKFSVLINTEGFL
        M + KA+ F +E   +K+ S VPSRMKRK SV INTEG L
Subjt:  MNTSKAELF-DEVSHNKVQSTVPSRMKRKFSVLINTEGFL

KAA0050734.1 gag protease polyprotein [Cucumis melo var. makuwa]1.2e-2862.02Show/hide
Query:  YINTEEVDDSKEVKQRTSVFDRIKPSTTRPSVFQRMSMAATEEENLDLTSSSIRTSVFQRLNVSTSKKRRPSTSVFDRLKVTSGQDESMMNTSKAELF-D
        Y + EEVD+S E+KQRTS+FDRIKP TTR  VFQR+SMA  EEEN   TS+  RTS F+RL++STSKK RPSTS FDRLK+T+ Q +  M + KA+ F +
Subjt:  YINTEEVDDSKEVKQRTSVFDRIKPSTTRPSVFQRMSMAATEEENLDLTSSSIRTSVFQRLNVSTSKKRRPSTSVFDRLKVTSGQDESMMNTSKAELF-D

Query:  EVSHNKVQSTVPSRMKRKFSVLINTEGFL
        E   +K+ S VPSRMKRK SV INTEG L
Subjt:  EVSHNKVQSTVPSRMKRKFSVLINTEGFL

KAA0062746.1 Retrotransposon gag protein [Cucumis melo var. makuwa]2.7e-2858.46Show/hide
Query:  YINTEEVDDSKEVKQRTSVFDRIKPSTTRPSVFQRMSMAATEEENLDLTSSSIRTSVFQRLNVSTSKKRRPSTSVFDRLKVTSGQDESMMNTSKAELFDE
        Y++++EVD+S E+K+RTS+FDRIKPSTTR SVF+R+SMA  EEEN  LTS+S +TS F+RL++STSKK RPS S FDRLK+T+ Q +  + T K + F E
Subjt:  YINTEEVDDSKEVKQRTSVFDRIKPSTTRPSVFQRMSMAATEEENLDLTSSSIRTSVFQRLNVSTSKKRRPSTSVFDRLKVTSGQDESMMNTSKAELFDE

Query:  VSHN--KVQSTVPSRMKRKFSVLINTEGFL
         +++  K+ + VPSRMKRK SV I+TEG L
Subjt:  VSHN--KVQSTVPSRMKRKFSVLINTEGFL

KAA0063719.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]2.1e-2856.67Show/hide
Query:  PRQLVTSTKFSFTSY------YINTEEVDDSKEVKQRTSVFDRIKPSTTRPSVFQRMSMAATEEENLDLTSSSIRTSVFQRLNVSTSKKRRPSTSVFDRL
        P++++  T    TS       Y + +EVD+S E+KQRTSVFDRIKP TTR SVFQR+SMA  EEEN   TS+  RTS F+RL++STSKK RPSTS FDRL
Subjt:  PRQLVTSTKFSFTSY------YINTEEVDDSKEVKQRTSVFDRIKPSTTRPSVFQRMSMAATEEENLDLTSSSIRTSVFQRLNVSTSKKRRPSTSVFDRL

Query:  KVTSGQDESMMNTSKAELF-DEVSHNKVQSTVPSRMKRKFSVLINTEGFL
        K+T+ Q +  M   KA+ F +E   +K+ S VPSRMKRK SV INTEG L
Subjt:  KVTSGQDESMMNTSKAELF-DEVSHNKVQSTVPSRMKRKFSVLINTEGFL

TYK14888.1 gag protease polyprotein [Cucumis melo var. makuwa]3.6e-2862.02Show/hide
Query:  YINTEEVDDSKEVKQRTSVFDRIKPSTTRPSVFQRMSMAATEEENLDLTSSSIRTSVFQRLNVSTSKKRRPSTSVFDRLKVTSGQDESMMNTSKAELF-D
        Y +++EV++S E+ QRTSVFDRIKPSTTR SVFQR+SMA  EEEN   T    RTS F+RL++STSKK RPSTS FDRLK+T+ Q +  M +SKA+ F +
Subjt:  YINTEEVDDSKEVKQRTSVFDRIKPSTTRPSVFQRMSMAATEEENLDLTSSSIRTSVFQRLNVSTSKKRRPSTSVFDRLKVTSGQDESMMNTSKAELF-D

Query:  EVSHNKVQSTVPSRMKRKFSVLINTEGFL
        E   +K+ S VPSRMKRK SV INTEG L
Subjt:  EVSHNKVQSTVPSRMKRKFSVLINTEGFL

TrEMBL top hitse value%identityAlignment
A0A5A7TQ06 Retrotransposon gag protein4.5e-2942.5Show/hide
Query:  SQIENQDVAESSQTPGVVTHLKNQMRNLALCDSCQFRGHKR-----KDKRQNKKDSRRSKPMRKGNTKSSQPRQLVTSTKFSFTSY--------------
        SQ + + + +  +   VV H K +  +     S   + H +     K K +  K   + KP++  +    QPRQ +T  +F   S+              
Subjt:  SQIENQDVAESSQTPGVVTHLKNQMRNLALCDSCQFRGHKR-----KDKRQNKKDSRRSKPMRKGNTKSSQPRQLVTSTKFSFTSY--------------

Query:  -----------YINTEEVDDSKEVKQRTSVFDRIKPSTTRPSVFQRMSMAATEEENLDLTSSSIRTSVFQRLNVSTSKKRRPSTSVFDRLKVTSGQDESM
                   Y + EEVD+S E+KQRTSVFDRIKP TTR SVFQR+SMA  EEEN   TS+  RTS F+RL++STSKK RPSTS FDRLK+T+ Q +  
Subjt:  -----------YINTEEVDDSKEVKQRTSVFDRIKPSTTRPSVFQRMSMAATEEENLDLTSSSIRTSVFQRLNVSTSKKRRPSTSVFDRLKVTSGQDESM

Query:  MNTSKAELF-DEVSHNKVQSTVPSRMKRKFSVLINTEGFL
        M + KA+ F +E   +K+ S VPSRMKRK SV INTEG L
Subjt:  MNTSKAELF-DEVSHNKVQSTVPSRMKRKFSVLINTEGFL

A0A5A7V8H3 Retrotransposon gag protein1.3e-2858.46Show/hide
Query:  YINTEEVDDSKEVKQRTSVFDRIKPSTTRPSVFQRMSMAATEEENLDLTSSSIRTSVFQRLNVSTSKKRRPSTSVFDRLKVTSGQDESMMNTSKAELFDE
        Y++++EVD+S E+K+RTS+FDRIKPSTTR SVF+R+SMA  EEEN  LTS+S +TS F+RL++STSKK RPS S FDRLK+T+ Q +  + T K + F E
Subjt:  YINTEEVDDSKEVKQRTSVFDRIKPSTTRPSVFQRMSMAATEEENLDLTSSSIRTSVFQRLNVSTSKKRRPSTSVFDRLKVTSGQDESMMNTSKAELFDE

Query:  VSHN--KVQSTVPSRMKRKFSVLINTEGFL
         +++  K+ + VPSRMKRK SV I+TEG L
Subjt:  VSHN--KVQSTVPSRMKRKFSVLINTEGFL

A0A5A7VDY3 Ty3-gypsy retrotransposon protein1.0e-2856.67Show/hide
Query:  PRQLVTSTKFSFTSY------YINTEEVDDSKEVKQRTSVFDRIKPSTTRPSVFQRMSMAATEEENLDLTSSSIRTSVFQRLNVSTSKKRRPSTSVFDRL
        P++++  T    TS       Y + +EVD+S E+KQRTSVFDRIKP TTR SVFQR+SMA  EEEN   TS+  RTS F+RL++STSKK RPSTS FDRL
Subjt:  PRQLVTSTKFSFTSY------YINTEEVDDSKEVKQRTSVFDRIKPSTTRPSVFQRMSMAATEEENLDLTSSSIRTSVFQRLNVSTSKKRRPSTSVFDRL

Query:  KVTSGQDESMMNTSKAELF-DEVSHNKVQSTVPSRMKRKFSVLINTEGFL
        K+T+ Q +  M   KA+ F +E   +K+ S VPSRMKRK SV INTEG L
Subjt:  KVTSGQDESMMNTSKAELF-DEVSHNKVQSTVPSRMKRKFSVLINTEGFL

A0A5D3BBF9 Gag protease polyprotein5.9e-2962.02Show/hide
Query:  YINTEEVDDSKEVKQRTSVFDRIKPSTTRPSVFQRMSMAATEEENLDLTSSSIRTSVFQRLNVSTSKKRRPSTSVFDRLKVTSGQDESMMNTSKAELF-D
        Y + EEVD+S E+KQRTS+FDRIKP TTR  VFQR+SMA  EEEN   TS+  RTS F+RL++STSKK RPSTS FDRLK+T+ Q +  M + KA+ F +
Subjt:  YINTEEVDDSKEVKQRTSVFDRIKPSTTRPSVFQRMSMAATEEENLDLTSSSIRTSVFQRLNVSTSKKRRPSTSVFDRLKVTSGQDESMMNTSKAELF-D

Query:  EVSHNKVQSTVPSRMKRKFSVLINTEGFL
        E   +K+ S VPSRMKRK SV INTEG L
Subjt:  EVSHNKVQSTVPSRMKRKFSVLINTEGFL

A0A5D3D209 Retrotransposon gag protein3.8e-2846.3Show/hide
Query:  HLKNQMRNLALCDSCQFRGHKRKDKRQNKKDSRRSKPMRKGNTKSSQPRQLVTSTKF-------------------------SFTSYYINTEEVDDSKEV
        H KN   N++         HK+K +R NKK   + KP++  +    QPR+ +T  +F                            + Y + EEVD+S E+
Subjt:  HLKNQMRNLALCDSCQFRGHKRKDKRQNKKDSRRSKPMRKGNTKSSQPRQLVTSTKF-------------------------SFTSYYINTEEVDDSKEV

Query:  KQRTSVFDRIKPSTTRPSVFQRMSMAATEEENLDLTSSSIRTSVFQRLNVSTSKKRRPSTSVFDRLKVTSGQDESMMNTSKAELFDEVS-HNKVQSTVPS
        KQRTSVFDRIKP TTR  VFQR+SMA  EEEN   TS+  RTS F+RL++STSKK RPSTS FDRLK+T+ Q +  M + KA+ F E +  NK+ S VPS
Subjt:  KQRTSVFDRIKPSTTRPSVFQRMSMAATEEENLDLTSSSIRTSVFQRLNVSTSKKRRPSTSVFDRLKVTSGQDESMMNTSKAELFDEVS-HNKVQSTVPS

Query:  RMKRKFSVLINTEGFL
        RMKRK SV INTEG L
Subjt:  RMKRKFSVLINTEGFL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCAACTCATACATAGGTTCTATTACCCACAGTTGCTCCAATGAGTTGAGGTCGCAAGAAGATGAAATAATGACACTGTTGGGGGCATCAACTAACAACAAGTTCCT
GATCAAGTGTAATCCTTTGTTCGATTTTGATTCTGACGTAGTGTCCGTCATGATGACTGAAACAAACACTATGGAAGAAAGAATGGCGAGGTTGCAGGAACAAATCAACG
ACTTAATGAAGGTGATTAAAGAAAGAGATTCTCAAATCGCATACTTAAAGAGTCAGATTGAGAATCAAGATGTCGCTGAATCAAGTCAAACTCCTGGTGTTGTGACTCAT
TTGAAGAATCAAATGAGAAATTTAGCTTTATGCGATTCTTGTCAATTTAGAGGCCACAAACGCAAAGACAAACGTCAAAATAAAAAGGATTCAAGGAGATCTAAACCAAT
GAGAAAAGGGAATACAAAATCTTCTCAACCTCGCCAACTAGTTACCTCAACAAAGTTCTCATTTACTTCGTATTACATCAACACAGAAGAAGTTGATGATTCAAAAGAAG
TCAAGCAAAGGACTTCTGTCTTCGATCGCATCAAGCCTTCAACTACTCGACCTTCAGTCTTCCAAAGGATGAGTATGGCCGCGACGGAAGAAGAAAACCTAGATCTAACT
TCTAGCTCCATTCGAACTTCAGTCTTTCAAAGGCTAAATGTCTCTACTTCAAAGAAACGCCGACCTTCAACATCTGTTTTTGACCGCCTCAAAGTAACAAGTGGTCAAGA
TGAAAGCATGATGAACACCTCGAAAGCAGAACTGTTCGATGAGGTGAGCCACAACAAGGTCCAAAGTACCGTCCCTTCACGTATGAAAAGGAAGTTCTCCGTTCTCATAA
ATACAGAGGGCTTCTTGAAGTTCGAGGACTCGCATGCCGCTTCCTCTCAAAGTTCGAGGACTCGCCGCGTCGCTTCCTCTCAAAGTTCGAGGACTCCGTCGTCGCTTCCT
CTCAAAGTTCGAGGACTCCGTCGTCGCTTCCTCTCAAAGCTCGAGGGCTCCGTCGTCGCTTCTTCTCAAAGCTCGAAGACTCCATCGTCGCTTCTTTTCAAAGCTCGAGG
GCTCCTTCGAGGACTCTGTCGTCGCTTCCTCTCAAAGCTCGAAGACTCCGTCGTCGCTTCTTCTCAAAGCTTGAGGACTCCGTCGTCGCTTCTTCTCAAAACTCGAGGAC
TCCGTTGCCGCTTCCTCTCAACGTTCGAGGACTCCGTCGTCGCTTCTTCTCAAGTTTCGAGGACTTCGTCGTCGCTTCTTCTCAAAGCTCGAAGACTTCGTCGTCGCTTC
TTCTCAAAGCTCGAGGACTCCAACGTCTTCAAAGACTCTGGCGGTGCGTCGTCTTCAAAGACTTTGGCGGTGCTTCGTCTTCAAAGTATTTGGCAGTGCGGTCGCTTCGT
CTTCAAAGTCTCCGGCGATGCGGTAGCTTCGTCTTCAAAGTCTTTGGCAGTGCGGTCGCTTCGTCTTCAAAGTCTCCGGCGGTGCGGTCGCTTCGTCTTCAAAGTCTCCG
ACAGCGGCGGCCACATCTCCAAGGTCCGGTCTCCGGCGCGGCGACTACATCTCCAAGGTCCGGTCTTCTGCGGCGACTACAACTCCAAGGTCCGGTCTCCGGCGACGACG
GCTACATCTCCAAGGTCCGGTCTCCGGCGGTGGCGGCTTCATCTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGCAACTCATACATAGGTTCTATTACCCACAGTTGCTCCAATGAGTTGAGGTCGCAAGAAGATGAAATAATGACACTGTTGGGGGCATCAACTAACAACAAGTTCCT
GATCAAGTGTAATCCTTTGTTCGATTTTGATTCTGACGTAGTGTCCGTCATGATGACTGAAACAAACACTATGGAAGAAAGAATGGCGAGGTTGCAGGAACAAATCAACG
ACTTAATGAAGGTGATTAAAGAAAGAGATTCTCAAATCGCATACTTAAAGAGTCAGATTGAGAATCAAGATGTCGCTGAATCAAGTCAAACTCCTGGTGTTGTGACTCAT
TTGAAGAATCAAATGAGAAATTTAGCTTTATGCGATTCTTGTCAATTTAGAGGCCACAAACGCAAAGACAAACGTCAAAATAAAAAGGATTCAAGGAGATCTAAACCAAT
GAGAAAAGGGAATACAAAATCTTCTCAACCTCGCCAACTAGTTACCTCAACAAAGTTCTCATTTACTTCGTATTACATCAACACAGAAGAAGTTGATGATTCAAAAGAAG
TCAAGCAAAGGACTTCTGTCTTCGATCGCATCAAGCCTTCAACTACTCGACCTTCAGTCTTCCAAAGGATGAGTATGGCCGCGACGGAAGAAGAAAACCTAGATCTAACT
TCTAGCTCCATTCGAACTTCAGTCTTTCAAAGGCTAAATGTCTCTACTTCAAAGAAACGCCGACCTTCAACATCTGTTTTTGACCGCCTCAAAGTAACAAGTGGTCAAGA
TGAAAGCATGATGAACACCTCGAAAGCAGAACTGTTCGATGAGGTGAGCCACAACAAGGTCCAAAGTACCGTCCCTTCACGTATGAAAAGGAAGTTCTCCGTTCTCATAA
ATACAGAGGGCTTCTTGAAGTTCGAGGACTCGCATGCCGCTTCCTCTCAAAGTTCGAGGACTCGCCGCGTCGCTTCCTCTCAAAGTTCGAGGACTCCGTCGTCGCTTCCT
CTCAAAGTTCGAGGACTCCGTCGTCGCTTCCTCTCAAAGCTCGAGGGCTCCGTCGTCGCTTCTTCTCAAAGCTCGAAGACTCCATCGTCGCTTCTTTTCAAAGCTCGAGG
GCTCCTTCGAGGACTCTGTCGTCGCTTCCTCTCAAAGCTCGAAGACTCCGTCGTCGCTTCTTCTCAAAGCTTGAGGACTCCGTCGTCGCTTCTTCTCAAAACTCGAGGAC
TCCGTTGCCGCTTCCTCTCAACGTTCGAGGACTCCGTCGTCGCTTCTTCTCAAGTTTCGAGGACTTCGTCGTCGCTTCTTCTCAAAGCTCGAAGACTTCGTCGTCGCTTC
TTCTCAAAGCTCGAGGACTCCAACGTCTTCAAAGACTCTGGCGGTGCGTCGTCTTCAAAGACTTTGGCGGTGCTTCGTCTTCAAAGTATTTGGCAGTGCGGTCGCTTCGT
CTTCAAAGTCTCCGGCGATGCGGTAGCTTCGTCTTCAAAGTCTTTGGCAGTGCGGTCGCTTCGTCTTCAAAGTCTCCGGCGGTGCGGTCGCTTCGTCTTCAAAGTCTCCG
ACAGCGGCGGCCACATCTCCAAGGTCCGGTCTCCGGCGCGGCGACTACATCTCCAAGGTCCGGTCTTCTGCGGCGACTACAACTCCAAGGTCCGGTCTCCGGCGACGACG
GCTACATCTCCAAGGTCCGGTCTCCGGCGGTGGCGGCTTCATCTTTAA
Protein sequenceShow/hide protein sequence
MSNSYIGSITHSCSNELRSQEDEIMTLLGASTNNKFLIKCNPLFDFDSDVVSVMMTETNTMEERMARLQEQINDLMKVIKERDSQIAYLKSQIENQDVAESSQTPGVVTH
LKNQMRNLALCDSCQFRGHKRKDKRQNKKDSRRSKPMRKGNTKSSQPRQLVTSTKFSFTSYYINTEEVDDSKEVKQRTSVFDRIKPSTTRPSVFQRMSMAATEEENLDLT
SSSIRTSVFQRLNVSTSKKRRPSTSVFDRLKVTSGQDESMMNTSKAELFDEVSHNKVQSTVPSRMKRKFSVLINTEGFLKFEDSHAASSQSSRTRRVASSQSSRTPSSLP
LKVRGLRRRFLSKLEGSVVASSQSSKTPSSLLFKARGLLRGLCRRFLSKLEDSVVASSQSLRTPSSLLLKTRGLRCRFLSTFEDSVVASSQVSRTSSSLLLKARRLRRRF
FSKLEDSNVFKDSGGASSSKTLAVLRLQSIWQCGRFVFKVSGDAVASSSKSLAVRSLRLQSLRRCGRFVFKVSDSGGHISKVRSPARRLHLQGPVFCGDYNSKVRSPATT
ATSPRSGLRRWRLHL