; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0008427 (gene) of Snake gourd v1 genome

Gene IDTan0008427
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG01:39308731..39310409
RNA-Seq ExpressionTan0008427
SyntenyTan0008427
Gene Ontology termsNA
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0034863.1 gag/pol protein [Cucumis melo var. makuwa]6.0e-9759.88Show/hide
Query:  SARVVDGASTSTSVVDPNSSSQVRS-LELGMPRRSGRVVRQPERYMDLDETQVVIPDDDCEDPLTYDQAMVDVDKDEWIKAMDQK---------W-----
        S RVV+  ST   VV  +SS ++R    LG PRRSGRV   P  YM L ET  VI D +  DPLT+ +AM DVDKDEWIKAM+ +         W     
Subjt:  SARVVDGASTSTSVVDPNSSSQVRS-LELGMPRRSGRVVRQPERYMDLDETQVVIPDDDCEDPLTYDQAMVDVDKDEWIKAMDQK---------W-----

Query:  -SRCTPILSGSLWI-----NLMGNVQTFKARLVAKGFTQVEGVDYEETFSPVAMVKSIRIFLAIAAYYDYEVWKMDVKTAFLNGNLDETIYMDQPKGFIV
             PI  G  WI        G VQTFKARLVAKG+TQVEGVDYE+ FS VAM+KSIRI L+IAAY+DYE+W+M VKT  LN NL+ETIYM QP+GFI+
Subjt:  -SRCTPILSGSLWI-----NLMGNVQTFKARLVAKGFTQVEGVDYEETFSPVAMVKSIRIFLAIAAYYDYEVWKMDVKTAFLNGNLDETIYMDQPKGFIV

Query:  Q-----------GQEQKEIYFGSTFILNGGAVVWWSIKQGCIGDSTMEAEYVAACETAKEVVWLRKFMTNLEVVPNMNLPITLFCDNSGAIANSREPRSH
                     Q Q +    + F+L     VW SIKQGCI DSTMEAEYVAAC+ AKE VWLRKF+T+LEVVPNM+ PITL+CDNSGA+AN REPRSH
Subjt:  Q-----------GQEQKEIYFGSTFILNGGAVVWWSIKQGCIGDSTMEAEYVAACETAKEVVWLRKFMTNLEVVPNMNLPITLFCDNSGAIANSREPRSH

Query:  KRGKPIERKYRLIREIVHRGDVTVTQIVSQHNVADPFTNPLTAK
        K GK IERKY LIREI+HRGDV VTQI S HN+ADPFT PLTAK
Subjt:  KRGKPIERKYRLIREIVHRGDVTVTQIVSQHNVADPFTNPLTAK

KAA0049626.1 gag/pol protein [Cucumis melo var. makuwa]1.1e-9346.77Show/hide
Query:  SARVVDGASTSTSVVDPNSSSQVRSLE-LGMPRRSGRVVRQPERYMDLDETQVVIPDDDCEDPLTYDQAMVDVDKDEWIKAMDQK-----WSRCTPILS-
        S RVVD    S+ V + ++S Q    + L MPRRSG++V Q  RY+ L ETQVVIPDD  EDPL+Y QAM DVDKD+W+KAMD +     ++    ++  
Subjt:  SARVVDGASTSTSVVDPNSSSQVRSLE-LGMPRRSGRVVRQPERYMDLDETQVVIPDDDCEDPLTYDQAMVDVDKDEWIKAMDQK-----WSRCTPILS-

Query:  -------GSLWI-----NLMGNVQTFKARLVAKGFTQVEGVDYEETFSPVAMVKSIRIFLAIAAYYDYEVWKMDVKTAFLNGNLDETIYMDQPKGFIVQ-
               G  WI     +  G VQTFK RLVAKG+TQ EGVDYEETFSPVAM+KSIRI ++IA +YDYE+W+MDVKTAFLNGNL+E+I+M QP+GFI Q 
Subjt:  -------GSLWI-----NLMGNVQTFKARLVAKGFTQVEGVDYEETFSPVAMVKSIRIFLAIAAYYDYEVWKMDVKTAFLNGNLDETIYMDQPKGFIVQ-

Query:  ----------------------------------------------------------------------------------------GQEQKEIYF---
                                                                                                 +EQ +I +   
Subjt:  ----------------------------------------------------------------------------------------GQEQKEIYF---

Query:  --------------------------GSTFILNGGAVVWWSIKQGCIGDSTMEAEYVAACETAKEVVWLRKFMTNLEVVPNMNLPITLFCDNSGAIANSR
                                  GS F LNGG VVW SIKQGCI DSTMEAEYV ACE AKEVVWLRKF+ +LEVVPNM LPITL+CDNSGA+ANS+
Subjt:  --------------------------GSTFILNGGAVVWWSIKQGCIGDSTMEAEYVAACETAKEVVWLRKFMTNLEVVPNMNLPITLFCDNSGAIANSR

Query:  EPRSHKRGKPIERKYRLIREIVHRGDVTVTQIVSQHNVADPFTNPLTAK
        EPRSHKRGK IERKY LIRE+V +GDV VT+I S+HN+ADP T  L AK
Subjt:  EPRSHKRGKPIERKYRLIREIVHRGDVTVTQIVSQHNVADPFTNPLTAK

KAA0067450.1 retrovirus-related pol polyprotein from transposon tnt 1-94 [Cucumis melo var. makuwa]2.3e-9348.06Show/hide
Query:  SARVVDGASTSTSVVDPNSSSQVR-SLELGMPRRSGRVVRQPERYMDLDETQVVIPDDDCEDPLTYDQAMVDVDKDEWIKAMD---------QKW-----
        S RVV+    S+ V +  +S Q   S  L MPRRSGRVV QP RY+ L ETQVVIPDD   DPL+Y QAM DVDKD+W+K++D           W     
Subjt:  SARVVDGASTSTSVVDPNSSSQVR-SLELGMPRRSGRVVRQPERYMDLDETQVVIPDDDCEDPLTYDQAMVDVDKDEWIKAMD---------QKW-----

Query:  -SRCTPILSGSLWI-----NLMGNVQTFKARLVAKGFTQVEGVDYEETFSPVAMVKSIRIFLAIAAYYDYEVWKMDVKTAFLNGNLDETIYMDQPKGFIV
             PI  G  WI     +  G VQT K +LVAKG+TQ EGVD+EETFSPVAM+KSI+I L+IA + DYE+W+MDVKT FLN N +E+I+M QP+GFI 
Subjt:  -SRCTPILSGSLWI-----NLMGNVQTFKARLVAKGFTQVEGVDYEETFSPVAMVKSIRIFLAIAAYYDYEVWKMDVKTAFLNGNLDETIYMDQPKGFIV

Query:  QGQEQK----------------------------------------------------------------------------------------------
        QGQEQK                                                                                              
Subjt:  QGQEQK----------------------------------------------------------------------------------------------

Query:  --------EIYFGSTFILNGGAVVWWSIKQGCIGDSTMEAEYVAACETAKEVVWLRKFMTNLEVVPNMNLPITLFCDNSGAIANSREPRSHKRGKPIERK
                +   GS F LNGGA+VW SIKQGCI  STMEAEYVAACE AKE VWLRKF+ + EVVPNMNLPITL+CDNSGA+ANS+EPRSHKRGK IERK
Subjt:  --------EIYFGSTFILNGGAVVWWSIKQGCIGDSTMEAEYVAACETAKEVVWLRKFMTNLEVVPNMNLPITLFCDNSGAIANSREPRSHKRGKPIERK

Query:  YRLIREIVHRGDVTVTQIVSQHNVADPFTNPLTAKGLRV
        Y LIREIV +GDV VT+I S+ N+ADPFT  LTAK  RV
Subjt:  YRLIREIVHRGDVTVTQIVSQHNVADPFTNPLTAKGLRV

TYJ96675.1 gag/pol protein [Cucumis melo var. makuwa]2.7e-9760.17Show/hide
Query:  SARVVDGASTSTSVVDPNSSSQVRS-LELGMPRRSGRVVRQPERYMDLDETQVVIPDDDCEDPLTYDQAMVDVDKDEWIKAMDQK---------W-----
        S RVV+  ST   VV  +SS ++R    LG PRRSGRV   P  YM L ET  VI D +  DPLT+ +AM DVDKDEWIKAM+ +         W     
Subjt:  SARVVDGASTSTSVVDPNSSSQVRS-LELGMPRRSGRVVRQPERYMDLDETQVVIPDDDCEDPLTYDQAMVDVDKDEWIKAMDQK---------W-----

Query:  -SRCTPILSGSLWI-----NLMGNVQTFKARLVAKGFTQVEGVDYEETFSPVAMVKSIRIFLAIAAYYDYEVWKMDVKTAFLNGNLDETIYMDQPKGFIV
             PI  G  WI        G VQTFKARLVAKG+TQVEGVDYE+ FS VAM+KSIRI L+IAAY+DYE+W+M VKT  LN NL+ETIYM QP+GFI+
Subjt:  -SRCTPILSGSLWI-----NLMGNVQTFKARLVAKGFTQVEGVDYEETFSPVAMVKSIRIFLAIAAYYDYEVWKMDVKTAFLNGNLDETIYMDQPKGFIV

Query:  Q-----------GQEQKEIYFGSTFILNGGAVVWWSIKQGCIGDSTMEAEYVAACETAKEVVWLRKFMTNLEVVPNMNLPITLFCDNSGAIANSREPRSH
                     Q Q +    + F+L     VW SIKQGCI DSTMEAEYVAAC+ AKE VWLRKF+T+LEVVPNM+ PITL+CDNSGA+AN REPRSH
Subjt:  Q-----------GQEQKEIYFGSTFILNGGAVVWWSIKQGCIGDSTMEAEYVAACETAKEVVWLRKFMTNLEVVPNMNLPITLFCDNSGAIANSREPRSH

Query:  KRGKPIERKYRLIREIVHRGDVTVTQIVSQHNVADPFTNPLTAK
        K GK IERKY LIREIVHRGDV VTQI S HN+ADPFT PLTAK
Subjt:  KRGKPIERKYRLIREIVHRGDVTVTQIVSQHNVADPFTNPLTAK

TYK02840.1 gag/pol protein [Cucumis melo var. makuwa]8.9e-10151.72Show/hide
Query:  SARVVDGASTSTSVVDPNSSSQVR-SLELGMPRRSGRVVRQPERYMDLDETQVVIPDDDCEDPLTYDQAMVDVDKDEWIKAMDQK---------W-----
        S RVVD    S+ V +  +S Q   S  L MPRRSGRVV QP RY+ L ETQVVIPDD  EDPL+Y QAM DVDKD+W+KAMD +         W     
Subjt:  SARVVDGASTSTSVVDPNSSSQVR-SLELGMPRRSGRVVRQPERYMDLDETQVVIPDDDCEDPLTYDQAMVDVDKDEWIKAMDQK---------W-----

Query:  -SRCTPILSGSLWI-----NLMGNVQTFKARLVAKGFTQVEGVDYEETFSPVAMVKSIRIFLAIAAYYDYEVWKMDVKTAFLNGNLDETIYMDQPKGFIV
             PI  G  WI     +  G VQTFKARLVAKG+TQ EGVDYEETFSPVAM+KSIRI L+IA +YDYE+W+MDVKTAFLNGNL+E+I+M QP+ FI 
Subjt:  -SRCTPILSGSLWI-----NLMGNVQTFKARLVAKGFTQVEGVDYEETFSPVAMVKSIRIFLAIAAYYDYEVWKMDVKTAFLNGNLDETIYMDQPKGFIV

Query:  QGQEQK------EIY-------------------------------------------------------------------------------------
        QGQEQK       IY                                                                                     
Subjt:  QGQEQK------EIY-------------------------------------------------------------------------------------

Query:  -------------FGSTFILNGGAVVWWSIKQGCIGDSTMEAEYVAACETAKEVVWLRKFMTNLEVVPNMNLPITLFCDNSGAIANSREPRSHKRGKPIE
                      GS F LN GA+VW SIKQGCI DSTMEAEYVAACE AKE VWLRKF+ +LEVVPNMNLPITL+CDNSGA+ANS+EPRSHKRGK IE
Subjt:  -------------FGSTFILNGGAVVWWSIKQGCIGDSTMEAEYVAACETAKEVVWLRKFMTNLEVVPNMNLPITLFCDNSGAIANSREPRSHKRGKPIE

Query:  RKYRLIREIVHRGDVTVTQIVSQHNVADPFTNPLTAK
        RKY LIREIV RGDV VT+I S+HN+ADPFT  LTAK
Subjt:  RKYRLIREIVHRGDVTVTQIVSQHNVADPFTNPLTAK

TrEMBL top hitse value%identityAlignment
A0A5A7SWF4 Gag/pol protein2.9e-9759.88Show/hide
Query:  SARVVDGASTSTSVVDPNSSSQVRS-LELGMPRRSGRVVRQPERYMDLDETQVVIPDDDCEDPLTYDQAMVDVDKDEWIKAMDQK---------W-----
        S RVV+  ST   VV  +SS ++R    LG PRRSGRV   P  YM L ET  VI D +  DPLT+ +AM DVDKDEWIKAM+ +         W     
Subjt:  SARVVDGASTSTSVVDPNSSSQVRS-LELGMPRRSGRVVRQPERYMDLDETQVVIPDDDCEDPLTYDQAMVDVDKDEWIKAMDQK---------W-----

Query:  -SRCTPILSGSLWI-----NLMGNVQTFKARLVAKGFTQVEGVDYEETFSPVAMVKSIRIFLAIAAYYDYEVWKMDVKTAFLNGNLDETIYMDQPKGFIV
             PI  G  WI        G VQTFKARLVAKG+TQVEGVDYE+ FS VAM+KSIRI L+IAAY+DYE+W+M VKT  LN NL+ETIYM QP+GFI+
Subjt:  -SRCTPILSGSLWI-----NLMGNVQTFKARLVAKGFTQVEGVDYEETFSPVAMVKSIRIFLAIAAYYDYEVWKMDVKTAFLNGNLDETIYMDQPKGFIV

Query:  Q-----------GQEQKEIYFGSTFILNGGAVVWWSIKQGCIGDSTMEAEYVAACETAKEVVWLRKFMTNLEVVPNMNLPITLFCDNSGAIANSREPRSH
                     Q Q +    + F+L     VW SIKQGCI DSTMEAEYVAAC+ AKE VWLRKF+T+LEVVPNM+ PITL+CDNSGA+AN REPRSH
Subjt:  Q-----------GQEQKEIYFGSTFILNGGAVVWWSIKQGCIGDSTMEAEYVAACETAKEVVWLRKFMTNLEVVPNMNLPITLFCDNSGAIANSREPRSH

Query:  KRGKPIERKYRLIREIVHRGDVTVTQIVSQHNVADPFTNPLTAK
        K GK IERKY LIREI+HRGDV VTQI S HN+ADPFT PLTAK
Subjt:  KRGKPIERKYRLIREIVHRGDVTVTQIVSQHNVADPFTNPLTAK

A0A5A7U616 Gag/pol protein5.1e-9446.77Show/hide
Query:  SARVVDGASTSTSVVDPNSSSQVRSLE-LGMPRRSGRVVRQPERYMDLDETQVVIPDDDCEDPLTYDQAMVDVDKDEWIKAMDQK-----WSRCTPILS-
        S RVVD    S+ V + ++S Q    + L MPRRSG++V Q  RY+ L ETQVVIPDD  EDPL+Y QAM DVDKD+W+KAMD +     ++    ++  
Subjt:  SARVVDGASTSTSVVDPNSSSQVRSLE-LGMPRRSGRVVRQPERYMDLDETQVVIPDDDCEDPLTYDQAMVDVDKDEWIKAMDQK-----WSRCTPILS-

Query:  -------GSLWI-----NLMGNVQTFKARLVAKGFTQVEGVDYEETFSPVAMVKSIRIFLAIAAYYDYEVWKMDVKTAFLNGNLDETIYMDQPKGFIVQ-
               G  WI     +  G VQTFK RLVAKG+TQ EGVDYEETFSPVAM+KSIRI ++IA +YDYE+W+MDVKTAFLNGNL+E+I+M QP+GFI Q 
Subjt:  -------GSLWI-----NLMGNVQTFKARLVAKGFTQVEGVDYEETFSPVAMVKSIRIFLAIAAYYDYEVWKMDVKTAFLNGNLDETIYMDQPKGFIVQ-

Query:  ----------------------------------------------------------------------------------------GQEQKEIYF---
                                                                                                 +EQ +I +   
Subjt:  ----------------------------------------------------------------------------------------GQEQKEIYF---

Query:  --------------------------GSTFILNGGAVVWWSIKQGCIGDSTMEAEYVAACETAKEVVWLRKFMTNLEVVPNMNLPITLFCDNSGAIANSR
                                  GS F LNGG VVW SIKQGCI DSTMEAEYV ACE AKEVVWLRKF+ +LEVVPNM LPITL+CDNSGA+ANS+
Subjt:  --------------------------GSTFILNGGAVVWWSIKQGCIGDSTMEAEYVAACETAKEVVWLRKFMTNLEVVPNMNLPITLFCDNSGAIANSR

Query:  EPRSHKRGKPIERKYRLIREIVHRGDVTVTQIVSQHNVADPFTNPLTAK
        EPRSHKRGK IERKY LIRE+V +GDV VT+I S+HN+ADP T  L AK
Subjt:  EPRSHKRGKPIERKYRLIREIVHRGDVTVTQIVSQHNVADPFTNPLTAK

A0A5A7VJV4 Retrovirus-related pol polyprotein from transposon tnt 1-941.1e-9348.06Show/hide
Query:  SARVVDGASTSTSVVDPNSSSQVR-SLELGMPRRSGRVVRQPERYMDLDETQVVIPDDDCEDPLTYDQAMVDVDKDEWIKAMD---------QKW-----
        S RVV+    S+ V +  +S Q   S  L MPRRSGRVV QP RY+ L ETQVVIPDD   DPL+Y QAM DVDKD+W+K++D           W     
Subjt:  SARVVDGASTSTSVVDPNSSSQVR-SLELGMPRRSGRVVRQPERYMDLDETQVVIPDDDCEDPLTYDQAMVDVDKDEWIKAMD---------QKW-----

Query:  -SRCTPILSGSLWI-----NLMGNVQTFKARLVAKGFTQVEGVDYEETFSPVAMVKSIRIFLAIAAYYDYEVWKMDVKTAFLNGNLDETIYMDQPKGFIV
             PI  G  WI     +  G VQT K +LVAKG+TQ EGVD+EETFSPVAM+KSI+I L+IA + DYE+W+MDVKT FLN N +E+I+M QP+GFI 
Subjt:  -SRCTPILSGSLWI-----NLMGNVQTFKARLVAKGFTQVEGVDYEETFSPVAMVKSIRIFLAIAAYYDYEVWKMDVKTAFLNGNLDETIYMDQPKGFIV

Query:  QGQEQK----------------------------------------------------------------------------------------------
        QGQEQK                                                                                              
Subjt:  QGQEQK----------------------------------------------------------------------------------------------

Query:  --------EIYFGSTFILNGGAVVWWSIKQGCIGDSTMEAEYVAACETAKEVVWLRKFMTNLEVVPNMNLPITLFCDNSGAIANSREPRSHKRGKPIERK
                +   GS F LNGGA+VW SIKQGCI  STMEAEYVAACE AKE VWLRKF+ + EVVPNMNLPITL+CDNSGA+ANS+EPRSHKRGK IERK
Subjt:  --------EIYFGSTFILNGGAVVWWSIKQGCIGDSTMEAEYVAACETAKEVVWLRKFMTNLEVVPNMNLPITLFCDNSGAIANSREPRSHKRGKPIERK

Query:  YRLIREIVHRGDVTVTQIVSQHNVADPFTNPLTAKGLRV
        Y LIREIV +GDV VT+I S+ N+ADPFT  LTAK  RV
Subjt:  YRLIREIVHRGDVTVTQIVSQHNVADPFTNPLTAKGLRV

A0A5D3BDY3 Gag/pol protein1.3e-9760.17Show/hide
Query:  SARVVDGASTSTSVVDPNSSSQVRS-LELGMPRRSGRVVRQPERYMDLDETQVVIPDDDCEDPLTYDQAMVDVDKDEWIKAMDQK---------W-----
        S RVV+  ST   VV  +SS ++R    LG PRRSGRV   P  YM L ET  VI D +  DPLT+ +AM DVDKDEWIKAM+ +         W     
Subjt:  SARVVDGASTSTSVVDPNSSSQVRS-LELGMPRRSGRVVRQPERYMDLDETQVVIPDDDCEDPLTYDQAMVDVDKDEWIKAMDQK---------W-----

Query:  -SRCTPILSGSLWI-----NLMGNVQTFKARLVAKGFTQVEGVDYEETFSPVAMVKSIRIFLAIAAYYDYEVWKMDVKTAFLNGNLDETIYMDQPKGFIV
             PI  G  WI        G VQTFKARLVAKG+TQVEGVDYE+ FS VAM+KSIRI L+IAAY+DYE+W+M VKT  LN NL+ETIYM QP+GFI+
Subjt:  -SRCTPILSGSLWI-----NLMGNVQTFKARLVAKGFTQVEGVDYEETFSPVAMVKSIRIFLAIAAYYDYEVWKMDVKTAFLNGNLDETIYMDQPKGFIV

Query:  Q-----------GQEQKEIYFGSTFILNGGAVVWWSIKQGCIGDSTMEAEYVAACETAKEVVWLRKFMTNLEVVPNMNLPITLFCDNSGAIANSREPRSH
                     Q Q +    + F+L     VW SIKQGCI DSTMEAEYVAAC+ AKE VWLRKF+T+LEVVPNM+ PITL+CDNSGA+AN REPRSH
Subjt:  Q-----------GQEQKEIYFGSTFILNGGAVVWWSIKQGCIGDSTMEAEYVAACETAKEVVWLRKFMTNLEVVPNMNLPITLFCDNSGAIANSREPRSH

Query:  KRGKPIERKYRLIREIVHRGDVTVTQIVSQHNVADPFTNPLTAK
        K GK IERKY LIREIVHRGDV VTQI S HN+ADPFT PLTAK
Subjt:  KRGKPIERKYRLIREIVHRGDVTVTQIVSQHNVADPFTNPLTAK

A0A5D3BUN8 Gag/pol protein4.3e-10151.72Show/hide
Query:  SARVVDGASTSTSVVDPNSSSQVR-SLELGMPRRSGRVVRQPERYMDLDETQVVIPDDDCEDPLTYDQAMVDVDKDEWIKAMDQK---------W-----
        S RVVD    S+ V +  +S Q   S  L MPRRSGRVV QP RY+ L ETQVVIPDD  EDPL+Y QAM DVDKD+W+KAMD +         W     
Subjt:  SARVVDGASTSTSVVDPNSSSQVR-SLELGMPRRSGRVVRQPERYMDLDETQVVIPDDDCEDPLTYDQAMVDVDKDEWIKAMDQK---------W-----

Query:  -SRCTPILSGSLWI-----NLMGNVQTFKARLVAKGFTQVEGVDYEETFSPVAMVKSIRIFLAIAAYYDYEVWKMDVKTAFLNGNLDETIYMDQPKGFIV
             PI  G  WI     +  G VQTFKARLVAKG+TQ EGVDYEETFSPVAM+KSIRI L+IA +YDYE+W+MDVKTAFLNGNL+E+I+M QP+ FI 
Subjt:  -SRCTPILSGSLWI-----NLMGNVQTFKARLVAKGFTQVEGVDYEETFSPVAMVKSIRIFLAIAAYYDYEVWKMDVKTAFLNGNLDETIYMDQPKGFIV

Query:  QGQEQK------EIY-------------------------------------------------------------------------------------
        QGQEQK       IY                                                                                     
Subjt:  QGQEQK------EIY-------------------------------------------------------------------------------------

Query:  -------------FGSTFILNGGAVVWWSIKQGCIGDSTMEAEYVAACETAKEVVWLRKFMTNLEVVPNMNLPITLFCDNSGAIANSREPRSHKRGKPIE
                      GS F LN GA+VW SIKQGCI DSTMEAEYVAACE AKE VWLRKF+ +LEVVPNMNLPITL+CDNSGA+ANS+EPRSHKRGK IE
Subjt:  -------------FGSTFILNGGAVVWWSIKQGCIGDSTMEAEYVAACETAKEVVWLRKFMTNLEVVPNMNLPITLFCDNSGAIANSREPRSHKRGKPIE

Query:  RKYRLIREIVHRGDVTVTQIVSQHNVADPFTNPLTAK
        RKY LIREIV RGDV VT+I S+HN+ADPFT  LTAK
Subjt:  RKYRLIREIVHRGDVTVTQIVSQHNVADPFTNPLTAK

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.7e-1754.88Show/hide
Query:  SLWINLMGNVQTFKARLVAKGFTQVEGVDYEETFSPVAMVKSIRIFLAIAAYYDYEVWKMDVKTAFLNGNLDETIYMDQPKG
        S+  N +GN   +KARLVA+GFTQ   +DYEETF+PVA + S R  L++   Y+ +V +MDVKTAFLNG L E IYM  P+G
Subjt:  SLWINLMGNVQTFKARLVAKGFTQVEGVDYEETFSPVAMVKSIRIFLAIAAYYDYEVWKMDVKTAFLNGNLDETIYMDQPKG

P04146 Copia protein5.6e-1333.93Show/hide
Query:  VVWWSIKQGCIGDSTMEAEYVAACETAKEVVWLRKFMTNLEVVPNMNLPITLFCDNSGAIANSREPRSHKRGKPIERKYRLIREIVHRGDVTVTQIVSQH
        + W + +Q  +  S+ EAEY+A  E  +E +WL+  +T++ +   +  PI ++ DN G I+ +  P  HKR K I+ KY   RE V    + +  I +++
Subjt:  VVWWSIKQGCIGDSTMEAEYVAACETAKEVVWLRKFMTNLEVVPNMNLPITLFCDNSGAIANSREPRSHKRGKPIERKYRLIREIVHRGDVTVTQIVSQH

Query:  NVADPFTNPLTA
         +AD FT PL A
Subjt:  NVADPFTNPLTA

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.4e-1835.88Show/hide
Query:  RRSGRVVRQPERYMDLDETQVVIPDDDCEDPLTYDQAMVDVDKDEWIKAMDQKWS--------RCTPILSGS-----LWINLMG-----NVQTFKARLVA
        RRS R   +  RY     T+ V+  DD  +P +  + +   +K++ +KAM ++          +   +  G       W+  +       +  +KARLV 
Subjt:  RRSGRVVRQPERYMDLDETQVVIPDDDCEDPLTYDQAMVDVDKDEWIKAMDQKWS--------RCTPILSGS-----LWINLMG-----NVQTFKARLVA

Query:  KGFTQVEGVDYEETFSPVAMVKSIRIFLAIAAYYDYEVWKMDVKTAFLNGNLDETIYMDQPKGFIVQGQE
        KGF Q +G+D++E FSPV  + SIR  L++AA  D EV ++DVKTAFL+G+L+E IYM+QP+GF V G++
Subjt:  KGFTQVEGVDYEETFSPVAMVKSIRIFLAIAAYYDYEVWKMDVKTAFLNGNLDETIYMDQPKGFIVQGQE

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.5e-1538.46Show/hide
Query:  GSTFILNGGAVVWWSIKQGCIGDSTMEAEYVAACETAKEVVWLRKFMTNLEVVPNMNLPITLFCDNSGAIANSREPRSHKRGKPIERKYRLIREIVHRGD
        G  F  +GGA+ W S  Q C+  ST EAEY+AA ET KE++WL++F+  L +         ++CD+  AI  S+    H R K I+ +Y  IRE+V    
Subjt:  GSTFILNGGAVVWWSIKQGCIGDSTMEAEYVAACETAKEVVWLRKFMTNLEVVPNMNLPITLFCDNSGAIANSREPRSHKRGKPIERKYRLIREIVHRGD

Query:  VTVTQIVSQHNVADPFT
        + V +I +  N AD  T
Subjt:  VTVTQIVSQHNVADPFT

P92520 Uncharacterized mitochondrial protein AtMg008208.0e-0451.16Show/hide
Query:  GNVQTFKARLVAKGFTQVEGVDYEETFSPVAMVKSIRIFLAIA
        G +   KARLVAKGF Q EG+ + ET+SPV    +IR  L +A
Subjt:  GNVQTFKARLVAKGFTQVEGVDYEETFSPVAMVKSIRIFLAIA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.5e-1547.78Show/hide
Query:  GSLWI-----NLMGNVQTFKARLVAKGFTQVEGVDYEETFSPVAMVKSIRIFLAIAAYYDYEVWKMDVKTAFLNGNLDETIYMDQPKGFI
        G  WI     N  G++  +KARLVAKG+ Q  G+DY ETFSPV    SIRI L +A    + + ++DV  AFL G L + +YM QP GFI
Subjt:  GSLWI-----NLMGNVQTFKARLVAKGFTQVEGVDYEETFSPVAMVKSIRIFLAIAAYYDYEVWKMDVKTAFLNGNLDETIYMDQPKGFI

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.1e-1639.42Show/hide
Query:  DPLTYDQAMVDVDKDEWIKAM---------DQKWSRCTP-----ILSGSLWI-----NLMGNVQTFKARLVAKGFTQVEGVDYEETFSPVAMVKSIRIFL
        +P T  QAM D   D W +AM         +  W    P      + G  WI     N  G++  +KARLVAKG+ Q  G+DY ETFSPV    SIRI L
Subjt:  DPLTYDQAMVDVDKDEWIKAM---------DQKWSRCTP-----ILSGSLWI-----NLMGNVQTFKARLVAKGFTQVEGVDYEETFSPVAMVKSIRIFL

Query:  AIAAYYDYEVWKMDVKTAFLNGNLDETIYMDQPKGFI
         +A    + + ++DV  AFL G L + +YM QP GF+
Subjt:  AIAAYYDYEVWKMDVKTAFLNGNLDETIYMDQPKGFI

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 85.7e-2140.58Show/hide
Query:  EDPLTYDQAMVDVDKDEWIKAMDQK---------WSRCT------PILSGSLWI-----NLMGNVQTFKARLVAKGFTQVEGVDYEETFSPVAMVKSIRI
        ++P TY++A   +    W  AMD +         W  CT      PI  G  W+     N  G ++ +KARLVAKG+TQ EG+D+ ETFSPV  + S+++
Subjt:  EDPLTYDQAMVDVDKDEWIKAMDQK---------WSRCT------PILSGSLWI-----NLMGNVQTFKARLVAKGFTQVEGVDYEETFSPVAMVKSIRI

Query:  FLAIAAYYDYEVWKMDVKTAFLNGNLDETIYMDQPKGF
         LAI+A Y++ + ++D+  AFLNG+LDE IYM  P G+
Subjt:  FLAIAAYYDYEVWKMDVKTAFLNGNLDETIYMDQPKGF

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)5.7e-0551.16Show/hide
Query:  GNVQTFKARLVAKGFTQVEGVDYEETFSPVAMVKSIRIFLAIA
        G +   KARLVAKGF Q EG+ + ET+SPV    +IR  L +A
Subjt:  GNVQTFKARLVAKGFTQVEGVDYEETFSPVAMVKSIRIFLAIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACAGTACATCAGCAAGAGTTGTTGATGGTGCTAGTACATCAACAAGTGTTGTTGATCCTAACTCGTCTAGTCAAGTCCGTTCCCTAGAGTTGGGAATGCCTCGACG
TAGTGGGAGAGTTGTGAGACAGCCTGAACGTTACATGGATTTAGATGAAACCCAAGTCGTCATTCCTGATGATGACTGTGAGGATCCATTGACCTATGATCAAGCAATGG
TTGATGTTGACAAGGACGAATGGATTAAAGCTATGGACCAGAAATGGAGTCGATGTACTCCAATTCTGTCTGGGAGCTTGTGGATCAACCTGATGGGGAACGTGCAGACC
TTCAAAGCACGACTAGTGGCAAAGGGTTTTACCCAGGTCGAAGGGGTTGACTATGAGGAAACCTTTTCACCTGTTGCCATGGTAAAGTCGATCAGGATCTTTCTGGCCAT
TGCCGCGTATTATGACTACGAGGTATGGAAGATGGATGTCAAGACCGCCTTTCTGAATGGCAACCTTGACGAAACCATCTACATGGACCAGCCCAAAGGGTTCATTGTCC
AAGGCCAAGAGCAAAAGGAAATCTACTTCGGGTCAACCTTCATTCTGAATGGAGGAGCTGTAGTATGGTGGAGCATCAAGCAGGGATGCATCGGCGATTCCACTATGGAA
GCGGAGTACGTTGCGGCTTGTGAAACTGCAAAGGAAGTTGTTTGGCTAAGGAAGTTCATGACAAATTTGGAAGTTGTTCCAAATATGAACTTGCCGATCACACTGTTTTG
TGACAACAGTGGTGCAATAGCCAACTCCAGGGAACCTCGGAGCCATAAGAGAGGCAAACCCATTGAGCGGAAGTATCGCCTGATACGGGAGATTGTGCACCGCGGAGACG
TGACAGTCACGCAGATAGTGTCGCAGCACAACGTAGCTGATCCATTTACAAATCCCCTCACGGCTAAGGGTTTGAGGGTCACCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGACAGTACATCAGCAAGAGTTGTTGATGGTGCTAGTACATCAACAAGTGTTGTTGATCCTAACTCGTCTAGTCAAGTCCGTTCCCTAGAGTTGGGAATGCCTCGACG
TAGTGGGAGAGTTGTGAGACAGCCTGAACGTTACATGGATTTAGATGAAACCCAAGTCGTCATTCCTGATGATGACTGTGAGGATCCATTGACCTATGATCAAGCAATGG
TTGATGTTGACAAGGACGAATGGATTAAAGCTATGGACCAGAAATGGAGTCGATGTACTCCAATTCTGTCTGGGAGCTTGTGGATCAACCTGATGGGGAACGTGCAGACC
TTCAAAGCACGACTAGTGGCAAAGGGTTTTACCCAGGTCGAAGGGGTTGACTATGAGGAAACCTTTTCACCTGTTGCCATGGTAAAGTCGATCAGGATCTTTCTGGCCAT
TGCCGCGTATTATGACTACGAGGTATGGAAGATGGATGTCAAGACCGCCTTTCTGAATGGCAACCTTGACGAAACCATCTACATGGACCAGCCCAAAGGGTTCATTGTCC
AAGGCCAAGAGCAAAAGGAAATCTACTTCGGGTCAACCTTCATTCTGAATGGAGGAGCTGTAGTATGGTGGAGCATCAAGCAGGGATGCATCGGCGATTCCACTATGGAA
GCGGAGTACGTTGCGGCTTGTGAAACTGCAAAGGAAGTTGTTTGGCTAAGGAAGTTCATGACAAATTTGGAAGTTGTTCCAAATATGAACTTGCCGATCACACTGTTTTG
TGACAACAGTGGTGCAATAGCCAACTCCAGGGAACCTCGGAGCCATAAGAGAGGCAAACCCATTGAGCGGAAGTATCGCCTGATACGGGAGATTGTGCACCGCGGAGACG
TGACAGTCACGCAGATAGTGTCGCAGCACAACGTAGCTGATCCATTTACAAATCCCCTCACGGCTAAGGGTTTGAGGGTCACCTAG
Protein sequenceShow/hide protein sequence
MDSTSARVVDGASTSTSVVDPNSSSQVRSLELGMPRRSGRVVRQPERYMDLDETQVVIPDDDCEDPLTYDQAMVDVDKDEWIKAMDQKWSRCTPILSGSLWINLMGNVQT
FKARLVAKGFTQVEGVDYEETFSPVAMVKSIRIFLAIAAYYDYEVWKMDVKTAFLNGNLDETIYMDQPKGFIVQGQEQKEIYFGSTFILNGGAVVWWSIKQGCIGDSTME
AEYVAACETAKEVVWLRKFMTNLEVVPNMNLPITLFCDNSGAIANSREPRSHKRGKPIERKYRLIREIVHRGDVTVTQIVSQHNVADPFTNPLTAKGLRVT