; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmUC08G143200 (gene) of Watermelon (USVL531) v1 genome

Gene IDCmUC08G143200
OrganismCitrullus mucosospermus (Watermelon (USVL531) v1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationCmU531Chr08:1277670..1282300
RNA-Seq ExpressionCmUC08G143200
SyntenyCmUC08G143200
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR012337 - Ribonuclease H-like superfamily
IPR021109 - Aspartic peptidase domain superfamily
IPR036397 - Ribonuclease H superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039128.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]2.3e-18737.08Show/hide
Query:  NTSVEDEEEIIAEE-EEVQIETEAIKITNNMELALHSVLGFLHKATMKLKGRVAEKEVVVLMDCGDTHNFIHQKLVDKVNLPLSEATNYGMVVGYGVAIQ
        N   EDEE    EE  E  +E + +++T +  + L ++     K TMKLKG + +KE+V+L+D G THNFIHQ L   + L + + T +G  +G G   +
Subjt:  NTSVEDEEEIIAEE-EEVQIETEAIKITNNMELALHSVLGFLHKATMKLKGRVAEKEVVVLMDCGDTHNFIHQKLVDKVNLPLSEATNYGMVVGYGVAIQ

Query:  GKGGLQVSHNYSPELTILDDFLLLDLGRVDVILGMTWLCTMGFMGVRWPSSTMTFFQWDSQVMLKG---------------------DLAFKAEFAEIKI
        GKG  +       E+TI+ DFL ++LG VD +LGM WL T G M + WPS TM+F+    Q++LKG                     D  F  E+A +++
Subjt:  GKGGLQVSHNYSPELTILDDFLLLDLGRVDVILGMTWLCTMGFMGVRWPSSTMTFFQWDSQVMLKG---------------------DLAFKAEFAEIKI

Query:  DDEGTKKQSLLQREDEPLHKGVLELLEEYQGLFETPKSLPSTREVGNHMILAEGQGPI---PYKYGHSQKNEMEKLIKEI--------------------
        + E T K    ++ DE     +  LL++Y  +F TPK LP  R++ + ++    Q PI   PYKYGH QK E+EKL+ E+                    
Subjt:  DDEGTKKQSLLQREDEPLHKGVLELLEEYQGLFETPKSLPSTREVGNHMILAEGQGPI---PYKYGHSQKNEMEKLIKEI--------------------

Query:  ----RGW-----------VRVSNKFPIPVIKELRDELNEAAIIPKLDLRS-------------------------------RLTNAPTTLQSLMNQVFRP
             GW             VS+KFPIPVI+EL DEL  AA+  KLDL+S                                LTNAP T QSLMNQVF+P
Subjt:  ----RGW-----------VRVSNKFPIPVIKELRDELNEAAIIPKLDLRS-------------------------------RLTNAPTTLQSLMNQVFRP

Query:  FLKRFILVFFDDILVYSLDMETHEKHLNIMFHVLKDNQLFANQKKCVFSQSRIQYLGHW---EGVEAENDKICAMANWSRPRNTTGLRGFLGLMGYYRRL
        FL+R +LVFFDDIL YS D+  H+KHL ++F VL+DN L+AN KKCVF+ SRIQYLGH     GVEA+ DKI +M +W RP + T LRGFLGL GYYRR 
Subjt:  FLKRFILVFFDDILVYSLDMETHEKHLNIMFHVLKDNQLFANQKKCVFSQSRIQYLGHW---EGVEAENDKICAMANWSRPRNTTGLRGFLGLMGYYRRL

Query:  VQDYRKIVAPLTKLLHKDRFMWTYEASIAFEALKKDMVTIPLLALPDFSSPLIIEIDASRTDLGVVVNGND-----------------------------
        V+ Y  I  PLTKLL K+ F W  EA  AF  LK  M TIP+L LPD++ P  IE DAS + LG V++                                
Subjt:  VQDYRKIVAPLTKLLHKDRFMWTYEASIAFEALKKDMVTIPLLALPDFSSPLIIEIDASRTDLGVVVNGND-----------------------------

Query:  --------------SIRAKMETLSFEIE-------------------------------------RVVPMAKAYTLTIPILLHVEEIQKEVEKDEELQQI
                      SI +  + L F +E                                     RV P  + + +T   ++ +  + +EV+KDEELQ+I
Subjt:  --------------SIRAKMETLSFEIE-------------------------------------RVVPMAKAYTLTIPILLHVEEIQKEVEKDEELQQI

Query:  VKLLQVDLDGKPKFQLILGQLQYK----------------------VTGGHSGYLRTYKHLIGKLYWKQMKADVKKYVEECMACQQNKTDSPTPTRLLQP
        V  L+ + +   KF+   G+L YK                      V GGHSG+LRTYK + G+LYW+ MK D+KKYV++C  CQ+NK ++     +LQP
Subjt:  VKLLQVDLDGKPKFQLILGQLQYK----------------------VTGGHSGYLRTYKHLIGKLYWKQMKADVKKYVEECMACQQNKTDSPTPTRLLQP

Query:  SPILEQVWDDLTMDFVEGLPKARGYDSITVVVDRLSKYAHFVTLSHLYTAKQVAEIFIQEV------------------------------GTVLKRSTT
         PI +++ +D TMDF+EGLPKA G + I VVVDRL+KYA+F+TL H ++AKQVA  FI ++                              GT+LKRST 
Subjt:  SPILEQVWDDLTMDFVEGLPKARGYDSITVVVDRLSKYAHFVTLSHLYTAKQVAEIFIQEV------------------------------GTVLKRSTT

Query:  FHPHTDG-----NRCLETYVRCSYNEKPKQRDKWLPWTKHWYNTTFHISINTT-FSMFFKRKPPPLIFYGERETSNDAIEQTLKASDEILKRLKKHLLIP
        FHP TDG     N+CLETY+RC  NE+P + D+++PW + WYNTTFH S  TT F   + R PPPL+ YG+++T+N+ +E  LK  D  L  LK++L + 
Subjt:  FHPHTDG-----NRCLETYVRCSYNEKPKQRDKWLPWTKHWYNTTFHISINTT-FSMFFKRKPPPLIFYGERETSNDAIEQTLKASDEILKRLKKHLLIP

Query:  QNQMKKFADSNRRRFSWKLGRKSFSKFD------------------------------------------------HTGSGQAMIGKKQVVQSQPPTLME
        QN+MKKFAD  RR    K+G + + K                                                  H    +  +G +QVVQ Q P L E
Subjt:  QNQMKKFADSNRRRFSWKLGRKSFSKFD------------------------------------------------HTGSGQAMIGKKQVVQSQPPTLME

Query:  DFE---------------DISMNEWLVKRQNLPKSEATWESSDFIKQKF
        +FE               ++  NEWL+K Q L +SEATWES   + Q+F
Subjt:  DFE---------------DISMNEWLVKRQNLPKSEATWESSDFIKQKF

KAA0055376.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]2.6e-18637Show/hide
Query:  EDEEEIIAEEEEVQ--IETEAIKITNNMELALHSVLGFLHKATMKLKGRVAEKEVVVLMDCGDTHNFIHQKLVDKVNLPLSEATNYGMVVGYGVAIQGKG
        E++EE  ++EE ++  +E + +++T+   + L ++     K TMKLKG + +KEVVVL+D G THNFIH  L   + L +   T +G  +G G   +GKG
Subjt:  EDEEEIIAEEEEVQ--IETEAIKITNNMELALHSVLGFLHKATMKLKGRVAEKEVVVLMDCGDTHNFIHQKLVDKVNLPLSEATNYGMVVGYGVAIQGKG

Query:  GLQVSHNYSPELTILDDFLLLDLGRVDVILGMTWLCTMGFMGVRWPSSTMTFFQWDSQVMLKG---------------------DLAFKAEFAEIKIDDE
          +       E+TI+ DFL ++LG VD +LGM WL T G M + WPS TM+F+    Q++LKG                     D  F  E+A +K++ +
Subjt:  GLQVSHNYSPELTILDDFLLLDLGRVDVILGMTWLCTMGFMGVRWPSSTMTFFQWDSQVMLKG---------------------DLAFKAEFAEIKIDDE

Query:  GTKKQSLLQREDEPLHKGVLELLEEYQGLFETPKSLPSTREVGNHMILAEGQGPI---PYKYGHSQKNEMEKLIKEI-----------------------
         T K    +  DE     +  LL++Y  +F TPK LP  R++ + ++    Q PI   PYKYGH QK E+EKL+ E+                       
Subjt:  GTKKQSLLQREDEPLHKGVLELLEEYQGLFETPKSLPSTREVGNHMILAEGQGPI---PYKYGHSQKNEMEKLIKEI-----------------------

Query:  -RGW-----------VRVSNKFPIPVIKELRDELNEAAIIPKLDLRS-------------------------------RLTNAPTTLQSLMNQVFRPFLK
          GW             +S+KFPIPVI+EL DEL  AA+  KLDL+S                                LTNAP T QSLMNQVF+PFL+
Subjt:  -RGW-----------VRVSNKFPIPVIKELRDELNEAAIIPKLDLRS-------------------------------RLTNAPTTLQSLMNQVFRPFLK

Query:  RFILVFFDDILVYSLDMETHEKHLNIMFHVLKDNQLFANQKKCVFSQSRIQYLGHW---EGVEAENDKICAMANWSRPRNTTGLRGFLGLMGYYRRLVQD
        R +LVFF DILVYS D+  HEKHL ++F VL+DNQL+AN KKCVF+ S+IQYLGH     GVEA+ DKI +M NW RP + T LRGFL L GYYRR V+ 
Subjt:  RFILVFFDDILVYSLDMETHEKHLNIMFHVLKDNQLFANQKKCVFSQSRIQYLGHW---EGVEAENDKICAMANWSRPRNTTGLRGFLGLMGYYRRLVQD

Query:  YRKIVAPLTKLLHKDRFMWTYEASIAFEALKKDMVTIPLLALPDFSSPLIIEIDASRTDLGVVVNGND--------------------------------
        Y  I  PLTKLL K+ F W  +A  AF  LK  M TIP+LALPD+S P  IE DAS + LG V++                                   
Subjt:  YRKIVAPLTKLLHKDRFMWTYEASIAFEALKKDMVTIPLLALPDFSSPLIIEIDASRTDLGVVVNGND--------------------------------

Query:  -----------SIRAKMETLSFEIE-------------------------------------RVVPMAKAYTLTIPILLHVEEIQKEVEKDEELQQIVKL
                   SI +  + L F +E                                     RV P  + + +T   ++ +  + +EV+KDE LQ+IV  
Subjt:  -----------SIRAKMETLSFEIE-------------------------------------RVVPMAKAYTLTIPILLHVEEIQKEVEKDEELQQIVKL

Query:  LQVDLDGKPKFQLILGQLQYK----------------------VTGGHSGYLRTYKHLIGKLYWKQMKADVKKYVEECMACQQNKTDSPTPTRLLQPSPI
        L+ + +   KF+   G+L YK                      V GGHSG+LRTYK + G+LYW+ MK D+KKYVE+C  CQ+NK ++  P  +LQP PI
Subjt:  LQVDLDGKPKFQLILGQLQYK----------------------VTGGHSGYLRTYKHLIGKLYWKQMKADVKKYVEECMACQQNKTDSPTPTRLLQPSPI

Query:  LEQVWDDLTMDFVEGLPKARGYDSITVVVDRLSKYAHFVTLSHLYTAKQVAEIFIQEV------------------------------GTVLKRSTTFHP
         +++ +D TMDF+EGLPKA G + I VVVDRL+KYA+F+TL H ++AKQVA  FI ++                              GT+LKRST FHP
Subjt:  LEQVWDDLTMDFVEGLPKARGYDSITVVVDRLSKYAHFVTLSHLYTAKQVAEIFIQEV------------------------------GTVLKRSTTFHP

Query:  HTDG-----NRCLETYVRCSYNEKPKQRDKWLPWTKHWYNTTFHISINTT-FSMFFKRKPPPLIFYGERETSNDAIEQTLKASDEILKRLKKHLLIPQNQ
         TDG     N+CLETY+RC  NE+P + D+++PW + WYNTTFH S  TT F   + R PPPL+ YG+++ +N+ +E  LK  D  L  LK++L + QN+
Subjt:  HTDG-----NRCLETYVRCSYNEKPKQRDKWLPWTKHWYNTTFHISINTT-FSMFFKRKPPPLIFYGERETSNDAIEQTLKASDEILKRLKKHLLIPQNQ

Query:  MKKFADSNRRRFSWKLGRKSFSKFD------------------------------------------------HTGSGQAMIGKKQVVQSQPPTLMEDFE
        MKKFAD  RR    K+G + + K                                                  H    +  +G +QVVQ Q P L E+FE
Subjt:  MKKFADSNRRRFSWKLGRKSFSKFD------------------------------------------------HTGSGQAMIGKKQVVQSQPPTLMEDFE

Query:  ---------------DISMNEWLVKRQNLPKSEATWESSDFIKQKF
                       ++  NEWL+K Q L +S+ATWES   + Q+F
Subjt:  ---------------DISMNEWLVKRQNLPKSEATWESSDFIKQKF

KAA0061073.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]2.3e-18737Show/hide
Query:  EDEEEIIAEEEEVQ--IETEAIKITNNMELALHSVLGFLHKATMKLKGRVAEKEVVVLMDCGDTHNFIHQKLVDKVNLPLSEATNYGMVVGYGVAIQGKG
        E++EE  ++EE ++  +E + +++T+   + L ++     K TMKLKG + +KEVVVL+D G THNFIH  L  ++ L +   T++G  +G G   +GKG
Subjt:  EDEEEIIAEEEEVQ--IETEAIKITNNMELALHSVLGFLHKATMKLKGRVAEKEVVVLMDCGDTHNFIHQKLVDKVNLPLSEATNYGMVVGYGVAIQGKG

Query:  GLQVSHNYSPELTILDDFLLLDLGRVDVILGMTWLCTMGFMGVRWPSSTMTFFQWDSQVMLKG---------------------DLAFKAEFAEIKIDDE
          +       E+TI+ DFL ++LG VD +LGM WL T G M + WPS TM+F+    Q++LKG                     D  F  E+A ++++ +
Subjt:  GLQVSHNYSPELTILDDFLLLDLGRVDVILGMTWLCTMGFMGVRWPSSTMTFFQWDSQVMLKG---------------------DLAFKAEFAEIKIDDE

Query:  GTKKQSLLQREDEPLHKGVLELLEEYQGLFETPKSLPSTREVGNHMILAEGQGPI---PYKYGHSQKNEMEKLIKEI-----------------------
         T K    +  DE     +  LL++Y+ +F TPK LP  R++ + ++    Q PI   PYKYGH QK E+EKL+ E+                       
Subjt:  GTKKQSLLQREDEPLHKGVLELLEEYQGLFETPKSLPSTREVGNHMILAEGQGPI---PYKYGHSQKNEMEKLIKEI-----------------------

Query:  -RGW-----------VRVSNKFPIPVIKELRDELNEAAIIPKLDLRS-------------------------------RLTNAPTTLQSLMNQVFRPFLK
          GW             +S+KFPIPVI+EL DEL  AA+  KLDL+S                                LTNAP T QSLMNQVF+PFL+
Subjt:  -RGW-----------VRVSNKFPIPVIKELRDELNEAAIIPKLDLRS-------------------------------RLTNAPTTLQSLMNQVFRPFLK

Query:  RFILVFFDDILVYSLDMETHEKHLNIMFHVLKDNQLFANQKKCVFSQSRIQYLGHW---EGVEAENDKICAMANWSRPRNTTGLRGFLGLMGYYRRLVQD
        R +LVFFDDILVYS D+   EKHL ++F VL+DNQL+AN KKCVF+ S+IQYLGH     GVEA+ DKI +M NW RP + T LRGFLGL GYYRR V+ 
Subjt:  RFILVFFDDILVYSLDMETHEKHLNIMFHVLKDNQLFANQKKCVFSQSRIQYLGHW---EGVEAENDKICAMANWSRPRNTTGLRGFLGLMGYYRRLVQD

Query:  YRKIVAPLTKLLHKDRFMWTYEASIAFEALKKDMVTIPLLALPDFSSPLIIEIDASRTDLGVVVNGND--------------------------------
        Y  I  PLTKLL K+ F W  +A  AF  LK  M TIP+LALPD+S P  IE DAS + LG V++                                   
Subjt:  YRKIVAPLTKLLHKDRFMWTYEASIAFEALKKDMVTIPLLALPDFSSPLIIEIDASRTDLGVVVNGND--------------------------------

Query:  -----------SIRAKMETLSFEIE-------------------------------------RVVPMAKAYTLTIPILLHVEEIQKEVEKDEELQQIVKL
                   SI +  + L F +E                                     RV P  + + +T   ++ +  + +EV+KDE LQ+IV  
Subjt:  -----------SIRAKMETLSFEIE-------------------------------------RVVPMAKAYTLTIPILLHVEEIQKEVEKDEELQQIVKL

Query:  LQVDLDGKPKFQLILGQLQYK----------------------VTGGHSGYLRTYKHLIGKLYWKQMKADVKKYVEECMACQQNKTDSPTPTRLLQPSPI
        L+ + +   KF+   G+L YK                      V GGHSG+LRTYK + G+LYW+ MK D+KKYVE+C  CQ+NK ++  P  +LQP PI
Subjt:  LQVDLDGKPKFQLILGQLQYK----------------------VTGGHSGYLRTYKHLIGKLYWKQMKADVKKYVEECMACQQNKTDSPTPTRLLQPSPI

Query:  LEQVWDDLTMDFVEGLPKARGYDSITVVVDRLSKYAHFVTLSHLYTAKQVAEIFIQEV------------------------------GTVLKRSTTFHP
         +++ +D TMDF+EGLPKA G + I VVVDRL+KYA+F+TL H ++AKQVA  FI ++                              GT+LKRST FHP
Subjt:  LEQVWDDLTMDFVEGLPKARGYDSITVVVDRLSKYAHFVTLSHLYTAKQVAEIFIQEV------------------------------GTVLKRSTTFHP

Query:  HTDG-----NRCLETYVRCSYNEKPKQRDKWLPWTKHWYNTTFHISINTT-FSMFFKRKPPPLIFYGERETSNDAIEQTLKASDEILKRLKKHLLIPQNQ
         TDG     N+CLETY+RC  NE+P + D+++PW + WYNT FH S  TT F   + R PPPL+ YG+++T N+ +E  LK  D  L  LK++L + QN+
Subjt:  HTDG-----NRCLETYVRCSYNEKPKQRDKWLPWTKHWYNTTFHISINTT-FSMFFKRKPPPLIFYGERETSNDAIEQTLKASDEILKRLKKHLLIPQNQ

Query:  MKKFADSNRRRFSWKLGRKSFSKFD------------------------------------------------HTGSGQAMIGKKQVVQSQPPTLMEDFE
        MKKFAD  RR    K+G + + K                                                  H    +  +G +QVVQ Q P L E+FE
Subjt:  MKKFADSNRRRFSWKLGRKSFSKFD------------------------------------------------HTGSGQAMIGKKQVVQSQPPTLMEDFE

Query:  ---------------DISMNEWLVKRQNLPKSEATWESSDFIKQKF
                       ++  NEWL+K Q L +S+ATWES   + Q+F
Subjt:  ---------------DISMNEWLVKRQNLPKSEATWESSDFIKQKF

TYJ99303.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]2.6e-18637Show/hide
Query:  EDEEEIIAEEEEVQ--IETEAIKITNNMELALHSVLGFLHKATMKLKGRVAEKEVVVLMDCGDTHNFIHQKLVDKVNLPLSEATNYGMVVGYGVAIQGKG
        E++EE  ++EE ++  +E + +++T+   + L ++     K TMKLKG + +KEVVVL+D G THNFIH  L   + L +   T +G  +G G   +GKG
Subjt:  EDEEEIIAEEEEVQ--IETEAIKITNNMELALHSVLGFLHKATMKLKGRVAEKEVVVLMDCGDTHNFIHQKLVDKVNLPLSEATNYGMVVGYGVAIQGKG

Query:  GLQVSHNYSPELTILDDFLLLDLGRVDVILGMTWLCTMGFMGVRWPSSTMTFFQWDSQVMLKG---------------------DLAFKAEFAEIKIDDE
          +       E+TI+ DFL ++LG VD +LGM WL T G M + WPS TM+F+    Q++LKG                     D  F  E+A +K++ +
Subjt:  GLQVSHNYSPELTILDDFLLLDLGRVDVILGMTWLCTMGFMGVRWPSSTMTFFQWDSQVMLKG---------------------DLAFKAEFAEIKIDDE

Query:  GTKKQSLLQREDEPLHKGVLELLEEYQGLFETPKSLPSTREVGNHMILAEGQGPI---PYKYGHSQKNEMEKLIKEI-----------------------
         T K    +  DE     +  LL++Y  +F TPK LP  R++ + ++    Q PI   PYKYGH QK E+EKL+ E+                       
Subjt:  GTKKQSLLQREDEPLHKGVLELLEEYQGLFETPKSLPSTREVGNHMILAEGQGPI---PYKYGHSQKNEMEKLIKEI-----------------------

Query:  -RGW-----------VRVSNKFPIPVIKELRDELNEAAIIPKLDLRS-------------------------------RLTNAPTTLQSLMNQVFRPFLK
          GW             +S+KFPIPVI+EL DEL  AA+  KLDL+S                                LTNAP T QSLMNQVF+PFL+
Subjt:  -RGW-----------VRVSNKFPIPVIKELRDELNEAAIIPKLDLRS-------------------------------RLTNAPTTLQSLMNQVFRPFLK

Query:  RFILVFFDDILVYSLDMETHEKHLNIMFHVLKDNQLFANQKKCVFSQSRIQYLGHW---EGVEAENDKICAMANWSRPRNTTGLRGFLGLMGYYRRLVQD
        R +LVFF DILVYS D+  HEKHL ++F VL+DNQL+AN KKCVF+ S+IQYLGH     GVEA+ DKI +M NW RP + T LRGFL L GYYRR V+ 
Subjt:  RFILVFFDDILVYSLDMETHEKHLNIMFHVLKDNQLFANQKKCVFSQSRIQYLGHW---EGVEAENDKICAMANWSRPRNTTGLRGFLGLMGYYRRLVQD

Query:  YRKIVAPLTKLLHKDRFMWTYEASIAFEALKKDMVTIPLLALPDFSSPLIIEIDASRTDLGVVVNGND--------------------------------
        Y  I  PLTKLL K+ F W  +A  AF  LK  M TIP+LALPD+S P  IE DAS + LG V++                                   
Subjt:  YRKIVAPLTKLLHKDRFMWTYEASIAFEALKKDMVTIPLLALPDFSSPLIIEIDASRTDLGVVVNGND--------------------------------

Query:  -----------SIRAKMETLSFEIE-------------------------------------RVVPMAKAYTLTIPILLHVEEIQKEVEKDEELQQIVKL
                   SI +  + L F +E                                     RV P  + + +T   ++ +  + +EV+KDE LQ+IV  
Subjt:  -----------SIRAKMETLSFEIE-------------------------------------RVVPMAKAYTLTIPILLHVEEIQKEVEKDEELQQIVKL

Query:  LQVDLDGKPKFQLILGQLQYK----------------------VTGGHSGYLRTYKHLIGKLYWKQMKADVKKYVEECMACQQNKTDSPTPTRLLQPSPI
        L+ + +   KF+   G+L YK                      V GGHSG+LRTYK + G+LYW+ MK D+KKYVE+C  CQ+NK ++  P  +LQP PI
Subjt:  LQVDLDGKPKFQLILGQLQYK----------------------VTGGHSGYLRTYKHLIGKLYWKQMKADVKKYVEECMACQQNKTDSPTPTRLLQPSPI

Query:  LEQVWDDLTMDFVEGLPKARGYDSITVVVDRLSKYAHFVTLSHLYTAKQVAEIFIQEV------------------------------GTVLKRSTTFHP
         +++ +D TMDF+EGLPKA G + I VVVDRL+KYA+F+TL H ++AKQVA  FI ++                              GT+LKRST FHP
Subjt:  LEQVWDDLTMDFVEGLPKARGYDSITVVVDRLSKYAHFVTLSHLYTAKQVAEIFIQEV------------------------------GTVLKRSTTFHP

Query:  HTDG-----NRCLETYVRCSYNEKPKQRDKWLPWTKHWYNTTFHISINTT-FSMFFKRKPPPLIFYGERETSNDAIEQTLKASDEILKRLKKHLLIPQNQ
         TDG     N+CLETY+RC  NE+P + D+++PW + WYNTTFH S  TT F   + R PPPL+ YG+++ +N+ +E  LK  D  L  LK++L + QN+
Subjt:  HTDG-----NRCLETYVRCSYNEKPKQRDKWLPWTKHWYNTTFHISINTT-FSMFFKRKPPPLIFYGERETSNDAIEQTLKASDEILKRLKKHLLIPQNQ

Query:  MKKFADSNRRRFSWKLGRKSFSKFD------------------------------------------------HTGSGQAMIGKKQVVQSQPPTLMEDFE
        MKKFAD  RR    K+G + + K                                                  H    +  +G +QVVQ Q P L E+FE
Subjt:  MKKFADSNRRRFSWKLGRKSFSKFD------------------------------------------------HTGSGQAMIGKKQVVQSQPPTLMEDFE

Query:  ---------------DISMNEWLVKRQNLPKSEATWESSDFIKQKF
                       ++  NEWL+K Q L +S+ATWES   + Q+F
Subjt:  ---------------DISMNEWLVKRQNLPKSEATWESSDFIKQKF

TYK28905.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.4e-18737.25Show/hide
Query:  NTSVEDEEEIIAEE-EEVQIETEAIKITNNMELALHSVLGFLHKATMKLKGRVAEKEVVVLMDCGDTHNFIHQKLVDKVNLPLSEATNYGMVVGYGVAIQ
        N   E+EE    EE  E  +E + +++T ++ + L ++     K TMKLKG + +KE+VVL+D G THNFIHQ L   + L L + T +G  +G G   +
Subjt:  NTSVEDEEEIIAEE-EEVQIETEAIKITNNMELALHSVLGFLHKATMKLKGRVAEKEVVVLMDCGDTHNFIHQKLVDKVNLPLSEATNYGMVVGYGVAIQ

Query:  GKGGLQVSHNYSPELTILDDFLLLDLGRVDVILGMTWLCTMGFMGVRWPSSTMTFFQWDSQVMLKG---------------------DLAFKAEFAEIKI
        G+G  +       E+TI+ DFL ++LG VD +LGM WL T G M + WPS TM F+    Q++LKG                     D  F  E+A +++
Subjt:  GKGGLQVSHNYSPELTILDDFLLLDLGRVDVILGMTWLCTMGFMGVRWPSSTMTFFQWDSQVMLKG---------------------DLAFKAEFAEIKI

Query:  DDEGTKKQSLLQREDEPLHKGVLELLEEYQGLFETPKSLPSTREVGNHMILAEGQGPI---PYKYGHSQKNEMEKLIKEI--------------------
        + E   K    ++ D+     +  LL++Y  +F TPK LP  R++ + ++    Q PI   PYKYGH QK E+EKL+ E+                    
Subjt:  DDEGTKKQSLLQREDEPLHKGVLELLEEYQGLFETPKSLPSTREVGNHMILAEGQGPI---PYKYGHSQKNEMEKLIKEI--------------------

Query:  ----RGW-----------VRVSNKFPIPVIKELRDELNEAAIIPKLDLRS-------------------------------RLTNAPTTLQSLMNQVFRP
             GW             VS+KFPIP+I+EL DEL  AA+  KLDL+S                                LTNAPTT QSLMNQVF+P
Subjt:  ----RGW-----------VRVSNKFPIPVIKELRDELNEAAIIPKLDLRS-------------------------------RLTNAPTTLQSLMNQVFRP

Query:  FLKRFILVFFDDILVYSLDMETHEKHLNIMFHVLKDNQLFANQKKCVFSQSRIQYLGHW---EGVEAENDKICAMANWSRPRNTTGLRGFLGLMGYYRRL
        FL+R +LVFFDDILVYS D+  HEKHL ++F  L+DNQL+AN KKCVF+ S+IQYLGH     GVEA+ DKI +M NW RP + T LRGFLGL+GYYRR 
Subjt:  FLKRFILVFFDDILVYSLDMETHEKHLNIMFHVLKDNQLFANQKKCVFSQSRIQYLGHW---EGVEAENDKICAMANWSRPRNTTGLRGFLGLMGYYRRL

Query:  VQDYRKIVAPLTKLLHKDRFMWTYEASIAFEALKKDMVTIPLLALPDFSSPLIIEIDASRTDLGVVVNGND-----------------------------
        VQ Y  I   LTKLL K+ F W  EA  AF  LK  M TIP+LALPD++ P  IE DAS + LG V++                                
Subjt:  VQDYRKIVAPLTKLLHKDRFMWTYEASIAFEALKKDMVTIPLLALPDFSSPLIIEIDASRTDLGVVVNGND-----------------------------

Query:  --------------SIRAKMETLSF-------------------------------------EIERVVPMAKAYTLTIPILLHVEEIQKEVEKDEELQQI
                      SI +  + L F                                      + RV P  + + +T   ++ +  +++EV+KDEELQQI
Subjt:  --------------SIRAKMETLSF-------------------------------------EIERVVPMAKAYTLTIPILLHVEEIQKEVEKDEELQQI

Query:  VKLLQVDLDGKPKFQLILGQLQYK----------------------VTGGHSGYLRTYKHLIGKLYWKQMKADVKKYVEECMACQQNKTDSPTPTRLLQP
        V  L+ + +   KF+    +L YK                      V GGHSG+LRTYK + G+LYW+ MK D+KKYV++C  CQ+NK ++  P  +LQP
Subjt:  VKLLQVDLDGKPKFQLILGQLQYK----------------------VTGGHSGYLRTYKHLIGKLYWKQMKADVKKYVEECMACQQNKTDSPTPTRLLQP

Query:  SPILEQVWDDLTMDFVEGLPKARGYDSITVVVDRLSKYAHFVTLSHLYTAKQVAEIFIQEV------------------------------GTVLKRSTT
         PI +++ +D TMDF+EGLPKA G + I VVVDRL+KYA+F+TL H ++AKQVA  FI +V                              GT+LKRST 
Subjt:  SPILEQVWDDLTMDFVEGLPKARGYDSITVVVDRLSKYAHFVTLSHLYTAKQVAEIFIQEV------------------------------GTVLKRSTT

Query:  FHPHTDG-----NRCLETYVRCSYNEKPKQRDKWLPWTKHWYNTTFHISINTT-FSMFFKRKPPPLIFYGERETSNDAIEQTLKASDEILKRLKKHLLIP
        FHP TDG     N+CLETY+RC  NE+P + D+++PW + WYNTTFH S  TT F   + R PPPL+ Y +R+T+N+ +E  LK  D  L  LK++L + 
Subjt:  FHPHTDG-----NRCLETYVRCSYNEKPKQRDKWLPWTKHWYNTTFHISINTT-FSMFFKRKPPPLIFYGERETSNDAIEQTLKASDEILKRLKKHLLIP

Query:  QNQMKKFADSNRRRFSWKLGRKSFSKFD------------------------------------------------HTGSGQAMIGKKQVVQSQPPTLME
        QN++KKFAD  RR    K+G + + K                                                  H    +  +G +QVVQ Q P L E
Subjt:  QNQMKKFADSNRRRFSWKLGRKSFSKFD------------------------------------------------HTGSGQAMIGKKQVVQSQPPTLME

Query:  DFE---------------DISMNEWLVKRQNLPKSEATWESSDFIKQKF
        +FE               ++  NEWL+K Q+LP+SEATWES   + Q+F
Subjt:  DFE---------------DISMNEWLVKRQNLPKSEATWESSDFIKQKF

TrEMBL top hitse value%identityAlignment
A0A5A7T8C0 Ty3/gypsy retrotransposon protein1.1e-18737.08Show/hide
Query:  NTSVEDEEEIIAEE-EEVQIETEAIKITNNMELALHSVLGFLHKATMKLKGRVAEKEVVVLMDCGDTHNFIHQKLVDKVNLPLSEATNYGMVVGYGVAIQ
        N   EDEE    EE  E  +E + +++T +  + L ++     K TMKLKG + +KE+V+L+D G THNFIHQ L   + L + + T +G  +G G   +
Subjt:  NTSVEDEEEIIAEE-EEVQIETEAIKITNNMELALHSVLGFLHKATMKLKGRVAEKEVVVLMDCGDTHNFIHQKLVDKVNLPLSEATNYGMVVGYGVAIQ

Query:  GKGGLQVSHNYSPELTILDDFLLLDLGRVDVILGMTWLCTMGFMGVRWPSSTMTFFQWDSQVMLKG---------------------DLAFKAEFAEIKI
        GKG  +       E+TI+ DFL ++LG VD +LGM WL T G M + WPS TM+F+    Q++LKG                     D  F  E+A +++
Subjt:  GKGGLQVSHNYSPELTILDDFLLLDLGRVDVILGMTWLCTMGFMGVRWPSSTMTFFQWDSQVMLKG---------------------DLAFKAEFAEIKI

Query:  DDEGTKKQSLLQREDEPLHKGVLELLEEYQGLFETPKSLPSTREVGNHMILAEGQGPI---PYKYGHSQKNEMEKLIKEI--------------------
        + E T K    ++ DE     +  LL++Y  +F TPK LP  R++ + ++    Q PI   PYKYGH QK E+EKL+ E+                    
Subjt:  DDEGTKKQSLLQREDEPLHKGVLELLEEYQGLFETPKSLPSTREVGNHMILAEGQGPI---PYKYGHSQKNEMEKLIKEI--------------------

Query:  ----RGW-----------VRVSNKFPIPVIKELRDELNEAAIIPKLDLRS-------------------------------RLTNAPTTLQSLMNQVFRP
             GW             VS+KFPIPVI+EL DEL  AA+  KLDL+S                                LTNAP T QSLMNQVF+P
Subjt:  ----RGW-----------VRVSNKFPIPVIKELRDELNEAAIIPKLDLRS-------------------------------RLTNAPTTLQSLMNQVFRP

Query:  FLKRFILVFFDDILVYSLDMETHEKHLNIMFHVLKDNQLFANQKKCVFSQSRIQYLGHW---EGVEAENDKICAMANWSRPRNTTGLRGFLGLMGYYRRL
        FL+R +LVFFDDIL YS D+  H+KHL ++F VL+DN L+AN KKCVF+ SRIQYLGH     GVEA+ DKI +M +W RP + T LRGFLGL GYYRR 
Subjt:  FLKRFILVFFDDILVYSLDMETHEKHLNIMFHVLKDNQLFANQKKCVFSQSRIQYLGHW---EGVEAENDKICAMANWSRPRNTTGLRGFLGLMGYYRRL

Query:  VQDYRKIVAPLTKLLHKDRFMWTYEASIAFEALKKDMVTIPLLALPDFSSPLIIEIDASRTDLGVVVNGND-----------------------------
        V+ Y  I  PLTKLL K+ F W  EA  AF  LK  M TIP+L LPD++ P  IE DAS + LG V++                                
Subjt:  VQDYRKIVAPLTKLLHKDRFMWTYEASIAFEALKKDMVTIPLLALPDFSSPLIIEIDASRTDLGVVVNGND-----------------------------

Query:  --------------SIRAKMETLSFEIE-------------------------------------RVVPMAKAYTLTIPILLHVEEIQKEVEKDEELQQI
                      SI +  + L F +E                                     RV P  + + +T   ++ +  + +EV+KDEELQ+I
Subjt:  --------------SIRAKMETLSFEIE-------------------------------------RVVPMAKAYTLTIPILLHVEEIQKEVEKDEELQQI

Query:  VKLLQVDLDGKPKFQLILGQLQYK----------------------VTGGHSGYLRTYKHLIGKLYWKQMKADVKKYVEECMACQQNKTDSPTPTRLLQP
        V  L+ + +   KF+   G+L YK                      V GGHSG+LRTYK + G+LYW+ MK D+KKYV++C  CQ+NK ++     +LQP
Subjt:  VKLLQVDLDGKPKFQLILGQLQYK----------------------VTGGHSGYLRTYKHLIGKLYWKQMKADVKKYVEECMACQQNKTDSPTPTRLLQP

Query:  SPILEQVWDDLTMDFVEGLPKARGYDSITVVVDRLSKYAHFVTLSHLYTAKQVAEIFIQEV------------------------------GTVLKRSTT
         PI +++ +D TMDF+EGLPKA G + I VVVDRL+KYA+F+TL H ++AKQVA  FI ++                              GT+LKRST 
Subjt:  SPILEQVWDDLTMDFVEGLPKARGYDSITVVVDRLSKYAHFVTLSHLYTAKQVAEIFIQEV------------------------------GTVLKRSTT

Query:  FHPHTDG-----NRCLETYVRCSYNEKPKQRDKWLPWTKHWYNTTFHISINTT-FSMFFKRKPPPLIFYGERETSNDAIEQTLKASDEILKRLKKHLLIP
        FHP TDG     N+CLETY+RC  NE+P + D+++PW + WYNTTFH S  TT F   + R PPPL+ YG+++T+N+ +E  LK  D  L  LK++L + 
Subjt:  FHPHTDG-----NRCLETYVRCSYNEKPKQRDKWLPWTKHWYNTTFHISINTT-FSMFFKRKPPPLIFYGERETSNDAIEQTLKASDEILKRLKKHLLIP

Query:  QNQMKKFADSNRRRFSWKLGRKSFSKFD------------------------------------------------HTGSGQAMIGKKQVVQSQPPTLME
        QN+MKKFAD  RR    K+G + + K                                                  H    +  +G +QVVQ Q P L E
Subjt:  QNQMKKFADSNRRRFSWKLGRKSFSKFD------------------------------------------------HTGSGQAMIGKKQVVQSQPPTLME

Query:  DFE---------------DISMNEWLVKRQNLPKSEATWESSDFIKQKF
        +FE               ++  NEWL+K Q L +SEATWES   + Q+F
Subjt:  DFE---------------DISMNEWLVKRQNLPKSEATWESSDFIKQKF

A0A5A7UM77 Ty3/gypsy retrotransposon protein1.3e-18637Show/hide
Query:  EDEEEIIAEEEEVQ--IETEAIKITNNMELALHSVLGFLHKATMKLKGRVAEKEVVVLMDCGDTHNFIHQKLVDKVNLPLSEATNYGMVVGYGVAIQGKG
        E++EE  ++EE ++  +E + +++T+   + L ++     K TMKLKG + +KEVVVL+D G THNFIH  L   + L +   T +G  +G G   +GKG
Subjt:  EDEEEIIAEEEEVQ--IETEAIKITNNMELALHSVLGFLHKATMKLKGRVAEKEVVVLMDCGDTHNFIHQKLVDKVNLPLSEATNYGMVVGYGVAIQGKG

Query:  GLQVSHNYSPELTILDDFLLLDLGRVDVILGMTWLCTMGFMGVRWPSSTMTFFQWDSQVMLKG---------------------DLAFKAEFAEIKIDDE
          +       E+TI+ DFL ++LG VD +LGM WL T G M + WPS TM+F+    Q++LKG                     D  F  E+A +K++ +
Subjt:  GLQVSHNYSPELTILDDFLLLDLGRVDVILGMTWLCTMGFMGVRWPSSTMTFFQWDSQVMLKG---------------------DLAFKAEFAEIKIDDE

Query:  GTKKQSLLQREDEPLHKGVLELLEEYQGLFETPKSLPSTREVGNHMILAEGQGPI---PYKYGHSQKNEMEKLIKEI-----------------------
         T K    +  DE     +  LL++Y  +F TPK LP  R++ + ++    Q PI   PYKYGH QK E+EKL+ E+                       
Subjt:  GTKKQSLLQREDEPLHKGVLELLEEYQGLFETPKSLPSTREVGNHMILAEGQGPI---PYKYGHSQKNEMEKLIKEI-----------------------

Query:  -RGW-----------VRVSNKFPIPVIKELRDELNEAAIIPKLDLRS-------------------------------RLTNAPTTLQSLMNQVFRPFLK
          GW             +S+KFPIPVI+EL DEL  AA+  KLDL+S                                LTNAP T QSLMNQVF+PFL+
Subjt:  -RGW-----------VRVSNKFPIPVIKELRDELNEAAIIPKLDLRS-------------------------------RLTNAPTTLQSLMNQVFRPFLK

Query:  RFILVFFDDILVYSLDMETHEKHLNIMFHVLKDNQLFANQKKCVFSQSRIQYLGHW---EGVEAENDKICAMANWSRPRNTTGLRGFLGLMGYYRRLVQD
        R +LVFF DILVYS D+  HEKHL ++F VL+DNQL+AN KKCVF+ S+IQYLGH     GVEA+ DKI +M NW RP + T LRGFL L GYYRR V+ 
Subjt:  RFILVFFDDILVYSLDMETHEKHLNIMFHVLKDNQLFANQKKCVFSQSRIQYLGHW---EGVEAENDKICAMANWSRPRNTTGLRGFLGLMGYYRRLVQD

Query:  YRKIVAPLTKLLHKDRFMWTYEASIAFEALKKDMVTIPLLALPDFSSPLIIEIDASRTDLGVVVNGND--------------------------------
        Y  I  PLTKLL K+ F W  +A  AF  LK  M TIP+LALPD+S P  IE DAS + LG V++                                   
Subjt:  YRKIVAPLTKLLHKDRFMWTYEASIAFEALKKDMVTIPLLALPDFSSPLIIEIDASRTDLGVVVNGND--------------------------------

Query:  -----------SIRAKMETLSFEIE-------------------------------------RVVPMAKAYTLTIPILLHVEEIQKEVEKDEELQQIVKL
                   SI +  + L F +E                                     RV P  + + +T   ++ +  + +EV+KDE LQ+IV  
Subjt:  -----------SIRAKMETLSFEIE-------------------------------------RVVPMAKAYTLTIPILLHVEEIQKEVEKDEELQQIVKL

Query:  LQVDLDGKPKFQLILGQLQYK----------------------VTGGHSGYLRTYKHLIGKLYWKQMKADVKKYVEECMACQQNKTDSPTPTRLLQPSPI
        L+ + +   KF+   G+L YK                      V GGHSG+LRTYK + G+LYW+ MK D+KKYVE+C  CQ+NK ++  P  +LQP PI
Subjt:  LQVDLDGKPKFQLILGQLQYK----------------------VTGGHSGYLRTYKHLIGKLYWKQMKADVKKYVEECMACQQNKTDSPTPTRLLQPSPI

Query:  LEQVWDDLTMDFVEGLPKARGYDSITVVVDRLSKYAHFVTLSHLYTAKQVAEIFIQEV------------------------------GTVLKRSTTFHP
         +++ +D TMDF+EGLPKA G + I VVVDRL+KYA+F+TL H ++AKQVA  FI ++                              GT+LKRST FHP
Subjt:  LEQVWDDLTMDFVEGLPKARGYDSITVVVDRLSKYAHFVTLSHLYTAKQVAEIFIQEV------------------------------GTVLKRSTTFHP

Query:  HTDG-----NRCLETYVRCSYNEKPKQRDKWLPWTKHWYNTTFHISINTT-FSMFFKRKPPPLIFYGERETSNDAIEQTLKASDEILKRLKKHLLIPQNQ
         TDG     N+CLETY+RC  NE+P + D+++PW + WYNTTFH S  TT F   + R PPPL+ YG+++ +N+ +E  LK  D  L  LK++L + QN+
Subjt:  HTDG-----NRCLETYVRCSYNEKPKQRDKWLPWTKHWYNTTFHISINTT-FSMFFKRKPPPLIFYGERETSNDAIEQTLKASDEILKRLKKHLLIPQNQ

Query:  MKKFADSNRRRFSWKLGRKSFSKFD------------------------------------------------HTGSGQAMIGKKQVVQSQPPTLMEDFE
        MKKFAD  RR    K+G + + K                                                  H    +  +G +QVVQ Q P L E+FE
Subjt:  MKKFADSNRRRFSWKLGRKSFSKFD------------------------------------------------HTGSGQAMIGKKQVVQSQPPTLMEDFE

Query:  ---------------DISMNEWLVKRQNLPKSEATWESSDFIKQKF
                       ++  NEWL+K Q L +S+ATWES   + Q+F
Subjt:  ---------------DISMNEWLVKRQNLPKSEATWESSDFIKQKF

A0A5A7UYM1 Ty3/gypsy retrotransposon protein1.1e-18737Show/hide
Query:  EDEEEIIAEEEEVQ--IETEAIKITNNMELALHSVLGFLHKATMKLKGRVAEKEVVVLMDCGDTHNFIHQKLVDKVNLPLSEATNYGMVVGYGVAIQGKG
        E++EE  ++EE ++  +E + +++T+   + L ++     K TMKLKG + +KEVVVL+D G THNFIH  L  ++ L +   T++G  +G G   +GKG
Subjt:  EDEEEIIAEEEEVQ--IETEAIKITNNMELALHSVLGFLHKATMKLKGRVAEKEVVVLMDCGDTHNFIHQKLVDKVNLPLSEATNYGMVVGYGVAIQGKG

Query:  GLQVSHNYSPELTILDDFLLLDLGRVDVILGMTWLCTMGFMGVRWPSSTMTFFQWDSQVMLKG---------------------DLAFKAEFAEIKIDDE
          +       E+TI+ DFL ++LG VD +LGM WL T G M + WPS TM+F+    Q++LKG                     D  F  E+A ++++ +
Subjt:  GLQVSHNYSPELTILDDFLLLDLGRVDVILGMTWLCTMGFMGVRWPSSTMTFFQWDSQVMLKG---------------------DLAFKAEFAEIKIDDE

Query:  GTKKQSLLQREDEPLHKGVLELLEEYQGLFETPKSLPSTREVGNHMILAEGQGPI---PYKYGHSQKNEMEKLIKEI-----------------------
         T K    +  DE     +  LL++Y+ +F TPK LP  R++ + ++    Q PI   PYKYGH QK E+EKL+ E+                       
Subjt:  GTKKQSLLQREDEPLHKGVLELLEEYQGLFETPKSLPSTREVGNHMILAEGQGPI---PYKYGHSQKNEMEKLIKEI-----------------------

Query:  -RGW-----------VRVSNKFPIPVIKELRDELNEAAIIPKLDLRS-------------------------------RLTNAPTTLQSLMNQVFRPFLK
          GW             +S+KFPIPVI+EL DEL  AA+  KLDL+S                                LTNAP T QSLMNQVF+PFL+
Subjt:  -RGW-----------VRVSNKFPIPVIKELRDELNEAAIIPKLDLRS-------------------------------RLTNAPTTLQSLMNQVFRPFLK

Query:  RFILVFFDDILVYSLDMETHEKHLNIMFHVLKDNQLFANQKKCVFSQSRIQYLGHW---EGVEAENDKICAMANWSRPRNTTGLRGFLGLMGYYRRLVQD
        R +LVFFDDILVYS D+   EKHL ++F VL+DNQL+AN KKCVF+ S+IQYLGH     GVEA+ DKI +M NW RP + T LRGFLGL GYYRR V+ 
Subjt:  RFILVFFDDILVYSLDMETHEKHLNIMFHVLKDNQLFANQKKCVFSQSRIQYLGHW---EGVEAENDKICAMANWSRPRNTTGLRGFLGLMGYYRRLVQD

Query:  YRKIVAPLTKLLHKDRFMWTYEASIAFEALKKDMVTIPLLALPDFSSPLIIEIDASRTDLGVVVNGND--------------------------------
        Y  I  PLTKLL K+ F W  +A  AF  LK  M TIP+LALPD+S P  IE DAS + LG V++                                   
Subjt:  YRKIVAPLTKLLHKDRFMWTYEASIAFEALKKDMVTIPLLALPDFSSPLIIEIDASRTDLGVVVNGND--------------------------------

Query:  -----------SIRAKMETLSFEIE-------------------------------------RVVPMAKAYTLTIPILLHVEEIQKEVEKDEELQQIVKL
                   SI +  + L F +E                                     RV P  + + +T   ++ +  + +EV+KDE LQ+IV  
Subjt:  -----------SIRAKMETLSFEIE-------------------------------------RVVPMAKAYTLTIPILLHVEEIQKEVEKDEELQQIVKL

Query:  LQVDLDGKPKFQLILGQLQYK----------------------VTGGHSGYLRTYKHLIGKLYWKQMKADVKKYVEECMACQQNKTDSPTPTRLLQPSPI
        L+ + +   KF+   G+L YK                      V GGHSG+LRTYK + G+LYW+ MK D+KKYVE+C  CQ+NK ++  P  +LQP PI
Subjt:  LQVDLDGKPKFQLILGQLQYK----------------------VTGGHSGYLRTYKHLIGKLYWKQMKADVKKYVEECMACQQNKTDSPTPTRLLQPSPI

Query:  LEQVWDDLTMDFVEGLPKARGYDSITVVVDRLSKYAHFVTLSHLYTAKQVAEIFIQEV------------------------------GTVLKRSTTFHP
         +++ +D TMDF+EGLPKA G + I VVVDRL+KYA+F+TL H ++AKQVA  FI ++                              GT+LKRST FHP
Subjt:  LEQVWDDLTMDFVEGLPKARGYDSITVVVDRLSKYAHFVTLSHLYTAKQVAEIFIQEV------------------------------GTVLKRSTTFHP

Query:  HTDG-----NRCLETYVRCSYNEKPKQRDKWLPWTKHWYNTTFHISINTT-FSMFFKRKPPPLIFYGERETSNDAIEQTLKASDEILKRLKKHLLIPQNQ
         TDG     N+CLETY+RC  NE+P + D+++PW + WYNT FH S  TT F   + R PPPL+ YG+++T N+ +E  LK  D  L  LK++L + QN+
Subjt:  HTDG-----NRCLETYVRCSYNEKPKQRDKWLPWTKHWYNTTFHISINTT-FSMFFKRKPPPLIFYGERETSNDAIEQTLKASDEILKRLKKHLLIPQNQ

Query:  MKKFADSNRRRFSWKLGRKSFSKFD------------------------------------------------HTGSGQAMIGKKQVVQSQPPTLMEDFE
        MKKFAD  RR    K+G + + K                                                  H    +  +G +QVVQ Q P L E+FE
Subjt:  MKKFADSNRRRFSWKLGRKSFSKFD------------------------------------------------HTGSGQAMIGKKQVVQSQPPTLMEDFE

Query:  ---------------DISMNEWLVKRQNLPKSEATWESSDFIKQKF
                       ++  NEWL+K Q L +S+ATWES   + Q+F
Subjt:  ---------------DISMNEWLVKRQNLPKSEATWESSDFIKQKF

A0A5D3BJ50 Ty3/gypsy retrotransposon protein1.3e-18637Show/hide
Query:  EDEEEIIAEEEEVQ--IETEAIKITNNMELALHSVLGFLHKATMKLKGRVAEKEVVVLMDCGDTHNFIHQKLVDKVNLPLSEATNYGMVVGYGVAIQGKG
        E++EE  ++EE ++  +E + +++T+   + L ++     K TMKLKG + +KEVVVL+D G THNFIH  L   + L +   T +G  +G G   +GKG
Subjt:  EDEEEIIAEEEEVQ--IETEAIKITNNMELALHSVLGFLHKATMKLKGRVAEKEVVVLMDCGDTHNFIHQKLVDKVNLPLSEATNYGMVVGYGVAIQGKG

Query:  GLQVSHNYSPELTILDDFLLLDLGRVDVILGMTWLCTMGFMGVRWPSSTMTFFQWDSQVMLKG---------------------DLAFKAEFAEIKIDDE
          +       E+TI+ DFL ++LG VD +LGM WL T G M + WPS TM+F+    Q++LKG                     D  F  E+A +K++ +
Subjt:  GLQVSHNYSPELTILDDFLLLDLGRVDVILGMTWLCTMGFMGVRWPSSTMTFFQWDSQVMLKG---------------------DLAFKAEFAEIKIDDE

Query:  GTKKQSLLQREDEPLHKGVLELLEEYQGLFETPKSLPSTREVGNHMILAEGQGPI---PYKYGHSQKNEMEKLIKEI-----------------------
         T K    +  DE     +  LL++Y  +F TPK LP  R++ + ++    Q PI   PYKYGH QK E+EKL+ E+                       
Subjt:  GTKKQSLLQREDEPLHKGVLELLEEYQGLFETPKSLPSTREVGNHMILAEGQGPI---PYKYGHSQKNEMEKLIKEI-----------------------

Query:  -RGW-----------VRVSNKFPIPVIKELRDELNEAAIIPKLDLRS-------------------------------RLTNAPTTLQSLMNQVFRPFLK
          GW             +S+KFPIPVI+EL DEL  AA+  KLDL+S                                LTNAP T QSLMNQVF+PFL+
Subjt:  -RGW-----------VRVSNKFPIPVIKELRDELNEAAIIPKLDLRS-------------------------------RLTNAPTTLQSLMNQVFRPFLK

Query:  RFILVFFDDILVYSLDMETHEKHLNIMFHVLKDNQLFANQKKCVFSQSRIQYLGHW---EGVEAENDKICAMANWSRPRNTTGLRGFLGLMGYYRRLVQD
        R +LVFF DILVYS D+  HEKHL ++F VL+DNQL+AN KKCVF+ S+IQYLGH     GVEA+ DKI +M NW RP + T LRGFL L GYYRR V+ 
Subjt:  RFILVFFDDILVYSLDMETHEKHLNIMFHVLKDNQLFANQKKCVFSQSRIQYLGHW---EGVEAENDKICAMANWSRPRNTTGLRGFLGLMGYYRRLVQD

Query:  YRKIVAPLTKLLHKDRFMWTYEASIAFEALKKDMVTIPLLALPDFSSPLIIEIDASRTDLGVVVNGND--------------------------------
        Y  I  PLTKLL K+ F W  +A  AF  LK  M TIP+LALPD+S P  IE DAS + LG V++                                   
Subjt:  YRKIVAPLTKLLHKDRFMWTYEASIAFEALKKDMVTIPLLALPDFSSPLIIEIDASRTDLGVVVNGND--------------------------------

Query:  -----------SIRAKMETLSFEIE-------------------------------------RVVPMAKAYTLTIPILLHVEEIQKEVEKDEELQQIVKL
                   SI +  + L F +E                                     RV P  + + +T   ++ +  + +EV+KDE LQ+IV  
Subjt:  -----------SIRAKMETLSFEIE-------------------------------------RVVPMAKAYTLTIPILLHVEEIQKEVEKDEELQQIVKL

Query:  LQVDLDGKPKFQLILGQLQYK----------------------VTGGHSGYLRTYKHLIGKLYWKQMKADVKKYVEECMACQQNKTDSPTPTRLLQPSPI
        L+ + +   KF+   G+L YK                      V GGHSG+LRTYK + G+LYW+ MK D+KKYVE+C  CQ+NK ++  P  +LQP PI
Subjt:  LQVDLDGKPKFQLILGQLQYK----------------------VTGGHSGYLRTYKHLIGKLYWKQMKADVKKYVEECMACQQNKTDSPTPTRLLQPSPI

Query:  LEQVWDDLTMDFVEGLPKARGYDSITVVVDRLSKYAHFVTLSHLYTAKQVAEIFIQEV------------------------------GTVLKRSTTFHP
         +++ +D TMDF+EGLPKA G + I VVVDRL+KYA+F+TL H ++AKQVA  FI ++                              GT+LKRST FHP
Subjt:  LEQVWDDLTMDFVEGLPKARGYDSITVVVDRLSKYAHFVTLSHLYTAKQVAEIFIQEV------------------------------GTVLKRSTTFHP

Query:  HTDG-----NRCLETYVRCSYNEKPKQRDKWLPWTKHWYNTTFHISINTT-FSMFFKRKPPPLIFYGERETSNDAIEQTLKASDEILKRLKKHLLIPQNQ
         TDG     N+CLETY+RC  NE+P + D+++PW + WYNTTFH S  TT F   + R PPPL+ YG+++ +N+ +E  LK  D  L  LK++L + QN+
Subjt:  HTDG-----NRCLETYVRCSYNEKPKQRDKWLPWTKHWYNTTFHISINTT-FSMFFKRKPPPLIFYGERETSNDAIEQTLKASDEILKRLKKHLLIPQNQ

Query:  MKKFADSNRRRFSWKLGRKSFSKFD------------------------------------------------HTGSGQAMIGKKQVVQSQPPTLMEDFE
        MKKFAD  RR    K+G + + K                                                  H    +  +G +QVVQ Q P L E+FE
Subjt:  MKKFADSNRRRFSWKLGRKSFSKFD------------------------------------------------HTGSGQAMIGKKQVVQSQPPTLMEDFE

Query:  ---------------DISMNEWLVKRQNLPKSEATWESSDFIKQKF
                       ++  NEWL+K Q L +S+ATWES   + Q+F
Subjt:  ---------------DISMNEWLVKRQNLPKSEATWESSDFIKQKF

A0A5D3DZZ7 Ty3/gypsy retrotransposon protein6.6e-18837.25Show/hide
Query:  NTSVEDEEEIIAEE-EEVQIETEAIKITNNMELALHSVLGFLHKATMKLKGRVAEKEVVVLMDCGDTHNFIHQKLVDKVNLPLSEATNYGMVVGYGVAIQ
        N   E+EE    EE  E  +E + +++T ++ + L ++     K TMKLKG + +KE+VVL+D G THNFIHQ L   + L L + T +G  +G G   +
Subjt:  NTSVEDEEEIIAEE-EEVQIETEAIKITNNMELALHSVLGFLHKATMKLKGRVAEKEVVVLMDCGDTHNFIHQKLVDKVNLPLSEATNYGMVVGYGVAIQ

Query:  GKGGLQVSHNYSPELTILDDFLLLDLGRVDVILGMTWLCTMGFMGVRWPSSTMTFFQWDSQVMLKG---------------------DLAFKAEFAEIKI
        G+G  +       E+TI+ DFL ++LG VD +LGM WL T G M + WPS TM F+    Q++LKG                     D  F  E+A +++
Subjt:  GKGGLQVSHNYSPELTILDDFLLLDLGRVDVILGMTWLCTMGFMGVRWPSSTMTFFQWDSQVMLKG---------------------DLAFKAEFAEIKI

Query:  DDEGTKKQSLLQREDEPLHKGVLELLEEYQGLFETPKSLPSTREVGNHMILAEGQGPI---PYKYGHSQKNEMEKLIKEI--------------------
        + E   K    ++ D+     +  LL++Y  +F TPK LP  R++ + ++    Q PI   PYKYGH QK E+EKL+ E+                    
Subjt:  DDEGTKKQSLLQREDEPLHKGVLELLEEYQGLFETPKSLPSTREVGNHMILAEGQGPI---PYKYGHSQKNEMEKLIKEI--------------------

Query:  ----RGW-----------VRVSNKFPIPVIKELRDELNEAAIIPKLDLRS-------------------------------RLTNAPTTLQSLMNQVFRP
             GW             VS+KFPIP+I+EL DEL  AA+  KLDL+S                                LTNAPTT QSLMNQVF+P
Subjt:  ----RGW-----------VRVSNKFPIPVIKELRDELNEAAIIPKLDLRS-------------------------------RLTNAPTTLQSLMNQVFRP

Query:  FLKRFILVFFDDILVYSLDMETHEKHLNIMFHVLKDNQLFANQKKCVFSQSRIQYLGHW---EGVEAENDKICAMANWSRPRNTTGLRGFLGLMGYYRRL
        FL+R +LVFFDDILVYS D+  HEKHL ++F  L+DNQL+AN KKCVF+ S+IQYLGH     GVEA+ DKI +M NW RP + T LRGFLGL+GYYRR 
Subjt:  FLKRFILVFFDDILVYSLDMETHEKHLNIMFHVLKDNQLFANQKKCVFSQSRIQYLGHW---EGVEAENDKICAMANWSRPRNTTGLRGFLGLMGYYRRL

Query:  VQDYRKIVAPLTKLLHKDRFMWTYEASIAFEALKKDMVTIPLLALPDFSSPLIIEIDASRTDLGVVVNGND-----------------------------
        VQ Y  I   LTKLL K+ F W  EA  AF  LK  M TIP+LALPD++ P  IE DAS + LG V++                                
Subjt:  VQDYRKIVAPLTKLLHKDRFMWTYEASIAFEALKKDMVTIPLLALPDFSSPLIIEIDASRTDLGVVVNGND-----------------------------

Query:  --------------SIRAKMETLSF-------------------------------------EIERVVPMAKAYTLTIPILLHVEEIQKEVEKDEELQQI
                      SI +  + L F                                      + RV P  + + +T   ++ +  +++EV+KDEELQQI
Subjt:  --------------SIRAKMETLSF-------------------------------------EIERVVPMAKAYTLTIPILLHVEEIQKEVEKDEELQQI

Query:  VKLLQVDLDGKPKFQLILGQLQYK----------------------VTGGHSGYLRTYKHLIGKLYWKQMKADVKKYVEECMACQQNKTDSPTPTRLLQP
        V  L+ + +   KF+    +L YK                      V GGHSG+LRTYK + G+LYW+ MK D+KKYV++C  CQ+NK ++  P  +LQP
Subjt:  VKLLQVDLDGKPKFQLILGQLQYK----------------------VTGGHSGYLRTYKHLIGKLYWKQMKADVKKYVEECMACQQNKTDSPTPTRLLQP

Query:  SPILEQVWDDLTMDFVEGLPKARGYDSITVVVDRLSKYAHFVTLSHLYTAKQVAEIFIQEV------------------------------GTVLKRSTT
         PI +++ +D TMDF+EGLPKA G + I VVVDRL+KYA+F+TL H ++AKQVA  FI +V                              GT+LKRST 
Subjt:  SPILEQVWDDLTMDFVEGLPKARGYDSITVVVDRLSKYAHFVTLSHLYTAKQVAEIFIQEV------------------------------GTVLKRSTT

Query:  FHPHTDG-----NRCLETYVRCSYNEKPKQRDKWLPWTKHWYNTTFHISINTT-FSMFFKRKPPPLIFYGERETSNDAIEQTLKASDEILKRLKKHLLIP
        FHP TDG     N+CLETY+RC  NE+P + D+++PW + WYNTTFH S  TT F   + R PPPL+ Y +R+T+N+ +E  LK  D  L  LK++L + 
Subjt:  FHPHTDG-----NRCLETYVRCSYNEKPKQRDKWLPWTKHWYNTTFHISINTT-FSMFFKRKPPPLIFYGERETSNDAIEQTLKASDEILKRLKKHLLIP

Query:  QNQMKKFADSNRRRFSWKLGRKSFSKFD------------------------------------------------HTGSGQAMIGKKQVVQSQPPTLME
        QN++KKFAD  RR    K+G + + K                                                  H    +  +G +QVVQ Q P L E
Subjt:  QNQMKKFADSNRRRFSWKLGRKSFSKFD------------------------------------------------HTGSGQAMIGKKQVVQSQPPTLME

Query:  DFE---------------DISMNEWLVKRQNLPKSEATWESSDFIKQKF
        +FE               ++  NEWL+K Q+LP+SEATWES   + Q+F
Subjt:  DFE---------------DISMNEWLVKRQNLPKSEATWESSDFIKQKF

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein2.3e-3622.77Show/hide
Query:  NKFPIPVIKELRDELNEAAIIPKLDLRSR-------------------------------LTNAPTTLQSLMNQVFRPFLKRFILVFFDDILVYSLDMET
        N +P+P+I++L  ++  + I  KLDL+S                                ++ AP   Q  +N +     +  ++ + DDIL++S     
Subjt:  NKFPIPVIKELRDELNEAAIIPKLDLRSR-------------------------------LTNAPTTLQSLMNQVFRPFLKRFILVFFDDILVYSLDMET

Query:  HEKHLNIMFHVLKDNQLFANQKKCVFSQSRIQYLGHW---EGVEAENDKICAMANWSRPRNTTGLRGFLGLMGYYRRLVQDYRKIVAPLTKLLHKD-RFM
        H KH+  +   LK+  L  NQ KC F QS+++++G+    +G     + I  +  W +P+N   LR FLG + Y R+ +    ++  PL  LL KD R+ 
Subjt:  HEKHLNIMFHVLKDNQLFANQKKCVFSQSRIQYLGHW---EGVEAENDKICAMANWSRPRNTTGLRGFLGLMGYYRRLVQDYRKIVAPLTKLLHKD-RFM

Query:  WTYEASIAFEALKKDMVTIPLLALPDFSSPLIIEIDASRTDLGVVVNG----------------------NDSIRAK-----------------------
        WT   + A E +K+ +V+ P+L   DFS  +++E DAS   +G V++                       N S+  K                       
Subjt:  WTYEASIAFEALKKDMVTIPLLALPDFSSPLIIEIDASRTDLGVVVNG----------------------NDSIRAK-----------------------

Query:  ------------------------------METLSFEI--------------ERVV----PMAK-------------------------AYTLTIPILLH
                                      ++  +FEI               R+V    P+ K                          YT    +L  
Subjt:  ------------------------------METLSFEI--------------ERVV----PMAK-------------------------AYTLTIPILLH

Query:  VEEIQKEVEKDEELQQIVKLLQVDLDGKPKFQLILGQL--QYKVTGG--HSGYLRTYKHLIGKLYWKQMKADVKKYVEECMACQQNKTDSPTPTRLLQPS
        +    K VE++ +L+  + +   D    P    +   +  +Y   G   H G       ++ +  WK ++  +++YV+ C  CQ NK+ +  P   LQP 
Subjt:  VEEIQKEVEKDEELQQIVKLLQVDLDGKPKFQLILGQL--QYKVTGG--HSGYLRTYKHLIGKLYWKQMKADVKKYVEECMACQQNKTDSPTPTRLLQPS

Query:  PILEQVWDDLTMDFVEGLPKARGYDSITVVVDRLSKYAHFVTLSHLYTAKQVAEIFIQEV------------------------------GTVLKRSTTF
        P  E+ W+ L+MDF+  LP++ GY+++ VVVDR SK A  V  +   TA+Q A +F Q V                                V+K S  +
Subjt:  PILEQVWDDLTMDFVEGLPKARGYDSITVVVDRLSKYAHFVTLSHLYTAKQVAEIFIQEV------------------------------GTVLKRSTTF

Query:  HPHTDG-----NRCLETYVRCSYNEKPKQRDKWLPWTKHWYNTTFHISINTTFSMFFKRKPPPLIFYGERETSNDAIEQTLKASDEILKRLKKHLLIPQN
         P TDG     N+ +E  +RC  +  P      +   +  YN   H +   T      R  P L    E  + +D  ++  + + ++ + +K+HL     
Subjt:  HPHTDG-----NRCLETYVRCSYNEKPKQRDKWLPWTKHWYNTTFHISINTTFSMFFKRKPPPLIFYGERETSNDAIEQTLKASDEILKRLKKHLLIPQN

Query:  QMKKFAD
        +MKK+ D
Subjt:  QMKKFAD

P0CT35 Transposon Tf2-2 polyprotein2.3e-3622.77Show/hide
Query:  NKFPIPVIKELRDELNEAAIIPKLDLRSR-------------------------------LTNAPTTLQSLMNQVFRPFLKRFILVFFDDILVYSLDMET
        N +P+P+I++L  ++  + I  KLDL+S                                ++ AP   Q  +N +     +  ++ + DDIL++S     
Subjt:  NKFPIPVIKELRDELNEAAIIPKLDLRSR-------------------------------LTNAPTTLQSLMNQVFRPFLKRFILVFFDDILVYSLDMET

Query:  HEKHLNIMFHVLKDNQLFANQKKCVFSQSRIQYLGHW---EGVEAENDKICAMANWSRPRNTTGLRGFLGLMGYYRRLVQDYRKIVAPLTKLLHKD-RFM
        H KH+  +   LK+  L  NQ KC F QS+++++G+    +G     + I  +  W +P+N   LR FLG + Y R+ +    ++  PL  LL KD R+ 
Subjt:  HEKHLNIMFHVLKDNQLFANQKKCVFSQSRIQYLGHW---EGVEAENDKICAMANWSRPRNTTGLRGFLGLMGYYRRLVQDYRKIVAPLTKLLHKD-RFM

Query:  WTYEASIAFEALKKDMVTIPLLALPDFSSPLIIEIDASRTDLGVVVNG----------------------NDSIRAK-----------------------
        WT   + A E +K+ +V+ P+L   DFS  +++E DAS   +G V++                       N S+  K                       
Subjt:  WTYEASIAFEALKKDMVTIPLLALPDFSSPLIIEIDASRTDLGVVVNG----------------------NDSIRAK-----------------------

Query:  ------------------------------METLSFEI--------------ERVV----PMAK-------------------------AYTLTIPILLH
                                      ++  +FEI               R+V    P+ K                          YT    +L  
Subjt:  ------------------------------METLSFEI--------------ERVV----PMAK-------------------------AYTLTIPILLH

Query:  VEEIQKEVEKDEELQQIVKLLQVDLDGKPKFQLILGQL--QYKVTGG--HSGYLRTYKHLIGKLYWKQMKADVKKYVEECMACQQNKTDSPTPTRLLQPS
        +    K VE++ +L+  + +   D    P    +   +  +Y   G   H G       ++ +  WK ++  +++YV+ C  CQ NK+ +  P   LQP 
Subjt:  VEEIQKEVEKDEELQQIVKLLQVDLDGKPKFQLILGQL--QYKVTGG--HSGYLRTYKHLIGKLYWKQMKADVKKYVEECMACQQNKTDSPTPTRLLQPS

Query:  PILEQVWDDLTMDFVEGLPKARGYDSITVVVDRLSKYAHFVTLSHLYTAKQVAEIFIQEV------------------------------GTVLKRSTTF
        P  E+ W+ L+MDF+  LP++ GY+++ VVVDR SK A  V  +   TA+Q A +F Q V                                V+K S  +
Subjt:  PILEQVWDDLTMDFVEGLPKARGYDSITVVVDRLSKYAHFVTLSHLYTAKQVAEIFIQEV------------------------------GTVLKRSTTF

Query:  HPHTDG-----NRCLETYVRCSYNEKPKQRDKWLPWTKHWYNTTFHISINTTFSMFFKRKPPPLIFYGERETSNDAIEQTLKASDEILKRLKKHLLIPQN
         P TDG     N+ +E  +RC  +  P      +   +  YN   H +   T      R  P L    E  + +D  ++  + + ++ + +K+HL     
Subjt:  HPHTDG-----NRCLETYVRCSYNEKPKQRDKWLPWTKHWYNTTFHISINTTFSMFFKRKPPPLIFYGERETSNDAIEQTLKASDEILKRLKKHLLIPQN

Query:  QMKKFAD
        +MKK+ D
Subjt:  QMKKFAD

P0CT36 Transposon Tf2-3 polyprotein2.3e-3622.77Show/hide
Query:  NKFPIPVIKELRDELNEAAIIPKLDLRSR-------------------------------LTNAPTTLQSLMNQVFRPFLKRFILVFFDDILVYSLDMET
        N +P+P+I++L  ++  + I  KLDL+S                                ++ AP   Q  +N +     +  ++ + DDIL++S     
Subjt:  NKFPIPVIKELRDELNEAAIIPKLDLRSR-------------------------------LTNAPTTLQSLMNQVFRPFLKRFILVFFDDILVYSLDMET

Query:  HEKHLNIMFHVLKDNQLFANQKKCVFSQSRIQYLGHW---EGVEAENDKICAMANWSRPRNTTGLRGFLGLMGYYRRLVQDYRKIVAPLTKLLHKD-RFM
        H KH+  +   LK+  L  NQ KC F QS+++++G+    +G     + I  +  W +P+N   LR FLG + Y R+ +    ++  PL  LL KD R+ 
Subjt:  HEKHLNIMFHVLKDNQLFANQKKCVFSQSRIQYLGHW---EGVEAENDKICAMANWSRPRNTTGLRGFLGLMGYYRRLVQDYRKIVAPLTKLLHKD-RFM

Query:  WTYEASIAFEALKKDMVTIPLLALPDFSSPLIIEIDASRTDLGVVVNG----------------------NDSIRAK-----------------------
        WT   + A E +K+ +V+ P+L   DFS  +++E DAS   +G V++                       N S+  K                       
Subjt:  WTYEASIAFEALKKDMVTIPLLALPDFSSPLIIEIDASRTDLGVVVNG----------------------NDSIRAK-----------------------

Query:  ------------------------------METLSFEI--------------ERVV----PMAK-------------------------AYTLTIPILLH
                                      ++  +FEI               R+V    P+ K                          YT    +L  
Subjt:  ------------------------------METLSFEI--------------ERVV----PMAK-------------------------AYTLTIPILLH

Query:  VEEIQKEVEKDEELQQIVKLLQVDLDGKPKFQLILGQL--QYKVTGG--HSGYLRTYKHLIGKLYWKQMKADVKKYVEECMACQQNKTDSPTPTRLLQPS
        +    K VE++ +L+  + +   D    P    +   +  +Y   G   H G       ++ +  WK ++  +++YV+ C  CQ NK+ +  P   LQP 
Subjt:  VEEIQKEVEKDEELQQIVKLLQVDLDGKPKFQLILGQL--QYKVTGG--HSGYLRTYKHLIGKLYWKQMKADVKKYVEECMACQQNKTDSPTPTRLLQPS

Query:  PILEQVWDDLTMDFVEGLPKARGYDSITVVVDRLSKYAHFVTLSHLYTAKQVAEIFIQEV------------------------------GTVLKRSTTF
        P  E+ W+ L+MDF+  LP++ GY+++ VVVDR SK A  V  +   TA+Q A +F Q V                                V+K S  +
Subjt:  PILEQVWDDLTMDFVEGLPKARGYDSITVVVDRLSKYAHFVTLSHLYTAKQVAEIFIQEV------------------------------GTVLKRSTTF

Query:  HPHTDG-----NRCLETYVRCSYNEKPKQRDKWLPWTKHWYNTTFHISINTTFSMFFKRKPPPLIFYGERETSNDAIEQTLKASDEILKRLKKHLLIPQN
         P TDG     N+ +E  +RC  +  P      +   +  YN   H +   T      R  P L    E  + +D  ++  + + ++ + +K+HL     
Subjt:  HPHTDG-----NRCLETYVRCSYNEKPKQRDKWLPWTKHWYNTTFHISINTTFSMFFKRKPPPLIFYGERETSNDAIEQTLKASDEILKRLKKHLLIPQN

Query:  QMKKFAD
        +MKK+ D
Subjt:  QMKKFAD

P0CT37 Transposon Tf2-4 polyprotein2.3e-3622.77Show/hide
Query:  NKFPIPVIKELRDELNEAAIIPKLDLRSR-------------------------------LTNAPTTLQSLMNQVFRPFLKRFILVFFDDILVYSLDMET
        N +P+P+I++L  ++  + I  KLDL+S                                ++ AP   Q  +N +     +  ++ + DDIL++S     
Subjt:  NKFPIPVIKELRDELNEAAIIPKLDLRSR-------------------------------LTNAPTTLQSLMNQVFRPFLKRFILVFFDDILVYSLDMET

Query:  HEKHLNIMFHVLKDNQLFANQKKCVFSQSRIQYLGHW---EGVEAENDKICAMANWSRPRNTTGLRGFLGLMGYYRRLVQDYRKIVAPLTKLLHKD-RFM
        H KH+  +   LK+  L  NQ KC F QS+++++G+    +G     + I  +  W +P+N   LR FLG + Y R+ +    ++  PL  LL KD R+ 
Subjt:  HEKHLNIMFHVLKDNQLFANQKKCVFSQSRIQYLGHW---EGVEAENDKICAMANWSRPRNTTGLRGFLGLMGYYRRLVQDYRKIVAPLTKLLHKD-RFM

Query:  WTYEASIAFEALKKDMVTIPLLALPDFSSPLIIEIDASRTDLGVVVNG----------------------NDSIRAK-----------------------
        WT   + A E +K+ +V+ P+L   DFS  +++E DAS   +G V++                       N S+  K                       
Subjt:  WTYEASIAFEALKKDMVTIPLLALPDFSSPLIIEIDASRTDLGVVVNG----------------------NDSIRAK-----------------------

Query:  ------------------------------METLSFEI--------------ERVV----PMAK-------------------------AYTLTIPILLH
                                      ++  +FEI               R+V    P+ K                          YT    +L  
Subjt:  ------------------------------METLSFEI--------------ERVV----PMAK-------------------------AYTLTIPILLH

Query:  VEEIQKEVEKDEELQQIVKLLQVDLDGKPKFQLILGQL--QYKVTGG--HSGYLRTYKHLIGKLYWKQMKADVKKYVEECMACQQNKTDSPTPTRLLQPS
        +    K VE++ +L+  + +   D    P    +   +  +Y   G   H G       ++ +  WK ++  +++YV+ C  CQ NK+ +  P   LQP 
Subjt:  VEEIQKEVEKDEELQQIVKLLQVDLDGKPKFQLILGQL--QYKVTGG--HSGYLRTYKHLIGKLYWKQMKADVKKYVEECMACQQNKTDSPTPTRLLQPS

Query:  PILEQVWDDLTMDFVEGLPKARGYDSITVVVDRLSKYAHFVTLSHLYTAKQVAEIFIQEV------------------------------GTVLKRSTTF
        P  E+ W+ L+MDF+  LP++ GY+++ VVVDR SK A  V  +   TA+Q A +F Q V                                V+K S  +
Subjt:  PILEQVWDDLTMDFVEGLPKARGYDSITVVVDRLSKYAHFVTLSHLYTAKQVAEIFIQEV------------------------------GTVLKRSTTF

Query:  HPHTDG-----NRCLETYVRCSYNEKPKQRDKWLPWTKHWYNTTFHISINTTFSMFFKRKPPPLIFYGERETSNDAIEQTLKASDEILKRLKKHLLIPQN
         P TDG     N+ +E  +RC  +  P      +   +  YN   H +   T      R  P L    E  + +D  ++  + + ++ + +K+HL     
Subjt:  HPHTDG-----NRCLETYVRCSYNEKPKQRDKWLPWTKHWYNTTFHISINTTFSMFFKRKPPPLIFYGERETSNDAIEQTLKASDEILKRLKKHLLIPQN

Query:  QMKKFAD
        +MKK+ D
Subjt:  QMKKFAD

P0CT41 Transposon Tf2-12 polyprotein2.3e-3622.77Show/hide
Query:  NKFPIPVIKELRDELNEAAIIPKLDLRSR-------------------------------LTNAPTTLQSLMNQVFRPFLKRFILVFFDDILVYSLDMET
        N +P+P+I++L  ++  + I  KLDL+S                                ++ AP   Q  +N +     +  ++ + DDIL++S     
Subjt:  NKFPIPVIKELRDELNEAAIIPKLDLRSR-------------------------------LTNAPTTLQSLMNQVFRPFLKRFILVFFDDILVYSLDMET

Query:  HEKHLNIMFHVLKDNQLFANQKKCVFSQSRIQYLGHW---EGVEAENDKICAMANWSRPRNTTGLRGFLGLMGYYRRLVQDYRKIVAPLTKLLHKD-RFM
        H KH+  +   LK+  L  NQ KC F QS+++++G+    +G     + I  +  W +P+N   LR FLG + Y R+ +    ++  PL  LL KD R+ 
Subjt:  HEKHLNIMFHVLKDNQLFANQKKCVFSQSRIQYLGHW---EGVEAENDKICAMANWSRPRNTTGLRGFLGLMGYYRRLVQDYRKIVAPLTKLLHKD-RFM

Query:  WTYEASIAFEALKKDMVTIPLLALPDFSSPLIIEIDASRTDLGVVVNG----------------------NDSIRAK-----------------------
        WT   + A E +K+ +V+ P+L   DFS  +++E DAS   +G V++                       N S+  K                       
Subjt:  WTYEASIAFEALKKDMVTIPLLALPDFSSPLIIEIDASRTDLGVVVNG----------------------NDSIRAK-----------------------

Query:  ------------------------------METLSFEI--------------ERVV----PMAK-------------------------AYTLTIPILLH
                                      ++  +FEI               R+V    P+ K                          YT    +L  
Subjt:  ------------------------------METLSFEI--------------ERVV----PMAK-------------------------AYTLTIPILLH

Query:  VEEIQKEVEKDEELQQIVKLLQVDLDGKPKFQLILGQL--QYKVTGG--HSGYLRTYKHLIGKLYWKQMKADVKKYVEECMACQQNKTDSPTPTRLLQPS
        +    K VE++ +L+  + +   D    P    +   +  +Y   G   H G       ++ +  WK ++  +++YV+ C  CQ NK+ +  P   LQP 
Subjt:  VEEIQKEVEKDEELQQIVKLLQVDLDGKPKFQLILGQL--QYKVTGG--HSGYLRTYKHLIGKLYWKQMKADVKKYVEECMACQQNKTDSPTPTRLLQPS

Query:  PILEQVWDDLTMDFVEGLPKARGYDSITVVVDRLSKYAHFVTLSHLYTAKQVAEIFIQEV------------------------------GTVLKRSTTF
        P  E+ W+ L+MDF+  LP++ GY+++ VVVDR SK A  V  +   TA+Q A +F Q V                                V+K S  +
Subjt:  PILEQVWDDLTMDFVEGLPKARGYDSITVVVDRLSKYAHFVTLSHLYTAKQVAEIFIQEV------------------------------GTVLKRSTTF

Query:  HPHTDG-----NRCLETYVRCSYNEKPKQRDKWLPWTKHWYNTTFHISINTTFSMFFKRKPPPLIFYGERETSNDAIEQTLKASDEILKRLKKHLLIPQN
         P TDG     N+ +E  +RC  +  P      +   +  YN   H +   T      R  P L    E  + +D  ++  + + ++ + +K+HL     
Subjt:  HPHTDG-----NRCLETYVRCSYNEKPKQRDKWLPWTKHWYNTTFHISINTTFSMFFKRKPPPLIFYGERETSNDAIEQTLKASDEILKRLKKHLLIPQN

Query:  QMKKFAD
        +MKK+ D
Subjt:  QMKKFAD

Arabidopsis top hitse value%identityAlignment
AT3G29750.1 Eukaryotic aspartyl protease family protein2.5e-0926.92Show/hide
Query:  SDAMTNTSVEDEEEIIAEEEEV-----QIETEAIKITNNMELALHSVLGFLHKATMKLKGRVAEKEVVVLMDCGDTHNFIHQKLVDKVNLPLSEATNYGM
        ++ M+ T V+ + +++ +++ V     ++E ++  +   ME     V+       M+  G + + +VVV +D G T NFI  +L   + LP S      +
Subjt:  SDAMTNTSVEDEEEIIAEEEEV-----QIETEAIKITNNMELALHSVLGFLHKATMKLKGRVAEKEVVVLMDCGDTHNFIHQKLVDKVNLPLSEATNYGM

Query:  VVGYGVAIQGKGGLQVSHNYSPELTILDDFLLLDLGR--VDVILGMTWLCTMGFMGVRWPSSTMTFF---QWDSQVMLKGDLAFKAEFAEIKIDDEGTKK
        ++G    IQ  G       +  E+ I ++FLLLDL +  VDVILG  WL  +G   V W +   +F    QW         +   AE  E++      K 
Subjt:  VVGYGVAIQGKGGLQVSHNYSPELTILDDFLLLDLGR--VDVILGMTWLCTMGFMGVRWPSSTMTFF---QWDSQVMLKGDLAFKAEFAEIKIDDEGTKK

Query:  QSLLQRED
        +S  ++ED
Subjt:  QSLLQRED

AT3G30770.1 Eukaryotic aspartyl protease family protein8.7e-0726.43Show/hide
Query:  SVLGFLHKATMKLKGRVAEKEVVVLMDCGDTHNFIHQKLVDKVNLPLSEATNYGMVVGYGVAIQGKGGLQVSHNYSPELTILDDFLLLDLGR--VDVILG
        S   F     M+  G ++  +VVV++D G T+NFI  +L   + LP S      +++G    IQ  G     +    E+ I ++FLLLDL +  VDVILG
Subjt:  SVLGFLHKATMKLKGRVAEKEVVVLMDCGDTHNFIHQKLVDKVNLPLSEATNYGMVVGYGVAIQGKGGLQVSHNYSPELTILDDFLLLDLGR--VDVILG

Query:  MTWLCTMGFMGVRWPSSTMTFFQWDSQVML-KGDLAFKAEFAEIKIDDEGTKKQSLLQREDEPLHKGVLELLEEYQGLFETPKSLPSTREVGNHMILAEG
              +    + W +   +FF     V L   D   +    ++K+  E  +++     ED+ + KG  + +  + G+    K +P  +++    +  E 
Subjt:  MTWLCTMGFMGVRWPSSTMTFFQWDSQVML-KGDLAFKAEFAEIKIDDEGTKKQSLLQREDEPLHKGVLELLEEYQGLFETPKSLPSTREVGNHMILAEG

Query:  QGPIPYKY---GHSQKNEMEKLIKEIR
           I +K    G S+KN  E ++K  R
Subjt:  QGPIPYKY---GHSQKNEMEKLIKEIR

ATMG00860.1 DNA/RNA polymerases superfamily protein7.8e-3249.62Show/hide
Query:  HLNIMFHVLKDNQLFANQKKCVFSQSRIQYLGH-----WEGVEAENDKICAMANWSRPRNTTGLRGFLGLMGYYRRLVQDYRKIVAPLTKLLHKDRFMWT
        HL ++  + + +Q +AN+KKC F Q +I YLGH      EGV A+  K+ AM  W  P+NTT LRGFLGL GYYRR V++Y KIV PLT+LL K+   WT
Subjt:  HLNIMFHVLKDNQLFANQKKCVFSQSRIQYLGH-----WEGVEAENDKICAMANWSRPRNTTGLRGFLGLMGYYRRLVQDYRKIVAPLTKLLHKDRFMWT

Query:  YEASIAFEALKKDMVTIPLLALPDFSSPLIIEI
          A++AF+ALK  + T+P+LALPD   P +  +
Subjt:  YEASIAFEALKKDMVTIPLLALPDFSSPLIIEI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTACTATAGGGGACCCCCACTAGTTGACTCCCCGACCAAGTATTGTTGTACTCTTTGCAAACAAGCTTTCTTGCAATTGCCAGTCCTCAATGCTTTATTGCTACATCA
TACCCCGAGTTTGCCCGCGACCCACGGTTTTTGTACCGCTGACTTGCTCGCCGGGTTAGAGCAGTGCCGCACAATCGCCTGTCCCGTATCCGAAAGCAGCTGTCGAATAG
AAAACAGAGAGTGCAAAGGGAGAAGGGTCTTTACTTCTGATGCGATGACAAATACTTCCGTTGAAGACGAAGAGGAAATAATCGCGGAAGAGGAAGAAGTTCAAATTGAG
ACTGAGGCAATTAAAATCACGAATAACATGGAGCTAGCGTTACATTCAGTGCTGGGTTTTTTGCACAAAGCAACGATGAAGCTAAAAGGAAGGGTAGCCGAGAAGGAAGT
AGTTGTCCTAATGGATTGTGGAGATACTCATAATTTCATACATCAAAAGTTAGTAGACAAAGTGAATTTACCCTTGTCTGAAGCAACTAATTATGGTATGGTGGTAGGCT
ATGGAGTGGCAATTCAGGGCAAAGGGGGTTTACAAGTCAGTCACAATTACTCTCCAGAGCTGACGATCTTGGATGATTTCCTTCTGCTTGATTTAGGAAGGGTGGACGTG
ATCCTTGGAATGACGTGGTTGTGTACCATGGGATTCATGGGCGTTCGCTGGCCCTCGTCGACGATGACTTTTTTCCAATGGGACTCCCAAGTAATGCTAAAAGGTGACCT
CGCTTTTAAGGCAGAGTTTGCCGAGATAAAAATCGATGATGAAGGAACTAAGAAGCAGAGTCTGTTACAAAGGGAAGATGAACCTCTACACAAGGGAGTCCTTGAATTGC
TTGAAGAGTATCAAGGTTTGTTTGAGACACCCAAGAGCCTGCCGTCAACAAGGGAAGTTGGTAACCATATGATTCTTGCGGAAGGGCAAGGTCCCATACCCTATAAGTAT
GGCCATTCTCAGAAGAACGAGATGGAGAAATTAATCAAGGAGATAAGGGGGTGGGTAAGGGTTTCCAACAAATTTCCAATTCCGGTTATTAAGGAGCTACGAGATGAGCT
AAATGAGGCGGCCATAATTCCAAAGTTGGATCTTCGGTCAAGGCTAACAAATGCACCAACCACTTTGCAATCTCTCATGAACCAAGTGTTCAGACCATTCTTAAAGAGAT
TCATCTTAGTCTTCTTCGATGACATCTTAGTCTACAGCCTGGACATGGAAACTCATGAGAAGCATTTGAATATCATGTTTCACGTATTGAAGGACAATCAACTCTTTGCC
AATCAAAAGAAATGTGTGTTTAGCCAGTCTAGAATTCAATACTTGGGCCACTGGGAAGGAGTGGAGGCTGAGAATGACAAAATATGTGCTATGGCCAATTGGTCGCGCCC
AAGAAACACGACAGGATTAAGGGGATTCCTTGGCCTCATGGGCTACTACCGAAGACTTGTCCAAGATTATAGGAAAATTGTTGCTCCCCTCACGAAACTTCTGCACAAAG
ACAGGTTCATGTGGACGTACGAAGCATCAATAGCCTTTGAAGCCTTAAAGAAAGATATGGTTACAATTCCACTATTGGCTCTTCCAGATTTTTCATCTCCTCTTATTATA
GAGATTGATGCATCTAGAACGGATTTAGGGGTTGTAGTTAATGGCAACGATTCTATTAGGGCAAAAATGGAGACACTGTCCTTTGAGATAGAAAGGGTCGTGCCAATGGC
CAAAGCATACACTCTCACCATCCCTATCTTGTTGCATGTGGAAGAAATTCAGAAAGAAGTGGAAAAGGATGAAGAATTACAACAGATTGTGAAGCTGTTACAAGTTGACC
TAGATGGAAAACCAAAATTTCAGCTCATCCTAGGGCAATTACAATACAAAGTCACTGGCGGGCATTCGGGATATCTACGAACATATAAACACTTGATAGGGAAGTTGTAT
TGGAAACAAATGAAGGCTGATGTCAAGAAATATGTGGAAGAATGTATGGCGTGCCAACAGAATAAGACAGATTCTCCCACTCCCACCAGACTTCTGCAACCTTCACCTAT
TCTTGAGCAAGTATGGGATGATTTAACCATGGATTTTGTAGAAGGGTTACCGAAGGCAAGAGGATATGACTCGATTACGGTAGTGGTTGATCGACTAAGCAAGTATGCTC
ATTTCGTTACTTTGTCTCATTTGTATACTGCAAAGCAAGTGGCTGAGATTTTCATCCAGGAAGTTGGGACTGTGTTAAAAAGGAGTACGACTTTCCACCCTCACACGGAT
GGCAATCGTTGCTTGGAAACCTACGTAAGGTGTTCTTACAATGAGAAGCCAAAGCAACGGGACAAGTGGTTACCATGGACCAAACATTGGTATAACACCACATTTCACAT
CTCTATCAATACTACTTTTTCAATGTTTTTCAAAAGGAAACCTCCTCCTTTGATATTTTATGGTGAGAGGGAAACGAGTAATGATGCGATAGAGCAAACGCTGAAGGCAA
GTGATGAAATATTGAAAAGACTCAAGAAACACTTGTTGATTCCCCAAAACCAAATGAAGAAGTTTGCAGACTCAAATCGTAGGAGGTTCAGTTGGAAGTTGGGGAGGAAG
TCTTTCTCAAAATTCGACCATACAGGCAGTGGCCAAGCAATGATTGGGAAGAAACAGGTAGTTCAAAGCCAACCTCCAACTTTGATGGAGGATTTCGAGGATATCTCAAT
GAATGAGTGGTTGGTGAAACGGCAGAATCTTCCCAAGAGTGAAGCTACGTGGGAGTCCAGTGACTTTATAAAGCAGAAATTCCAGGAGAGAAGGATAGTGGAAGGAGAAT
GGGAAGCGGTGAGGCCGTTGGGACCACCTGGGGCCCATGAGGTGGGTTTAAGAAAGGGGTTTACTGTGAGGAGGGGGCATCTTGCTTTTGATGAATTTTTGAAGCTGAAG
GCTAAAGAAGGTGGAGAGAATTCCAGCTCTCTCTCGAATGTGCTGGGGATTGTTCTTATTTTGAGTCCTTTGTTTATTTTATTGCTTTCCTTGAGATTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTACTATAGGGGACCCCCACTAGTTGACTCCCCGACCAAGTATTGTTGTACTCTTTGCAAACAAGCTTTCTTGCAATTGCCAGTCCTCAATGCTTTATTGCTACATCA
TACCCCGAGTTTGCCCGCGACCCACGGTTTTTGTACCGCTGACTTGCTCGCCGGGTTAGAGCAGTGCCGCACAATCGCCTGTCCCGTATCCGAAAGCAGCTGTCGAATAG
AAAACAGAGAGTGCAAAGGGAGAAGGGTCTTTACTTCTGATGCGATGACAAATACTTCCGTTGAAGACGAAGAGGAAATAATCGCGGAAGAGGAAGAAGTTCAAATTGAG
ACTGAGGCAATTAAAATCACGAATAACATGGAGCTAGCGTTACATTCAGTGCTGGGTTTTTTGCACAAAGCAACGATGAAGCTAAAAGGAAGGGTAGCCGAGAAGGAAGT
AGTTGTCCTAATGGATTGTGGAGATACTCATAATTTCATACATCAAAAGTTAGTAGACAAAGTGAATTTACCCTTGTCTGAAGCAACTAATTATGGTATGGTGGTAGGCT
ATGGAGTGGCAATTCAGGGCAAAGGGGGTTTACAAGTCAGTCACAATTACTCTCCAGAGCTGACGATCTTGGATGATTTCCTTCTGCTTGATTTAGGAAGGGTGGACGTG
ATCCTTGGAATGACGTGGTTGTGTACCATGGGATTCATGGGCGTTCGCTGGCCCTCGTCGACGATGACTTTTTTCCAATGGGACTCCCAAGTAATGCTAAAAGGTGACCT
CGCTTTTAAGGCAGAGTTTGCCGAGATAAAAATCGATGATGAAGGAACTAAGAAGCAGAGTCTGTTACAAAGGGAAGATGAACCTCTACACAAGGGAGTCCTTGAATTGC
TTGAAGAGTATCAAGGTTTGTTTGAGACACCCAAGAGCCTGCCGTCAACAAGGGAAGTTGGTAACCATATGATTCTTGCGGAAGGGCAAGGTCCCATACCCTATAAGTAT
GGCCATTCTCAGAAGAACGAGATGGAGAAATTAATCAAGGAGATAAGGGGGTGGGTAAGGGTTTCCAACAAATTTCCAATTCCGGTTATTAAGGAGCTACGAGATGAGCT
AAATGAGGCGGCCATAATTCCAAAGTTGGATCTTCGGTCAAGGCTAACAAATGCACCAACCACTTTGCAATCTCTCATGAACCAAGTGTTCAGACCATTCTTAAAGAGAT
TCATCTTAGTCTTCTTCGATGACATCTTAGTCTACAGCCTGGACATGGAAACTCATGAGAAGCATTTGAATATCATGTTTCACGTATTGAAGGACAATCAACTCTTTGCC
AATCAAAAGAAATGTGTGTTTAGCCAGTCTAGAATTCAATACTTGGGCCACTGGGAAGGAGTGGAGGCTGAGAATGACAAAATATGTGCTATGGCCAATTGGTCGCGCCC
AAGAAACACGACAGGATTAAGGGGATTCCTTGGCCTCATGGGCTACTACCGAAGACTTGTCCAAGATTATAGGAAAATTGTTGCTCCCCTCACGAAACTTCTGCACAAAG
ACAGGTTCATGTGGACGTACGAAGCATCAATAGCCTTTGAAGCCTTAAAGAAAGATATGGTTACAATTCCACTATTGGCTCTTCCAGATTTTTCATCTCCTCTTATTATA
GAGATTGATGCATCTAGAACGGATTTAGGGGTTGTAGTTAATGGCAACGATTCTATTAGGGCAAAAATGGAGACACTGTCCTTTGAGATAGAAAGGGTCGTGCCAATGGC
CAAAGCATACACTCTCACCATCCCTATCTTGTTGCATGTGGAAGAAATTCAGAAAGAAGTGGAAAAGGATGAAGAATTACAACAGATTGTGAAGCTGTTACAAGTTGACC
TAGATGGAAAACCAAAATTTCAGCTCATCCTAGGGCAATTACAATACAAAGTCACTGGCGGGCATTCGGGATATCTACGAACATATAAACACTTGATAGGGAAGTTGTAT
TGGAAACAAATGAAGGCTGATGTCAAGAAATATGTGGAAGAATGTATGGCGTGCCAACAGAATAAGACAGATTCTCCCACTCCCACCAGACTTCTGCAACCTTCACCTAT
TCTTGAGCAAGTATGGGATGATTTAACCATGGATTTTGTAGAAGGGTTACCGAAGGCAAGAGGATATGACTCGATTACGGTAGTGGTTGATCGACTAAGCAAGTATGCTC
ATTTCGTTACTTTGTCTCATTTGTATACTGCAAAGCAAGTGGCTGAGATTTTCATCCAGGAAGTTGGGACTGTGTTAAAAAGGAGTACGACTTTCCACCCTCACACGGAT
GGCAATCGTTGCTTGGAAACCTACGTAAGGTGTTCTTACAATGAGAAGCCAAAGCAACGGGACAAGTGGTTACCATGGACCAAACATTGGTATAACACCACATTTCACAT
CTCTATCAATACTACTTTTTCAATGTTTTTCAAAAGGAAACCTCCTCCTTTGATATTTTATGGTGAGAGGGAAACGAGTAATGATGCGATAGAGCAAACGCTGAAGGCAA
GTGATGAAATATTGAAAAGACTCAAGAAACACTTGTTGATTCCCCAAAACCAAATGAAGAAGTTTGCAGACTCAAATCGTAGGAGGTTCAGTTGGAAGTTGGGGAGGAAG
TCTTTCTCAAAATTCGACCATACAGGCAGTGGCCAAGCAATGATTGGGAAGAAACAGGTAGTTCAAAGCCAACCTCCAACTTTGATGGAGGATTTCGAGGATATCTCAAT
GAATGAGTGGTTGGTGAAACGGCAGAATCTTCCCAAGAGTGAAGCTACGTGGGAGTCCAGTGACTTTATAAAGCAGAAATTCCAGGAGAGAAGGATAGTGGAAGGAGAAT
GGGAAGCGGTGAGGCCGTTGGGACCACCTGGGGCCCATGAGGTGGGTTTAAGAAAGGGGTTTACTGTGAGGAGGGGGCATCTTGCTTTTGATGAATTTTTGAAGCTGAAG
GCTAAAGAAGGTGGAGAGAATTCCAGCTCTCTCTCGAATGTGCTGGGGATTGTTCTTATTTTGAGTCCTTTGTTTATTTTATTGCTTTCCTTGAGATTGTAG
Protein sequenceShow/hide protein sequence
MYYRGPPLVDSPTKYCCTLCKQAFLQLPVLNALLLHHTPSLPATHGFCTADLLAGLEQCRTIACPVSESSCRIENRECKGRRVFTSDAMTNTSVEDEEEIIAEEEEVQIE
TEAIKITNNMELALHSVLGFLHKATMKLKGRVAEKEVVVLMDCGDTHNFIHQKLVDKVNLPLSEATNYGMVVGYGVAIQGKGGLQVSHNYSPELTILDDFLLLDLGRVDV
ILGMTWLCTMGFMGVRWPSSTMTFFQWDSQVMLKGDLAFKAEFAEIKIDDEGTKKQSLLQREDEPLHKGVLELLEEYQGLFETPKSLPSTREVGNHMILAEGQGPIPYKY
GHSQKNEMEKLIKEIRGWVRVSNKFPIPVIKELRDELNEAAIIPKLDLRSRLTNAPTTLQSLMNQVFRPFLKRFILVFFDDILVYSLDMETHEKHLNIMFHVLKDNQLFA
NQKKCVFSQSRIQYLGHWEGVEAENDKICAMANWSRPRNTTGLRGFLGLMGYYRRLVQDYRKIVAPLTKLLHKDRFMWTYEASIAFEALKKDMVTIPLLALPDFSSPLII
EIDASRTDLGVVVNGNDSIRAKMETLSFEIERVVPMAKAYTLTIPILLHVEEIQKEVEKDEELQQIVKLLQVDLDGKPKFQLILGQLQYKVTGGHSGYLRTYKHLIGKLY
WKQMKADVKKYVEECMACQQNKTDSPTPTRLLQPSPILEQVWDDLTMDFVEGLPKARGYDSITVVVDRLSKYAHFVTLSHLYTAKQVAEIFIQEVGTVLKRSTTFHPHTD
GNRCLETYVRCSYNEKPKQRDKWLPWTKHWYNTTFHISINTTFSMFFKRKPPPLIFYGERETSNDAIEQTLKASDEILKRLKKHLLIPQNQMKKFADSNRRRFSWKLGRK
SFSKFDHTGSGQAMIGKKQVVQSQPPTLMEDFEDISMNEWLVKRQNLPKSEATWESSDFIKQKFQERRIVEGEWEAVRPLGPPGAHEVGLRKGFTVRRGHLAFDEFLKLK
AKEGGENSSSLSNVLGIVLILSPLFILLLSLRL