; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G33150 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G33150
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationChr1:28025513..28029031
RNA-Seq ExpressionCSPI01G33150
SyntenyCSPI01G33150
Gene Ontology termsNA
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYJ96663.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.9e-27848.78Show/hide
Query:  MGLKEMLLEMKKSMERMVEDLRESHSNKSREEFGTSDGSVMKLKVKAEETDTTNEGNLTMIDHSKYKKLEMSMFLRENPKSWVYRAEHFFEINNLPKAKK
        + LKEM+LEMKKSM+R+ ++LR+  S K +EE GTSDGS+MK+K K EETD T E +   ID SKYKKLEM MFL ENP+SWVYRAEHFFEINNLP+A+K
Subjt:  MGLKEMLLEMKKSMERMVEDLRESHSNKSREEFGTSDGSVMKLKVKAEETDTTNEGNLTMIDHSKYKKLEMSMFLRENPKSWVYRAEHFFEINNLPKAKK

Query:  VMVAVVSFGQDEVDWYRWSHNRKKVES-ECIK-------------------------GCVPDLVR------------ADSPSR----------------S
        V VAVVSFGQDEVDWYRWSHNR+KVES E +K                         G   D V+            A+S  R                S
Subjt:  VMVAVVSFGQDEVDWYRWSHNRKKVES-ECIK-------------------------GCVPDLVR------------ADSPSR----------------S

Query:  GKPQSTNSGRMR-----------------------------------NGEEGITKKEK----------EARAKKSEPPVKHLSDAEFRARLDK-------
          PQ+     M                                    N E+G+ +K +          +   +K EPPVK LSDAEFRARLDK       
Subjt:  GKPQSTNSGRMR-----------------------------------NGEEGITKKEK----------EARAKKSEPPVKHLSDAEFRARLDK-------

Query:  -------------------------ESLEEEGRTEESNEDVLELKQIKLEKGTKIELKAIHGLTSKGTMKIKGEIKGREVLVLIDSKATHNFIHNKIVEG
                                 ES EE   +E  N + +E+ Q+   + T IE +AI  LT+KGTMK++G +KG+EV+VLIDS ATHNFIH+++V+ 
Subjt:  -------------------------ESLEEEGRTEESNEDVLELKQIKLEKGTKIELKAIHGLTSKGTMKIKGEIKGREVLVLIDSKATHNFIHNKIVEG

Query:  MGLALEKGTPFGVTTGDGTRCQGRGVCKRLELKLKEITIVADFLAIEVGNVDLILGMQWLDTTGTMKIHWPSLTMTFSMGENRLPSRRD-----------
          + + + T FG+T GDGT C+G G+C ++E++L+ + +V D L + +G +D++LGMQWLDTTGTMKIHWPSLTM F    N++  + D           
Subjt:  MGLALEKGTPFGVTTGDGTRCQGRGVCKRLELKLKEITIVADFLAIEVGNVDLILGMQWLDTTGTMKIHWPSLTMTFSMGENRLPSRRD-----------

Query:  ----------------------EESDEEQRVKS---DE--------------DLFEEPKGLPPKREVDHRILLLPGQKPINMRPYKYGHTQKEEIEKLVT
                              E  D + +  S   DE              D+F+ P  LPPKR +DHRIL LPGQKPIN+RPYKYGH QKEEIEKLV 
Subjt:  ----------------------EESDEEQRVKS---DE--------------DLFEEPKGLPPKREVDHRILLLPGQKPINMRPYKYGHTQKEEIEKLVT

Query:  KMLQTGINRPNHSPYSSPILMARKKDGGWRFCVDYRKLNQVTVSDKFPIPMIEELLDELHGATVFSKLDLKSGKLNQVTVSDKFPIPVIEELLDELH---
        +MLQTGI RP+HSP+SSP+L+ +KKDGGWRFCVDYRKLN++T++DKFPIP+IEELLDELHGATVFSKLDLKSG  +Q+ + ++     IE+     H   
Subjt:  KMLQTGINRPNHSPYSSPILMARKKDGGWRFCVDYRKLNQVTVSDKFPIPMIEELLDELHGATVFSKLDLKSGKLNQVTVSDKFPIPVIEELLDELH---

Query:  ------------GATVFSKLHSGPIRDTMSSC------------TDLTEHEKHLGMVFAVMRDNQFFANKKKCVMPHSQIQYLGHLISSKGVEANEEKIK
                        F  L +   +  +  C            +D+TEHEKHLGMVFA +RDNQ +AN+KKCV  HSQI YLGH+IS  GVEA+++K+K
Subjt:  ------------GATVFSKLHSGPIRDTMSSC------------TDLTEHEKHLGMVFAVMRDNQFFANKKKCVMPHSQIQYLGHLISSKGVEANEEKIK

Query:  NM-------------------------------------------------------------------------NLPFIIETDASGIALRAVLPQNGHP
        +M                                                                         +LPF+IETDASG  L AVL QN HP
Subjt:  NM-------------------------------------------------------------------------NLPFIIETDASGIALRAVLPQNGHP

Query:  IAFFSQKLSFRPQTKSIYERELIAVVLSVQKWRYYLLGRNFTIVSNQRALIFLLEQREVQPQFQKWLTKLLGYYFEILYQPRLQNKAANALSRIEQPLEV
        IAFFSQKLS R Q KSIYEREL+AVVLSVQKWR+YLLGR FTI+S+Q+AL FLLEQREVQPQFQKWLTKLLGY FEILYQP LQNKAA+ALSR++  +E+
Subjt:  IAFFSQKLSFRPQTKSIYERELIAVVLSVQKWRYYLLGRNFTIVSNQRALIFLLEQREVQPQFQKWLTKLLGYYFEILYQPRLQNKAANALSRIEQPLEV

Query:  RSMCTTGIVNMEVIEKEVKLDEDLKRIIEELKKNPDESSKFQWINGNLLYKKRIVLSKRSSLIPTLLHTFHDSILRGHSGFLRTYKRMCGELYWKGMKTD
        +++ TTGIV+MEV+ KEV+ DE+L+ +I++L+ NP    K+   NG L+YK R+VLSK SS+IP+LLHTFHDSIL GHSGFLRTYKRM GEL+WKGMK D
Subjt:  RSMCTTGIVNMEVIEKEVKLDEDLKRIIEELKKNPDESSKFQWINGNLLYKKRIVLSKRSSLIPTLLHTFHDSILRGHSGFLRTYKRMCGELYWKGMKTD

Query:  VKKYVEQCEVCQRNKLEATKPAGVLQPIPIPERILEEWSMDFIEGL
        +KKYVEQCE+CQRNK EATKPAGVLQP+PIP+RILE+W+MDFIEGL
Subjt:  VKKYVEQCEVCQRNKLEATKPAGVLQPIPIPERILEEWSMDFIEGL

TYK21035.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.6e-27748.43Show/hide
Query:  MGLKEMLLEMKKSMERMVEDLRESHSNKSREEFGTSDGSVMKLKVKAEETDTTNEGNLTMIDHSKYKKLEMSMFLRENPKSWVYRAEHFFEINNLPKAKK
        + LKEM+LEMKKSM+R+ ++LR+  S K +EE GTSDGS+MK+K K EETD T E +   ID SKYKKLEM MFL ENP+SWVYRAEHFFEINNLP+A+K
Subjt:  MGLKEMLLEMKKSMERMVEDLRESHSNKSREEFGTSDGSVMKLKVKAEETDTTNEGNLTMIDHSKYKKLEMSMFLRENPKSWVYRAEHFFEINNLPKAKK

Query:  VMVAVVSFGQDEVDWYRWSHNRKKVES-ECIK-------------------------GCVPDLVR------------ADSPSR----------------S
        V VAVVSFGQDEVDWYRWSHNR+KVES E +K                         G   D V+            A+S  R                S
Subjt:  VMVAVVSFGQDEVDWYRWSHNRKKVES-ECIK-------------------------GCVPDLVR------------ADSPSR----------------S

Query:  GKPQSTNSGRMR-----------------------------------NGEEGITKKEK----------EARAKKSEPPVKHLSDAEFRARLDK-------
          PQ+     M                                    N ++G+ +K +          +   +K EPPVK LSDAEFRARLDK       
Subjt:  GKPQSTNSGRMR-----------------------------------NGEEGITKKEK----------EARAKKSEPPVKHLSDAEFRARLDK-------

Query:  -------------------------ESLEEEGRTEESNEDVLELKQIKLEKGTKIELKAIHGLTSKGTMKIKGEIKGREVLVLIDSKATHNFIHNKIVEG
                                 ES EE   +E  N   +E+ Q++  + T IE +AI  LT+KGTMK++G +KG+EV+VLIDS ATHNFIH+++V  
Subjt:  -------------------------ESLEEEGRTEESNEDVLELKQIKLEKGTKIELKAIHGLTSKGTMKIKGEIKGREVLVLIDSKATHNFIHNKIVEG

Query:  MGLALEKGTPFGVTTGDGTRCQGRGVCKRLELKLKEITIVADFLAIEVGNVDLILGMQWLDTTGTMKIHWPSLTMTFSMGENRLPSRRD-----------
          + + + T FG+T GDGT C+G G+C ++E++L+ + +V D L + +G +D++LGMQWLDTTGTMKIHWPSLTM F    N++  + D           
Subjt:  MGLALEKGTPFGVTTGDGTRCQGRGVCKRLELKLKEITIVADFLAIEVGNVDLILGMQWLDTTGTMKIHWPSLTMTFSMGENRLPSRRD-----------

Query:  ------------------------EESDEEQRVKSDE---------------DLFEEPKGLPPKREVDHRILLLPGQKPINMRPYKYGHTQKEEIEKLVT
                                E++D +   ++ +               D+F  P  LPPKR +DHRIL LPGQKPIN+RPYKYGH QKEEIEKLV 
Subjt:  ------------------------EESDEEQRVKSDE---------------DLFEEPKGLPPKREVDHRILLLPGQKPINMRPYKYGHTQKEEIEKLVT

Query:  KMLQTGINRPNHSPYSSPILMARKKDGGWRFCVDYRKLNQVTVSDKFPIPMIEELLDELHGATVFSKLDLKSGKLNQVTVSDKFPIPVIEELLDELH---
        +MLQTGI RP+HSP+SSP+L+ +KKDGGWRFCVDYRKLN++T++DKFPIP+IEELLDELHGATVFSKLDLKSG  +Q+ + ++     IE+     H   
Subjt:  KMLQTGINRPNHSPYSSPILMARKKDGGWRFCVDYRKLNQVTVSDKFPIPMIEELLDELHGATVFSKLDLKSGKLNQVTVSDKFPIPVIEELLDELH---

Query:  ------------GATVFSKLHSGPIRDTMSSC------------TDLTEHEKHLGMVFAVMRDNQFFANKKKCVMPHSQIQYLGHLISSKGVEANEEKIK
                        F  L +   +  +  C            +D+TEHEKHLGMVFA +RDNQ +AN+KKCV  HSQI YLGH+IS  GVEA+++K+K
Subjt:  ------------GATVFSKLHSGPIRDTMSSC------------TDLTEHEKHLGMVFAVMRDNQFFANKKKCVMPHSQIQYLGHLISSKGVEANEEKIK

Query:  NM-------------------------------------------------------------------------NLPFIIETDASGIALRAVLPQNGHP
        +M                                                                         +LPF+IETDASG  L AVL QN HP
Subjt:  NM-------------------------------------------------------------------------NLPFIIETDASGIALRAVLPQNGHP

Query:  IAFFSQKLSFRPQTKSIYERELIAVVLSVQKWRYYLLGRNFTIVSNQRALIFLLEQREVQPQFQKWLTKLLGYYFEILYQPRLQNKAANALSRIEQPLEV
        IAFFSQKLS R Q KSIYEREL+AVVLSVQKWR+YLLGR FTI+S+Q+AL FLLEQREVQPQFQKWLTKLLGY FEILYQP LQNKAA+ALSR++  +E+
Subjt:  IAFFSQKLSFRPQTKSIYERELIAVVLSVQKWRYYLLGRNFTIVSNQRALIFLLEQREVQPQFQKWLTKLLGYYFEILYQPRLQNKAANALSRIEQPLEV

Query:  RSMCTTGIVNMEVIEKEVKLDEDLKRIIEELKKNPDESSKFQWINGNLLYKKRIVLSKRSSLIPTLLHTFHDSILRGHSGFLRTYKRMCGELYWKGMKTD
        +++ TTGIV+MEV+ KEV+ DE+L+ +I++L+ NP    K+   NG L+YK R+VLSK SS+IP+LLHTFHDSIL GHSGFLRTYKRM GEL+WKGMK D
Subjt:  RSMCTTGIVNMEVIEKEVKLDEDLKRIIEELKKNPDESSKFQWINGNLLYKKRIVLSKRSSLIPTLLHTFHDSILRGHSGFLRTYKRMCGELYWKGMKTD

Query:  VKKYVEQCEVCQRNKLEATKPAGVLQPIPIPERILEEWSMDFIEGL
        +KKYVEQCE+CQRNK EATKPAGVLQP+PIP+RILE+W+MDFIEGL
Subjt:  VKKYVEQCEVCQRNKLEATKPAGVLQPIPIPERILEEWSMDFIEGL

TYK27058.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.9e-27848.78Show/hide
Query:  MGLKEMLLEMKKSMERMVEDLRESHSNKSREEFGTSDGSVMKLKVKAEETDTTNEGNLTMIDHSKYKKLEMSMFLRENPKSWVYRAEHFFEINNLPKAKK
        + LKEM+LEMKKSM+R+ ++LR+  S K +EE GTSDGS+MK+K K EETD T E +   ID SKYKKLEM MFL ENP+SWVYRAEHFFEINNLP+A+K
Subjt:  MGLKEMLLEMKKSMERMVEDLRESHSNKSREEFGTSDGSVMKLKVKAEETDTTNEGNLTMIDHSKYKKLEMSMFLRENPKSWVYRAEHFFEINNLPKAKK

Query:  VMVAVVSFGQDEVDWYRWSHNRKKVES-ECIK-------------------------GCVPDLVR------------ADSPSR----------------S
        V VAVVSFGQDEVDWYRWSHNR+KVES E +K                         G   D V+            A+S  R                S
Subjt:  VMVAVVSFGQDEVDWYRWSHNRKKVES-ECIK-------------------------GCVPDLVR------------ADSPSR----------------S

Query:  GKPQSTNSGRMR-----------------------------------NGEEGITKKEK----------EARAKKSEPPVKHLSDAEFRARLDK-------
          PQ+     M                                    N E+G+ +K +          +   +K EPPVK LSDAEFRARLDK       
Subjt:  GKPQSTNSGRMR-----------------------------------NGEEGITKKEK----------EARAKKSEPPVKHLSDAEFRARLDK-------

Query:  -------------------------ESLEEEGRTEESNEDVLELKQIKLEKGTKIELKAIHGLTSKGTMKIKGEIKGREVLVLIDSKATHNFIHNKIVEG
                                 ES EE   +E  N + +E+ Q+   + T IE +AI  LT+KGTMK++G +KG+EV+VLIDS ATHNFIH+++V+ 
Subjt:  -------------------------ESLEEEGRTEESNEDVLELKQIKLEKGTKIELKAIHGLTSKGTMKIKGEIKGREVLVLIDSKATHNFIHNKIVEG

Query:  MGLALEKGTPFGVTTGDGTRCQGRGVCKRLELKLKEITIVADFLAIEVGNVDLILGMQWLDTTGTMKIHWPSLTMTFSMGENRLPSRRD-----------
          + + + T FG+T GDGT C+G G+C ++E++L+ + +V D L + +G +D++LGMQWLDTTGTMKIHWPSLTM F    N++  + D           
Subjt:  MGLALEKGTPFGVTTGDGTRCQGRGVCKRLELKLKEITIVADFLAIEVGNVDLILGMQWLDTTGTMKIHWPSLTMTFSMGENRLPSRRD-----------

Query:  ----------------------EESDEEQRVKS---DE--------------DLFEEPKGLPPKREVDHRILLLPGQKPINMRPYKYGHTQKEEIEKLVT
                              E  D + +  S   DE              D+F+ P  LPPKR +DHRIL LPGQKPIN+RPYKYGH QKEEIEKLV 
Subjt:  ----------------------EESDEEQRVKS---DE--------------DLFEEPKGLPPKREVDHRILLLPGQKPINMRPYKYGHTQKEEIEKLVT

Query:  KMLQTGINRPNHSPYSSPILMARKKDGGWRFCVDYRKLNQVTVSDKFPIPMIEELLDELHGATVFSKLDLKSGKLNQVTVSDKFPIPVIEELLDELH---
        +MLQTGI RP+HSP+SSP+L+ +KKDGGWRFCVDYRKLN++T++DKFPIP+IEELLDELHGATVFSKLDLKSG  +Q+ + ++     IE+     H   
Subjt:  KMLQTGINRPNHSPYSSPILMARKKDGGWRFCVDYRKLNQVTVSDKFPIPMIEELLDELHGATVFSKLDLKSGKLNQVTVSDKFPIPVIEELLDELH---

Query:  ------------GATVFSKLHSGPIRDTMSSC------------TDLTEHEKHLGMVFAVMRDNQFFANKKKCVMPHSQIQYLGHLISSKGVEANEEKIK
                        F  L +   +  +  C            +D+TEHEKHLGMVFA +RDNQ +AN+KKCV  HSQI YLGH+IS  GVEA+++K+K
Subjt:  ------------GATVFSKLHSGPIRDTMSSC------------TDLTEHEKHLGMVFAVMRDNQFFANKKKCVMPHSQIQYLGHLISSKGVEANEEKIK

Query:  NM-------------------------------------------------------------------------NLPFIIETDASGIALRAVLPQNGHP
        +M                                                                         +LPF+IETDASG  L AVL QN HP
Subjt:  NM-------------------------------------------------------------------------NLPFIIETDASGIALRAVLPQNGHP

Query:  IAFFSQKLSFRPQTKSIYERELIAVVLSVQKWRYYLLGRNFTIVSNQRALIFLLEQREVQPQFQKWLTKLLGYYFEILYQPRLQNKAANALSRIEQPLEV
        IAFFSQKLS R Q KSIYEREL+AVVLSVQKWR+YLLGR FTI+S+Q+AL FLLEQREVQPQFQKWLTKLLGY FEILYQP LQNKAA+ALSR++  +E+
Subjt:  IAFFSQKLSFRPQTKSIYERELIAVVLSVQKWRYYLLGRNFTIVSNQRALIFLLEQREVQPQFQKWLTKLLGYYFEILYQPRLQNKAANALSRIEQPLEV

Query:  RSMCTTGIVNMEVIEKEVKLDEDLKRIIEELKKNPDESSKFQWINGNLLYKKRIVLSKRSSLIPTLLHTFHDSILRGHSGFLRTYKRMCGELYWKGMKTD
        +++ TTGIV+MEV+ KEV+ DE+L+ +I++L+ NP    K+   NG L+YK R+VLSK SS+IP+LLHTFHDSIL GHSGFLRTYKRM GEL+WKGMK D
Subjt:  RSMCTTGIVNMEVIEKEVKLDEDLKRIIEELKKNPDESSKFQWINGNLLYKKRIVLSKRSSLIPTLLHTFHDSILRGHSGFLRTYKRMCGELYWKGMKTD

Query:  VKKYVEQCEVCQRNKLEATKPAGVLQPIPIPERILEEWSMDFIEGL
        +KKYVEQCE+CQRNK EATKPAGVLQP+PIP+RILE+W+MDFIEGL
Subjt:  VKKYVEQCEVCQRNKLEATKPAGVLQPIPIPERILEEWSMDFIEGL

TYK27963.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]9.3e-27848.69Show/hide
Query:  MGLKEMLLEMKKSMERMVEDLRESHSNKSREEFGTSDGSVMKLKVKAEETDTTNEGNLTMIDHSKYKKLEMSMFLRENPKSWVYRAEHFFEINNLPKAKK
        + LKEM+LEMKKSM+R+ ++LR+  S K +EE GTSDGS+MK+K K EETD T E +   ID SKYKKLEM MFL ENP+SWVYRAEHFFEINNLP+A+K
Subjt:  MGLKEMLLEMKKSMERMVEDLRESHSNKSREEFGTSDGSVMKLKVKAEETDTTNEGNLTMIDHSKYKKLEMSMFLRENPKSWVYRAEHFFEINNLPKAKK

Query:  VMVAVVSFGQDEVDWYRWSHNRKKVES-ECIK-------------------------GCVPDLVR------------ADSPSR----------------S
        V VAVVSFGQDEVDWYRWSHNR+KVES E +K                         G   D V+            A+S  R                S
Subjt:  VMVAVVSFGQDEVDWYRWSHNRKKVES-ECIK-------------------------GCVPDLVR------------ADSPSR----------------S

Query:  GKPQSTNSGRMR-----------------------------------NGEEGITKKEK----------EARAKKSEPPVKHLSDAEFRARLDK-------
          PQ+     M                                    N ++G+ +K +          +   +K EPPVK LSDAEFRARLDK       
Subjt:  GKPQSTNSGRMR-----------------------------------NGEEGITKKEK----------EARAKKSEPPVKHLSDAEFRARLDK-------

Query:  -------------------------ESLEEEGRTEESNEDVLELKQIKLEKGTKIELKAIHGLTSKGTMKIKGEIKGREVLVLIDSKATHNFIHNKIVEG
                                 ES EE   +E  N   +E+ Q+   + T IE +AI  LT+KGTMK++G +KG+EV+VLIDS ATHNFIH+++V+ 
Subjt:  -------------------------ESLEEEGRTEESNEDVLELKQIKLEKGTKIELKAIHGLTSKGTMKIKGEIKGREVLVLIDSKATHNFIHNKIVEG

Query:  MGLALEKGTPFGVTTGDGTRCQGRGVCKRLELKLKEITIVADFLAIEVGNVDLILGMQWLDTTGTMKIHWPSLTMTFSMGENRLPSRRD-----------
          + + + T FG+T GDGT C+G G+C ++E++L+ + +V D L + +G +D++LGMQWLDTTGTMKIHWPSLTM F    N++  + D           
Subjt:  MGLALEKGTPFGVTTGDGTRCQGRGVCKRLELKLKEITIVADFLAIEVGNVDLILGMQWLDTTGTMKIHWPSLTMTFSMGENRLPSRRD-----------

Query:  ----------------------EESDEEQRVKS---DE--------------DLFEEPKGLPPKREVDHRILLLPGQKPINMRPYKYGHTQKEEIEKLVT
                              E  D + +  S   DE              D+F+ P  LPPKR +DHRIL LPGQKPIN+RPYKYGH QKEEIEKLV 
Subjt:  ----------------------EESDEEQRVKS---DE--------------DLFEEPKGLPPKREVDHRILLLPGQKPINMRPYKYGHTQKEEIEKLVT

Query:  KMLQTGINRPNHSPYSSPILMARKKDGGWRFCVDYRKLNQVTVSDKFPIPMIEELLDELHGATVFSKLDLKSGKLNQVTVSDKFPIPVIEELLDELH---
        +MLQTGI RP+HSP+SSP+L+ +KKDGGWRFCVDYRKLN++T++DKFPIP+IEELLDELHGATVFSKLDLKSG  +Q+ + ++     IE+     H   
Subjt:  KMLQTGINRPNHSPYSSPILMARKKDGGWRFCVDYRKLNQVTVSDKFPIPMIEELLDELHGATVFSKLDLKSGKLNQVTVSDKFPIPVIEELLDELH---

Query:  ------------GATVFSKLHSGPIRDTMSSC------------TDLTEHEKHLGMVFAVMRDNQFFANKKKCVMPHSQIQYLGHLISSKGVEANEEKIK
                        F  L +   +  +  C            +D+TEHEKHLGMVFA +RDNQ +AN+KKCV  HSQI YLGH+IS  GVEA+++K+K
Subjt:  ------------GATVFSKLHSGPIRDTMSSC------------TDLTEHEKHLGMVFAVMRDNQFFANKKKCVMPHSQIQYLGHLISSKGVEANEEKIK

Query:  NM-------------------------------------------------------------------------NLPFIIETDASGIALRAVLPQNGHP
        +M                                                                         +LPF+IETDASG  L AVL QN HP
Subjt:  NM-------------------------------------------------------------------------NLPFIIETDASGIALRAVLPQNGHP

Query:  IAFFSQKLSFRPQTKSIYERELIAVVLSVQKWRYYLLGRNFTIVSNQRALIFLLEQREVQPQFQKWLTKLLGYYFEILYQPRLQNKAANALSRIEQPLEV
        IAFFSQKLS R Q KSIYEREL+AVVLSVQKWR+YLLGR FTI+S+Q+AL FLLEQREVQPQFQKWLTKLLGY FEILYQP LQNKAA+ALSR++  +E+
Subjt:  IAFFSQKLSFRPQTKSIYERELIAVVLSVQKWRYYLLGRNFTIVSNQRALIFLLEQREVQPQFQKWLTKLLGYYFEILYQPRLQNKAANALSRIEQPLEV

Query:  RSMCTTGIVNMEVIEKEVKLDEDLKRIIEELKKNPDESSKFQWINGNLLYKKRIVLSKRSSLIPTLLHTFHDSILRGHSGFLRTYKRMCGELYWKGMKTD
        +++ TTGIV+MEV+ KEV+ DE+L+ +I++L+ NP    K+   NG L+YK R+VLSK SS+IP+LLHTFHDSIL GHSGFLRTYKRM GEL+WKGMK D
Subjt:  RSMCTTGIVNMEVIEKEVKLDEDLKRIIEELKKNPDESSKFQWINGNLLYKKRIVLSKRSSLIPTLLHTFHDSILRGHSGFLRTYKRMCGELYWKGMKTD

Query:  VKKYVEQCEVCQRNKLEATKPAGVLQPIPIPERILEEWSMDFIEGL
        +KKYVEQCE+CQRNK EATKPAGVLQP+PIP+RILE+W+MDFIEGL
Subjt:  VKKYVEQCEVCQRNKLEATKPAGVLQPIPIPERILEEWSMDFIEGL

TYK28944.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.9e-27848.78Show/hide
Query:  MGLKEMLLEMKKSMERMVEDLRESHSNKSREEFGTSDGSVMKLKVKAEETDTTNEGNLTMIDHSKYKKLEMSMFLRENPKSWVYRAEHFFEINNLPKAKK
        + LKEM+LEMKKSM+R+ ++LR+  S K +EE GTSDGS+MK+K K EETD T E +   ID SKYKKLEM MFL ENP+SWVYRAEHFFEINNLP+A+K
Subjt:  MGLKEMLLEMKKSMERMVEDLRESHSNKSREEFGTSDGSVMKLKVKAEETDTTNEGNLTMIDHSKYKKLEMSMFLRENPKSWVYRAEHFFEINNLPKAKK

Query:  VMVAVVSFGQDEVDWYRWSHNRKKVES-ECIK-------------------------GCVPDLVR------------ADSPSR----------------S
        V VAVVSFGQDEVDWYRWSHNR+KVES E +K                         G   D V+            A+S  R                S
Subjt:  VMVAVVSFGQDEVDWYRWSHNRKKVES-ECIK-------------------------GCVPDLVR------------ADSPSR----------------S

Query:  GKPQSTNSGRMR-----------------------------------NGEEGITKKEK----------EARAKKSEPPVKHLSDAEFRARLDK-------
          PQ+     M                                    N E+G+ +K +          +   +K EPPVK LSDAEFRARLDK       
Subjt:  GKPQSTNSGRMR-----------------------------------NGEEGITKKEK----------EARAKKSEPPVKHLSDAEFRARLDK-------

Query:  -------------------------ESLEEEGRTEESNEDVLELKQIKLEKGTKIELKAIHGLTSKGTMKIKGEIKGREVLVLIDSKATHNFIHNKIVEG
                                 ES EE   +E  N + +E+ Q+   + T IE +AI  LT+KGTMK++G +KG+EV+VLIDS ATHNFIH+++V+ 
Subjt:  -------------------------ESLEEEGRTEESNEDVLELKQIKLEKGTKIELKAIHGLTSKGTMKIKGEIKGREVLVLIDSKATHNFIHNKIVEG

Query:  MGLALEKGTPFGVTTGDGTRCQGRGVCKRLELKLKEITIVADFLAIEVGNVDLILGMQWLDTTGTMKIHWPSLTMTFSMGENRLPSRRD-----------
          + + + T FG+T GDGT C+G G+C ++E++L+ + +V D L + +G +D++LGMQWLDTTGTMKIHWPSLTM F    N++  + D           
Subjt:  MGLALEKGTPFGVTTGDGTRCQGRGVCKRLELKLKEITIVADFLAIEVGNVDLILGMQWLDTTGTMKIHWPSLTMTFSMGENRLPSRRD-----------

Query:  ----------------------EESDEEQRVKS---DE--------------DLFEEPKGLPPKREVDHRILLLPGQKPINMRPYKYGHTQKEEIEKLVT
                              E  D + +  S   DE              D+F+ P  LPPKR +DHRIL LPGQKPIN+RPYKYGH QKEEIEKLV 
Subjt:  ----------------------EESDEEQRVKS---DE--------------DLFEEPKGLPPKREVDHRILLLPGQKPINMRPYKYGHTQKEEIEKLVT

Query:  KMLQTGINRPNHSPYSSPILMARKKDGGWRFCVDYRKLNQVTVSDKFPIPMIEELLDELHGATVFSKLDLKSGKLNQVTVSDKFPIPVIEELLDELH---
        +MLQTGI RP+HSP+SSP+L+ +KKDGGWRFCVDYRKLN++T++DKFPIP+IEELLDELHGATVFSKLDLKSG  +Q+ + ++     IE+     H   
Subjt:  KMLQTGINRPNHSPYSSPILMARKKDGGWRFCVDYRKLNQVTVSDKFPIPMIEELLDELHGATVFSKLDLKSGKLNQVTVSDKFPIPVIEELLDELH---

Query:  ------------GATVFSKLHSGPIRDTMSSC------------TDLTEHEKHLGMVFAVMRDNQFFANKKKCVMPHSQIQYLGHLISSKGVEANEEKIK
                        F  L +   +  +  C            +D+TEHEKHLGMVFA +RDNQ +AN+KKCV  HSQI YLGH+IS  GVEA+++K+K
Subjt:  ------------GATVFSKLHSGPIRDTMSSC------------TDLTEHEKHLGMVFAVMRDNQFFANKKKCVMPHSQIQYLGHLISSKGVEANEEKIK

Query:  NM-------------------------------------------------------------------------NLPFIIETDASGIALRAVLPQNGHP
        +M                                                                         +LPF+IETDASG  L AVL QN HP
Subjt:  NM-------------------------------------------------------------------------NLPFIIETDASGIALRAVLPQNGHP

Query:  IAFFSQKLSFRPQTKSIYERELIAVVLSVQKWRYYLLGRNFTIVSNQRALIFLLEQREVQPQFQKWLTKLLGYYFEILYQPRLQNKAANALSRIEQPLEV
        IAFFSQKLS R Q KSIYEREL+AVVLSVQKWR+YLLGR FTI+S+Q+AL FLLEQREVQPQFQKWLTKLLGY FEILYQP LQNKAA+ALSR++  +E+
Subjt:  IAFFSQKLSFRPQTKSIYERELIAVVLSVQKWRYYLLGRNFTIVSNQRALIFLLEQREVQPQFQKWLTKLLGYYFEILYQPRLQNKAANALSRIEQPLEV

Query:  RSMCTTGIVNMEVIEKEVKLDEDLKRIIEELKKNPDESSKFQWINGNLLYKKRIVLSKRSSLIPTLLHTFHDSILRGHSGFLRTYKRMCGELYWKGMKTD
        +++ TTGIV+MEV+ KEV+ DE+L+ +I++L+ NP    K+   NG L+YK R+VLSK SS+IP+LLHTFHDSIL GHSGFLRTYKRM GEL+WKGMK D
Subjt:  RSMCTTGIVNMEVIEKEVKLDEDLKRIIEELKKNPDESSKFQWINGNLLYKKRIVLSKRSSLIPTLLHTFHDSILRGHSGFLRTYKRMCGELYWKGMKTD

Query:  VKKYVEQCEVCQRNKLEATKPAGVLQPIPIPERILEEWSMDFIEGL
        +KKYVEQCE+CQRNK EATKPAGVLQP+PIP+RILE+W+MDFIEGL
Subjt:  VKKYVEQCEVCQRNKLEATKPAGVLQPIPIPERILEEWSMDFIEGL

TrEMBL top hitse value%identityAlignment
A0A5D3BBH7 Ty3/gypsy retrotransposon protein9.0e-27948.78Show/hide
Query:  MGLKEMLLEMKKSMERMVEDLRESHSNKSREEFGTSDGSVMKLKVKAEETDTTNEGNLTMIDHSKYKKLEMSMFLRENPKSWVYRAEHFFEINNLPKAKK
        + LKEM+LEMKKSM+R+ ++LR+  S K +EE GTSDGS+MK+K K EETD T E +   ID SKYKKLEM MFL ENP+SWVYRAEHFFEINNLP+A+K
Subjt:  MGLKEMLLEMKKSMERMVEDLRESHSNKSREEFGTSDGSVMKLKVKAEETDTTNEGNLTMIDHSKYKKLEMSMFLRENPKSWVYRAEHFFEINNLPKAKK

Query:  VMVAVVSFGQDEVDWYRWSHNRKKVES-ECIK-------------------------GCVPDLVR------------ADSPSR----------------S
        V VAVVSFGQDEVDWYRWSHNR+KVES E +K                         G   D V+            A+S  R                S
Subjt:  VMVAVVSFGQDEVDWYRWSHNRKKVES-ECIK-------------------------GCVPDLVR------------ADSPSR----------------S

Query:  GKPQSTNSGRMR-----------------------------------NGEEGITKKEK----------EARAKKSEPPVKHLSDAEFRARLDK-------
          PQ+     M                                    N E+G+ +K +          +   +K EPPVK LSDAEFRARLDK       
Subjt:  GKPQSTNSGRMR-----------------------------------NGEEGITKKEK----------EARAKKSEPPVKHLSDAEFRARLDK-------

Query:  -------------------------ESLEEEGRTEESNEDVLELKQIKLEKGTKIELKAIHGLTSKGTMKIKGEIKGREVLVLIDSKATHNFIHNKIVEG
                                 ES EE   +E  N + +E+ Q+   + T IE +AI  LT+KGTMK++G +KG+EV+VLIDS ATHNFIH+++V+ 
Subjt:  -------------------------ESLEEEGRTEESNEDVLELKQIKLEKGTKIELKAIHGLTSKGTMKIKGEIKGREVLVLIDSKATHNFIHNKIVEG

Query:  MGLALEKGTPFGVTTGDGTRCQGRGVCKRLELKLKEITIVADFLAIEVGNVDLILGMQWLDTTGTMKIHWPSLTMTFSMGENRLPSRRD-----------
          + + + T FG+T GDGT C+G G+C ++E++L+ + +V D L + +G +D++LGMQWLDTTGTMKIHWPSLTM F    N++  + D           
Subjt:  MGLALEKGTPFGVTTGDGTRCQGRGVCKRLELKLKEITIVADFLAIEVGNVDLILGMQWLDTTGTMKIHWPSLTMTFSMGENRLPSRRD-----------

Query:  ----------------------EESDEEQRVKS---DE--------------DLFEEPKGLPPKREVDHRILLLPGQKPINMRPYKYGHTQKEEIEKLVT
                              E  D + +  S   DE              D+F+ P  LPPKR +DHRIL LPGQKPIN+RPYKYGH QKEEIEKLV 
Subjt:  ----------------------EESDEEQRVKS---DE--------------DLFEEPKGLPPKREVDHRILLLPGQKPINMRPYKYGHTQKEEIEKLVT

Query:  KMLQTGINRPNHSPYSSPILMARKKDGGWRFCVDYRKLNQVTVSDKFPIPMIEELLDELHGATVFSKLDLKSGKLNQVTVSDKFPIPVIEELLDELH---
        +MLQTGI RP+HSP+SSP+L+ +KKDGGWRFCVDYRKLN++T++DKFPIP+IEELLDELHGATVFSKLDLKSG  +Q+ + ++     IE+     H   
Subjt:  KMLQTGINRPNHSPYSSPILMARKKDGGWRFCVDYRKLNQVTVSDKFPIPMIEELLDELHGATVFSKLDLKSGKLNQVTVSDKFPIPVIEELLDELH---

Query:  ------------GATVFSKLHSGPIRDTMSSC------------TDLTEHEKHLGMVFAVMRDNQFFANKKKCVMPHSQIQYLGHLISSKGVEANEEKIK
                        F  L +   +  +  C            +D+TEHEKHLGMVFA +RDNQ +AN+KKCV  HSQI YLGH+IS  GVEA+++K+K
Subjt:  ------------GATVFSKLHSGPIRDTMSSC------------TDLTEHEKHLGMVFAVMRDNQFFANKKKCVMPHSQIQYLGHLISSKGVEANEEKIK

Query:  NM-------------------------------------------------------------------------NLPFIIETDASGIALRAVLPQNGHP
        +M                                                                         +LPF+IETDASG  L AVL QN HP
Subjt:  NM-------------------------------------------------------------------------NLPFIIETDASGIALRAVLPQNGHP

Query:  IAFFSQKLSFRPQTKSIYERELIAVVLSVQKWRYYLLGRNFTIVSNQRALIFLLEQREVQPQFQKWLTKLLGYYFEILYQPRLQNKAANALSRIEQPLEV
        IAFFSQKLS R Q KSIYEREL+AVVLSVQKWR+YLLGR FTI+S+Q+AL FLLEQREVQPQFQKWLTKLLGY FEILYQP LQNKAA+ALSR++  +E+
Subjt:  IAFFSQKLSFRPQTKSIYERELIAVVLSVQKWRYYLLGRNFTIVSNQRALIFLLEQREVQPQFQKWLTKLLGYYFEILYQPRLQNKAANALSRIEQPLEV

Query:  RSMCTTGIVNMEVIEKEVKLDEDLKRIIEELKKNPDESSKFQWINGNLLYKKRIVLSKRSSLIPTLLHTFHDSILRGHSGFLRTYKRMCGELYWKGMKTD
        +++ TTGIV+MEV+ KEV+ DE+L+ +I++L+ NP    K+   NG L+YK R+VLSK SS+IP+LLHTFHDSIL GHSGFLRTYKRM GEL+WKGMK D
Subjt:  RSMCTTGIVNMEVIEKEVKLDEDLKRIIEELKKNPDESSKFQWINGNLLYKKRIVLSKRSSLIPTLLHTFHDSILRGHSGFLRTYKRMCGELYWKGMKTD

Query:  VKKYVEQCEVCQRNKLEATKPAGVLQPIPIPERILEEWSMDFIEGL
        +KKYVEQCE+CQRNK EATKPAGVLQP+PIP+RILE+W+MDFIEGL
Subjt:  VKKYVEQCEVCQRNKLEATKPAGVLQPIPIPERILEEWSMDFIEGL

A0A5D3DU86 Ty3/gypsy retrotransposon protein9.0e-27948.78Show/hide
Query:  MGLKEMLLEMKKSMERMVEDLRESHSNKSREEFGTSDGSVMKLKVKAEETDTTNEGNLTMIDHSKYKKLEMSMFLRENPKSWVYRAEHFFEINNLPKAKK
        + LKEM+LEMKKSM+R+ ++LR+  S K +EE GTSDGS+MK+K K EETD T E +   ID SKYKKLEM MFL ENP+SWVYRAEHFFEINNLP+A+K
Subjt:  MGLKEMLLEMKKSMERMVEDLRESHSNKSREEFGTSDGSVMKLKVKAEETDTTNEGNLTMIDHSKYKKLEMSMFLRENPKSWVYRAEHFFEINNLPKAKK

Query:  VMVAVVSFGQDEVDWYRWSHNRKKVES-ECIK-------------------------GCVPDLVR------------ADSPSR----------------S
        V VAVVSFGQDEVDWYRWSHNR+KVES E +K                         G   D V+            A+S  R                S
Subjt:  VMVAVVSFGQDEVDWYRWSHNRKKVES-ECIK-------------------------GCVPDLVR------------ADSPSR----------------S

Query:  GKPQSTNSGRMR-----------------------------------NGEEGITKKEK----------EARAKKSEPPVKHLSDAEFRARLDK-------
          PQ+     M                                    N E+G+ +K +          +   +K EPPVK LSDAEFRARLDK       
Subjt:  GKPQSTNSGRMR-----------------------------------NGEEGITKKEK----------EARAKKSEPPVKHLSDAEFRARLDK-------

Query:  -------------------------ESLEEEGRTEESNEDVLELKQIKLEKGTKIELKAIHGLTSKGTMKIKGEIKGREVLVLIDSKATHNFIHNKIVEG
                                 ES EE   +E  N + +E+ Q+   + T IE +AI  LT+KGTMK++G +KG+EV+VLIDS ATHNFIH+++V+ 
Subjt:  -------------------------ESLEEEGRTEESNEDVLELKQIKLEKGTKIELKAIHGLTSKGTMKIKGEIKGREVLVLIDSKATHNFIHNKIVEG

Query:  MGLALEKGTPFGVTTGDGTRCQGRGVCKRLELKLKEITIVADFLAIEVGNVDLILGMQWLDTTGTMKIHWPSLTMTFSMGENRLPSRRD-----------
          + + + T FG+T GDGT C+G G+C ++E++L+ + +V D L + +G +D++LGMQWLDTTGTMKIHWPSLTM F    N++  + D           
Subjt:  MGLALEKGTPFGVTTGDGTRCQGRGVCKRLELKLKEITIVADFLAIEVGNVDLILGMQWLDTTGTMKIHWPSLTMTFSMGENRLPSRRD-----------

Query:  ----------------------EESDEEQRVKS---DE--------------DLFEEPKGLPPKREVDHRILLLPGQKPINMRPYKYGHTQKEEIEKLVT
                              E  D + +  S   DE              D+F+ P  LPPKR +DHRIL LPGQKPIN+RPYKYGH QKEEIEKLV 
Subjt:  ----------------------EESDEEQRVKS---DE--------------DLFEEPKGLPPKREVDHRILLLPGQKPINMRPYKYGHTQKEEIEKLVT

Query:  KMLQTGINRPNHSPYSSPILMARKKDGGWRFCVDYRKLNQVTVSDKFPIPMIEELLDELHGATVFSKLDLKSGKLNQVTVSDKFPIPVIEELLDELH---
        +MLQTGI RP+HSP+SSP+L+ +KKDGGWRFCVDYRKLN++T++DKFPIP+IEELLDELHGATVFSKLDLKSG  +Q+ + ++     IE+     H   
Subjt:  KMLQTGINRPNHSPYSSPILMARKKDGGWRFCVDYRKLNQVTVSDKFPIPMIEELLDELHGATVFSKLDLKSGKLNQVTVSDKFPIPVIEELLDELH---

Query:  ------------GATVFSKLHSGPIRDTMSSC------------TDLTEHEKHLGMVFAVMRDNQFFANKKKCVMPHSQIQYLGHLISSKGVEANEEKIK
                        F  L +   +  +  C            +D+TEHEKHLGMVFA +RDNQ +AN+KKCV  HSQI YLGH+IS  GVEA+++K+K
Subjt:  ------------GATVFSKLHSGPIRDTMSSC------------TDLTEHEKHLGMVFAVMRDNQFFANKKKCVMPHSQIQYLGHLISSKGVEANEEKIK

Query:  NM-------------------------------------------------------------------------NLPFIIETDASGIALRAVLPQNGHP
        +M                                                                         +LPF+IETDASG  L AVL QN HP
Subjt:  NM-------------------------------------------------------------------------NLPFIIETDASGIALRAVLPQNGHP

Query:  IAFFSQKLSFRPQTKSIYERELIAVVLSVQKWRYYLLGRNFTIVSNQRALIFLLEQREVQPQFQKWLTKLLGYYFEILYQPRLQNKAANALSRIEQPLEV
        IAFFSQKLS R Q KSIYEREL+AVVLSVQKWR+YLLGR FTI+S+Q+AL FLLEQREVQPQFQKWLTKLLGY FEILYQP LQNKAA+ALSR++  +E+
Subjt:  IAFFSQKLSFRPQTKSIYERELIAVVLSVQKWRYYLLGRNFTIVSNQRALIFLLEQREVQPQFQKWLTKLLGYYFEILYQPRLQNKAANALSRIEQPLEV

Query:  RSMCTTGIVNMEVIEKEVKLDEDLKRIIEELKKNPDESSKFQWINGNLLYKKRIVLSKRSSLIPTLLHTFHDSILRGHSGFLRTYKRMCGELYWKGMKTD
        +++ TTGIV+MEV+ KEV+ DE+L+ +I++L+ NP    K+   NG L+YK R+VLSK SS+IP+LLHTFHDSIL GHSGFLRTYKRM GEL+WKGMK D
Subjt:  RSMCTTGIVNMEVIEKEVKLDEDLKRIIEELKKNPDESSKFQWINGNLLYKKRIVLSKRSSLIPTLLHTFHDSILRGHSGFLRTYKRMCGELYWKGMKTD

Query:  VKKYVEQCEVCQRNKLEATKPAGVLQPIPIPERILEEWSMDFIEGL
        +KKYVEQCE+CQRNK EATKPAGVLQP+PIP+RILE+W+MDFIEGL
Subjt:  VKKYVEQCEVCQRNKLEATKPAGVLQPIPIPERILEEWSMDFIEGL

A0A5D3DWA9 Ty3/gypsy retrotransposon protein4.5e-27848.69Show/hide
Query:  MGLKEMLLEMKKSMERMVEDLRESHSNKSREEFGTSDGSVMKLKVKAEETDTTNEGNLTMIDHSKYKKLEMSMFLRENPKSWVYRAEHFFEINNLPKAKK
        + LKEM+LEMKKSM+R+ ++LR+  S K +EE GTSDGS+MK+K K EETD T E +   ID SKYKKLEM MFL ENP+SWVYRAEHFFEINNLP+A+K
Subjt:  MGLKEMLLEMKKSMERMVEDLRESHSNKSREEFGTSDGSVMKLKVKAEETDTTNEGNLTMIDHSKYKKLEMSMFLRENPKSWVYRAEHFFEINNLPKAKK

Query:  VMVAVVSFGQDEVDWYRWSHNRKKVES-ECIK-------------------------GCVPDLVR------------ADSPSR----------------S
        V VAVVSFGQDEVDWYRWSHNR+KVES E +K                         G   D V+            A+S  R                S
Subjt:  VMVAVVSFGQDEVDWYRWSHNRKKVES-ECIK-------------------------GCVPDLVR------------ADSPSR----------------S

Query:  GKPQSTNSGRMR-----------------------------------NGEEGITKKEK----------EARAKKSEPPVKHLSDAEFRARLDK-------
          PQ+     M                                    N ++G+ +K +          +   +K EPPVK LSDAEFRARLDK       
Subjt:  GKPQSTNSGRMR-----------------------------------NGEEGITKKEK----------EARAKKSEPPVKHLSDAEFRARLDK-------

Query:  -------------------------ESLEEEGRTEESNEDVLELKQIKLEKGTKIELKAIHGLTSKGTMKIKGEIKGREVLVLIDSKATHNFIHNKIVEG
                                 ES EE   +E  N   +E+ Q+   + T IE +AI  LT+KGTMK++G +KG+EV+VLIDS ATHNFIH+++V+ 
Subjt:  -------------------------ESLEEEGRTEESNEDVLELKQIKLEKGTKIELKAIHGLTSKGTMKIKGEIKGREVLVLIDSKATHNFIHNKIVEG

Query:  MGLALEKGTPFGVTTGDGTRCQGRGVCKRLELKLKEITIVADFLAIEVGNVDLILGMQWLDTTGTMKIHWPSLTMTFSMGENRLPSRRD-----------
          + + + T FG+T GDGT C+G G+C ++E++L+ + +V D L + +G +D++LGMQWLDTTGTMKIHWPSLTM F    N++  + D           
Subjt:  MGLALEKGTPFGVTTGDGTRCQGRGVCKRLELKLKEITIVADFLAIEVGNVDLILGMQWLDTTGTMKIHWPSLTMTFSMGENRLPSRRD-----------

Query:  ----------------------EESDEEQRVKS---DE--------------DLFEEPKGLPPKREVDHRILLLPGQKPINMRPYKYGHTQKEEIEKLVT
                              E  D + +  S   DE              D+F+ P  LPPKR +DHRIL LPGQKPIN+RPYKYGH QKEEIEKLV 
Subjt:  ----------------------EESDEEQRVKS---DE--------------DLFEEPKGLPPKREVDHRILLLPGQKPINMRPYKYGHTQKEEIEKLVT

Query:  KMLQTGINRPNHSPYSSPILMARKKDGGWRFCVDYRKLNQVTVSDKFPIPMIEELLDELHGATVFSKLDLKSGKLNQVTVSDKFPIPVIEELLDELH---
        +MLQTGI RP+HSP+SSP+L+ +KKDGGWRFCVDYRKLN++T++DKFPIP+IEELLDELHGATVFSKLDLKSG  +Q+ + ++     IE+     H   
Subjt:  KMLQTGINRPNHSPYSSPILMARKKDGGWRFCVDYRKLNQVTVSDKFPIPMIEELLDELHGATVFSKLDLKSGKLNQVTVSDKFPIPVIEELLDELH---

Query:  ------------GATVFSKLHSGPIRDTMSSC------------TDLTEHEKHLGMVFAVMRDNQFFANKKKCVMPHSQIQYLGHLISSKGVEANEEKIK
                        F  L +   +  +  C            +D+TEHEKHLGMVFA +RDNQ +AN+KKCV  HSQI YLGH+IS  GVEA+++K+K
Subjt:  ------------GATVFSKLHSGPIRDTMSSC------------TDLTEHEKHLGMVFAVMRDNQFFANKKKCVMPHSQIQYLGHLISSKGVEANEEKIK

Query:  NM-------------------------------------------------------------------------NLPFIIETDASGIALRAVLPQNGHP
        +M                                                                         +LPF+IETDASG  L AVL QN HP
Subjt:  NM-------------------------------------------------------------------------NLPFIIETDASGIALRAVLPQNGHP

Query:  IAFFSQKLSFRPQTKSIYERELIAVVLSVQKWRYYLLGRNFTIVSNQRALIFLLEQREVQPQFQKWLTKLLGYYFEILYQPRLQNKAANALSRIEQPLEV
        IAFFSQKLS R Q KSIYEREL+AVVLSVQKWR+YLLGR FTI+S+Q+AL FLLEQREVQPQFQKWLTKLLGY FEILYQP LQNKAA+ALSR++  +E+
Subjt:  IAFFSQKLSFRPQTKSIYERELIAVVLSVQKWRYYLLGRNFTIVSNQRALIFLLEQREVQPQFQKWLTKLLGYYFEILYQPRLQNKAANALSRIEQPLEV

Query:  RSMCTTGIVNMEVIEKEVKLDEDLKRIIEELKKNPDESSKFQWINGNLLYKKRIVLSKRSSLIPTLLHTFHDSILRGHSGFLRTYKRMCGELYWKGMKTD
        +++ TTGIV+MEV+ KEV+ DE+L+ +I++L+ NP    K+   NG L+YK R+VLSK SS+IP+LLHTFHDSIL GHSGFLRTYKRM GEL+WKGMK D
Subjt:  RSMCTTGIVNMEVIEKEVKLDEDLKRIIEELKKNPDESSKFQWINGNLLYKKRIVLSKRSSLIPTLLHTFHDSILRGHSGFLRTYKRMCGELYWKGMKTD

Query:  VKKYVEQCEVCQRNKLEATKPAGVLQPIPIPERILEEWSMDFIEGL
        +KKYVEQCE+CQRNK EATKPAGVLQP+PIP+RILE+W+MDFIEGL
Subjt:  VKKYVEQCEVCQRNKLEATKPAGVLQPIPIPERILEEWSMDFIEGL

A0A5D3DZK6 Ty3/gypsy retrotransposon protein9.0e-27948.78Show/hide
Query:  MGLKEMLLEMKKSMERMVEDLRESHSNKSREEFGTSDGSVMKLKVKAEETDTTNEGNLTMIDHSKYKKLEMSMFLRENPKSWVYRAEHFFEINNLPKAKK
        + LKEM+LEMKKSM+R+ ++LR+  S K +EE GTSDGS+MK+K K EETD T E +   ID SKYKKLEM MFL ENP+SWVYRAEHFFEINNLP+A+K
Subjt:  MGLKEMLLEMKKSMERMVEDLRESHSNKSREEFGTSDGSVMKLKVKAEETDTTNEGNLTMIDHSKYKKLEMSMFLRENPKSWVYRAEHFFEINNLPKAKK

Query:  VMVAVVSFGQDEVDWYRWSHNRKKVES-ECIK-------------------------GCVPDLVR------------ADSPSR----------------S
        V VAVVSFGQDEVDWYRWSHNR+KVES E +K                         G   D V+            A+S  R                S
Subjt:  VMVAVVSFGQDEVDWYRWSHNRKKVES-ECIK-------------------------GCVPDLVR------------ADSPSR----------------S

Query:  GKPQSTNSGRMR-----------------------------------NGEEGITKKEK----------EARAKKSEPPVKHLSDAEFRARLDK-------
          PQ+     M                                    N E+G+ +K +          +   +K EPPVK LSDAEFRARLDK       
Subjt:  GKPQSTNSGRMR-----------------------------------NGEEGITKKEK----------EARAKKSEPPVKHLSDAEFRARLDK-------

Query:  -------------------------ESLEEEGRTEESNEDVLELKQIKLEKGTKIELKAIHGLTSKGTMKIKGEIKGREVLVLIDSKATHNFIHNKIVEG
                                 ES EE   +E  N + +E+ Q+   + T IE +AI  LT+KGTMK++G +KG+EV+VLIDS ATHNFIH+++V+ 
Subjt:  -------------------------ESLEEEGRTEESNEDVLELKQIKLEKGTKIELKAIHGLTSKGTMKIKGEIKGREVLVLIDSKATHNFIHNKIVEG

Query:  MGLALEKGTPFGVTTGDGTRCQGRGVCKRLELKLKEITIVADFLAIEVGNVDLILGMQWLDTTGTMKIHWPSLTMTFSMGENRLPSRRD-----------
          + + + T FG+T GDGT C+G G+C ++E++L+ + +V D L + +G +D++LGMQWLDTTGTMKIHWPSLTM F    N++  + D           
Subjt:  MGLALEKGTPFGVTTGDGTRCQGRGVCKRLELKLKEITIVADFLAIEVGNVDLILGMQWLDTTGTMKIHWPSLTMTFSMGENRLPSRRD-----------

Query:  ----------------------EESDEEQRVKS---DE--------------DLFEEPKGLPPKREVDHRILLLPGQKPINMRPYKYGHTQKEEIEKLVT
                              E  D + +  S   DE              D+F+ P  LPPKR +DHRIL LPGQKPIN+RPYKYGH QKEEIEKLV 
Subjt:  ----------------------EESDEEQRVKS---DE--------------DLFEEPKGLPPKREVDHRILLLPGQKPINMRPYKYGHTQKEEIEKLVT

Query:  KMLQTGINRPNHSPYSSPILMARKKDGGWRFCVDYRKLNQVTVSDKFPIPMIEELLDELHGATVFSKLDLKSGKLNQVTVSDKFPIPVIEELLDELH---
        +MLQTGI RP+HSP+SSP+L+ +KKDGGWRFCVDYRKLN++T++DKFPIP+IEELLDELHGATVFSKLDLKSG  +Q+ + ++     IE+     H   
Subjt:  KMLQTGINRPNHSPYSSPILMARKKDGGWRFCVDYRKLNQVTVSDKFPIPMIEELLDELHGATVFSKLDLKSGKLNQVTVSDKFPIPVIEELLDELH---

Query:  ------------GATVFSKLHSGPIRDTMSSC------------TDLTEHEKHLGMVFAVMRDNQFFANKKKCVMPHSQIQYLGHLISSKGVEANEEKIK
                        F  L +   +  +  C            +D+TEHEKHLGMVFA +RDNQ +AN+KKCV  HSQI YLGH+IS  GVEA+++K+K
Subjt:  ------------GATVFSKLHSGPIRDTMSSC------------TDLTEHEKHLGMVFAVMRDNQFFANKKKCVMPHSQIQYLGHLISSKGVEANEEKIK

Query:  NM-------------------------------------------------------------------------NLPFIIETDASGIALRAVLPQNGHP
        +M                                                                         +LPF+IETDASG  L AVL QN HP
Subjt:  NM-------------------------------------------------------------------------NLPFIIETDASGIALRAVLPQNGHP

Query:  IAFFSQKLSFRPQTKSIYERELIAVVLSVQKWRYYLLGRNFTIVSNQRALIFLLEQREVQPQFQKWLTKLLGYYFEILYQPRLQNKAANALSRIEQPLEV
        IAFFSQKLS R Q KSIYEREL+AVVLSVQKWR+YLLGR FTI+S+Q+AL FLLEQREVQPQFQKWLTKLLGY FEILYQP LQNKAA+ALSR++  +E+
Subjt:  IAFFSQKLSFRPQTKSIYERELIAVVLSVQKWRYYLLGRNFTIVSNQRALIFLLEQREVQPQFQKWLTKLLGYYFEILYQPRLQNKAANALSRIEQPLEV

Query:  RSMCTTGIVNMEVIEKEVKLDEDLKRIIEELKKNPDESSKFQWINGNLLYKKRIVLSKRSSLIPTLLHTFHDSILRGHSGFLRTYKRMCGELYWKGMKTD
        +++ TTGIV+MEV+ KEV+ DE+L+ +I++L+ NP    K+   NG L+YK R+VLSK SS+IP+LLHTFHDSIL GHSGFLRTYKRM GEL+WKGMK D
Subjt:  RSMCTTGIVNMEVIEKEVKLDEDLKRIIEELKKNPDESSKFQWINGNLLYKKRIVLSKRSSLIPTLLHTFHDSILRGHSGFLRTYKRMCGELYWKGMKTD

Query:  VKKYVEQCEVCQRNKLEATKPAGVLQPIPIPERILEEWSMDFIEGL
        +KKYVEQCE+CQRNK EATKPAGVLQP+PIP+RILE+W+MDFIEGL
Subjt:  VKKYVEQCEVCQRNKLEATKPAGVLQPIPIPERILEEWSMDFIEGL

A0A5D3E325 Ty3/gypsy retrotransposon protein7.7e-27848.43Show/hide
Query:  MGLKEMLLEMKKSMERMVEDLRESHSNKSREEFGTSDGSVMKLKVKAEETDTTNEGNLTMIDHSKYKKLEMSMFLRENPKSWVYRAEHFFEINNLPKAKK
        + LKEM+LEMKKSM+R+ ++LR+  S K +EE GTSDGS+MK+K K EETD T E +   ID SKYKKLEM MFL ENP+SWVYRAEHFFEINNLP+A+K
Subjt:  MGLKEMLLEMKKSMERMVEDLRESHSNKSREEFGTSDGSVMKLKVKAEETDTTNEGNLTMIDHSKYKKLEMSMFLRENPKSWVYRAEHFFEINNLPKAKK

Query:  VMVAVVSFGQDEVDWYRWSHNRKKVES-ECIK-------------------------GCVPDLVR------------ADSPSR----------------S
        V VAVVSFGQDEVDWYRWSHNR+KVES E +K                         G   D V+            A+S  R                S
Subjt:  VMVAVVSFGQDEVDWYRWSHNRKKVES-ECIK-------------------------GCVPDLVR------------ADSPSR----------------S

Query:  GKPQSTNSGRMR-----------------------------------NGEEGITKKEK----------EARAKKSEPPVKHLSDAEFRARLDK-------
          PQ+     M                                    N ++G+ +K +          +   +K EPPVK LSDAEFRARLDK       
Subjt:  GKPQSTNSGRMR-----------------------------------NGEEGITKKEK----------EARAKKSEPPVKHLSDAEFRARLDK-------

Query:  -------------------------ESLEEEGRTEESNEDVLELKQIKLEKGTKIELKAIHGLTSKGTMKIKGEIKGREVLVLIDSKATHNFIHNKIVEG
                                 ES EE   +E  N   +E+ Q++  + T IE +AI  LT+KGTMK++G +KG+EV+VLIDS ATHNFIH+++V  
Subjt:  -------------------------ESLEEEGRTEESNEDVLELKQIKLEKGTKIELKAIHGLTSKGTMKIKGEIKGREVLVLIDSKATHNFIHNKIVEG

Query:  MGLALEKGTPFGVTTGDGTRCQGRGVCKRLELKLKEITIVADFLAIEVGNVDLILGMQWLDTTGTMKIHWPSLTMTFSMGENRLPSRRD-----------
          + + + T FG+T GDGT C+G G+C ++E++L+ + +V D L + +G +D++LGMQWLDTTGTMKIHWPSLTM F    N++  + D           
Subjt:  MGLALEKGTPFGVTTGDGTRCQGRGVCKRLELKLKEITIVADFLAIEVGNVDLILGMQWLDTTGTMKIHWPSLTMTFSMGENRLPSRRD-----------

Query:  ------------------------EESDEEQRVKSDE---------------DLFEEPKGLPPKREVDHRILLLPGQKPINMRPYKYGHTQKEEIEKLVT
                                E++D +   ++ +               D+F  P  LPPKR +DHRIL LPGQKPIN+RPYKYGH QKEEIEKLV 
Subjt:  ------------------------EESDEEQRVKSDE---------------DLFEEPKGLPPKREVDHRILLLPGQKPINMRPYKYGHTQKEEIEKLVT

Query:  KMLQTGINRPNHSPYSSPILMARKKDGGWRFCVDYRKLNQVTVSDKFPIPMIEELLDELHGATVFSKLDLKSGKLNQVTVSDKFPIPVIEELLDELH---
        +MLQTGI RP+HSP+SSP+L+ +KKDGGWRFCVDYRKLN++T++DKFPIP+IEELLDELHGATVFSKLDLKSG  +Q+ + ++     IE+     H   
Subjt:  KMLQTGINRPNHSPYSSPILMARKKDGGWRFCVDYRKLNQVTVSDKFPIPMIEELLDELHGATVFSKLDLKSGKLNQVTVSDKFPIPVIEELLDELH---

Query:  ------------GATVFSKLHSGPIRDTMSSC------------TDLTEHEKHLGMVFAVMRDNQFFANKKKCVMPHSQIQYLGHLISSKGVEANEEKIK
                        F  L +   +  +  C            +D+TEHEKHLGMVFA +RDNQ +AN+KKCV  HSQI YLGH+IS  GVEA+++K+K
Subjt:  ------------GATVFSKLHSGPIRDTMSSC------------TDLTEHEKHLGMVFAVMRDNQFFANKKKCVMPHSQIQYLGHLISSKGVEANEEKIK

Query:  NM-------------------------------------------------------------------------NLPFIIETDASGIALRAVLPQNGHP
        +M                                                                         +LPF+IETDASG  L AVL QN HP
Subjt:  NM-------------------------------------------------------------------------NLPFIIETDASGIALRAVLPQNGHP

Query:  IAFFSQKLSFRPQTKSIYERELIAVVLSVQKWRYYLLGRNFTIVSNQRALIFLLEQREVQPQFQKWLTKLLGYYFEILYQPRLQNKAANALSRIEQPLEV
        IAFFSQKLS R Q KSIYEREL+AVVLSVQKWR+YLLGR FTI+S+Q+AL FLLEQREVQPQFQKWLTKLLGY FEILYQP LQNKAA+ALSR++  +E+
Subjt:  IAFFSQKLSFRPQTKSIYERELIAVVLSVQKWRYYLLGRNFTIVSNQRALIFLLEQREVQPQFQKWLTKLLGYYFEILYQPRLQNKAANALSRIEQPLEV

Query:  RSMCTTGIVNMEVIEKEVKLDEDLKRIIEELKKNPDESSKFQWINGNLLYKKRIVLSKRSSLIPTLLHTFHDSILRGHSGFLRTYKRMCGELYWKGMKTD
        +++ TTGIV+MEV+ KEV+ DE+L+ +I++L+ NP    K+   NG L+YK R+VLSK SS+IP+LLHTFHDSIL GHSGFLRTYKRM GEL+WKGMK D
Subjt:  RSMCTTGIVNMEVIEKEVKLDEDLKRIIEELKKNPDESSKFQWINGNLLYKKRIVLSKRSSLIPTLLHTFHDSILRGHSGFLRTYKRMCGELYWKGMKTD

Query:  VKKYVEQCEVCQRNKLEATKPAGVLQPIPIPERILEEWSMDFIEGL
        +KKYVEQCE+CQRNK EATKPAGVLQP+PIP+RILE+W+MDFIEGL
Subjt:  VKKYVEQCEVCQRNKLEATKPAGVLQPIPIPERILEEWSMDFIEGL

SwissProt top hitse value%identityAlignment
P0CT42 Transposon Tf2-7 polyprotein1.9e-3924.03Show/hide
Query:  LDKESLEEEGRTEESNEDVLELKQIKLEKGTKIELKAIHGLTSKGTMKIKGEIKGREVLV--LIDSKATHNFIHNKIVEGMGLALEKGTPFGVTTGDGTR
        L KESL ++  T  S      L++  + +      K ++    +  M ++ E+   +  +  LID+ A  N I  + V    L     +   +  G    
Subjt:  LDKESLEEEGRTEESNEDVLELKQIKLEKGTKIELKAIHGLTSKGTMKIKGEIKGREVLV--LIDSKATHNFIHNKIVEGMGLALEKGTPFGVTTGDGTR

Query:  CQGRGVCKRLELKLKEITIVADFLAIEVGNVDLILGMQWLDTTGTMKIHWPSLTMTFS--------MGENRLPSRRDEESDEEQRVKSDEDLFEEPKGLP
           R   K L + L  I+I  +FL ++  +    +      T     I   S   T S        + E  LP    E  D      + E L +  KGL 
Subjt:  CQGRGVCKRLELKLKEITIVADFLAIEVGNVDLILGMQWLDTTGTMKIHWPSLTMTFS--------MGENRLPSRRDEESDEEQRVKSDEDLFEEPKGLP

Query:  PKREVDHRILLLPGQKPINMRPYKYGHTQKEEIEKLVTKMLQTGINRPNHSPYSSPILMARKKDGGWRFCVDYRKLNQVTVSDKFPIPMIEELLDELHGA
         + E+      LP      +R Y     + + +   + + L++GI R + +  + P++   KK+G  R  VDY+ LN+    + +P+P+IE+LL ++ G+
Subjt:  PKREVDHRILLLPGQKPINMRPYKYGHTQKEEIEKLVTKMLQTGINRPNHSPYSSPILMARKKDGGWRFCVDYRKLNQVTVSDKFPIPMIEELLDELHGA

Query:  TVFSKLDLKSG-KLNQVTVSDKFPIPV-----IEELLDELHGATV-------FSKLHSGPIRDTMSSC---------TDLTEHEKHLGMVFAVMRDNQFF
        T+F+KLDLKS   L +V   D+  +       + E L   +G ++       F     G ++++   C            +EH KH+  V   +++    
Subjt:  TVFSKLDLKSG-KLNQVTVSDKFPIPV-----IEELLDELHGATV-------FSKLHSGPIRDTMSSC---------TDLTEHEKHLGMVFAVMRDNQFF

Query:  ANKKKCVMPHSQIQYLGHLISSKG---VEANEEKIKNMNLP-----------------------------------------------------------
         N+ KC    SQ++++G+ IS KG    + N +K+     P                                                           
Subjt:  ANKKKCVMPHSQIQYLGHLISSKG---VEANEEKIKNMNLP-----------------------------------------------------------

Query:  ------------FIIETDASGIALRAVLPQNG-----HPIAFFSQKLSFRPQTKSIYERELIAVVLSVQKWRYYLLG--RNFTIVSNQRALIFLL--EQR
                     ++ETDAS +A+ AVL Q       +P+ ++S K+S      S+ ++E++A++ S++ WR+YL      F I+++ R LI  +  E  
Subjt:  ------------FIIETDASGIALRAVLPQNG-----HPIAFFSQKLSFRPQTKSIYERELIAVVLSVQKWRYYLLG--RNFTIVSNQRALIFLL--EQR

Query:  EVQPQFQKWLTKLLGYYFEILYQPRLQNKAANALSRIEQPLE-------------VRSMCTTGIVNMEVIEKEVKLDEDLKRIIEELKKNPDESSKFQWI
            +  +W   L  + FEI Y+P   N  A+ALSRI    E             V  +  T     +V+  E   D  L  ++    K  +E+   Q  
Subjt:  EVQPQFQKWLTKLLGYYFEILYQPRLQNKAANALSRIEQPLE-------------VRSMCTTGIVNMEVIEKEVKLDEDLKRIIEELKKNPDESSKFQWI

Query:  NGNLLYKK-RIVLSKRSSLIPTLLHTFHDSILRGHSGFLRTYKRMCGELYWKGMKTDVKKYVEQCEVCQRNKLEATKPAGVLQPIPIPERILEEWSMDFI
        +G L+  K +I+L   + L  T++  +H+     H G       +     WKG++  +++YV+ C  CQ NK    KP G LQPIP  ER  E  SMDFI
Subjt:  NGNLLYKK-RIVLSKRSSLIPTLLHTFHDSILRGHSGFLRTYKRMCGELYWKGMKTDVKKYVEQCEVCQRNKLEATKPAGVLQPIPIPERILEEWSMDFI

Query:  EGL
          L
Subjt:  EGL

P0CT43 Transposon Tf2-8 polyprotein1.9e-3924.03Show/hide
Query:  LDKESLEEEGRTEESNEDVLELKQIKLEKGTKIELKAIHGLTSKGTMKIKGEIKGREVLV--LIDSKATHNFIHNKIVEGMGLALEKGTPFGVTTGDGTR
        L KESL ++  T  S      L++  + +      K ++    +  M ++ E+   +  +  LID+ A  N I  + V    L     +   +  G    
Subjt:  LDKESLEEEGRTEESNEDVLELKQIKLEKGTKIELKAIHGLTSKGTMKIKGEIKGREVLV--LIDSKATHNFIHNKIVEGMGLALEKGTPFGVTTGDGTR

Query:  CQGRGVCKRLELKLKEITIVADFLAIEVGNVDLILGMQWLDTTGTMKIHWPSLTMTFS--------MGENRLPSRRDEESDEEQRVKSDEDLFEEPKGLP
           R   K L + L  I+I  +FL ++  +    +      T     I   S   T S        + E  LP    E  D      + E L +  KGL 
Subjt:  CQGRGVCKRLELKLKEITIVADFLAIEVGNVDLILGMQWLDTTGTMKIHWPSLTMTFS--------MGENRLPSRRDEESDEEQRVKSDEDLFEEPKGLP

Query:  PKREVDHRILLLPGQKPINMRPYKYGHTQKEEIEKLVTKMLQTGINRPNHSPYSSPILMARKKDGGWRFCVDYRKLNQVTVSDKFPIPMIEELLDELHGA
         + E+      LP      +R Y     + + +   + + L++GI R + +  + P++   KK+G  R  VDY+ LN+    + +P+P+IE+LL ++ G+
Subjt:  PKREVDHRILLLPGQKPINMRPYKYGHTQKEEIEKLVTKMLQTGINRPNHSPYSSPILMARKKDGGWRFCVDYRKLNQVTVSDKFPIPMIEELLDELHGA

Query:  TVFSKLDLKSG-KLNQVTVSDKFPIPV-----IEELLDELHGATV-------FSKLHSGPIRDTMSSC---------TDLTEHEKHLGMVFAVMRDNQFF
        T+F+KLDLKS   L +V   D+  +       + E L   +G ++       F     G ++++   C            +EH KH+  V   +++    
Subjt:  TVFSKLDLKSG-KLNQVTVSDKFPIPV-----IEELLDELHGATV-------FSKLHSGPIRDTMSSC---------TDLTEHEKHLGMVFAVMRDNQFF

Query:  ANKKKCVMPHSQIQYLGHLISSKG---VEANEEKIKNMNLP-----------------------------------------------------------
         N+ KC    SQ++++G+ IS KG    + N +K+     P                                                           
Subjt:  ANKKKCVMPHSQIQYLGHLISSKG---VEANEEKIKNMNLP-----------------------------------------------------------

Query:  ------------FIIETDASGIALRAVLPQNG-----HPIAFFSQKLSFRPQTKSIYERELIAVVLSVQKWRYYLLG--RNFTIVSNQRALIFLL--EQR
                     ++ETDAS +A+ AVL Q       +P+ ++S K+S      S+ ++E++A++ S++ WR+YL      F I+++ R LI  +  E  
Subjt:  ------------FIIETDASGIALRAVLPQNG-----HPIAFFSQKLSFRPQTKSIYERELIAVVLSVQKWRYYLLG--RNFTIVSNQRALIFLL--EQR

Query:  EVQPQFQKWLTKLLGYYFEILYQPRLQNKAANALSRIEQPLE-------------VRSMCTTGIVNMEVIEKEVKLDEDLKRIIEELKKNPDESSKFQWI
            +  +W   L  + FEI Y+P   N  A+ALSRI    E             V  +  T     +V+  E   D  L  ++    K  +E+   Q  
Subjt:  EVQPQFQKWLTKLLGYYFEILYQPRLQNKAANALSRIEQPLE-------------VRSMCTTGIVNMEVIEKEVKLDEDLKRIIEELKKNPDESSKFQWI

Query:  NGNLLYKK-RIVLSKRSSLIPTLLHTFHDSILRGHSGFLRTYKRMCGELYWKGMKTDVKKYVEQCEVCQRNKLEATKPAGVLQPIPIPERILEEWSMDFI
        +G L+  K +I+L   + L  T++  +H+     H G       +     WKG++  +++YV+ C  CQ NK    KP G LQPIP  ER  E  SMDFI
Subjt:  NGNLLYKK-RIVLSKRSSLIPTLLHTFHDSILRGHSGFLRTYKRMCGELYWKGMKTDVKKYVEQCEVCQRNKLEATKPAGVLQPIPIPERILEEWSMDFI

Query:  EGL
          L
Subjt:  EGL

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein8.2e-4324.13Show/hide
Query:  VDHRILLLPGQKPINMRPYKYGHTQKEEIEKLVTKMLQTGINRPNHSPYSSPILMARKKDGGWRFCVDYRKLNQVTVSDKFPIPMIEELLDELHGATVFS
        V H I + PG +   ++PY      ++EI K+V K+L      P+ SP SSP+++  KKDG +R CVDYR LN+ T+SD FP+P I+ LL  +  A +F+
Subjt:  VDHRILLLPGQKPINMRPYKYGHTQKEEIEKLVTKMLQTGINRPNHSPYSSPILMARKKDGGWRFCVDYRKLNQVTVSDKFPIPMIEELLDELHGATVFS

Query:  KLDLKSG------------KLNQVTVSDKFPIPVIEELLDELHGATVFSKLHSGPIR----------DTMSSCTDLTEHEKHLGMVFAVMRDNQFFANKK
         LDL SG            K   VT S K+   V+   L  ++  + F++  +   R          D +       EH KHL  V   +++      KK
Subjt:  KLDLKSG------------KLNQVTVSDKFPIPVIEELLDELHGATVFSKLHSGPIR----------DTMSSCTDLTEHEKHLGMVFAVMRDNQFFANKK

Query:  KCVMPHSQIQYLGHLI---------------------------------------------------------SSKGVEANEEKIKNM-----NLPFII-
        KC     + ++LG+ I                                                          S+  E  ++ I+ +     N P ++ 
Subjt:  KCVMPHSQIQYLGHLI---------------------------------------------------------SSKGVEANEEKIKNM-----NLPFII-

Query:  ---------ETDASGIALRAVLPQNGHP------IAFFSQKLSFRPQTKSIYERELIAVVLSVQKWRYYLLGRNFTIVSNQRALIFLLEQREVQPQFQKW
                  TDAS   + AVL +  +       + +FS+ L    +     E EL+ ++ ++  +RY L G++FT+ ++  +L+ L  + E   + Q+W
Subjt:  ---------ETDASGIALRAVLPQNGHP------IAFFSQKLSFRPQTKSIYERELIAVVLSVQKWRYYLLGRNFTIVSNQRALIFLLEQREVQPQFQKW

Query:  LTKLLGYYFEILYQPRLQNKAANALSR--------IEQPLEVRS---------MCTTGIVNMEVIEKEVKLDEDL---KRIIEELKKNPDESSKFQWING
        L  L  Y F + Y    +N  A+A+SR          +P++  S         +C+  +++M+ + +     ED+   +   ++L+ +      +   + 
Subjt:  LTKLLGYYFEILYQPRLQNKAANALSR--------IEQPLEVRS---------MCTTGIVNMEVIEKEVKLDEDL---KRIIEELKKNPDESSKFQWING

Query:  NLLYKKRIVLSKRSSLIPTLLHTFHD-SILRGHSGFLRTYKRMCGELYWKGMKTDVKKYVEQCEVCQRNKLEATKPAGVLQPIPIPERILEEWSMDFIEG
         + Y+ R+V+  +      ++  +HD ++  GH G   T  ++    YW  ++  + +Y+  C  CQ  K    +  G+LQP+PI E    + SMDF+ G
Subjt:  NLLYKKRIVLSKRSSLIPTLLHTFHD-SILRGHSGFLRTYKRMCGELYWKGMKTDVKKYVEQCEVCQRNKLEATKPAGVLQPIPIPERILEEWSMDFIEG

Query:  L
        L
Subjt:  L

Q99315 Transposon Ty3-G Gag-Pol polyprotein1.4e-4224.13Show/hide
Query:  VDHRILLLPGQKPINMRPYKYGHTQKEEIEKLVTKMLQTGINRPNHSPYSSPILMARKKDGGWRFCVDYRKLNQVTVSDKFPIPMIEELLDELHGATVFS
        V H I + PG +   ++PY      ++EI K+V K+L      P+ SP SSP+++  KKDG +R CVDYR LN+ T+SD FP+P I+ LL  +  A +F+
Subjt:  VDHRILLLPGQKPINMRPYKYGHTQKEEIEKLVTKMLQTGINRPNHSPYSSPILMARKKDGGWRFCVDYRKLNQVTVSDKFPIPMIEELLDELHGATVFS

Query:  KLDLKSG------------KLNQVTVSDKFPIPVIEELLDELHGATVFSKLHSGPIR----------DTMSSCTDLTEHEKHLGMVFAVMRDNQFFANKK
         LDL SG            K   VT S K+   V+   L  ++  + F++  +   R          D +       EH KHL  V   +++      KK
Subjt:  KLDLKSG------------KLNQVTVSDKFPIPVIEELLDELHGATVFSKLHSGPIR----------DTMSSCTDLTEHEKHLGMVFAVMRDNQFFANKK

Query:  KCVMPHSQIQYLGHLI---------------------------------------------------------SSKGVEANEEKIKNM-----NLPFII-
        KC     + ++LG+ I                                                          S+  E  ++ I  +     N P ++ 
Subjt:  KCVMPHSQIQYLGHLI---------------------------------------------------------SSKGVEANEEKIKNM-----NLPFII-

Query:  ---------ETDASGIALRAVLPQNGHP------IAFFSQKLSFRPQTKSIYERELIAVVLSVQKWRYYLLGRNFTIVSNQRALIFLLEQREVQPQFQKW
                  TDAS   + AVL +  +       + +FS+ L    +     E EL+ ++ ++  +RY L G++FT+ ++  +L+ L  + E   + Q+W
Subjt:  ---------ETDASGIALRAVLPQNGHP------IAFFSQKLSFRPQTKSIYERELIAVVLSVQKWRYYLLGRNFTIVSNQRALIFLLEQREVQPQFQKW

Query:  LTKLLGYYFEILYQPRLQNKAANALSR--------IEQPLEVRS---------MCTTGIVNMEVIEKEVKLDEDL---KRIIEELKKNPDESSKFQWING
        L  L  Y F + Y    +N  A+A+SR          +P++  S         +C+  +++M+ + +     ED+   +   ++L+ +      +   + 
Subjt:  LTKLLGYYFEILYQPRLQNKAANALSR--------IEQPLEVRS---------MCTTGIVNMEVIEKEVKLDEDL---KRIIEELKKNPDESSKFQWING

Query:  NLLYKKRIVLSKRSSLIPTLLHTFHD-SILRGHSGFLRTYKRMCGELYWKGMKTDVKKYVEQCEVCQRNKLEATKPAGVLQPIPIPERILEEWSMDFIEG
         + Y+ R+V+  +      ++  +HD ++  GH G   T  ++    YW  ++  + +Y+  C  CQ  K    +  G+LQP+PI E    + SMDF+ G
Subjt:  NLLYKKRIVLSKRSSLIPTLLHTFHD-SILRGHSGFLRTYKRMCGELYWKGMKTDVKKYVEQCEVCQRNKLEATKPAGVLQPIPIPERILEEWSMDFIEG

Query:  L
        L
Subjt:  L

Q9UR07 Transposon Tf2-11 polyprotein5.8e-4125.72Show/hide
Query:  PINMRPYKYGHTQKEEIEKLVTKMLQTGINRPNHSPYSSPILMARKKDGGWRFCVDYRKLNQVTVSDKFPIPMIEELLDELHGATVFSKLDLKSG-KLNQ
        PI   P   G  Q    E  + + L++GI R + +  + P++   KK+G  R  VDY+ LN+    + +P+P+IE+LL ++ G+T+F+KLDLKS   L +
Subjt:  PINMRPYKYGHTQKEEIEKLVTKMLQTGINRPNHSPYSSPILMARKKDGGWRFCVDYRKLNQVTVSDKFPIPMIEELLDELHGATVFSKLDLKSG-KLNQ

Query:  VTVSDKFPIPV-----IEELLDELHGATV-------FSKLHSGPIRDTMSSC---------TDLTEHEKHLGMVFAVMRDNQFFANKKKCVMPHSQIQYL
        V   D+  +       + E L   +G ++       F     G ++++   C            +EH KH+  V   +++     N+ KC    SQ++++
Subjt:  VTVSDKFPIPV-----IEELLDELHGATV-------FSKLHSGPIRDTMSSC---------TDLTEHEKHLGMVFAVMRDNQFFANKKKCVMPHSQIQYL

Query:  GHLISSKG---VEANEEKIKNMNLP-----------------------------------------------------------------------FIIE
        G+ IS KG    + N +K+     P                                                                        ++E
Subjt:  GHLISSKG---VEANEEKIKNMNLP-----------------------------------------------------------------------FIIE

Query:  TDASGIALRAVLPQNG-----HPIAFFSQKLSFRPQTKSIYERELIAVVLSVQKWRYYLLG--RNFTIVSNQRALIFLL--EQREVQPQFQKWLTKLLGY
        TDAS +A+ AVL Q       +P+ ++S K+S      S+ ++E++A++ S++ WR+YL      F I+++ R LI  +  E      +  +W   L  +
Subjt:  TDASGIALRAVLPQNG-----HPIAFFSQKLSFRPQTKSIYERELIAVVLSVQKWRYYLLG--RNFTIVSNQRALIFLL--EQREVQPQFQKWLTKLLGY

Query:  YFEILYQPRLQNKAANALSRIEQPLE-------------VRSMCTTGIVNMEVIEKEVKLDEDLKRIIEELKKNPDESSKFQWINGNLLYKK-RIVLSKR
         FEI Y+P   N  A+ALSRI    E             V  +  T     +V+  E   D  L  ++    K  +E+   Q  +G L+  K +I+L   
Subjt:  YFEILYQPRLQNKAANALSRIEQPLE-------------VRSMCTTGIVNMEVIEKEVKLDEDLKRIIEELKKNPDESSKFQWINGNLLYKK-RIVLSKR

Query:  SSLIPTLLHTFHDSILRGHSGFLRTYKRMCGELYWKGMKTDVKKYVEQCEVCQRNKLEATKPAGVLQPIPIPERILEEWSMDFIEGL
        + L  T++  +H+     H G       +     WKG++  +++YV+ C  CQ NK    KP G LQPIP  ER  E  SMDFI  L
Subjt:  SSLIPTLLHTFHDSILRGHSGFLRTYKRMCGELYWKGMKTDVKKYVEQCEVCQRNKLEATKPAGVLQPIPIPERILEEWSMDFIEGL

Arabidopsis top hitse value%identityAlignment
AT3G29750.1 Eukaryotic aspartyl protease family protein1.5e-0726.09Show/hide
Query:  RARLDKESLEEEGRTEESNEDVLELKQIKLEKGTKIELKAIHGLTSKGTMKIKGEIKGREVLVLIDSKATHNFIHNKIVEGMGLALEKGTPFGVTTGDGT
        +A+LD    +++G   E  E  LE     L +G +   + +  LT    M+  G I   +V+V IDS AT NFI  ++   + L         V  G   
Subjt:  RARLDKESLEEEGRTEESNEDVLELKQIKLEKGTKIELKAIHGLTSKGTMKIKGEIKGREVLVLIDSKATHNFIHNKIVEGMGLALEKGTPFGVTTGDGT

Query:  RCQGRGVCKRLELKLKEITIVADFLAIEVG--NVDLILGMQWLDTTGTMKIHWPSLTMTFSMGENRLPSRRDEESDEEQRVKSDEDLFEEPKGLPPKREV
          Q  G C  + L ++E+ I  +FL +++   +VD+ILG +WL   G   ++W +   +FS  +  +    + E  E+   K       E + +  +R  
Subjt:  RCQGRGVCKRLELKLKEITIVADFLAIEVG--NVDLILGMQWLDTTGTMKIHWPSLTMTFSMGENRLPSRRDEESDEEQRVKSDEDLFEEPKGLPPKREV

Query:  DHRILLL
        D  +L++
Subjt:  DHRILLL

ATMG00850.1 DNA/RNA polymerases superfamily protein4.5e-0443.59Show/hide
Query:  QKEEIEKLVTKMLQTGINRPNHSPYSSPILMARKKDGGW
        ++  ++  + +ML+  I +P+ SPYSSP+L+ +KKDGGW
Subjt:  QKEEIEKLVTKMLQTGINRPNHSPYSSPILMARKKDGGW

ATMG00860.1 DNA/RNA polymerases superfamily protein9.7e-0748.08Show/hide
Query:  HLGMVFAVMRDNQFFANKKKCVMPHSQIQYLG--HLISSKGVEANEEKIKNM
        HLGMV  +   +QF+AN+KKC     QI YLG  H+IS +GV A+  K++ M
Subjt:  HLGMVFAVMRDNQFFANKKKCVMPHSQIQYLG--HLISSKGVEANEEKIKNM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTCTGAAGGAGATGTTATTAGAGATGAAAAAATCGATGGAGAGAATGGTTGAGGATCTGAGAGAGAGCCATAGCAACAAAAGTAGAGAGGAGTTTGGGACCTCCGA
TGGATCGGTCATGAAACTGAAGGTGAAGGCCGAGGAAACCGATACTACCAATGAAGGGAACTTGACCATGATCGACCACAGCAAATACAAGAAACTGGAAATGTCGATGT
TCTTGAGAGAAAATCCGAAATCTTGGGTCTACAGAGCAGAGCATTTCTTTGAAATCAATAATTTACCCAAAGCAAAAAAGGTCATGGTGGCAGTAGTAAGCTTTGGACAA
GATGAGGTAGACTGGTACAGATGGTCACATAATAGGAAGAAGGTGGAATCAGAGTGTATTAAGGGATGCGTTCCTGACCTGGTTAGAGCCGACTCTCCAAGCCGAAGTGG
TAAGCCGCAATCCACAAACTCTGGAAGAATGCGGAATGGGGAGGAAGGGATCACAAAAAAGGAGAAGGAAGCACGGGCAAAGAAGAGTGAACCACCAGTGAAACATTTAT
CTGATGCAGAATTCAGAGCAAGACTGGACAAGGAAAGTCTAGAAGAGGAAGGCAGAACAGAGGAGTCTAATGAGGATGTGTTAGAATTGAAACAGATTAAATTGGAGAAG
GGAACCAAAATTGAGTTGAAAGCAATACATGGACTAACAAGTAAAGGAACCATGAAGATAAAGGGTGAAATCAAAGGGAGGGAAGTACTAGTACTGATTGATAGCAAGGC
CACCCACAACTTCATACACAACAAGATTGTTGAGGGAATGGGATTAGCACTAGAGAAGGGTACTCCATTCGGGGTCACTACTGGAGATGGTACGAGATGCCAAGGAAGAG
GGGTGTGTAAGAGATTAGAACTGAAACTAAAGGAGATCACAATTGTGGCGGATTTTTTAGCAATTGAAGTCGGGAATGTGGATCTCATCTTGGGAATGCAATGGCTGGAT
ACAACTGGAACTATGAAGATACATTGGCCATCCCTAACCATGACATTTTCGATGGGGGAAAACAGATTACCCTCAAGGAGGGATGAGGAATCTGATGAAGAACAAAGGGT
GAAGAGCGATGAGGATTTATTTGAGGAACCTAAGGGGTTACCACCTAAAAGAGAAGTTGACCATCGTATTTTGTTGTTGCCCGGACAGAAACCTATCAACATGAGACCCT
ACAAGTATGGTCACACCCAGAAGGAAGAAATTGAGAAGTTGGTAACCAAGATGCTCCAAACTGGGATAAACCGACCCAACCACAGCCCTTACTCTAGTCCCATTTTGATG
GCTAGAAAGAAAGATGGAGGGTGGAGGTTCTGTGTTGACTACAGGAAATTAAACCAAGTAACCGTATCTGATAAATTTCCAATACCTATGATAGAGGAATTGTTAGATGA
GTTGCACGGGGCTACAGTCTTCTCAAAGCTGGATCTAAAATCAGGGAAATTAAACCAAGTAACCGTATCTGATAAATTTCCAATACCTGTGATAGAGGAATTGTTAGATG
AGTTGCACGGGGCTACAGTCTTCTCAAAACTACATTCAGGACCCATTAGGGACACTATGAGTTCTTGCACGGACCTCACAGAGCATGAGAAGCATTTGGGCATGGTATTT
GCAGTAATGAGGGATAACCAGTTCTTTGCCAACAAAAAAAAATGTGTGATGCCTCACTCGCAGATCCAATACTTGGGACATTTAATATCTAGTAAGGGTGTAGAGGCAAA
TGAAGAGAAGATTAAGAACATGAACCTACCCTTCATCATTGAAACGGATGCTTCCGGGATTGCTCTAAGGGCAGTTCTACCTCAAAATGGCCACCCCATAGCCTTCTTTA
GTCAGAAACTTTCATTCAGACCTCAAACCAAATCCATATACGAGAGGGAATTGATAGCAGTAGTCCTTTCAGTACAAAAGTGGAGATATTACCTTTTGGGAAGGAATTTT
ACAATCGTCTCGAATCAGAGAGCCCTCATATTTCTTTTAGAACAAAGGGAAGTGCAACCCCAGTTCCAAAAGTGGTTGACTAAACTCCTAGGGTACTACTTCGAAATACT
TTACCAACCAAGGTTACAAAACAAAGCAGCAAATGCCCTCTCCCGAATAGAACAGCCACTAGAAGTGAGGAGTATGTGCACCACGGGTATTGTTAACATGGAAGTGATTG
AGAAGGAGGTCAAGTTAGATGAGGATCTCAAGAGAATCATCGAAGAATTAAAGAAAAATCCTGATGAGTCTAGTAAATTCCAATGGATTAATGGAAACCTACTGTATAAG
AAACGAATAGTTTTGTCCAAGAGATCCTCCTTGATTCCCACCCTGCTACATACGTTTCATGACTCAATTTTGAGAGGCCACTCCGGATTCTTAAGAACCTATAAAAGAAT
GTGTGGGGAACTTTATTGGAAGGGCATGAAAACGGATGTAAAAAAATATGTAGAGCAGTGTGAGGTATGCCAAAGAAATAAGTTGGAAGCAACTAAACCAGCTGGAGTTT
TGCAGCCAATTCCAATCCCCGAAAGAATTTTGGAGGAATGGTCCATGGACTTCATTGAAGGATTATGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGTCTGAAGGAGATGTTATTAGAGATGAAAAAATCGATGGAGAGAATGGTTGAGGATCTGAGAGAGAGCCATAGCAACAAAAGTAGAGAGGAGTTTGGGACCTCCGA
TGGATCGGTCATGAAACTGAAGGTGAAGGCCGAGGAAACCGATACTACCAATGAAGGGAACTTGACCATGATCGACCACAGCAAATACAAGAAACTGGAAATGTCGATGT
TCTTGAGAGAAAATCCGAAATCTTGGGTCTACAGAGCAGAGCATTTCTTTGAAATCAATAATTTACCCAAAGCAAAAAAGGTCATGGTGGCAGTAGTAAGCTTTGGACAA
GATGAGGTAGACTGGTACAGATGGTCACATAATAGGAAGAAGGTGGAATCAGAGTGTATTAAGGGATGCGTTCCTGACCTGGTTAGAGCCGACTCTCCAAGCCGAAGTGG
TAAGCCGCAATCCACAAACTCTGGAAGAATGCGGAATGGGGAGGAAGGGATCACAAAAAAGGAGAAGGAAGCACGGGCAAAGAAGAGTGAACCACCAGTGAAACATTTAT
CTGATGCAGAATTCAGAGCAAGACTGGACAAGGAAAGTCTAGAAGAGGAAGGCAGAACAGAGGAGTCTAATGAGGATGTGTTAGAATTGAAACAGATTAAATTGGAGAAG
GGAACCAAAATTGAGTTGAAAGCAATACATGGACTAACAAGTAAAGGAACCATGAAGATAAAGGGTGAAATCAAAGGGAGGGAAGTACTAGTACTGATTGATAGCAAGGC
CACCCACAACTTCATACACAACAAGATTGTTGAGGGAATGGGATTAGCACTAGAGAAGGGTACTCCATTCGGGGTCACTACTGGAGATGGTACGAGATGCCAAGGAAGAG
GGGTGTGTAAGAGATTAGAACTGAAACTAAAGGAGATCACAATTGTGGCGGATTTTTTAGCAATTGAAGTCGGGAATGTGGATCTCATCTTGGGAATGCAATGGCTGGAT
ACAACTGGAACTATGAAGATACATTGGCCATCCCTAACCATGACATTTTCGATGGGGGAAAACAGATTACCCTCAAGGAGGGATGAGGAATCTGATGAAGAACAAAGGGT
GAAGAGCGATGAGGATTTATTTGAGGAACCTAAGGGGTTACCACCTAAAAGAGAAGTTGACCATCGTATTTTGTTGTTGCCCGGACAGAAACCTATCAACATGAGACCCT
ACAAGTATGGTCACACCCAGAAGGAAGAAATTGAGAAGTTGGTAACCAAGATGCTCCAAACTGGGATAAACCGACCCAACCACAGCCCTTACTCTAGTCCCATTTTGATG
GCTAGAAAGAAAGATGGAGGGTGGAGGTTCTGTGTTGACTACAGGAAATTAAACCAAGTAACCGTATCTGATAAATTTCCAATACCTATGATAGAGGAATTGTTAGATGA
GTTGCACGGGGCTACAGTCTTCTCAAAGCTGGATCTAAAATCAGGGAAATTAAACCAAGTAACCGTATCTGATAAATTTCCAATACCTGTGATAGAGGAATTGTTAGATG
AGTTGCACGGGGCTACAGTCTTCTCAAAACTACATTCAGGACCCATTAGGGACACTATGAGTTCTTGCACGGACCTCACAGAGCATGAGAAGCATTTGGGCATGGTATTT
GCAGTAATGAGGGATAACCAGTTCTTTGCCAACAAAAAAAAATGTGTGATGCCTCACTCGCAGATCCAATACTTGGGACATTTAATATCTAGTAAGGGTGTAGAGGCAAA
TGAAGAGAAGATTAAGAACATGAACCTACCCTTCATCATTGAAACGGATGCTTCCGGGATTGCTCTAAGGGCAGTTCTACCTCAAAATGGCCACCCCATAGCCTTCTTTA
GTCAGAAACTTTCATTCAGACCTCAAACCAAATCCATATACGAGAGGGAATTGATAGCAGTAGTCCTTTCAGTACAAAAGTGGAGATATTACCTTTTGGGAAGGAATTTT
ACAATCGTCTCGAATCAGAGAGCCCTCATATTTCTTTTAGAACAAAGGGAAGTGCAACCCCAGTTCCAAAAGTGGTTGACTAAACTCCTAGGGTACTACTTCGAAATACT
TTACCAACCAAGGTTACAAAACAAAGCAGCAAATGCCCTCTCCCGAATAGAACAGCCACTAGAAGTGAGGAGTATGTGCACCACGGGTATTGTTAACATGGAAGTGATTG
AGAAGGAGGTCAAGTTAGATGAGGATCTCAAGAGAATCATCGAAGAATTAAAGAAAAATCCTGATGAGTCTAGTAAATTCCAATGGATTAATGGAAACCTACTGTATAAG
AAACGAATAGTTTTGTCCAAGAGATCCTCCTTGATTCCCACCCTGCTACATACGTTTCATGACTCAATTTTGAGAGGCCACTCCGGATTCTTAAGAACCTATAAAAGAAT
GTGTGGGGAACTTTATTGGAAGGGCATGAAAACGGATGTAAAAAAATATGTAGAGCAGTGTGAGGTATGCCAAAGAAATAAGTTGGAAGCAACTAAACCAGCTGGAGTTT
TGCAGCCAATTCCAATCCCCGAAAGAATTTTGGAGGAATGGTCCATGGACTTCATTGAAGGATTATGTTAG
Protein sequenceShow/hide protein sequence
MGLKEMLLEMKKSMERMVEDLRESHSNKSREEFGTSDGSVMKLKVKAEETDTTNEGNLTMIDHSKYKKLEMSMFLRENPKSWVYRAEHFFEINNLPKAKKVMVAVVSFGQ
DEVDWYRWSHNRKKVESECIKGCVPDLVRADSPSRSGKPQSTNSGRMRNGEEGITKKEKEARAKKSEPPVKHLSDAEFRARLDKESLEEEGRTEESNEDVLELKQIKLEK
GTKIELKAIHGLTSKGTMKIKGEIKGREVLVLIDSKATHNFIHNKIVEGMGLALEKGTPFGVTTGDGTRCQGRGVCKRLELKLKEITIVADFLAIEVGNVDLILGMQWLD
TTGTMKIHWPSLTMTFSMGENRLPSRRDEESDEEQRVKSDEDLFEEPKGLPPKREVDHRILLLPGQKPINMRPYKYGHTQKEEIEKLVTKMLQTGINRPNHSPYSSPILM
ARKKDGGWRFCVDYRKLNQVTVSDKFPIPMIEELLDELHGATVFSKLDLKSGKLNQVTVSDKFPIPVIEELLDELHGATVFSKLHSGPIRDTMSSCTDLTEHEKHLGMVF
AVMRDNQFFANKKKCVMPHSQIQYLGHLISSKGVEANEEKIKNMNLPFIIETDASGIALRAVLPQNGHPIAFFSQKLSFRPQTKSIYERELIAVVLSVQKWRYYLLGRNF
TIVSNQRALIFLLEQREVQPQFQKWLTKLLGYYFEILYQPRLQNKAANALSRIEQPLEVRSMCTTGIVNMEVIEKEVKLDEDLKRIIEELKKNPDESSKFQWINGNLLYK
KRIVLSKRSSLIPTLLHTFHDSILRGHSGFLRTYKRMCGELYWKGMKTDVKKYVEQCEVCQRNKLEATKPAGVLQPIPIPERILEEWSMDFIEGLC