; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI04G20550 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI04G20550
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationChr4:18809754..18815325
RNA-Seq ExpressionCSPI04G20550
SyntenyCSPI04G20550
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR021109 - Aspartic peptidase domain superfamily
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033982.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]4.5e-19551.11Show/hide
Query:  KKGLELLEQKFMEMQTKFKK----IPMVVKYLEGIIREKALTSKEDEGSVSREKKSENILVELNDELKLHLKS-NEKLTEQSKFKKFEIPVFNGTDPDSW
        ++ L LL +   EM ++  K      +++KY+E I+R+ +   + +EGS S+   +E       +E K+ +K+  E+  ++SKFKK E+PVF+GTDPDSW
Subjt:  KKGLELLEQKFMEMQTKFKK----IPMVVKYLEGIIREKALTSKEDEGSVSREKKSENILVELNDELKLHLKS-NEKLTEQSKFKKFEIPVFNGTDPDSW

Query:  LFRADRFFKIHSFTDSEKLSVAIISFDGPAIDWYRFQDECEAFKDWDGLKEKMLTCFPTIRDGSL-------------------------------TIVL
        LFRADR+FKIH+ TDSE+L+VA+ISFDGPA+DWYR Q+E EAF  WD LK KML  F   R+G+L                               T+VL
Subjt:  LFRADRFFKIHSFTDSEKLSVAIISFDGPAIDWYRFQDECEAFKDWDGLKEKMLTCFPTIRDGSL-------------------------------TIVL

Query:  EETFMNGLYPWLKTEVSLLEPKGLAAMMKLALKIEHRDGSQR----------------------------------------------GWNDNIERG---
        EETFMNGL PWLK+EV  LEP GLA MMKLALKIE+R+  +R                                              G  DN   G   
Subjt:  EETFMNGLYPWLKTEVSLLEPKGLAAMMKLALKIEHRDGSQR----------------------------------------------GWNDNIERG---

Query:  -----------------------YAGGELKGRANKKIELRMLVVHDNGEELEIIEDDPYDEELEVKPMEVGADENLNIELSINSVVGLNNSGTMKVKGKV
                               +AG   K + +K  ELRMLVV + GEELEI+E++ +D E E+KP EV   ENLNIELS+NSVVGLNN GTMKVKGKV
Subjt:  -----------------------YAGGELKGRANKKIELRMLVVHDNGEELEIIEDDPYDEELEVKPMEVGADENLNIELSINSVVGLNNSGTMKVKGKV

Query:  KNEDVVVLIDCGATHNFISEKLVTALNLPLKTTTTYGVILGPGTTIKGKG---------------------------------WLKSLGVTEVDWRKLIM
          E+VV+LIDCGATHNFI+E LVT L L L+ T  YGVILG GT +KGKG                                 WL SLGVTEVDW++L++
Subjt:  KNEDVVVLIDCGATHNFISEKLVTALNLPLKTTTTYGVILGPGTTIKGKG---------------------------------WLKSLGVTEVDWRKLIM

Query:  TFQHYWKRITIKGDPSLMKIRVSLKHMMKTWDNEDQSYLIGCRALHSSLTIEEMYDEETKPTAENLLPSLLTKFDDVFSWPETLPPQRGIEHHIHLKQGT
        TF H  K++ I+GDPSL+K RVSLK++MKTW  +DQ +L+ CR L      +E      K  AE  + +LL +F  VF WP TLPPQR IEHHI+LK GT
Subjt:  TFQHYWKRITIKGDPSLMKIRVSLKHMMKTWDNEDQSYLIGCRALHSSLTIEEMYDEETKPTAENLLPSLLTKFDDVFSWPETLPPQRGIEHHIHLKQGT

Query:  NP--------------EMERLVDEMLASGIIWPSTSPYSSPVLLVRRKDGSWRFYVDYRALNNVTIPDKFPIPVIEELFNELNGANMFSKIDIKARYHQI
        +P              EMERLVDEML+SGII PS SPYSSPVLLVR+KDGSWRF VDYRALNNVTIPDKFPIPVIEELF+EL GA++FSK+D+KA YHQI
Subjt:  NP--------------EMERLVDEMLASGIIWPSTSPYSSPVLLVRRKDGSWRFYVDYRALNNVTIPDKFPIPVIEELFNELNGANMFSKIDIKARYHQI

Query:  RMNQEDVEKTTFRTHEGHYEFLVMPFGLTNAPSTFQALMNSIFRPYMRRFVLVFFDDILVAAE
        RM  ED+EKT FRTHEGHYEFLVMPFGLTNAPSTFQALMN +F+PY+RRFVLVFFDDIL+ ++
Subjt:  RMNQEDVEKTTFRTHEGHYEFLVMPFGLTNAPSTFQALMNSIFRPYMRRFVLVFFDDILVAAE

KAA0038753.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]4.5e-19551.11Show/hide
Query:  KKGLELLEQKFMEMQTKFKK----IPMVVKYLEGIIREKALTSKEDEGSVSREKKSENILVELNDELKLHLKS-NEKLTEQSKFKKFEIPVFNGTDPDSW
        ++ L LL +   EM ++  K      +++KY+E I+R+ +   + +EGS S+   +E       +E K+ +K+  E+  ++SKFKK E+PVF+GTDPDSW
Subjt:  KKGLELLEQKFMEMQTKFKK----IPMVVKYLEGIIREKALTSKEDEGSVSREKKSENILVELNDELKLHLKS-NEKLTEQSKFKKFEIPVFNGTDPDSW

Query:  LFRADRFFKIHSFTDSEKLSVAIISFDGPAIDWYRFQDECEAFKDWDGLKEKMLTCFPTIRDGSL-------------------------------TIVL
        LFRADR+FKIH+ TDSE+L+VA+ISFDGPA+DWYR Q+E EAF  WD LK KML  F   R+G+L                               T+VL
Subjt:  LFRADRFFKIHSFTDSEKLSVAIISFDGPAIDWYRFQDECEAFKDWDGLKEKMLTCFPTIRDGSL-------------------------------TIVL

Query:  EETFMNGLYPWLKTEVSLLEPKGLAAMMKLALKIEHRDGSQR----------------------------------------------GWNDNIERG---
        EETFMNGL PWLK+EV  LEP GLA MMKLALKIE+R+  +R                                              G  DN   G   
Subjt:  EETFMNGLYPWLKTEVSLLEPKGLAAMMKLALKIEHRDGSQR----------------------------------------------GWNDNIERG---

Query:  -----------------------YAGGELKGRANKKIELRMLVVHDNGEELEIIEDDPYDEELEVKPMEVGADENLNIELSINSVVGLNNSGTMKVKGKV
                               +AG   K + +K  ELRMLVV + GEELEI+E++ +D E E+KP EV   ENLNIELS+NSVVGLNN GTMKVKGKV
Subjt:  -----------------------YAGGELKGRANKKIELRMLVVHDNGEELEIIEDDPYDEELEVKPMEVGADENLNIELSINSVVGLNNSGTMKVKGKV

Query:  KNEDVVVLIDCGATHNFISEKLVTALNLPLKTTTTYGVILGPGTTIKGKG---------------------------------WLKSLGVTEVDWRKLIM
          E+VV+LIDCGATHNFI+E LVT L L L+ T  YGVILG GT +KGKG                                 WL SLGVTEVDW++L++
Subjt:  KNEDVVVLIDCGATHNFISEKLVTALNLPLKTTTTYGVILGPGTTIKGKG---------------------------------WLKSLGVTEVDWRKLIM

Query:  TFQHYWKRITIKGDPSLMKIRVSLKHMMKTWDNEDQSYLIGCRALHSSLTIEEMYDEETKPTAENLLPSLLTKFDDVFSWPETLPPQRGIEHHIHLKQGT
        TF H  K++ I+GDPSL+K RVSLK++MKTW  +DQ +L+ CR L      +E      K  AE  + +LL +F  VF WP TLPPQR IEHHI+LK GT
Subjt:  TFQHYWKRITIKGDPSLMKIRVSLKHMMKTWDNEDQSYLIGCRALHSSLTIEEMYDEETKPTAENLLPSLLTKFDDVFSWPETLPPQRGIEHHIHLKQGT

Query:  NP--------------EMERLVDEMLASGIIWPSTSPYSSPVLLVRRKDGSWRFYVDYRALNNVTIPDKFPIPVIEELFNELNGANMFSKIDIKARYHQI
        +P              EMERLVDEML+SGII PS SPYSSPVLLVR+KDGSWRF VDYRALNNVTIPDKFPIPVIEELF+EL GA++FSK+D+KA YHQI
Subjt:  NP--------------EMERLVDEMLASGIIWPSTSPYSSPVLLVRRKDGSWRFYVDYRALNNVTIPDKFPIPVIEELFNELNGANMFSKIDIKARYHQI

Query:  RMNQEDVEKTTFRTHEGHYEFLVMPFGLTNAPSTFQALMNSIFRPYMRRFVLVFFDDILVAAE
        RM  ED+EKT FRTHEGHYEFLVMPFGLTNAPSTFQALMN +F+PY+RRFVLVFFDDIL+ ++
Subjt:  RMNQEDVEKTTFRTHEGHYEFLVMPFGLTNAPSTFQALMNSIFRPYMRRFVLVFFDDILVAAE

KAA0055700.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]5.8e-19550.98Show/hide
Query:  KKGLELLEQKFMEMQTKFKK----IPMVVKYLEGIIREKALTSKEDEGSVSREKKSENILVELNDELKLHLKS-NEKLTEQSKFKKFEIPVFNGTDPDSW
        ++ L LL +   EM ++  K      +++KY+E I+R+ +   + +EGS S+   +E       +E K+ +K+  E+  ++SKFKK E+PVF+GTDPDSW
Subjt:  KKGLELLEQKFMEMQTKFKK----IPMVVKYLEGIIREKALTSKEDEGSVSREKKSENILVELNDELKLHLKS-NEKLTEQSKFKKFEIPVFNGTDPDSW

Query:  LFRADRFFKIHSFTDSEKLSVAIISFDGPAIDWYRFQDECEAFKDWDGLKEKMLTCFPTIRDGSL-------------------------------TIVL
        LFRADR+FKIH+ TDSE+L+VA+ISFDGPA+DWYR Q+E EAF  WD LK KML  F   R+G+L                               T+VL
Subjt:  LFRADRFFKIHSFTDSEKLSVAIISFDGPAIDWYRFQDECEAFKDWDGLKEKMLTCFPTIRDGSL-------------------------------TIVL

Query:  EETFMNGLYPWLKTEVSLLEPKGLAAMMKLALKIEHRDGSQR----------------------------------------------GWNDNIERG---
        EETFMNGL PWLK+EV  LEP GLA MMKLALKIE+R+  +R                                              G  DN   G   
Subjt:  EETFMNGLYPWLKTEVSLLEPKGLAAMMKLALKIEHRDGSQR----------------------------------------------GWNDNIERG---

Query:  -----------------------YAGGELKGRANKKIELRMLVVHDNGEELEIIEDDPYDEELEVKPMEVGADENLNIELSINSVVGLNNSGTMKVKGKV
                               +AG   K + +K  ELRMLVV + GEELEI+E++ +D E E+KP EV   ENLNIELS+NSVVGLNN GTMKVKGKV
Subjt:  -----------------------YAGGELKGRANKKIELRMLVVHDNGEELEIIEDDPYDEELEVKPMEVGADENLNIELSINSVVGLNNSGTMKVKGKV

Query:  KNEDVVVLIDCGATHNFISEKLVTALNLPLKTTTTYGVILGPGTTIKGKG---------------------------------WLKSLGVTEVDWRKLIM
          E+VV+LIDCGATHNFI+E LVT + L L+ T  YGVILG GT +KGKG                                 WL SLGVTEVDW++L++
Subjt:  KNEDVVVLIDCGATHNFISEKLVTALNLPLKTTTTYGVILGPGTTIKGKG---------------------------------WLKSLGVTEVDWRKLIM

Query:  TFQHYWKRITIKGDPSLMKIRVSLKHMMKTWDNEDQSYLIGCRALHSSLTIEEMYDEETKPTAENLLPSLLTKFDDVFSWPETLPPQRGIEHHIHLKQGT
        TF H  K++ I+GDPSL+K RVSLK++MKTW  +DQ +L+ CR L      +E      K  AE  + +LL +F  VF WP TLPPQR IEHHI+LK GT
Subjt:  TFQHYWKRITIKGDPSLMKIRVSLKHMMKTWDNEDQSYLIGCRALHSSLTIEEMYDEETKPTAENLLPSLLTKFDDVFSWPETLPPQRGIEHHIHLKQGT

Query:  NP--------------EMERLVDEMLASGIIWPSTSPYSSPVLLVRRKDGSWRFYVDYRALNNVTIPDKFPIPVIEELFNELNGANMFSKIDIKARYHQI
        +P              EMERLVDEML+SGII PS SPYSSPVLLVR+KDGSWRF VDYRALNNVTIPDKFPIPVIEELF+EL GA++FSK+D+KA YHQI
Subjt:  NP--------------EMERLVDEMLASGIIWPSTSPYSSPVLLVRRKDGSWRFYVDYRALNNVTIPDKFPIPVIEELFNELNGANMFSKIDIKARYHQI

Query:  RMNQEDVEKTTFRTHEGHYEFLVMPFGLTNAPSTFQALMNSIFRPYMRRFVLVFFDDILVAAE
        RM  ED+EKT FRTHEGHYEFLVMPFGLTNAPSTFQALMN +F+PY+RRFVLVFFDDIL+ ++
Subjt:  RMNQEDVEKTTFRTHEGHYEFLVMPFGLTNAPSTFQALMNSIFRPYMRRFVLVFFDDILVAAE

KAA0061635.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.4e-19650.92Show/hide
Query:  KKGLELLEQKFMEMQT----KFKKIPMVVKYLEGIIREKALTSKEDEGSVSREKKSENILVELNDELKLHLKSNE-KLTEQSKFKKFEIPVFNGTDPDSW
        ++ + LL +   EM      ++++  +++KYLEGI+R+ +   +  EGS S+ +  E  +  ++++ K+  K+ E   T++SKFKK E+P+F GTDPDSW
Subjt:  KKGLELLEQKFMEMQT----KFKKIPMVVKYLEGIIREKALTSKEDEGSVSREKKSENILVELNDELKLHLKSNE-KLTEQSKFKKFEIPVFNGTDPDSW

Query:  LFRADRFFKIHSFTDSEKLSVAIISFDGPAIDWYRFQDECEAFKDWDGLKEKMLTCFPTIRDGSLT-------------------------------IVL
        LFRADR+FKIH+ +DSEKL+VA+I FDGPA+DWYR Q+E EAFK W  LK+KML  F  IRDG+L                                +VL
Subjt:  LFRADRFFKIHSFTDSEKLSVAIISFDGPAIDWYRFQDECEAFKDWDGLKEKMLTCFPTIRDGSLT-------------------------------IVL

Query:  EETFMNGLYPWLKTEVSLLEPKGLAAMMKLALKIEHRD---------------GSQRGWNDNI-------------------------------------
        EETFMNGL PWLK+EV   EP GLA MMK+ALKIE+R+                +QR  N  I                                     
Subjt:  EETFMNGLYPWLKTEVSLLEPKGLAAMMKLALKIEHRD---------------GSQRGWNDNI-------------------------------------

Query:  --------------------ERGYAGGELKGRANKKIELRMLVVHDNGEELEIIEDDPYDEELEVKPMEVGADENLNIELSINSVVGLNNSGTMKVKGKV
                            E+ YAG   K + +K  ELRML+V +NGEELEIIE++ +D E E+KP EV   ENLNIELSINSVVGL N GTMKVKGK+
Subjt:  --------------------ERGYAGGELKGRANKKIELRMLVVHDNGEELEIIEDDPYDEELEVKPMEVGADENLNIELSINSVVGLNNSGTMKVKGKV

Query:  KNEDVVVLIDCGATHNFISEKLVTALNLPLKTTTTYGVILGPGTTIKGKG---------------------------------WLKSLGVTEVDWRKLIM
          E+VVVLIDCGATHNFI+EKLV  L LP+K T  YGVILG GT+IKGKG                                 WL S+G TEVDW+ LI+
Subjt:  KNEDVVVLIDCGATHNFISEKLVTALNLPLKTTTTYGVILGPGTTIKGKG---------------------------------WLKSLGVTEVDWRKLIM

Query:  TFQHYWKRITIKGDPSLMKIRVSLKHMMKTWDNEDQSYLIGCRALHSSLTIEEMYDEETKPTAENLLPSLLTKFDDVFSWPETLPPQRGIEHHIHLKQGT
        TF     ++TIKGDPSL K ++SLK M+K WD +DQ YL+ CRAL      +   D +   T E  + ++++KF DVF  P  LPPQRGIEHHI+LKQG 
Subjt:  TFQHYWKRITIKGDPSLMKIRVSLKHMMKTWDNEDQSYLIGCRALHSSLTIEEMYDEETKPTAENLLPSLLTKFDDVFSWPETLPPQRGIEHHIHLKQGT

Query:  NP--------------EMERLVDEMLASGIIWPSTSPYSSPVLLVRRKDGSWRFYVDYRALNNVTIPDKFPIPVIEELFNELNGANMFSKIDIKARYHQI
        +P              EMERLVDEML SGII  STSPYSSPVLLVR+KDGSWRF VDYRALNNVT+PDKFPIPV+EELF+ELNGA++FSKID+KA YHQI
Subjt:  NP--------------EMERLVDEMLASGIIWPSTSPYSSPVLLVRRKDGSWRFYVDYRALNNVTIPDKFPIPVIEELFNELNGANMFSKIDIKARYHQI

Query:  RMNQEDVEKTTFRTHEGHYEFLVMPFGLTNAPSTFQALMNSIFRPYMRRFVLVFFDDILV
        RM+ ED+EKTTFRTHEGHYEF+VMPFGLTNAPSTFQALMN +F+P++RRFVLVFFDDIL+
Subjt:  RMNQEDVEKTTFRTHEGHYEFLVMPFGLTNAPSTFQALMNSIFRPYMRRFVLVFFDDILV

TYK08591.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]4.5e-19551.11Show/hide
Query:  KKGLELLEQKFMEMQTKFKK----IPMVVKYLEGIIREKALTSKEDEGSVSREKKSENILVELNDELKLHLKS-NEKLTEQSKFKKFEIPVFNGTDPDSW
        ++ L LL +   EM ++  K      +++KY+E I+R+ +   + +EGS S+   +E       +E K+ +K+  E+  ++SKFKK E+PVF+GTDPDSW
Subjt:  KKGLELLEQKFMEMQTKFKK----IPMVVKYLEGIIREKALTSKEDEGSVSREKKSENILVELNDELKLHLKS-NEKLTEQSKFKKFEIPVFNGTDPDSW

Query:  LFRADRFFKIHSFTDSEKLSVAIISFDGPAIDWYRFQDECEAFKDWDGLKEKMLTCFPTIRDGSL-------------------------------TIVL
        LFRADR+FKIH+ TDSE+L+VA+ISFDGPA+DWYR Q+E EAF  WD LK KML  F   R+G+L                               T+VL
Subjt:  LFRADRFFKIHSFTDSEKLSVAIISFDGPAIDWYRFQDECEAFKDWDGLKEKMLTCFPTIRDGSL-------------------------------TIVL

Query:  EETFMNGLYPWLKTEVSLLEPKGLAAMMKLALKIEHRDGSQR----------------------------------------------GWNDNIERG---
        EETFMNGL PWLK+EV  LEP GLA MMKLALKIE+R+  +R                                              G  DN   G   
Subjt:  EETFMNGLYPWLKTEVSLLEPKGLAAMMKLALKIEHRDGSQR----------------------------------------------GWNDNIERG---

Query:  -----------------------YAGGELKGRANKKIELRMLVVHDNGEELEIIEDDPYDEELEVKPMEVGADENLNIELSINSVVGLNNSGTMKVKGKV
                               +AG   K + +K  ELRMLVV + GEELEI+E++ +D E E+KP EV   ENLNIELS+NSVVGLNN GTMKVKGKV
Subjt:  -----------------------YAGGELKGRANKKIELRMLVVHDNGEELEIIEDDPYDEELEVKPMEVGADENLNIELSINSVVGLNNSGTMKVKGKV

Query:  KNEDVVVLIDCGATHNFISEKLVTALNLPLKTTTTYGVILGPGTTIKGKG---------------------------------WLKSLGVTEVDWRKLIM
          E+VV+LIDCGATHNFI+E LVT L L L+ T  YGVILG GT +KGKG                                 WL SLGVTEVDW++L++
Subjt:  KNEDVVVLIDCGATHNFISEKLVTALNLPLKTTTTYGVILGPGTTIKGKG---------------------------------WLKSLGVTEVDWRKLIM

Query:  TFQHYWKRITIKGDPSLMKIRVSLKHMMKTWDNEDQSYLIGCRALHSSLTIEEMYDEETKPTAENLLPSLLTKFDDVFSWPETLPPQRGIEHHIHLKQGT
        TF H  K++ I+GDPSL+K RVSLK++MKTW  +DQ +L+ CR L      +E      K  AE  + +LL +F  VF WP TLPPQR IEHHI+LK GT
Subjt:  TFQHYWKRITIKGDPSLMKIRVSLKHMMKTWDNEDQSYLIGCRALHSSLTIEEMYDEETKPTAENLLPSLLTKFDDVFSWPETLPPQRGIEHHIHLKQGT

Query:  NP--------------EMERLVDEMLASGIIWPSTSPYSSPVLLVRRKDGSWRFYVDYRALNNVTIPDKFPIPVIEELFNELNGANMFSKIDIKARYHQI
        +P              EMERLVDEML+SGII PS SPYSSPVLLVR+KDGSWRF VDYRALNNVTIPDKFPIPVIEELF+EL GA++FSK+D+KA YHQI
Subjt:  NP--------------EMERLVDEMLASGIIWPSTSPYSSPVLLVRRKDGSWRFYVDYRALNNVTIPDKFPIPVIEELFNELNGANMFSKIDIKARYHQI

Query:  RMNQEDVEKTTFRTHEGHYEFLVMPFGLTNAPSTFQALMNSIFRPYMRRFVLVFFDDILVAAE
        RM  ED+EKT FRTHEGHYEFLVMPFGLTNAPSTFQALMN +F+PY+RRFVLVFFDDIL+ ++
Subjt:  RMNQEDVEKTTFRTHEGHYEFLVMPFGLTNAPSTFQALMNSIFRPYMRRFVLVFFDDILVAAE

TrEMBL top hitse value%identityAlignment
A0A5A7SS61 Ty3/gypsy retrotransposon protein2.2e-19551.11Show/hide
Query:  KKGLELLEQKFMEMQTKFKK----IPMVVKYLEGIIREKALTSKEDEGSVSREKKSENILVELNDELKLHLKS-NEKLTEQSKFKKFEIPVFNGTDPDSW
        ++ L LL +   EM ++  K      +++KY+E I+R+ +   + +EGS S+   +E       +E K+ +K+  E+  ++SKFKK E+PVF+GTDPDSW
Subjt:  KKGLELLEQKFMEMQTKFKK----IPMVVKYLEGIIREKALTSKEDEGSVSREKKSENILVELNDELKLHLKS-NEKLTEQSKFKKFEIPVFNGTDPDSW

Query:  LFRADRFFKIHSFTDSEKLSVAIISFDGPAIDWYRFQDECEAFKDWDGLKEKMLTCFPTIRDGSL-------------------------------TIVL
        LFRADR+FKIH+ TDSE+L+VA+ISFDGPA+DWYR Q+E EAF  WD LK KML  F   R+G+L                               T+VL
Subjt:  LFRADRFFKIHSFTDSEKLSVAIISFDGPAIDWYRFQDECEAFKDWDGLKEKMLTCFPTIRDGSL-------------------------------TIVL

Query:  EETFMNGLYPWLKTEVSLLEPKGLAAMMKLALKIEHRDGSQR----------------------------------------------GWNDNIERG---
        EETFMNGL PWLK+EV  LEP GLA MMKLALKIE+R+  +R                                              G  DN   G   
Subjt:  EETFMNGLYPWLKTEVSLLEPKGLAAMMKLALKIEHRDGSQR----------------------------------------------GWNDNIERG---

Query:  -----------------------YAGGELKGRANKKIELRMLVVHDNGEELEIIEDDPYDEELEVKPMEVGADENLNIELSINSVVGLNNSGTMKVKGKV
                               +AG   K + +K  ELRMLVV + GEELEI+E++ +D E E+KP EV   ENLNIELS+NSVVGLNN GTMKVKGKV
Subjt:  -----------------------YAGGELKGRANKKIELRMLVVHDNGEELEIIEDDPYDEELEVKPMEVGADENLNIELSINSVVGLNNSGTMKVKGKV

Query:  KNEDVVVLIDCGATHNFISEKLVTALNLPLKTTTTYGVILGPGTTIKGKG---------------------------------WLKSLGVTEVDWRKLIM
          E+VV+LIDCGATHNFI+E LVT L L L+ T  YGVILG GT +KGKG                                 WL SLGVTEVDW++L++
Subjt:  KNEDVVVLIDCGATHNFISEKLVTALNLPLKTTTTYGVILGPGTTIKGKG---------------------------------WLKSLGVTEVDWRKLIM

Query:  TFQHYWKRITIKGDPSLMKIRVSLKHMMKTWDNEDQSYLIGCRALHSSLTIEEMYDEETKPTAENLLPSLLTKFDDVFSWPETLPPQRGIEHHIHLKQGT
        TF H  K++ I+GDPSL+K RVSLK++MKTW  +DQ +L+ CR L      +E      K  AE  + +LL +F  VF WP TLPPQR IEHHI+LK GT
Subjt:  TFQHYWKRITIKGDPSLMKIRVSLKHMMKTWDNEDQSYLIGCRALHSSLTIEEMYDEETKPTAENLLPSLLTKFDDVFSWPETLPPQRGIEHHIHLKQGT

Query:  NP--------------EMERLVDEMLASGIIWPSTSPYSSPVLLVRRKDGSWRFYVDYRALNNVTIPDKFPIPVIEELFNELNGANMFSKIDIKARYHQI
        +P              EMERLVDEML+SGII PS SPYSSPVLLVR+KDGSWRF VDYRALNNVTIPDKFPIPVIEELF+EL GA++FSK+D+KA YHQI
Subjt:  NP--------------EMERLVDEMLASGIIWPSTSPYSSPVLLVRRKDGSWRFYVDYRALNNVTIPDKFPIPVIEELFNELNGANMFSKIDIKARYHQI

Query:  RMNQEDVEKTTFRTHEGHYEFLVMPFGLTNAPSTFQALMNSIFRPYMRRFVLVFFDDILVAAE
        RM  ED+EKT FRTHEGHYEFLVMPFGLTNAPSTFQALMN +F+PY+RRFVLVFFDDIL+ ++
Subjt:  RMNQEDVEKTTFRTHEGHYEFLVMPFGLTNAPSTFQALMNSIFRPYMRRFVLVFFDDILVAAE

A0A5A7T725 Ty3/gypsy retrotransposon protein2.2e-19551.11Show/hide
Query:  KKGLELLEQKFMEMQTKFKK----IPMVVKYLEGIIREKALTSKEDEGSVSREKKSENILVELNDELKLHLKS-NEKLTEQSKFKKFEIPVFNGTDPDSW
        ++ L LL +   EM ++  K      +++KY+E I+R+ +   + +EGS S+   +E       +E K+ +K+  E+  ++SKFKK E+PVF+GTDPDSW
Subjt:  KKGLELLEQKFMEMQTKFKK----IPMVVKYLEGIIREKALTSKEDEGSVSREKKSENILVELNDELKLHLKS-NEKLTEQSKFKKFEIPVFNGTDPDSW

Query:  LFRADRFFKIHSFTDSEKLSVAIISFDGPAIDWYRFQDECEAFKDWDGLKEKMLTCFPTIRDGSL-------------------------------TIVL
        LFRADR+FKIH+ TDSE+L+VA+ISFDGPA+DWYR Q+E EAF  WD LK KML  F   R+G+L                               T+VL
Subjt:  LFRADRFFKIHSFTDSEKLSVAIISFDGPAIDWYRFQDECEAFKDWDGLKEKMLTCFPTIRDGSL-------------------------------TIVL

Query:  EETFMNGLYPWLKTEVSLLEPKGLAAMMKLALKIEHRDGSQR----------------------------------------------GWNDNIERG---
        EETFMNGL PWLK+EV  LEP GLA MMKLALKIE+R+  +R                                              G  DN   G   
Subjt:  EETFMNGLYPWLKTEVSLLEPKGLAAMMKLALKIEHRDGSQR----------------------------------------------GWNDNIERG---

Query:  -----------------------YAGGELKGRANKKIELRMLVVHDNGEELEIIEDDPYDEELEVKPMEVGADENLNIELSINSVVGLNNSGTMKVKGKV
                               +AG   K + +K  ELRMLVV + GEELEI+E++ +D E E+KP EV   ENLNIELS+NSVVGLNN GTMKVKGKV
Subjt:  -----------------------YAGGELKGRANKKIELRMLVVHDNGEELEIIEDDPYDEELEVKPMEVGADENLNIELSINSVVGLNNSGTMKVKGKV

Query:  KNEDVVVLIDCGATHNFISEKLVTALNLPLKTTTTYGVILGPGTTIKGKG---------------------------------WLKSLGVTEVDWRKLIM
          E+VV+LIDCGATHNFI+E LVT L L L+ T  YGVILG GT +KGKG                                 WL SLGVTEVDW++L++
Subjt:  KNEDVVVLIDCGATHNFISEKLVTALNLPLKTTTTYGVILGPGTTIKGKG---------------------------------WLKSLGVTEVDWRKLIM

Query:  TFQHYWKRITIKGDPSLMKIRVSLKHMMKTWDNEDQSYLIGCRALHSSLTIEEMYDEETKPTAENLLPSLLTKFDDVFSWPETLPPQRGIEHHIHLKQGT
        TF H  K++ I+GDPSL+K RVSLK++MKTW  +DQ +L+ CR L      +E      K  AE  + +LL +F  VF WP TLPPQR IEHHI+LK GT
Subjt:  TFQHYWKRITIKGDPSLMKIRVSLKHMMKTWDNEDQSYLIGCRALHSSLTIEEMYDEETKPTAENLLPSLLTKFDDVFSWPETLPPQRGIEHHIHLKQGT

Query:  NP--------------EMERLVDEMLASGIIWPSTSPYSSPVLLVRRKDGSWRFYVDYRALNNVTIPDKFPIPVIEELFNELNGANMFSKIDIKARYHQI
        +P              EMERLVDEML+SGII PS SPYSSPVLLVR+KDGSWRF VDYRALNNVTIPDKFPIPVIEELF+EL GA++FSK+D+KA YHQI
Subjt:  NP--------------EMERLVDEMLASGIIWPSTSPYSSPVLLVRRKDGSWRFYVDYRALNNVTIPDKFPIPVIEELFNELNGANMFSKIDIKARYHQI

Query:  RMNQEDVEKTTFRTHEGHYEFLVMPFGLTNAPSTFQALMNSIFRPYMRRFVLVFFDDILVAAE
        RM  ED+EKT FRTHEGHYEFLVMPFGLTNAPSTFQALMN +F+PY+RRFVLVFFDDIL+ ++
Subjt:  RMNQEDVEKTTFRTHEGHYEFLVMPFGLTNAPSTFQALMNSIFRPYMRRFVLVFFDDILVAAE

A0A5A7UKN8 Ty3/gypsy retrotransposon protein2.8e-19550.98Show/hide
Query:  KKGLELLEQKFMEMQTKFKK----IPMVVKYLEGIIREKALTSKEDEGSVSREKKSENILVELNDELKLHLKS-NEKLTEQSKFKKFEIPVFNGTDPDSW
        ++ L LL +   EM ++  K      +++KY+E I+R+ +   + +EGS S+   +E       +E K+ +K+  E+  ++SKFKK E+PVF+GTDPDSW
Subjt:  KKGLELLEQKFMEMQTKFKK----IPMVVKYLEGIIREKALTSKEDEGSVSREKKSENILVELNDELKLHLKS-NEKLTEQSKFKKFEIPVFNGTDPDSW

Query:  LFRADRFFKIHSFTDSEKLSVAIISFDGPAIDWYRFQDECEAFKDWDGLKEKMLTCFPTIRDGSL-------------------------------TIVL
        LFRADR+FKIH+ TDSE+L+VA+ISFDGPA+DWYR Q+E EAF  WD LK KML  F   R+G+L                               T+VL
Subjt:  LFRADRFFKIHSFTDSEKLSVAIISFDGPAIDWYRFQDECEAFKDWDGLKEKMLTCFPTIRDGSL-------------------------------TIVL

Query:  EETFMNGLYPWLKTEVSLLEPKGLAAMMKLALKIEHRDGSQR----------------------------------------------GWNDNIERG---
        EETFMNGL PWLK+EV  LEP GLA MMKLALKIE+R+  +R                                              G  DN   G   
Subjt:  EETFMNGLYPWLKTEVSLLEPKGLAAMMKLALKIEHRDGSQR----------------------------------------------GWNDNIERG---

Query:  -----------------------YAGGELKGRANKKIELRMLVVHDNGEELEIIEDDPYDEELEVKPMEVGADENLNIELSINSVVGLNNSGTMKVKGKV
                               +AG   K + +K  ELRMLVV + GEELEI+E++ +D E E+KP EV   ENLNIELS+NSVVGLNN GTMKVKGKV
Subjt:  -----------------------YAGGELKGRANKKIELRMLVVHDNGEELEIIEDDPYDEELEVKPMEVGADENLNIELSINSVVGLNNSGTMKVKGKV

Query:  KNEDVVVLIDCGATHNFISEKLVTALNLPLKTTTTYGVILGPGTTIKGKG---------------------------------WLKSLGVTEVDWRKLIM
          E+VV+LIDCGATHNFI+E LVT + L L+ T  YGVILG GT +KGKG                                 WL SLGVTEVDW++L++
Subjt:  KNEDVVVLIDCGATHNFISEKLVTALNLPLKTTTTYGVILGPGTTIKGKG---------------------------------WLKSLGVTEVDWRKLIM

Query:  TFQHYWKRITIKGDPSLMKIRVSLKHMMKTWDNEDQSYLIGCRALHSSLTIEEMYDEETKPTAENLLPSLLTKFDDVFSWPETLPPQRGIEHHIHLKQGT
        TF H  K++ I+GDPSL+K RVSLK++MKTW  +DQ +L+ CR L      +E      K  AE  + +LL +F  VF WP TLPPQR IEHHI+LK GT
Subjt:  TFQHYWKRITIKGDPSLMKIRVSLKHMMKTWDNEDQSYLIGCRALHSSLTIEEMYDEETKPTAENLLPSLLTKFDDVFSWPETLPPQRGIEHHIHLKQGT

Query:  NP--------------EMERLVDEMLASGIIWPSTSPYSSPVLLVRRKDGSWRFYVDYRALNNVTIPDKFPIPVIEELFNELNGANMFSKIDIKARYHQI
        +P              EMERLVDEML+SGII PS SPYSSPVLLVR+KDGSWRF VDYRALNNVTIPDKFPIPVIEELF+EL GA++FSK+D+KA YHQI
Subjt:  NP--------------EMERLVDEMLASGIIWPSTSPYSSPVLLVRRKDGSWRFYVDYRALNNVTIPDKFPIPVIEELFNELNGANMFSKIDIKARYHQI

Query:  RMNQEDVEKTTFRTHEGHYEFLVMPFGLTNAPSTFQALMNSIFRPYMRRFVLVFFDDILVAAE
        RM  ED+EKT FRTHEGHYEFLVMPFGLTNAPSTFQALMN +F+PY+RRFVLVFFDDIL+ ++
Subjt:  RMNQEDVEKTTFRTHEGHYEFLVMPFGLTNAPSTFQALMNSIFRPYMRRFVLVFFDDILVAAE

A0A5A7V707 Ty3/gypsy retrotransposon protein6.7e-19750.92Show/hide
Query:  KKGLELLEQKFMEMQT----KFKKIPMVVKYLEGIIREKALTSKEDEGSVSREKKSENILVELNDELKLHLKSNE-KLTEQSKFKKFEIPVFNGTDPDSW
        ++ + LL +   EM      ++++  +++KYLEGI+R+ +   +  EGS S+ +  E  +  ++++ K+  K+ E   T++SKFKK E+P+F GTDPDSW
Subjt:  KKGLELLEQKFMEMQT----KFKKIPMVVKYLEGIIREKALTSKEDEGSVSREKKSENILVELNDELKLHLKSNE-KLTEQSKFKKFEIPVFNGTDPDSW

Query:  LFRADRFFKIHSFTDSEKLSVAIISFDGPAIDWYRFQDECEAFKDWDGLKEKMLTCFPTIRDGSLT-------------------------------IVL
        LFRADR+FKIH+ +DSEKL+VA+I FDGPA+DWYR Q+E EAFK W  LK+KML  F  IRDG+L                                +VL
Subjt:  LFRADRFFKIHSFTDSEKLSVAIISFDGPAIDWYRFQDECEAFKDWDGLKEKMLTCFPTIRDGSLT-------------------------------IVL

Query:  EETFMNGLYPWLKTEVSLLEPKGLAAMMKLALKIEHRD---------------GSQRGWNDNI-------------------------------------
        EETFMNGL PWLK+EV   EP GLA MMK+ALKIE+R+                +QR  N  I                                     
Subjt:  EETFMNGLYPWLKTEVSLLEPKGLAAMMKLALKIEHRD---------------GSQRGWNDNI-------------------------------------

Query:  --------------------ERGYAGGELKGRANKKIELRMLVVHDNGEELEIIEDDPYDEELEVKPMEVGADENLNIELSINSVVGLNNSGTMKVKGKV
                            E+ YAG   K + +K  ELRML+V +NGEELEIIE++ +D E E+KP EV   ENLNIELSINSVVGL N GTMKVKGK+
Subjt:  --------------------ERGYAGGELKGRANKKIELRMLVVHDNGEELEIIEDDPYDEELEVKPMEVGADENLNIELSINSVVGLNNSGTMKVKGKV

Query:  KNEDVVVLIDCGATHNFISEKLVTALNLPLKTTTTYGVILGPGTTIKGKG---------------------------------WLKSLGVTEVDWRKLIM
          E+VVVLIDCGATHNFI+EKLV  L LP+K T  YGVILG GT+IKGKG                                 WL S+G TEVDW+ LI+
Subjt:  KNEDVVVLIDCGATHNFISEKLVTALNLPLKTTTTYGVILGPGTTIKGKG---------------------------------WLKSLGVTEVDWRKLIM

Query:  TFQHYWKRITIKGDPSLMKIRVSLKHMMKTWDNEDQSYLIGCRALHSSLTIEEMYDEETKPTAENLLPSLLTKFDDVFSWPETLPPQRGIEHHIHLKQGT
        TF     ++TIKGDPSL K ++SLK M+K WD +DQ YL+ CRAL      +   D +   T E  + ++++KF DVF  P  LPPQRGIEHHI+LKQG 
Subjt:  TFQHYWKRITIKGDPSLMKIRVSLKHMMKTWDNEDQSYLIGCRALHSSLTIEEMYDEETKPTAENLLPSLLTKFDDVFSWPETLPPQRGIEHHIHLKQGT

Query:  NP--------------EMERLVDEMLASGIIWPSTSPYSSPVLLVRRKDGSWRFYVDYRALNNVTIPDKFPIPVIEELFNELNGANMFSKIDIKARYHQI
        +P              EMERLVDEML SGII  STSPYSSPVLLVR+KDGSWRF VDYRALNNVT+PDKFPIPV+EELF+ELNGA++FSKID+KA YHQI
Subjt:  NP--------------EMERLVDEMLASGIIWPSTSPYSSPVLLVRRKDGSWRFYVDYRALNNVTIPDKFPIPVIEELFNELNGANMFSKIDIKARYHQI

Query:  RMNQEDVEKTTFRTHEGHYEFLVMPFGLTNAPSTFQALMNSIFRPYMRRFVLVFFDDILV
        RM+ ED+EKTTFRTHEGHYEF+VMPFGLTNAPSTFQALMN +F+P++RRFVLVFFDDIL+
Subjt:  RMNQEDVEKTTFRTHEGHYEFLVMPFGLTNAPSTFQALMNSIFRPYMRRFVLVFFDDILV

A0A5D3C9P5 Ty3/gypsy retrotransposon protein2.2e-19551.11Show/hide
Query:  KKGLELLEQKFMEMQTKFKK----IPMVVKYLEGIIREKALTSKEDEGSVSREKKSENILVELNDELKLHLKS-NEKLTEQSKFKKFEIPVFNGTDPDSW
        ++ L LL +   EM ++  K      +++KY+E I+R+ +   + +EGS S+   +E       +E K+ +K+  E+  ++SKFKK E+PVF+GTDPDSW
Subjt:  KKGLELLEQKFMEMQTKFKK----IPMVVKYLEGIIREKALTSKEDEGSVSREKKSENILVELNDELKLHLKS-NEKLTEQSKFKKFEIPVFNGTDPDSW

Query:  LFRADRFFKIHSFTDSEKLSVAIISFDGPAIDWYRFQDECEAFKDWDGLKEKMLTCFPTIRDGSL-------------------------------TIVL
        LFRADR+FKIH+ TDSE+L+VA+ISFDGPA+DWYR Q+E EAF  WD LK KML  F   R+G+L                               T+VL
Subjt:  LFRADRFFKIHSFTDSEKLSVAIISFDGPAIDWYRFQDECEAFKDWDGLKEKMLTCFPTIRDGSL-------------------------------TIVL

Query:  EETFMNGLYPWLKTEVSLLEPKGLAAMMKLALKIEHRDGSQR----------------------------------------------GWNDNIERG---
        EETFMNGL PWLK+EV  LEP GLA MMKLALKIE+R+  +R                                              G  DN   G   
Subjt:  EETFMNGLYPWLKTEVSLLEPKGLAAMMKLALKIEHRDGSQR----------------------------------------------GWNDNIERG---

Query:  -----------------------YAGGELKGRANKKIELRMLVVHDNGEELEIIEDDPYDEELEVKPMEVGADENLNIELSINSVVGLNNSGTMKVKGKV
                               +AG   K + +K  ELRMLVV + GEELEI+E++ +D E E+KP EV   ENLNIELS+NSVVGLNN GTMKVKGKV
Subjt:  -----------------------YAGGELKGRANKKIELRMLVVHDNGEELEIIEDDPYDEELEVKPMEVGADENLNIELSINSVVGLNNSGTMKVKGKV

Query:  KNEDVVVLIDCGATHNFISEKLVTALNLPLKTTTTYGVILGPGTTIKGKG---------------------------------WLKSLGVTEVDWRKLIM
          E+VV+LIDCGATHNFI+E LVT L L L+ T  YGVILG GT +KGKG                                 WL SLGVTEVDW++L++
Subjt:  KNEDVVVLIDCGATHNFISEKLVTALNLPLKTTTTYGVILGPGTTIKGKG---------------------------------WLKSLGVTEVDWRKLIM

Query:  TFQHYWKRITIKGDPSLMKIRVSLKHMMKTWDNEDQSYLIGCRALHSSLTIEEMYDEETKPTAENLLPSLLTKFDDVFSWPETLPPQRGIEHHIHLKQGT
        TF H  K++ I+GDPSL+K RVSLK++MKTW  +DQ +L+ CR L      +E      K  AE  + +LL +F  VF WP TLPPQR IEHHI+LK GT
Subjt:  TFQHYWKRITIKGDPSLMKIRVSLKHMMKTWDNEDQSYLIGCRALHSSLTIEEMYDEETKPTAENLLPSLLTKFDDVFSWPETLPPQRGIEHHIHLKQGT

Query:  NP--------------EMERLVDEMLASGIIWPSTSPYSSPVLLVRRKDGSWRFYVDYRALNNVTIPDKFPIPVIEELFNELNGANMFSKIDIKARYHQI
        +P              EMERLVDEML+SGII PS SPYSSPVLLVR+KDGSWRF VDYRALNNVTIPDKFPIPVIEELF+EL GA++FSK+D+KA YHQI
Subjt:  NP--------------EMERLVDEMLASGIIWPSTSPYSSPVLLVRRKDGSWRFYVDYRALNNVTIPDKFPIPVIEELFNELNGANMFSKIDIKARYHQI

Query:  RMNQEDVEKTTFRTHEGHYEFLVMPFGLTNAPSTFQALMNSIFRPYMRRFVLVFFDDILVAAE
        RM  ED+EKT FRTHEGHYEFLVMPFGLTNAPSTFQALMN +F+PY+RRFVLVFFDDIL+ ++
Subjt:  RMNQEDVEKTTFRTHEGHYEFLVMPFGLTNAPSTFQALMNSIFRPYMRRFVLVFFDDILVAAE

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.69.1e-3445.45Show/hide
Query:  QGTNPEMERLVDEMLASGIIWPSTSPYSSPVLLVRRKDGS-----WRFYVDYRALNNVTIPDKFPIPVIEELFNELNGANMFSKIDIKARYHQIRMNQED
        Q    E+E  + +ML  GII  S SPY+SP+ +V +K  +     +R  +DYR LN +T+ D+ PIP ++E+  +L   N F+ ID+   +HQI M+ E 
Subjt:  QGTNPEMERLVDEMLASGIIWPSTSPYSSPVLLVRRKDGS-----WRFYVDYRALNNVTIPDKFPIPVIEELFNELNGANMFSKIDIKARYHQIRMNQED

Query:  VEKTTFRTHEGHYEFLVMPFGLTNAPSTFQALMNSIFRPYMRRFVLVFFDDILV
        V KT F T  GHYE+L MPFGL NAP+TFQ  MN I RP + +  LV+ DDI+V
Subjt:  VEKTTFRTHEGHYEFLVMPFGLTNAPSTFQALMNSIFRPYMRRFVLVFFDDILV

P20825 Retrovirus-related Pol polyprotein from transposon 2971.1e-3445.51Show/hide
Query:  LKQGTNPEMERLVDEMLASGIIWPSTSPYSSPVLLVRRKD-----GSWRFYVDYRALNNVTIPDKFPIPVIEELFNELNGANMFSKIDIKARYHQIRMNQ
        L Q    E+E  V EML  G+I  S SPY+SP  +V +K        +R  +DYR LN +TIPD++PIP ++E+  +L     F+ ID+   +HQI M++
Subjt:  LKQGTNPEMERLVDEMLASGIIWPSTSPYSSPVLLVRRKD-----GSWRFYVDYRALNNVTIPDKFPIPVIEELFNELNGANMFSKIDIKARYHQIRMNQ

Query:  EDVEKTTFRTHEGHYEFLVMPFGLTNAPSTFQALMNSIFRPYMRRFVLVFFDDILV
        E + KT F T  GHYE+L MPFGL NAP+TFQ  MN+I RP + +  LV+ DDI++
Subjt:  EDVEKTTFRTHEGHYEFLVMPFGLTNAPSTFQALMNSIFRPYMRRFVLVFFDDILV

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein1.4e-3444.91Show/hide
Query:  HLKQGTNPEMERLVDEMLASGIIWPSTSPYSSPVLLVRRKDGSWRFYVDYRALNNVTIPDKFPIPVIEELFNELNGANMFSKIDIKARYHQIRMNQEDVE
        H+ +    E+ ++V ++L +  I PS SP SSPV+LV +KDG++R  VDYR LN  TI D FP+P I+ L + +  A +F+ +D+ + YHQI M  +D  
Subjt:  HLKQGTNPEMERLVDEMLASGIIWPSTSPYSSPVLLVRRKDGSWRFYVDYRALNNVTIPDKFPIPVIEELFNELNGANMFSKIDIKARYHQIRMNQEDVE

Query:  KTTFRTHEGHYEFLVMPFGLTNAPSTFQALMNSIFRPYMRRFVLVFFDDILVAAED----WKNTSSI
        KT F T  G YE+ VMPFGL NAPSTF   M   FR    RFV V+ DDIL+ +E     WK+  ++
Subjt:  KTTFRTHEGHYEFLVMPFGLTNAPSTFQALMNSIFRPYMRRFVLVFFDDILVAAED----WKNTSSI

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus8.3e-3536.96Show/hide
Query:  SSLTIEEMYDEETKPTAENLLPSLLTKFDDVFSWP------ETLPP---QRGIEHHIHLKQGTNP-----EMERLVDEMLASGIIWPSTSPYSSPVLLVR
        +S+ +  +   E     + +L SLL +F  +F  P      ET      +   +  I+ K    P     E+ER +DE+L  GII PS SPY+SP+ +V 
Subjt:  SSLTIEEMYDEETKPTAENLLPSLLTKFDDVFSWP------ETLPP---QRGIEHHIHLKQGTNP-----EMERLVDEMLASGIIWPSTSPYSSPVLLVR

Query:  RK-----DGSWRFYVDYRALNNVTIPDKFPIPVIEELFNELNGANMFSKIDIKARYHQIRMNQEDVEKTTFRTHEGHYEFLVMPFGLTNAPSTFQALMNS
        +K     +  +R  VD++ LN VTIPD +PIP I      L  A  F+ +D+ + +HQI M + D+ KT F T  G YEFL +PFGL NAP+ FQ +++ 
Subjt:  RK-----DGSWRFYVDYRALNNVTIPDKFPIPVIEELFNELNGANMFSKIDIKARYHQIRMNQEDVEKTTFRTHEGHYEFLVMPFGLTNAPSTFQALMNS

Query:  IFRPYMRRFVLVFFDDILVAAED----WKN
        I R ++ +   V+ DDI+V +ED    WKN
Subjt:  IFRPYMRRFVLVFFDDILVAAED----WKN

Q99315 Transposon Ty3-G Gag-Pol polyprotein1.4e-3444.91Show/hide
Query:  HLKQGTNPEMERLVDEMLASGIIWPSTSPYSSPVLLVRRKDGSWRFYVDYRALNNVTIPDKFPIPVIEELFNELNGANMFSKIDIKARYHQIRMNQEDVE
        H+ +    E+ ++V ++L +  I PS SP SSPV+LV +KDG++R  VDYR LN  TI D FP+P I+ L + +  A +F+ +D+ + YHQI M  +D  
Subjt:  HLKQGTNPEMERLVDEMLASGIIWPSTSPYSSPVLLVRRKDGSWRFYVDYRALNNVTIPDKFPIPVIEELFNELNGANMFSKIDIKARYHQIRMNQEDVE

Query:  KTTFRTHEGHYEFLVMPFGLTNAPSTFQALMNSIFRPYMRRFVLVFFDDILVAAED----WKNTSSI
        KT F T  G YE+ VMPFGL NAPSTF   M   FR    RFV V+ DDIL+ +E     WK+  ++
Subjt:  KTTFRTHEGHYEFLVMPFGLTNAPSTFQALMNSIFRPYMRRFVLVFFDDILVAAED----WKNTSSI

Arabidopsis top hitse value%identityAlignment
AT1G67020.1 unknown protein2.5e-1029.67Show/hide
Query:  SKFKKFEIPVFNGTDPDSWLFRADRFFKIHSFTDSEKLSVAIISFDGPAIDWYRFQDECEAFKDWDGLKEKMLTCFPTIRDGSLTIVLEET
        S  ++ E+PVF+G+    W  + +RFF++  + DS+KL +  +S +G A+ W+  +     F+DW+  ++++L  F  ++  S   VL  T
Subjt:  SKFKKFEIPVFNGTDPDSWLFRADRFFKIHSFTDSEKLSVAIISFDGPAIDWYRFQDECEAFKDWDGLKEKMLTCFPTIRDGSLTIVLEET

AT3G29750.1 Eukaryotic aspartyl protease family protein1.7e-0625.27Show/hide
Query:  EETFMNGLYPWLKTEVSLLEPKGL-------AAMMKLALKIEHRDGSQRGWNDNIERGYAGGELKGRANKKIELRMLVVHDNGEELEIIEDDPYDEELEV
        EE F+ GL P L+T V  L+P G+       A +M L L     D  ++               KG  N               ELE +E D Y      
Subjt:  EETFMNGLYPWLKTEVSLLEPKGL-------AAMMKLALKIEHRDGSQRGWNDNIERGYAGGELKGRANKKIELRMLVVHDNGEELEIIEDDPYDEELEV

Query:  KPMEVGADENLNIELSINSVVGLNNSGTMKVKGKVKNEDVVVLIDCGATHNFISEKLVTALNLPLKTTTTYGVILGPGTTIKGKG---------------
          +  G ++          V+ L  +  M+  G + +  VVV ID GAT NFI  +L  +L LP   T    V+LG    I+  G               
Subjt:  KPMEVGADENLNIELSINSVVGLNNSGTMKVKGKVKNEDVVVLIDCGATHNFISEKLVTALNLPLKTTTTYGVILGPGTTIKGKG---------------

Query:  --------------------WLKSLGVTEVDWRKLIMTFQHYWKRITIKGDPSLMKIRVSLKHMMKTWDNEDQ
                            WL  LG T V+W+    +F H  + IT+  +   ++ +V+ K  MK+ +NE +
Subjt:  --------------------WLKSLGVTEVDWRKLIMTFQHYWKRITIKGDPSLMKIRVSLKHMMKTWDNEDQ

AT3G30770.1 Eukaryotic aspartyl protease family protein3.5e-0443.86Show/hide
Query:  MKVKGKVKNEDVVVLIDCGATHNFISEKLVTALNLPLKTTTTYGVILGPGTTIKGKG
        M+  G +    VVV+ID GAT+NFIS++L   L LP  TT    V+LG    I+  G
Subjt:  MKVKGKVKNEDVVVLIDCGATHNFISEKLVTALNLPLKTTTTYGVILGPGTTIKGKG

AT3G44713.1 unknown protein1.2e-0429.27Show/hide
Query:  PVFNGTDPD--SWLFRADRFFKIHSFTDSEKLSVAIISFDGPAIDWYRFQDECEAFKDWDGLKEKMLTCFPTIRDGSLTIVL
        P FNG   +  SW+   + FF   +FTD EK+++A    +G A  W+  + +   F+ W+ L++ ++  F   +D  L  +L
Subjt:  PVFNGTDPD--SWLFRADRFFKIHSFTDSEKLSVAIISFDGPAIDWYRFQDECEAFKDWDGLKEKMLTCFPTIRDGSLTIVL

ATMG00850.1 DNA/RNA polymerases superfamily protein1.1e-0553.33Show/hide
Query:  IHLKQGTNPEMERLVDEMLASGIIWPSTSPYSSPVLLVRRKDGSW
        IH+ + T   ++  + EML + II PS SPYSSPVLLV++KDG W
Subjt:  IHLKQGTNPEMERLVDEMLASGIIWPSTSPYSSPVLLVRRKDGSW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTATTCAGACTAAATCGTCAGCGGCAAGTGATCAACCATCTCTGGCAACCAACGATCACTTCCGGCGACTTCCGGCGACTTCCGGACTCGGGGCAATAGCTGCTGG
ATTCAGAAACCACAACTCGGTAAAATCGAAAATGTTGAAAGGAAACGGAGGTACTTGGTTGGGGAATGAGAAGATTGGATCAAACGTTTGGAATGCTACAGCAAAGGAAT
GCAATTCGTCGAAGTACGTGATGGTGAATGAAACCGATACTGGCGAGCATTTGAGATTGGATAAGGGAATCTCTAAGGTTTATTTATTCCGTTCCTCGTTTATTTCTCTC
CGTTTCTCTTTTCCTCTCCACATCGTTTTCTATCTTTCTCGGCTTTTTTTTCTCTCTTTCTCTTCTCCGGCGCGTCATCACCATTATCGGCTTTCGTCTTCTCTCTCCCT
CTTCGCCCGCTGTCTTCTCTCTTCTCCGGCTAAAGTTACAAAGTATGTGTTTCTCTATTCCTATATTTCTCTCTTCTCCGGCTCTGTCTCTCTATTTATCTCTTCCCCGA
TCTCCATCTTCTCCGATTCTCTTTCTCTTAACTTTTCTTCTCCGATTCTCTGCCTTCTCTCTTCTTCGGCTAAAGGTAGAGTACTGTACTTTGGTGACGGAAATTTGATG
AAATTTAGTTCTCCTTTTACTTACTTTGAGCCCTTAGTTGTCTGTTATCTGCCATCTCCTTTCTGGGTTTTGATGCTAGCCCTAAAACCTGGGCATTCAGAAAAGATGGC
GAAAAAAGAAAAAAAAGGTCTGGAACTTCTTGAGCAGAAATTTATGGAAATGCAAACAAAGTTCAAAAAAATACCGATGGTTGTGAAATATCTTGAAGGTATTATTCGAG
AGAAGGCTTTGACATCGAAGGAAGACGAAGGATCTGTGAGTAGAGAGAAGAAAAGCGAAAATATATTAGTTGAACTCAACGATGAATTGAAGTTGCACCTGAAGAGCAAT
GAAAAATTGACTGAGCAGAGTAAGTTTAAGAAATTTGAAATACCGGTATTCAATGGCACGGATCCGGATTCTTGGTTATTCCGAGCGGATCGATTCTTCAAAATACACAG
TTTTACTGATTCAGAGAAGTTGTCTGTGGCAATTATCAGTTTTGATGGACCGGCTATAGATTGGTATCGTTTTCAGGATGAATGTGAGGCTTTTAAGGATTGGGATGGTC
TAAAAGAGAAGATGTTAACTTGTTTTCCAACAATAAGAGATGGATCTCTTACAATTGTATTAGAAGAAACTTTTATGAATGGGCTTTACCCATGGCTGAAAACGGAAGTG
TCTCTTTTGGAACCTAAGGGCTTGGCTGCAATGATGAAATTGGCATTAAAAATAGAACACAGAGATGGTTCGCAAAGAGGCTGGAACGATAACATTGAAAGGGGTTATGC
CGGGGGAGAACTGAAGGGAAGGGCCAACAAAAAGATTGAATTGAGAATGTTGGTTGTGCACGACAATGGGGAAGAATTAGAAATCATTGAAGATGACCCATATGATGAAG
AATTGGAGGTTAAACCAATGGAGGTAGGAGCTGATGAAAATCTCAACATTGAGTTATCTATCAACTCAGTGGTGGGATTAAACAACTCTGGAACCATGAAAGTGAAGGGA
AAAGTGAAGAATGAGGATGTTGTAGTGTTAATTGACTGTGGGGCTACCCACAATTTCATATCTGAGAAATTGGTGACTGCATTAAATCTTCCTTTGAAAACAACTACAAC
TTATGGAGTGATTCTTGGGCCAGGGACGACTATTAAAGGAAAAGGCTGGTTGAAATCTTTAGGTGTCACTGAAGTGGACTGGAGGAAGTTAATTATGACTTTTCAGCATT
ATTGGAAGAGGATTACAATCAAAGGTGATCCTAGCTTAATGAAGATAAGGGTTAGTTTGAAACATATGATGAAGACTTGGGATAATGAAGATCAGAGTTATTTAATAGGG
TGTCGAGCATTACACAGTAGTTTAACTATAGAAGAGATGTATGATGAGGAGACAAAACCCACAGCCGAAAATTTACTGCCCTCTCTGTTAACTAAGTTCGATGATGTGTT
CAGTTGGCCCGAGACGCTACCACCGCAACGTGGGATTGAACATCATATCCATCTCAAACAAGGGACGAATCCAGAAATGGAACGTTTGGTGGATGAGATGCTGGCTTCGG
GAATTATTTGGCCAAGTACCAGTCCATATTCTAGTCCCGTGTTGTTGGTAAGAAGGAAAGATGGGAGTTGGCGTTTTTATGTCGATTATAGAGCGCTGAACAATGTAACA
ATCCCAGATAAATTTCCTATTCCAGTTATTGAAGAGCTCTTTAATGAACTTAATGGAGCGAATATGTTTTCAAAGATTGATATTAAAGCGAGATATCATCAAATTCGGAT
GAATCAAGAAGATGTGGAGAAGACAACTTTTAGAACTCATGAAGGGCATTATGAGTTCTTAGTTATGCCATTTGGGCTCACCAATGCCCCTTCAACTTTCCAAGCATTAA
TGAATTCAATCTTCAGGCCGTATATGAGGAGGTTTGTGTTGGTTTTCTTTGATGACATATTAGTTGCAGCAGAGGATTGGAAGAACACTTCCAGCATTTAG
mRNA sequenceShow/hide mRNA sequence
ATGACTATTCAGACTAAATCGTCAGCGGCAAGTGATCAACCATCTCTGGCAACCAACGATCACTTCCGGCGACTTCCGGCGACTTCCGGACTCGGGGCAATAGCTGCTGG
ATTCAGAAACCACAACTCGGTAAAATCGAAAATGTTGAAAGGAAACGGAGGTACTTGGTTGGGGAATGAGAAGATTGGATCAAACGTTTGGAATGCTACAGCAAAGGAAT
GCAATTCGTCGAAGTACGTGATGGTGAATGAAACCGATACTGGCGAGCATTTGAGATTGGATAAGGGAATCTCTAAGGTTTATTTATTCCGTTCCTCGTTTATTTCTCTC
CGTTTCTCTTTTCCTCTCCACATCGTTTTCTATCTTTCTCGGCTTTTTTTTCTCTCTTTCTCTTCTCCGGCGCGTCATCACCATTATCGGCTTTCGTCTTCTCTCTCCCT
CTTCGCCCGCTGTCTTCTCTCTTCTCCGGCTAAAGTTACAAAGTATGTGTTTCTCTATTCCTATATTTCTCTCTTCTCCGGCTCTGTCTCTCTATTTATCTCTTCCCCGA
TCTCCATCTTCTCCGATTCTCTTTCTCTTAACTTTTCTTCTCCGATTCTCTGCCTTCTCTCTTCTTCGGCTAAAGGTAGAGTACTGTACTTTGGTGACGGAAATTTGATG
AAATTTAGTTCTCCTTTTACTTACTTTGAGCCCTTAGTTGTCTGTTATCTGCCATCTCCTTTCTGGGTTTTGATGCTAGCCCTAAAACCTGGGCATTCAGAAAAGATGGC
GAAAAAAGAAAAAAAAGGTCTGGAACTTCTTGAGCAGAAATTTATGGAAATGCAAACAAAGTTCAAAAAAATACCGATGGTTGTGAAATATCTTGAAGGTATTATTCGAG
AGAAGGCTTTGACATCGAAGGAAGACGAAGGATCTGTGAGTAGAGAGAAGAAAAGCGAAAATATATTAGTTGAACTCAACGATGAATTGAAGTTGCACCTGAAGAGCAAT
GAAAAATTGACTGAGCAGAGTAAGTTTAAGAAATTTGAAATACCGGTATTCAATGGCACGGATCCGGATTCTTGGTTATTCCGAGCGGATCGATTCTTCAAAATACACAG
TTTTACTGATTCAGAGAAGTTGTCTGTGGCAATTATCAGTTTTGATGGACCGGCTATAGATTGGTATCGTTTTCAGGATGAATGTGAGGCTTTTAAGGATTGGGATGGTC
TAAAAGAGAAGATGTTAACTTGTTTTCCAACAATAAGAGATGGATCTCTTACAATTGTATTAGAAGAAACTTTTATGAATGGGCTTTACCCATGGCTGAAAACGGAAGTG
TCTCTTTTGGAACCTAAGGGCTTGGCTGCAATGATGAAATTGGCATTAAAAATAGAACACAGAGATGGTTCGCAAAGAGGCTGGAACGATAACATTGAAAGGGGTTATGC
CGGGGGAGAACTGAAGGGAAGGGCCAACAAAAAGATTGAATTGAGAATGTTGGTTGTGCACGACAATGGGGAAGAATTAGAAATCATTGAAGATGACCCATATGATGAAG
AATTGGAGGTTAAACCAATGGAGGTAGGAGCTGATGAAAATCTCAACATTGAGTTATCTATCAACTCAGTGGTGGGATTAAACAACTCTGGAACCATGAAAGTGAAGGGA
AAAGTGAAGAATGAGGATGTTGTAGTGTTAATTGACTGTGGGGCTACCCACAATTTCATATCTGAGAAATTGGTGACTGCATTAAATCTTCCTTTGAAAACAACTACAAC
TTATGGAGTGATTCTTGGGCCAGGGACGACTATTAAAGGAAAAGGCTGGTTGAAATCTTTAGGTGTCACTGAAGTGGACTGGAGGAAGTTAATTATGACTTTTCAGCATT
ATTGGAAGAGGATTACAATCAAAGGTGATCCTAGCTTAATGAAGATAAGGGTTAGTTTGAAACATATGATGAAGACTTGGGATAATGAAGATCAGAGTTATTTAATAGGG
TGTCGAGCATTACACAGTAGTTTAACTATAGAAGAGATGTATGATGAGGAGACAAAACCCACAGCCGAAAATTTACTGCCCTCTCTGTTAACTAAGTTCGATGATGTGTT
CAGTTGGCCCGAGACGCTACCACCGCAACGTGGGATTGAACATCATATCCATCTCAAACAAGGGACGAATCCAGAAATGGAACGTTTGGTGGATGAGATGCTGGCTTCGG
GAATTATTTGGCCAAGTACCAGTCCATATTCTAGTCCCGTGTTGTTGGTAAGAAGGAAAGATGGGAGTTGGCGTTTTTATGTCGATTATAGAGCGCTGAACAATGTAACA
ATCCCAGATAAATTTCCTATTCCAGTTATTGAAGAGCTCTTTAATGAACTTAATGGAGCGAATATGTTTTCAAAGATTGATATTAAAGCGAGATATCATCAAATTCGGAT
GAATCAAGAAGATGTGGAGAAGACAACTTTTAGAACTCATGAAGGGCATTATGAGTTCTTAGTTATGCCATTTGGGCTCACCAATGCCCCTTCAACTTTCCAAGCATTAA
TGAATTCAATCTTCAGGCCGTATATGAGGAGGTTTGTGTTGGTTTTCTTTGATGACATATTAGTTGCAGCAGAGGATTGGAAGAACACTTCCAGCATTTAG
Protein sequenceShow/hide protein sequence
MTIQTKSSAASDQPSLATNDHFRRLPATSGLGAIAAGFRNHNSVKSKMLKGNGGTWLGNEKIGSNVWNATAKECNSSKYVMVNETDTGEHLRLDKGISKVYLFRSSFISL
RFSFPLHIVFYLSRLFFLSFSSPARHHHYRLSSSLSLFARCLLSSPAKVTKYVFLYSYISLFSGSVSLFISSPISIFSDSLSLNFSSPILCLLSSSAKGRVLYFGDGNLM
KFSSPFTYFEPLVVCYLPSPFWVLMLALKPGHSEKMAKKEKKGLELLEQKFMEMQTKFKKIPMVVKYLEGIIREKALTSKEDEGSVSREKKSENILVELNDELKLHLKSN
EKLTEQSKFKKFEIPVFNGTDPDSWLFRADRFFKIHSFTDSEKLSVAIISFDGPAIDWYRFQDECEAFKDWDGLKEKMLTCFPTIRDGSLTIVLEETFMNGLYPWLKTEV
SLLEPKGLAAMMKLALKIEHRDGSQRGWNDNIERGYAGGELKGRANKKIELRMLVVHDNGEELEIIEDDPYDEELEVKPMEVGADENLNIELSINSVVGLNNSGTMKVKG
KVKNEDVVVLIDCGATHNFISEKLVTALNLPLKTTTTYGVILGPGTTIKGKGWLKSLGVTEVDWRKLIMTFQHYWKRITIKGDPSLMKIRVSLKHMMKTWDNEDQSYLIG
CRALHSSLTIEEMYDEETKPTAENLLPSLLTKFDDVFSWPETLPPQRGIEHHIHLKQGTNPEMERLVDEMLASGIIWPSTSPYSSPVLLVRRKDGSWRFYVDYRALNNVT
IPDKFPIPVIEELFNELNGANMFSKIDIKARYHQIRMNQEDVEKTTFRTHEGHYEFLVMPFGLTNAPSTFQALMNSIFRPYMRRFVLVFFDDILVAAEDWKNTSSI