; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy4G011085 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy4G011085
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationGy14Chr4:10339228..10342135
RNA-Seq ExpressionCsGy4G011085
SyntenyCsGy4G011085
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR021109 - Aspartic peptidase domain superfamily
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_031737605.1 uncharacterized protein LOC116402475 [Cucumis sativus]4.35e-26294.34Show/hide
Query:  MKQVTIPLKNDYQKKDPPIKRLSDTEFRARLDKGLCFRCNEKYAPGHRCKGREKRELMLLILNEEEDHKREEDTEDEASEVIELNHLELNMDNPIELRLI
        MKQVTIPLK+DYQKKDPPIKRLSDTEFRARLDKGLCFRCNEKY+PGHRCKGREKRELMLLILNEEEDH+RE +TEDEASE+IELN LELN D PIELRLI
Subjt:  MKQVTIPLKNDYQKKDPPIKRLSDTEFRARLDKGLCFRCNEKYAPGHRCKGREKRELMLLILNEEEDHKREEDTEDEASEVIELNHLELNMDNPIELRLI

Query:  TGVTSKGTMKLKGHVNGREVVILIDSGATNNFISQVLVDELQLSIDPGTRFGVTIGNGNQCEGSGICKRAKVKLKELTIVADFLAVELGTVDLVLGMQWL
        TGVTSKGTMKLKGHVNG+EVVILIDSGATNNFISQVLVDELQLSIDPGTRFGV IGNG +CEG GICKR KVKLKELTIVADFLAVELG VDLVLGMQWL
Subjt:  TGVTSKGTMKLKGHVNGREVVILIDSGATNNFISQVLVDELQLSIDPGTRFGVTIGNGNQCEGSGICKRAKVKLKELTIVADFLAVELGTVDLVLGMQWL

Query:  DSTGTMKVHWPSLTMTFWMKGRRIILKGDPSLTKSECSLRTLEKTWQSGDQGFLLEFQNYEVNYEGELETEAELKGKEEGLPMVQRLLEQYADVFRLPTG
        DSTGTMKVHWPSLTMTFW KGRRIILKGD SLTKSECSLRTLEKTWQSGDQGFLLEFQNYEV+YEGE ETEAELKGKEEGLPMVQRLLEQYAD+FRLPTG
Subjt:  DSTGTMKVHWPSLTMTFWMKGRRIILKGDPSLTKSECSLRTLEKTWQSGDQGFLLEFQNYEVNYEGELETEAELKGKEEGLPMVQRLLEQYADVFRLPTG

Query:  LPPRRAIDHRILTVADQKPINVRPYKYGHVQKEEIEKLVLEMLQAGVIRPSRSPYSSPVLLVKKKDGGWRFCVDYRKLNQVTVADKFPI
        LPP+RAIDHRILTVADQKPINVRPYKYGHVQKEEIEKLVLEMLQAGVIRPSRSPYSSPVLLVKKKDGGWRFCVDYRKLNQVTVADKFPI
Subjt:  LPPRRAIDHRILTVADQKPINVRPYKYGHVQKEEIEKLVLEMLQAGVIRPSRSPYSSPVLLVKKKDGGWRFCVDYRKLNQVTVADKFPI

XP_031742100.1 uncharacterized protein LOC116404055 [Cucumis sativus]1.11e-27091.78Show/hide
Query:  MKQVTIPLKNDYQKKDPPIKRLSDTEFRARLDKGLCFRCNEKYAPGHRCKGREKRELMLLILNEEEDHKREEDTEDEASEVIELNHLELNMDNPIELRLI
        MKQVTIPLKNDYQKKD PIKRLSDTEFRARLDKGLCFRCNEKYAPGHRCKGREKRELMLLILNEEEDHKREEDTEDEASEVIELNHLELNMDNPIELRLI
Subjt:  MKQVTIPLKNDYQKKDPPIKRLSDTEFRARLDKGLCFRCNEKYAPGHRCKGREKRELMLLILNEEEDHKREEDTEDEASEVIELNHLELNMDNPIELRLI

Query:  TGVTSKGTMKLKGHVNGREVVILIDSGATNNFISQVLVDELQLSIDPGTRFGVTIGNGNQCEGSGICKRAKVKLKELTIVADFLAVELGTVDLVLGMQWL
        TGVTSKGTMKLKGH                            LSIDPGTRFGVTIGN NQCEGSGICKR KVKLKELTIVADFLAVELGTVDLVLGMQWL
Subjt:  TGVTSKGTMKLKGHVNGREVVILIDSGATNNFISQVLVDELQLSIDPGTRFGVTIGNGNQCEGSGICKRAKVKLKELTIVADFLAVELGTVDLVLGMQWL

Query:  DSTGTMKVHWPSLTMTFWMKGRRIILKGDPSLTKSECSLRTLEKTWQSGDQGFLLEFQNYEVNYEGELETEAELKGKEEGLPMVQRLLEQYADVFRLPTG
        DSTGTMKVHWPSLTMTFWMKGRRIILKGDPSLTKSECSLRTLEKTWQSGDQGFLLEFQNYEVNYEGE ETEAELKG+EEGLPMVQRLLEQYADVFRLPTG
Subjt:  DSTGTMKVHWPSLTMTFWMKGRRIILKGDPSLTKSECSLRTLEKTWQSGDQGFLLEFQNYEVNYEGELETEAELKGKEEGLPMVQRLLEQYADVFRLPTG

Query:  LPPRRAIDHRILTVADQKPINVRPYKYGHVQKEEIEKLVLEMLQAGVIRPSRSPYSSPVLLVKKKDGGWRFCVDYRKLNQVTVADKFPIPVIEELLDELH
        LPPRRAIDHRILTVADQKPINVRPYKYGHVQKEEIEKLVLEMLQAGVIRPSRSPYSSPVLLVK KDGGWRFCVDYRKLNQVTVADKFPIPVIEELLDELH
Subjt:  LPPRRAIDHRILTVADQKPINVRPYKYGHVQKEEIEKLVLEMLQAGVIRPSRSPYSSPVLLVKKKDGGWRFCVDYRKLNQVTVADKFPIPVIEELLDELH

Query:  GATVFSKLDLKSGYHQIRMREEDVEK
        GAT FSKLDLKSGYHQIRMREEDVEK
Subjt:  GATVFSKLDLKSGYHQIRMREEDVEK

XP_031745472.1 uncharacterized protein K02A2.6-like [Cucumis sativus]1.30e-29497.42Show/hide
Query:  MKQVTIPLKNDYQKKDPPIKRLSDTEFRARLDKGLCFRCNEKYAPGHRCKGREKRELMLLILNEEEDHKREEDTEDEASEVIELNHLELNMDNPIELRLI
        MKQVTIPLKNDYQKKDPPIKRLSDTEFRARLDKGLCFRCNEKYAPGHRCKGREKRELMLLILNEEEDHKREEDTEDEASEVIELNHLELNMDNPIELRLI
Subjt:  MKQVTIPLKNDYQKKDPPIKRLSDTEFRARLDKGLCFRCNEKYAPGHRCKGREKRELMLLILNEEEDHKREEDTEDEASEVIELNHLELNMDNPIELRLI

Query:  TGVTSKGTMKLKGHVNGREVVILIDSGATNNFISQVLVDELQLSIDPGTRFGVTIGNGNQCEGSGICKRAKVKLKELTIVADFLAVELGTVDLVLGMQWL
        TGVTSKGTMKLKGHVNGREVVILIDSGATNNFISQVLVDELQLSIDPGTRFGVTIGNGNQCEGSGICKR KVKLKELTIVADFLAVELGTVDLVLGMQWL
Subjt:  TGVTSKGTMKLKGHVNGREVVILIDSGATNNFISQVLVDELQLSIDPGTRFGVTIGNGNQCEGSGICKRAKVKLKELTIVADFLAVELGTVDLVLGMQWL

Query:  DSTGTMKVHWPSLTMTFWMKGRRIILKGDPSLTKSECSLRTLEKTWQSGDQGFLLEFQNYEVNYEGELETEAELKGKEEGLPMVQRLLEQYADVFRLPTG
        DSTGTMKVHWPSLTMTFW KGRRIILKGDPSLTKSECSLRTLEKTWQSGDQGFLLEF+NYE    GE ETEAELKGKEEGLPMVQRLLEQYADVFRLP G
Subjt:  DSTGTMKVHWPSLTMTFWMKGRRIILKGDPSLTKSECSLRTLEKTWQSGDQGFLLEFQNYEVNYEGELETEAELKGKEEGLPMVQRLLEQYADVFRLPTG

Query:  LPPRRAIDHRILTVADQKPINVRPYKYGHVQKEEIEKLVLEMLQAGVIRPSRSPYSSPVLLVKKKDGGWRFCVDYRKLNQVTVADKFPIPVIEELLDELH
        LPPRRAIDHRILTVADQKPINVRPYKYGHVQKEEIEKLVLEMLQAGVIRPSRSPYSSPVLLVKKKDGGWRFCVDYRKLNQVTVADKFPIPV EELLDELH
Subjt:  LPPRRAIDHRILTVADQKPINVRPYKYGHVQKEEIEKLVLEMLQAGVIRPSRSPYSSPVLLVKKKDGGWRFCVDYRKLNQVTVADKFPIPVIEELLDELH

Query:  GATVFSKLDLKSGYHQIRMREEDVEK
        GAT FSKLDLKSGYHQIRMREEDVEK
Subjt:  GATVFSKLDLKSGYHQIRMREEDVEK

XP_031745528.1 uncharacterized protein LOC116405915 [Cucumis sativus]2.05e-21569.95Show/hide
Query:  MKQVTIPLKNDYQKKDPPIKRLSDTEFRARLDKGLCFRCNEKYAPGHRCKGREKRELMLLILNEEEDHKREEDTEDEASEVIELNHLELNMDNPIELRLI
        +KQVTIP+K +YQK +PP+KRLSD EFRARLDKGLCF+CNE+Y+PGHRCK ++KRELML I+NEEE  + E+ TE+   EV+ELN L L     IEL+ I
Subjt:  MKQVTIPLKNDYQKKDPPIKRLSDTEFRARLDKGLCFRCNEKYAPGHRCKGREKRELMLLILNEEEDHKREEDTEDEASEVIELNHLELNMDNPIELRLI

Query:  TGVTSKGTMKLKGHVNGREVVILIDSGATNNFISQVLVDELQLSIDPGTRFGVTIGNGNQCEGSGICKRAKVKLKELTIVADFLAVELGTVDLVLGMQWL
         G+TSKGTMK+KG + G+EV+ILIDSGAT+NFI   +V+E+ L ++  T FGVTIG+G +C+G G+C R ++KLKE+TIVADFLA+ELG+VD++LGMQWL
Subjt:  TGVTSKGTMKLKGHVNGREVVILIDSGATNNFISQVLVDELQLSIDPGTRFGVTIGNGNQCEGSGICKRAKVKLKELTIVADFLAVELGTVDLVLGMQWL

Query:  DSTGTMKVHWPSLTMTFWMKGRRIILKGDPSLTKSECSLRTLEKTWQSGDQGFLLEFQNYEVNYEGELETEAELKGKEEGLPMVQRLLEQYADVFRLPTG
        ++TGTMK+HWPSLTMTF M  ++ ILKGDPSL ++ECSL+T+EKTW+  DQGFLLE QNYE   +GEL+    +KG EE  PM+Q LL+QY D+F  P G
Subjt:  DSTGTMKVHWPSLTMTFWMKGRRIILKGDPSLTKSECSLRTLEKTWQSGDQGFLLEFQNYEVNYEGELETEAELKGKEEGLPMVQRLLEQYADVFRLPTG

Query:  LPPRRAIDHRILTVADQKPINVRPYKYGHVQKEEIEKLVLEMLQAGVIRPSRSPYSSPVLLVKKKDGGWRFCVDYRKLNQVTVADKFPIPVIEELLDELH
        LPP+R  DHRIL V  QKPINVRPYKYGH QKEEIEKL+ EMLQ G+IRPS SPYSSPVLLV+KKDGGWRFCVDYRKLNQVT++DKFPIPVIEELLDELH
Subjt:  LPPRRAIDHRILTVADQKPINVRPYKYGHVQKEEIEKLVLEMLQAGVIRPSRSPYSSPVLLVKKKDGGWRFCVDYRKLNQVTVADKFPIPVIEELLDELH

Query:  GATVFSKLDLKSGYHQIRMREEDVEK
        GATVFSKLDLKSGYHQIRM+EEDVEK
Subjt:  GATVFSKLDLKSGYHQIRMREEDVEK

XP_031745726.1 uncharacterized protein K02A2.6-like [Cucumis sativus]4.69e-29597.42Show/hide
Query:  MKQVTIPLKNDYQKKDPPIKRLSDTEFRARLDKGLCFRCNEKYAPGHRCKGREKRELMLLILNEEEDHKREEDTEDEASEVIELNHLELNMDNPIELRLI
        MKQVTIPLKNDYQKKDPPIKRLSDTEFRARLDKGLCFRCNEKYAPGHRCKGREKRELMLLILNEEEDHKREEDTEDEASEVIELNHLELNMDNPIELRLI
Subjt:  MKQVTIPLKNDYQKKDPPIKRLSDTEFRARLDKGLCFRCNEKYAPGHRCKGREKRELMLLILNEEEDHKREEDTEDEASEVIELNHLELNMDNPIELRLI

Query:  TGVTSKGTMKLKGHVNGREVVILIDSGATNNFISQVLVDELQLSIDPGTRFGVTIGNGNQCEGSGICKRAKVKLKELTIVADFLAVELGTVDLVLGMQWL
        TGVTSKGTMKLKGHVNGREVVILIDSGATNNFISQVLVDELQLSIDPGTRFGVTIGNGNQCEGSGICKR KVKLKELTIVADFLAVELGTVDLVLGMQWL
Subjt:  TGVTSKGTMKLKGHVNGREVVILIDSGATNNFISQVLVDELQLSIDPGTRFGVTIGNGNQCEGSGICKRAKVKLKELTIVADFLAVELGTVDLVLGMQWL

Query:  DSTGTMKVHWPSLTMTFWMKGRRIILKGDPSLTKSECSLRTLEKTWQSGDQGFLLEFQNYEVNYEGELETEAELKGKEEGLPMVQRLLEQYADVFRLPTG
        DSTGTMKVHWPSLTMTFW KGRRIILKGDPSLTKSECSLRTLEKTWQSGDQGFLLEF+NYE    GE ETEAELKGKEEGLPMVQRLLEQYADVFRLP G
Subjt:  DSTGTMKVHWPSLTMTFWMKGRRIILKGDPSLTKSECSLRTLEKTWQSGDQGFLLEFQNYEVNYEGELETEAELKGKEEGLPMVQRLLEQYADVFRLPTG

Query:  LPPRRAIDHRILTVADQKPINVRPYKYGHVQKEEIEKLVLEMLQAGVIRPSRSPYSSPVLLVKKKDGGWRFCVDYRKLNQVTVADKFPIPVIEELLDELH
        LPPRRAIDHRILTVADQKPINVRPYKYGHVQKEEIEKLVLEMLQAGVIRPSRSPYSSPVLLVKKKDGGWRFCVDYRKLNQVTVADKFPIPV EELLDELH
Subjt:  LPPRRAIDHRILTVADQKPINVRPYKYGHVQKEEIEKLVLEMLQAGVIRPSRSPYSSPVLLVKKKDGGWRFCVDYRKLNQVTVADKFPIPVIEELLDELH

Query:  GATVFSKLDLKSGYHQIRMREEDVEK
        GAT FSKLDLKSGYHQIRMREEDVEK
Subjt:  GATVFSKLDLKSGYHQIRMREEDVEK

TrEMBL top hitse value%identityAlignment
A0A1S4E096 uncharacterized protein LOC1034951799.01e-21567.37Show/hide
Query:  MKQVTIPLKNDYQKKDPPIKRLSDTEFRARLDKGLCFRCNEKYAPGHRCKGREKRELMLLILNEEEDHKREEDTEDEASEVIELNHLELNMDNPIELRLI
        MKQ+TIP+K +Y+K +PP+KRLSD EFR RLD+GLCFRCN+ Y+PGHRCK +EKRELM  ILNEEE+ + E   +++    +EL +LE+     IEL+ +
Subjt:  MKQVTIPLKNDYQKKDPPIKRLSDTEFRARLDKGLCFRCNEKYAPGHRCKGREKRELMLLILNEEEDHKREEDTEDEASEVIELNHLELNMDNPIELRLI

Query:  TGVTSKGTMKLKGHVNGREVVILIDSGATNNFISQVLVDELQLSIDPGTRFGVTIGNGNQCEGSGICKRAKVKLKELTIVADFLAVELGTVDLVLGMQWL
        TG+TSKGTMKLKG V  +++V+LIDS AT NFI Q L +EL++ ++  T F VTIG+G +C+G G C+R ++KLKE+TI+ADFLAVELGTVD +LGMQWL
Subjt:  TGVTSKGTMKLKGHVNGREVVILIDSGATNNFISQVLVDELQLSIDPGTRFGVTIGNGNQCEGSGICKRAKVKLKELTIVADFLAVELGTVDLVLGMQWL

Query:  DSTGTMKVHWPSLTMTFWMKGRRIILKGDPSLTKSECSLRTLEKTWQSGDQGFLLEFQNYEVNYEGELETEAELKGKEEGLPMVQRLLEQYADVFRLPTG
        D TGTM++HWPSLTMTFW +GR+I+LKGDPSL K+ECSL+T+EKTW+  D+GFLLE+ N+E+  +   E   E++G E  +PM++ LL QYAD+F  P G
Subjt:  DSTGTMKVHWPSLTMTFWMKGRRIILKGDPSLTKSECSLRTLEKTWQSGDQGFLLEFQNYEVNYEGELETEAELKGKEEGLPMVQRLLEQYADVFRLPTG

Query:  LPPRRAIDHRILTVADQKPINVRPYKYGHVQKEEIEKLVLEMLQAGVIRPSRSPYSSPVLLVKKKDGGWRFCVDYRKLNQVTVADKFPIPVIEELLDELH
        LPP+R ID+RILT+ +Q+PIN RPYKYGHVQKEEIEKLV EMLQ GVIRPSRSPYSSPVLLVKKKDGGWRFCVDYRKLNQ T++DKFPIPVIEELLDELH
Subjt:  LPPRRAIDHRILTVADQKPINVRPYKYGHVQKEEIEKLVLEMLQAGVIRPSRSPYSSPVLLVKKKDGGWRFCVDYRKLNQVTVADKFPIPVIEELLDELH

Query:  GATVFSKLDLKSGYHQIRMREEDVEK
        GA+VFSKLDLKSGYHQIRM+EED+EK
Subjt:  GATVFSKLDLKSGYHQIRMREEDVEK

A0A5A7UYM1 Ty3/gypsy retrotransposon protein1.21e-20469.48Show/hide
Query:  MKQVTIPLKNDYQKKDPPIKRLSDTEFRARLDKGLCFRCNEKYAPGHRCKGREKRELMLLILNEEEDHKREEDTEDEASEVIELNHLELNMDNPIELRLI
        MKQ+TIP+K +++K +PP+KRLSD EFRARLD+GLCFRCN+KY+PGHRCK +EKRELM  I+NEEE+ +  +  E+   E +EL  LEL  +  IEL+ +
Subjt:  MKQVTIPLKNDYQKKDPPIKRLSDTEFRARLDKGLCFRCNEKYAPGHRCKGREKRELMLLILNEEEDHKREEDTEDEASEVIELNHLELNMDNPIELRLI

Query:  TGVTSKGTMKLKGHVNGREVVILIDSGATNNFISQVLVDELQLSIDPGTRFGVTIGNGNQCEGSGICKRAKVKLKELTIVADFLAVELGTVDLVLGMQWL
        T ++SKGTMKLKG +  +EVV+LIDSGAT+NFI   L  EL+L +DP T FG TIGNG +C G GIC+R +VKL E+TI+ADFLAVELG+VD VLGMQWL
Subjt:  TGVTSKGTMKLKGHVNGREVVILIDSGATNNFISQVLVDELQLSIDPGTRFGVTIGNGNQCEGSGICKRAKVKLKELTIVADFLAVELGTVDLVLGMQWL

Query:  DSTGTMKVHWPSLTMTFWMKGRRIILKGDPSLTKSECSLRTLEKTWQSGDQGFLLEFQNYEVNYEGELETEAELKGKEEGLPMVQRLLEQYADVFRLPTG
        D+TGTMK++WPSLTM+FW  GR+I+LKGDPSL ++ECSLRTLEKTWQ  DQGFLLE+ N EV  +   +T+ + +G E  +PM++ LL+QY D+F  P G
Subjt:  DSTGTMKVHWPSLTMTFWMKGRRIILKGDPSLTKSECSLRTLEKTWQSGDQGFLLEFQNYEVNYEGELETEAELKGKEEGLPMVQRLLEQYADVFRLPTG

Query:  LPPRRAIDHRILTVADQKPINVRPYKYGHVQKEEIEKLVLEMLQAGVIRPSRSPYSSPVLLVKKKDGGWRFCVDYRKLNQVTVADKFPIPVIEELLDELH
        LPP+R IDHRILT+ DQKPINVRPYKYGH+QK EIEKLV EMLQ GVIRPSRSPYSSPVLLVKKKDGGWRFCVDYRKLNQ T++DKFPIPVIEELLDEL+
Subjt:  LPPRRAIDHRILTVADQKPINVRPYKYGHVQKEEIEKLVLEMLQAGVIRPSRSPYSSPVLLVKKKDGGWRFCVDYRKLNQVTVADKFPIPVIEELLDELH

Query:  GATVFSKLDLKSGYHQIRMREEDVEK
        GA VFSKLDLKSGYHQIRM+EEDVEK
Subjt:  GATVFSKLDLKSGYHQIRMREEDVEK

A0A5D3CUL0 Reverse transcriptase domain-containing protein2.30e-20868.54Show/hide
Query:  MKQVTIPLKNDYQKKDPPIKRLSDTEFRARLDKGLCFRCNEKYAPGHRCKGREKRELMLLILNEEEDHKREEDTEDEASEVIELNHLELNMDNPIELRLI
        MKQ+TIP+K +++K +PP+KRLSD EFRARLD+ LCFRCN+KY+PGHRCK +EKRELM  I+NEEE+++  +  E+    ++EL  LEL  D  IEL+ +
Subjt:  MKQVTIPLKNDYQKKDPPIKRLSDTEFRARLDKGLCFRCNEKYAPGHRCKGREKRELMLLILNEEEDHKREEDTEDEASEVIELNHLELNMDNPIELRLI

Query:  TGVTSKGTMKLKGHVNGREVVILIDSGATNNFISQVLVDELQLSIDPGTRFGVTIGNGNQCEGSGICKRAKVKLKELTIVADFLAVELGTVDLVLGMQWL
        T  +SKGTMKLKG +  +E+VILIDSGAT+NFI Q L  +L+L ++  T+FG TIGNG +C+G GIC+R +VKL+E+TI+ADFLAVELG+VD VLGMQWL
Subjt:  TGVTSKGTMKLKGHVNGREVVILIDSGATNNFISQVLVDELQLSIDPGTRFGVTIGNGNQCEGSGICKRAKVKLKELTIVADFLAVELGTVDLVLGMQWL

Query:  DSTGTMKVHWPSLTMTFWMKGRRIILKGDPSLTKSECSLRTLEKTWQSGDQGFLLEFQNYEVNYEGELETEAELKGKEEGLPMVQRLLEQYADVFRLPTG
        D+ GTMK+HWPSLTM+FW +GR+IILKGDP L K+ECSLRTLEKTWQ  DQGFLLE+ N EV  E   +T+ + KG E  +PM++ LL+QY D+F  P G
Subjt:  DSTGTMKVHWPSLTMTFWMKGRRIILKGDPSLTKSECSLRTLEKTWQSGDQGFLLEFQNYEVNYEGELETEAELKGKEEGLPMVQRLLEQYADVFRLPTG

Query:  LPPRRAIDHRILTVADQKPINVRPYKYGHVQKEEIEKLVLEMLQAGVIRPSRSPYSSPVLLVKKKDGGWRFCVDYRKLNQVTVADKFPIPVIEELLDELH
        LPP+R IDHRILT+ +Q+PINVRPYKYGHVQK EIE LV EMLQ G+IRPSRSPYSS VLLVKKKDGGWRFCVDYRKLNQ TV+DKFPIPVIEELLDEL+
Subjt:  LPPRRAIDHRILTVADQKPINVRPYKYGHVQKEEIEKLVLEMLQAGVIRPSRSPYSSPVLLVKKKDGGWRFCVDYRKLNQVTVADKFPIPVIEELLDELH

Query:  GATVFSKLDLKSGYHQIRMREEDVEK
        GA VFSKLDLKSGYHQIRM+EED+EK
Subjt:  GATVFSKLDLKSGYHQIRMREEDVEK

A0A5D3CW02 Uncharacterized protein5.99e-21269.25Show/hide
Query:  MKQVTIPLKNDYQKKDPPIKRLSDTEFRARLDKGLCFRCNEKYAPGHRCKGREKRELMLLILNEEEDHKREEDTEDEASEVIELNHLELNMDNPIELRLI
        MKQ+TIP+K  ++K +PP+KRLSD +FRARLD+ LCFRCN+KY PGHRCK +EKRELM  I+NEEE+ +     ++     +EL +LEL  D  IEL  +
Subjt:  MKQVTIPLKNDYQKKDPPIKRLSDTEFRARLDKGLCFRCNEKYAPGHRCKGREKRELMLLILNEEEDHKREEDTEDEASEVIELNHLELNMDNPIELRLI

Query:  TGVTSKGTMKLKGHVNGREVVILIDSGATNNFISQVLVDELQLSIDPGTRFGVTIGNGNQCEGSGICKRAKVKLKELTIVADFLAVELGTVDLVLGMQWL
        T +TSKGTMKLKG V  +E+V+LIDSGAT+NFI Q L +ELQ+ ++  T+FG TIGNG +C+G G+C+R ++KLKE+TI+ADFLAVELGTVD VLGMQWL
Subjt:  TGVTSKGTMKLKGHVNGREVVILIDSGATNNFISQVLVDELQLSIDPGTRFGVTIGNGNQCEGSGICKRAKVKLKELTIVADFLAVELGTVDLVLGMQWL

Query:  DSTGTMKVHWPSLTMTFWMKGRRIILKGDPSLTKSECSLRTLEKTWQSGDQGFLLEFQNYEVNYEGELETEAELKGKEEGLPMVQRLLEQYADVFRLPTG
        D+TGTM++HWPSLTM FW +GR+I+LKGDPSL K+ECSL+TLEKTWQ  DQGFLLE+ N E++ E + ET+ E+KG E  +PM++ LL+QYAD+F  P G
Subjt:  DSTGTMKVHWPSLTMTFWMKGRRIILKGDPSLTKSECSLRTLEKTWQSGDQGFLLEFQNYEVNYEGELETEAELKGKEEGLPMVQRLLEQYADVFRLPTG

Query:  LPPRRAIDHRILTVADQKPINVRPYKYGHVQKEEIEKLVLEMLQAGVIRPSRSPYSSPVLLVKKKDGGWRFCVDYRKLNQVTVADKFPIPVIEELLDELH
        LPP+R IDHRILT+ DQ+PINVRPYKYGHVQKEEIE LV EMLQ G+IRPS SPYSSPVLLVK+KDGGWRFCVDYRKLNQ TV+DKFPIPVIEELLDELH
Subjt:  LPPRRAIDHRILTVADQKPINVRPYKYGHVQKEEIEKLVLEMLQAGVIRPSRSPYSSPVLLVKKKDGGWRFCVDYRKLNQVTVADKFPIPVIEELLDELH

Query:  GATVFSKLDLKSGYHQIRMREEDVEK
        GA VFSKLDLKSGYHQIRM+EED+EK
Subjt:  GATVFSKLDLKSGYHQIRMREEDVEK

A0A5D3E328 Reverse transcriptase1.15e-21267.37Show/hide
Query:  MKQVTIPLKNDYQKKDPPIKRLSDTEFRARLDKGLCFRCNEKYAPGHRCKGREKRELMLLILNEEEDHKREEDTEDEASEVIELNHLELNMDNPIELRLI
        MKQ+TIP+K +Y+K +PP+KRLSD EFR RLD+GLCFRCN+ Y+PGHRCK +EKRELM  ILNEEE+ + E   +++    +EL +LE+     IEL+ +
Subjt:  MKQVTIPLKNDYQKKDPPIKRLSDTEFRARLDKGLCFRCNEKYAPGHRCKGREKRELMLLILNEEEDHKREEDTEDEASEVIELNHLELNMDNPIELRLI

Query:  TGVTSKGTMKLKGHVNGREVVILIDSGATNNFISQVLVDELQLSIDPGTRFGVTIGNGNQCEGSGICKRAKVKLKELTIVADFLAVELGTVDLVLGMQWL
        TG+TSKGTMKLKG V  +++V+LIDS AT NFI Q L +EL++ ++  T F VTIG+G +C+G G C+R ++KLKE+TI+ADFLAVELGTVD +LGMQWL
Subjt:  TGVTSKGTMKLKGHVNGREVVILIDSGATNNFISQVLVDELQLSIDPGTRFGVTIGNGNQCEGSGICKRAKVKLKELTIVADFLAVELGTVDLVLGMQWL

Query:  DSTGTMKVHWPSLTMTFWMKGRRIILKGDPSLTKSECSLRTLEKTWQSGDQGFLLEFQNYEVNYEGELETEAELKGKEEGLPMVQRLLEQYADVFRLPTG
        D TGTM++HWPSLTMTFW +GR+I+LKGDPSL K+ECSL+T+EKTW+  D+GFLLE+ N+E+  +   E   E++G E  +PM++ LL QYAD+F  P G
Subjt:  DSTGTMKVHWPSLTMTFWMKGRRIILKGDPSLTKSECSLRTLEKTWQSGDQGFLLEFQNYEVNYEGELETEAELKGKEEGLPMVQRLLEQYADVFRLPTG

Query:  LPPRRAIDHRILTVADQKPINVRPYKYGHVQKEEIEKLVLEMLQAGVIRPSRSPYSSPVLLVKKKDGGWRFCVDYRKLNQVTVADKFPIPVIEELLDELH
        LPP+R ID+RILT+ +Q+PIN RPYKYGHVQKEEIEKLV EMLQ GVIRPSRSPYSSPVLLVKKKDGGWRFCVDYRKLNQ T++DKFPIPVIEELLDELH
Subjt:  LPPRRAIDHRILTVADQKPINVRPYKYGHVQKEEIEKLVLEMLQAGVIRPSRSPYSSPVLLVKKKDGGWRFCVDYRKLNQVTVADKFPIPVIEELLDELH

Query:  GATVFSKLDLKSGYHQIRMREEDVEK
        GA+VFSKLDLKSGYHQIRM+EED+EK
Subjt:  GATVFSKLDLKSGYHQIRMREEDVEK

SwissProt top hitse value%identityAlignment
P10394 Retrovirus-related Pol polyprotein from transposon 4126.9e-2236.81Show/hide
Query:  VQRLLEQYADVFRLPTGLPPRRAIDHRILTVADQKPINVRPYKYGHVQKEEIEKLVLEMLQAGVIRPSRSPYSSPVLLVKKKDG------GWRFCVDYRK
        ++ +  +Y D+F L +       +  + L + D +P+  + Y+  H Q EEI+  V ++++  ++ PS S Y+SP+LLV KK         WR  +DYR+
Subjt:  VQRLLEQYADVFRLPTGLPPRRAIDHRILTVADQKPINVRPYKYGHVQKEEIEKLVLEMLQAGVIRPSRSPYSSPVLLVKKKDG------GWRFCVDYRK

Query:  LNQVTVADKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMRE
        +N+  +ADKFP+P I+++LD+L  A  FS LDL SG+HQI + E
Subjt:  LNQVTVADKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMRE

P20825 Retrovirus-related Pol polyprotein from transposon 2979.9e-2128.51Show/hide
Query:  DLVLGMQWLDSTGTMKVHWPSLTMTFWMKGRRIILKGDPSLTKSECSLRTLEKTWQSGDQGFL--LEFQNYEVNYEGELETEAELKGKEEGLPMVQRLLE
        D+++G + L +  ++ +++ + T+T + +  ++I     S       ++   ++  S DQ  +  L+F  + +++  + ET  +LKG          LL 
Subjt:  DLVLGMQWLDSTGTMKVHWPSLTMTFWMKGRRIILKGDPSLTKSECSLRTLEKTWQSGDQGFL--LEFQNYEVNYEGELETEAELKGKEEGLPMVQRLLE

Query:  QYADV-FRLPTGLPPRRAIDHRILTVADQKPINVRPYKYGHVQKEEIEKLVLEMLQAGVIRPSRSPYSSPVLLVKKKDGG-----WRFCVDYRKLNQVTV
        ++ ++ ++    L     I H +L      PI  + Y      + E+E  V EML  G+IR S SPY+SP  +V KK        +R  +DYRKLN++T+
Subjt:  QYADV-FRLPTGLPPRRAIDHRILTVADQKPINVRPYKYGHVQKEEIEKLVLEMLQAGVIRPSRSPYSSPVLLVKKKDGG-----WRFCVDYRKLNQVTV

Query:  ADKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMREEDVEKIQLSAR
         D++PIP ++E+L +L     F+ +DL  G+HQI M EE + K   S +
Subjt:  ADKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMREEDVEKIQLSAR

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein1.5e-2442.66Show/hide
Query:  LLEQYADVFRLPTGLPPRRA------IDHRILTVADQKPINVRPYKYGHVQKEEIEKLVLEMLQAGVIRPSRSPYSSPVLLVKKKDGGWRFCVDYRKLNQ
        L ++Y ++ R    LPPR A      + H I      +   ++PY      ++EI K+V ++L    I PS+SP SSPV+LV KKDG +R CVDYR LN+
Subjt:  LLEQYADVFRLPTGLPPRRA------IDHRILTVADQKPINVRPYKYGHVQKEEIEKLVLEMLQAGVIRPSRSPYSSPVLLVKKKDGGWRFCVDYRKLNQ

Query:  VTVADKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMREED
         T++D FP+P I+ LL  +  A +F+ LDL SGYHQI M  +D
Subjt:  VTVADKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMREED

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus1.1e-2238.96Show/hide
Query:  MVQRLLEQYADVFRLP-TGLPPRRAIDHRILTVADQKPINVRPYKYGHVQKEEIEKLVLEMLQAGVIRPSRSPYSSPVLLVKKK-----DGGWRFCVDYR
        ++  LL ++  +F  P +G+    A+   I T   Q PI  + Y Y    + E+E+ + E+LQ G+IRPS SPY+SP+ +V KK     +  +R  VD++
Subjt:  MVQRLLEQYADVFRLP-TGLPPRRAIDHRILTVADQKPINVRPYKYGHVQKEEIEKLVLEMLQAGVIRPSRSPYSSPVLLVKKK-----DGGWRFCVDYR

Query:  KLNQVTVADKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMREEDVEKIQLS
        +LN VT+ D +PIP I   L  L  A  F+ LDL SG+HQI M+E D+ K   S
Subjt:  KLNQVTVADKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMREEDVEKIQLS

Q99315 Transposon Ty3-G Gag-Pol polyprotein1.5e-2442.66Show/hide
Query:  LLEQYADVFRLPTGLPPRRA------IDHRILTVADQKPINVRPYKYGHVQKEEIEKLVLEMLQAGVIRPSRSPYSSPVLLVKKKDGGWRFCVDYRKLNQ
        L ++Y ++ R    LPPR A      + H I      +   ++PY      ++EI K+V ++L    I PS+SP SSPV+LV KKDG +R CVDYR LN+
Subjt:  LLEQYADVFRLPTGLPPRRA------IDHRILTVADQKPINVRPYKYGHVQKEEIEKLVLEMLQAGVIRPSRSPYSSPVLLVKKKDGGWRFCVDYRKLNQ

Query:  VTVADKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMREED
         T++D FP+P I+ LL  +  A +F+ LDL SGYHQI M  +D
Subjt:  VTVADKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMREED

Arabidopsis top hitse value%identityAlignment
AT3G29750.1 Eukaryotic aspartyl protease family protein9.6e-1130.33Show/hide
Query:  RLITGVTSKGTMKLKGHVNGREVVILIDSGATNNFISQVLVDELQLSIDPGTRFGVTIGNGNQCEGSGICKRAKVKLKELTIVADFLAVELG--TVDLVL
        +L+  +T    M+  G +   +VV+ IDSGAT+NFI   L   L+L      +  V +G     +  G C   ++ ++E+ I  +FL ++L    VD++L
Subjt:  RLITGVTSKGTMKLKGHVNGREVVILIDSGATNNFISQVLVDELQLSIDPGTRFGVTIGNGNQCEGSGICKRAKVKLKELTIVADFLAVELG--TVDLVL

Query:  GMQWLDSTGTMKVHWPSLTMTF
        G +WL   G   V+W +   +F
Subjt:  GMQWLDSTGTMKVHWPSLTMTF

AT3G30770.1 Eukaryotic aspartyl protease family protein1.1e-0633.67Show/hide
Query:  TGVTSKGTMKLKGHVNGREVVILIDSGATNNFISQVLVDELQLSIDPGTRFGVTIGNGNQCEGSGICKRAKVKLKELTIVADFLAVEL--GTVDLVLG
        T  T    M+  G ++  +VV++IDSGATNNFIS  L   L+L      +  V +G     +  G C    + ++E+ I  +FL ++L    VD++LG
Subjt:  TGVTSKGTMKLKGHVNGREVVILIDSGATNNFISQVLVDELQLSIDPGTRFGVTIGNGNQCEGSGICKRAKVKLKELTIVADFLAVEL--GTVDLVLG

ATMG00850.1 DNA/RNA polymerases superfamily protein8.4e-0757.5Show/hide
Query:  VQKEEIEKLVLEMLQAGVIRPSRSPYSSPVLLVKKKDGGW
        +++  ++  + EML+A +I+PS SPYSSPVLLV+KKDGGW
Subjt:  VQKEEIEKLVLEMLQAGVIRPSRSPYSSPVLLVKKKDGGW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGCAGGTAACTATTCCATTGAAAAATGATTACCAGAAAAAAGATCCTCCAATTAAACGGCTGTCGGATACGGAATTTAGAGCAAGGCTAGATAAGGGGCTTTGTTT
CAGATGTAATGAAAAGTACGCCCCAGGCCATCGATGTAAAGGGAGAGAAAAAAGAGAGTTGATGCTCCTTATACTAAATGAGGAAGAAGACCACAAAAGGGAAGAGGATA
CAGAGGATGAGGCAAGCGAAGTAATAGAACTGAATCATCTGGAATTGAATATGGATAACCCTATTGAATTGAGGTTGATCACGGGAGTTACATCAAAGGGGACGATGAAA
TTAAAAGGACATGTGAACGGAAGGGAAGTAGTTATTCTAATTGACAGCGGGGCGACCAATAACTTCATCAGCCAAGTGTTGGTAGATGAACTACAGTTGAGCATCGATCC
GGGAACTCGTTTTGGAGTTACCATTGGGAATGGCAACCAATGTGAAGGAAGTGGGATTTGCAAGAGGGCGAAGGTGAAGTTAAAAGAGTTAACAATCGTAGCAGATTTCC
TAGCGGTAGAGTTAGGAACGGTAGACTTGGTGCTTGGGATGCAATGGCTAGATTCGACAGGAACCATGAAGGTTCACTGGCCATCCCTAACCATGACGTTTTGGATGAAG
GGTAGAAGAATAATCCTAAAAGGTGACCCTTCTCTAACGAAGTCAGAATGTTCATTGAGAACCTTAGAGAAAACGTGGCAATCCGGGGACCAAGGATTCCTCTTGGAATT
CCAAAACTATGAAGTAAACTATGAAGGAGAATTGGAAACAGAAGCTGAATTGAAGGGAAAGGAAGAAGGATTACCCATGGTTCAGCGATTGCTCGAGCAATATGCAGATG
TCTTTAGGTTGCCCACGGGTTTACCGCCAAGGAGAGCCATAGACCATCGCATTCTGACCGTGGCCGATCAGAAACCAATTAATGTAAGACCATATAAGTATGGCCATGTA
CAAAAGGAAGAGATTGAAAAATTGGTGTTAGAAATGTTACAAGCTGGGGTGATTCGTCCAAGCCGCAGCCCATATTCGAGCCCGGTCCTCTTAGTGAAGAAAAAAGATGG
AGGGTGGAGATTTTGTGTAGATTACAGGAAACTCAATCAAGTAACGGTGGCTGATAAATTTCCAATTCCCGTGATCGAAGAACTCTTAGATGAACTTCATGGTGCGACAG
TTTTCTCAAAGTTAGACCTTAAATCTGGATACCACCAGATTAGGATGAGGGAGGAAGATGTGGAGAAAATACAGCTTTCCGCACGCATGAAGGACATTATGAGTTCTTGG
TGA
mRNA sequenceShow/hide mRNA sequence
TATAGTTGAGCCCAAAAGGAATGAGAACAGCAGCAAATTTCAAGTTAAAAATGAGAAGACAGAAAGTAAAAAGACAGAGTTTGTGATGAAGCAGGTAACTATTCCATTGA
AAAATGATTACCAGAAAAAAGATCCTCCAATTAAACGGCTGTCGGATACGGAATTTAGAGCAAGGCTAGATAAGGGGCTTTGTTTCAGATGTAATGAAAAGTACGCCCCA
GGCCATCGATGTAAAGGGAGAGAAAAAAGAGAGTTGATGCTCCTTATACTAAATGAGGAAGAAGACCACAAAAGGGAAGAGGATACAGAGGATGAGGCAAGCGAAGTAAT
AGAACTGAATCATCTGGAATTGAATATGGATAACCCTATTGAATTGAGGTTGATCACGGGAGTTACATCAAAGGGGACGATGAAATTAAAAGGACATGTGAACGGAAGGG
AAGTAGTTATTCTAATTGACAGCGGGGCGACCAATAACTTCATCAGCCAAGTGTTGGTAGATGAACTACAGTTGAGCATCGATCCGGGAACTCGTTTTGGAGTTACCATT
GGGAATGGCAACCAATGTGAAGGAAGTGGGATTTGCAAGAGGGCGAAGGTGAAGTTAAAAGAGTTAACAATCGTAGCAGATTTCCTAGCGGTAGAGTTAGGAACGGTAGA
CTTGGTGCTTGGGATGCAATGGCTAGATTCGACAGGAACCATGAAGGTTCACTGGCCATCCCTAACCATGACGTTTTGGATGAAGGGTAGAAGAATAATCCTAAAAGGTG
ACCCTTCTCTAACGAAGTCAGAATGTTCATTGAGAACCTTAGAGAAAACGTGGCAATCCGGGGACCAAGGATTCCTCTTGGAATTCCAAAACTATGAAGTAAACTATGAA
GGAGAATTGGAAACAGAAGCTGAATTGAAGGGAAAGGAAGAAGGATTACCCATGGTTCAGCGATTGCTCGAGCAATATGCAGATGTCTTTAGGTTGCCCACGGGTTTACC
GCCAAGGAGAGCCATAGACCATCGCATTCTGACCGTGGCCGATCAGAAACCAATTAATGTAAGACCATATAAGTATGGCCATGTACAAAAGGAAGAGATTGAAAAATTGG
TGTTAGAAATGTTACAAGCTGGGGTGATTCGTCCAAGCCGCAGCCCATATTCGAGCCCGGTCCTCTTAGTGAAGAAAAAAGATGGAGGGTGGAGATTTTGTGTAGATTAC
AGGAAACTCAATCAAGTAACGGTGGCTGATAAATTTCCAATTCCCGTGATCGAAGAACTCTTAGATGAACTTCATGGTGCGACAGTTTTCTCAAAGTTAGACCTTAAATC
TGGATACCACCAGATTAGGATGAGGGAGGAAGATGTGGAGAAAATACAGCTTTCCGCACGCATGAAGGACATTATGAGTTCTTGGTGATGCCTTTCGGCCTTACGAATGC
TCCTGCCACCTTCCAATCTCTCATGAACGATGTGTTCAAACCATTCCTTCGAAGGTGTGTCCTGGTTTTTTTTTATGACATTCTAGTTTATAGTGTGGACATAGATGAGC
ACATGAAACATTTAGGAATGGTTTTTGCTATCTTGAGGGACCATGAATTGTTTGCAAATAGGTCTAAATGTGTCATTGCTCATTCCCAAGTTCAATATTTGGGTCATCTG
ATTTCCAGCAGAGGAGTGGAGGCTGATGAGGACAAGATTCGCAGTATGGTAAATTGGCCACGGCCGAAAGATATAACTGGGCTGAGGGGATTCCTTGGACTGACTGGGTA
TTATAGAAGATTTGTGAAAAGCTATGGAGAAATAGTTGCACCCTTAACCAAATTACTTCAGAAAAATGCATTCCATTGGAATGAGGAAGCTACAATAGCGTTTGACCAGC
TGAAGCTAGCAATGACAACCTTACCGGTATTAGCATTGTCGGATTGGTCTCAGCCCTTCACAATCGAAACTGATGCTTCAGGAGTAGGTTTAGGTGCAGTTTTATCACAG
AATGGTCATCCCATCGCATTCTTCAGCCAAAAACTGTCCCCAAGAGCCCAGGGTAAGTCGATCTATGAAAGGGAATTGATGGCGGTTGTCCTTTCGGTGCAAAAATGGAG
ACATTATCTCCTGGGCAGAAAGTTCACAATTGTTTCAGATCAGAAGGCTCTGAAATTTCTGTTAGAACAGAGGGAAGTTCAGCCTCAATTCCAAAAGTGGCTCACAAAAC
TTTTGGGGTATGACTTTGAGATCTTATACCAGCCGGGTCTGCAGAATAAGGTGGCGGATGCTCTCTCAAGGAAGGACCATTCGGTGGAGTTAAATACAATGACAACCACA
GGCATAGTTGATATAGAGATAATAGCAAAGAAGTTGAAATGGATCAAGAACTTCAGAAAATTATTGCCGAACTTAAGGGAGAGGTGGATCAAGGTGGGAAATACCAGTGG
AACAATGGCAGGCTGCTATATAAAGGAAGGATGGTGCTGCCGCGAAATTCCTCCCTCATTCCGAGTCTTCTACACACGTTCTATGATTCCATATTAGGAGGGCACTCGGG
ATTCTTGAGAACCTATAAAAGAATGAGTGGGGAACTATTTTGGAAAGGAATGAAGGCTGATATCAAGAGATATGTAGAAGAATGTGACACTTGTCAACGTAATAAGTTCG
AAGCTACAAAGCCTGCGGGAGTTCTGCAACCCATTCCTATCCCCGACAAGATATTGGAGGATTGGACCATGGATTTTATTGAAGGGCTGCCAATAGCAGGAGGTTACAAC
GTGATTATGGTAGTCGTCGACCGCCTAAGTAAGTACTCCTACTTCTTGCCTCTGAAACACCCGTACACAGCCAAGCAAGTGGCTTCAATTTTTTTGGAAAAAGTGGTCAG
CAAACATGGAATACCCAAGTCCATTATTACCGACCGTGATAAGATCTT
Protein sequenceShow/hide protein sequence
MKQVTIPLKNDYQKKDPPIKRLSDTEFRARLDKGLCFRCNEKYAPGHRCKGREKRELMLLILNEEEDHKREEDTEDEASEVIELNHLELNMDNPIELRLITGVTSKGTMK
LKGHVNGREVVILIDSGATNNFISQVLVDELQLSIDPGTRFGVTIGNGNQCEGSGICKRAKVKLKELTIVADFLAVELGTVDLVLGMQWLDSTGTMKVHWPSLTMTFWMK
GRRIILKGDPSLTKSECSLRTLEKTWQSGDQGFLLEFQNYEVNYEGELETEAELKGKEEGLPMVQRLLEQYADVFRLPTGLPPRRAIDHRILTVADQKPINVRPYKYGHV
QKEEIEKLVLEMLQAGVIRPSRSPYSSPVLLVKKKDGGWRFCVDYRKLNQVTVADKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMREEDVEKIQLSARMKDIMSSW