; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI07G08960 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI07G08960
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationChr7:6794619..6801887
RNA-Seq ExpressionCSPI07G08960
SyntenyCSPI07G08960
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025132.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]3.0e-26354.64Show/hide
Query:  AVKGKGICSNVEVTVGEWKITNSFLPLELGGVNVILGMQWLHSLGTIEFDWKNL--------------------------------------GRLAE---
        ++KGKG+C  VE+ +G  K+T+SFLPLELGGV+ ILGMQWLHS+GT E DWKNL                                      G L E   
Subjt:  AVKGKGICSNVEVTVGEWKITNSFLPLELGGVNVILGMQWLHSLGTIEFDWKNL--------------------------------------GRLAE---

Query:  ---EGLEDEEVPA----TVDNMISPLLSKFNDVFDWLEELPPKRNIEHHIPLKKGVDPVNVRPYHYPHQQKEEMERLVDEMLKTGIIRPGTSPYSSPVLL
           EG+ED EV      T    ++ +L+KF DVF+    LPP+R IEHHI LK+G DPVNVRPY Y HQQKEEMERLVDEML +GIIRP TSPYSSPVLL
Subjt:  ---EGLEDEEVPA----TVDNMISPLLSKFNDVFDWLEELPPKRNIEHHIPLKKGVDPVNVRPYHYPHQQKEEMERLVDEMLKTGIIRPGTSPYSSPVLL

Query:  VKKKDDSWRFCIDYRELNNVTIPDKFPILVIDELFNELNGANMFSKIDLKVGYHQIRMHPRGVEKTTFRTHEGHYEFLVMPFGLTNAPSTFQALMNHIFK
        V+K+D SWRFC+DYR LNNVT+PDKFPI V++ELF+ELNGA++FSKIDLK GYHQIRMHP  +EKT FRTHEGHYEF+VMPFGLTNAPSTFQALMN +FK
Subjt:  VKKKDDSWRFCIDYRELNNVTIPDKFPILVIDELFNELNGANMFSKIDLKVGYHQIRMHPRGVEKTTFRTHEGHYEFLVMPFGLTNAPSTFQALMNHIFK

Query:  PYMSRFVSVFFDDVLVHSKGVEEHGQHLEVVLEILRENELYVNLAKCCFAKGIIGYLGHFISEKGTE---------------------------------
        P++ RFV VFFDD+L++SKG+EEHGQHLEVVLEILRE+ELY NL KC FAK  I YLGH+IS+KG E                                 
Subjt:  PYMSRFVSVFFDDVLVHSKGVEEHGQHLEVVLEILRENELYVNLAKCCFAKGIIGYLGHFISEKGTE---------------------------------

Query:  FVQHYGFIVAPLTQLLRKGGFQWTEESKEAFQQLQNAMMTLPVLVLPDFNATFEMETDASRYRVGVVLVQAKRPIAYFSHTLALRDRAKPIYERELMAVI
        FVQ+YG I APLTQLL+ G ++WTEE++ AF++L+ AMMTLPVL +PDFN  FE+E+DAS + VG VLVQ K+P+AYFS  L+ RDRA+P+YERELMAV+
Subjt:  FVQHYGFIVAPLTQLLRKGGFQWTEESKEAFQQLQNAMMTLPVLVLPDFNATFEMETDASRYRVGVVLVQAKRPIAYFSHTLALRDRAKPIYERELMAVI

Query:  LPVQRWRPYLLERKFIVKTDQKSLKFLLQQRVIQLQYQKWVAKLTGYSFEVVYKPNLENKAADALSWMPPTVNLGSITAPTLVDILVIKKEVEADEKLSK
          VQRWRPYLL RKF VKTDQ+SLKFLL+QRVIQ QYQ+W+AKL GYSFEV+YKP LENKAADALS + PT +L  +TAP L+D+ VI+KEV  D  L +
Subjt:  LPVQRWRPYLLERKFIVKTDQKSLKFLLQQRVIQLQYQKWVAKLTGYSFEVVYKPNLENKAADALSWMPPTVNLGSITAPTLVDILVIKKEVEADEKLSK

Query:  IMEELQAMEEGTE-GKFSIRQGMLRYKDKFVLSKTSTLIPTILHTYHDS----------------------------------------NNSLALSPTRL
        I+  ++  E+G E   ++  QG+L++K + VLSK STLIPTI+HTYHDS                                        N S ALSP  L
Subjt:  IMEELQAMEEGTE-GKFSIRQGMLRYKDKFVLSKTSTLIPTILHTYHDS----------------------------------------NNSLALSPTRL

Query:  LLSLDSPNNIWSDISMDFIEGLPRS----------------------------NDSANIFVKEIVRLHGFPKSIVSDRDKVFLSNFWQEMFKMAGTRLNR
        LL L+ P+ IWSDISMDFIEGLP+S                               A +FVKE+VRLHGFPKSIVSDRDK+F+S+FW EMFK+AGT+LNR
Subjt:  LLSLDSPNNIWSDISMDFIEGLPRS----------------------------NDSANIFVKEIVRLHGFPKSIVSDRDKVFLSNFWQEMFKMAGTRLNR

Query:  STAYHPQSDGKTE-------------------------------------RSVGVTPFQAIYGRMPPPLMFYGDRDTPNTTLNE
        S++YHPQ+DG+TE                                      S+G+TPFQA+YGR+PPPL+ YG+ +TPN+TL++
Subjt:  STAYHPQSDGKTE-------------------------------------RSVGVTPFQAIYGRMPPPLMFYGDRDTPNTTLNE

KAA0049630.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]2.6e-26254.41Show/hide
Query:  AVKGKGICSNVEVTVGEWKITNSFLPLELGGVNVILGMQWLHSLGTIEFDWKNL--------------------------------------GRLAE---
        ++KGKG+C  VE+ +G  K+T+SFLPLELGGV+ ILGMQWLHS+GT E DWKNL                                      G L E   
Subjt:  AVKGKGICSNVEVTVGEWKITNSFLPLELGGVNVILGMQWLHSLGTIEFDWKNL--------------------------------------GRLAE---

Query:  ---EGLEDEEVPA----TVDNMISPLLSKFNDVFDWLEELPPKRNIEHHIPLKKGVDPVNVRPYHYPHQQKEEMERLVDEMLKTGIIRPGTSPYSSPVLL
           EG+ED EV      T     + +L+KF DVF+    LPP+R IEHHI LK+G DP+NVRPY Y HQQKEEMERLVDEML +GIIRP TSPYSSPVLL
Subjt:  ---EGLEDEEVPA----TVDNMISPLLSKFNDVFDWLEELPPKRNIEHHIPLKKGVDPVNVRPYHYPHQQKEEMERLVDEMLKTGIIRPGTSPYSSPVLL

Query:  VKKKDDSWRFCIDYRELNNVTIPDKFPILVIDELFNELNGANMFSKIDLKVGYHQIRMHPRGVEKTTFRTHEGHYEFLVMPFGLTNAPSTFQALMNHIFK
        V+KKD SWRFC+DYR LNNVT+PDKFPI V++ELF+ELNGA++FSKIDLK GYHQIRMHP  +EKT FRTHEGHYEF+VMPFGLTNAPSTFQALMN +FK
Subjt:  VKKKDDSWRFCIDYRELNNVTIPDKFPILVIDELFNELNGANMFSKIDLKVGYHQIRMHPRGVEKTTFRTHEGHYEFLVMPFGLTNAPSTFQALMNHIFK

Query:  PYMSRFVSVFFDDVLVHSKGVEEHGQHLEVVLEILRENELYVNLAKCCFAKGIIGYLGHFISEKGTE---------------------------------
        P++ RFV VFFDD+L++SKG+EEHGQHLEVVLEILRE+ELY N  KC FAK  I YLGH+IS+KG E                                 
Subjt:  PYMSRFVSVFFDDVLVHSKGVEEHGQHLEVVLEILRENELYVNLAKCCFAKGIIGYLGHFISEKGTE---------------------------------

Query:  FVQHYGFIVAPLTQLLRKGGFQWTEESKEAFQQLQNAMMTLPVLVLPDFNATFEMETDASRYRVGVVLVQAKRPIAYFSHTLALRDRAKPIYERELMAVI
        FVQ+YG I APLTQLL+ G ++WTEE++ AF++L+ AMMTLPVL +PDFN  FE+E+DAS + VG VLVQ K+P+AYFS  L+ RDRA+P+YERELMAV+
Subjt:  FVQHYGFIVAPLTQLLRKGGFQWTEESKEAFQQLQNAMMTLPVLVLPDFNATFEMETDASRYRVGVVLVQAKRPIAYFSHTLALRDRAKPIYERELMAVI

Query:  LPVQRWRPYLLERKFIVKTDQKSLKFLLQQRVIQLQYQKWVAKLTGYSFEVVYKPNLENKAADALSWMPPTVNLGSITAPTLVDILVIKKEVEADEKLSK
          VQRWRPYLL RKF VKTDQ+SLKFLL+QRVIQ QYQ+W+ KL GYSFEV+YKP LENKAADALS + PT +L  +TAP L+D+ VI+KEV  D  L +
Subjt:  LPVQRWRPYLLERKFIVKTDQKSLKFLLQQRVIQLQYQKWVAKLTGYSFEVVYKPNLENKAADALSWMPPTVNLGSITAPTLVDILVIKKEVEADEKLSK

Query:  IMEELQAMEEGTE-GKFSIRQGMLRYKDKFVLSKTSTLIPTILHTYHDS----------------------------------------NNSLALSPTRL
        I+  ++  E+G E   ++  QG+L++K + VLSK STLIPTI+HTYHDS                                        N S ALSP  L
Subjt:  IMEELQAMEEGTE-GKFSIRQGMLRYKDKFVLSKTSTLIPTILHTYHDS----------------------------------------NNSLALSPTRL

Query:  LLSLDSPNNIWSDISMDFIEGLPRS----------------------------NDSANIFVKEIVRLHGFPKSIVSDRDKVFLSNFWQEMFKMAGTRLNR
        LL L+ P+ IWSDISMDFIEGLP+S                               A +FVKE+VRLHGFPKSIVSDRDK+F+S+FW EMFK+AGT+LNR
Subjt:  LLSLDSPNNIWSDISMDFIEGLPRS----------------------------NDSANIFVKEIVRLHGFPKSIVSDRDKVFLSNFWQEMFKMAGTRLNR

Query:  STAYHPQSDGKTE-------------------------------------RSVGVTPFQAIYGRMPPPLMFYGDRDTPNTTLNE
        S++YHPQ+DG+TE                                      S+G+TPFQA+YGR+PPPL+ YG+ +TPN+TL++
Subjt:  STAYHPQSDGKTE-------------------------------------RSVGVTPFQAIYGRMPPPLMFYGDRDTPNTTLNE

KAA0049776.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]4.0e-26354.64Show/hide
Query:  AVKGKGICSNVEVTVGEWKITNSFLPLELGGVNVILGMQWLHSLGTIEFDWKNL--------------------------------------GRLAE---
        ++KGKG+C  VE+ +G  K+T+SFLPLELGGV+ ILGMQWLHS+GT E DWKNL                                      G L E   
Subjt:  AVKGKGICSNVEVTVGEWKITNSFLPLELGGVNVILGMQWLHSLGTIEFDWKNL--------------------------------------GRLAE---

Query:  ---EGLEDEEVPA----TVDNMISPLLSKFNDVFDWLEELPPKRNIEHHIPLKKGVDPVNVRPYHYPHQQKEEMERLVDEMLKTGIIRPGTSPYSSPVLL
           EG+ED EV      T    ++ +L+KF DVF+    LPP+R IEHHI LK+G DPVNVRPY Y HQQKEEMERLVDEML +GIIRP TSPYSSPVLL
Subjt:  ---EGLEDEEVPA----TVDNMISPLLSKFNDVFDWLEELPPKRNIEHHIPLKKGVDPVNVRPYHYPHQQKEEMERLVDEMLKTGIIRPGTSPYSSPVLL

Query:  VKKKDDSWRFCIDYRELNNVTIPDKFPILVIDELFNELNGANMFSKIDLKVGYHQIRMHPRGVEKTTFRTHEGHYEFLVMPFGLTNAPSTFQALMNHIFK
        V+KKD SWRFC+DYR LNNVT+PDKFPI V++ELF+ELNGA++FSKIDLK GYHQIRMHP  +EKT FRTHEGHYEF+VMPFGLTNAPSTFQALMN +FK
Subjt:  VKKKDDSWRFCIDYRELNNVTIPDKFPILVIDELFNELNGANMFSKIDLKVGYHQIRMHPRGVEKTTFRTHEGHYEFLVMPFGLTNAPSTFQALMNHIFK

Query:  PYMSRFVSVFFDDVLVHSKGVEEHGQHLEVVLEILRENELYVNLAKCCFAKGIIGYLGHFISEKGTE---------------------------------
        P++ RFV VFFDD+L++SKG+EEHGQHLEVVLEILRE+ELY N  KC FAK  I YLGH+IS+KG E                                 
Subjt:  PYMSRFVSVFFDDVLVHSKGVEEHGQHLEVVLEILRENELYVNLAKCCFAKGIIGYLGHFISEKGTE---------------------------------

Query:  FVQHYGFIVAPLTQLLRKGGFQWTEESKEAFQQLQNAMMTLPVLVLPDFNATFEMETDASRYRVGVVLVQAKRPIAYFSHTLALRDRAKPIYERELMAVI
        FVQ+YG I APLTQLL+ G ++WTEE++ AF++L+ AMMTLPVL +PDFN  FE+E+DAS + VG VLVQ K+P+AYFS  L+ RDRA+P+YERELMAV+
Subjt:  FVQHYGFIVAPLTQLLRKGGFQWTEESKEAFQQLQNAMMTLPVLVLPDFNATFEMETDASRYRVGVVLVQAKRPIAYFSHTLALRDRAKPIYERELMAVI

Query:  LPVQRWRPYLLERKFIVKTDQKSLKFLLQQRVIQLQYQKWVAKLTGYSFEVVYKPNLENKAADALSWMPPTVNLGSITAPTLVDILVIKKEVEADEKLSK
          VQRWRPYLL RKF VKTDQ+SLKFLL+QRVIQ QYQ+W+AKL GYSFEV+YKP LENKAADALS + PT +L  +TAP L+D+ VI+KEV  D  L +
Subjt:  LPVQRWRPYLLERKFIVKTDQKSLKFLLQQRVIQLQYQKWVAKLTGYSFEVVYKPNLENKAADALSWMPPTVNLGSITAPTLVDILVIKKEVEADEKLSK

Query:  IMEELQAMEEGTE-GKFSIRQGMLRYKDKFVLSKTSTLIPTILHTYHDS----------------------------------------NNSLALSPTRL
        I+  ++  E+G E   ++  QG+L++K + VLSK STLIPTI+HTYHDS                                        N S ALSP  L
Subjt:  IMEELQAMEEGTE-GKFSIRQGMLRYKDKFVLSKTSTLIPTILHTYHDS----------------------------------------NNSLALSPTRL

Query:  LLSLDSPNNIWSDISMDFIEGLPRS----------------------------NDSANIFVKEIVRLHGFPKSIVSDRDKVFLSNFWQEMFKMAGTRLNR
        LL L+ P+ IWSDISMDFIEGLP+S                               A +FVKE+VRLHGFPKSIVSDRDK+F+S+FW EMFK+AGT+LNR
Subjt:  LLSLDSPNNIWSDISMDFIEGLPRS----------------------------NDSANIFVKEIVRLHGFPKSIVSDRDKVFLSNFWQEMFKMAGTRLNR

Query:  STAYHPQSDGKTE-------------------------------------RSVGVTPFQAIYGRMPPPLMFYGDRDTPNTTLNE
        S++YHPQ+DG+TE                                      S+G+TPFQA+YGR+PPPL+ YG+ +TPN+TL++
Subjt:  STAYHPQSDGKTE-------------------------------------RSVGVTPFQAIYGRMPPPLMFYGDRDTPNTTLNE

TYK15990.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]4.0e-26354.64Show/hide
Query:  AVKGKGICSNVEVTVGEWKITNSFLPLELGGVNVILGMQWLHSLGTIEFDWKNL--------------------------------------GRLAE---
        ++KGKG+C  VE+ +G  K+T+SFLPLELGGV+ ILGMQWLHS+GT E DWKNL                                      G L E   
Subjt:  AVKGKGICSNVEVTVGEWKITNSFLPLELGGVNVILGMQWLHSLGTIEFDWKNL--------------------------------------GRLAE---

Query:  ---EGLEDEEVPA----TVDNMISPLLSKFNDVFDWLEELPPKRNIEHHIPLKKGVDPVNVRPYHYPHQQKEEMERLVDEMLKTGIIRPGTSPYSSPVLL
           EG+ED EV      T    ++ +L+KF DVF+    LPP+R IEHHI LK+G DPVNVRPY Y HQQKEEMERLVDEML +GIIRP TSPYSSPVLL
Subjt:  ---EGLEDEEVPA----TVDNMISPLLSKFNDVFDWLEELPPKRNIEHHIPLKKGVDPVNVRPYHYPHQQKEEMERLVDEMLKTGIIRPGTSPYSSPVLL

Query:  VKKKDDSWRFCIDYRELNNVTIPDKFPILVIDELFNELNGANMFSKIDLKVGYHQIRMHPRGVEKTTFRTHEGHYEFLVMPFGLTNAPSTFQALMNHIFK
        V+KKD SWRFC+DYR LNNVT+PDKFPI V++ELF+ELNGA++FSKIDLK GYHQIRMHP  +EKT FRTHEGHYEF+VMPFGLTNAPSTFQALMN +FK
Subjt:  VKKKDDSWRFCIDYRELNNVTIPDKFPILVIDELFNELNGANMFSKIDLKVGYHQIRMHPRGVEKTTFRTHEGHYEFLVMPFGLTNAPSTFQALMNHIFK

Query:  PYMSRFVSVFFDDVLVHSKGVEEHGQHLEVVLEILRENELYVNLAKCCFAKGIIGYLGHFISEKGTE---------------------------------
        P++ RFV VFFDD+L++SKG+EEHGQHLEVVLEILRE+ELY N  KC FAK  I YLGH+IS+KG E                                 
Subjt:  PYMSRFVSVFFDDVLVHSKGVEEHGQHLEVVLEILRENELYVNLAKCCFAKGIIGYLGHFISEKGTE---------------------------------

Query:  FVQHYGFIVAPLTQLLRKGGFQWTEESKEAFQQLQNAMMTLPVLVLPDFNATFEMETDASRYRVGVVLVQAKRPIAYFSHTLALRDRAKPIYERELMAVI
        FVQ+YG I APLTQLL+ G ++WTEE++ AF++L+ AMMTLPVL +PDFN  FE+E+DAS + VG VLVQ K+P+AYFS  L+ RDRA+P+YERELMAV+
Subjt:  FVQHYGFIVAPLTQLLRKGGFQWTEESKEAFQQLQNAMMTLPVLVLPDFNATFEMETDASRYRVGVVLVQAKRPIAYFSHTLALRDRAKPIYERELMAVI

Query:  LPVQRWRPYLLERKFIVKTDQKSLKFLLQQRVIQLQYQKWVAKLTGYSFEVVYKPNLENKAADALSWMPPTVNLGSITAPTLVDILVIKKEVEADEKLSK
          VQRWRPYLL RKF VKTDQ+SLKFLL+QRVIQ QYQ+W+AKL GYSFEV+YKP LENKAADALS + PT +L  +TAP L+D+ VI+KEV  D  L +
Subjt:  LPVQRWRPYLLERKFIVKTDQKSLKFLLQQRVIQLQYQKWVAKLTGYSFEVVYKPNLENKAADALSWMPPTVNLGSITAPTLVDILVIKKEVEADEKLSK

Query:  IMEELQAMEEGTE-GKFSIRQGMLRYKDKFVLSKTSTLIPTILHTYHDS----------------------------------------NNSLALSPTRL
        I+  ++  E+G E   ++  QG+L++K + VLSK STLIPTI+HTYHDS                                        N S ALSP  L
Subjt:  IMEELQAMEEGTE-GKFSIRQGMLRYKDKFVLSKTSTLIPTILHTYHDS----------------------------------------NNSLALSPTRL

Query:  LLSLDSPNNIWSDISMDFIEGLPRS----------------------------NDSANIFVKEIVRLHGFPKSIVSDRDKVFLSNFWQEMFKMAGTRLNR
        LL L+ P+ IWSDISMDFIEGLP+S                               A +FVKE+VRLHGFPKSIVSDRDK+F+S+FW EMFK+AGT+LNR
Subjt:  LLSLDSPNNIWSDISMDFIEGLPRS----------------------------NDSANIFVKEIVRLHGFPKSIVSDRDKVFLSNFWQEMFKMAGTRLNR

Query:  STAYHPQSDGKTE-------------------------------------RSVGVTPFQAIYGRMPPPLMFYGDRDTPNTTLNE
        S++YHPQ+DG+TE                                      S+G+TPFQA+YGR+PPPL+ YG+ +TPN+TL++
Subjt:  STAYHPQSDGKTE-------------------------------------RSVGVTPFQAIYGRMPPPLMFYGDRDTPNTTLNE

TYK23090.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]4.0e-26354.64Show/hide
Query:  AVKGKGICSNVEVTVGEWKITNSFLPLELGGVNVILGMQWLHSLGTIEFDWKNL--------------------------------------GRLAE---
        ++KGKG+C  VE+ +G  K+T+SFLPLELGGV+ ILGMQWLHS+GT E DWKNL                                      G L E   
Subjt:  AVKGKGICSNVEVTVGEWKITNSFLPLELGGVNVILGMQWLHSLGTIEFDWKNL--------------------------------------GRLAE---

Query:  ---EGLEDEEVPA----TVDNMISPLLSKFNDVFDWLEELPPKRNIEHHIPLKKGVDPVNVRPYHYPHQQKEEMERLVDEMLKTGIIRPGTSPYSSPVLL
           EG+ED EV      T    ++ +L+KF DVF+    LPP+R IEHHI LK+G DPVNVRPY Y HQQKEEMERLVDEML +GIIRP TSPYSSPVLL
Subjt:  ---EGLEDEEVPA----TVDNMISPLLSKFNDVFDWLEELPPKRNIEHHIPLKKGVDPVNVRPYHYPHQQKEEMERLVDEMLKTGIIRPGTSPYSSPVLL

Query:  VKKKDDSWRFCIDYRELNNVTIPDKFPILVIDELFNELNGANMFSKIDLKVGYHQIRMHPRGVEKTTFRTHEGHYEFLVMPFGLTNAPSTFQALMNHIFK
        V+KKD SWRFC+DYR LNNVT+PDKFPI V++ELF+ELNGA++FSKIDLK GYHQIRMHP  +EKT FRTHEGHYEF+VMPFGLTNAPSTFQALMN +FK
Subjt:  VKKKDDSWRFCIDYRELNNVTIPDKFPILVIDELFNELNGANMFSKIDLKVGYHQIRMHPRGVEKTTFRTHEGHYEFLVMPFGLTNAPSTFQALMNHIFK

Query:  PYMSRFVSVFFDDVLVHSKGVEEHGQHLEVVLEILRENELYVNLAKCCFAKGIIGYLGHFISEKGTE---------------------------------
        P++ RFV VFFDD+L++SKG+EEHGQHLEVVLEILRE+ELY N  KC FAK  I YLGH+IS+KG E                                 
Subjt:  PYMSRFVSVFFDDVLVHSKGVEEHGQHLEVVLEILRENELYVNLAKCCFAKGIIGYLGHFISEKGTE---------------------------------

Query:  FVQHYGFIVAPLTQLLRKGGFQWTEESKEAFQQLQNAMMTLPVLVLPDFNATFEMETDASRYRVGVVLVQAKRPIAYFSHTLALRDRAKPIYERELMAVI
        FVQ+YG I APLTQLL+ G ++WTEE++ AF++L+ AMMTLPVL +PDFN  FE+E+DAS + VG VLVQ K+P+AYFS  L+ RDRA+P+YERELMAV+
Subjt:  FVQHYGFIVAPLTQLLRKGGFQWTEESKEAFQQLQNAMMTLPVLVLPDFNATFEMETDASRYRVGVVLVQAKRPIAYFSHTLALRDRAKPIYERELMAVI

Query:  LPVQRWRPYLLERKFIVKTDQKSLKFLLQQRVIQLQYQKWVAKLTGYSFEVVYKPNLENKAADALSWMPPTVNLGSITAPTLVDILVIKKEVEADEKLSK
          VQRWRPYLL RKF VKTDQ+SLKFLL+QRVIQ QYQ+W+AKL GYSFEV+YKP LENKAADALS + PT +L  +TAP L+D+ VI+KEV  D  L +
Subjt:  LPVQRWRPYLLERKFIVKTDQKSLKFLLQQRVIQLQYQKWVAKLTGYSFEVVYKPNLENKAADALSWMPPTVNLGSITAPTLVDILVIKKEVEADEKLSK

Query:  IMEELQAMEEGTE-GKFSIRQGMLRYKDKFVLSKTSTLIPTILHTYHDS----------------------------------------NNSLALSPTRL
        I+  ++  E+G E   ++  QG+L++K + VLSK STLIPTI+HTYHDS                                        N S ALSP  L
Subjt:  IMEELQAMEEGTE-GKFSIRQGMLRYKDKFVLSKTSTLIPTILHTYHDS----------------------------------------NNSLALSPTRL

Query:  LLSLDSPNNIWSDISMDFIEGLPRS----------------------------NDSANIFVKEIVRLHGFPKSIVSDRDKVFLSNFWQEMFKMAGTRLNR
        LL L+ P+ IWSDISMDFIEGLP+S                               A +FVKE+VRLHGFPKSIVSDRDK+F+S+FW EMFK+AGT+LNR
Subjt:  LLSLDSPNNIWSDISMDFIEGLPRS----------------------------NDSANIFVKEIVRLHGFPKSIVSDRDKVFLSNFWQEMFKMAGTRLNR

Query:  STAYHPQSDGKTE-------------------------------------RSVGVTPFQAIYGRMPPPLMFYGDRDTPNTTLNE
        S++YHPQ+DG+TE                                      S+G+TPFQA+YGR+PPPL+ YG+ +TPN+TL++
Subjt:  STAYHPQSDGKTE-------------------------------------RSVGVTPFQAIYGRMPPPLMFYGDRDTPNTTLNE

TrEMBL top hitse value%identityAlignment
A0A5A7SIV7 Ty3/gypsy retrotransposon protein1.5e-26354.64Show/hide
Query:  AVKGKGICSNVEVTVGEWKITNSFLPLELGGVNVILGMQWLHSLGTIEFDWKNL--------------------------------------GRLAE---
        ++KGKG+C  VE+ +G  K+T+SFLPLELGGV+ ILGMQWLHS+GT E DWKNL                                      G L E   
Subjt:  AVKGKGICSNVEVTVGEWKITNSFLPLELGGVNVILGMQWLHSLGTIEFDWKNL--------------------------------------GRLAE---

Query:  ---EGLEDEEVPA----TVDNMISPLLSKFNDVFDWLEELPPKRNIEHHIPLKKGVDPVNVRPYHYPHQQKEEMERLVDEMLKTGIIRPGTSPYSSPVLL
           EG+ED EV      T    ++ +L+KF DVF+    LPP+R IEHHI LK+G DPVNVRPY Y HQQKEEMERLVDEML +GIIRP TSPYSSPVLL
Subjt:  ---EGLEDEEVPA----TVDNMISPLLSKFNDVFDWLEELPPKRNIEHHIPLKKGVDPVNVRPYHYPHQQKEEMERLVDEMLKTGIIRPGTSPYSSPVLL

Query:  VKKKDDSWRFCIDYRELNNVTIPDKFPILVIDELFNELNGANMFSKIDLKVGYHQIRMHPRGVEKTTFRTHEGHYEFLVMPFGLTNAPSTFQALMNHIFK
        V+K+D SWRFC+DYR LNNVT+PDKFPI V++ELF+ELNGA++FSKIDLK GYHQIRMHP  +EKT FRTHEGHYEF+VMPFGLTNAPSTFQALMN +FK
Subjt:  VKKKDDSWRFCIDYRELNNVTIPDKFPILVIDELFNELNGANMFSKIDLKVGYHQIRMHPRGVEKTTFRTHEGHYEFLVMPFGLTNAPSTFQALMNHIFK

Query:  PYMSRFVSVFFDDVLVHSKGVEEHGQHLEVVLEILRENELYVNLAKCCFAKGIIGYLGHFISEKGTE---------------------------------
        P++ RFV VFFDD+L++SKG+EEHGQHLEVVLEILRE+ELY NL KC FAK  I YLGH+IS+KG E                                 
Subjt:  PYMSRFVSVFFDDVLVHSKGVEEHGQHLEVVLEILRENELYVNLAKCCFAKGIIGYLGHFISEKGTE---------------------------------

Query:  FVQHYGFIVAPLTQLLRKGGFQWTEESKEAFQQLQNAMMTLPVLVLPDFNATFEMETDASRYRVGVVLVQAKRPIAYFSHTLALRDRAKPIYERELMAVI
        FVQ+YG I APLTQLL+ G ++WTEE++ AF++L+ AMMTLPVL +PDFN  FE+E+DAS + VG VLVQ K+P+AYFS  L+ RDRA+P+YERELMAV+
Subjt:  FVQHYGFIVAPLTQLLRKGGFQWTEESKEAFQQLQNAMMTLPVLVLPDFNATFEMETDASRYRVGVVLVQAKRPIAYFSHTLALRDRAKPIYERELMAVI

Query:  LPVQRWRPYLLERKFIVKTDQKSLKFLLQQRVIQLQYQKWVAKLTGYSFEVVYKPNLENKAADALSWMPPTVNLGSITAPTLVDILVIKKEVEADEKLSK
          VQRWRPYLL RKF VKTDQ+SLKFLL+QRVIQ QYQ+W+AKL GYSFEV+YKP LENKAADALS + PT +L  +TAP L+D+ VI+KEV  D  L +
Subjt:  LPVQRWRPYLLERKFIVKTDQKSLKFLLQQRVIQLQYQKWVAKLTGYSFEVVYKPNLENKAADALSWMPPTVNLGSITAPTLVDILVIKKEVEADEKLSK

Query:  IMEELQAMEEGTE-GKFSIRQGMLRYKDKFVLSKTSTLIPTILHTYHDS----------------------------------------NNSLALSPTRL
        I+  ++  E+G E   ++  QG+L++K + VLSK STLIPTI+HTYHDS                                        N S ALSP  L
Subjt:  IMEELQAMEEGTE-GKFSIRQGMLRYKDKFVLSKTSTLIPTILHTYHDS----------------------------------------NNSLALSPTRL

Query:  LLSLDSPNNIWSDISMDFIEGLPRS----------------------------NDSANIFVKEIVRLHGFPKSIVSDRDKVFLSNFWQEMFKMAGTRLNR
        LL L+ P+ IWSDISMDFIEGLP+S                               A +FVKE+VRLHGFPKSIVSDRDK+F+S+FW EMFK+AGT+LNR
Subjt:  LLSLDSPNNIWSDISMDFIEGLPRS----------------------------NDSANIFVKEIVRLHGFPKSIVSDRDKVFLSNFWQEMFKMAGTRLNR

Query:  STAYHPQSDGKTE-------------------------------------RSVGVTPFQAIYGRMPPPLMFYGDRDTPNTTLNE
        S++YHPQ+DG+TE                                      S+G+TPFQA+YGR+PPPL+ YG+ +TPN+TL++
Subjt:  STAYHPQSDGKTE-------------------------------------RSVGVTPFQAIYGRMPPPLMFYGDRDTPNTTLNE

A0A5A7U2S1 Ty3/gypsy retrotransposon protein1.2e-26254.41Show/hide
Query:  AVKGKGICSNVEVTVGEWKITNSFLPLELGGVNVILGMQWLHSLGTIEFDWKNL--------------------------------------GRLAE---
        ++KGKG+C  VE+ +G  K+T+SFLPLELGGV+ ILGMQWLHS+GT E DWKNL                                      G L E   
Subjt:  AVKGKGICSNVEVTVGEWKITNSFLPLELGGVNVILGMQWLHSLGTIEFDWKNL--------------------------------------GRLAE---

Query:  ---EGLEDEEVPA----TVDNMISPLLSKFNDVFDWLEELPPKRNIEHHIPLKKGVDPVNVRPYHYPHQQKEEMERLVDEMLKTGIIRPGTSPYSSPVLL
           EG+ED EV      T     + +L+KF DVF+    LPP+R IEHHI LK+G DP+NVRPY Y HQQKEEMERLVDEML +GIIRP TSPYSSPVLL
Subjt:  ---EGLEDEEVPA----TVDNMISPLLSKFNDVFDWLEELPPKRNIEHHIPLKKGVDPVNVRPYHYPHQQKEEMERLVDEMLKTGIIRPGTSPYSSPVLL

Query:  VKKKDDSWRFCIDYRELNNVTIPDKFPILVIDELFNELNGANMFSKIDLKVGYHQIRMHPRGVEKTTFRTHEGHYEFLVMPFGLTNAPSTFQALMNHIFK
        V+KKD SWRFC+DYR LNNVT+PDKFPI V++ELF+ELNGA++FSKIDLK GYHQIRMHP  +EKT FRTHEGHYEF+VMPFGLTNAPSTFQALMN +FK
Subjt:  VKKKDDSWRFCIDYRELNNVTIPDKFPILVIDELFNELNGANMFSKIDLKVGYHQIRMHPRGVEKTTFRTHEGHYEFLVMPFGLTNAPSTFQALMNHIFK

Query:  PYMSRFVSVFFDDVLVHSKGVEEHGQHLEVVLEILRENELYVNLAKCCFAKGIIGYLGHFISEKGTE---------------------------------
        P++ RFV VFFDD+L++SKG+EEHGQHLEVVLEILRE+ELY N  KC FAK  I YLGH+IS+KG E                                 
Subjt:  PYMSRFVSVFFDDVLVHSKGVEEHGQHLEVVLEILRENELYVNLAKCCFAKGIIGYLGHFISEKGTE---------------------------------

Query:  FVQHYGFIVAPLTQLLRKGGFQWTEESKEAFQQLQNAMMTLPVLVLPDFNATFEMETDASRYRVGVVLVQAKRPIAYFSHTLALRDRAKPIYERELMAVI
        FVQ+YG I APLTQLL+ G ++WTEE++ AF++L+ AMMTLPVL +PDFN  FE+E+DAS + VG VLVQ K+P+AYFS  L+ RDRA+P+YERELMAV+
Subjt:  FVQHYGFIVAPLTQLLRKGGFQWTEESKEAFQQLQNAMMTLPVLVLPDFNATFEMETDASRYRVGVVLVQAKRPIAYFSHTLALRDRAKPIYERELMAVI

Query:  LPVQRWRPYLLERKFIVKTDQKSLKFLLQQRVIQLQYQKWVAKLTGYSFEVVYKPNLENKAADALSWMPPTVNLGSITAPTLVDILVIKKEVEADEKLSK
          VQRWRPYLL RKF VKTDQ+SLKFLL+QRVIQ QYQ+W+ KL GYSFEV+YKP LENKAADALS + PT +L  +TAP L+D+ VI+KEV  D  L +
Subjt:  LPVQRWRPYLLERKFIVKTDQKSLKFLLQQRVIQLQYQKWVAKLTGYSFEVVYKPNLENKAADALSWMPPTVNLGSITAPTLVDILVIKKEVEADEKLSK

Query:  IMEELQAMEEGTE-GKFSIRQGMLRYKDKFVLSKTSTLIPTILHTYHDS----------------------------------------NNSLALSPTRL
        I+  ++  E+G E   ++  QG+L++K + VLSK STLIPTI+HTYHDS                                        N S ALSP  L
Subjt:  IMEELQAMEEGTE-GKFSIRQGMLRYKDKFVLSKTSTLIPTILHTYHDS----------------------------------------NNSLALSPTRL

Query:  LLSLDSPNNIWSDISMDFIEGLPRS----------------------------NDSANIFVKEIVRLHGFPKSIVSDRDKVFLSNFWQEMFKMAGTRLNR
        LL L+ P+ IWSDISMDFIEGLP+S                               A +FVKE+VRLHGFPKSIVSDRDK+F+S+FW EMFK+AGT+LNR
Subjt:  LLSLDSPNNIWSDISMDFIEGLPRS----------------------------NDSANIFVKEIVRLHGFPKSIVSDRDKVFLSNFWQEMFKMAGTRLNR

Query:  STAYHPQSDGKTE-------------------------------------RSVGVTPFQAIYGRMPPPLMFYGDRDTPNTTLNE
        S++YHPQ+DG+TE                                      S+G+TPFQA+YGR+PPPL+ YG+ +TPN+TL++
Subjt:  STAYHPQSDGKTE-------------------------------------RSVGVTPFQAIYGRMPPPLMFYGDRDTPNTTLNE

A0A5A7U6J3 Ty3/gypsy retrotransposon protein1.9e-26354.64Show/hide
Query:  AVKGKGICSNVEVTVGEWKITNSFLPLELGGVNVILGMQWLHSLGTIEFDWKNL--------------------------------------GRLAE---
        ++KGKG+C  VE+ +G  K+T+SFLPLELGGV+ ILGMQWLHS+GT E DWKNL                                      G L E   
Subjt:  AVKGKGICSNVEVTVGEWKITNSFLPLELGGVNVILGMQWLHSLGTIEFDWKNL--------------------------------------GRLAE---

Query:  ---EGLEDEEVPA----TVDNMISPLLSKFNDVFDWLEELPPKRNIEHHIPLKKGVDPVNVRPYHYPHQQKEEMERLVDEMLKTGIIRPGTSPYSSPVLL
           EG+ED EV      T    ++ +L+KF DVF+    LPP+R IEHHI LK+G DPVNVRPY Y HQQKEEMERLVDEML +GIIRP TSPYSSPVLL
Subjt:  ---EGLEDEEVPA----TVDNMISPLLSKFNDVFDWLEELPPKRNIEHHIPLKKGVDPVNVRPYHYPHQQKEEMERLVDEMLKTGIIRPGTSPYSSPVLL

Query:  VKKKDDSWRFCIDYRELNNVTIPDKFPILVIDELFNELNGANMFSKIDLKVGYHQIRMHPRGVEKTTFRTHEGHYEFLVMPFGLTNAPSTFQALMNHIFK
        V+KKD SWRFC+DYR LNNVT+PDKFPI V++ELF+ELNGA++FSKIDLK GYHQIRMHP  +EKT FRTHEGHYEF+VMPFGLTNAPSTFQALMN +FK
Subjt:  VKKKDDSWRFCIDYRELNNVTIPDKFPILVIDELFNELNGANMFSKIDLKVGYHQIRMHPRGVEKTTFRTHEGHYEFLVMPFGLTNAPSTFQALMNHIFK

Query:  PYMSRFVSVFFDDVLVHSKGVEEHGQHLEVVLEILRENELYVNLAKCCFAKGIIGYLGHFISEKGTE---------------------------------
        P++ RFV VFFDD+L++SKG+EEHGQHLEVVLEILRE+ELY N  KC FAK  I YLGH+IS+KG E                                 
Subjt:  PYMSRFVSVFFDDVLVHSKGVEEHGQHLEVVLEILRENELYVNLAKCCFAKGIIGYLGHFISEKGTE---------------------------------

Query:  FVQHYGFIVAPLTQLLRKGGFQWTEESKEAFQQLQNAMMTLPVLVLPDFNATFEMETDASRYRVGVVLVQAKRPIAYFSHTLALRDRAKPIYERELMAVI
        FVQ+YG I APLTQLL+ G ++WTEE++ AF++L+ AMMTLPVL +PDFN  FE+E+DAS + VG VLVQ K+P+AYFS  L+ RDRA+P+YERELMAV+
Subjt:  FVQHYGFIVAPLTQLLRKGGFQWTEESKEAFQQLQNAMMTLPVLVLPDFNATFEMETDASRYRVGVVLVQAKRPIAYFSHTLALRDRAKPIYERELMAVI

Query:  LPVQRWRPYLLERKFIVKTDQKSLKFLLQQRVIQLQYQKWVAKLTGYSFEVVYKPNLENKAADALSWMPPTVNLGSITAPTLVDILVIKKEVEADEKLSK
          VQRWRPYLL RKF VKTDQ+SLKFLL+QRVIQ QYQ+W+AKL GYSFEV+YKP LENKAADALS + PT +L  +TAP L+D+ VI+KEV  D  L +
Subjt:  LPVQRWRPYLLERKFIVKTDQKSLKFLLQQRVIQLQYQKWVAKLTGYSFEVVYKPNLENKAADALSWMPPTVNLGSITAPTLVDILVIKKEVEADEKLSK

Query:  IMEELQAMEEGTE-GKFSIRQGMLRYKDKFVLSKTSTLIPTILHTYHDS----------------------------------------NNSLALSPTRL
        I+  ++  E+G E   ++  QG+L++K + VLSK STLIPTI+HTYHDS                                        N S ALSP  L
Subjt:  IMEELQAMEEGTE-GKFSIRQGMLRYKDKFVLSKTSTLIPTILHTYHDS----------------------------------------NNSLALSPTRL

Query:  LLSLDSPNNIWSDISMDFIEGLPRS----------------------------NDSANIFVKEIVRLHGFPKSIVSDRDKVFLSNFWQEMFKMAGTRLNR
        LL L+ P+ IWSDISMDFIEGLP+S                               A +FVKE+VRLHGFPKSIVSDRDK+F+S+FW EMFK+AGT+LNR
Subjt:  LLSLDSPNNIWSDISMDFIEGLPRS----------------------------NDSANIFVKEIVRLHGFPKSIVSDRDKVFLSNFWQEMFKMAGTRLNR

Query:  STAYHPQSDGKTE-------------------------------------RSVGVTPFQAIYGRMPPPLMFYGDRDTPNTTLNE
        S++YHPQ+DG+TE                                      S+G+TPFQA+YGR+PPPL+ YG+ +TPN+TL++
Subjt:  STAYHPQSDGKTE-------------------------------------RSVGVTPFQAIYGRMPPPLMFYGDRDTPNTTLNE

A0A5D3CXB1 Ty3/gypsy retrotransposon protein1.9e-26354.64Show/hide
Query:  AVKGKGICSNVEVTVGEWKITNSFLPLELGGVNVILGMQWLHSLGTIEFDWKNL--------------------------------------GRLAE---
        ++KGKG+C  VE+ +G  K+T+SFLPLELGGV+ ILGMQWLHS+GT E DWKNL                                      G L E   
Subjt:  AVKGKGICSNVEVTVGEWKITNSFLPLELGGVNVILGMQWLHSLGTIEFDWKNL--------------------------------------GRLAE---

Query:  ---EGLEDEEVPA----TVDNMISPLLSKFNDVFDWLEELPPKRNIEHHIPLKKGVDPVNVRPYHYPHQQKEEMERLVDEMLKTGIIRPGTSPYSSPVLL
           EG+ED EV      T    ++ +L+KF DVF+    LPP+R IEHHI LK+G DPVNVRPY Y HQQKEEMERLVDEML +GIIRP TSPYSSPVLL
Subjt:  ---EGLEDEEVPA----TVDNMISPLLSKFNDVFDWLEELPPKRNIEHHIPLKKGVDPVNVRPYHYPHQQKEEMERLVDEMLKTGIIRPGTSPYSSPVLL

Query:  VKKKDDSWRFCIDYRELNNVTIPDKFPILVIDELFNELNGANMFSKIDLKVGYHQIRMHPRGVEKTTFRTHEGHYEFLVMPFGLTNAPSTFQALMNHIFK
        V+KKD SWRFC+DYR LNNVT+PDKFPI V++ELF+ELNGA++FSKIDLK GYHQIRMHP  +EKT FRTHEGHYEF+VMPFGLTNAPSTFQALMN +FK
Subjt:  VKKKDDSWRFCIDYRELNNVTIPDKFPILVIDELFNELNGANMFSKIDLKVGYHQIRMHPRGVEKTTFRTHEGHYEFLVMPFGLTNAPSTFQALMNHIFK

Query:  PYMSRFVSVFFDDVLVHSKGVEEHGQHLEVVLEILRENELYVNLAKCCFAKGIIGYLGHFISEKGTE---------------------------------
        P++ RFV VFFDD+L++SKG+EEHGQHLEVVLEILRE+ELY N  KC FAK  I YLGH+IS+KG E                                 
Subjt:  PYMSRFVSVFFDDVLVHSKGVEEHGQHLEVVLEILRENELYVNLAKCCFAKGIIGYLGHFISEKGTE---------------------------------

Query:  FVQHYGFIVAPLTQLLRKGGFQWTEESKEAFQQLQNAMMTLPVLVLPDFNATFEMETDASRYRVGVVLVQAKRPIAYFSHTLALRDRAKPIYERELMAVI
        FVQ+YG I APLTQLL+ G ++WTEE++ AF++L+ AMMTLPVL +PDFN  FE+E+DAS + VG VLVQ K+P+AYFS  L+ RDRA+P+YERELMAV+
Subjt:  FVQHYGFIVAPLTQLLRKGGFQWTEESKEAFQQLQNAMMTLPVLVLPDFNATFEMETDASRYRVGVVLVQAKRPIAYFSHTLALRDRAKPIYERELMAVI

Query:  LPVQRWRPYLLERKFIVKTDQKSLKFLLQQRVIQLQYQKWVAKLTGYSFEVVYKPNLENKAADALSWMPPTVNLGSITAPTLVDILVIKKEVEADEKLSK
          VQRWRPYLL RKF VKTDQ+SLKFLL+QRVIQ QYQ+W+AKL GYSFEV+YKP LENKAADALS + PT +L  +TAP L+D+ VI+KEV  D  L +
Subjt:  LPVQRWRPYLLERKFIVKTDQKSLKFLLQQRVIQLQYQKWVAKLTGYSFEVVYKPNLENKAADALSWMPPTVNLGSITAPTLVDILVIKKEVEADEKLSK

Query:  IMEELQAMEEGTE-GKFSIRQGMLRYKDKFVLSKTSTLIPTILHTYHDS----------------------------------------NNSLALSPTRL
        I+  ++  E+G E   ++  QG+L++K + VLSK STLIPTI+HTYHDS                                        N S ALSP  L
Subjt:  IMEELQAMEEGTE-GKFSIRQGMLRYKDKFVLSKTSTLIPTILHTYHDS----------------------------------------NNSLALSPTRL

Query:  LLSLDSPNNIWSDISMDFIEGLPRS----------------------------NDSANIFVKEIVRLHGFPKSIVSDRDKVFLSNFWQEMFKMAGTRLNR
        LL L+ P+ IWSDISMDFIEGLP+S                               A +FVKE+VRLHGFPKSIVSDRDK+F+S+FW EMFK+AGT+LNR
Subjt:  LLSLDSPNNIWSDISMDFIEGLPRS----------------------------NDSANIFVKEIVRLHGFPKSIVSDRDKVFLSNFWQEMFKMAGTRLNR

Query:  STAYHPQSDGKTE-------------------------------------RSVGVTPFQAIYGRMPPPLMFYGDRDTPNTTLNE
        S++YHPQ+DG+TE                                      S+G+TPFQA+YGR+PPPL+ YG+ +TPN+TL++
Subjt:  STAYHPQSDGKTE-------------------------------------RSVGVTPFQAIYGRMPPPLMFYGDRDTPNTTLNE

A0A5D3DI73 Ty3/gypsy retrotransposon protein1.9e-26354.64Show/hide
Query:  AVKGKGICSNVEVTVGEWKITNSFLPLELGGVNVILGMQWLHSLGTIEFDWKNL--------------------------------------GRLAE---
        ++KGKG+C  VE+ +G  K+T+SFLPLELGGV+ ILGMQWLHS+GT E DWKNL                                      G L E   
Subjt:  AVKGKGICSNVEVTVGEWKITNSFLPLELGGVNVILGMQWLHSLGTIEFDWKNL--------------------------------------GRLAE---

Query:  ---EGLEDEEVPA----TVDNMISPLLSKFNDVFDWLEELPPKRNIEHHIPLKKGVDPVNVRPYHYPHQQKEEMERLVDEMLKTGIIRPGTSPYSSPVLL
           EG+ED EV      T    ++ +L+KF DVF+    LPP+R IEHHI LK+G DPVNVRPY Y HQQKEEMERLVDEML +GIIRP TSPYSSPVLL
Subjt:  ---EGLEDEEVPA----TVDNMISPLLSKFNDVFDWLEELPPKRNIEHHIPLKKGVDPVNVRPYHYPHQQKEEMERLVDEMLKTGIIRPGTSPYSSPVLL

Query:  VKKKDDSWRFCIDYRELNNVTIPDKFPILVIDELFNELNGANMFSKIDLKVGYHQIRMHPRGVEKTTFRTHEGHYEFLVMPFGLTNAPSTFQALMNHIFK
        V+KKD SWRFC+DYR LNNVT+PDKFPI V++ELF+ELNGA++FSKIDLK GYHQIRMHP  +EKT FRTHEGHYEF+VMPFGLTNAPSTFQALMN +FK
Subjt:  VKKKDDSWRFCIDYRELNNVTIPDKFPILVIDELFNELNGANMFSKIDLKVGYHQIRMHPRGVEKTTFRTHEGHYEFLVMPFGLTNAPSTFQALMNHIFK

Query:  PYMSRFVSVFFDDVLVHSKGVEEHGQHLEVVLEILRENELYVNLAKCCFAKGIIGYLGHFISEKGTE---------------------------------
        P++ RFV VFFDD+L++SKG+EEHGQHLEVVLEILRE+ELY N  KC FAK  I YLGH+IS+KG E                                 
Subjt:  PYMSRFVSVFFDDVLVHSKGVEEHGQHLEVVLEILRENELYVNLAKCCFAKGIIGYLGHFISEKGTE---------------------------------

Query:  FVQHYGFIVAPLTQLLRKGGFQWTEESKEAFQQLQNAMMTLPVLVLPDFNATFEMETDASRYRVGVVLVQAKRPIAYFSHTLALRDRAKPIYERELMAVI
        FVQ+YG I APLTQLL+ G ++WTEE++ AF++L+ AMMTLPVL +PDFN  FE+E+DAS + VG VLVQ K+P+AYFS  L+ RDRA+P+YERELMAV+
Subjt:  FVQHYGFIVAPLTQLLRKGGFQWTEESKEAFQQLQNAMMTLPVLVLPDFNATFEMETDASRYRVGVVLVQAKRPIAYFSHTLALRDRAKPIYERELMAVI

Query:  LPVQRWRPYLLERKFIVKTDQKSLKFLLQQRVIQLQYQKWVAKLTGYSFEVVYKPNLENKAADALSWMPPTVNLGSITAPTLVDILVIKKEVEADEKLSK
          VQRWRPYLL RKF VKTDQ+SLKFLL+QRVIQ QYQ+W+AKL GYSFEV+YKP LENKAADALS + PT +L  +TAP L+D+ VI+KEV  D  L +
Subjt:  LPVQRWRPYLLERKFIVKTDQKSLKFLLQQRVIQLQYQKWVAKLTGYSFEVVYKPNLENKAADALSWMPPTVNLGSITAPTLVDILVIKKEVEADEKLSK

Query:  IMEELQAMEEGTE-GKFSIRQGMLRYKDKFVLSKTSTLIPTILHTYHDS----------------------------------------NNSLALSPTRL
        I+  ++  E+G E   ++  QG+L++K + VLSK STLIPTI+HTYHDS                                        N S ALSP  L
Subjt:  IMEELQAMEEGTE-GKFSIRQGMLRYKDKFVLSKTSTLIPTILHTYHDS----------------------------------------NNSLALSPTRL

Query:  LLSLDSPNNIWSDISMDFIEGLPRS----------------------------NDSANIFVKEIVRLHGFPKSIVSDRDKVFLSNFWQEMFKMAGTRLNR
        LL L+ P+ IWSDISMDFIEGLP+S                               A +FVKE+VRLHGFPKSIVSDRDK+F+S+FW EMFK+AGT+LNR
Subjt:  LLSLDSPNNIWSDISMDFIEGLPRS----------------------------NDSANIFVKEIVRLHGFPKSIVSDRDKVFLSNFWQEMFKMAGTRLNR

Query:  STAYHPQSDGKTE-------------------------------------RSVGVTPFQAIYGRMPPPLMFYGDRDTPNTTLNE
        S++YHPQ+DG+TE                                      S+G+TPFQA+YGR+PPPL+ YG+ +TPN+TL++
Subjt:  STAYHPQSDGKTE-------------------------------------RSVGVTPFQAIYGRMPPPLMFYGDRDTPNTTLNE

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.62.0e-6834.07Show/hide
Query:  LLSKFNDV-------FDWLEELPPKRNIEHHIPLKKGVDPVNVRPYHYPHQQKEEMERLVDEMLKTGIIRPGTSPYSSPVLLVKKKDDS-----WRFCID
        LL K++D+         +  +     N +H++PL           Y YP   ++E+E  + +ML  GIIR   SPY+SP+ +V KK D+     +R  ID
Subjt:  LLSKFNDV-------FDWLEELPPKRNIEHHIPLKKGVDPVNVRPYHYPHQQKEEMERLVDEMLKTGIIRPGTSPYSSPVLLVKKKDDS-----WRFCID

Query:  YRELNNVTIPDKFPILVIDELFNELNGANMFSKIDLKVGYHQIRMHPRGVEKTTFRTHEGHYEFLVMPFGLTNAPSTFQALMNHIFKPYMSRFVSVFFDD
        YR+LN +T+ D+ PI  +DE+  +L   N F+ IDL  G+HQI M P  V KT F T  GHYE+L MPFGL NAP+TFQ  MN I +P +++   V+ DD
Subjt:  YRELNNVTIPDKFPILVIDELFNELNGANMFSKIDLKVGYHQIRMHPRGVEKTTFRTHEGHYEFLVMPFGLTNAPSTFQALMNHIFKPYMSRFVSVFFDD

Query:  VLVHSKGVEEHGQHLEVVLEILRENELYVNLAKCCFAKGIIGYLGHFISEKG---------------------------------TEFVQHYGFIVAPLT
        ++V S  ++EH Q L +V E L +  L + L KC F K    +LGH ++  G                                  +F+ ++  I  P+T
Subjt:  VLVHSKGVEEHGQHLEVVLEILRENELYVNLAKCCFAKGIIGYLGHFISEKG---------------------------------TEFVQHYGFIVAPLT

Query:  QLLRKGGFQWT--EESKEAFQQLQNAMMTLPVLVLPDFNATFEMETDASRYRVGVVLVQAKRPIAYFSHTLALRDRAKPIYERELMAVILPVQRWRPYLL
        + L+K     T   E   AF++L+  +   P+L +PDF   F + TDAS   +G VL Q   P++Y S TL   +      E+EL+A++   + +R YLL
Subjt:  QLLRKGGFQWT--EESKEAFQQLQNAMMTLPVLVLPDFNATFEMETDASRYRVGVVLVQAKRPIAYFSHTLALRDRAKPIYERELMAVILPVQRWRPYLL

Query:  ERKFIVKTDQKSLKFLLQQRVIQLQYQKWVAKLTGYSFEVVYKPNLENKAADALS
         R F + +D + L +L + +    +  +W  KL+ + F++ Y    EN  ADALS
Subjt:  ERKFIVKTDQKSLKFLLQQRVIQLQYQKWVAKLTGYSFEVVYKPNLENKAADALS

P20825 Retrovirus-related Pol polyprotein from transposon 2975.9e-6834.08Show/hide
Query:  LLSKFNDV-FDWLEELPPKRNIEHHIPLKKGVDPVNVRPYHYPHQQKEEMERLVDEMLKTGIIRPGTSPYSSPVLLVKKKDDS-----WRFCIDYRELNN
        LL+KF ++ +   E+L     I+H +       P+  + Y      + E+E  V EML  G+IR   SPY+SP  +V KK D+     +R  IDYR+LN 
Subjt:  LLSKFNDV-FDWLEELPPKRNIEHHIPLKKGVDPVNVRPYHYPHQQKEEMERLVDEMLKTGIIRPGTSPYSSPVLLVKKKDDS-----WRFCIDYRELNN

Query:  VTIPDKFPILVIDELFNELNGANMFSKIDLKVGYHQIRMHPRGVEKTTFRTHEGHYEFLVMPFGLTNAPSTFQALMNHIFKPYMSRFVSVFFDDVLVHSK
        +TIPD++PI  +DE+  +L     F+ IDL  G+HQI M    + KT F T  GHYE+L MPFGL NAP+TFQ  MN+I +P +++   V+ DD+++ S 
Subjt:  VTIPDKFPILVIDELFNELNGANMFSKIDLKVGYHQIRMHPRGVEKTTFRTHEGHYEFLVMPFGLTNAPSTFQALMNHIFKPYMSRFVSVFFDDVLVHSK

Query:  GVEEHGQHLEVVLEILRENELYVNLAKCCFAKGIIGYLGHFISEKG---------------------------------TEFVQHYGFIVAPLTQLLRKG
         + EH   +++V   L +  L + L KC F K    +LGH ++  G                                  +F+ +Y  I  P+T  L+K 
Subjt:  GVEEHGQHLEVVLEILRENELYVNLAKCCFAKGIIGYLGHFISEKG---------------------------------TEFVQHYGFIVAPLTQLLRKG

Query:  GFQWTE--ESKEAFQQLQNAMMTLPVLVLPDFNATFEMETDASRYRVGVVLVQAKRPIAYFSHTLALRDRAKPIYERELMAVILPVQRWRPYLLERKFIV
            T+  E  EAF++L+  ++  P+L LPDF   F + TDAS   +G VL Q   PI++ S TL   +      E+EL+A++   + +R YLL R+F++
Subjt:  GFQWTE--ESKEAFQQLQNAMMTLPVLVLPDFNATFEMETDASRYRVGVVLVQAKRPIAYFSHTLALRDRAKPIYERELMAVILPVQRWRPYLLERKFIV

Query:  KTDQKSLKFLLQQRVIQLQYQKWVAKLTGYSFEVVYKPNLENKAADALS
         +D + L++L   +    + ++W  +L+ Y F++ Y    EN  ADALS
Subjt:  KTDQKSLKFLLQQRVIQLQYQKWVAKLTGYSFEVVYKPNLENKAADALS

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein1.5e-7429.2Show/hide
Query:  ELPPKR------NIEHHIPLKKGVDPVNVRPYHYPHQQKEEMERLVDEMLKTGIIRPGTSPYSSPVLLVKKKDDSWRFCIDYRELNNVTIPDKFPILVID
        +LPP+        ++H I +K G     ++PYH   + ++E+ ++V ++L    I P  SP SSPV+LV KKD ++R C+DYR LN  TI D FP+  ID
Subjt:  ELPPKR------NIEHHIPLKKGVDPVNVRPYHYPHQQKEEMERLVDEMLKTGIIRPGTSPYSSPVLLVKKKDDSWRFCIDYRELNNVTIPDKFPILVID

Query:  ELFNELNGANMFSKIDLKVGYHQIRMHPRGVEKTTFRTHEGHYEFLVMPFGLTNAPSTFQALMNHIFKPYMSRFVSVFFDDVLVHSKGVEEHGQHLEVVL
         L + +  A +F+ +DL  GYHQI M P+   KT F T  G YE+ VMPFGL NAPSTF   M   F+    RFV+V+ DD+L+ S+  EEH +HL+ VL
Subjt:  ELFNELNGANMFSKIDLKVGYHQIRMHPRGVEKTTFRTHEGHYEFLVMPFGLTNAPSTFQALMNHIFKPYMSRFVSVFFDDVLVHSKGVEEHGQHLEVVL

Query:  EILRENELYVNLAKCCFAKGIIGYLGHFISEKGTEFVQH--------------------YGFI------------VAPLTQLLRKGGFQWTEESKEAFQQ
        E L+   L V   KC FA     +LG+ I  +    +QH                     G I            +A   QL      QWTE+  +A ++
Subjt:  EILRENELYVNLAKCCFAKGIIGYLGHFISEKGTEFVQH--------------------YGFI------------VAPLTQLLRKGGFQWTEESKEAFQQ

Query:  LQNAMMTLPVLVLPDFNATFEMETDASRYRVGVVLVQAKRP------IAYFSHTLALRDRAKPIYERELMAVILPVQRWRPYLLERKFIVKTDQKSLKFL
        L+ A+   PVLV  +  A + + TDAS+  +G VL +          + YFS +L    +  P  E EL+ +I  +  +R  L  + F ++TD  SL  L
Subjt:  LQNAMMTLPVLVLPDFNATFEMETDASRYRVGVVLVQAKRP------IAYFSHTLALRDRAKPIYERELMAVILPVQRWRPYLLERKFIVKTDQKSLKFL

Query:  LQQRVIQLQYQKWVAKLTGYSFEVVYKPNLENKAADALSWMPPTVNLGSI-------------TAPTLVDILVIKKEVEADEKLSKIMEELQAMEEGTE-
          +     + Q+W+  L  Y F + Y    +N  ADA+S    T+   +              + P    +L+  KE+       + M   ++ ++  E 
Subjt:  LQQRVIQLQYQKWVAKLTGYSFEVVYKPNLENKAADALSWMPPTVNLGSI-------------TAPTLVDILVIKKEVEADEKLSKIMEELQAMEEGTE-

Query:  -----GKFSIRQGMLRYKDKFVLSKTSTLIPTILHTYHDSN----------NSLALSPT-------------------------------RLLLSLDSPN
               +S+   M+ Y+D+ V+         ++  YHD                +SP                                 LL  L    
Subjt:  -----GKFSIRQGMLRYKDKFVLSKTSTLIPTILHTYHDSN----------NSLALSPT-------------------------------RLLLSLDSPN

Query:  NIWSDISMDFIEGLPRSNDSANIFV-----------------------------KEIVRLHGFPKSIVSDRDKVFLSNFWQEMFKMAGTRLNRSTAYHPQ
          W DISMDF+ GLP ++++ N+ +                             + I   HGFP++I SDRD    ++ +QE+ K  G +   S+A HPQ
Subjt:  NIWSDISMDFIEGLPRSNDSANIFV-----------------------------KEIVRLHGFPKSIVSDRDKVFLSNFWQEMFKMAGTRLNRSTAYHPQ

Query:  SDGKTERSV
        +DG++ER++
Subjt:  SDGKTERSV

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus5.5e-6631.83Show/hide
Query:  EVPATVDNMISPLLSKFNDVFDWLEELPPKRNIEHHIPLKKGV-----DPVNVRPYHYPHQQKEEMERLVDEMLKTGIIRPGTSPYSSPVLLVKKK----
        E P     +++ LL +F  +F+     PP   +     +K  +     DP+  + Y YP   + E+ER +DE+L+ GIIRP  SPY+SP+ +V KK    
Subjt:  EVPATVDNMISPLLSKFNDVFDWLEELPPKRNIEHHIPLKKGV-----DPVNVRPYHYPHQQKEEMERLVDEMLKTGIIRPGTSPYSSPVLLVKKK----

Query:  -DDSWRFCIDYRELNNVTIPDKFPILVIDELFNELNGANMFSKIDLKVGYHQIRMHPRGVEKTTFRTHEGHYEFLVMPFGLTNAPSTFQALMNHIFKPYM
         +  +R  +D++ LN VTIPD +PI  I+     L  A  F+ +DL  G+HQI M    + KT F T  G YEFL +PFGL NAP+ FQ +++ I + ++
Subjt:  -DDSWRFCIDYRELNNVTIPDKFPILVIDELFNELNGANMFSKIDLKVGYHQIRMHPRGVEKTTFRTHEGHYEFLVMPFGLTNAPSTFQALMNHIFKPYM

Query:  SRFVSVFFDDVLVHSKGVEEHGQHLEVVLEILRENELYVNLAKCCFAKGIIGYLGHFISEKG---------------------------------TEFVQ
         +   V+ DD++V S+  + H ++L +VL  L +  L VNL K  F    + +LG+ ++  G                                  +F+Q
Subjt:  SRFVSVFFDDVLVHSKGVEEHGQHLEVVLEILRENELYVNLAKCCFAKGIIGYLGHFISEKG---------------------------------TEFVQ

Query:  HYGFIVAPLTQLLR------------KGGFQWTEESKEAFQQLQNAMMTLPVLVLPDFNATFEMETDASRYRVGVVLVQ----AKRPIAYFSHTLALRDR
         Y  +  PLT L R            K      E + ++F  L++ + +  +L  P F   F + TDAS + +G VL Q      RPIAY S +L   + 
Subjt:  HYGFIVAPLTQLLR------------KGGFQWTEESKEAFQQLQNAMMTLPVLVLPDFNATFEMETDASRYRVGVVLVQ----AKRPIAYFSHTLALRDR

Query:  AKPIYERELMAVILPVQRWRPYLLERKFI-VKTDQKSLKFLLQQRVIQLQYQKWVAKLTGYSFEVVYKPNLENKAADALSWMPPTVN
             E+E++A+I  +   R YL     I V TD + L F L  R    + ++W A++  Y+ E++YKP   N  ADALS +PP +N
Subjt:  AKPIYERELMAVILPVQRWRPYLLERKFI-VKTDQKSLKFLLQQRVIQLQYQKWVAKLTGYSFEVVYKPNLENKAADALSWMPPTVN

Q99315 Transposon Ty3-G Gag-Pol polyprotein8.5e-7529.2Show/hide
Query:  ELPPKR------NIEHHIPLKKGVDPVNVRPYHYPHQQKEEMERLVDEMLKTGIIRPGTSPYSSPVLLVKKKDDSWRFCIDYRELNNVTIPDKFPILVID
        +LPP+        ++H I +K G     ++PYH   + ++E+ ++V ++L    I P  SP SSPV+LV KKD ++R C+DYR LN  TI D FP+  ID
Subjt:  ELPPKR------NIEHHIPLKKGVDPVNVRPYHYPHQQKEEMERLVDEMLKTGIIRPGTSPYSSPVLLVKKKDDSWRFCIDYRELNNVTIPDKFPILVID

Query:  ELFNELNGANMFSKIDLKVGYHQIRMHPRGVEKTTFRTHEGHYEFLVMPFGLTNAPSTFQALMNHIFKPYMSRFVSVFFDDVLVHSKGVEEHGQHLEVVL
         L + +  A +F+ +DL  GYHQI M P+   KT F T  G YE+ VMPFGL NAPSTF   M   F+    RFV+V+ DD+L+ S+  EEH +HL+ VL
Subjt:  ELFNELNGANMFSKIDLKVGYHQIRMHPRGVEKTTFRTHEGHYEFLVMPFGLTNAPSTFQALMNHIFKPYMSRFVSVFFDDVLVHSKGVEEHGQHLEVVL

Query:  EILRENELYVNLAKCCFAKGIIGYLGHFISEKGTEFVQH--------------------YGFI------------VAPLTQLLRKGGFQWTEESKEAFQQ
        E L+   L V   KC FA     +LG+ I  +    +QH                     G I            +A   QL      QWTE+  +A  +
Subjt:  EILRENELYVNLAKCCFAKGIIGYLGHFISEKGTEFVQH--------------------YGFI------------VAPLTQLLRKGGFQWTEESKEAFQQ

Query:  LQNAMMTLPVLVLPDFNATFEMETDASRYRVGVVLVQAKRP------IAYFSHTLALRDRAKPIYERELMAVILPVQRWRPYLLERKFIVKTDQKSLKFL
        L++A+   PVLV  +  A + + TDAS+  +G VL +          + YFS +L    +  P  E EL+ +I  +  +R  L  + F ++TD  SL  L
Subjt:  LQNAMMTLPVLVLPDFNATFEMETDASRYRVGVVLVQAKRP------IAYFSHTLALRDRAKPIYERELMAVILPVQRWRPYLLERKFIVKTDQKSLKFL

Query:  LQQRVIQLQYQKWVAKLTGYSFEVVYKPNLENKAADALSWMPPTVNLGSI-------------TAPTLVDILVIKKEVEADEKLSKIMEELQAMEEGTE-
          +     + Q+W+  L  Y F + Y    +N  ADA+S    T+   +              + P    +L+  KE+       + M   ++ ++  E 
Subjt:  LQQRVIQLQYQKWVAKLTGYSFEVVYKPNLENKAADALSWMPPTVNLGSI-------------TAPTLVDILVIKKEVEADEKLSKIMEELQAMEEGTE-

Query:  -----GKFSIRQGMLRYKDKFVLSKTSTLIPTILHTYHDSN----------NSLALSPT-------------------------------RLLLSLDSPN
               +S+   M+ Y+D+ V+         ++  YHD                +SP                                 LL  L    
Subjt:  -----GKFSIRQGMLRYKDKFVLSKTSTLIPTILHTYHDSN----------NSLALSPT-------------------------------RLLLSLDSPN

Query:  NIWSDISMDFIEGLPRSNDSANIFV-----------------------------KEIVRLHGFPKSIVSDRDKVFLSNFWQEMFKMAGTRLNRSTAYHPQ
          W DISMDF+ GLP ++++ N+ +                             + I   HGFP++I SDRD    ++ +QE+ K  G +   S+A HPQ
Subjt:  NIWSDISMDFIEGLPRSNDSANIFV-----------------------------KEIVRLHGFPKSIVSDRDKVFLSNFWQEMFKMAGTRLNRSTAYHPQ

Query:  SDGKTERSV
        +DG++ER++
Subjt:  SDGKTERSV

Arabidopsis top hitse value%identityAlignment
ATMG00850.1 DNA/RNA polymerases superfamily protein5.6e-0551.28Show/hide
Query:  QKEEMERLVDEMLKTGIIRPGTSPYSSPVLLVKKKDDSW
        ++  ++  + EML+  II+P  SPYSSPVLLV+KKD  W
Subjt:  QKEEMERLVDEMLKTGIIRPGTSPYSSPVLLVKKKDDSW

ATMG00860.1 DNA/RNA polymerases superfamily protein8.6e-1435.66Show/hide
Query:  HLEVVLEILRENELYVNLAKCCFAKGIIGYLG--HFISEKGT---------------------------------EFVQHYGFIVAPLTQLLRKGGFQWT
        HL +VL+I  +++ Y N  KC F +  I YLG  H IS +G                                   FV++YG IV PLT+LL+K   +WT
Subjt:  HLEVVLEILRENELYVNLAKCCFAKGIIGYLG--HFISEKGT---------------------------------EFVQHYGFIVAPLTQLLRKGGFQWT

Query:  EESKEAFQQLQNAMMTLPVLVLPDFNATF
        E +  AF+ L+ A+ TLPVL LPD    F
Subjt:  EESKEAFQQLQNAMMTLPVLVLPDFNATF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GTTGCGGTCAAAGGAAAAGGAATTTGCAGTAATGTTGAAGTCACGGTAGGAGAGTGGAAAATTACAAACAGCTTTTTACCCTTGGAGTTAGGAGGGGTTAATGTAATTTT
GGGTATGCAGTGGTTGCACTCATTGGGCACGATTGAATTTGATTGGAAAAATTTAGGAAGGCTAGCTGAGGAAGGCTTAGAGGATGAAGAAGTGCCAGCCACTGTTGACA
ACATGATCTCTCCTCTGTTGAGCAAATTTAATGATGTTTTTGATTGGCTGGAAGAGCTGCCTCCGAAGAGGAACATTGAACACCATATACCCCTTAAGAAGGGGGTGGAT
CCAGTTAATGTCAGACCTTATCATTACCCACATCAGCAAAAGGAGGAAATGGAAAGACTAGTTGATGAAATGTTGAAGACAGGCATCATAAGACCGGGTACCAGTCCGTA
TTCCAGCCCAGTCCTTCTCGTGAAAAAGAAAGACGATAGCTGGCGATTTTGTATAGATTATCGGGAGTTGAACAATGTCACTATCCCAGATAAGTTTCCAATTCTGGTCA
TAGATGAGTTATTCAATGAGCTGAACGGGGCCAACATGTTTTCGAAGATTGATTTAAAGGTGGGTTATCACCAGATCAGGATGCATCCGAGAGGTGTAGAGAAGACGACC
TTTCGAACACATGAGGGACACTATGAGTTCTTGGTCATGCCGTTTGGGCTAACCAATGCCCCCTCTACGTTCCAAGCATTAATGAACCACATCTTTAAGCCTTATATGAG
TAGATTCGTATCCGTGTTTTTCGATGACGTTTTGGTGCACAGCAAGGGAGTAGAAGAACATGGACAACATCTGGAAGTAGTCTTGGAGATCCTTCGGGAGAATGAATTGT
ATGTGAATTTAGCCAAATGTTGCTTTGCCAAAGGTATAATAGGTTATTTGGGGCATTTCATATCCGAAAAGGGAACTGAATTCGTTCAGCACTATGGCTTCATTGTCGCA
CCATTGACTCAACTACTTAGGAAAGGGGGATTCCAGTGGACTGAGGAGTCTAAGGAGGCATTCCAACAACTACAGAATGCCATGATGACCCTGCCCGTTTTGGTGTTGCC
TGATTTCAATGCCACCTTTGAAATGGAAACAGATGCATCGAGATATAGGGTAGGAGTCGTACTCGTCCAAGCCAAGCGCCCCATAGCATACTTTAGCCATACGTTAGCCT
TAAGAGATAGGGCCAAGCCTATATATGAAAGGGAATTAATGGCAGTAATCTTGCCTGTGCAACGATGGCGACCATATCTGTTGGAAAGAAAATTTATTGTGAAGACCGAT
CAGAAATCCTTGAAGTTCTTATTACAACAAAGAGTGATACAGCTTCAGTATCAGAAATGGGTAGCGAAGCTAACGGGATACTCATTTGAGGTGGTCTACAAACCCAACTT
GGAGAATAAAGCAGCAGACGCCTTGTCTTGGATGCCTCCTACAGTTAACCTCGGTAGCATAACAGCTCCTACATTGGTTGACATTCTGGTGATTAAAAAGGAAGTCGAAG
CGGATGAAAAGTTGAGCAAAATAATGGAAGAACTACAAGCAATGGAAGAAGGAACAGAGGGTAAATTCTCCATCCGACAAGGTATGTTAAGATATAAGGATAAGTTTGTG
CTGTCCAAAACATCTACATTGATACCCACAATCCTACATACTTACCATGATTCGAACAACTCCCTAGCACTATCCCCCACCCGACTGCTGCTGTCCCTGGACAGTCCAAA
CAATATTTGGAGCGATATATCTATGGACTTTATCGAGGGCTTACCAAGATCGAATGATTCAGCAAATATATTTGTTAAGGAGATTGTCCGACTTCATGGCTTTCCCAAAT
CGATAGTGTCCGACCGAGATAAGGTATTCTTGAGCAACTTTTGGCAAGAAATGTTCAAAATGGCAGGCACCCGATTAAATCGAAGCACCGCTTATCACCCTCAGTCGGAT
GGGAAAACTGAGAGATCCGTAGGGGTCACTCCATTTCAAGCTATCTATGGACGTATGCCACCACCCTTGATGTTTTATGGTGACCGGGACACTCCAAACACTACCTTAAA
CGAATAG
mRNA sequenceShow/hide mRNA sequence
GGGTTGCGGTCAAAGGAAAAGGAATTTGCAGTAATGTTGAAGTCACGGTAGGAGAGTGGAAAATTACAAACAGCTTTTTACCCTTGGAGTTAGGAGGGGTTAATGTAATT
TTGGGTATGCAGTGGTTGCACTCATTGGGCACGATTGAATTTGATTGGAAAAATTTAGGAAGGCTAGCTGAGGAAGGCTTAGAGGATGAAGAAGTGCCAGCCACTGTTGA
CAACATGATCTCTCCTCTGTTGAGCAAATTTAATGATGTTTTTGATTGGCTGGAAGAGCTGCCTCCGAAGAGGAACATTGAACACCATATACCCCTTAAGAAGGGGGTGG
ATCCAGTTAATGTCAGACCTTATCATTACCCACATCAGCAAAAGGAGGAAATGGAAAGACTAGTTGATGAAATGTTGAAGACAGGCATCATAAGACCGGGTACCAGTCCG
TATTCCAGCCCAGTCCTTCTCGTGAAAAAGAAAGACGATAGCTGGCGATTTTGTATAGATTATCGGGAGTTGAACAATGTCACTATCCCAGATAAGTTTCCAATTCTGGT
CATAGATGAGTTATTCAATGAGCTGAACGGGGCCAACATGTTTTCGAAGATTGATTTAAAGGTGGGTTATCACCAGATCAGGATGCATCCGAGAGGTGTAGAGAAGACGA
CCTTTCGAACACATGAGGGACACTATGAGTTCTTGGTCATGCCGTTTGGGCTAACCAATGCCCCCTCTACGTTCCAAGCATTAATGAACCACATCTTTAAGCCTTATATG
AGTAGATTCGTATCCGTGTTTTTCGATGACGTTTTGGTGCACAGCAAGGGAGTAGAAGAACATGGACAACATCTGGAAGTAGTCTTGGAGATCCTTCGGGAGAATGAATT
GTATGTGAATTTAGCCAAATGTTGCTTTGCCAAAGGTATAATAGGTTATTTGGGGCATTTCATATCCGAAAAGGGAACTGAATTCGTTCAGCACTATGGCTTCATTGTCG
CACCATTGACTCAACTACTTAGGAAAGGGGGATTCCAGTGGACTGAGGAGTCTAAGGAGGCATTCCAACAACTACAGAATGCCATGATGACCCTGCCCGTTTTGGTGTTG
CCTGATTTCAATGCCACCTTTGAAATGGAAACAGATGCATCGAGATATAGGGTAGGAGTCGTACTCGTCCAAGCCAAGCGCCCCATAGCATACTTTAGCCATACGTTAGC
CTTAAGAGATAGGGCCAAGCCTATATATGAAAGGGAATTAATGGCAGTAATCTTGCCTGTGCAACGATGGCGACCATATCTGTTGGAAAGAAAATTTATTGTGAAGACCG
ATCAGAAATCCTTGAAGTTCTTATTACAACAAAGAGTGATACAGCTTCAGTATCAGAAATGGGTAGCGAAGCTAACGGGATACTCATTTGAGGTGGTCTACAAACCCAAC
TTGGAGAATAAAGCAGCAGACGCCTTGTCTTGGATGCCTCCTACAGTTAACCTCGGTAGCATAACAGCTCCTACATTGGTTGACATTCTGGTGATTAAAAAGGAAGTCGA
AGCGGATGAAAAGTTGAGCAAAATAATGGAAGAACTACAAGCAATGGAAGAAGGAACAGAGGGTAAATTCTCCATCCGACAAGGTATGTTAAGATATAAGGATAAGTTTG
TGCTGTCCAAAACATCTACATTGATACCCACAATCCTACATACTTACCATGATTCGAACAACTCCCTAGCACTATCCCCCACCCGACTGCTGCTGTCCCTGGACAGTCCA
AACAATATTTGGAGCGATATATCTATGGACTTTATCGAGGGCTTACCAAGATCGAATGATTCAGCAAATATATTTGTTAAGGAGATTGTCCGACTTCATGGCTTTCCCAA
ATCGATAGTGTCCGACCGAGATAAGGTATTCTTGAGCAACTTTTGGCAAGAAATGTTCAAAATGGCAGGCACCCGATTAAATCGAAGCACCGCTTATCACCCTCAGTCGG
ATGGGAAAACTGAGAGATCCGTAGGGGTCACTCCATTTCAAGCTATCTATGGACGTATGCCACCACCCTTGATGTTTTATGGTGACCGGGACACTCCAAACACTACCTTA
AACGAATAG
Protein sequenceShow/hide protein sequence
VAVKGKGICSNVEVTVGEWKITNSFLPLELGGVNVILGMQWLHSLGTIEFDWKNLGRLAEEGLEDEEVPATVDNMISPLLSKFNDVFDWLEELPPKRNIEHHIPLKKGVD
PVNVRPYHYPHQQKEEMERLVDEMLKTGIIRPGTSPYSSPVLLVKKKDDSWRFCIDYRELNNVTIPDKFPILVIDELFNELNGANMFSKIDLKVGYHQIRMHPRGVEKTT
FRTHEGHYEFLVMPFGLTNAPSTFQALMNHIFKPYMSRFVSVFFDDVLVHSKGVEEHGQHLEVVLEILRENELYVNLAKCCFAKGIIGYLGHFISEKGTEFVQHYGFIVA
PLTQLLRKGGFQWTEESKEAFQQLQNAMMTLPVLVLPDFNATFEMETDASRYRVGVVLVQAKRPIAYFSHTLALRDRAKPIYERELMAVILPVQRWRPYLLERKFIVKTD
QKSLKFLLQQRVIQLQYQKWVAKLTGYSFEVVYKPNLENKAADALSWMPPTVNLGSITAPTLVDILVIKKEVEADEKLSKIMEELQAMEEGTEGKFSIRQGMLRYKDKFV
LSKTSTLIPTILHTYHDSNNSLALSPTRLLLSLDSPNNIWSDISMDFIEGLPRSNDSANIFVKEIVRLHGFPKSIVSDRDKVFLSNFWQEMFKMAGTRLNRSTAYHPQSD
GKTERSVGVTPFQAIYGRMPPPLMFYGDRDTPNTTLNE