; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0014058 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0014058
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionReverse transcriptase
Genome locationchr07:14379326..14383371
RNA-Seq ExpressionPay0014058
SyntenyPay0014058
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016740 - transferase activity (molecular function)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR005162 - Retrotransposon gag domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048442.1 pol protein [Cucumis melo var. makuwa]2.2e-22968.47Show/hide
Query:  ARRGGGKGGRGAGRTQSEEQPTVQAANPPAP-----------------------------TQASPAQAQIVPPPVPMEAQPAPVQLSGKAKHLRDFRKYN
        ARRGGG+GGRGAGR Q E  P   A +P AP                              QA+P QAQ V PP P EAQP PVQLS +AKHLRDFRKYN
Subjt:  ARRGGGKGGRGAGRTQSEEQPTVQAANPPAP-----------------------------TQASPAQAQIVPPPVPMEAQPAPVQLSGKAKHLRDFRKYN

Query:  PKTFDGSMDNPTKAQMWLTSIEKIFRYIKCPDDQKVQCTVFFLEDRGTAWWETAERMLGRDVSKITWEQFKDSFNAKFFSANVKYVKQQEFLNLEQGDMT
        PKTFDGSMDNPTKAQMWLTSIE IFRY+KCP+DQKVQC VFFLEDRGTAWWETAERMLG DV+KITWEQFK++F AKFFSANVK+ K QEFLNLEQGDMT
Subjt:  PKTFDGSMDNPTKAQMWLTSIEKIFRYIKCPDDQKVQCTVFFLEDRGTAWWETAERMLGRDVSKITWEQFKDSFNAKFFSANVKYVKQQEFLNLEQGDMT

Query:  VEQYDAEFDMLSRFAADVLKDEEARTEKLIRGLRQDLQGIVRALRPTTHANDLRLALDLSLHERAYPSKAIGRGSALGQKSKVESQPDMTPQRNLRSGGI
        VEQYDAEFDMLSRFA D+++DE ARTEK +RGLR DLQGIVRALRP THA+ LR+ALDLSL ERA  SKA GRGSALGQK KVE+QPD+TPQR LRSGG+
Subjt:  VEQYDAEFDMLSRFAADVLKDEEARTEKLIRGLRQDLQGIVRALRPTTHANDLRLALDLSLHERAYPSKAIGRGSALGQKSKVESQPDMTPQRNLRSGGI

Query:  FQRHRRKLAAAGRTLKELPVCPSCGRVHGGRCLAGSGVCFRCRQPGHTADACPWKLIETTPHQPPASKQGRVFATTRQEAKRAGNVVT------------
        FQRHRR+LAAAGRTL+ELP C +CGRVHGGRCLAGSGVCFRCRQ GHTAD CP K  ETTP QP AS+QGRVFATTRQEA+RAG VVT            
Subjt:  FQRHRRKLAAAGRTLKELPVCPSCGRVHGGRCLAGSGVCFRCRQPGHTADACPWKLIETTPHQPPASKQGRVFATTRQEAKRAGNVVT------------

Query:  -------------------------------------------------------------------------GIVCIPKVISAMKAGKLLSQGTWSILA
                                                                                 G+VCIPKVISAMKA KLLSQGTW ILA
Subjt:  -------------------------------------------------------------------------GIVCIPKVISAMKAGKLLSQGTWSILA

Query:  SVLDTRELEVSLSSKPVVREYPDVFPDELPGLPPPREIDFAIELESDTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWRAPMLFVKKKDGSM
        SV+D RE EVSLSS+PVVREYPDVFPDELPGLP PRE+DFAIELE  TAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPW AP+LFVK KDGSM
Subjt:  SVLDTRELEVSLSSKPVVREYPDVFPDELPGLPPPREIDFAIELESDTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWRAPMLFVKKKDGSM

Query:  RLCIDYRELNKVTVKNRYPLPMIDDLFD
        RLCIDYRELNKVTVKNRYPLP IDDLFD
Subjt:  RLCIDYRELNKVTVKNRYPLPMIDDLFD

KAA0053272.1 gag protease polyprotein [Cucumis melo var. makuwa]6.7e-22664.71Show/hide
Query:  ARRGGGKGGRGAGRTQSEEQPTVQAANPPAP-----------------------------TQASPAQAQIVPPPVPMEAQPAPVQLSGKAKHLRDFRKYN
        ARRGGG+GGRGAGR Q E  P   A +P AP                              QA+P QAQ V PP P EAQP PVQLS +AKHLRDFRKYN
Subjt:  ARRGGGKGGRGAGRTQSEEQPTVQAANPPAP-----------------------------TQASPAQAQIVPPPVPMEAQPAPVQLSGKAKHLRDFRKYN

Query:  PKTFDGSMDNPTKAQMWLTSIEKIFRYIKCPDDQKVQCTVFFLEDRGTAWWETAERMLGRDVSKITWEQFKDSFNAKFFSANVKYVKQQEFLNLEQGDMT
        PKTFDGSMDNPTKAQMWLTSIE IFRY+KCP+DQKVQC VFFLEDRGTAWWETAERMLG DVSKITWEQFK++F AKFFSANVK+ K QEFLNLEQGDMT
Subjt:  PKTFDGSMDNPTKAQMWLTSIEKIFRYIKCPDDQKVQCTVFFLEDRGTAWWETAERMLGRDVSKITWEQFKDSFNAKFFSANVKYVKQQEFLNLEQGDMT

Query:  VEQYDAEFDMLSRFAADVLKDEEARTEKLIRGLRQDLQGIVRALRPTTHANDLRLALDLSLHERAYPSKAIGRGSALGQKSKVESQPDMTPQRNLRSGGI
        VEQYDAEFDMLSRFA D+++DE ARTEK +RGLR DLQGIVRALRP THA+ LR+ALDLSL ERA  SKA GRGSALGQK KVE+QPD+ PQR LRSGG+
Subjt:  VEQYDAEFDMLSRFAADVLKDEEARTEKLIRGLRQDLQGIVRALRPTTHANDLRLALDLSLHERAYPSKAIGRGSALGQKSKVESQPDMTPQRNLRSGGI

Query:  FQRHRRKLAAAGRTLKELPVCPSCGRVHGGRCLAGSGVCFRCRQPGHTADACPWKLIETTPHQPPASKQGRVFATTRQEAKRAGNVVT------------
        FQRHR++LAAAGRTL+ELP C +CGRVHGGRCLAGSGVCFRCRQPGHTAD CP K  ETTP QP A++QGRVFATTRQEA+RAG VVT            
Subjt:  FQRHRRKLAAAGRTLKELPVCPSCGRVHGGRCLAGSGVCFRCRQPGHTADACPWKLIETTPHQPPASKQGRVFATTRQEAKRAGNVVT------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------GIVCIPKVISAMKAGKLLSQGTWSILASVLDTRELEVSLSSKPVVREYPDVFPDELPGLPPPREIDFAIELESDTAPISRAPYRMAPAELKE
                G+VCIPKVISAMKA KLLSQGTW ILASV+D RE EVSLSS+PVVREYPDVFPDELPGLPPPRE+DFAIELE  TAPISRAPYRMAPAELKE
Subjt:  --------GIVCIPKVISAMKAGKLLSQGTWSILASVLDTRELEVSLSSKPVVREYPDVFPDELPGLPPPREIDFAIELESDTAPISRAPYRMAPAELKE

Query:  LKVQLQELLDKGFIRPSVSPWRAPMLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPMIDDLFD
         KVQLQELLDKGFIRPSVSPW AP+LFVKKKDGSMRLCIDYRELNKVTVKNRYPLP I+DLFD
Subjt:  LKVQLQELLDKGFIRPSVSPWRAPMLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPMIDDLFD

KAA0053574.1 pol protein [Cucumis melo var. makuwa]2.7e-22778.88Show/hide
Query:  QASPAQAQIVPPPVPMEAQPAPVQLSGKAKHLRDFRKYNPKTFDGSMDNPTKAQMWLTSIEKIFRYIKCPDDQKVQCTVFFLEDRGTAWWETAERMLGRD
        QA+  QAQ V PP PM +QP PVQLS +AKHLRDFRKYNPKTFDGSMDNPTKAQMWLTSIE IFRY+KCP+DQKVQC VFFLEDRGTAWWETAERMLG D
Subjt:  QASPAQAQIVPPPVPMEAQPAPVQLSGKAKHLRDFRKYNPKTFDGSMDNPTKAQMWLTSIEKIFRYIKCPDDQKVQCTVFFLEDRGTAWWETAERMLGRD

Query:  VSKITWEQFKDSFNAKFFSANVKYVKQQEFLNLEQGDMTVEQYDAEFDMLSRFAADVLKDEEARTEKLIRGLRQDLQGIVRALRPTTHANDLRLALDLSL
        VSKITWEQFKD+F AKFFSANVK+ K QEFLNLEQGDMTVEQYDAEFDMLSRFA DV++DE ARTEK IR LR DLQGIVRALRP THA+ LR+ALDLSL
Subjt:  VSKITWEQFKDSFNAKFFSANVKYVKQQEFLNLEQGDMTVEQYDAEFDMLSRFAADVLKDEEARTEKLIRGLRQDLQGIVRALRPTTHANDLRLALDLSL

Query:  HERAYPSKAIGRGSALGQKSKVESQPDMTPQRNLRSGGIFQRHRRKLAAAGRTLKELPVCPSCGRVHGGRCLAGSGVCFRCRQPGHTADACPWKLIETTP
        +ERA  SKA+GRG ALGQK KVE+QPD+ PQR LRSGG+FQRHRR+LAAAGRTL+ LP C +CGRVHGGRCLAGSGVCFRCRQPGHTADACP K IETTP
Subjt:  HERAYPSKAIGRGSALGQKSKVESQPDMTPQRNLRSGGIFQRHRRKLAAAGRTLKELPVCPSCGRVHGGRCLAGSGVCFRCRQPGHTADACPWKLIETTP

Query:  HQPPASKQGRVFATTRQEAKRAGNVVT----------------------------------GIVCIPKVISAMKAGKLLSQGTWSILASVLDTRELEVSL
        HQP AS+QG  FATTRQEA+RAG VVT                                  GIVC+PKVISAMKA KLLSQGTW +LA+V+D RE EVSL
Subjt:  HQPPASKQGRVFATTRQEAKRAGNVVT----------------------------------GIVCIPKVISAMKAGKLLSQGTWSILASVLDTRELEVSL

Query:  SSKPVVREYPDVFPDELPGLPPPREIDFAIELESDTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWRAPMLFVKKKDGSMRLCIDYRELNKV
        SS+PVVREYPDVFP+EL GLPPPREIDFAIELE  TAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPW AP+LFVKKKDGSMRLCIDYRELNK+
Subjt:  SSKPVVREYPDVFPDELPGLPPPREIDFAIELESDTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWRAPMLFVKKKDGSMRLCIDYRELNKV

Query:  TVKNRYPLPMIDDLFD
        TVKN YPLP IDDLFD
Subjt:  TVKNRYPLPMIDDLFD

KAA0066101.1 reverse transcriptase [Cucumis melo var. makuwa]8.8e-22664.71Show/hide
Query:  ARRGGGKGGRGAGRTQSEEQPTVQAANPPAP-----------------------------TQASPAQAQIVPPPVPMEAQPAPVQLSGKAKHLRDFRKYN
        ARRGGG+GGRGAGR QSEEQP V   +P AP                              QA+P QAQ V PP P+EAQP PVQLS +AKHLRDFRKYN
Subjt:  ARRGGGKGGRGAGRTQSEEQPTVQAANPPAP-----------------------------TQASPAQAQIVPPPVPMEAQPAPVQLSGKAKHLRDFRKYN

Query:  PKTFDGSMDNPTKAQMWLTSIEKIFRYIKCPDDQKVQCTVFFLEDRGTAWWETAERMLGRDVSKITWEQFKDSFNAKFFSANVKYVKQQEFLNLEQGDMT
        PKTFDGSMDNPT AQMWLTSIE IFRY+KCP+DQKVQC VFFLEDRG+AWWETAERMLG DVSKITWEQFK++F AKFFS NVK+ K QEFLNLEQGDMT
Subjt:  PKTFDGSMDNPTKAQMWLTSIEKIFRYIKCPDDQKVQCTVFFLEDRGTAWWETAERMLGRDVSKITWEQFKDSFNAKFFSANVKYVKQQEFLNLEQGDMT

Query:  VEQYDAEFDMLSRFAADVLKDEEARTEKLIRGLRQDLQGIVRALRPTTHANDLRLALDLSLHERAYPSKAIGRGSALGQKSKVESQPDMTPQRNLRSGGI
        VEQYDAEFDMLSRFA DV++DE ARTEK +RGLR DLQGIVRALRP THA+ LR+ALDLSLHERA  SKA+ RG ALGQK KVE+QPD+ PQ+ LR GG+
Subjt:  VEQYDAEFDMLSRFAADVLKDEEARTEKLIRGLRQDLQGIVRALRPTTHANDLRLALDLSLHERAYPSKAIGRGSALGQKSKVESQPDMTPQRNLRSGGI

Query:  FQRHRRKLAAAGRTLKELPVCPSCGRVHGGRCLAGSGVCFRCRQPGHTADACPWKLIETTPHQPPASKQGRVFATTRQEAKRAGNVVT------------
        FQRHRR+LAAAGRTL+ELP C +CGRVHGGRCLAGSGVCFRCRQ GHTADACP K  ETTPHQP AS+QGRVF TTRQEA+RAG VVT            
Subjt:  FQRHRRKLAAAGRTLKELPVCPSCGRVHGGRCLAGSGVCFRCRQPGHTADACPWKLIETTPHQPPASKQGRVFATTRQEAKRAGNVVT------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------GIVCIPKVISAMKAGKLLSQGTWSILASVLDTRELEVSLSSKPVVREYPDVFPDELPGLPPPREIDFAIELESDTAPISRAPYRMAPAELKE
                GIVCIPKVISAMKA KLL+QGTW ILASV+DTRE EVSLSS+PVVREYPDVFPDEL GLPPPREI FAIELE  TAPISRAPYRMAPAELKE
Subjt:  --------GIVCIPKVISAMKAGKLLSQGTWSILASVLDTRELEVSLSSKPVVREYPDVFPDELPGLPPPREIDFAIELESDTAPISRAPYRMAPAELKE

Query:  LKVQLQELLDKGFIRPSVSPWRAPMLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPMIDDLFD
         KVQLQELLDKGFIRPSVSPW AP+LFVKKKDGSMRLCIDYRELNKVT+KNRYPLP IDDLFD
Subjt:  LKVQLQELLDKGFIRPSVSPWRAPMLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPMIDDLFD

TYK19164.1 pol protein [Cucumis melo var. makuwa]2.7e-22778.88Show/hide
Query:  QASPAQAQIVPPPVPMEAQPAPVQLSGKAKHLRDFRKYNPKTFDGSMDNPTKAQMWLTSIEKIFRYIKCPDDQKVQCTVFFLEDRGTAWWETAERMLGRD
        QA+  QAQ V PP PM +QP PVQLS +AKHLRDFRKYNPKTFDGSMDNPTKAQMWLTSIE IFRY+KCP+DQKVQC VFFLEDRGTAWWETAERMLG D
Subjt:  QASPAQAQIVPPPVPMEAQPAPVQLSGKAKHLRDFRKYNPKTFDGSMDNPTKAQMWLTSIEKIFRYIKCPDDQKVQCTVFFLEDRGTAWWETAERMLGRD

Query:  VSKITWEQFKDSFNAKFFSANVKYVKQQEFLNLEQGDMTVEQYDAEFDMLSRFAADVLKDEEARTEKLIRGLRQDLQGIVRALRPTTHANDLRLALDLSL
        VSKITWEQFKD+F AKFFSANVK+ K QEFLNLEQGDMTVEQYDAEFDMLSRFA DV++DE ARTEK IR LR DLQGIVRALRP THA+ LR+ALDLSL
Subjt:  VSKITWEQFKDSFNAKFFSANVKYVKQQEFLNLEQGDMTVEQYDAEFDMLSRFAADVLKDEEARTEKLIRGLRQDLQGIVRALRPTTHANDLRLALDLSL

Query:  HERAYPSKAIGRGSALGQKSKVESQPDMTPQRNLRSGGIFQRHRRKLAAAGRTLKELPVCPSCGRVHGGRCLAGSGVCFRCRQPGHTADACPWKLIETTP
        +ERA  SKA+GRG ALGQK KVE+QPD+ PQR LRSGG+FQRHRR+LAAAGRTL+ LP C +CGRVHGGRCLAGSGVCFRCRQPGHTADACP K IETTP
Subjt:  HERAYPSKAIGRGSALGQKSKVESQPDMTPQRNLRSGGIFQRHRRKLAAAGRTLKELPVCPSCGRVHGGRCLAGSGVCFRCRQPGHTADACPWKLIETTP

Query:  HQPPASKQGRVFATTRQEAKRAGNVVT----------------------------------GIVCIPKVISAMKAGKLLSQGTWSILASVLDTRELEVSL
        HQP AS+QG  FATTRQEA+RAG VVT                                  GIVC+PKVISAMKA KLLSQGTW +LA+V+D RE EVSL
Subjt:  HQPPASKQGRVFATTRQEAKRAGNVVT----------------------------------GIVCIPKVISAMKAGKLLSQGTWSILASVLDTRELEVSL

Query:  SSKPVVREYPDVFPDELPGLPPPREIDFAIELESDTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWRAPMLFVKKKDGSMRLCIDYRELNKV
        SS+PVVREYPDVFP+EL GLPPPREIDFAIELE  TAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPW AP+LFVKKKDGSMRLCIDYRELNK+
Subjt:  SSKPVVREYPDVFPDELPGLPPPREIDFAIELESDTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWRAPMLFVKKKDGSMRLCIDYRELNKV

Query:  TVKNRYPLPMIDDLFD
        TVKN YPLP IDDLFD
Subjt:  TVKNRYPLPMIDDLFD

TrEMBL top hitse value%identityAlignment
A0A5A7TY28 Reverse transcriptase1.1e-22968.47Show/hide
Query:  ARRGGGKGGRGAGRTQSEEQPTVQAANPPAP-----------------------------TQASPAQAQIVPPPVPMEAQPAPVQLSGKAKHLRDFRKYN
        ARRGGG+GGRGAGR Q E  P   A +P AP                              QA+P QAQ V PP P EAQP PVQLS +AKHLRDFRKYN
Subjt:  ARRGGGKGGRGAGRTQSEEQPTVQAANPPAP-----------------------------TQASPAQAQIVPPPVPMEAQPAPVQLSGKAKHLRDFRKYN

Query:  PKTFDGSMDNPTKAQMWLTSIEKIFRYIKCPDDQKVQCTVFFLEDRGTAWWETAERMLGRDVSKITWEQFKDSFNAKFFSANVKYVKQQEFLNLEQGDMT
        PKTFDGSMDNPTKAQMWLTSIE IFRY+KCP+DQKVQC VFFLEDRGTAWWETAERMLG DV+KITWEQFK++F AKFFSANVK+ K QEFLNLEQGDMT
Subjt:  PKTFDGSMDNPTKAQMWLTSIEKIFRYIKCPDDQKVQCTVFFLEDRGTAWWETAERMLGRDVSKITWEQFKDSFNAKFFSANVKYVKQQEFLNLEQGDMT

Query:  VEQYDAEFDMLSRFAADVLKDEEARTEKLIRGLRQDLQGIVRALRPTTHANDLRLALDLSLHERAYPSKAIGRGSALGQKSKVESQPDMTPQRNLRSGGI
        VEQYDAEFDMLSRFA D+++DE ARTEK +RGLR DLQGIVRALRP THA+ LR+ALDLSL ERA  SKA GRGSALGQK KVE+QPD+TPQR LRSGG+
Subjt:  VEQYDAEFDMLSRFAADVLKDEEARTEKLIRGLRQDLQGIVRALRPTTHANDLRLALDLSLHERAYPSKAIGRGSALGQKSKVESQPDMTPQRNLRSGGI

Query:  FQRHRRKLAAAGRTLKELPVCPSCGRVHGGRCLAGSGVCFRCRQPGHTADACPWKLIETTPHQPPASKQGRVFATTRQEAKRAGNVVT------------
        FQRHRR+LAAAGRTL+ELP C +CGRVHGGRCLAGSGVCFRCRQ GHTAD CP K  ETTP QP AS+QGRVFATTRQEA+RAG VVT            
Subjt:  FQRHRRKLAAAGRTLKELPVCPSCGRVHGGRCLAGSGVCFRCRQPGHTADACPWKLIETTPHQPPASKQGRVFATTRQEAKRAGNVVT------------

Query:  -------------------------------------------------------------------------GIVCIPKVISAMKAGKLLSQGTWSILA
                                                                                 G+VCIPKVISAMKA KLLSQGTW ILA
Subjt:  -------------------------------------------------------------------------GIVCIPKVISAMKAGKLLSQGTWSILA

Query:  SVLDTRELEVSLSSKPVVREYPDVFPDELPGLPPPREIDFAIELESDTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWRAPMLFVKKKDGSM
        SV+D RE EVSLSS+PVVREYPDVFPDELPGLP PRE+DFAIELE  TAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPW AP+LFVK KDGSM
Subjt:  SVLDTRELEVSLSSKPVVREYPDVFPDELPGLPPPREIDFAIELESDTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWRAPMLFVKKKDGSM

Query:  RLCIDYRELNKVTVKNRYPLPMIDDLFD
        RLCIDYRELNKVTVKNRYPLP IDDLFD
Subjt:  RLCIDYRELNKVTVKNRYPLPMIDDLFD

A0A5A7UC03 Gag protease polyprotein3.3e-22664.71Show/hide
Query:  ARRGGGKGGRGAGRTQSEEQPTVQAANPPAP-----------------------------TQASPAQAQIVPPPVPMEAQPAPVQLSGKAKHLRDFRKYN
        ARRGGG+GGRGAGR Q E  P   A +P AP                              QA+P QAQ V PP P EAQP PVQLS +AKHLRDFRKYN
Subjt:  ARRGGGKGGRGAGRTQSEEQPTVQAANPPAP-----------------------------TQASPAQAQIVPPPVPMEAQPAPVQLSGKAKHLRDFRKYN

Query:  PKTFDGSMDNPTKAQMWLTSIEKIFRYIKCPDDQKVQCTVFFLEDRGTAWWETAERMLGRDVSKITWEQFKDSFNAKFFSANVKYVKQQEFLNLEQGDMT
        PKTFDGSMDNPTKAQMWLTSIE IFRY+KCP+DQKVQC VFFLEDRGTAWWETAERMLG DVSKITWEQFK++F AKFFSANVK+ K QEFLNLEQGDMT
Subjt:  PKTFDGSMDNPTKAQMWLTSIEKIFRYIKCPDDQKVQCTVFFLEDRGTAWWETAERMLGRDVSKITWEQFKDSFNAKFFSANVKYVKQQEFLNLEQGDMT

Query:  VEQYDAEFDMLSRFAADVLKDEEARTEKLIRGLRQDLQGIVRALRPTTHANDLRLALDLSLHERAYPSKAIGRGSALGQKSKVESQPDMTPQRNLRSGGI
        VEQYDAEFDMLSRFA D+++DE ARTEK +RGLR DLQGIVRALRP THA+ LR+ALDLSL ERA  SKA GRGSALGQK KVE+QPD+ PQR LRSGG+
Subjt:  VEQYDAEFDMLSRFAADVLKDEEARTEKLIRGLRQDLQGIVRALRPTTHANDLRLALDLSLHERAYPSKAIGRGSALGQKSKVESQPDMTPQRNLRSGGI

Query:  FQRHRRKLAAAGRTLKELPVCPSCGRVHGGRCLAGSGVCFRCRQPGHTADACPWKLIETTPHQPPASKQGRVFATTRQEAKRAGNVVT------------
        FQRHR++LAAAGRTL+ELP C +CGRVHGGRCLAGSGVCFRCRQPGHTAD CP K  ETTP QP A++QGRVFATTRQEA+RAG VVT            
Subjt:  FQRHRRKLAAAGRTLKELPVCPSCGRVHGGRCLAGSGVCFRCRQPGHTADACPWKLIETTPHQPPASKQGRVFATTRQEAKRAGNVVT------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------GIVCIPKVISAMKAGKLLSQGTWSILASVLDTRELEVSLSSKPVVREYPDVFPDELPGLPPPREIDFAIELESDTAPISRAPYRMAPAELKE
                G+VCIPKVISAMKA KLLSQGTW ILASV+D RE EVSLSS+PVVREYPDVFPDELPGLPPPRE+DFAIELE  TAPISRAPYRMAPAELKE
Subjt:  --------GIVCIPKVISAMKAGKLLSQGTWSILASVLDTRELEVSLSSKPVVREYPDVFPDELPGLPPPREIDFAIELESDTAPISRAPYRMAPAELKE

Query:  LKVQLQELLDKGFIRPSVSPWRAPMLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPMIDDLFD
         KVQLQELLDKGFIRPSVSPW AP+LFVKKKDGSMRLCIDYRELNKVTVKNRYPLP I+DLFD
Subjt:  LKVQLQELLDKGFIRPSVSPWRAPMLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPMIDDLFD

A0A5A7UED4 Pol protein1.3e-22778.88Show/hide
Query:  QASPAQAQIVPPPVPMEAQPAPVQLSGKAKHLRDFRKYNPKTFDGSMDNPTKAQMWLTSIEKIFRYIKCPDDQKVQCTVFFLEDRGTAWWETAERMLGRD
        QA+  QAQ V PP PM +QP PVQLS +AKHLRDFRKYNPKTFDGSMDNPTKAQMWLTSIE IFRY+KCP+DQKVQC VFFLEDRGTAWWETAERMLG D
Subjt:  QASPAQAQIVPPPVPMEAQPAPVQLSGKAKHLRDFRKYNPKTFDGSMDNPTKAQMWLTSIEKIFRYIKCPDDQKVQCTVFFLEDRGTAWWETAERMLGRD

Query:  VSKITWEQFKDSFNAKFFSANVKYVKQQEFLNLEQGDMTVEQYDAEFDMLSRFAADVLKDEEARTEKLIRGLRQDLQGIVRALRPTTHANDLRLALDLSL
        VSKITWEQFKD+F AKFFSANVK+ K QEFLNLEQGDMTVEQYDAEFDMLSRFA DV++DE ARTEK IR LR DLQGIVRALRP THA+ LR+ALDLSL
Subjt:  VSKITWEQFKDSFNAKFFSANVKYVKQQEFLNLEQGDMTVEQYDAEFDMLSRFAADVLKDEEARTEKLIRGLRQDLQGIVRALRPTTHANDLRLALDLSL

Query:  HERAYPSKAIGRGSALGQKSKVESQPDMTPQRNLRSGGIFQRHRRKLAAAGRTLKELPVCPSCGRVHGGRCLAGSGVCFRCRQPGHTADACPWKLIETTP
        +ERA  SKA+GRG ALGQK KVE+QPD+ PQR LRSGG+FQRHRR+LAAAGRTL+ LP C +CGRVHGGRCLAGSGVCFRCRQPGHTADACP K IETTP
Subjt:  HERAYPSKAIGRGSALGQKSKVESQPDMTPQRNLRSGGIFQRHRRKLAAAGRTLKELPVCPSCGRVHGGRCLAGSGVCFRCRQPGHTADACPWKLIETTP

Query:  HQPPASKQGRVFATTRQEAKRAGNVVT----------------------------------GIVCIPKVISAMKAGKLLSQGTWSILASVLDTRELEVSL
        HQP AS+QG  FATTRQEA+RAG VVT                                  GIVC+PKVISAMKA KLLSQGTW +LA+V+D RE EVSL
Subjt:  HQPPASKQGRVFATTRQEAKRAGNVVT----------------------------------GIVCIPKVISAMKAGKLLSQGTWSILASVLDTRELEVSL

Query:  SSKPVVREYPDVFPDELPGLPPPREIDFAIELESDTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWRAPMLFVKKKDGSMRLCIDYRELNKV
        SS+PVVREYPDVFP+EL GLPPPREIDFAIELE  TAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPW AP+LFVKKKDGSMRLCIDYRELNK+
Subjt:  SSKPVVREYPDVFPDELPGLPPPREIDFAIELESDTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWRAPMLFVKKKDGSMRLCIDYRELNKV

Query:  TVKNRYPLPMIDDLFD
        TVKN YPLP IDDLFD
Subjt:  TVKNRYPLPMIDDLFD

A0A5A7VK60 Reverse transcriptase4.3e-22664.71Show/hide
Query:  ARRGGGKGGRGAGRTQSEEQPTVQAANPPAP-----------------------------TQASPAQAQIVPPPVPMEAQPAPVQLSGKAKHLRDFRKYN
        ARRGGG+GGRGAGR QSEEQP V   +P AP                              QA+P QAQ V PP P+EAQP PVQLS +AKHLRDFRKYN
Subjt:  ARRGGGKGGRGAGRTQSEEQPTVQAANPPAP-----------------------------TQASPAQAQIVPPPVPMEAQPAPVQLSGKAKHLRDFRKYN

Query:  PKTFDGSMDNPTKAQMWLTSIEKIFRYIKCPDDQKVQCTVFFLEDRGTAWWETAERMLGRDVSKITWEQFKDSFNAKFFSANVKYVKQQEFLNLEQGDMT
        PKTFDGSMDNPT AQMWLTSIE IFRY+KCP+DQKVQC VFFLEDRG+AWWETAERMLG DVSKITWEQFK++F AKFFS NVK+ K QEFLNLEQGDMT
Subjt:  PKTFDGSMDNPTKAQMWLTSIEKIFRYIKCPDDQKVQCTVFFLEDRGTAWWETAERMLGRDVSKITWEQFKDSFNAKFFSANVKYVKQQEFLNLEQGDMT

Query:  VEQYDAEFDMLSRFAADVLKDEEARTEKLIRGLRQDLQGIVRALRPTTHANDLRLALDLSLHERAYPSKAIGRGSALGQKSKVESQPDMTPQRNLRSGGI
        VEQYDAEFDMLSRFA DV++DE ARTEK +RGLR DLQGIVRALRP THA+ LR+ALDLSLHERA  SKA+ RG ALGQK KVE+QPD+ PQ+ LR GG+
Subjt:  VEQYDAEFDMLSRFAADVLKDEEARTEKLIRGLRQDLQGIVRALRPTTHANDLRLALDLSLHERAYPSKAIGRGSALGQKSKVESQPDMTPQRNLRSGGI

Query:  FQRHRRKLAAAGRTLKELPVCPSCGRVHGGRCLAGSGVCFRCRQPGHTADACPWKLIETTPHQPPASKQGRVFATTRQEAKRAGNVVT------------
        FQRHRR+LAAAGRTL+ELP C +CGRVHGGRCLAGSGVCFRCRQ GHTADACP K  ETTPHQP AS+QGRVF TTRQEA+RAG VVT            
Subjt:  FQRHRRKLAAAGRTLKELPVCPSCGRVHGGRCLAGSGVCFRCRQPGHTADACPWKLIETTPHQPPASKQGRVFATTRQEAKRAGNVVT------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------GIVCIPKVISAMKAGKLLSQGTWSILASVLDTRELEVSLSSKPVVREYPDVFPDELPGLPPPREIDFAIELESDTAPISRAPYRMAPAELKE
                GIVCIPKVISAMKA KLL+QGTW ILASV+DTRE EVSLSS+PVVREYPDVFPDEL GLPPPREI FAIELE  TAPISRAPYRMAPAELKE
Subjt:  --------GIVCIPKVISAMKAGKLLSQGTWSILASVLDTRELEVSLSSKPVVREYPDVFPDELPGLPPPREIDFAIELESDTAPISRAPYRMAPAELKE

Query:  LKVQLQELLDKGFIRPSVSPWRAPMLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPMIDDLFD
         KVQLQELLDKGFIRPSVSPW AP+LFVKKKDGSMRLCIDYRELNKVT+KNRYPLP IDDLFD
Subjt:  LKVQLQELLDKGFIRPSVSPWRAPMLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPMIDDLFD

A0A5D3D6H0 Pol protein1.3e-22778.88Show/hide
Query:  QASPAQAQIVPPPVPMEAQPAPVQLSGKAKHLRDFRKYNPKTFDGSMDNPTKAQMWLTSIEKIFRYIKCPDDQKVQCTVFFLEDRGTAWWETAERMLGRD
        QA+  QAQ V PP PM +QP PVQLS +AKHLRDFRKYNPKTFDGSMDNPTKAQMWLTSIE IFRY+KCP+DQKVQC VFFLEDRGTAWWETAERMLG D
Subjt:  QASPAQAQIVPPPVPMEAQPAPVQLSGKAKHLRDFRKYNPKTFDGSMDNPTKAQMWLTSIEKIFRYIKCPDDQKVQCTVFFLEDRGTAWWETAERMLGRD

Query:  VSKITWEQFKDSFNAKFFSANVKYVKQQEFLNLEQGDMTVEQYDAEFDMLSRFAADVLKDEEARTEKLIRGLRQDLQGIVRALRPTTHANDLRLALDLSL
        VSKITWEQFKD+F AKFFSANVK+ K QEFLNLEQGDMTVEQYDAEFDMLSRFA DV++DE ARTEK IR LR DLQGIVRALRP THA+ LR+ALDLSL
Subjt:  VSKITWEQFKDSFNAKFFSANVKYVKQQEFLNLEQGDMTVEQYDAEFDMLSRFAADVLKDEEARTEKLIRGLRQDLQGIVRALRPTTHANDLRLALDLSL

Query:  HERAYPSKAIGRGSALGQKSKVESQPDMTPQRNLRSGGIFQRHRRKLAAAGRTLKELPVCPSCGRVHGGRCLAGSGVCFRCRQPGHTADACPWKLIETTP
        +ERA  SKA+GRG ALGQK KVE+QPD+ PQR LRSGG+FQRHRR+LAAAGRTL+ LP C +CGRVHGGRCLAGSGVCFRCRQPGHTADACP K IETTP
Subjt:  HERAYPSKAIGRGSALGQKSKVESQPDMTPQRNLRSGGIFQRHRRKLAAAGRTLKELPVCPSCGRVHGGRCLAGSGVCFRCRQPGHTADACPWKLIETTP

Query:  HQPPASKQGRVFATTRQEAKRAGNVVT----------------------------------GIVCIPKVISAMKAGKLLSQGTWSILASVLDTRELEVSL
        HQP AS+QG  FATTRQEA+RAG VVT                                  GIVC+PKVISAMKA KLLSQGTW +LA+V+D RE EVSL
Subjt:  HQPPASKQGRVFATTRQEAKRAGNVVT----------------------------------GIVCIPKVISAMKAGKLLSQGTWSILASVLDTRELEVSL

Query:  SSKPVVREYPDVFPDELPGLPPPREIDFAIELESDTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWRAPMLFVKKKDGSMRLCIDYRELNKV
        SS+PVVREYPDVFP+EL GLPPPREIDFAIELE  TAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPW AP+LFVKKKDGSMRLCIDYRELNK+
Subjt:  SSKPVVREYPDVFPDELPGLPPPREIDFAIELESDTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWRAPMLFVKKKDGSMRLCIDYRELNKV

Query:  TVKNRYPLPMIDDLFD
        TVKN YPLP IDDLFD
Subjt:  TVKNRYPLPMIDDLFD

SwissProt top hitse value%identityAlignment
P0CT41 Transposon Tf2-12 polyprotein5.7e-1030.36Show/hide
Query:  VVREYPDVFPD-ELPGLPPP-REIDFAIELESDTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWRAPMLFVKKKDGSMRLCIDYRELNKVTV
        + +E+ D+  +     LP P + ++F +EL  +   +    Y + P +++ +  ++ + L  G IR S +    P++FV KK+G++R+ +DY+ LNK   
Subjt:  VVREYPDVFPD-ELPGLPPP-REIDFAIELESDTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWRAPMLFVKKKDGSMRLCIDYRELNKVTV

Query:  KNRYPLPMIDDL
         N YPLP+I+ L
Subjt:  KNRYPLPMIDDL

P10394 Retrovirus-related Pol polyprotein from transposon 4121.2e-1030Show/hide
Query:  REYPDVFPDELPGLPPPREIDFAIELE--------------SDTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWRAPMLFVKKKDG------
        + +P++F  +L  +       FA+E E               D  P+    YR   ++++E++ Q+Q+L+    + PSVS + +P+L V KK        
Subjt:  REYPDVFPDELPGLPPPREIDFAIELE--------------SDTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWRAPMLFVKKKDG------

Query:  SMRLCIDYRELNKVTVKNRYPLPMIDDLFD
          RL IDYR++NK  + +++PLP IDD+ D
Subjt:  SMRLCIDYRELNKVTVKNRYPLPMIDDLFD

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein1.1e-1338.05Show/hide
Query:  REYPDVFPDELPGLPPPREI-DFAIELESDTAPISR----APYRMAPAELKELKVQLQELLDKGFIRPSVSPWRAPMLFVKKKDGSMRLCIDYRELNKVT
        ++Y ++  ++LP  P P +I +  ++ + +  P +R     PY +     +E+   +Q+LLD  FI PS SP  +P++ V KKDG+ RLC+DYR LNK T
Subjt:  REYPDVFPDELPGLPPPREI-DFAIELESDTAPISR----APYRMAPAELKELKVQLQELLDKGFIRPSVSPWRAPMLFVKKKDGSMRLCIDYRELNKVT

Query:  VKNRYPLPMIDDL
        + + +PLP ID+L
Subjt:  VKNRYPLPMIDDL

Q99315 Transposon Ty3-G Gag-Pol polyprotein1.1e-1338.05Show/hide
Query:  REYPDVFPDELPGLPPPREI-DFAIELESDTAPISR----APYRMAPAELKELKVQLQELLDKGFIRPSVSPWRAPMLFVKKKDGSMRLCIDYRELNKVT
        ++Y ++  ++LP  P P +I +  ++ + +  P +R     PY +     +E+   +Q+LLD  FI PS SP  +P++ V KKDG+ RLC+DYR LNK T
Subjt:  REYPDVFPDELPGLPPPREI-DFAIELESDTAPISR----APYRMAPAELKELKVQLQELLDKGFIRPSVSPWRAPMLFVKKKDGSMRLCIDYRELNKVT

Query:  VKNRYPLPMIDDL
        + + +PLP ID+L
Subjt:  VKNRYPLPMIDDL

Q9UR07 Transposon Tf2-11 polyprotein5.7e-1030.36Show/hide
Query:  VVREYPDVFPD-ELPGLPPP-REIDFAIELESDTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWRAPMLFVKKKDGSMRLCIDYRELNKVTV
        + +E+ D+  +     LP P + ++F +EL  +   +    Y + P +++ +  ++ + L  G IR S +    P++FV KK+G++R+ +DY+ LNK   
Subjt:  VVREYPDVFPD-ELPGLPPP-REIDFAIELESDTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWRAPMLFVKKKDGSMRLCIDYRELNKVTV

Query:  KNRYPLPMIDDL
         N YPLP+I+ L
Subjt:  KNRYPLPMIDDL

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATTTTTTATTTTTATTAGACATTAAGAAAATATCTAATGGTGGTGGTGCTTGGGAGTTGAAAGAAAAGGAGAAAACCTTAACCCTATTTCCGCTGCCACCA
CCCCCCTCTTGCAAACTGCCGCCGCCGCCATCAATTCAAGCCGATCGACTTCATCAGGCGCGTTGCCTCTGTCGTTTGAAGTTGTGCCGCCGCCGTAGCCGAACC
GTGCACAAGCGTCAGCCGTCACGTGAAGAACATCACGCACGACGACGCTCCATCAGCCGTCGCCTCCATCTTCAGCCATTTACGGCACGTCCAGTCGTCTGCCTT
GCAGTAGCACCATCGTCGCCGTCTTCGTCGTTGGTCTTCTTCATCAATGAACTGTCACCAGGCGTAACCCGAGTCGACCCTGCAACAGAATCCCGAGTCGTACGA
GCCGCGTGTGTGAGTTGCGCCTCTAGCCGCTCGCAATCCCTAGCTGAGTGGACCAGCCAGTCCCTTCCTCCAGCCGAGCCATCAGTCCTTTATACCACCCAGCCG
AGCCGTCCAGTTATTGTTGAGCCGCCTAGCCTATTTTTGACCTTCTTACCTGTTTTGGTAAGTTGTGCACGTCGAGGGGGTGGTAAGGGAGGTAGAGGAGCTGGA
CGTACCCAATCGGAGGAGCAACCTACTGTACAGGCAGCCAACCCTCCCGCACCTACCCAGGCCTCTCCTGCTCAGGCCCAGATCGTCCCTCCTCCTGTCCCTATG
GAAGCTCAGCCCGCGCCGGTTCAACTGTCAGGGAAGGCCAAACATCTGAGAGACTTTAGGAAGTACAATCCTAAAACATTTGACGGATCCATGGATAACCCCACC
AAGGCCCAGATGTGGTTGACTTCTATAGAGAAGATCTTCAGGTACATAAAATGCCCTGATGACCAGAAGGTTCAGTGCACAGTTTTCTTCTTGGAGGATAGAGGC
ACTGCATGGTGGGAGACTGCTGAAAGGATGCTGGGTAGAGATGTCAGCAAGATAACCTGGGAGCAGTTCAAGGACAGCTTCAATGCAAAGTTCTTTTCTGCCAAC
GTGAAGTACGTGAAGCAGCAAGAGTTCTTGAACTTGGAGCAAGGCGACATGACGGTGGAGCAGTATGACGCTGAGTTTGACATGCTATCCCGCTTTGCCGCTGAT
GTTTTGAAGGATGAGGAGGCCAGGACCGAGAAGTTAATCAGAGGTCTTAGACAAGACCTCCAGGGTATTGTTCGAGCCCTCAGGCCAACAACTCATGCTAATGAT
TTACGCCTGGCACTAGACTTGAGTCTGCATGAGAGAGCTTATCCGTCCAAGGCTATCGGCAGGGGGTCAGCCCTTGGTCAGAAGAGCAAGGTTGAGTCGCAGCCT
GACATGACACCGCAGCGAAATCTAAGGTCAGGAGGTATCTTCCAACGGCATCGTCGGAAGCTTGCAGCAGCCGGGAGAACCTTGAAAGAGCTACCCGTTTGTCCT
AGCTGTGGGAGAGTTCATGGAGGTCGTTGCTTGGCCGGGAGTGGAGTTTGCTTCAGGTGCAGGCAACCAGGGCATACCGCTGATGCTTGTCCTTGGAAACTCATT
GAGACTACCCCACACCAGCCTCCCGCTTCCAAGCAGGGAAGAGTTTTTGCCACTACTCGTCAGGAAGCCAAACGAGCTGGTAATGTGGTGACAGGAATCGTATGT
ATACCCAAGGTCATCTCAGCTATGAAGGCTGGTAAACTACTTAGCCAGGGTACCTGGAGCATCTTGGCAAGCGTATTAGATACCAGAGAACTAGAAGTTTCCTTG
TCCTCCAAACCAGTGGTAAGGGAGTACCCTGATGTATTCCCTGACGAGCTTCCAGGACTTCCACCTCCTAGGGAGATAGACTTCGCCATCGAGTTAGAGTCAGAC
ACTGCTCCTATTTCGAGGGCCCCTTACAGAATGGCTCCAGCTGAGCTAAAGGAGCTGAAGGTGCAATTGCAGGAGTTGCTGGACAAGGGTTTCATTCGACCCAGT
GTGTCACCTTGGAGAGCACCAATGTTGTTCGTGAAGAAGAAGGATGGGTCAATGCGCCTTTGCATTGACTATAGAGAGCTGAACAAGGTGACAGTCAAGAACCGC
TATCCCTTGCCCATGATTGATGATTTGTTCGATTAG
mRNA sequenceShow/hide mRNA sequence
ATGAATTTTTTATTTTTATTAGACATTAAGAAAATATCTAATGGTGGTGGTGCTTGGGAGTTGAAAGAAAAGGAGAAAACCTTAACCCTATTTCCGCTGCCACCA
CCCCCCTCTTGCAAACTGCCGCCGCCGCCATCAATTCAAGCCGATCGACTTCATCAGGCGCGTTGCCTCTGTCGTTTGAAGTTGTGCCGCCGCCGTAGCCGAACC
GTGCACAAGCGTCAGCCGTCACGTGAAGAACATCACGCACGACGACGCTCCATCAGCCGTCGCCTCCATCTTCAGCCATTTACGGCACGTCCAGTCGTCTGCCTT
GCAGTAGCACCATCGTCGCCGTCTTCGTCGTTGGTCTTCTTCATCAATGAACTGTCACCAGGCGTAACCCGAGTCGACCCTGCAACAGAATCCCGAGTCGTACGA
GCCGCGTGTGTGAGTTGCGCCTCTAGCCGCTCGCAATCCCTAGCTGAGTGGACCAGCCAGTCCCTTCCTCCAGCCGAGCCATCAGTCCTTTATACCACCCAGCCG
AGCCGTCCAGTTATTGTTGAGCCGCCTAGCCTATTTTTGACCTTCTTACCTGTTTTGGTAAGTTGTGCACGTCGAGGGGGTGGTAAGGGAGGTAGAGGAGCTGGA
CGTACCCAATCGGAGGAGCAACCTACTGTACAGGCAGCCAACCCTCCCGCACCTACCCAGGCCTCTCCTGCTCAGGCCCAGATCGTCCCTCCTCCTGTCCCTATG
GAAGCTCAGCCCGCGCCGGTTCAACTGTCAGGGAAGGCCAAACATCTGAGAGACTTTAGGAAGTACAATCCTAAAACATTTGACGGATCCATGGATAACCCCACC
AAGGCCCAGATGTGGTTGACTTCTATAGAGAAGATCTTCAGGTACATAAAATGCCCTGATGACCAGAAGGTTCAGTGCACAGTTTTCTTCTTGGAGGATAGAGGC
ACTGCATGGTGGGAGACTGCTGAAAGGATGCTGGGTAGAGATGTCAGCAAGATAACCTGGGAGCAGTTCAAGGACAGCTTCAATGCAAAGTTCTTTTCTGCCAAC
GTGAAGTACGTGAAGCAGCAAGAGTTCTTGAACTTGGAGCAAGGCGACATGACGGTGGAGCAGTATGACGCTGAGTTTGACATGCTATCCCGCTTTGCCGCTGAT
GTTTTGAAGGATGAGGAGGCCAGGACCGAGAAGTTAATCAGAGGTCTTAGACAAGACCTCCAGGGTATTGTTCGAGCCCTCAGGCCAACAACTCATGCTAATGAT
TTACGCCTGGCACTAGACTTGAGTCTGCATGAGAGAGCTTATCCGTCCAAGGCTATCGGCAGGGGGTCAGCCCTTGGTCAGAAGAGCAAGGTTGAGTCGCAGCCT
GACATGACACCGCAGCGAAATCTAAGGTCAGGAGGTATCTTCCAACGGCATCGTCGGAAGCTTGCAGCAGCCGGGAGAACCTTGAAAGAGCTACCCGTTTGTCCT
AGCTGTGGGAGAGTTCATGGAGGTCGTTGCTTGGCCGGGAGTGGAGTTTGCTTCAGGTGCAGGCAACCAGGGCATACCGCTGATGCTTGTCCTTGGAAACTCATT
GAGACTACCCCACACCAGCCTCCCGCTTCCAAGCAGGGAAGAGTTTTTGCCACTACTCGTCAGGAAGCCAAACGAGCTGGTAATGTGGTGACAGGAATCGTATGT
ATACCCAAGGTCATCTCAGCTATGAAGGCTGGTAAACTACTTAGCCAGGGTACCTGGAGCATCTTGGCAAGCGTATTAGATACCAGAGAACTAGAAGTTTCCTTG
TCCTCCAAACCAGTGGTAAGGGAGTACCCTGATGTATTCCCTGACGAGCTTCCAGGACTTCCACCTCCTAGGGAGATAGACTTCGCCATCGAGTTAGAGTCAGAC
ACTGCTCCTATTTCGAGGGCCCCTTACAGAATGGCTCCAGCTGAGCTAAAGGAGCTGAAGGTGCAATTGCAGGAGTTGCTGGACAAGGGTTTCATTCGACCCAGT
GTGTCACCTTGGAGAGCACCAATGTTGTTCGTGAAGAAGAAGGATGGGTCAATGCGCCTTTGCATTGACTATAGAGAGCTGAACAAGGTGACAGTCAAGAACCGC
TATCCCTTGCCCATGATTGATGATTTGTTCGATTAG
Protein sequenceShow/hide protein sequence
MNFLFLLDIKKISNGGGAWELKEKEKTLTLFPLPPPPSCKLPPPPSIQADRLHQARCLCRLKLCRRRSRTVHKRQPSREEHHARRRSISRRLHLQPFTARPVVCL
AVAPSSPSSSLVFFINELSPGVTRVDPATESRVVRAACVSCASSRSQSLAEWTSQSLPPAEPSVLYTTQPSRPVIVEPPSLFLTFLPVLVSCARRGGGKGGRGAG
RTQSEEQPTVQAANPPAPTQASPAQAQIVPPPVPMEAQPAPVQLSGKAKHLRDFRKYNPKTFDGSMDNPTKAQMWLTSIEKIFRYIKCPDDQKVQCTVFFLEDRG
TAWWETAERMLGRDVSKITWEQFKDSFNAKFFSANVKYVKQQEFLNLEQGDMTVEQYDAEFDMLSRFAADVLKDEEARTEKLIRGLRQDLQGIVRALRPTTHAND
LRLALDLSLHERAYPSKAIGRGSALGQKSKVESQPDMTPQRNLRSGGIFQRHRRKLAAAGRTLKELPVCPSCGRVHGGRCLAGSGVCFRCRQPGHTADACPWKLI
ETTPHQPPASKQGRVFATTRQEAKRAGNVVTGIVCIPKVISAMKAGKLLSQGTWSILASVLDTRELEVSLSSKPVVREYPDVFPDELPGLPPPREIDFAIELESD
TAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWRAPMLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPMIDDLFD