; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0017237 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0017237
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionSWIM-type domain-containing protein
Genome locationchr09:17762979..17768114
RNA-Seq ExpressionPay0017237
SyntenyPay0017237
Gene Ontology termsGO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR004332 - Transposase, MuDR, plant
IPR018289 - MULE transposase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026169.1 uncharacterized protein E6C27_scaffold19G001250 [Cucumis melo var. makuwa]1.0e-21474.86Show/hide
Query:  MSLRPEDLVILEPGNHGDSDIDVEIDELFGNEENMEEENERIPSEIFTQIDWDITNSVCEQRSTLANRDEFVDTSCLIQKGMLFDCKEDLQLAVKKYCVT
        M LRPEDLVILEPGNH DSDIDVEIDELFGNEENMEEENERIPSEIFTQIDWDITNSVCEQ STLANRDEFVDT CLIQKGMLFDCKEDLQLAVKKYCVT
Subjt:  MSLRPEDLVILEPGNHGDSDIDVEIDELFGNEENMEEENERIPSEIFTQIDWDITNSVCEQRSTLANRDEFVDTSCLIQKGMLFDCKEDLQLAVKKYCVT

Query:  QHYEIVVVESNQNILSVRCKQWSNGCNWRLRGSRRKSHGLFEISRLEGEHSCL-----------------------------VTMSVLMEMIKQQYGYTV
        QHY+IVVVESNQNI SVRCKQWSNGCNWRLRGSRRKSHGLFEISRLEGEHSCL                             VTMSVLMEMIKQQYGYTV
Subjt:  QHYEIVVVESNQNILSVRCKQWSNGCNWRLRGSRRKSHGLFEISRLEGEHSCL-----------------------------VTMSVLMEMIKQQYGYTV

Query:  KYRRVWQAKRKALVAVFGDWDKSYNELPYWLSVVVHYNPGTRVDWIFLPSDVPGTTIFGRIFWAFGPAIEGFKYCRPLIQIDGTHLYGKYKGKMLTALSI
        KYRRVWQAKRKALVAVFGDWDKSYNEL YWLS VVHYNPGT+VDW FLPSDVPGTTIFGR+FWAFGPAIEGFKYCRPLIQIDGTHLYGKYKGKMLTALSI
Subjt:  KYRRVWQAKRKALVAVFGDWDKSYNELPYWLSVVVHYNPGTRVDWIFLPSDVPGTTIFGRIFWAFGPAIEGFKYCRPLIQIDGTHLYGKYKGKMLTALSI

Query:  DANGHIFPLAFAIVEGENASSWS----------------------------------------------------CNFNNKYKSKQLKDLVFRAGNQHQR
        DANGHIFPLAFAIVEGENASSWS                                                     NFNNKYKSKQLKD VFRAGNQHQR
Subjt:  DANGHIFPLAFAIVEGENASSWS----------------------------------------------------CNFNNKYKSKQLKDLVFRAGNQHQR

Query:  RKFIRNMKEIKQLNPECLEFFEDIDLQKWTHNAAECMNGVFKGARMLPMTSLVRLTFYRTILYFERRRAEISEAVDRGEVYTEYVMKKLKRWETWASAHS
        RKFIRNMKEIKQLNPECLEFFEDIDLQKWT +     NG                             A+ISEAVDRGEVYTEY MKKLKRWET ASAHS
Subjt:  RKFIRNMKEIKQLNPECLEFFEDIDLQKWTHNAAECMNGVFKGARMLPMTSLVRLTFYRTILYFERRRAEISEAVDRGEVYTEYVMKKLKRWETWASAHS

Query:  VTSIDRETQTFEVHTGMSMISPYKGQHTQ
        VTSIDRETQTFEVHTGMSMISPYKGQHTQ
Subjt:  VTSIDRETQTFEVHTGMSMISPYKGQHTQ

KAA0044308.1 uncharacterized protein E6C27_scaffold46G00290 [Cucumis melo var. makuwa]4.7e-24480.66Show/hide
Query:  MSLRPEDLVILEPGNHGDSDIDVEIDELFGNEENMEEENERIPSEIFTQIDWDITNSVCEQRSTLANRDEFVDTSCLIQKGMLFDCKEDLQLAVKKYCVT
        MSLRPEDLVILEPGNHGDSDIDVEIDELFGNEENMEEENERIPSEIFTQIDWDITNSVCEQ STL NRDEFVDTSCLIQKGMLFDCKEDLQLAVKKYCVT
Subjt:  MSLRPEDLVILEPGNHGDSDIDVEIDELFGNEENMEEENERIPSEIFTQIDWDITNSVCEQRSTLANRDEFVDTSCLIQKGMLFDCKEDLQLAVKKYCVT

Query:  QHYEIVVVESNQNILSVRCKQWSNGCNWRLRGSRRKSHGLFEISRLEGEHSCL-----------------------------VTMSVLMEMIKQQYGYTV
        QHYEIVVVESNQNI SVRCKQWSNGCNWRLRGSRRKSHGLFEISRLEGEHSCL                             VT+SVLMEMIKQQYGYTV
Subjt:  QHYEIVVVESNQNILSVRCKQWSNGCNWRLRGSRRKSHGLFEISRLEGEHSCL-----------------------------VTMSVLMEMIKQQYGYTV

Query:  KYRRVWQAKRKALVAVFGDWDKSYNELPYWLSVVVHYNPGTRVDWIFLPSDVPGTTIFGRIFWAFGPAIEGFKYCRPLIQIDGTHLYGKYKGKMLTALSI
        KYRRVWQAKRKALVAVFGDWDKSYNELPYWLS VVHYNPGTRVDW FLPSDVPGTTIFGR+FWAFGPAIEGFKYCRPLIQIDGTHLYGKYKGKMLTALSI
Subjt:  KYRRVWQAKRKALVAVFGDWDKSYNELPYWLSVVVHYNPGTRVDWIFLPSDVPGTTIFGRIFWAFGPAIEGFKYCRPLIQIDGTHLYGKYKGKMLTALSI

Query:  DANGHIFPLAFAIVEGENASSWS----------------------------------------------------CNFNNKYKSKQLKDLVFRAGNQHQR
        DANGHIFPLAFAIVEGENASSWS                                                     NFNNKYKSKQLKDLVFRAGNQHQR
Subjt:  DANGHIFPLAFAIVEGENASSWS----------------------------------------------------CNFNNKYKSKQLKDLVFRAGNQHQR

Query:  RKFIRNMKEIKQLNPECLEFFEDIDLQKWTH-------------NAAECMNGVFKGARMLPMTSLVRLTFYRTILYFERRRAEISEAVDRGEVYTEYVMK
        RKFIRNMKEIKQLNPECLEFFEDIDLQKWT              NAAECMNGVFKGARMLPMTSLVRLTFYRTILYFERRRAEISEAVDRGEVYTEY MK
Subjt:  RKFIRNMKEIKQLNPECLEFFEDIDLQKWTH-------------NAAECMNGVFKGARMLPMTSLVRLTFYRTILYFERRRAEISEAVDRGEVYTEYVMK

Query:  KLKRWETWASAHSVTSIDRETQTFEVHTGMSMISPYKGQHTQV
        KLKRWET ASAHSVTSIDRETQTFEVHTGMSMISPYKGQHTQV
Subjt:  KLKRWETWASAHSVTSIDRETQTFEVHTGMSMISPYKGQHTQV

KAA0054037.1 uncharacterized protein E6C27_scaffold318G001000 [Cucumis melo var. makuwa]1.3e-20674.71Show/hide
Query:  MSLRPEDLVILEPGNHGDSDIDVEIDELFGNEENMEEENERIPSEIFTQIDWDITNSVCEQRSTLANRDEFVDTSCLIQKGMLFDCKEDLQLAVKKYCVT
        MSLRPEDL+ILEP NHGDSDIDVE+DELFGNEENMEEENERIPSEIFTQIDWDITNSVCEQ STLANRDE VDTSCLIQKGMLFDCKEDLQLAVKKYCVT
Subjt:  MSLRPEDLVILEPGNHGDSDIDVEIDELFGNEENMEEENERIPSEIFTQIDWDITNSVCEQRSTLANRDEFVDTSCLIQKGMLFDCKEDLQLAVKKYCVT

Query:  QHYEIVVVESNQNILSVRCKQWSNGCNWRLRGSRRKSHGLFEISRLEGEHSC----LVTMSVLMEMIKQQYGYTVKYRRVWQAKRKALVAVFGDWDKSYN
        QHYEIVVVESNQNI SVRCKQWSNGCNWRLRGSRRKSHGLFEISRLEGEHS      VT+SVLMEMIKQQY                             
Subjt:  QHYEIVVVESNQNILSVRCKQWSNGCNWRLRGSRRKSHGLFEISRLEGEHSC----LVTMSVLMEMIKQQYGYTVKYRRVWQAKRKALVAVFGDWDKSYN

Query:  ELPYWLSVVVHYNPGTRVDWIFLPSDVPGTTIFGRIFWAFGPAIEGFKYCRPLIQIDGTHLYGKYKGKMLTALSIDANGHIFPLAFAIVEGENASSWS--
                      GTRVDW FLPSDV GTTIFGR+FWAFGPAIEG KYCRPLIQIDGTHLYGKYKGKMLT LSIDANGHIFPLAFAIVEGENASSWS  
Subjt:  ELPYWLSVVVHYNPGTRVDWIFLPSDVPGTTIFGRIFWAFGPAIEGFKYCRPLIQIDGTHLYGKYKGKMLTALSIDANGHIFPLAFAIVEGENASSWS--

Query:  --------------------------------------------------CNFNNKYKSKQLKDLVFRAGNQHQRRKFIRNMKEIKQLNPECLEFFEDID
                                                           NFNNKYKSKQLKDLVFRAGNQHQRRKFIRNMKEIKQLNPECLEFFEDID
Subjt:  --------------------------------------------------CNFNNKYKSKQLKDLVFRAGNQHQRRKFIRNMKEIKQLNPECLEFFEDID

Query:  LQKWTH-------------NAAECMNGVFKGARMLPMTSLVRLTFYRTILYFERRRAEISEAVDRGEVYTEYVMKKLKRWETWASAHSVTSIDRETQTFE
        LQKWT              NA ECMNGVFKGARMLPMTSLVRLTFYRTILYFERRRAEISEAVDRGEVYTEY MKKLKRWET ASAHSVTSIDRETQTFE
Subjt:  LQKWTH-------------NAAECMNGVFKGARMLPMTSLVRLTFYRTILYFERRRAEISEAVDRGEVYTEYVMKKLKRWETWASAHSVTSIDRETQTFE

Query:  VHTGMSMISPYKGQHTQV
        VHTGMSMISPYKGQHTQV
Subjt:  VHTGMSMISPYKGQHTQV

TYK02543.1 uncharacterized protein E5676_scaffold201G00230 [Cucumis melo var. makuwa]1.0e-24380.48Show/hide
Query:  MSLRPEDLVILEPGNHGDSDIDVEIDELFGNEENMEEENERIPSEIFTQIDWDITNSVCEQRSTLANRDEFVDTSCLIQKGMLFDCKEDLQLAVKKYCVT
        MSLRPEDLVILEPGNHGDSDIDVEIDELFGNEENMEEENERIPSEIFTQIDWDITNSVCEQ STL NRDEFVDTSCLIQKGMLFDCKEDLQLAVKKYCVT
Subjt:  MSLRPEDLVILEPGNHGDSDIDVEIDELFGNEENMEEENERIPSEIFTQIDWDITNSVCEQRSTLANRDEFVDTSCLIQKGMLFDCKEDLQLAVKKYCVT

Query:  QHYEIVVVESNQNILSVRCKQWSNGCNWRLRGSRRKSHGLFEISRLEGEHSCL-----------------------------VTMSVLMEMIKQQYGYTV
        QHYEIVVVESNQNI SVRCKQWSNGCNWRLRGSRRKSHGLFEISRLEGEHSCL                             VT+SVLMEMIKQQYGYTV
Subjt:  QHYEIVVVESNQNILSVRCKQWSNGCNWRLRGSRRKSHGLFEISRLEGEHSCL-----------------------------VTMSVLMEMIKQQYGYTV

Query:  KYRRVWQAKRKALVAVFGDWDKSYNELPYWLSVVVHYNPGTRVDWIFLPSDVPGTTIFGRIFWAFGPAIEGFKYCRPLIQIDGTHLYGKYKGKMLTALSI
        KYRRVWQAKRKALVAVFGDWDKSYNELPYWLS VVHYNPGTRVDW FLPSDVPGTTIFGR+FWAFGPAIEGFKYCRPLIQIDGTHLYGKYKGKMLTALSI
Subjt:  KYRRVWQAKRKALVAVFGDWDKSYNELPYWLSVVVHYNPGTRVDWIFLPSDVPGTTIFGRIFWAFGPAIEGFKYCRPLIQIDGTHLYGKYKGKMLTALSI

Query:  DANGHIFPLAFAIVEGENASSWS----------------------------------------------------CNFNNKYKSKQLKDLVFRAGNQHQR
        DANGHIFPLAFAIVEGENASSWS                                                     NFNNKYKSKQLKDLVFRAGNQHQR
Subjt:  DANGHIFPLAFAIVEGENASSWS----------------------------------------------------CNFNNKYKSKQLKDLVFRAGNQHQR

Query:  RKFIRNMKEIKQLNPECLEFFEDIDLQKWTH-------------NAAECMNGVFKGARMLPMTSLVRLTFYRTILYFERRRAEISEAVDRGEVYTEYVMK
        RKFIRNMKEIKQLNPECLEFFEDIDLQKWT              NAAECMNGVFKGARMLPMTSLVRLTFYRTILYFERRRAEISEAVDRGEVYTEY MK
Subjt:  RKFIRNMKEIKQLNPECLEFFEDIDLQKWTH-------------NAAECMNGVFKGARMLPMTSLVRLTFYRTILYFERRRAEISEAVDRGEVYTEYVMK

Query:  KLKRWETWASAHSVTSIDRETQTFEVHTGMSMISPYKGQHTQV
        KLK+WET ASAHSVTSIDRETQTFEVHTGMSMISPYKGQHTQV
Subjt:  KLKRWETWASAHSVTSIDRETQTFEVHTGMSMISPYKGQHTQV

XP_031737534.1 uncharacterized protein LOC116402427 [Cucumis sativus]1.5e-19465.01Show/hide
Query:  MSLRPEDLVILEPGNHGDSDIDVEIDELFGNEENMEEENERIPSEIFTQIDWDITNSVCEQRSTLANRDEFVDTSCLIQKGMLFDCKEDLQLAVKKYCVT
        MSLRPEDLVILEPGN GDSDIDVE+DELF                                     +R+  VDTSCLIQKGM+FD KEDL LAVK+YCVT
Subjt:  MSLRPEDLVILEPGNHGDSDIDVEIDELFGNEENMEEENERIPSEIFTQIDWDITNSVCEQRSTLANRDEFVDTSCLIQKGMLFDCKEDLQLAVKKYCVT

Query:  QHYEIVVVESNQNILSVRCKQWSNGCNWRLRGSRRKSHGLFEISRLEGEHSCL-----------------------------VTMSVLMEMIKQQYGYTV
        QHYEIVVVESNQ++ ++RCKQW+NGCNWRLRG++RKSHGLFEI++L+GEHSCL                             +T+SVLME+IKQQYGY V
Subjt:  QHYEIVVVESNQNILSVRCKQWSNGCNWRLRGSRRKSHGLFEISRLEGEHSCL-----------------------------VTMSVLMEMIKQQYGYTV

Query:  KYRRVWQAKRKALVAVFGDWDKSYNELPYWLSVVVHYNPGTRVDWIFLPSDVPGTTIFGRIFWAFGPAIEGFKYCRPLIQIDGTHLYGKYKGKMLTALSI
        KY +VWQAK+KAL+ VFGDW+KSYNELPYWLS VVHYNPGTRVDW FLPSDVPGTTIFGR+FW+FGPAIEGFK+CRPLIQIDGTHLYGKYKGKMLTALSI
Subjt:  KYRRVWQAKRKALVAVFGDWDKSYNELPYWLSVVVHYNPGTRVDWIFLPSDVPGTTIFGRIFWAFGPAIEGFKYCRPLIQIDGTHLYGKYKGKMLTALSI

Query:  DANGHIFPLAFAIVEGENASSWS----------------------------------------------------CNFNNKYKSKQLKDLVFRAGNQHQR
        DANGHIFPLAFAIVEGEN SSWS                                                     NFN KYKSKQLKDLVFRAGNQHQR
Subjt:  DANGHIFPLAFAIVEGENASSWS----------------------------------------------------CNFNNKYKSKQLKDLVFRAGNQHQR

Query:  RKFIRNMKEIKQLNPECLEFFEDIDLQKWTH-------------NAAECMNGVFKGARMLPMTSLVRLTFYRTILYFERRRAEISEAVDRGEVYTEYVMK
        RKFI+ MKE+++LNPECLEFFEDID++KWT              NAAECMNGVFKGARMLP+TSLVRLTFYRTILYFERRRAEISEA+DRG++YT+Y ++
Subjt:  RKFIRNMKEIKQLNPECLEFFEDIDLQKWTH-------------NAAECMNGVFKGARMLPMTSLVRLTFYRTILYFERRRAEISEAVDRGEVYTEYVMK

Query:  KLKRWETWASAHSVTSIDRETQTFEVHTGMSMISPYKGQHTQV
        KLK+WE  ASAHSVTSIDRETQTFEVHTGMSM SPYKGQHTQV
Subjt:  KLKRWETWASAHSVTSIDRETQTFEVHTGMSMISPYKGQHTQV

TrEMBL top hitse value%identityAlignment
A0A1S3BS22 uncharacterized protein LOC1034928819.1e-16965.73Show/hide
Query:  EDLVILEPGNHGDSDIDVEIDELFGNEENMEEENERIPSEIFTQIDWDITNSVCEQRSTLANRDEFVDTSCLIQKGMLFDCKEDLQLAVKKYCVTQHYEI
        EDLVILEPGNHG+SDIDVE+DELFG                                STLANRDEFVDTSCLIQKGMLFDCKEDLQLAVKKYCVTQHYEI
Subjt:  EDLVILEPGNHGDSDIDVEIDELFGNEENMEEENERIPSEIFTQIDWDITNSVCEQRSTLANRDEFVDTSCLIQKGMLFDCKEDLQLAVKKYCVTQHYEI

Query:  VVVESNQNILSVRCKQWSNGCNWRLRGSRRKSHGLFEISRLEGEHSCL-----------------------------VTMSVLMEMIKQQYGYTVKYRRV
        VVVESNQNI SVRCKQWSN CNWRLRGSRRKSHGLFEISRLEGEHSCL                             VT+SVLMEMIKQQYGY VKYRRV
Subjt:  VVVESNQNILSVRCKQWSNGCNWRLRGSRRKSHGLFEISRLEGEHSCL-----------------------------VTMSVLMEMIKQQYGYTVKYRRV

Query:  WQAKRKALVAVFGDWDKSYNELPYWLSVVVHYNPGTRVDWIFLPSDVPGTTIFGRIFWAFGPAIEGFKYCRPLIQIDGTHLYGKYKGKMLTALSIDANGH
        WQAKRKALV VFGDWDKSYN+LPYWLS VVHYNPGTRVDW  LPSDVPGTTIFGR+FWAFGPAIEGFKYCRPLIQIDG HLYGKYKGKMLTALSIDANGH
Subjt:  WQAKRKALVAVFGDWDKSYNELPYWLSVVVHYNPGTRVDWIFLPSDVPGTTIFGRIFWAFGPAIEGFKYCRPLIQIDGTHLYGKYKGKMLTALSIDANGH

Query:  IFPLAFAIVEGENASSWS----------------------------------------------C------NFNNKYKSKQLKDLVFRAGNQHQRRKFIR
        IFPLAFAIVEGENASSWS                                              C      NFNNKYKSKQLKDLVFRA           
Subjt:  IFPLAFAIVEGENASSWS----------------------------------------------C------NFNNKYKSKQLKDLVFRAGNQHQRRKFIR

Query:  NMKEIKQLNPECLEFFEDIDLQKWTHN-------------AAECMNGVFKGARMLPMTSLVRLTFYRTILYFERRRAEISEAVDRGEVYTEYVMKKLKR
                         DIDLQKWT +             AAECMNGVFKGARMLPMTSLVRLTFYRTILYFERRRAEISEAVDRGEVYTEY MKKLK+
Subjt:  NMKEIKQLNPECLEFFEDIDLQKWTHN-------------AAECMNGVFKGARMLPMTSLVRLTFYRTILYFERRRAEISEAVDRGEVYTEYVMKKLKR

A0A5A7SLM1 Uncharacterized protein4.9e-21574.86Show/hide
Query:  MSLRPEDLVILEPGNHGDSDIDVEIDELFGNEENMEEENERIPSEIFTQIDWDITNSVCEQRSTLANRDEFVDTSCLIQKGMLFDCKEDLQLAVKKYCVT
        M LRPEDLVILEPGNH DSDIDVEIDELFGNEENMEEENERIPSEIFTQIDWDITNSVCEQ STLANRDEFVDT CLIQKGMLFDCKEDLQLAVKKYCVT
Subjt:  MSLRPEDLVILEPGNHGDSDIDVEIDELFGNEENMEEENERIPSEIFTQIDWDITNSVCEQRSTLANRDEFVDTSCLIQKGMLFDCKEDLQLAVKKYCVT

Query:  QHYEIVVVESNQNILSVRCKQWSNGCNWRLRGSRRKSHGLFEISRLEGEHSCL-----------------------------VTMSVLMEMIKQQYGYTV
        QHY+IVVVESNQNI SVRCKQWSNGCNWRLRGSRRKSHGLFEISRLEGEHSCL                             VTMSVLMEMIKQQYGYTV
Subjt:  QHYEIVVVESNQNILSVRCKQWSNGCNWRLRGSRRKSHGLFEISRLEGEHSCL-----------------------------VTMSVLMEMIKQQYGYTV

Query:  KYRRVWQAKRKALVAVFGDWDKSYNELPYWLSVVVHYNPGTRVDWIFLPSDVPGTTIFGRIFWAFGPAIEGFKYCRPLIQIDGTHLYGKYKGKMLTALSI
        KYRRVWQAKRKALVAVFGDWDKSYNEL YWLS VVHYNPGT+VDW FLPSDVPGTTIFGR+FWAFGPAIEGFKYCRPLIQIDGTHLYGKYKGKMLTALSI
Subjt:  KYRRVWQAKRKALVAVFGDWDKSYNELPYWLSVVVHYNPGTRVDWIFLPSDVPGTTIFGRIFWAFGPAIEGFKYCRPLIQIDGTHLYGKYKGKMLTALSI

Query:  DANGHIFPLAFAIVEGENASSWS----------------------------------------------------CNFNNKYKSKQLKDLVFRAGNQHQR
        DANGHIFPLAFAIVEGENASSWS                                                     NFNNKYKSKQLKD VFRAGNQHQR
Subjt:  DANGHIFPLAFAIVEGENASSWS----------------------------------------------------CNFNNKYKSKQLKDLVFRAGNQHQR

Query:  RKFIRNMKEIKQLNPECLEFFEDIDLQKWTHNAAECMNGVFKGARMLPMTSLVRLTFYRTILYFERRRAEISEAVDRGEVYTEYVMKKLKRWETWASAHS
        RKFIRNMKEIKQLNPECLEFFEDIDLQKWT +     NG                             A+ISEAVDRGEVYTEY MKKLKRWET ASAHS
Subjt:  RKFIRNMKEIKQLNPECLEFFEDIDLQKWTHNAAECMNGVFKGARMLPMTSLVRLTFYRTILYFERRRAEISEAVDRGEVYTEYVMKKLKRWETWASAHS

Query:  VTSIDRETQTFEVHTGMSMISPYKGQHTQ
        VTSIDRETQTFEVHTGMSMISPYKGQHTQ
Subjt:  VTSIDRETQTFEVHTGMSMISPYKGQHTQ

A0A5A7TN56 SWIM-type domain-containing protein2.3e-24480.66Show/hide
Query:  MSLRPEDLVILEPGNHGDSDIDVEIDELFGNEENMEEENERIPSEIFTQIDWDITNSVCEQRSTLANRDEFVDTSCLIQKGMLFDCKEDLQLAVKKYCVT
        MSLRPEDLVILEPGNHGDSDIDVEIDELFGNEENMEEENERIPSEIFTQIDWDITNSVCEQ STL NRDEFVDTSCLIQKGMLFDCKEDLQLAVKKYCVT
Subjt:  MSLRPEDLVILEPGNHGDSDIDVEIDELFGNEENMEEENERIPSEIFTQIDWDITNSVCEQRSTLANRDEFVDTSCLIQKGMLFDCKEDLQLAVKKYCVT

Query:  QHYEIVVVESNQNILSVRCKQWSNGCNWRLRGSRRKSHGLFEISRLEGEHSCL-----------------------------VTMSVLMEMIKQQYGYTV
        QHYEIVVVESNQNI SVRCKQWSNGCNWRLRGSRRKSHGLFEISRLEGEHSCL                             VT+SVLMEMIKQQYGYTV
Subjt:  QHYEIVVVESNQNILSVRCKQWSNGCNWRLRGSRRKSHGLFEISRLEGEHSCL-----------------------------VTMSVLMEMIKQQYGYTV

Query:  KYRRVWQAKRKALVAVFGDWDKSYNELPYWLSVVVHYNPGTRVDWIFLPSDVPGTTIFGRIFWAFGPAIEGFKYCRPLIQIDGTHLYGKYKGKMLTALSI
        KYRRVWQAKRKALVAVFGDWDKSYNELPYWLS VVHYNPGTRVDW FLPSDVPGTTIFGR+FWAFGPAIEGFKYCRPLIQIDGTHLYGKYKGKMLTALSI
Subjt:  KYRRVWQAKRKALVAVFGDWDKSYNELPYWLSVVVHYNPGTRVDWIFLPSDVPGTTIFGRIFWAFGPAIEGFKYCRPLIQIDGTHLYGKYKGKMLTALSI

Query:  DANGHIFPLAFAIVEGENASSWS----------------------------------------------------CNFNNKYKSKQLKDLVFRAGNQHQR
        DANGHIFPLAFAIVEGENASSWS                                                     NFNNKYKSKQLKDLVFRAGNQHQR
Subjt:  DANGHIFPLAFAIVEGENASSWS----------------------------------------------------CNFNNKYKSKQLKDLVFRAGNQHQR

Query:  RKFIRNMKEIKQLNPECLEFFEDIDLQKWTH-------------NAAECMNGVFKGARMLPMTSLVRLTFYRTILYFERRRAEISEAVDRGEVYTEYVMK
        RKFIRNMKEIKQLNPECLEFFEDIDLQKWT              NAAECMNGVFKGARMLPMTSLVRLTFYRTILYFERRRAEISEAVDRGEVYTEY MK
Subjt:  RKFIRNMKEIKQLNPECLEFFEDIDLQKWTH-------------NAAECMNGVFKGARMLPMTSLVRLTFYRTILYFERRRAEISEAVDRGEVYTEYVMK

Query:  KLKRWETWASAHSVTSIDRETQTFEVHTGMSMISPYKGQHTQV
        KLKRWET ASAHSVTSIDRETQTFEVHTGMSMISPYKGQHTQV
Subjt:  KLKRWETWASAHSVTSIDRETQTFEVHTGMSMISPYKGQHTQV

A0A5A7UHL3 SWIM-type domain-containing protein6.5e-20774.71Show/hide
Query:  MSLRPEDLVILEPGNHGDSDIDVEIDELFGNEENMEEENERIPSEIFTQIDWDITNSVCEQRSTLANRDEFVDTSCLIQKGMLFDCKEDLQLAVKKYCVT
        MSLRPEDL+ILEP NHGDSDIDVE+DELFGNEENMEEENERIPSEIFTQIDWDITNSVCEQ STLANRDE VDTSCLIQKGMLFDCKEDLQLAVKKYCVT
Subjt:  MSLRPEDLVILEPGNHGDSDIDVEIDELFGNEENMEEENERIPSEIFTQIDWDITNSVCEQRSTLANRDEFVDTSCLIQKGMLFDCKEDLQLAVKKYCVT

Query:  QHYEIVVVESNQNILSVRCKQWSNGCNWRLRGSRRKSHGLFEISRLEGEHSC----LVTMSVLMEMIKQQYGYTVKYRRVWQAKRKALVAVFGDWDKSYN
        QHYEIVVVESNQNI SVRCKQWSNGCNWRLRGSRRKSHGLFEISRLEGEHS      VT+SVLMEMIKQQY                             
Subjt:  QHYEIVVVESNQNILSVRCKQWSNGCNWRLRGSRRKSHGLFEISRLEGEHSC----LVTMSVLMEMIKQQYGYTVKYRRVWQAKRKALVAVFGDWDKSYN

Query:  ELPYWLSVVVHYNPGTRVDWIFLPSDVPGTTIFGRIFWAFGPAIEGFKYCRPLIQIDGTHLYGKYKGKMLTALSIDANGHIFPLAFAIVEGENASSWS--
                      GTRVDW FLPSDV GTTIFGR+FWAFGPAIEG KYCRPLIQIDGTHLYGKYKGKMLT LSIDANGHIFPLAFAIVEGENASSWS  
Subjt:  ELPYWLSVVVHYNPGTRVDWIFLPSDVPGTTIFGRIFWAFGPAIEGFKYCRPLIQIDGTHLYGKYKGKMLTALSIDANGHIFPLAFAIVEGENASSWS--

Query:  --------------------------------------------------CNFNNKYKSKQLKDLVFRAGNQHQRRKFIRNMKEIKQLNPECLEFFEDID
                                                           NFNNKYKSKQLKDLVFRAGNQHQRRKFIRNMKEIKQLNPECLEFFEDID
Subjt:  --------------------------------------------------CNFNNKYKSKQLKDLVFRAGNQHQRRKFIRNMKEIKQLNPECLEFFEDID

Query:  LQKWTH-------------NAAECMNGVFKGARMLPMTSLVRLTFYRTILYFERRRAEISEAVDRGEVYTEYVMKKLKRWETWASAHSVTSIDRETQTFE
        LQKWT              NA ECMNGVFKGARMLPMTSLVRLTFYRTILYFERRRAEISEAVDRGEVYTEY MKKLKRWET ASAHSVTSIDRETQTFE
Subjt:  LQKWTH-------------NAAECMNGVFKGARMLPMTSLVRLTFYRTILYFERRRAEISEAVDRGEVYTEYVMKKLKRWETWASAHSVTSIDRETQTFE

Query:  VHTGMSMISPYKGQHTQV
        VHTGMSMISPYKGQHTQV
Subjt:  VHTGMSMISPYKGQHTQV

A0A5D3BS92 SWIM-type domain-containing protein5.1e-24480.48Show/hide
Query:  MSLRPEDLVILEPGNHGDSDIDVEIDELFGNEENMEEENERIPSEIFTQIDWDITNSVCEQRSTLANRDEFVDTSCLIQKGMLFDCKEDLQLAVKKYCVT
        MSLRPEDLVILEPGNHGDSDIDVEIDELFGNEENMEEENERIPSEIFTQIDWDITNSVCEQ STL NRDEFVDTSCLIQKGMLFDCKEDLQLAVKKYCVT
Subjt:  MSLRPEDLVILEPGNHGDSDIDVEIDELFGNEENMEEENERIPSEIFTQIDWDITNSVCEQRSTLANRDEFVDTSCLIQKGMLFDCKEDLQLAVKKYCVT

Query:  QHYEIVVVESNQNILSVRCKQWSNGCNWRLRGSRRKSHGLFEISRLEGEHSCL-----------------------------VTMSVLMEMIKQQYGYTV
        QHYEIVVVESNQNI SVRCKQWSNGCNWRLRGSRRKSHGLFEISRLEGEHSCL                             VT+SVLMEMIKQQYGYTV
Subjt:  QHYEIVVVESNQNILSVRCKQWSNGCNWRLRGSRRKSHGLFEISRLEGEHSCL-----------------------------VTMSVLMEMIKQQYGYTV

Query:  KYRRVWQAKRKALVAVFGDWDKSYNELPYWLSVVVHYNPGTRVDWIFLPSDVPGTTIFGRIFWAFGPAIEGFKYCRPLIQIDGTHLYGKYKGKMLTALSI
        KYRRVWQAKRKALVAVFGDWDKSYNELPYWLS VVHYNPGTRVDW FLPSDVPGTTIFGR+FWAFGPAIEGFKYCRPLIQIDGTHLYGKYKGKMLTALSI
Subjt:  KYRRVWQAKRKALVAVFGDWDKSYNELPYWLSVVVHYNPGTRVDWIFLPSDVPGTTIFGRIFWAFGPAIEGFKYCRPLIQIDGTHLYGKYKGKMLTALSI

Query:  DANGHIFPLAFAIVEGENASSWS----------------------------------------------------CNFNNKYKSKQLKDLVFRAGNQHQR
        DANGHIFPLAFAIVEGENASSWS                                                     NFNNKYKSKQLKDLVFRAGNQHQR
Subjt:  DANGHIFPLAFAIVEGENASSWS----------------------------------------------------CNFNNKYKSKQLKDLVFRAGNQHQR

Query:  RKFIRNMKEIKQLNPECLEFFEDIDLQKWTH-------------NAAECMNGVFKGARMLPMTSLVRLTFYRTILYFERRRAEISEAVDRGEVYTEYVMK
        RKFIRNMKEIKQLNPECLEFFEDIDLQKWT              NAAECMNGVFKGARMLPMTSLVRLTFYRTILYFERRRAEISEAVDRGEVYTEY MK
Subjt:  RKFIRNMKEIKQLNPECLEFFEDIDLQKWTH-------------NAAECMNGVFKGARMLPMTSLVRLTFYRTILYFERRRAEISEAVDRGEVYTEYVMK

Query:  KLKRWETWASAHSVTSIDRETQTFEVHTGMSMISPYKGQHTQV
        KLK+WET ASAHSVTSIDRETQTFEVHTGMSMISPYKGQHTQV
Subjt:  KLKRWETWASAHSVTSIDRETQTFEVHTGMSMISPYKGQHTQV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G49920.1 MuDR family transposase7.7e-3525.84Show/hide
Query:  GMLFDCKEDLQLAVKKYCVTQHYEIVVVESNQNILSVRCKQWSNGCNWRLRGSRRKSHGLFEISRLEGEHSC-------------------------LVT
        G+ F    +++ AV    + +  + ++ E+ +++  V C++W   C W +  SRR+  GLFEI+   G H C                          ++
Subjt:  GMLFDCKEDLQLAVKKYCVTQHYEIVVVESNQNILSVRCKQWSNGCNWRLRGSRRKSHGLFEISRLEGEHSC-------------------------LVT

Query:  MSVLMEMIKQQYGYTV-------KYRRVWQAKRKALVAVFGDWDKSYNELPYWLSVVVHYNPGTRVDWIF--LPSDVPGTTIFGRIFWAFGPAIEGFKYC
         + L +  ++++G+ +           V  AK KA+   FGDWD+S+  +P  +S V+H + G  VDW +  L  D P    F  +FWAF  +I+GF++C
Subjt:  MSVLMEMIKQQYGYTV-------KYRRVWQAKRKALVAVFGDWDKSYNELPYWLSVVVHYNPGTRVDWIF--LPSDVPGTTIFGRIFWAFGPAIEGFKYC

Query:  RPLIQIDGTHLYGKYKGKMLTALSIDANGHIFPLAFAIVEGENASSWS---CNFNNKYKSKQ--------------------------------------
        RPLI +D  +L GKYK K++ A + DA    FPLAFA+ +  +  SW         K   +Q                                      
Subjt:  RPLIQIDGTHLYGKYKGKMLTALSIDANGHIFPLAFAIVEGENASSWS---CNFNNKYKSKQ--------------------------------------

Query:  -------------LKDLVFRAGNQHQRRKFIRNMKEIKQLNPECLEFFEDIDLQKW--THN----------AAECMNGVFKGARMLPMTSLVRLTFYRTI
                     +  LV  AG+  Q+ +F   MKEIK+ NPE  ++ +     +W   H+            E +  V K  R + M   V L F +  
Subjt:  -------------LKDLVFRAGNQHQRRKFIRNMKEIKQLNPECLEFFEDIDLQKW--THN----------AAECMNGVFKGARMLPMTSLVRLTFYRTI

Query:  LYFERRRAEISEAVDRGEVYTEYVMKKLKRWETWASAHSVTSIDRETQTFEVHTGMSMISPYKGQHTQVYGHTTDS
          F         ++  G+VYTE+VM+KL+ +ET +    +T    E   ++V      ++P K   T++ G + DS
Subjt:  LYFERRRAEISEAVDRGEVYTEYVMKKLKRWETWASAHSVTSIDRETQTFEVHTGMSMISPYKGQHTQVYGHTTDS

AT1G64255.1 MuDR family transposase3.8e-3425.94Show/hide
Query:  IQKGMLFDCKEDLQLAVKKYCVTQHYEIVVVESNQNILSVRCKQWSNGCNWRLRGSRRKSHGLFEISRLEGEHSC-------------------------
        ++ G+ F   ++L+ AV    +    + VV E+ ++     C +W   C W L  +R K HGL EI +  G H+C                         
Subjt:  IQKGMLFDCKEDLQLAVKKYCVTQHYEIVVVESNQNILSVRCKQWSNGCNWRLRGSRRKSHGLFEISRLEGEHSC-------------------------

Query:  LVTMSVLMEMIKQQYGYTVKYRRVWQAKRKALVAVFGDWDKSYNELPYWLSVVVHYNPGTRVDWIFLPSDVPGTTIFGRIFWAFGPAIEGFKYCRPLIQI
          T+S L +  K++ GY ++   V  AK KA+  VFGDWD+S+ + P  +S +   N G  VDW +     P    F  +FWAF  +IEGF++CRPLI +
Subjt:  LVTMSVLMEMIKQQYGYTVKYRRVWQAKRKALVAVFGDWDKSYNELPYWLSVVVHYNPGTRVDWIFLPSDVPGTTIFGRIFWAFGPAIEGFKYCRPLIQI

Query:  DGTHLYGKYKGKMLTALSIDANGHIFPLAFAI--------------------------------------VEGENASSWS--------------CNFNNK
        D  +L  +Y+ K++ A  +DA    FPLAFA+                                      V  E+ S W                 F+  
Subjt:  DGTHLYGKYKGKMLTALSIDANGHIFPLAFAI--------------------------------------VEGENASSWS--------------CNFNNK

Query:  YKSKQLKDLVFRAGNQHQRRKFIRNMKEIKQLNPECLEFFEDIDLQKW---------------THNAAECMNGVFKGARMLPMTSLVRLTFYRTILYFER
        + S  L   + RAG+  Q+ +F+  M +IK+ NPE  ++ +     +W                  A   +   F+ A  + +T  V L F      F++
Subjt:  YKSKQLKDLVFRAGNQHQRRKFIRNMKEIKQLNPECLEFFEDIDLQKW---------------THNAAECMNGVFKGARMLPMTSLVRLTFYRTILYFER

Query:  RRAEISEAVDRGEVYTEYVMKKLKRWETWASAHSVTSIDRETQTFEVHTGM
          +    +++ G+VYTE VM KL+ + T    +S      +   F+V T +
Subjt:  RRAEISEAVDRGEVYTEYVMKKLKRWETWASAHSVTSIDRETQTFEVHTGM

AT1G64260.1 MuDR family transposase6.1e-4025.22Show/hide
Query:  FVDTSCLIQKGMLFDCKEDLQLAVKKYCVTQHYEIVVVESNQNILSVRCKQWSNGCNWRLRGSRRKSHGLFEISRLEGEHSC------------------
        ++D    +  G+ F  +++L+ AV  +C+ +    +V E+ + + +  C +W   C W LR +R + HGL EI++  G H+C                  
Subjt:  FVDTSCLIQKGMLFDCKEDLQLAVKKYCVTQHYEIVVVESNQNILSVRCKQWSNGCNWRLRGSRRKSHGLFEISRLEGEHSC------------------

Query:  -------LVTMSVLMEMIKQQYGYTVKYRRVWQAKRKALVAVFGDWDKSYNELPYWLSVVVHYNPGTRVDWIFLPSDVPGTTIFGRIFWAFGPAIEGFKY
                ++++ L +  K++ GY ++  ++   K + +  VFGD D+S+  +P  +S   H + G  VDW +     P    F  +FW+F  +IEGF++
Subjt:  -------LVTMSVLMEMIKQQYGYTVKYRRVWQAKRKALVAVFGDWDKSYNELPYWLSVVVHYNPGTRVDWIFLPSDVPGTTIFGRIFWAFGPAIEGFKY

Query:  CRPLIQIDGTHLYGKYKGKMLTALSIDANGHIFPLAFAIVEGENASSW----------------------------------------------------
        CRPLI +D   L GKY+ K++ A  +DA    FPLAFA+ +  +  SW                                                    
Subjt:  CRPLIQIDGTHLYGKYKGKMLTALSIDANGHIFPLAFAIVEGENASSW----------------------------------------------------

Query:  SCNFNNKYKSKQLKDLVFRAGNQHQRRKFIRNMKEIKQLNPECLEFFEDIDLQKW--THNAA----------ECMNGVFKGAR--MLPMTSLVRLTFYRT
           F   ++   L+ LV +AG+ +Q+ +F   M +IK+ NPE  ++ + I   KW   H++           E +  V +G     + MT  V L F   
Subjt:  SCNFNNKYKSKQLKDLVFRAGNQHQRRKFIRNMKEIKQLNPECLEFFEDIDLQKW--THNAA----------ECMNGVFKGAR--MLPMTSLVRLTFYRT

Query:  ILYFERRRAEISEAVDRGEVYTEYVMKKLKRWETWASAHSVTSIDRET
           F++  + I  +++RG VYTE  M KL+ + T +  + +T ++R++
Subjt:  ILYFERRRAEISEAVDRGEVYTEYVMKKLKRWETWASAHSVTSIDRET


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTTGCGACCTGAAGATTTAGTTATTCTTGAACCTGGTAATCACGGAGATAGTGATATAGATGTAGAAATTGACGAACTATTTGGTAATGAAGAAAATATGGAAGA
AGAGAATGAAAGGATCCCTTCTGAGATATTTACACAGATAGATTGGGATATTACGAATTCTGTTTGTGAACAAAGGTCTACGTTGGCAAATAGGGATGAATTTGTAGACA
CATCATGTTTGATTCAGAAGGGAATGTTATTTGACTGCAAGGAAGATCTTCAATTAGCCGTAAAAAAGTACTGTGTCACCCAACACTACGAAATTGTTGTAGTCGAATCC
AACCAAAATATTTTGTCTGTTCGATGCAAACAATGGAGTAATGGTTGCAACTGGAGGTTACGTGGAAGTAGGCGTAAAAGCCACGGATTGTTTGAGATCAGTCGACTGGA
AGGAGAGCACTCATGTCTTGTTACTATGTCGGTGCTCATGGAAATGATAAAACAACAGTATGGTTACACGGTTAAATACAGACGGGTGTGGCAAGCGAAGAGGAAAGCTT
TGGTTGCTGTTTTCGGTGATTGGGACAAATCGTACAATGAGCTCCCGTACTGGTTGAGTGTCGTTGTACATTATAATCCAGGAACTCGAGTTGATTGGATTTTTCTTCCA
TCTGATGTACCTGGGACAACCATATTTGGACGCATTTTCTGGGCATTTGGTCCTGCAATAGAAGGGTTCAAATATTGTAGGCCATTAATTCAAATCGACGGAACCCATTT
GTATGGAAAGTATAAAGGCAAAATGTTAACTGCCCTATCTATCGATGCAAATGGTCATATATTTCCTCTTGCATTTGCTATTGTGGAAGGGGAAAACGCGTCCAGTTGGT
CATGCAACTTCAATAATAAATACAAATCGAAGCAACTAAAAGATTTGGTGTTTAGGGCAGGTAATCAACACCAAAGGCGCAAATTTATAAGAAACATGAAAGAAATAAAA
CAACTGAACCCAGAGTGTCTTGAATTCTTTGAAGATATTGATCTACAAAAATGGACACACAACGCAGCTGAATGTATGAATGGGGTATTTAAAGGAGCTCGTATGTTACC
CATGACATCTTTGGTTAGATTAACATTTTATCGCACAATTCTGTATTTTGAACGCCGAAGAGCTGAGATAAGTGAAGCAGTTGACCGTGGCGAAGTTTATACAGAATATG
TGATGAAAAAGCTAAAAAGATGGGAAACATGGGCTTCTGCACATTCAGTGACATCCATTGATAGAGAAACCCAGACGTTTGAGGTTCACACGGGTATGAGTATGATTTCT
CCATATAAAGGCCAGCACACACAGGTATATGGGCATACGACAGATTCTCCATTATATCAATTTGTGGGGCAAGGTATTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCGTTGCGACCTGAAGATTTAGTTATTCTTGAACCTGGTAATCACGGAGATAGTGATATAGATGTAGAAATTGACGAACTATTTGGTAATGAAGAAAATATGGAAGA
AGAGAATGAAAGGATCCCTTCTGAGATATTTACACAGATAGATTGGGATATTACGAATTCTGTTTGTGAACAAAGGTCTACGTTGGCAAATAGGGATGAATTTGTAGACA
CATCATGTTTGATTCAGAAGGGAATGTTATTTGACTGCAAGGAAGATCTTCAATTAGCCGTAAAAAAGTACTGTGTCACCCAACACTACGAAATTGTTGTAGTCGAATCC
AACCAAAATATTTTGTCTGTTCGATGCAAACAATGGAGTAATGGTTGCAACTGGAGGTTACGTGGAAGTAGGCGTAAAAGCCACGGATTGTTTGAGATCAGTCGACTGGA
AGGAGAGCACTCATGTCTTGTTACTATGTCGGTGCTCATGGAAATGATAAAACAACAGTATGGTTACACGGTTAAATACAGACGGGTGTGGCAAGCGAAGAGGAAAGCTT
TGGTTGCTGTTTTCGGTGATTGGGACAAATCGTACAATGAGCTCCCGTACTGGTTGAGTGTCGTTGTACATTATAATCCAGGAACTCGAGTTGATTGGATTTTTCTTCCA
TCTGATGTACCTGGGACAACCATATTTGGACGCATTTTCTGGGCATTTGGTCCTGCAATAGAAGGGTTCAAATATTGTAGGCCATTAATTCAAATCGACGGAACCCATTT
GTATGGAAAGTATAAAGGCAAAATGTTAACTGCCCTATCTATCGATGCAAATGGTCATATATTTCCTCTTGCATTTGCTATTGTGGAAGGGGAAAACGCGTCCAGTTGGT
CATGCAACTTCAATAATAAATACAAATCGAAGCAACTAAAAGATTTGGTGTTTAGGGCAGGTAATCAACACCAAAGGCGCAAATTTATAAGAAACATGAAAGAAATAAAA
CAACTGAACCCAGAGTGTCTTGAATTCTTTGAAGATATTGATCTACAAAAATGGACACACAACGCAGCTGAATGTATGAATGGGGTATTTAAAGGAGCTCGTATGTTACC
CATGACATCTTTGGTTAGATTAACATTTTATCGCACAATTCTGTATTTTGAACGCCGAAGAGCTGAGATAAGTGAAGCAGTTGACCGTGGCGAAGTTTATACAGAATATG
TGATGAAAAAGCTAAAAAGATGGGAAACATGGGCTTCTGCACATTCAGTGACATCCATTGATAGAGAAACCCAGACGTTTGAGGTTCACACGGGTATGAGTATGATTTCT
CCATATAAAGGCCAGCACACACAGGTATATGGGCATACGACAGATTCTCCATTATATCAATTTGTGGGGCAAGGTATTTAG
Protein sequenceShow/hide protein sequence
MSLRPEDLVILEPGNHGDSDIDVEIDELFGNEENMEEENERIPSEIFTQIDWDITNSVCEQRSTLANRDEFVDTSCLIQKGMLFDCKEDLQLAVKKYCVTQHYEIVVVES
NQNILSVRCKQWSNGCNWRLRGSRRKSHGLFEISRLEGEHSCLVTMSVLMEMIKQQYGYTVKYRRVWQAKRKALVAVFGDWDKSYNELPYWLSVVVHYNPGTRVDWIFLP
SDVPGTTIFGRIFWAFGPAIEGFKYCRPLIQIDGTHLYGKYKGKMLTALSIDANGHIFPLAFAIVEGENASSWSCNFNNKYKSKQLKDLVFRAGNQHQRRKFIRNMKEIK
QLNPECLEFFEDIDLQKWTHNAAECMNGVFKGARMLPMTSLVRLTFYRTILYFERRRAEISEAVDRGEVYTEYVMKKLKRWETWASAHSVTSIDRETQTFEVHTGMSMIS
PYKGQHTQVYGHTTDSPLYQFVGQGI