; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g12410 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g12410
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr4:9465160..9477869
RNA-Seq ExpressionMoc04g12410
SyntenyMoc04g12410
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]7.7e-25293.54Show/hide
Query:  DPTESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPTRSISTYSQLRREFLAQFSSRHYDK
        D  ESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFE LMDFQAASDAIKCRAF+IALTGSARLWYRRLP  SISTYSQLRREFLA FSSRHYDK
Subjt:  DPTESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPTRSISTYSQLRREFLAQFSSRHYDK

Query:  KIATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQKLLRTKTGRPERKIDRGRSGKD
        K ATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQ+LLRTKTGRPERKI RGRSGKD
Subjt:  KIATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQKLLRTKTGRPERKIDRGRSGKD

Query:  TGKADPKSKDEGPFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMERLLKRPEKLRGAPKRRSKDKYCRFHREHGHNTSDCWELKRQVE
           ADPKSKD+G FSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGME+LLKRPEKLRGAP+RRSKDKYCRFHREHGHNTSD WELKRQ+E
Subjt:  TGKADPKSKDEGPFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMERLLKRPEKLRGAPKRRSKDKYCRFHREHGHNTSDCWELKRQVE

Query:  DLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGWPSGGQSGHKRKELARAARREVCIIKEQRPTCPITFDSADLEEVHLPHNDALV
        +LIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFG PSGGQSG KRKELARAARREVCII+EQRPTCPITFD ADLEEVHLPHNDALV
Subjt:  DLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGWPSGGQSGHKRKELARAARREVCIIKEQRPTCPITFDSADLEEVHLPHNDALV

Query:  IAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVTPEGCIDLPVTFGQEKTQVTQMAEFV
        IAPLIDHVVV RVLVDGG SANILSLPTYLALGWTRSQLKKSPTPLVGFSGESV PEG IDLPVT GQ++TQVTQMAEFV
Subjt:  IAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVTPEGCIDLPVTFGQEKTQVTQMAEFV

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]6.7e-24880.48Show/hide
Query:  DPTESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPTRSISTYSQLRREFLAQFSSRHYDK
        D  ESPFTSDVLE        APTVK YDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLW                              
Subjt:  DPTESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPTRSISTYSQLRREFLAQFSSRHYDK

Query:  KIATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQKLLRTKTGRPERKIDRGRSGKD
                               FQE+QLKVA  SDDSAMCYFLTGLADEALTVKLG+EAPATFAEVLQKAKKVIDGQ+LLRTKTGRPER IDRGRSGKD
Subjt:  KIATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQKLLRTKTGRPERKIDRGRSGKD

Query:  TGKADPKSKDEGPFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMERLLKRPEKLRGAPKRRSKDKYCRFHREHGHNTSDCWELKRQVE
          KAD KSKD+G FSSGRAE+RRA NGPTRSRPYERFTPTTIPISEILTNIEESGME+LLKRPEKLRGAP+RR+KDKYCRFHREH HNTSD WELKRQ+E
Subjt:  TGKADPKSKDEGPFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMERLLKRPEKLRGAPKRRSKDKYCRFHREHGHNTSDCWELKRQVE

Query:  DLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGWPSGGQSGHKRKELARAARREVCIIKEQRPTCPITFDSADLEEVHLPHNDALV
        DLIQD YFKKFVGKPRTSSAEKKEERK SRTP RR DRPAVINTIFG PSGGQSGHKRKELARAARREVCII+EQRPTCPITFDSADLEEVHLPHNDALV
Subjt:  DLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGWPSGGQSGHKRKELARAARREVCIIKEQRPTCPITFDSADLEEVHLPHNDALV

Query:  IAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVTPEGCIDLPVTFGQEKTQVTQMAEFVVIDGRSAYNAIFGRPIIHSF
        IAPLIDHVVVRRVLVD G SANI+SL TYLALGWTRSQLKKS TPLVGFS ESV PEGCIDLPVT G ++TQVTQMAEFVVIDGRSAYNAIFGRPIIHSF
Subjt:  IAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVTPEGCIDLPVTFGQEKTQVTQMAEFVVIDGRSAYNAIFGRPIIHSF

Query:  RAVPSTLHQVLKNSTPNGMGTVRGEQTASRECYASALKGSSVCALEGQASGDGPLEFEADLPRREFFAPTEELELVPLL
        RA+PSTLHQVLK STPNG+G VRGEQ ASRECYASALKGSSVCALE   S DG LEF+A+LPRREF APTEELELVPLL
Subjt:  RAVPSTLHQVLKNSTPNGMGTVRGEQTASRECYASALKGSSVCALEGQASGDGPLEFEADLPRREFFAPTEELELVPLL

XP_022152110.1 uncharacterized protein LOC111019899 [Momordica charantia]1.6e-22087.89Show/hide
Query:  MCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQKLLRTKTGRPERKIDRGRSGKDTGKADPKSKDEGPFSSGRAEYRRAENGPTRSRPYERFTP
        MCYFLTGLADEALTVKL EEAPATFAEVLQKAKKVIDGQ+LLRTK G       +GRSGKD    DPKSKD+G FS+GRAEYRRAENGPTRSRPYERFTP
Subjt:  MCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQKLLRTKTGRPERKIDRGRSGKDTGKADPKSKDEGPFSSGRAEYRRAENGPTRSRPYERFTP

Query:  TTIPISEILTNIEESGMERLLKRPEKLRGAPKRRSKDKYCRFHREHGHNTSDCWELKRQVEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRP
        TTIPISEILTNIEESGME+LLKRPEKLRGAP+RRSKDKYCRFHREHGHNTSD WELK Q+EDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRP
Subjt:  TTIPISEILTNIEESGMERLLKRPEKLRGAPKRRSKDKYCRFHREHGHNTSDCWELKRQVEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRP

Query:  AVINTIFGWPSGGQSGHKRKELARAARREVCIIKEQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
        AVINTIFG PSGGQSGHKRK+LARAARREVCII+EQRPTCPITFD ADL EVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
Subjt:  AVINTIFGWPSGGQSGHKRKELARAARREVCIIKEQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL

Query:  KKSPTPLVGFSGESVTPEGCIDLPVTFGQEKTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKNSTPNGMGTVRGEQTASRECYASALKG
        KKSPTPLVGFSGESV PEGCIDLPVT GQ++T+VTQMAEFVV+DGRSAYNAIFGRPIIHSFRA+PSTLHQVLK STPNG+GTVRGEQTASRECYAS LKG
Subjt:  KKSPTPLVGFSGESVTPEGCIDLPVTFGQEKTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKNSTPNGMGTVRGEQTASRECYASALKG

Query:  SSVCALEGQASGDGPLEFEADLPRREFFAPTEELELVPLLSPEKQPDLIEIGAL
        +SVCALE   S DG LEFEADLP REF AP EELELVPLLS EKQ   +++G L
Subjt:  SSVCALEGQASGDGPLEFEADLPRREFFAPTEELELVPLLSPEKQPDLIEIGAL

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]1.7e-24379.93Show/hide
Query:  DPTESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPTRSISTYSQLRREFLAQFSSRHYDK
        D  E  F+SD+LEA IPPKFK PT+KPYDGSKDPKDYVEVFE LMDFQAA+DAIKC AFQIALTGSARLWYRRLP R ISTYSQLR+EF++QFSSRHYD+
Subjt:  DPTESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPTRSISTYSQLRREFLAQFSSRHYDK

Query:  KIATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQKLLRTKTGRPERKIDRGRSGKD
        K  THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EEAPATFAEVLQK KKVIDGQ+LLRTKTGRPE+ ID+GR+GKD
Subjt:  KIATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQKLLRTKTGRPERKIDRGRSGKD

Query:  TGKADPKSKDEGP-FSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMERLLKRPEKLRGAPKRRSKDKYCRFHREHGHNTSDCWELKRQV
         GKAD KS+D+GP  SS R +YRR+ +   +SRPYE +TPTTIPI EILTNIEE+GME+LLKRPEKLRG P++R+ DKYCRFHR+HGHNTS+ WELKRQ+
Subjt:  TGKADPKSKDEGP-FSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMERLLKRPEKLRGAPKRRSKDKYCRFHREHGHNTSDCWELKRQV

Query:  EDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGWPSGGQSGHKRKELARAARREVCIIKEQRPTCPITFDSADLEEVHLPHNDAL
        EDLIQDGYFKKFVGKPR++S EKKEERKR RTPPRR DRPAVIN             K+KELAR ARREVCII+EQRPT  I F+ ADLE VHLPHNDAL
Subjt:  EDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGWPSGGQSGHKRKELARAARREVCIIKEQRPTCPITFDSADLEEVHLPHNDAL

Query:  VIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVTPEGCIDLPVTFGQEKTQVTQMAEFVVIDGRSAYNAIFGRPIIHS
        VIAPLID V+VRR+LVDGGASANILSL TYLALGWTRSQLKKSPTPLVGFSGES++ EGCIDLPV+  Q+ TQVTQMAEFVVIDGRSAYNAIFGRPIIHS
Subjt:  VIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVTPEGCIDLPVTFGQEKTQVTQMAEFVVIDGRSAYNAIFGRPIIHS

Query:  FRAVPSTLHQVLKNSTPNGMGTVRGEQTASRECYASALKGSSVCALEGQASGD
        FRAVPSTLHQVLK ST NG+GTVRGE   SRECYAS  K SSVCALE Q   D
Subjt:  FRAVPSTLHQVLKNSTPNGMGTVRGEQTASRECYASALKGSSVCALEGQASGD

XP_022158414.1 uncharacterized protein LOC111024904 [Momordica charantia]7.7e-22075Show/hide
Query:  MDFQAASDAIKCRAFQIALTGSARLWYRRLPTRSISTYSQLRREFLAQFSSRHYDKKIATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL
        MDFQAA+DAIKCRAFQIALTGSARLWYRRLP RSISTYSQLR+EF++QFSS HYD+K ATHLATIRQKE ETLREYVTRFQEEQLKVAHCSDDSAMCYFL
Subjt:  MDFQAASDAIKCRAFQIALTGSARLWYRRLPTRSISTYSQLRREFLAQFSSRHYDKKIATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL

Query:  TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQKLLRTKTGRPERKIDRGRSGKDTGKADPKSKDEGPFSS-GRAEYRRAENGPTRSRPYERFTPTTIP
        T LADE LTVKLGEEAP TF EVLQKAKKVIDGQ+LLRTKTGRPE++ID+ +  ++  KAD KS+D+G  SS  R EYRR E+GP+RSRPYER+T +TIP
Subjt:  TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQKLLRTKTGRPERKIDRGRSGKDTGKADPKSKDEGPFSS-GRAEYRRAENGPTRSRPYERFTPTTIP

Query:  ISEILTNIEESGMERLLKRPEKLRGAPKRRSKDKYCRFHREHGHNTSDCWELKRQVEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVIN
        ISEILTNIEESGME+LLKRPEKLRG  ++R+K+KYCRFHR+HGHNT+ CWELKRQ+EDLIQDGYFKKFVGKPR++S EKKEERKRSRTPPRR DRPAVIN
Subjt:  ISEILTNIEESGMERLLKRPEKLRGAPKRRSKDKYCRFHREHGHNTSDCWELKRQVEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVIN

Query:  TIFGWPSGGQSGHKRKELARAARREVCIIKEQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSP
        TIFG P+GGQSG+KRKELAR ARREVCII+E +PTC ITF  ADLE VHLPHNDALVIA LIDH +VRRVL+DG                          
Subjt:  TIFGWPSGGQSGHKRKELARAARREVCIIKEQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSP

Query:  TPLVGFSGESVTPEGCIDLPVTFGQEKTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKNSTPNGMGTVRGEQTASRECYASALKGSSVC
                      GCIDLPVT GQ+ TQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLK STPN +G VRGEQ  SRECYASALKGS+VC
Subjt:  TPLVGFSGESVTPEGCIDLPVTFGQEKTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKNSTPNGMGTVRGEQTASRECYASALKGSSVC

Query:  ALEGQASGDGPLEFEADLP---RREFFAPTEELELVPLLSPEKQ
        ALE Q +     E EADLP   +R+F  PTEELELVPLLSPE+Q
Subjt:  ALEGQASGDGPLEFEADLP---RREFFAPTEELELVPLLSPEKQ

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088133.7e-25293.54Show/hide
Query:  DPTESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPTRSISTYSQLRREFLAQFSSRHYDK
        D  ESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFE LMDFQAASDAIKCRAF+IALTGSARLWYRRLP  SISTYSQLRREFLA FSSRHYDK
Subjt:  DPTESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPTRSISTYSQLRREFLAQFSSRHYDK

Query:  KIATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQKLLRTKTGRPERKIDRGRSGKD
        K ATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQ+LLRTKTGRPERKI RGRSGKD
Subjt:  KIATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQKLLRTKTGRPERKIDRGRSGKD

Query:  TGKADPKSKDEGPFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMERLLKRPEKLRGAPKRRSKDKYCRFHREHGHNTSDCWELKRQVE
           ADPKSKD+G FSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGME+LLKRPEKLRGAP+RRSKDKYCRFHREHGHNTSD WELKRQ+E
Subjt:  TGKADPKSKDEGPFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMERLLKRPEKLRGAPKRRSKDKYCRFHREHGHNTSDCWELKRQVE

Query:  DLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGWPSGGQSGHKRKELARAARREVCIIKEQRPTCPITFDSADLEEVHLPHNDALV
        +LIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFG PSGGQSG KRKELARAARREVCII+EQRPTCPITFD ADLEEVHLPHNDALV
Subjt:  DLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGWPSGGQSGHKRKELARAARREVCIIKEQRPTCPITFDSADLEEVHLPHNDALV

Query:  IAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVTPEGCIDLPVTFGQEKTQVTQMAEFV
        IAPLIDHVVV RVLVDGG SANILSLPTYLALGWTRSQLKKSPTPLVGFSGESV PEG IDLPVT GQ++TQVTQMAEFV
Subjt:  IAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVTPEGCIDLPVTFGQEKTQVTQMAEFV

A0A6J1D9E1 uncharacterized protein LOC1110188233.3e-24880.48Show/hide
Query:  DPTESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPTRSISTYSQLRREFLAQFSSRHYDK
        D  ESPFTSDVLE        APTVK YDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLW                              
Subjt:  DPTESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPTRSISTYSQLRREFLAQFSSRHYDK

Query:  KIATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQKLLRTKTGRPERKIDRGRSGKD
                               FQE+QLKVA  SDDSAMCYFLTGLADEALTVKLG+EAPATFAEVLQKAKKVIDGQ+LLRTKTGRPER IDRGRSGKD
Subjt:  KIATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQKLLRTKTGRPERKIDRGRSGKD

Query:  TGKADPKSKDEGPFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMERLLKRPEKLRGAPKRRSKDKYCRFHREHGHNTSDCWELKRQVE
          KAD KSKD+G FSSGRAE+RRA NGPTRSRPYERFTPTTIPISEILTNIEESGME+LLKRPEKLRGAP+RR+KDKYCRFHREH HNTSD WELKRQ+E
Subjt:  TGKADPKSKDEGPFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMERLLKRPEKLRGAPKRRSKDKYCRFHREHGHNTSDCWELKRQVE

Query:  DLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGWPSGGQSGHKRKELARAARREVCIIKEQRPTCPITFDSADLEEVHLPHNDALV
        DLIQD YFKKFVGKPRTSSAEKKEERK SRTP RR DRPAVINTIFG PSGGQSGHKRKELARAARREVCII+EQRPTCPITFDSADLEEVHLPHNDALV
Subjt:  DLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGWPSGGQSGHKRKELARAARREVCIIKEQRPTCPITFDSADLEEVHLPHNDALV

Query:  IAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVTPEGCIDLPVTFGQEKTQVTQMAEFVVIDGRSAYNAIFGRPIIHSF
        IAPLIDHVVVRRVLVD G SANI+SL TYLALGWTRSQLKKS TPLVGFS ESV PEGCIDLPVT G ++TQVTQMAEFVVIDGRSAYNAIFGRPIIHSF
Subjt:  IAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVTPEGCIDLPVTFGQEKTQVTQMAEFVVIDGRSAYNAIFGRPIIHSF

Query:  RAVPSTLHQVLKNSTPNGMGTVRGEQTASRECYASALKGSSVCALEGQASGDGPLEFEADLPRREFFAPTEELELVPLL
        RA+PSTLHQVLK STPNG+G VRGEQ ASRECYASALKGSSVCALE   S DG LEF+A+LPRREF APTEELELVPLL
Subjt:  RAVPSTLHQVLKNSTPNGMGTVRGEQTASRECYASALKGSSVCALEGQASGDGPLEFEADLPRREFFAPTEELELVPLL

A0A6J1DD03 uncharacterized protein LOC1110198997.6e-22187.89Show/hide
Query:  MCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQKLLRTKTGRPERKIDRGRSGKDTGKADPKSKDEGPFSSGRAEYRRAENGPTRSRPYERFTP
        MCYFLTGLADEALTVKL EEAPATFAEVLQKAKKVIDGQ+LLRTK G       +GRSGKD    DPKSKD+G FS+GRAEYRRAENGPTRSRPYERFTP
Subjt:  MCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQKLLRTKTGRPERKIDRGRSGKDTGKADPKSKDEGPFSSGRAEYRRAENGPTRSRPYERFTP

Query:  TTIPISEILTNIEESGMERLLKRPEKLRGAPKRRSKDKYCRFHREHGHNTSDCWELKRQVEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRP
        TTIPISEILTNIEESGME+LLKRPEKLRGAP+RRSKDKYCRFHREHGHNTSD WELK Q+EDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRP
Subjt:  TTIPISEILTNIEESGMERLLKRPEKLRGAPKRRSKDKYCRFHREHGHNTSDCWELKRQVEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRP

Query:  AVINTIFGWPSGGQSGHKRKELARAARREVCIIKEQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
        AVINTIFG PSGGQSGHKRK+LARAARREVCII+EQRPTCPITFD ADL EVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
Subjt:  AVINTIFGWPSGGQSGHKRKELARAARREVCIIKEQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL

Query:  KKSPTPLVGFSGESVTPEGCIDLPVTFGQEKTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKNSTPNGMGTVRGEQTASRECYASALKG
        KKSPTPLVGFSGESV PEGCIDLPVT GQ++T+VTQMAEFVV+DGRSAYNAIFGRPIIHSFRA+PSTLHQVLK STPNG+GTVRGEQTASRECYAS LKG
Subjt:  KKSPTPLVGFSGESVTPEGCIDLPVTFGQEKTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKNSTPNGMGTVRGEQTASRECYASALKG

Query:  SSVCALEGQASGDGPLEFEADLPRREFFAPTEELELVPLLSPEKQPDLIEIGAL
        +SVCALE   S DG LEFEADLP REF AP EELELVPLLS EKQ   +++G L
Subjt:  SSVCALEGQASGDGPLEFEADLPRREFFAPTEELELVPLLSPEKQPDLIEIGAL

A0A6J1DHB3 uncharacterized protein LOC1110204798.3e-24479.93Show/hide
Query:  DPTESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPTRSISTYSQLRREFLAQFSSRHYDK
        D  E  F+SD+LEA IPPKFK PT+KPYDGSKDPKDYVEVFE LMDFQAA+DAIKC AFQIALTGSARLWYRRLP R ISTYSQLR+EF++QFSSRHYD+
Subjt:  DPTESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPTRSISTYSQLRREFLAQFSSRHYDK

Query:  KIATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQKLLRTKTGRPERKIDRGRSGKD
        K  THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EEAPATFAEVLQK KKVIDGQ+LLRTKTGRPE+ ID+GR+GKD
Subjt:  KIATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQKLLRTKTGRPERKIDRGRSGKD

Query:  TGKADPKSKDEGP-FSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMERLLKRPEKLRGAPKRRSKDKYCRFHREHGHNTSDCWELKRQV
         GKAD KS+D+GP  SS R +YRR+ +   +SRPYE +TPTTIPI EILTNIEE+GME+LLKRPEKLRG P++R+ DKYCRFHR+HGHNTS+ WELKRQ+
Subjt:  TGKADPKSKDEGP-FSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMERLLKRPEKLRGAPKRRSKDKYCRFHREHGHNTSDCWELKRQV

Query:  EDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGWPSGGQSGHKRKELARAARREVCIIKEQRPTCPITFDSADLEEVHLPHNDAL
        EDLIQDGYFKKFVGKPR++S EKKEERKR RTPPRR DRPAVIN             K+KELAR ARREVCII+EQRPT  I F+ ADLE VHLPHNDAL
Subjt:  EDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGWPSGGQSGHKRKELARAARREVCIIKEQRPTCPITFDSADLEEVHLPHNDAL

Query:  VIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVTPEGCIDLPVTFGQEKTQVTQMAEFVVIDGRSAYNAIFGRPIIHS
        VIAPLID V+VRR+LVDGGASANILSL TYLALGWTRSQLKKSPTPLVGFSGES++ EGCIDLPV+  Q+ TQVTQMAEFVVIDGRSAYNAIFGRPIIHS
Subjt:  VIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVTPEGCIDLPVTFGQEKTQVTQMAEFVVIDGRSAYNAIFGRPIIHS

Query:  FRAVPSTLHQVLKNSTPNGMGTVRGEQTASRECYASALKGSSVCALEGQASGD
        FRAVPSTLHQVLK ST NG+GTVRGE   SRECYAS  K SSVCALE Q   D
Subjt:  FRAVPSTLHQVLKNSTPNGMGTVRGEQTASRECYASALKGSSVCALEGQASGD

A0A6J1DZB9 uncharacterized protein LOC1110249043.8e-22075Show/hide
Query:  MDFQAASDAIKCRAFQIALTGSARLWYRRLPTRSISTYSQLRREFLAQFSSRHYDKKIATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL
        MDFQAA+DAIKCRAFQIALTGSARLWYRRLP RSISTYSQLR+EF++QFSS HYD+K ATHLATIRQKE ETLREYVTRFQEEQLKVAHCSDDSAMCYFL
Subjt:  MDFQAASDAIKCRAFQIALTGSARLWYRRLPTRSISTYSQLRREFLAQFSSRHYDKKIATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL

Query:  TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQKLLRTKTGRPERKIDRGRSGKDTGKADPKSKDEGPFSS-GRAEYRRAENGPTRSRPYERFTPTTIP
        T LADE LTVKLGEEAP TF EVLQKAKKVIDGQ+LLRTKTGRPE++ID+ +  ++  KAD KS+D+G  SS  R EYRR E+GP+RSRPYER+T +TIP
Subjt:  TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQKLLRTKTGRPERKIDRGRSGKDTGKADPKSKDEGPFSS-GRAEYRRAENGPTRSRPYERFTPTTIP

Query:  ISEILTNIEESGMERLLKRPEKLRGAPKRRSKDKYCRFHREHGHNTSDCWELKRQVEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVIN
        ISEILTNIEESGME+LLKRPEKLRG  ++R+K+KYCRFHR+HGHNT+ CWELKRQ+EDLIQDGYFKKFVGKPR++S EKKEERKRSRTPPRR DRPAVIN
Subjt:  ISEILTNIEESGMERLLKRPEKLRGAPKRRSKDKYCRFHREHGHNTSDCWELKRQVEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVIN

Query:  TIFGWPSGGQSGHKRKELARAARREVCIIKEQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSP
        TIFG P+GGQSG+KRKELAR ARREVCII+E +PTC ITF  ADLE VHLPHNDALVIA LIDH +VRRVL+DG                          
Subjt:  TIFGWPSGGQSGHKRKELARAARREVCIIKEQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSP

Query:  TPLVGFSGESVTPEGCIDLPVTFGQEKTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKNSTPNGMGTVRGEQTASRECYASALKGSSVC
                      GCIDLPVT GQ+ TQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLK STPN +G VRGEQ  SRECYASALKGS+VC
Subjt:  TPLVGFSGESVTPEGCIDLPVTFGQEKTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKNSTPNGMGTVRGEQTASRECYASALKGSSVC

Query:  ALEGQASGDGPLEFEADLP---RREFFAPTEELELVPLLSPEKQ
        ALE Q +     E EADLP   +R+F  PTEELELVPLLSPE+Q
Subjt:  ALEGQASGDGPLEFEADLP---RREFFAPTEELELVPLLSPEKQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTGTTAGATCATCCAAATCGGAGCTCAAACGGAGAAGTTACGATCGAAACAAATTGGAAGGAACAATACCCGAGAAGATGGCGTGGACAGTTCGTAAGCTG
ACAAGTGACAAGTTTCTTTTTAAATGTGATCCTCACCGTGGAGAGGATCGTCTTTGCTCTTGTGAAGAGCAGTGTACCGGTTTAACTCCTACCATGGAAAGAGTC
AGAATTTCAATTTACAACCCAAGAGGTTTCATTCCTTTTACTGATAGTCAATCTATTACTAAGCCTCCTTTCTTTAATGAGGGATTTAAAATTCCAACAAAAGAT
GTTGATGGTGTTAGAGTTGTAAAGCCTAAAGAAGAATGGTCTATAATTGAAAAGAAAGCATGTTCTTTAAATGCTAAAGCTATTAATTGCTTGTTTTGTGCTTTG
AATGAAGTTGAATTTACTAACATTGTCAATGCTTTAGAAGGACTTGGAAAAGAATATTCAAATCTTGAGAAGGTAAAGAAGCTCTTATGGTCCTTGCCTAAACAA
TGGGAGCCTAAAGTCACCGCCATTCAAGAGGCAAAGGATCTCAAGACTCTCTCCATGGACGAACTCATTGGTTCGTTGATGACACATGAGATAAAGATCAAGAAA
AACATGGAGGATGAGAAGAAAAAGAAAGAGAAGAGCATAGCATTAAAGGCCATCACCTTGGAAGTTGACTCCGAAGGTGAGAATGCTCTTGATGAAGATGATGTG
GCCTATCTCTCACGTAAGTATAAAAATTTCATCAAGAGAAAGAAACAATTCAAGAAGAATTTCTTCAACCAAAAAGAGTCAAAAAGTGAAAAGAGCAAAAAGGAT
GAGGTAATTTGTTATGAATGCAAAAAACCGGGTCATATTAGAACCGATTGTCCTCTTCTTAAATCATCCAAGAAATCCAAGAAGAAAGCAATGAAGGCTACTTGG
GATGATAGTGATGAAAGTGGAAGTGAAAGTGAGAATGAAGAAGTGACCAACTTTTGCTTCATGGCTCATAGTGACAAGGAGGATGAACAAGATGAGGTAACTCTT
GATCCCCTTTCTTATGATGAGTTGTTTGAAGCTTTTGAGAATATGCAAAATGATTTAGAAAAGCTTGGTTCTAAATATGTTATGCTTAAAAAGAAATACAATGTC
TTAACTAGTGAAAATAAGTCTTTACTTGATGATATTGCTTGCTTAAAGAAAAATGAGCATGATGTTGTAAATATCTCTTGTGATAAGCATGTTCTTGATTGTGAT
GAGAAAAATACATTACTTGATAAAATTAGATTTCTTAAGCATGATGGCTGTGAAAAAGATAATTTGATTAAATTGCTTAAGAAAAATGAATCAAATGCTTTAGTG
GAACTTGATAAGGCTAAAGATTCTATTAAAAAATTAACAATAGGTGCTCAAAGGTTGGACAAGATTATTGAAGTAGGTAAACCTTATGGTGATAAAAGAGGTTTA
GGCTATATTGATGAATGCTCTACTCCTTCAAGTTCTAAAACTATCTTTGTTAAAGCATCTCCTAATATGCCCAAACTTGTTGCTCCTAAAGTTGTATCTAAACAT
GCTAAAATTAACTTTGTGCCTATATGTCATTATTGTGGTGTTGAAGAAAGAAATTTTGGAGATTTACTTGTTAGTGACAAAAGCAAAGAGATTGCTTCAAGTAAC
CAAGAAGTGAGCATCAACGAAAATAAGGTGGACGGTTTTTCATCCATGCCTAAGGAGTGGAAGTATGCTCCATCTCATCCTAAGGATTTAATTCTTGGTGATCCC
GAACAAGGGTTGCATACCGATCCAACAGAATCGCCATTCACCTCGGACGTTTTGGAAGCACCGATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAACCTTATGAT
GGGTCGAAGGACCCTAAGGATTATGTTGAGGTCTTTGAGGGCCTCATGGATTTTCAAGCGGCATCAGACGCAATCAAATGTCGCGCCTTTCAGATCGCGCTTACT
GGTAGCGCGCGTTTGTGGTATCGGAGACTGCCAACCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCACTATGAC
AAAAAGATAGCGACCCATCTCGCCACCATCAGGCAGAAGGAAGGTGAGACGCTACGGGAGTATGTCACCAGGTTCCAGGAGGAACAATTGAAGGTCGCACACTGC
TCCGACGACTCGGCCATGTGCTACTTTCTCACCGGCCTGGCCGACGAAGCCCTCACGGTGAAGCTAGGAGAGGAGGCTCCGGCAACCTTCGCCGAAGTGCTACAA
AAGGCGAAGAAAGTCATCGACGGGCAGAAGCTCCTCCGAACCAAAACCGGCCGACCAGAGAGAAAAATCGACCGGGGCAGAAGTGGAAAAGATACAGGAAAGGCG
GATCCCAAGTCCAAGGACGAGGGACCTTTCTCCAGTGGCCGAGCTGAGTATCGTAGGGCGGAGAACGGACCCACCAGGAGCCGACCTTACGAACGCTTCACCCCG
ACCACTATTCCAATCTCCGAGATCCTAACGAACATCGAGGAGTCTGGAATGGAAAGACTCCTCAAACGACCTGAGAAGCTTCGGGGAGCCCCGAAGAGGCGCAGC
AAGGACAAGTATTGCCGCTTTCATCGGGAGCACGGCCATAACACGTCAGATTGCTGGGAATTAAAGCGCCAAGTAGAGGATCTTATTCAAGATGGCTACTTCAAG
AAATTTGTGGGGAAGCCCAGGACCAGCTCGGCAGAAAAGAAGGAAGAGAGGAAGCGTTCGAGGACGCCGCCCCGGCGCACTGACCGACCTGCGGTCATCAATACC
ATTTTCGGATGGCCAAGCGGGGGCCAGTCCGGACATAAAAGAAAGGAGTTAGCTCGTGCAGCCAGGCGCGAGGTGTGCATCATCAAGGAGCAGAGGCCGACCTGC
CCAATCACCTTCGACAGTGCAGACTTGGAGGAGGTCCACCTGCCCCACAATGATGCACTTGTGATCGCTCCCTTGATTGATCATGTGGTGGTCAGAAGGGTACTA
GTAGACGGGGGCGCGTCTGCTAACATCCTGTCCCTACCAACATACCTTGCCCTGGGTTGGACAAGGTCGCAATTGAAGAAAAGCCCGACACCACTAGTTGGGTTC
TCTGGAGAGTCGGTCACCCCAGAGGGTTGCATCGACTTGCCGGTCACATTTGGGCAAGAAAAAACACAGGTCACCCAAATGGCCGAGTTCGTGGTGATTGACGGT
AGATCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCGTTCCCTCAACACTTCATCAAGTTTTGAAGAATTCCACCCCTAATGGCATG
GGCACAGTCCGAGGAGAGCAAACCGCTTCGAGGGAATGCTATGCCTCCGCACTCAAAGGGTCATCGGTATGCGCCCTCGAAGGTCAAGCCAGTGGGGATGGGCCG
CTCGAGTTCGAGGCCGACCTGCCGAGAAGGGAGTTTTTCGCGCCTACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAACAGCCAGATCTGATAGAG
ATTGGCGCTCTAGAGCCCTCATGGATGGACCCGATTATGGACTTCATTAGGGGCAATTCACCACAAGACCCCAAGGAGCGCAGAAAGTTAGCGAGGAAGGCAGCT
CGAATGGCCAGACATTACAACGCCCGCGTTCGACCTCCAACCTTCCAAGTCGGACATCTGGTCTTAAGGAAGGCCCAAACCCATGTGGGTACCCTTGACCCGAAC
TGGGAGGGGCCGTTTGAAGTCAAGGGAATAGTCCGACCTGGGACGTACATATTGGCCGATCTGAAAGGAGACGTCTTCGCGCACCCATGGAACGTGGAACACCTG
AAGCGTTATTACCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGACTGTTAGATCATCCAAATCGGAGCTCAAACGGAGAAGTTACGATCGAAACAAATTGGAAGGAACAATACCCGAGAAGATGGCGTGGACAGTTCGTAAGCTG
ACAAGTGACAAGTTTCTTTTTAAATGTGATCCTCACCGTGGAGAGGATCGTCTTTGCTCTTGTGAAGAGCAGTGTACCGGTTTAACTCCTACCATGGAAAGAGTC
AGAATTTCAATTTACAACCCAAGAGGTTTCATTCCTTTTACTGATAGTCAATCTATTACTAAGCCTCCTTTCTTTAATGAGGGATTTAAAATTCCAACAAAAGAT
GTTGATGGTGTTAGAGTTGTAAAGCCTAAAGAAGAATGGTCTATAATTGAAAAGAAAGCATGTTCTTTAAATGCTAAAGCTATTAATTGCTTGTTTTGTGCTTTG
AATGAAGTTGAATTTACTAACATTGTCAATGCTTTAGAAGGACTTGGAAAAGAATATTCAAATCTTGAGAAGGTAAAGAAGCTCTTATGGTCCTTGCCTAAACAA
TGGGAGCCTAAAGTCACCGCCATTCAAGAGGCAAAGGATCTCAAGACTCTCTCCATGGACGAACTCATTGGTTCGTTGATGACACATGAGATAAAGATCAAGAAA
AACATGGAGGATGAGAAGAAAAAGAAAGAGAAGAGCATAGCATTAAAGGCCATCACCTTGGAAGTTGACTCCGAAGGTGAGAATGCTCTTGATGAAGATGATGTG
GCCTATCTCTCACGTAAGTATAAAAATTTCATCAAGAGAAAGAAACAATTCAAGAAGAATTTCTTCAACCAAAAAGAGTCAAAAAGTGAAAAGAGCAAAAAGGAT
GAGGTAATTTGTTATGAATGCAAAAAACCGGGTCATATTAGAACCGATTGTCCTCTTCTTAAATCATCCAAGAAATCCAAGAAGAAAGCAATGAAGGCTACTTGG
GATGATAGTGATGAAAGTGGAAGTGAAAGTGAGAATGAAGAAGTGACCAACTTTTGCTTCATGGCTCATAGTGACAAGGAGGATGAACAAGATGAGGTAACTCTT
GATCCCCTTTCTTATGATGAGTTGTTTGAAGCTTTTGAGAATATGCAAAATGATTTAGAAAAGCTTGGTTCTAAATATGTTATGCTTAAAAAGAAATACAATGTC
TTAACTAGTGAAAATAAGTCTTTACTTGATGATATTGCTTGCTTAAAGAAAAATGAGCATGATGTTGTAAATATCTCTTGTGATAAGCATGTTCTTGATTGTGAT
GAGAAAAATACATTACTTGATAAAATTAGATTTCTTAAGCATGATGGCTGTGAAAAAGATAATTTGATTAAATTGCTTAAGAAAAATGAATCAAATGCTTTAGTG
GAACTTGATAAGGCTAAAGATTCTATTAAAAAATTAACAATAGGTGCTCAAAGGTTGGACAAGATTATTGAAGTAGGTAAACCTTATGGTGATAAAAGAGGTTTA
GGCTATATTGATGAATGCTCTACTCCTTCAAGTTCTAAAACTATCTTTGTTAAAGCATCTCCTAATATGCCCAAACTTGTTGCTCCTAAAGTTGTATCTAAACAT
GCTAAAATTAACTTTGTGCCTATATGTCATTATTGTGGTGTTGAAGAAAGAAATTTTGGAGATTTACTTGTTAGTGACAAAAGCAAAGAGATTGCTTCAAGTAAC
CAAGAAGTGAGCATCAACGAAAATAAGGTGGACGGTTTTTCATCCATGCCTAAGGAGTGGAAGTATGCTCCATCTCATCCTAAGGATTTAATTCTTGGTGATCCC
GAACAAGGGTTGCATACCGATCCAACAGAATCGCCATTCACCTCGGACGTTTTGGAAGCACCGATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAACCTTATGAT
GGGTCGAAGGACCCTAAGGATTATGTTGAGGTCTTTGAGGGCCTCATGGATTTTCAAGCGGCATCAGACGCAATCAAATGTCGCGCCTTTCAGATCGCGCTTACT
GGTAGCGCGCGTTTGTGGTATCGGAGACTGCCAACCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCACTATGAC
AAAAAGATAGCGACCCATCTCGCCACCATCAGGCAGAAGGAAGGTGAGACGCTACGGGAGTATGTCACCAGGTTCCAGGAGGAACAATTGAAGGTCGCACACTGC
TCCGACGACTCGGCCATGTGCTACTTTCTCACCGGCCTGGCCGACGAAGCCCTCACGGTGAAGCTAGGAGAGGAGGCTCCGGCAACCTTCGCCGAAGTGCTACAA
AAGGCGAAGAAAGTCATCGACGGGCAGAAGCTCCTCCGAACCAAAACCGGCCGACCAGAGAGAAAAATCGACCGGGGCAGAAGTGGAAAAGATACAGGAAAGGCG
GATCCCAAGTCCAAGGACGAGGGACCTTTCTCCAGTGGCCGAGCTGAGTATCGTAGGGCGGAGAACGGACCCACCAGGAGCCGACCTTACGAACGCTTCACCCCG
ACCACTATTCCAATCTCCGAGATCCTAACGAACATCGAGGAGTCTGGAATGGAAAGACTCCTCAAACGACCTGAGAAGCTTCGGGGAGCCCCGAAGAGGCGCAGC
AAGGACAAGTATTGCCGCTTTCATCGGGAGCACGGCCATAACACGTCAGATTGCTGGGAATTAAAGCGCCAAGTAGAGGATCTTATTCAAGATGGCTACTTCAAG
AAATTTGTGGGGAAGCCCAGGACCAGCTCGGCAGAAAAGAAGGAAGAGAGGAAGCGTTCGAGGACGCCGCCCCGGCGCACTGACCGACCTGCGGTCATCAATACC
ATTTTCGGATGGCCAAGCGGGGGCCAGTCCGGACATAAAAGAAAGGAGTTAGCTCGTGCAGCCAGGCGCGAGGTGTGCATCATCAAGGAGCAGAGGCCGACCTGC
CCAATCACCTTCGACAGTGCAGACTTGGAGGAGGTCCACCTGCCCCACAATGATGCACTTGTGATCGCTCCCTTGATTGATCATGTGGTGGTCAGAAGGGTACTA
GTAGACGGGGGCGCGTCTGCTAACATCCTGTCCCTACCAACATACCTTGCCCTGGGTTGGACAAGGTCGCAATTGAAGAAAAGCCCGACACCACTAGTTGGGTTC
TCTGGAGAGTCGGTCACCCCAGAGGGTTGCATCGACTTGCCGGTCACATTTGGGCAAGAAAAAACACAGGTCACCCAAATGGCCGAGTTCGTGGTGATTGACGGT
AGATCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCGTTCCCTCAACACTTCATCAAGTTTTGAAGAATTCCACCCCTAATGGCATG
GGCACAGTCCGAGGAGAGCAAACCGCTTCGAGGGAATGCTATGCCTCCGCACTCAAAGGGTCATCGGTATGCGCCCTCGAAGGTCAAGCCAGTGGGGATGGGCCG
CTCGAGTTCGAGGCCGACCTGCCGAGAAGGGAGTTTTTCGCGCCTACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAACAGCCAGATCTGATAGAG
ATTGGCGCTCTAGAGCCCTCATGGATGGACCCGATTATGGACTTCATTAGGGGCAATTCACCACAAGACCCCAAGGAGCGCAGAAAGTTAGCGAGGAAGGCAGCT
CGAATGGCCAGACATTACAACGCCCGCGTTCGACCTCCAACCTTCCAAGTCGGACATCTGGTCTTAAGGAAGGCCCAAACCCATGTGGGTACCCTTGACCCGAAC
TGGGAGGGGCCGTTTGAAGTCAAGGGAATAGTCCGACCTGGGACGTACATATTGGCCGATCTGAAAGGAGACGTCTTCGCGCACCCATGGAACGTGGAACACCTG
AAGCGTTATTACCCTTGA
Protein sequenceShow/hide protein sequence
MTVRSSKSELKRRSYDRNKLEGTIPEKMAWTVRKLTSDKFLFKCDPHRGEDRLCSCEEQCTGLTPTMERVRISIYNPRGFIPFTDSQSITKPPFFNEGFKIPTKD
VDGVRVVKPKEEWSIIEKKACSLNAKAINCLFCALNEVEFTNIVNALEGLGKEYSNLEKVKKLLWSLPKQWEPKVTAIQEAKDLKTLSMDELIGSLMTHEIKIKK
NMEDEKKKKEKSIALKAITLEVDSEGENALDEDDVAYLSRKYKNFIKRKKQFKKNFFNQKESKSEKSKKDEVICYECKKPGHIRTDCPLLKSSKKSKKKAMKATW
DDSDESGSESENEEVTNFCFMAHSDKEDEQDEVTLDPLSYDELFEAFENMQNDLEKLGSKYVMLKKKYNVLTSENKSLLDDIACLKKNEHDVVNISCDKHVLDCD
EKNTLLDKIRFLKHDGCEKDNLIKLLKKNESNALVELDKAKDSIKKLTIGAQRLDKIIEVGKPYGDKRGLGYIDECSTPSSSKTIFVKASPNMPKLVAPKVVSKH
AKINFVPICHYCGVEERNFGDLLVSDKSKEIASSNQEVSINENKVDGFSSMPKEWKYAPSHPKDLILGDPEQGLHTDPTESPFTSDVLEAPIPPKFKAPTVKPYD
GSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPTRSISTYSQLRREFLAQFSSRHYDKKIATHLATIRQKEGETLREYVTRFQEEQLKVAHC
SDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQKLLRTKTGRPERKIDRGRSGKDTGKADPKSKDEGPFSSGRAEYRRAENGPTRSRPYERFTP
TTIPISEILTNIEESGMERLLKRPEKLRGAPKRRSKDKYCRFHREHGHNTSDCWELKRQVEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINT
IFGWPSGGQSGHKRKELARAARREVCIIKEQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGF
SGESVTPEGCIDLPVTFGQEKTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKNSTPNGMGTVRGEQTASRECYASALKGSSVCALEGQASGDGP
LEFEADLPRREFFAPTEELELVPLLSPEKQPDLIEIGALEPSWMDPIMDFIRGNSPQDPKERRKLARKAARMARHYNARVRPPTFQVGHLVLRKAQTHVGTLDPN
WEGPFEVKGIVRPGTYILADLKGDVFAHPWNVEHLKRYYP