; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g03460 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g03460
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr7:3016132..3027637
RNA-Seq ExpressionMoc07g03460
SyntenyMoc07g03460
Gene Ontology termsGO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR004332 - Transposase, MuDR, plant
IPR005162 - Retrotransposon gag domain
IPR006564 - Zinc finger, PMZ-type
IPR007527 - Zinc finger, SWIM-type
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]7.1e-24384.28Show/hide
Query:  QAESSHN---PAGIITREEFDQLRGKLDAQVEALKAKCEQKDDSLNDGDLRESPFTSDVLELPIPPKFKAPTVKPYDGTKDPKDYVE-------------
        +AESS N   PAG+ITREEFDQLRG+LDAQVEALKAKCEQK+  LNDGDL ESPFTSDVLE PIPPKFKAPTVKPYDG+KDPKDYVE             
Subjt:  QAESSHN---PAGIITREEFDQLRGKLDAQVEALKAKCEQKDDSLNDGDLRESPFTSDVLELPIPPKFKAPTVKPYDGTKDPKDYVE-------------

Query:  --------IALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQNEGETLREYVTRFQEEQLKVAHCSDDSAMCYLLTGLADEAL
                IALTGSARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQ EGETLREYVTRFQEEQLKVAHCSDDSAMCY LTGLADEAL
Subjt:  --------IALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQNEGETLREYVTRFQEEQLKVAHCSDDSAMCYLLTGLADEAL

Query:  TVKLGDEAPATFAE-------------------------IGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAESGHTRSQPYERFTPTTIPISEILTNIE
        TVKLG+EAPATFAE                         IGRGRSGKD+E ADPKSKDKGSFSSGRAEYRRAE+G TRS+PYERFTPTTIPISEILTNIE
Subjt:  TVKLGDEAPATFAE-------------------------IGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAESGHTRSQPYERFTPTTIPISEILTNIE

Query:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKIFVGKPRTSSAEEKEERKCSRTPPRRTDRPAVINTIFGGPSGG
        ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFK FVGKPRTSSAE+KEERK SRTPPRRTDRPAVINTIFGGPSGG
Subjt:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKIFVGKPRTSSAEEKEERKCSRTPPRRTDRPAVINTIFGGPSGG

Query:  QSGHKRKKLARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGE
        QSG KRK+LARAARREVCIIREQ PTCPITFDGADLEEVHLPHNDALVIAPLIDHV+V RVLVDGG SANILSLPTYLALGWTRSQLKKSPTPLVGFSGE
Subjt:  QSGHKRKKLARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGE

Query:  SVIPEGCIDLPVTLGQDRTRVTQMAEFV
        SVIPEG IDLPVTLGQD+T+VTQMAEFV
Subjt:  SVIPEGCIDLPVTLGQDRTRVTQMAEFV

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]9.6e-24073.53Show/hide
Query:  SSNQQAESSHNPA---GIITREEFDQLRGKLDAQVEALKAKCEQKDDSLNDGDLRESPFTSDVLELPIPPKFKAPTVKPYDGTKDPKDYVE---------
        SSNQQAESSHNPA   G+ITREEFDQLRGKL+AQVEALKAKCEQK+  LNDGDL ESPFTSDVLE        APTVK YDG+KDPKDYVE         
Subjt:  SSNQQAESSHNPA---GIITREEFDQLRGKLDAQVEALKAKCEQKDDSLNDGDLRESPFTSDVLELPIPPKFKAPTVKPYDGTKDPKDYVE---------

Query:  ------------IALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQNEGETLREYVTRFQEEQLKVAHCSDDSAMCYLLTGLA
                    IALTGSARLW                                                     FQE+QLKVA  SDDSAMCY LTGLA
Subjt:  ------------IALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQNEGETLREYVTRFQEEQLKVAHCSDDSAMCYLLTGLA

Query:  DEALTVKLGDEAPATFAE-------------------------IGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAESGHTRSQPYERFTPTTIPISEIL
        DEALTVKLG EAPATFAE                         I RGRSGKD E+AD KSKDKGSFSSGRAE+RRA +G TRS+PYERFTPTTIPISEIL
Subjt:  DEALTVKLGDEAPATFAE-------------------------IGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAESGHTRSQPYERFTPTTIPISEIL

Query:  TNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKIFVGKPRTSSAEEKEERKCSRTPPRRTDRPAVINTIFGG
        TNIEESGMEKLLKRPEKLRGAPERR+KDKYCRFHREH HNTSD WELKRQIEDLIQD YFK FVGKPRTSSAE+KEERK SRTP RR DRPAVINTIFGG
Subjt:  TNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKIFVGKPRTSSAEEKEERKCSRTPPRRTDRPAVINTIFGG

Query:  PSGGQSGHKRKKLARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVG
        PSGGQSGHKRK+LARAARREVCIIREQ PTCPITFD ADLEEVHLPHNDALVIAPLIDHV+VRRVLVD G SANI+SL TYLALGWTRSQLKKS TPLVG
Subjt:  PSGGQSGHKRKKLARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVG

Query:  FSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNTIFGRPIIHSFRAIPSTLHQVLKYPTPSGVGTVLGEQTTSRECYASALKGSSVCALETL
        FS ESVIPEGCIDLPVTLG D+T+VTQMAEFVVIDGRSAYN IFGRPIIHSFRAIPSTLHQVLKY TP+GVG V GEQ  SRECYASALKGSSVCALETL
Subjt:  FSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNTIFGRPIIHSFRAIPSTLHQVLKYPTPSGVGTVLGEQTTSRECYASALKGSSVCALETL

Query:  AGRDGTLEFEADLPRKEFDAPTEELELVPLL
          RDGTLEF+A+LPR+EF APTEELELVPLL
Subjt:  AGRDGTLEFEADLPRKEFDAPTEELELVPLL

XP_022152110.1 uncharacterized protein LOC111019899 [Momordica charantia]5.9e-21387.07Show/hide
Query:  MCYLLTGLADEALTVKLGDEAPATFAE------------------IGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAESGHTRSQPYERFTPTTIPISE
        MCY LTGLADEALTVKL +EAPATFAE                  IG+GRSGKD+E  DPKSKDKGSFS+GRAEYRRAE+G TRS+PYERFTPTTIPISE
Subjt:  MCYLLTGLADEALTVKLGDEAPATFAE------------------IGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAESGHTRSQPYERFTPTTIPISE

Query:  ILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKIFVGKPRTSSAEEKEERKCSRTPPRRTDRPAVINTIF
        ILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELK QIEDLIQDGYFK FVGKPRTSSAE+KEERK SRTPPRRTDRPAVINTIF
Subjt:  ILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKIFVGKPRTSSAEEKEERKCSRTPPRRTDRPAVINTIF

Query:  GGPSGGQSGHKRKKLARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPL
        GGPSGGQSGHKRKKLARAARREVCIIREQ PTCPITFD ADL EVHLPHNDALVIAPLIDHV+VRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPL
Subjt:  GGPSGGQSGHKRKKLARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPL

Query:  VGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNTIFGRPIIHSFRAIPSTLHQVLKYPTPSGVGTVLGEQTTSRECYASALKGSSVCALE
        VGFSGESV+PEGCIDLPVTLGQD+TRVTQMAEFVV+DGRSAYN IFGRPIIHSFRAIPSTLHQVLKY TP+GVGTV GEQT SRECYAS LKG+SVCALE
Subjt:  VGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNTIFGRPIIHSFRAIPSTLHQVLKYPTPSGVGTVLGEQTTSRECYASALKGSSVCALE

Query:  TLAGRDGTLEFEADLPRKEFDAPTEELELVPLLSPEKQPDL
        TL  RDGTLEFEADLP +EF AP EELELVPLLS EKQ  L
Subjt:  TLAGRDGTLEFEADLPRKEFDAPTEELELVPLLSPEKQPDL

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]1.5e-22971.67Show/hide
Query:  QAESSHNP--AGIITREEFDQLRGKLDAQVEALKAKCEQKDDSLNDGDLRESPFTSDVLELPIPPKFKAPTVKPYDGTKDPKDYVE--------------
        +AESS+NP   G+ITREEFDQL+ K DAQVEALKA+CE+K+ S +DGDL E  F+SD+LE  IPPKFK PT+KPYDG+KDPKDYVE              
Subjt:  QAESSHNP--AGIITREEFDQLRGKLDAQVEALKAKCEQKDDSLNDGDLRESPFTSDVLELPIPPKFKAPTVKPYDGTKDPKDYVE--------------

Query:  -------IALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQNEGETLREYVTRFQEEQLKVAHCSDDSAMCYLLTGLADEALT
               IALTGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQ EGETLREYVTRF EEQLKVAHCSDDSAMCY LTGLADE LT
Subjt:  -------IALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQNEGETLREYVTRFQEEQLKVAHCSDDSAMCYLLTGLADEALT

Query:  VKLGDEAPATFAE-------------------------IGRGRSGKDVERADPKSKDKG-SFSSGRAEYRRAESGHTRSQPYERFTPTTIPISEILTNIE
        VKL +EAPATFAE                         I +GR+GKD  +AD KS+DKG S SS R +YRR+ S H +S+PYE +TPTTIPI EILTNIE
Subjt:  VKLGDEAPATFAE-------------------------IGRGRSGKDVERADPKSKDKG-SFSSGRAEYRRAESGHTRSQPYERFTPTTIPISEILTNIE

Query:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKIFVGKPRTSSAEEKEERKCSRTPPRRTDRPAVINTIFGGPSGG
        E+GMEKLLKRPEKLRG PE+R+ DKYCRFHR+HGHNTS+ WELKRQIEDLIQDGYFK FVGKPR++S E+KEERK  RTPPRR DRPAVIN         
Subjt:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKIFVGKPRTSSAEEKEERKCSRTPPRRTDRPAVINTIFGGPSGG

Query:  QSGHKRKKLARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGE
            K+K+LAR ARREVCIIREQ PT  I F+ ADLE VHLPHNDALVIAPLID VLVRR+LVDGGASANILSL TYLALGWTRSQLKKSPTPLVGFSGE
Subjt:  QSGHKRKKLARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGE

Query:  SVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNTIFGRPIIHSFRAIPSTLHQVLKYPTPSGVGTVLGEQTTSRECYASALKGSSVCALETLAGRD
        S+  EGCIDLPV++ QD T+VTQMAEFVVIDGRSAYN IFGRPIIHSFRA+PSTLHQVLKY T +GVGTV GE  TSRECYAS  K SSVCALE    RD
Subjt:  SVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNTIFGRPIIHSFRAIPSTLHQVLKYPTPSGVGTVLGEQTTSRECYASALKGSSVCALETLAGRD

XP_022158414.1 uncharacterized protein LOC111024904 [Momordica charantia]2.7e-19468.44Show/hide
Query:  EIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQNEGETLREYVTRFQEEQLKVAHCSDDSAMCYLLTGLADEALTVKLGDE
        +IALTGSARLWYRRLPARSISTYSQLR+EF++QFSS HYD+KTATHLATIRQ E ETLREYVTRFQEEQLKVAHCSDDSAMCY LT LADE LTVKLG+E
Subjt:  EIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQNEGETLREYVTRFQEEQLKVAHCSDDSAMCYLLTGLADEALTVKLGDE

Query:  APATFAEI----------------GRGRSGKDVE---------RADPKSKDKGSFSS-GRAEYRRAESGHTRSQPYERFTPTTIPISEILTNIEESGMEK
        AP TF E+                  GR  K ++         +AD KS+DKGS SS  R EYRR ESG +RS+PYER+T +TIPISEILTNIEESGMEK
Subjt:  APATFAEI----------------GRGRSGKDVE---------RADPKSKDKGSFSS-GRAEYRRAESGHTRSQPYERFTPTTIPISEILTNIEESGMEK

Query:  LLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKIFVGKPRTSSAEEKEERKCSRTPPRRTDRPAVINTIFGGPSGGQSGHKR
        LLKRPEKLRG  E+R+K+KYCRFHR+HGHNT+ CWELKRQIEDLIQDGYFK FVGKPR++S E+KEERK SRTPPRR DRPAVINTIFGGP+GGQSG+KR
Subjt:  LLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKIFVGKPRTSSAEEKEERKCSRTPPRRTDRPAVINTIFGGPSGGQSGHKR

Query:  KKLARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEG
        K+LAR ARREVCIIRE  PTC ITF  ADLE VHLPHNDALVIA LIDH LVRRVL+DG                                        G
Subjt:  KKLARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEG

Query:  CIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNTIFGRPIIHSFRAIPSTLHQVLKYPTPSGVGTVLGEQTTSRECYASALKGSSVCALETLAGRDGTLEFE
        CIDLPVT+GQD T+VTQMAEFVVIDGRSAYN IFGRPIIHSFRA+PSTLHQVLKY TP+ VG V GEQ TSRECYASALKGS+VCALE    R    E E
Subjt:  CIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNTIFGRPIIHSFRAIPSTLHQVLKYPTPSGVGTVLGEQTTSRECYASALKGSSVCALETLAGRDGTLEFE

Query:  ADLP---RKEFDAPTEELELVPLLSPEKQPD------LMEIGAPE
        ADLP   +++F  PTEELELVPLLSPE+Q +      ++E+ AP+
Subjt:  ADLP---RKEFDAPTEELELVPLLSPEKQPD------LMEIGAPE

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088133.4e-24384.28Show/hide
Query:  QAESSHN---PAGIITREEFDQLRGKLDAQVEALKAKCEQKDDSLNDGDLRESPFTSDVLELPIPPKFKAPTVKPYDGTKDPKDYVE-------------
        +AESS N   PAG+ITREEFDQLRG+LDAQVEALKAKCEQK+  LNDGDL ESPFTSDVLE PIPPKFKAPTVKPYDG+KDPKDYVE             
Subjt:  QAESSHN---PAGIITREEFDQLRGKLDAQVEALKAKCEQKDDSLNDGDLRESPFTSDVLELPIPPKFKAPTVKPYDGTKDPKDYVE-------------

Query:  --------IALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQNEGETLREYVTRFQEEQLKVAHCSDDSAMCYLLTGLADEAL
                IALTGSARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQ EGETLREYVTRFQEEQLKVAHCSDDSAMCY LTGLADEAL
Subjt:  --------IALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQNEGETLREYVTRFQEEQLKVAHCSDDSAMCYLLTGLADEAL

Query:  TVKLGDEAPATFAE-------------------------IGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAESGHTRSQPYERFTPTTIPISEILTNIE
        TVKLG+EAPATFAE                         IGRGRSGKD+E ADPKSKDKGSFSSGRAEYRRAE+G TRS+PYERFTPTTIPISEILTNIE
Subjt:  TVKLGDEAPATFAE-------------------------IGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAESGHTRSQPYERFTPTTIPISEILTNIE

Query:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKIFVGKPRTSSAEEKEERKCSRTPPRRTDRPAVINTIFGGPSGG
        ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFK FVGKPRTSSAE+KEERK SRTPPRRTDRPAVINTIFGGPSGG
Subjt:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKIFVGKPRTSSAEEKEERKCSRTPPRRTDRPAVINTIFGGPSGG

Query:  QSGHKRKKLARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGE
        QSG KRK+LARAARREVCIIREQ PTCPITFDGADLEEVHLPHNDALVIAPLIDHV+V RVLVDGG SANILSLPTYLALGWTRSQLKKSPTPLVGFSGE
Subjt:  QSGHKRKKLARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGE

Query:  SVIPEGCIDLPVTLGQDRTRVTQMAEFV
        SVIPEG IDLPVTLGQD+T+VTQMAEFV
Subjt:  SVIPEGCIDLPVTLGQDRTRVTQMAEFV

A0A6J1D9E1 uncharacterized protein LOC1110188234.7e-24073.53Show/hide
Query:  SSNQQAESSHNPA---GIITREEFDQLRGKLDAQVEALKAKCEQKDDSLNDGDLRESPFTSDVLELPIPPKFKAPTVKPYDGTKDPKDYVE---------
        SSNQQAESSHNPA   G+ITREEFDQLRGKL+AQVEALKAKCEQK+  LNDGDL ESPFTSDVLE        APTVK YDG+KDPKDYVE         
Subjt:  SSNQQAESSHNPA---GIITREEFDQLRGKLDAQVEALKAKCEQKDDSLNDGDLRESPFTSDVLELPIPPKFKAPTVKPYDGTKDPKDYVE---------

Query:  ------------IALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQNEGETLREYVTRFQEEQLKVAHCSDDSAMCYLLTGLA
                    IALTGSARLW                                                     FQE+QLKVA  SDDSAMCY LTGLA
Subjt:  ------------IALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQNEGETLREYVTRFQEEQLKVAHCSDDSAMCYLLTGLA

Query:  DEALTVKLGDEAPATFAE-------------------------IGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAESGHTRSQPYERFTPTTIPISEIL
        DEALTVKLG EAPATFAE                         I RGRSGKD E+AD KSKDKGSFSSGRAE+RRA +G TRS+PYERFTPTTIPISEIL
Subjt:  DEALTVKLGDEAPATFAE-------------------------IGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAESGHTRSQPYERFTPTTIPISEIL

Query:  TNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKIFVGKPRTSSAEEKEERKCSRTPPRRTDRPAVINTIFGG
        TNIEESGMEKLLKRPEKLRGAPERR+KDKYCRFHREH HNTSD WELKRQIEDLIQD YFK FVGKPRTSSAE+KEERK SRTP RR DRPAVINTIFGG
Subjt:  TNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKIFVGKPRTSSAEEKEERKCSRTPPRRTDRPAVINTIFGG

Query:  PSGGQSGHKRKKLARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVG
        PSGGQSGHKRK+LARAARREVCIIREQ PTCPITFD ADLEEVHLPHNDALVIAPLIDHV+VRRVLVD G SANI+SL TYLALGWTRSQLKKS TPLVG
Subjt:  PSGGQSGHKRKKLARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVG

Query:  FSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNTIFGRPIIHSFRAIPSTLHQVLKYPTPSGVGTVLGEQTTSRECYASALKGSSVCALETL
        FS ESVIPEGCIDLPVTLG D+T+VTQMAEFVVIDGRSAYN IFGRPIIHSFRAIPSTLHQVLKY TP+GVG V GEQ  SRECYASALKGSSVCALETL
Subjt:  FSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNTIFGRPIIHSFRAIPSTLHQVLKYPTPSGVGTVLGEQTTSRECYASALKGSSVCALETL

Query:  AGRDGTLEFEADLPRKEFDAPTEELELVPLL
          RDGTLEF+A+LPR+EF APTEELELVPLL
Subjt:  AGRDGTLEFEADLPRKEFDAPTEELELVPLL

A0A6J1DD03 uncharacterized protein LOC1110198992.8e-21387.07Show/hide
Query:  MCYLLTGLADEALTVKLGDEAPATFAE------------------IGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAESGHTRSQPYERFTPTTIPISE
        MCY LTGLADEALTVKL +EAPATFAE                  IG+GRSGKD+E  DPKSKDKGSFS+GRAEYRRAE+G TRS+PYERFTPTTIPISE
Subjt:  MCYLLTGLADEALTVKLGDEAPATFAE------------------IGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAESGHTRSQPYERFTPTTIPISE

Query:  ILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKIFVGKPRTSSAEEKEERKCSRTPPRRTDRPAVINTIF
        ILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELK QIEDLIQDGYFK FVGKPRTSSAE+KEERK SRTPPRRTDRPAVINTIF
Subjt:  ILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKIFVGKPRTSSAEEKEERKCSRTPPRRTDRPAVINTIF

Query:  GGPSGGQSGHKRKKLARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPL
        GGPSGGQSGHKRKKLARAARREVCIIREQ PTCPITFD ADL EVHLPHNDALVIAPLIDHV+VRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPL
Subjt:  GGPSGGQSGHKRKKLARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPL

Query:  VGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNTIFGRPIIHSFRAIPSTLHQVLKYPTPSGVGTVLGEQTTSRECYASALKGSSVCALE
        VGFSGESV+PEGCIDLPVTLGQD+TRVTQMAEFVV+DGRSAYN IFGRPIIHSFRAIPSTLHQVLKY TP+GVGTV GEQT SRECYAS LKG+SVCALE
Subjt:  VGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNTIFGRPIIHSFRAIPSTLHQVLKYPTPSGVGTVLGEQTTSRECYASALKGSSVCALE

Query:  TLAGRDGTLEFEADLPRKEFDAPTEELELVPLLSPEKQPDL
        TL  RDGTLEFEADLP +EF AP EELELVPLLS EKQ  L
Subjt:  TLAGRDGTLEFEADLPRKEFDAPTEELELVPLLSPEKQPDL

A0A6J1DHB3 uncharacterized protein LOC1110204797.4e-23071.67Show/hide
Query:  QAESSHNP--AGIITREEFDQLRGKLDAQVEALKAKCEQKDDSLNDGDLRESPFTSDVLELPIPPKFKAPTVKPYDGTKDPKDYVE--------------
        +AESS+NP   G+ITREEFDQL+ K DAQVEALKA+CE+K+ S +DGDL E  F+SD+LE  IPPKFK PT+KPYDG+KDPKDYVE              
Subjt:  QAESSHNP--AGIITREEFDQLRGKLDAQVEALKAKCEQKDDSLNDGDLRESPFTSDVLELPIPPKFKAPTVKPYDGTKDPKDYVE--------------

Query:  -------IALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQNEGETLREYVTRFQEEQLKVAHCSDDSAMCYLLTGLADEALT
               IALTGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQ EGETLREYVTRF EEQLKVAHCSDDSAMCY LTGLADE LT
Subjt:  -------IALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQNEGETLREYVTRFQEEQLKVAHCSDDSAMCYLLTGLADEALT

Query:  VKLGDEAPATFAE-------------------------IGRGRSGKDVERADPKSKDKG-SFSSGRAEYRRAESGHTRSQPYERFTPTTIPISEILTNIE
        VKL +EAPATFAE                         I +GR+GKD  +AD KS+DKG S SS R +YRR+ S H +S+PYE +TPTTIPI EILTNIE
Subjt:  VKLGDEAPATFAE-------------------------IGRGRSGKDVERADPKSKDKG-SFSSGRAEYRRAESGHTRSQPYERFTPTTIPISEILTNIE

Query:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKIFVGKPRTSSAEEKEERKCSRTPPRRTDRPAVINTIFGGPSGG
        E+GMEKLLKRPEKLRG PE+R+ DKYCRFHR+HGHNTS+ WELKRQIEDLIQDGYFK FVGKPR++S E+KEERK  RTPPRR DRPAVIN         
Subjt:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKIFVGKPRTSSAEEKEERKCSRTPPRRTDRPAVINTIFGGPSGG

Query:  QSGHKRKKLARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGE
            K+K+LAR ARREVCIIREQ PT  I F+ ADLE VHLPHNDALVIAPLID VLVRR+LVDGGASANILSL TYLALGWTRSQLKKSPTPLVGFSGE
Subjt:  QSGHKRKKLARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGE

Query:  SVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNTIFGRPIIHSFRAIPSTLHQVLKYPTPSGVGTVLGEQTTSRECYASALKGSSVCALETLAGRD
        S+  EGCIDLPV++ QD T+VTQMAEFVVIDGRSAYN IFGRPIIHSFRA+PSTLHQVLKY T +GVGTV GE  TSRECYAS  K SSVCALE    RD
Subjt:  SVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNTIFGRPIIHSFRAIPSTLHQVLKYPTPSGVGTVLGEQTTSRECYASALKGSSVCALETLAGRD

A0A6J1DZB9 uncharacterized protein LOC1110249041.3e-19468.44Show/hide
Query:  EIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQNEGETLREYVTRFQEEQLKVAHCSDDSAMCYLLTGLADEALTVKLGDE
        +IALTGSARLWYRRLPARSISTYSQLR+EF++QFSS HYD+KTATHLATIRQ E ETLREYVTRFQEEQLKVAHCSDDSAMCY LT LADE LTVKLG+E
Subjt:  EIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQNEGETLREYVTRFQEEQLKVAHCSDDSAMCYLLTGLADEALTVKLGDE

Query:  APATFAEI----------------GRGRSGKDVE---------RADPKSKDKGSFSS-GRAEYRRAESGHTRSQPYERFTPTTIPISEILTNIEESGMEK
        AP TF E+                  GR  K ++         +AD KS+DKGS SS  R EYRR ESG +RS+PYER+T +TIPISEILTNIEESGMEK
Subjt:  APATFAEI----------------GRGRSGKDVE---------RADPKSKDKGSFSS-GRAEYRRAESGHTRSQPYERFTPTTIPISEILTNIEESGMEK

Query:  LLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKIFVGKPRTSSAEEKEERKCSRTPPRRTDRPAVINTIFGGPSGGQSGHKR
        LLKRPEKLRG  E+R+K+KYCRFHR+HGHNT+ CWELKRQIEDLIQDGYFK FVGKPR++S E+KEERK SRTPPRR DRPAVINTIFGGP+GGQSG+KR
Subjt:  LLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKIFVGKPRTSSAEEKEERKCSRTPPRRTDRPAVINTIFGGPSGGQSGHKR

Query:  KKLARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEG
        K+LAR ARREVCIIRE  PTC ITF  ADLE VHLPHNDALVIA LIDH LVRRVL+DG                                        G
Subjt:  KKLARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEG

Query:  CIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNTIFGRPIIHSFRAIPSTLHQVLKYPTPSGVGTVLGEQTTSRECYASALKGSSVCALETLAGRDGTLEFE
        CIDLPVT+GQD T+VTQMAEFVVIDGRSAYN IFGRPIIHSFRA+PSTLHQVLKY TP+ VG V GEQ TSRECYASALKGS+VCALE    R    E E
Subjt:  CIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNTIFGRPIIHSFRAIPSTLHQVLKYPTPSGVGTVLGEQTTSRECYASALKGSSVCALETLAGRDGTLEFE

Query:  ADLP---RKEFDAPTEELELVPLLSPEKQPD------LMEIGAPE
        ADLP   +++F  PTEELELVPLLSPE+Q +      ++E+ AP+
Subjt:  ADLP---RKEFDAPTEELELVPLLSPEKQPD------LMEIGAPE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCAATTGCGTTGCTCCTAAGAAAATACTACATATATATGTAGAGCACCCTAACAATATTTTTGATGAAGGAGAAAGCTTTATGGATGTTGAAGTGATTGCAGAAAC
ACCTATTGCCATAAAAGAGACACATAATAGTTTACAATTAGATGATATTTGTGGTAGTAGCGATAGAGGAATGCACATAGAAGAACTACTAAACCTTGATTATGATGGAG
AAGATGATGACGAATTGTTTATTGGAAAGTTTGATGGTGTGGGGTTAAACTTAAGAGATGGTGGTGTAGGTTTACATTTAGAGAATGAAGAATTGTGTAATTCTGAATGT
GGTTCATTTAGTCACCTAAACTCACCCATTCAATCTGATGATGATGTTGTCGAGGGATCACAAAAGTATGTTGAGTTTCATCCTTCTATGTTGAAGGATAACATAGAATT
TACATTAGGGATGAAATTTGGAACATTAACTGAATTGAGAGATGCAATAAGAGAATATTCCTTACGATCTGGACATGAATTGATTTACCAGAAAAATGAACCTGAAAGGG
TTACAGTGAGATGTCAAGATGGGTGCAATTGGAGATTTAGGGCTCAACTGGAAAAGGATTGGAAAACAATTGTAGATCTTGTGAGAAAGGACTTTAATATTAACATATCA
AGACATCAAGCATATAGAGCAAGGAAGATTGCAATGCATAAGCTTGAAGGCACAGTTGAGATGCAATATGCCAAACTACGAGACTACTGTGAGGAATTGTGGAGATCTAA
TCCGGATAGTTCTTATTTACTAAGACCAGTAGAAAAAAATAGATGGGTGTTTATGACTGATCAACAAAAGGGTTTAATTCCCGCCATTGAGAAAGTGGCTCCTAACGCTG
AACATAGGATTTGTGTATGCCATTTGTATGGTAATTTCAAGAAACATTTCAAAGGTGCTGAAAGATTTGAGGTAAAAGATGGGAGGAAACAGTTCATAGTAGACCTCAAT
ATGCATACATGTACTTGCTATAGATGGGATTTGACAGGAATACCGTGCGCTCACTCCATTCAATGTATTTGGTTCATGAGGAAAGACTCCAAAGATTATGTGCATAGTAG
GTGTGGAAAAAATGGCCATAATATTAGATCTTGCAAAGGCCAAGTCAAAGGAAAATCATCAACTTCTAAACTTCCTCTCGGACGGCCGAGATCACCACTACTAGCGTTGA
CAGACGCTAAGTATGAGGGCCGAGGTGAACCTGGCCAAGGTCCGACCTACCGGGAAGCTCGGTGGGGGCCGATGCGAACTACTGTCAGCGGCGTGATTATCGAGCATAAT
CGTCGGTCCCGAGATTACCGGGTCCGCTCAAGTGTTCAGGTCGGTCCGCAGGCCGAGTTCGAGCTGCAATTTGAAATACGTTGTTATGGTTCAACCCGCAAACTCGACCA
ATACGGCGGATCGAAGGACCCTCGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGCGGTAGAGGGGCAAGGTCACGACGGCCTAGCAACAGAACCCCTCCGC
AGGTCGGCACGGATCACCGCGCCTGCCTTACAACCTGCGCACCCGAGGACGTCCAAGGCCACTCGTGGCCGAGGTGGACCTCTAAGAAGGGCGTCCGGGGTCCAGCCCCG
GCTCCAACAAGCGAGAACTTTGATGCACTCCAGAGAGAGATGGAGGCAATGCGCACACAAATGCGCTCCATGGAGGAAATGTATAACGAAATGATGCTAGCTGCAGGCGC
AGGGTCCCGATCTGAAAATCGAGTGACGCGCATGGACGTACGCGAGCAAAGAGGTTCCCACCTCGGCCCAGCCGAGGAGGAACGTCCCGAAGACAACGAGAGCGAGGGGT
ACACTCGCCAGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCGTCTCTCCGAAAAGGGCAGTCACCATCCCGCTCACATAGGAGCTCCAACCAGCAGGCT
GAATCCTCTCATAATCCCGCAGGGATAATCACAAGAGAGGAGTTCGACCAGCTGAGGGGCAAGCTCGATGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGA
CGATTCACTAAACGATGGCGACTTGAGAGAATCGCCATTCACCTCGGACGTTTTGGAATTACCAATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGATGGGA
CGAAGGACCCCAAGGACTATGTTGAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAG
TTCCTCGCCCAGTTCTCTTCTCGGCACTATGACAAAAAGACAGCGACTCATCTCGCCACCATCAGGCAAAATGAGGGTGAGACGCTGCGGGAATATGTCACCAGATTCCA
GGAGGAGCAGTTGAAGGTCGCACACTGCTCCGATGACTCGGCCATGTGCTATCTCCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGAGATGAGGCCCCGG
CCACCTTCGCCGAGATCGGCCGGGGCAGAAGTGGAAAAGATGTAGAAAGGGCAGACCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGAGCTGAGTATCGAAGG
GCGGAGAGCGGACATACCAGGAGCCAACCTTACGAGCGCTTCACCCCGACCACGATTCCAATTTCCGAGATCCTAACGAACATCGAGGAATCTGGAATGGAAAAACTACT
CAAGCGTCCTGAGAAGCTTCGGGGAGCCCCGGAGAGGCGCAGTAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCATAACACGTCAGACTGCTGGGAGTTGAAGC
GCCAAATTGAGGATCTAATTCAAGATGGCTACTTCAAGATATTTGTGGGAAAGCCCAGGACCAGCTCAGCAGAGGAAAAGGAAGAGCGAAAGTGTTCAAGGACGCCGCCC
CGGCGCACTGACCGACCAGCGGTCATCAATACCATTTTTGGAGGGCCAAGCGGGGGTCAGTCCGGACATAAAAGAAAGAAGTTAGCCCGTGCAGCCAGGCGCGAGGTGTG
CATCATCAGGGAGCAGGGGCCGACCTGCCCAATCACCTTCGACGGTGCAGACTTGGAGGAGGTCCACCTACCCCACAATGATGCACTTGTGATTGCTCCCTTGATTGATC
ATGTGCTGGTCAGGAGAGTGCTGGTAGACGGGGGCGCATCCGCTAACATCCTGTCCTTACCGACCTACCTCGCCTTGGGATGGACGAGGTCGCAATTGAAGAAAAGCCCG
ACACCGCTGGTCGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGTTGCATCGACTTGCCGGTCACGCTGGGGCAGGACCGAACTCGGGTCACTCAAATGGCCGAGTTCGT
GGTAATTGACGGTAGATCGGCCTATAACACCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCATTCCCTCAACACTTCATCAAGTTTTGAAGTATCCCACCCCCA
GTGGCGTGGGCACGGTCCTAGGAGAACAAACCACTTCGAGGGAGTGTTATGCCTCCGCACTCAAAGGCTCATCGGTCTGCGCCCTCGAAACTCTCGCCGGTAGGGATGGG
ACGCTCGAGTTCGAGGCCGACCTGCCGAGGAAGGAGTTTGACGCACCCACTGAGGAGCTGGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAGCCAGATCTAATGGAGAT
CGGCGCTCCAGAATTCTCATGGATGGACCCGATCGCGGACTTCATTAGGGGAAACTCACCACAAGACCCCAAGGAGTGCAGAAAGTTGGCAAGGCAGGCAGCTCGGTTCG
TGGTCCGAGATGGGGCATTGTACCGACGTGGCTTTTCCCTGCCTCTATTGAGATGCCTAACCCCTGAAGAGGGCCTAGTAGAGCATTACGAACCTACGACAAATGAGGAA
GAGCTGGTCCTCAACCTCGACTTGTTGGAAGAAAGAAGAGCAATGGCCCAACTACGCCTGGCTGAATATCAGGGCAGAATGGCCAGACATTACAATGCCCGCGTTCGACT
TCGGGCCTTTCAGGTCGGACATCTGGTCTTAAGGAGGGTCCAAACGCATGTGGCTGCCCTTGATCCGGCCTGGGAGGGCCCGTTTGAGGTCAAGGGCATAGTCCGACCTG
GGACGTACGTATTGGCCGATCTACAAGGAGACGTTCTCGCGCACCCGTGGAACGCTGAGCACCTGAAGCGTTATTATCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTCAATTGCGTTGCTCCTAAGAAAATACTACATATATATGTAGAGCACCCTAACAATATTTTTGATGAAGGAGAAAGCTTTATGGATGTTGAAGTGATTGCAGAAAC
ACCTATTGCCATAAAAGAGACACATAATAGTTTACAATTAGATGATATTTGTGGTAGTAGCGATAGAGGAATGCACATAGAAGAACTACTAAACCTTGATTATGATGGAG
AAGATGATGACGAATTGTTTATTGGAAAGTTTGATGGTGTGGGGTTAAACTTAAGAGATGGTGGTGTAGGTTTACATTTAGAGAATGAAGAATTGTGTAATTCTGAATGT
GGTTCATTTAGTCACCTAAACTCACCCATTCAATCTGATGATGATGTTGTCGAGGGATCACAAAAGTATGTTGAGTTTCATCCTTCTATGTTGAAGGATAACATAGAATT
TACATTAGGGATGAAATTTGGAACATTAACTGAATTGAGAGATGCAATAAGAGAATATTCCTTACGATCTGGACATGAATTGATTTACCAGAAAAATGAACCTGAAAGGG
TTACAGTGAGATGTCAAGATGGGTGCAATTGGAGATTTAGGGCTCAACTGGAAAAGGATTGGAAAACAATTGTAGATCTTGTGAGAAAGGACTTTAATATTAACATATCA
AGACATCAAGCATATAGAGCAAGGAAGATTGCAATGCATAAGCTTGAAGGCACAGTTGAGATGCAATATGCCAAACTACGAGACTACTGTGAGGAATTGTGGAGATCTAA
TCCGGATAGTTCTTATTTACTAAGACCAGTAGAAAAAAATAGATGGGTGTTTATGACTGATCAACAAAAGGGTTTAATTCCCGCCATTGAGAAAGTGGCTCCTAACGCTG
AACATAGGATTTGTGTATGCCATTTGTATGGTAATTTCAAGAAACATTTCAAAGGTGCTGAAAGATTTGAGGTAAAAGATGGGAGGAAACAGTTCATAGTAGACCTCAAT
ATGCATACATGTACTTGCTATAGATGGGATTTGACAGGAATACCGTGCGCTCACTCCATTCAATGTATTTGGTTCATGAGGAAAGACTCCAAAGATTATGTGCATAGTAG
GTGTGGAAAAAATGGCCATAATATTAGATCTTGCAAAGGCCAAGTCAAAGGAAAATCATCAACTTCTAAACTTCCTCTCGGACGGCCGAGATCACCACTACTAGCGTTGA
CAGACGCTAAGTATGAGGGCCGAGGTGAACCTGGCCAAGGTCCGACCTACCGGGAAGCTCGGTGGGGGCCGATGCGAACTACTGTCAGCGGCGTGATTATCGAGCATAAT
CGTCGGTCCCGAGATTACCGGGTCCGCTCAAGTGTTCAGGTCGGTCCGCAGGCCGAGTTCGAGCTGCAATTTGAAATACGTTGTTATGGTTCAACCCGCAAACTCGACCA
ATACGGCGGATCGAAGGACCCTCGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGCGGTAGAGGGGCAAGGTCACGACGGCCTAGCAACAGAACCCCTCCGC
AGGTCGGCACGGATCACCGCGCCTGCCTTACAACCTGCGCACCCGAGGACGTCCAAGGCCACTCGTGGCCGAGGTGGACCTCTAAGAAGGGCGTCCGGGGTCCAGCCCCG
GCTCCAACAAGCGAGAACTTTGATGCACTCCAGAGAGAGATGGAGGCAATGCGCACACAAATGCGCTCCATGGAGGAAATGTATAACGAAATGATGCTAGCTGCAGGCGC
AGGGTCCCGATCTGAAAATCGAGTGACGCGCATGGACGTACGCGAGCAAAGAGGTTCCCACCTCGGCCCAGCCGAGGAGGAACGTCCCGAAGACAACGAGAGCGAGGGGT
ACACTCGCCAGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCGTCTCTCCGAAAAGGGCAGTCACCATCCCGCTCACATAGGAGCTCCAACCAGCAGGCT
GAATCCTCTCATAATCCCGCAGGGATAATCACAAGAGAGGAGTTCGACCAGCTGAGGGGCAAGCTCGATGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGA
CGATTCACTAAACGATGGCGACTTGAGAGAATCGCCATTCACCTCGGACGTTTTGGAATTACCAATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGATGGGA
CGAAGGACCCCAAGGACTATGTTGAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAG
TTCCTCGCCCAGTTCTCTTCTCGGCACTATGACAAAAAGACAGCGACTCATCTCGCCACCATCAGGCAAAATGAGGGTGAGACGCTGCGGGAATATGTCACCAGATTCCA
GGAGGAGCAGTTGAAGGTCGCACACTGCTCCGATGACTCGGCCATGTGCTATCTCCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGAGATGAGGCCCCGG
CCACCTTCGCCGAGATCGGCCGGGGCAGAAGTGGAAAAGATGTAGAAAGGGCAGACCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGAGCTGAGTATCGAAGG
GCGGAGAGCGGACATACCAGGAGCCAACCTTACGAGCGCTTCACCCCGACCACGATTCCAATTTCCGAGATCCTAACGAACATCGAGGAATCTGGAATGGAAAAACTACT
CAAGCGTCCTGAGAAGCTTCGGGGAGCCCCGGAGAGGCGCAGTAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCATAACACGTCAGACTGCTGGGAGTTGAAGC
GCCAAATTGAGGATCTAATTCAAGATGGCTACTTCAAGATATTTGTGGGAAAGCCCAGGACCAGCTCAGCAGAGGAAAAGGAAGAGCGAAAGTGTTCAAGGACGCCGCCC
CGGCGCACTGACCGACCAGCGGTCATCAATACCATTTTTGGAGGGCCAAGCGGGGGTCAGTCCGGACATAAAAGAAAGAAGTTAGCCCGTGCAGCCAGGCGCGAGGTGTG
CATCATCAGGGAGCAGGGGCCGACCTGCCCAATCACCTTCGACGGTGCAGACTTGGAGGAGGTCCACCTACCCCACAATGATGCACTTGTGATTGCTCCCTTGATTGATC
ATGTGCTGGTCAGGAGAGTGCTGGTAGACGGGGGCGCATCCGCTAACATCCTGTCCTTACCGACCTACCTCGCCTTGGGATGGACGAGGTCGCAATTGAAGAAAAGCCCG
ACACCGCTGGTCGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGTTGCATCGACTTGCCGGTCACGCTGGGGCAGGACCGAACTCGGGTCACTCAAATGGCCGAGTTCGT
GGTAATTGACGGTAGATCGGCCTATAACACCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCATTCCCTCAACACTTCATCAAGTTTTGAAGTATCCCACCCCCA
GTGGCGTGGGCACGGTCCTAGGAGAACAAACCACTTCGAGGGAGTGTTATGCCTCCGCACTCAAAGGCTCATCGGTCTGCGCCCTCGAAACTCTCGCCGGTAGGGATGGG
ACGCTCGAGTTCGAGGCCGACCTGCCGAGGAAGGAGTTTGACGCACCCACTGAGGAGCTGGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAGCCAGATCTAATGGAGAT
CGGCGCTCCAGAATTCTCATGGATGGACCCGATCGCGGACTTCATTAGGGGAAACTCACCACAAGACCCCAAGGAGTGCAGAAAGTTGGCAAGGCAGGCAGCTCGGTTCG
TGGTCCGAGATGGGGCATTGTACCGACGTGGCTTTTCCCTGCCTCTATTGAGATGCCTAACCCCTGAAGAGGGCCTAGTAGAGCATTACGAACCTACGACAAATGAGGAA
GAGCTGGTCCTCAACCTCGACTTGTTGGAAGAAAGAAGAGCAATGGCCCAACTACGCCTGGCTGAATATCAGGGCAGAATGGCCAGACATTACAATGCCCGCGTTCGACT
TCGGGCCTTTCAGGTCGGACATCTGGTCTTAAGGAGGGTCCAAACGCATGTGGCTGCCCTTGATCCGGCCTGGGAGGGCCCGTTTGAGGTCAAGGGCATAGTCCGACCTG
GGACGTACGTATTGGCCGATCTACAAGGAGACGTTCTCGCGCACCCGTGGAACGCTGAGCACCTGAAGCGTTATTATCCTTGA
Protein sequenceShow/hide protein sequence
MFNCVAPKKILHIYVEHPNNIFDEGESFMDVEVIAETPIAIKETHNSLQLDDICGSSDRGMHIEELLNLDYDGEDDDELFIGKFDGVGLNLRDGGVGLHLENEELCNSEC
GSFSHLNSPIQSDDDVVEGSQKYVEFHPSMLKDNIEFTLGMKFGTLTELRDAIREYSLRSGHELIYQKNEPERVTVRCQDGCNWRFRAQLEKDWKTIVDLVRKDFNINIS
RHQAYRARKIAMHKLEGTVEMQYAKLRDYCEELWRSNPDSSYLLRPVEKNRWVFMTDQQKGLIPAIEKVAPNAEHRICVCHLYGNFKKHFKGAERFEVKDGRKQFIVDLN
MHTCTCYRWDLTGIPCAHSIQCIWFMRKDSKDYVHSRCGKNGHNIRSCKGQVKGKSSTSKLPLGRPRSPLLALTDAKYEGRGEPGQGPTYREARWGPMRTTVSGVIIEHN
RRSRDYRVRSSVQVGPQAEFELQFEIRCYGSTRKLDQYGGSKDPRCQRCPPEGGRSSSGRGARSRRPSNRTPPQVGTDHRACLTTCAPEDVQGHSWPRWTSKKGVRGPAP
APTSENFDALQREMEAMRTQMRSMEEMYNEMMLAAGAGSRSENRVTRMDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQA
ESSHNPAGIITREEFDQLRGKLDAQVEALKAKCEQKDDSLNDGDLRESPFTSDVLELPIPPKFKAPTVKPYDGTKDPKDYVEIALTGSARLWYRRLPARSISTYSQLRRE
FLAQFSSRHYDKKTATHLATIRQNEGETLREYVTRFQEEQLKVAHCSDDSAMCYLLTGLADEALTVKLGDEAPATFAEIGRGRSGKDVERADPKSKDKGSFSSGRAEYRR
AESGHTRSQPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKIFVGKPRTSSAEEKEERKCSRTPP
RRTDRPAVINTIFGGPSGGQSGHKRKKLARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSP
TPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNTIFGRPIIHSFRAIPSTLHQVLKYPTPSGVGTVLGEQTTSRECYASALKGSSVCALETLAGRDG
TLEFEADLPRKEFDAPTEELELVPLLSPEKQPDLMEIGAPEFSWMDPIADFIRGNSPQDPKECRKLARQAARFVVRDGALYRRGFSLPLLRCLTPEEGLVEHYEPTTNEE
ELVLNLDLLEERRAMAQLRLAEYQGRMARHYNARVRLRAFQVGHLVLRRVQTHVAALDPAWEGPFEVKGIVRPGTYVLADLQGDVLAHPWNAEHLKRYYP