; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g17840 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g17840
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr9:14097256..14112528
RNA-Seq ExpressionMoc09g17840
SyntenyMoc09g17840
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]1.7e-27793.37Show/hide
Query:  QAESSHN---PAGIITREEFDQLRGELDAQVETLKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDRTKDPKDYVEVFEGLMDFQAASD
        +AESS N   PAG+ITREEFDQLRG+LDAQVE LKAKCEQK+  LNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYD +KDPKDYVEVFE LMDFQAASD
Subjt:  QAESSHN---PAGIITREEFDQLRGELDAQVETLKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDRTKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIAFTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
        AIKCRAF+IA TGSARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIAFTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKIGRPKRKIGRGRSGKDVERADPESKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE
        TVKLGEEAPATFAEVLQKAKKVIDGQELLRTK GRP+RKIGRGRSGKD+E ADP+SKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKIGRPKRKIGRGRSGKDVERADPESKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE

Query:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHSHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPNGG
        ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREH HNTSD WELKRQIE+LIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGP+GG
Subjt:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHSHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPNGG

Query:  QSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVLRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGE
        QSG KRKELARAARREVCIIREQ PTCPITFDGADLEEVHLPHNDALVIAPLIDHVV+ RVLVDGG SANILSLPTYLALGWTRSQLK+SPTPLVGFSGE
Subjt:  QSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVLRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGE

Query:  SVIPEGCIDLPVTLGQDQTRVTQMAEFV
        SVIPEG IDLPVTLGQDQT+VTQMAEFV
Subjt:  SVIPEGCIDLPVTLGQDQTRVTQMAEFV

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]5.0e-25380.72Show/hide
Query:  SSNQQAESSHNPA---GIITREEFDQLRGELDAQVETLKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDRTKDPKDYVEVFEGLMDFQ
        SSNQQAESSHNPA   G+ITREEFDQLRG+L+AQVE LKAKCEQK+  LNDGDLGESPFTSDVLE        APTVK YD +KDPKDYVEVFEGLMDFQ
Subjt:  SSNQQAESSHNPA---GIITREEFDQLRGELDAQVETLKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDRTKDPKDYVEVFEGLMDFQ

Query:  AASDAIKCRAFQIAFTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA
        AASDAIKCRAFQIA TGSARLW                                                     FQE+QLKVA  SDDSAMCYFLTGLA
Subjt:  AASDAIKCRAFQIAFTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA

Query:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKIGRPKRKIGRGRSGKDVERADPESKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEIL
        DEALTVKLG+EAPATFAEVLQKAKKVIDGQELLRTK GRP+R I RGRSGKD E+AD +SKDKGSFSSGRAE+RRA NGPTRSRPYERFTPTTIPISEIL
Subjt:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKIGRPKRKIGRGRSGKDVERADPESKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEIL

Query:  TNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHSHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGG
        TNIEESGMEKLLKRPEKLRGAPERR+KDKYCRFHREH HNTSD WELKRQIEDLIQD YFKKFVGKPRTSSAEKKEERK SRTP RR DRPAVINTIFGG
Subjt:  TNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHSHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGG

Query:  PNGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVLRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVG
        P+GGQSGHKRKELARAARREVCIIREQ PTCPITFD ADLEEVHLPHNDALVIAPLIDHVV+RRVLVD G SANI+SL TYLALGWTRSQLK+S TPLVG
Subjt:  PNGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVLRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVG

Query:  FSGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVIDGRSTYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQAASRECYA
        FS ESVIPEGCIDLPVTLG DQT+VTQMAEFVVIDGRS YNAIFGRPIIHSFRAIPSTLHQVLKY TPNGVG VRGEQ ASRECYA
Subjt:  FSGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVIDGRSTYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQAASRECYA

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]3.8e-1152.33Show/hide
Query:  ACRKHHASALKGSSVCALETLAGRDGPFEFEADLPRKEFAAPTEELELVPLLSPEKQTDLARSVPVEILDNPSILEPDLMKIGAPE
        A R+ +ASALKGSSVCALETL  RDG  EF+A+LPR+EFAAPTEELELVPLL  +   ++     ++   + + ++ D+   G PE
Subjt:  ACRKHHASALKGSSVCALETLAGRDGPFEFEADLPRKEFAAPTEELELVPLLSPEKQTDLARSVPVEILDNPSILEPDLMKIGAPE

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]1.3e-22495.02Show/hide
Query:  KDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDRTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIAFTGSARLWYRRLPARSISTYSQLRREFLAQ
        KDDSLNDGDLGES FTSDVLEAPIPPKFKAPTVKPYD +KDPKDYVEVFEGLMDF AASDAIKCRAFQIA TGSARLWYRRLPARSISTYSQLRREFLAQ
Subjt:  KDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDRTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIAFTGSARLWYRRLPARSISTYSQLRREFLAQ

Query:  FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKIGRPKRKI
        FSSR Y KKT THLATIRQKEG TLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGE+AP TFAEVLQKAKKVIDGQELLRTK GRP RKI
Subjt:  FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKIGRPKRKI

Query:  GRGRSGKDVERADPESKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHSHNTSDC
        GRGRSGKDVERADP+SKDKGSFSSGRAEYRRAE+GPT+SRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREH HNTSDC
Subjt:  GRGRSGKDVERADPESKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHSHNTSDC

Query:  WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPNGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVH
        WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGP+GGQSGHKRKELARAARREVCIIREQGPTCPITFDGAD EEVH
Subjt:  WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPNGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVH

Query:  LPHNDALVIAPLIDHVVLRRVL
        LPHNDA VIAPLIDHVV+RRVL
Subjt:  LPHNDALVIAPLIDHVVLRRVL

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]3.4e-25478.25Show/hide
Query:  QAESSHNP--AGIITREEFDQLRGELDAQVETLKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDRTKDPKDYVEVFEGLMDFQAASDA
        +AESS+NP   G+ITREEFDQL+ + DAQVE LKA+CE+K+ S +DGDLGE  F+SD+LEA IPPKFK PT+KPYD +KDPKDYVEVFE LMDFQAA+DA
Subjt:  QAESSHNP--AGIITREEFDQLRGELDAQVETLKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDRTKDPKDYVEVFEGLMDFQAASDA

Query:  IKCRAFQIAFTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALT
        IKC AFQIA TGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE LT
Subjt:  IKCRAFQIAFTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALT

Query:  VKLGEEAPATFAEVLQKAKKVIDGQELLRTKIGRPKRKIGRGRSGKDVERADPESKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE
        VKL EEAPATFAEVLQK KKVIDGQELLRTK GRP++ I +GR+GKD  +AD +S+DKG S SS R +YRR+ +   +SRPYE +TPTTIPI EILTNIE
Subjt:  VKLGEEAPATFAEVLQKAKKVIDGQELLRTKIGRPKRKIGRGRSGKDVERADPESKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE

Query:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHSHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPNGG
        E+GMEKLLKRPEKLRG PE+R+ DKYCRFHR+H HNTS+ WELKRQIEDLIQDGYFKKFVGKPR++S EKKEERKR RTPPRR DRPAVIN         
Subjt:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHSHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPNGG

Query:  QSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVLRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGE
            K+KELAR ARREVCIIREQ PT  I F+ ADLE VHLPHNDALVIAPLID V++RR+LVDGGASANILSL TYLALGWTRSQLK+SPTPLVGFSGE
Subjt:  QSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVLRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGE

Query:  SVIPEGCIDLPVTLGQDQTRVTQMAEFVVIDGRSTYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQAASRECYALI
        S+  EGCIDLPV++ QD T+VTQMAEFVVIDGRS YNAIFGRPIIHSFRA+PSTLHQVLKY T NGVGTVRGE   SRECYA +
Subjt:  SVIPEGCIDLPVTLGQDQTRVTQMAEFVVIDGRSTYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQAASRECYALI

XP_022158652.1 uncharacterized protein LOC111025109 [Momordica charantia]6.8e-21887.55Show/hide
Query:  NRKRGSSLRKGQSPSRSHRSSNQQAESSHNPAGIITREEFDQLRGELDAQVETLKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDRTK
        + KRGSSLRKGQSPSRSHRSSNQQAESSHNPAGIITREEFDQLRGELDAQVE LKAKCEQKDDSLNDGDLGE PFTSDVLEAPIPPKFKAPTVKPYD TK
Subjt:  NRKRGSSLRKGQSPSRSHRSSNQQAESSHNPAGIITREEFDQLRGELDAQVETLKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDRTK

Query:  DPKDYVEVFEGLMDFQAASDAIKCRAFQIAFTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA
        DPKDYVEVFEGLMDFQAASDAIKCRAFQIA TGSARLWYRRLPA                                 RQKE ETLREYVTRFQEEQLKVA
Subjt:  DPKDYVEVFEGLMDFQAASDAIKCRAFQIAFTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA

Query:  HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKIGRPKRKIGRGRSGKDVERADPESKDKGSFSSGRAEYRRAENGPTRSR
        HCSDDSAMCYF TGLADEALTVKLGEEAP TFAEVLQKAKKVIDGQELLRTK GRP+RKIGRGRSGKDVERADP+SKDKGSFSSGRAEYRRAENGPT  R
Subjt:  HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKIGRPKRKIGRGRSGKDVERADPESKDKGSFSSGRAEYRRAENGPTRSR

Query:  PYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHSHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTP
        PYERFTPTTIPIS ILTNIEESGMEKLLKR EKLRGAPERR KDKYCRFHREH HNTS+CWELKRQIEDLIQDGYFKKFVG PRTSSAEKKEERKRSRTP
Subjt:  PYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHSHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTP

Query:  PRRTDRPAVINTIFGGPNGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEE
        PRRTDRPAVINTIFGGP+GGQS HKRK+LARAARREVCIIREQGPTCPITFD ADLEE
Subjt:  PRRTDRPAVINTIFGGPNGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEE

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088138.2e-27893.37Show/hide
Query:  QAESSHN---PAGIITREEFDQLRGELDAQVETLKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDRTKDPKDYVEVFEGLMDFQAASD
        +AESS N   PAG+ITREEFDQLRG+LDAQVE LKAKCEQK+  LNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYD +KDPKDYVEVFE LMDFQAASD
Subjt:  QAESSHN---PAGIITREEFDQLRGELDAQVETLKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDRTKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIAFTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
        AIKCRAF+IA TGSARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIAFTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKIGRPKRKIGRGRSGKDVERADPESKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE
        TVKLGEEAPATFAEVLQKAKKVIDGQELLRTK GRP+RKIGRGRSGKD+E ADP+SKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKIGRPKRKIGRGRSGKDVERADPESKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE

Query:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHSHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPNGG
        ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREH HNTSD WELKRQIE+LIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGP+GG
Subjt:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHSHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPNGG

Query:  QSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVLRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGE
        QSG KRKELARAARREVCIIREQ PTCPITFDGADLEEVHLPHNDALVIAPLIDHVV+ RVLVDGG SANILSLPTYLALGWTRSQLK+SPTPLVGFSGE
Subjt:  QSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVLRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGE

Query:  SVIPEGCIDLPVTLGQDQTRVTQMAEFV
        SVIPEG IDLPVTLGQDQT+VTQMAEFV
Subjt:  SVIPEGCIDLPVTLGQDQTRVTQMAEFV

A0A6J1D9E1 uncharacterized protein LOC1110188232.4e-25380.72Show/hide
Query:  SSNQQAESSHNPA---GIITREEFDQLRGELDAQVETLKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDRTKDPKDYVEVFEGLMDFQ
        SSNQQAESSHNPA   G+ITREEFDQLRG+L+AQVE LKAKCEQK+  LNDGDLGESPFTSDVLE        APTVK YD +KDPKDYVEVFEGLMDFQ
Subjt:  SSNQQAESSHNPA---GIITREEFDQLRGELDAQVETLKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDRTKDPKDYVEVFEGLMDFQ

Query:  AASDAIKCRAFQIAFTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA
        AASDAIKCRAFQIA TGSARLW                                                     FQE+QLKVA  SDDSAMCYFLTGLA
Subjt:  AASDAIKCRAFQIAFTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA

Query:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKIGRPKRKIGRGRSGKDVERADPESKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEIL
        DEALTVKLG+EAPATFAEVLQKAKKVIDGQELLRTK GRP+R I RGRSGKD E+AD +SKDKGSFSSGRAE+RRA NGPTRSRPYERFTPTTIPISEIL
Subjt:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKIGRPKRKIGRGRSGKDVERADPESKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEIL

Query:  TNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHSHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGG
        TNIEESGMEKLLKRPEKLRGAPERR+KDKYCRFHREH HNTSD WELKRQIEDLIQD YFKKFVGKPRTSSAEKKEERK SRTP RR DRPAVINTIFGG
Subjt:  TNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHSHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGG

Query:  PNGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVLRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVG
        P+GGQSGHKRKELARAARREVCIIREQ PTCPITFD ADLEEVHLPHNDALVIAPLIDHVV+RRVLVD G SANI+SL TYLALGWTRSQLK+S TPLVG
Subjt:  PNGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVLRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVG

Query:  FSGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVIDGRSTYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQAASRECYA
        FS ESVIPEGCIDLPVTLG DQT+VTQMAEFVVIDGRS YNAIFGRPIIHSFRAIPSTLHQVLKY TPNGVG VRGEQ ASRECYA
Subjt:  FSGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVIDGRSTYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQAASRECYA

A0A6J1D9E1 uncharacterized protein LOC1110188231.8e-1152.33Show/hide
Query:  ACRKHHASALKGSSVCALETLAGRDGPFEFEADLPRKEFAAPTEELELVPLLSPEKQTDLARSVPVEILDNPSILEPDLMKIGAPE
        A R+ +ASALKGSSVCALETL  RDG  EF+A+LPR+EFAAPTEELELVPLL  +   ++     ++   + + ++ D+   G PE
Subjt:  ACRKHHASALKGSSVCALETLAGRDGPFEFEADLPRKEFAAPTEELELVPLLSPEKQTDLARSVPVEILDNPSILEPDLMKIGAPE

A0A6J1D9E1 uncharacterized protein LOC1110188236.2e-22595.02Show/hide
Query:  KDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDRTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIAFTGSARLWYRRLPARSISTYSQLRREFLAQ
        KDDSLNDGDLGES FTSDVLEAPIPPKFKAPTVKPYD +KDPKDYVEVFEGLMDF AASDAIKCRAFQIA TGSARLWYRRLPARSISTYSQLRREFLAQ
Subjt:  KDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDRTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIAFTGSARLWYRRLPARSISTYSQLRREFLAQ

Query:  FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKIGRPKRKI
        FSSR Y KKT THLATIRQKEG TLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGE+AP TFAEVLQKAKKVIDGQELLRTK GRP RKI
Subjt:  FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKIGRPKRKI

Query:  GRGRSGKDVERADPESKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHSHNTSDC
        GRGRSGKDVERADP+SKDKGSFSSGRAEYRRAE+GPT+SRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREH HNTSDC
Subjt:  GRGRSGKDVERADPESKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHSHNTSDC

Query:  WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPNGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVH
        WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGP+GGQSGHKRKELARAARREVCIIREQGPTCPITFDGAD EEVH
Subjt:  WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPNGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVH

Query:  LPHNDALVIAPLIDHVVLRRVL
        LPHNDA VIAPLIDHVV+RRVL
Subjt:  LPHNDALVIAPLIDHVVLRRVL

A0A6J1DHB3 uncharacterized protein LOC1110204791.7e-25478.25Show/hide
Query:  QAESSHNP--AGIITREEFDQLRGELDAQVETLKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDRTKDPKDYVEVFEGLMDFQAASDA
        +AESS+NP   G+ITREEFDQL+ + DAQVE LKA+CE+K+ S +DGDLGE  F+SD+LEA IPPKFK PT+KPYD +KDPKDYVEVFE LMDFQAA+DA
Subjt:  QAESSHNP--AGIITREEFDQLRGELDAQVETLKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDRTKDPKDYVEVFEGLMDFQAASDA

Query:  IKCRAFQIAFTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALT
        IKC AFQIA TGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE LT
Subjt:  IKCRAFQIAFTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALT

Query:  VKLGEEAPATFAEVLQKAKKVIDGQELLRTKIGRPKRKIGRGRSGKDVERADPESKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE
        VKL EEAPATFAEVLQK KKVIDGQELLRTK GRP++ I +GR+GKD  +AD +S+DKG S SS R +YRR+ +   +SRPYE +TPTTIPI EILTNIE
Subjt:  VKLGEEAPATFAEVLQKAKKVIDGQELLRTKIGRPKRKIGRGRSGKDVERADPESKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE

Query:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHSHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPNGG
        E+GMEKLLKRPEKLRG PE+R+ DKYCRFHR+H HNTS+ WELKRQIEDLIQDGYFKKFVGKPR++S EKKEERKR RTPPRR DRPAVIN         
Subjt:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHSHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPNGG

Query:  QSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVLRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGE
            K+KELAR ARREVCIIREQ PT  I F+ ADLE VHLPHNDALVIAPLID V++RR+LVDGGASANILSL TYLALGWTRSQLK+SPTPLVGFSGE
Subjt:  QSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVLRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGE

Query:  SVIPEGCIDLPVTLGQDQTRVTQMAEFVVIDGRSTYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQAASRECYALI
        S+  EGCIDLPV++ QD T+VTQMAEFVVIDGRS YNAIFGRPIIHSFRA+PSTLHQVLKY T NGVGTVRGE   SRECYA +
Subjt:  SVIPEGCIDLPVTLGQDQTRVTQMAEFVVIDGRSTYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQAASRECYALI

A0A6J1DXR9 uncharacterized protein LOC1110251093.3e-21887.55Show/hide
Query:  NRKRGSSLRKGQSPSRSHRSSNQQAESSHNPAGIITREEFDQLRGELDAQVETLKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDRTK
        + KRGSSLRKGQSPSRSHRSSNQQAESSHNPAGIITREEFDQLRGELDAQVE LKAKCEQKDDSLNDGDLGE PFTSDVLEAPIPPKFKAPTVKPYD TK
Subjt:  NRKRGSSLRKGQSPSRSHRSSNQQAESSHNPAGIITREEFDQLRGELDAQVETLKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDRTK

Query:  DPKDYVEVFEGLMDFQAASDAIKCRAFQIAFTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA
        DPKDYVEVFEGLMDFQAASDAIKCRAFQIA TGSARLWYRRLPA                                 RQKE ETLREYVTRFQEEQLKVA
Subjt:  DPKDYVEVFEGLMDFQAASDAIKCRAFQIAFTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA

Query:  HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKIGRPKRKIGRGRSGKDVERADPESKDKGSFSSGRAEYRRAENGPTRSR
        HCSDDSAMCYF TGLADEALTVKLGEEAP TFAEVLQKAKKVIDGQELLRTK GRP+RKIGRGRSGKDVERADP+SKDKGSFSSGRAEYRRAENGPT  R
Subjt:  HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKIGRPKRKIGRGRSGKDVERADPESKDKGSFSSGRAEYRRAENGPTRSR

Query:  PYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHSHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTP
        PYERFTPTTIPIS ILTNIEESGMEKLLKR EKLRGAPERR KDKYCRFHREH HNTS+CWELKRQIEDLIQDGYFKKFVG PRTSSAEKKEERKRSRTP
Subjt:  PYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHSHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTP

Query:  PRRTDRPAVINTIFGGPNGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEE
        PRRTDRPAVINTIFGGP+GGQS HKRK+LARAARREVCIIREQGPTCPITFD ADLEE
Subjt:  PRRTDRPAVINTIFGGPNGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGCAATGCGCACACAAATGCGCTCCATGGAGGAAATGTATAACGAAATGATGCTAGCTGCAAGCGCAGGGTCTCGATCTGAAAATCGAGTGGTGCGCGTGGACGT
ACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGCCGAGGAGGAACGACCCGAAGACAACGAGAGCGAGGGGTACACTCGCCAGAGGGGAGACCTCCGTGAGCATCTCAACA
GAAAGAGAGGCTCGTCTCTCCGAAAAGGGCAGTCACCATCCCGCTCACACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAATCCCGCAGGGATAATCACGAGGGAG
GAGTTCGACCAGCTGAGGGGGGAGCTCGATGCTCAGGTGGAGACCTTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAACGATGGCGACTTGGGAGAATCGCCTTT
CACCTCGGACGTGTTGGAAGCACCAATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAACCTTATGATCGGACGAAGGACCCCAAGGACTATGTTGAGGTCTTTGAAGGCC
TCATGGACTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATCGCGTTTACTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCG
ACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCACTATGACAAAAAGACAGCGACCCATCTTGCCACCATCAGGCAGAAGGAGGGTGAGACGCT
GCGGGAGTATGTCACCAGATTCCAGGAGGAGCAGTTGAAGGTTGCACACTGCTCCGATGACTCGGCCATGTGCTATTTCCTCACCGGTCTAGCCGACGAAGCCCTCACGG
TGAAACTCGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTACTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTTCTCCGAACCAAAATCGGCCGACCGAAGCGA
AAGATCGGCCGGGGCAGAAGTGGGAAAGATGTAGAAAGGGCAGATCCCGAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGAGCTGAGTATCGAAGGGCGGAGAACGG
ACCTACCAGGAGCCGACCTTACGAGCGCTTCACCCCGACCACGATTCCAATTTCCGAGATCCTAACGAACATTGAGGAGTCTGGAATGGAAAAACTACTCAAGCGTCCTG
AGAAACTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGCCGTTTCCATCGGGAGCACAGCCACAACACGTCGGATTGTTGGGAGTTGAAGCGCCAAATTGAG
GATCTAATTCAAGACGGCTACTTCAAGAAGTTTGTGGGAAAGCCCAGGACCAGCTCAGCAGAGAAAAAGGAAGAGCGAAAGCGTTCAAGGACGCCACCCCGACGCACCGA
CCGACCTGCGGTCATCAATACCATTTTTGGAGGGCCAAACGGGGGTCAGTCCGGACATAAAAGAAAGGAGTTAGCCCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGGG
AACAGGGGCCGACCTGCCCAATCACCTTCGACGGTGCAGACTTGGAGGAGGTCCACCTACCCCATAATGATGCACTTGTGATTGCTCCCTTGATTGATCATGTGGTGCTC
AGGAGAGTGCTGGTAGACGGGGGCGCATCCGCTAACATCCTGTCCTTACCGACCTACCTCGCCTTGGGATGGACGAGGTCGCAATTGAAGAGGAGCCCGACACCGCTAGT
TGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGTTGCATTGACTTGCCGGTCACGCTGGGGCAGGACCAAACTCGGGTCACTCAAATGGCCGAGTTCGTGGTAATTGATG
GTAGATCGACCTATAACGCCATCTTTGGGCGACCCATCATCCACTCATTTCGGGCCATTCCCTCAACACTTCATCAAGTTTTGAAGTATCCCACCCCCAATGGCGTGGGC
ACGGTCCGAGGAGAACAGGCCGCTTCGAGGGAGTGTTATGCCTTGATTACGCTTGTTTACATGCGTAAAAAGAAGGTTTTAGATGCTGTTTTTACTATGACCATTTGGGA
GTGCAAATTAATCAAAAGAAGCAAAAAGAGGGAAAAACCTTCATGGAGGCGCCAGGCGCCTGGGAAGCCTGCAGAAAACAGTTTTTCTTCCAACTTTGCCCTTAATGAAA
CGCGTCTTCCAATGCGTTTTGGTGGTTCCAACCGATGCATACGTGTAGAAGAAGTGTTCCACTATCAGTTTGAGCACGATTTGGGTACGTTAAAGTGCATGAGCACAAGA
TCTTTTCTTCTACCCCTTGACCCTGAGATTGAACGGACCCTTCGAAAAACTCGAAAGGAGCAAAGACTTCGAAAACAACTAGAAAAGCAAAAAGAGAGAGAAGGTGAAAT
CAGTCCTGAAAGTGAGGTAGAGAGTACGAGCACATCGATGGCAGATATTCCACCTCGTGATCCGGTTGATCCACCTGCTGTTAACGGTAATATGAGGGATCATGCGAGAA
ATGATGAATTCAACCATATCCAGATGGCGGACAACAGAGACGTGGCAATGCGAGAATATGCCGCCACGGCTTTTCAGAACTTTGATTCAGGGATAATAGAACATTTCTTT
AGAGGCTTAGATCATCCTACTAAGATGATGCTAAACAATGCTGCCAATGGAGCCTTTACAAAGAAGACATTCAACGAGATAGTTGACATCCTAAATGACTTAGCTTCACA
TAACGAACTATGGTGTTCGCAAAGATCTAGGGCAGCACCAAAGAAGCAAGATCCAGCTGGAGTTTTGGCTCTGGACATTGCGACCTCGATGCAAAAAGAGATGGTTACAA
TGAACCAGAGGCTGAAAGAGATGGCATTGGGAATAAAAAATCCATTAGCCACGCCGATACAACCTGTGCAGTCGGATTATTGCACTCCTGCCCCTGTTTGCCAAGTCAAC
GATCTCATTTGTTCATTCTGCAGTGAAAACCATATCTATGATAATTGTCCACATAACTCTGCTTCCGTTTTTTATGTAGGACATGGGAACAATAGGAACTTTAACCCATA
TTCGAACACCTACAACCCAGGTTGGAGGCACCACCCTAATTTCTCATGGGGCGGACAAGGAGGTTCGAGCGGTTTTAATCAAGGGCAGAGCCAGCAGAACAAGCAGCCCT
ATGTTCCACCTACACAACAATACATCCCACCACCGCAACAGCAGTACAATCAGAGAACACAGACTTCACCAGTTCAAAATAACAACTCAAATCTTGAGAATATGATGAAG
GAGTACATGGCCCGAACCGACGCAGTGATACAATCCCAAGCGGCATCAATGAGGAATTTCGAGACCCAATTGGGACAGCTCGCCAATGAATTGAAGAATAGACCACAAGG
TTCTTTTCCAGGCCATACTGAATTACCAAAACGAGAAGGGAAAGAACAGTGCAAAGCTGTCACCCTCAGGAGTGGACTGACATATGATGGACCAACAATGCCAACAACAG
ATGTACAGATTCCGTCCACTGAACCAATTGTAAAGATACCAGAAAATCCAACAACACCAGAAAAAGAAAATATTAGAAAAGGAGACTTTGAAGAGTGCTCTGCTATAAAT
AACTTGAATCCTGTTATGTTTGATGAGTTTTATGACTTATTAGTTACAGAGATTGAAGAAGAGCTTGATAAGATAGCAGAAGGACCAGAAGATGTGGCTAGTCCTATTGA
AAAAATACAAAAAGAAGAATGCAAGTCGTTACTTCCGTCCATAGTGGAACCACCCACGTTGGAGCAGAAGCCATTGCCGTCGCATTTGAAATATGCGTATCTAGGGGATA
ACGACACTTTACCAGTTCGAGAAGTCATGCAACATATCTACAACTTAAGGGCTTCATTGGATTTTGCGGTTTTACCTTCATGGCCTCCAGCGCTAGCTGCTATCCTTGGT
CATCCATCTTCCACTGTTAGTTTCAAAGTCTTTCTACGAGTAGCTGTGGTTGAAACCGATGTTAGTGTGTGGAAACCTCAGGGGCTAGTAGCCGGTCTTTCAGATGTAGA
AAGTCTTAGTGACCATTTGGGAGTGCAAATTAATCAAAAGAAGCAAAAAGACGGAAAAACCTTCATGGAGGCGCCAGGCGCCTGGGAAGCCTGCAGAAAACATCATGCCT
CCGCACTCAAAGGCTCATCGGTCTGCGCCCTCGAAACTCTCGCTGGCAGGGATGGGCCGTTCGAGTTCGAGGCCGACCTGCCGAGGAAGGAGTTTGCCGCACCCACTGAG
GAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAGACCGACCTGGCCAGGTCGGTCCCCGTCGAGATCCTAGATAATCCCTCGATCTTAGAGCCAGATCTGATGAA
GATCGGCGCTCCAGAATCCTCATGGATGGACCCGATCGCGGACTTCATTAAGGGCAACCCACCACAAGACCCCAAGGAGCGCAGAAAGTTGGCAAGGCGGGCAGCTCGGT
TCGTGGTCCGAGATGGGGCATTGTACCGACGTGGCTTTTCCCTGCCTCTGTTGAGATGCCTAACCCCTGAAGAGGGCCTGAGGGTCCAAACGCATGTGGGTGCCCTTGAT
CCGGCCTGGGAGGGCCCGTTTGAGATCAAGGGCATAGTCCGACCTGGGACGTACATATTGGCCGATCTGAAAGGAGATGTCCTCGCGCACCCGTGGAACGCGGAACACCT
GAAGCGTTATTATCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGGCAATGCGCACACAAATGCGCTCCATGGAGGAAATGTATAACGAAATGATGCTAGCTGCAAGCGCAGGGTCTCGATCTGAAAATCGAGTGGTGCGCGTGGACGT
ACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGCCGAGGAGGAACGACCCGAAGACAACGAGAGCGAGGGGTACACTCGCCAGAGGGGAGACCTCCGTGAGCATCTCAACA
GAAAGAGAGGCTCGTCTCTCCGAAAAGGGCAGTCACCATCCCGCTCACACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAATCCCGCAGGGATAATCACGAGGGAG
GAGTTCGACCAGCTGAGGGGGGAGCTCGATGCTCAGGTGGAGACCTTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAACGATGGCGACTTGGGAGAATCGCCTTT
CACCTCGGACGTGTTGGAAGCACCAATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAACCTTATGATCGGACGAAGGACCCCAAGGACTATGTTGAGGTCTTTGAAGGCC
TCATGGACTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATCGCGTTTACTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCG
ACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCACTATGACAAAAAGACAGCGACCCATCTTGCCACCATCAGGCAGAAGGAGGGTGAGACGCT
GCGGGAGTATGTCACCAGATTCCAGGAGGAGCAGTTGAAGGTTGCACACTGCTCCGATGACTCGGCCATGTGCTATTTCCTCACCGGTCTAGCCGACGAAGCCCTCACGG
TGAAACTCGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTACTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTTCTCCGAACCAAAATCGGCCGACCGAAGCGA
AAGATCGGCCGGGGCAGAAGTGGGAAAGATGTAGAAAGGGCAGATCCCGAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGAGCTGAGTATCGAAGGGCGGAGAACGG
ACCTACCAGGAGCCGACCTTACGAGCGCTTCACCCCGACCACGATTCCAATTTCCGAGATCCTAACGAACATTGAGGAGTCTGGAATGGAAAAACTACTCAAGCGTCCTG
AGAAACTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGCCGTTTCCATCGGGAGCACAGCCACAACACGTCGGATTGTTGGGAGTTGAAGCGCCAAATTGAG
GATCTAATTCAAGACGGCTACTTCAAGAAGTTTGTGGGAAAGCCCAGGACCAGCTCAGCAGAGAAAAAGGAAGAGCGAAAGCGTTCAAGGACGCCACCCCGACGCACCGA
CCGACCTGCGGTCATCAATACCATTTTTGGAGGGCCAAACGGGGGTCAGTCCGGACATAAAAGAAAGGAGTTAGCCCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGGG
AACAGGGGCCGACCTGCCCAATCACCTTCGACGGTGCAGACTTGGAGGAGGTCCACCTACCCCATAATGATGCACTTGTGATTGCTCCCTTGATTGATCATGTGGTGCTC
AGGAGAGTGCTGGTAGACGGGGGCGCATCCGCTAACATCCTGTCCTTACCGACCTACCTCGCCTTGGGATGGACGAGGTCGCAATTGAAGAGGAGCCCGACACCGCTAGT
TGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGTTGCATTGACTTGCCGGTCACGCTGGGGCAGGACCAAACTCGGGTCACTCAAATGGCCGAGTTCGTGGTAATTGATG
GTAGATCGACCTATAACGCCATCTTTGGGCGACCCATCATCCACTCATTTCGGGCCATTCCCTCAACACTTCATCAAGTTTTGAAGTATCCCACCCCCAATGGCGTGGGC
ACGGTCCGAGGAGAACAGGCCGCTTCGAGGGAGTGTTATGCCTTGATTACGCTTGTTTACATGCGTAAAAAGAAGGTTTTAGATGCTGTTTTTACTATGACCATTTGGGA
GTGCAAATTAATCAAAAGAAGCAAAAAGAGGGAAAAACCTTCATGGAGGCGCCAGGCGCCTGGGAAGCCTGCAGAAAACAGTTTTTCTTCCAACTTTGCCCTTAATGAAA
CGCGTCTTCCAATGCGTTTTGGTGGTTCCAACCGATGCATACGTGTAGAAGAAGTGTTCCACTATCAGTTTGAGCACGATTTGGGTACGTTAAAGTGCATGAGCACAAGA
TCTTTTCTTCTACCCCTTGACCCTGAGATTGAACGGACCCTTCGAAAAACTCGAAAGGAGCAAAGACTTCGAAAACAACTAGAAAAGCAAAAAGAGAGAGAAGGTGAAAT
CAGTCCTGAAAGTGAGGTAGAGAGTACGAGCACATCGATGGCAGATATTCCACCTCGTGATCCGGTTGATCCACCTGCTGTTAACGGTAATATGAGGGATCATGCGAGAA
ATGATGAATTCAACCATATCCAGATGGCGGACAACAGAGACGTGGCAATGCGAGAATATGCCGCCACGGCTTTTCAGAACTTTGATTCAGGGATAATAGAACATTTCTTT
AGAGGCTTAGATCATCCTACTAAGATGATGCTAAACAATGCTGCCAATGGAGCCTTTACAAAGAAGACATTCAACGAGATAGTTGACATCCTAAATGACTTAGCTTCACA
TAACGAACTATGGTGTTCGCAAAGATCTAGGGCAGCACCAAAGAAGCAAGATCCAGCTGGAGTTTTGGCTCTGGACATTGCGACCTCGATGCAAAAAGAGATGGTTACAA
TGAACCAGAGGCTGAAAGAGATGGCATTGGGAATAAAAAATCCATTAGCCACGCCGATACAACCTGTGCAGTCGGATTATTGCACTCCTGCCCCTGTTTGCCAAGTCAAC
GATCTCATTTGTTCATTCTGCAGTGAAAACCATATCTATGATAATTGTCCACATAACTCTGCTTCCGTTTTTTATGTAGGACATGGGAACAATAGGAACTTTAACCCATA
TTCGAACACCTACAACCCAGGTTGGAGGCACCACCCTAATTTCTCATGGGGCGGACAAGGAGGTTCGAGCGGTTTTAATCAAGGGCAGAGCCAGCAGAACAAGCAGCCCT
ATGTTCCACCTACACAACAATACATCCCACCACCGCAACAGCAGTACAATCAGAGAACACAGACTTCACCAGTTCAAAATAACAACTCAAATCTTGAGAATATGATGAAG
GAGTACATGGCCCGAACCGACGCAGTGATACAATCCCAAGCGGCATCAATGAGGAATTTCGAGACCCAATTGGGACAGCTCGCCAATGAATTGAAGAATAGACCACAAGG
TTCTTTTCCAGGCCATACTGAATTACCAAAACGAGAAGGGAAAGAACAGTGCAAAGCTGTCACCCTCAGGAGTGGACTGACATATGATGGACCAACAATGCCAACAACAG
ATGTACAGATTCCGTCCACTGAACCAATTGTAAAGATACCAGAAAATCCAACAACACCAGAAAAAGAAAATATTAGAAAAGGAGACTTTGAAGAGTGCTCTGCTATAAAT
AACTTGAATCCTGTTATGTTTGATGAGTTTTATGACTTATTAGTTACAGAGATTGAAGAAGAGCTTGATAAGATAGCAGAAGGACCAGAAGATGTGGCTAGTCCTATTGA
AAAAATACAAAAAGAAGAATGCAAGTCGTTACTTCCGTCCATAGTGGAACCACCCACGTTGGAGCAGAAGCCATTGCCGTCGCATTTGAAATATGCGTATCTAGGGGATA
ACGACACTTTACCAGTTCGAGAAGTCATGCAACATATCTACAACTTAAGGGCTTCATTGGATTTTGCGGTTTTACCTTCATGGCCTCCAGCGCTAGCTGCTATCCTTGGT
CATCCATCTTCCACTGTTAGTTTCAAAGTCTTTCTACGAGTAGCTGTGGTTGAAACCGATGTTAGTGTGTGGAAACCTCAGGGGCTAGTAGCCGGTCTTTCAGATGTAGA
AAGTCTTAGTGACCATTTGGGAGTGCAAATTAATCAAAAGAAGCAAAAAGACGGAAAAACCTTCATGGAGGCGCCAGGCGCCTGGGAAGCCTGCAGAAAACATCATGCCT
CCGCACTCAAAGGCTCATCGGTCTGCGCCCTCGAAACTCTCGCTGGCAGGGATGGGCCGTTCGAGTTCGAGGCCGACCTGCCGAGGAAGGAGTTTGCCGCACCCACTGAG
GAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAGACCGACCTGGCCAGGTCGGTCCCCGTCGAGATCCTAGATAATCCCTCGATCTTAGAGCCAGATCTGATGAA
GATCGGCGCTCCAGAATCCTCATGGATGGACCCGATCGCGGACTTCATTAAGGGCAACCCACCACAAGACCCCAAGGAGCGCAGAAAGTTGGCAAGGCGGGCAGCTCGGT
TCGTGGTCCGAGATGGGGCATTGTACCGACGTGGCTTTTCCCTGCCTCTGTTGAGATGCCTAACCCCTGAAGAGGGCCTGAGGGTCCAAACGCATGTGGGTGCCCTTGAT
CCGGCCTGGGAGGGCCCGTTTGAGATCAAGGGCATAGTCCGACCTGGGACGTACATATTGGCCGATCTGAAAGGAGATGTCCTCGCGCACCCGTGGAACGCGGAACACCT
GAAGCGTTATTATCCTTGA
Protein sequenceShow/hide protein sequence
MEAMRTQMRSMEEMYNEMMLAASAGSRSENRVVRVDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPAGIITRE
EFDQLRGELDAQVETLKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDRTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIAFTGSARLWYRRLPARSIS
TYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKIGRPKR
KIGRGRSGKDVERADPESKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHSHNTSDCWELKRQIE
DLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPNGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVL
RRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVIDGRSTYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVG
TVRGEQAASRECYALITLVYMRKKKVLDAVFTMTIWECKLIKRSKKREKPSWRRQAPGKPAENSFSSNFALNETRLPMRFGGSNRCIRVEEVFHYQFEHDLGTLKCMSTR
SFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAATAFQNFDSGIIEHFF
RGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVN
DLICSFCSENHIYDNCPHNSASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTSPVQNNNSNLENMMK
EYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLTYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGDFEECSAIN
NLNPVMFDEFYDLLVTEIEEELDKIAEGPEDVASPIEKIQKEECKSLLPSIVEPPTLEQKPLPSHLKYAYLGDNDTLPVREVMQHIYNLRASLDFAVLPSWPPALAAILG
HPSSTVSFKVFLRVAVVETDVSVWKPQGLVAGLSDVESLSDHLGVQINQKKQKDGKTFMEAPGAWEACRKHHASALKGSSVCALETLAGRDGPFEFEADLPRKEFAAPTE
ELELVPLLSPEKQTDLARSVPVEILDNPSILEPDLMKIGAPESSWMDPIADFIKGNPPQDPKERRKLARRAARFVVRDGALYRRGFSLPLLRCLTPEEGLRVQTHVGALD
PAWEGPFEIKGIVRPGTYILADLKGDVLAHPWNAEHLKRYYP