; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g14220 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g14220
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr3:9586367..9591911
RNA-Seq ExpressionMoc03g14220
SyntenyMoc03g14220
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]1.4e-24987.5Show/hide
Query:  QAESSHN---PAGVITREEFDQLRGKLDAQVEALKAKCEQKEDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASD
        +AESS N   PAGVITREEFDQLRG+LDAQVEALKAKCEQKE  LNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDG+KDPKDYVEVFE LMDFQAASD
Subjt:  QAESSHN---PAGVITREEFDQLRGKLDAQVEALKAKCEQKEDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHCDKKTATHLATIRQKEGETLREYVSRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
        AIKCRAF+IALTGSARLWYRRLPA SISTYSQLRREFLA FSSRH DKKTATHLATIRQKEGETLREYV+RFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHCDKKTATHLATIRQKEGETLREYVSRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFTEVLQKAKKVIDGQELLRTKTGRPERKIGWGRSGKDIERADPKSKDKGSFSCHRAGYRRAENGPTRSRPYERFTPTTIPISEILTNIE
        TVKLGEEAPATF EVLQKAKKVIDGQELLRTKTGRPERKIG GRSGKDIE ADPKSKDKGSFS  RA YRRAENGPTRSRPYERFTPTTIPISEILTNIE
Subjt:  TVKLGEEAPATFTEVLQKAKKVIDGQELLRTKTGRPERKIGWGRSGKDIERADPKSKDKGSFSCHRAGYRRAENGPTRSRPYERFTPTTIPISEILTNIE

Query:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIEDGYFKKFVGKPRTSSAEKKEERKRSRTPSRRTDRPAVINTIFGGPSGG
        ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELKRQIE+LI+DGYFKKFVGKPRTSSAEKKEERKRSRTP RRTDRPAVINTIFGGPSGG
Subjt:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIEDGYFKKFVGKPRTSSAEKKEERKRSRTPSRRTDRPAVINTIFGGPSGG

Query:  QSGHKRKELARAARREVCIIRE-------------------------------------RVLVDGGASANILSLPTYLALGWTRSQLKKSLTPLVGFSGE
        QSG KRKELARAARREVCIIRE                                     RVLVDGG SANILSLPTYLALGWTRSQLKKS TPLVGFSGE
Subjt:  QSGHKRKELARAARREVCIIRE-------------------------------------RVLVDGGASANILSLPTYLALGWTRSQLKKSLTPLVGFSGE

Query:  SVIPEGCIDLPVTLGQDQTQVTQMAEFV
        SVIPEG IDLPVTLGQDQTQVTQMAEFV
Subjt:  SVIPEGCIDLPVTLGQDQTQVTQMAEFV

XP_022141796.1 uncharacterized protein LOC111012081 [Momordica charantia]2.7e-20593.18Show/hide
Query:  VITREEFDQLRGKLDAQVEALKAKCEQKEDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTG
        +ITREEFDQLRG+LDAQ EALKAKCEQKE  LNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDG+KDPKDYVEVFEGLMDFQA SDAIKCRAFQIALTG
Subjt:  VITREEFDQLRGKLDAQVEALKAKCEQKEDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTG

Query:  SARLWYRRLPARSISTYSQLRREFLAQFSSRHCDKKTATHLATIRQKEGETLREYVSRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFT
        SARLWYRRLPARSISTYSQLRREFLAQFSSRH DKKTATHLATIRQKEGETLREYV+RFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAP+TFT
Subjt:  SARLWYRRLPARSISTYSQLRREFLAQFSSRHCDKKTATHLATIRQKEGETLREYVSRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFT

Query:  EVLQKAKKVIDGQELLRTKTGRPERKIGWGRSGKDIERADPKSKDKGSFSCHRAGYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEK
        EVLQK KKVIDG ELLRTKTGRPERKI  GRSGKDIE+ DPKSKDKGSFS  R  YRRAENGPTRSRPYERFTPTTIPI EILT IEESGMEKLLKRPEK
Subjt:  EVLQKAKKVIDGQELLRTKTGRPERKIGWGRSGKDIERADPKSKDKGSFSCHRAGYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEK

Query:  LRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIEDGYFKKFVGKPRTSSAEKKEERKRSRTPSRRTDRPAVINTIFGGPSGGQSGHKRKEL
        LR   ERRSKDKYCRFHREHGHNTSDCWELKRQIEDLI+DGYFKKFVGKPRTSSAEKKEERKRSRTP RRTDRPAVINTIFGGPSGGQSGHKRKEL
Subjt:  LRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIEDGYFKKFVGKPRTSSAEKKEERKRSRTPSRRTDRPAVINTIFGGPSGGQSGHKRKEL

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]2.0e-24876.59Show/hide
Query:  SSNQQAESSHNPA---GVITREEFDQLRGKLDAQVEALKAKCEQKEDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQ
        SSNQQAESSHNPA   GVITREEFDQLRGKL+AQVEALKAKCEQKE  LNDGDLGESPFTSDVLE        APTVK YDG+KDPKDYVEVFEGLMDFQ
Subjt:  SSNQQAESSHNPA---GVITREEFDQLRGKLDAQVEALKAKCEQKEDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQ

Query:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHCDKKTATHLATIRQKEGETLREYVSRFQEEQLKVAHCSDDSAMCYFLTGLA
        AASDAIKCRAFQIALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFLTGLA
Subjt:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHCDKKTATHLATIRQKEGETLREYVSRFQEEQLKVAHCSDDSAMCYFLTGLA

Query:  DEALTVKLGEEAPATFTEVLQKAKKVIDGQELLRTKTGRPERKIGWGRSGKDIERADPKSKDKGSFSCHRAGYRRAENGPTRSRPYERFTPTTIPISEIL
        DEALTVKLG+EAPATF EVLQKAKKVIDGQELLRTKTGRPER I  GRSGKD E+AD KSKDKGSFS  RA +RRA NGPTRSRPYERFTPTTIPISEIL
Subjt:  DEALTVKLGEEAPATFTEVLQKAKKVIDGQELLRTKTGRPERKIGWGRSGKDIERADPKSKDKGSFSCHRAGYRRAENGPTRSRPYERFTPTTIPISEIL

Query:  TNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIEDGYFKKFVGKPRTSSAEKKEERKRSRTPSRRTDRPAVINTIFGG
        TNIEESGMEKLLKRPEKLRGAPERR+KDKYCRFHREH HNTSD WELKRQIEDLI+D YFKKFVGKPRTSSAEKKEERK SRTP RR DRPAVINTIFGG
Subjt:  TNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIEDGYFKKFVGKPRTSSAEKKEERKRSRTPSRRTDRPAVINTIFGG

Query:  PSGGQSGHKRKELARAARREVCIIRE-------------------------------------RVLVDGGASANILSLPTYLALGWTRSQLKKSLTPLVG
        PSGGQSGHKRKELARAARREVCIIRE                                     RVLVD G SANI+SL TYLALGWTRSQLKKS TPLVG
Subjt:  PSGGQSGHKRKELARAARREVCIIRE-------------------------------------RVLVDGGASANILSLPTYLALGWTRSQLKKSLTPLVG

Query:  FSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKGSSVCALETL
        FS ESVIPEGCIDLPVTLG DQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKY TPNGVG VRGEQ ASRECYASALKGSSVCALETL
Subjt:  FSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKGSSVCALETL

Query:  AGRDGVLEFEADLSRREFAAPTEELELI
          RDG LEF+A+L RREFAAPTEELEL+
Subjt:  AGRDGVLEFEADLSRREFAAPTEELELI

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]6.0e-25364.62Show/hide
Query:  MVQPANSTNTADRRTLATSDAHQREVGAAAVEGQGHDGLATEPLRRLARITAPALPPAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAMR
        MVQPANSTNTADRR LA +  HQREVGA  VEGQGH+ L TEPL R ARIT P LPPAHP+ SK                                    
Subjt:  MVQPANSTNTADRRTLATSDAHQREVGAAAVEGQGHDGLATEPLRRLARITAPALPPAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAMR

Query:  TQMQSMEEMYNEMILAAGAGSRSENRVTRVDVREQRGSHLGPAEEERPEDNESEGSPSRSHRSSNQQAESSHNP--AGVITREEFDQLRGKLDAQVEALK
                                                                           AESS+NP   GVITREEFDQL+ K DAQVEALK
Subjt:  TQMQSMEEMYNEMILAAGAGSRSENRVTRVDVREQRGSHLGPAEEERPEDNESEGSPSRSHRSSNQQAESSHNP--AGVITREEFDQLRGKLDAQVEALK

Query:  AKCEQKEDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRR
        A+CE+KE S +DGDLGE  F+SD+LEA IPPKFK PT+KPYDG+KDPKDYVEVFE LMDFQAA+DAIKC AFQIALTGSARLWYRRLPAR ISTYSQLR+
Subjt:  AKCEQKEDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRR

Query:  EFLAQFSSRHCDKKTATHLATIRQKEGETLREYVSRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFTEVLQKAKKVIDGQELLRTKTGR
        EF++QFSSRH D+KT THLATIRQKEGETLREYV+RF EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EEAPATF EVLQK KKVIDGQELLRTKTGR
Subjt:  EFLAQFSSRHCDKKTATHLATIRQKEGETLREYVSRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFTEVLQKAKKVIDGQELLRTKTGR

Query:  PERKIGWGRSGKDIERADPKSKDKG-SFSCHRAGYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHG
        PE+ I  GR+GKD  +AD KS+DKG S S  R  YRR+ +   +SRPYE +TPTTIPI EILTNIEE+GMEKLLKRPEKLRG PE+R+ DKYCRFHR+HG
Subjt:  PERKIGWGRSGKDIERADPKSKDKG-SFSCHRAGYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHG

Query:  HNTSDCWELKRQIEDLIEDGYFKKFVGKPRTSSAEKKEERKRSRTPSRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIRE------------
        HNTS+ WELKRQIEDLI+DGYFKKFVGKPR++S EKKEERKR RTP RR DRPAVIN             K+KELAR ARREVCIIRE            
Subjt:  HNTSDCWELKRQIEDLIEDGYFKKFVGKPRTSSAEKKEERKRSRTPSRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIRE------------

Query:  -------------------------RVLVDGGASANILSLPTYLALGWTRSQLKKSLTPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRS
                                 R+LVDGGASANILSL TYLALGWTRSQLKKS TPLVGFSGES+  EGCIDLPV++ QD TQVTQMAEFVVIDGRS
Subjt:  -------------------------RVLVDGGASANILSLPTYLALGWTRSQLKKSLTPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRS

Query:  AYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKGSSVCALETLAGRD
        AYNAIFGRPIIHSFRA+PSTLHQVLKY T NGVGTVRGE   SRECYAS  K SSVCALE    RD
Subjt:  AYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKGSSVCALETLAGRD

XP_022156542.1 uncharacterized protein LOC111023421 [Momordica charantia]1.3e-21595.56Show/hide
Query:  GVITREEFDQLRGKLDAQVEALKAKCEQKEDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALT
        G+ITREEFDQLRG+LDAQVEALKAKCEQK+DSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALT
Subjt:  GVITREEFDQLRGKLDAQVEALKAKCEQKEDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALT

Query:  GSARLWYRRLPARSISTYSQLRREFLAQFSSRHCDKKTATHLATIRQKEGETLREYVSRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATF
        GSARLWYRRLP RSISTYSQLRREFLAQFSSRH DKKTATHLATIRQKEGETLREYV+RFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATF
Subjt:  GSARLWYRRLPARSISTYSQLRREFLAQFSSRHCDKKTATHLATIRQKEGETLREYVSRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATF

Query:  TEVLQKAKKVIDGQELLRTKTGRPERKIGWGRSGKDIERADPKSKDKGSFSCHRAGYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPE
         EVLQKAKKVIDGQELLRTKTGRPERKIG GRSGKD+ERADPKSKDKGSFS  RA YRRAENGPTRSRPYERFTPTTIPI EILTNIEESGMEKLLKRPE
Subjt:  TEVLQKAKKVIDGQELLRTKTGRPERKIGWGRSGKDIERADPKSKDKGSFSCHRAGYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPE

Query:  KLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIEDGYFKKFVGKPRTSSAEKKEERKRSRTPSRRTDRPAVINTIFGGPSGGQSGHKRKELARA
        KLRGAPERRSKDKYCRFHREHGHNTSD WELKRQIEDLI+DGYFKKFVGKPRTSSAEKKEERKRSRTP RRTDRPAVINTIFGGPSGGQ GHKRKELARA
Subjt:  KLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIEDGYFKKFVGKPRTSSAEKKEERKRSRTPSRRTDRPAVINTIFGGPSGGQSGHKRKELARA

Query:  ARREV
        ARRE+
Subjt:  ARREV

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088136.7e-25087.5Show/hide
Query:  QAESSHN---PAGVITREEFDQLRGKLDAQVEALKAKCEQKEDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASD
        +AESS N   PAGVITREEFDQLRG+LDAQVEALKAKCEQKE  LNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDG+KDPKDYVEVFE LMDFQAASD
Subjt:  QAESSHN---PAGVITREEFDQLRGKLDAQVEALKAKCEQKEDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHCDKKTATHLATIRQKEGETLREYVSRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
        AIKCRAF+IALTGSARLWYRRLPA SISTYSQLRREFLA FSSRH DKKTATHLATIRQKEGETLREYV+RFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHCDKKTATHLATIRQKEGETLREYVSRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFTEVLQKAKKVIDGQELLRTKTGRPERKIGWGRSGKDIERADPKSKDKGSFSCHRAGYRRAENGPTRSRPYERFTPTTIPISEILTNIE
        TVKLGEEAPATF EVLQKAKKVIDGQELLRTKTGRPERKIG GRSGKDIE ADPKSKDKGSFS  RA YRRAENGPTRSRPYERFTPTTIPISEILTNIE
Subjt:  TVKLGEEAPATFTEVLQKAKKVIDGQELLRTKTGRPERKIGWGRSGKDIERADPKSKDKGSFSCHRAGYRRAENGPTRSRPYERFTPTTIPISEILTNIE

Query:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIEDGYFKKFVGKPRTSSAEKKEERKRSRTPSRRTDRPAVINTIFGGPSGG
        ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELKRQIE+LI+DGYFKKFVGKPRTSSAEKKEERKRSRTP RRTDRPAVINTIFGGPSGG
Subjt:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIEDGYFKKFVGKPRTSSAEKKEERKRSRTPSRRTDRPAVINTIFGGPSGG

Query:  QSGHKRKELARAARREVCIIRE-------------------------------------RVLVDGGASANILSLPTYLALGWTRSQLKKSLTPLVGFSGE
        QSG KRKELARAARREVCIIRE                                     RVLVDGG SANILSLPTYLALGWTRSQLKKS TPLVGFSGE
Subjt:  QSGHKRKELARAARREVCIIRE-------------------------------------RVLVDGGASANILSLPTYLALGWTRSQLKKSLTPLVGFSGE

Query:  SVIPEGCIDLPVTLGQDQTQVTQMAEFV
        SVIPEG IDLPVTLGQDQTQVTQMAEFV
Subjt:  SVIPEGCIDLPVTLGQDQTQVTQMAEFV

A0A6J1CKB3 uncharacterized protein LOC1110120811.3e-20593.18Show/hide
Query:  VITREEFDQLRGKLDAQVEALKAKCEQKEDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTG
        +ITREEFDQLRG+LDAQ EALKAKCEQKE  LNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDG+KDPKDYVEVFEGLMDFQA SDAIKCRAFQIALTG
Subjt:  VITREEFDQLRGKLDAQVEALKAKCEQKEDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTG

Query:  SARLWYRRLPARSISTYSQLRREFLAQFSSRHCDKKTATHLATIRQKEGETLREYVSRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFT
        SARLWYRRLPARSISTYSQLRREFLAQFSSRH DKKTATHLATIRQKEGETLREYV+RFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAP+TFT
Subjt:  SARLWYRRLPARSISTYSQLRREFLAQFSSRHCDKKTATHLATIRQKEGETLREYVSRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFT

Query:  EVLQKAKKVIDGQELLRTKTGRPERKIGWGRSGKDIERADPKSKDKGSFSCHRAGYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEK
        EVLQK KKVIDG ELLRTKTGRPERKI  GRSGKDIE+ DPKSKDKGSFS  R  YRRAENGPTRSRPYERFTPTTIPI EILT IEESGMEKLLKRPEK
Subjt:  EVLQKAKKVIDGQELLRTKTGRPERKIGWGRSGKDIERADPKSKDKGSFSCHRAGYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEK

Query:  LRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIEDGYFKKFVGKPRTSSAEKKEERKRSRTPSRRTDRPAVINTIFGGPSGGQSGHKRKEL
        LR   ERRSKDKYCRFHREHGHNTSDCWELKRQIEDLI+DGYFKKFVGKPRTSSAEKKEERKRSRTP RRTDRPAVINTIFGGPSGGQSGHKRKEL
Subjt:  LRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIEDGYFKKFVGKPRTSSAEKKEERKRSRTPSRRTDRPAVINTIFGGPSGGQSGHKRKEL

A0A6J1D9E1 uncharacterized protein LOC1110188239.7e-24976.59Show/hide
Query:  SSNQQAESSHNPA---GVITREEFDQLRGKLDAQVEALKAKCEQKEDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQ
        SSNQQAESSHNPA   GVITREEFDQLRGKL+AQVEALKAKCEQKE  LNDGDLGESPFTSDVLE        APTVK YDG+KDPKDYVEVFEGLMDFQ
Subjt:  SSNQQAESSHNPA---GVITREEFDQLRGKLDAQVEALKAKCEQKEDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQ

Query:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHCDKKTATHLATIRQKEGETLREYVSRFQEEQLKVAHCSDDSAMCYFLTGLA
        AASDAIKCRAFQIALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFLTGLA
Subjt:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHCDKKTATHLATIRQKEGETLREYVSRFQEEQLKVAHCSDDSAMCYFLTGLA

Query:  DEALTVKLGEEAPATFTEVLQKAKKVIDGQELLRTKTGRPERKIGWGRSGKDIERADPKSKDKGSFSCHRAGYRRAENGPTRSRPYERFTPTTIPISEIL
        DEALTVKLG+EAPATF EVLQKAKKVIDGQELLRTKTGRPER I  GRSGKD E+AD KSKDKGSFS  RA +RRA NGPTRSRPYERFTPTTIPISEIL
Subjt:  DEALTVKLGEEAPATFTEVLQKAKKVIDGQELLRTKTGRPERKIGWGRSGKDIERADPKSKDKGSFSCHRAGYRRAENGPTRSRPYERFTPTTIPISEIL

Query:  TNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIEDGYFKKFVGKPRTSSAEKKEERKRSRTPSRRTDRPAVINTIFGG
        TNIEESGMEKLLKRPEKLRGAPERR+KDKYCRFHREH HNTSD WELKRQIEDLI+D YFKKFVGKPRTSSAEKKEERK SRTP RR DRPAVINTIFGG
Subjt:  TNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIEDGYFKKFVGKPRTSSAEKKEERKRSRTPSRRTDRPAVINTIFGG

Query:  PSGGQSGHKRKELARAARREVCIIRE-------------------------------------RVLVDGGASANILSLPTYLALGWTRSQLKKSLTPLVG
        PSGGQSGHKRKELARAARREVCIIRE                                     RVLVD G SANI+SL TYLALGWTRSQLKKS TPLVG
Subjt:  PSGGQSGHKRKELARAARREVCIIRE-------------------------------------RVLVDGGASANILSLPTYLALGWTRSQLKKSLTPLVG

Query:  FSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKGSSVCALETL
        FS ESVIPEGCIDLPVTLG DQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKY TPNGVG VRGEQ ASRECYASALKGSSVCALETL
Subjt:  FSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKGSSVCALETL

Query:  AGRDGVLEFEADLSRREFAAPTEELELI
          RDG LEF+A+L RREFAAPTEELEL+
Subjt:  AGRDGVLEFEADLSRREFAAPTEELELI

A0A6J1DHB3 uncharacterized protein LOC1110204792.9e-25364.62Show/hide
Query:  MVQPANSTNTADRRTLATSDAHQREVGAAAVEGQGHDGLATEPLRRLARITAPALPPAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAMR
        MVQPANSTNTADRR LA +  HQREVGA  VEGQGH+ L TEPL R ARIT P LPPAHP+ SK                                    
Subjt:  MVQPANSTNTADRRTLATSDAHQREVGAAAVEGQGHDGLATEPLRRLARITAPALPPAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAMR

Query:  TQMQSMEEMYNEMILAAGAGSRSENRVTRVDVREQRGSHLGPAEEERPEDNESEGSPSRSHRSSNQQAESSHNP--AGVITREEFDQLRGKLDAQVEALK
                                                                           AESS+NP   GVITREEFDQL+ K DAQVEALK
Subjt:  TQMQSMEEMYNEMILAAGAGSRSENRVTRVDVREQRGSHLGPAEEERPEDNESEGSPSRSHRSSNQQAESSHNP--AGVITREEFDQLRGKLDAQVEALK

Query:  AKCEQKEDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRR
        A+CE+KE S +DGDLGE  F+SD+LEA IPPKFK PT+KPYDG+KDPKDYVEVFE LMDFQAA+DAIKC AFQIALTGSARLWYRRLPAR ISTYSQLR+
Subjt:  AKCEQKEDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRR

Query:  EFLAQFSSRHCDKKTATHLATIRQKEGETLREYVSRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFTEVLQKAKKVIDGQELLRTKTGR
        EF++QFSSRH D+KT THLATIRQKEGETLREYV+RF EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EEAPATF EVLQK KKVIDGQELLRTKTGR
Subjt:  EFLAQFSSRHCDKKTATHLATIRQKEGETLREYVSRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFTEVLQKAKKVIDGQELLRTKTGR

Query:  PERKIGWGRSGKDIERADPKSKDKG-SFSCHRAGYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHG
        PE+ I  GR+GKD  +AD KS+DKG S S  R  YRR+ +   +SRPYE +TPTTIPI EILTNIEE+GMEKLLKRPEKLRG PE+R+ DKYCRFHR+HG
Subjt:  PERKIGWGRSGKDIERADPKSKDKG-SFSCHRAGYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHG

Query:  HNTSDCWELKRQIEDLIEDGYFKKFVGKPRTSSAEKKEERKRSRTPSRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIRE------------
        HNTS+ WELKRQIEDLI+DGYFKKFVGKPR++S EKKEERKR RTP RR DRPAVIN             K+KELAR ARREVCIIRE            
Subjt:  HNTSDCWELKRQIEDLIEDGYFKKFVGKPRTSSAEKKEERKRSRTPSRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIRE------------

Query:  -------------------------RVLVDGGASANILSLPTYLALGWTRSQLKKSLTPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRS
                                 R+LVDGGASANILSL TYLALGWTRSQLKKS TPLVGFSGES+  EGCIDLPV++ QD TQVTQMAEFVVIDGRS
Subjt:  -------------------------RVLVDGGASANILSLPTYLALGWTRSQLKKSLTPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRS

Query:  AYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKGSSVCALETLAGRD
        AYNAIFGRPIIHSFRA+PSTLHQVLKY T NGVGTVRGE   SRECYAS  K SSVCALE    RD
Subjt:  AYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKGSSVCALETLAGRD

A0A6J1DS95 uncharacterized protein LOC1110234216.4e-21695.56Show/hide
Query:  GVITREEFDQLRGKLDAQVEALKAKCEQKEDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALT
        G+ITREEFDQLRG+LDAQVEALKAKCEQK+DSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALT
Subjt:  GVITREEFDQLRGKLDAQVEALKAKCEQKEDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALT

Query:  GSARLWYRRLPARSISTYSQLRREFLAQFSSRHCDKKTATHLATIRQKEGETLREYVSRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATF
        GSARLWYRRLP RSISTYSQLRREFLAQFSSRH DKKTATHLATIRQKEGETLREYV+RFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATF
Subjt:  GSARLWYRRLPARSISTYSQLRREFLAQFSSRHCDKKTATHLATIRQKEGETLREYVSRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATF

Query:  TEVLQKAKKVIDGQELLRTKTGRPERKIGWGRSGKDIERADPKSKDKGSFSCHRAGYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPE
         EVLQKAKKVIDGQELLRTKTGRPERKIG GRSGKD+ERADPKSKDKGSFS  RA YRRAENGPTRSRPYERFTPTTIPI EILTNIEESGMEKLLKRPE
Subjt:  TEVLQKAKKVIDGQELLRTKTGRPERKIGWGRSGKDIERADPKSKDKGSFSCHRAGYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPE

Query:  KLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIEDGYFKKFVGKPRTSSAEKKEERKRSRTPSRRTDRPAVINTIFGGPSGGQSGHKRKELARA
        KLRGAPERRSKDKYCRFHREHGHNTSD WELKRQIEDLI+DGYFKKFVGKPRTSSAEKKEERKRSRTP RRTDRPAVINTIFGGPSGGQ GHKRKELARA
Subjt:  KLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIEDGYFKKFVGKPRTSSAEKKEERKRSRTPSRRTDRPAVINTIFGGPSGGQSGHKRKELARA

Query:  ARREV
        ARRE+
Subjt:  ARREV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAACCCGCAAACTCGACCAATACGGCGGATCGAAGGACCCTAGCTACCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGCGGTAGAGGGGCAAGGTCACGA
CGGCCTAGCAACAGAACCCCTCCGCAGGTTGGCACGGATCACCGCGCCTGCCCTACCGCCTGCGCACCCGAGGACGTCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTA
AGAAGGGCGCCCGGGGTCCAGCCCCGGCTCCAACAAGTGAGAACTTTGATGCACTCCAGAGAGAGATGGAGGCAATGCGCACACAAATGCAGTCCATGGAGGAAATGTAT
AACGAAATGATACTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAGTGACGCGCGTGGACGTACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGCCGAGGAGGAACG
TCCCGAAGACAACGAGAGCGAGGGGTCACCATCCCGCTCACACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAATCCCGCAGGGGTGATCACAAGGGAGGAGTTCG
ACCAGTTGCGGGGCAAGCTCGATGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGAAGATTCACTGAACGATGGCGACTTGGGAGAATCGCCCTTCACCTCG
GACGTTTTGGAAGCACCAATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGATGGGACGAAGGACCCCAAGGACTATGTTGAGGTCTTTGAAGGCCTCATGGA
CTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACT
CTCAGTTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCATTGTGACAAAAAAACAGCGACCCATCTCGCCACCATCAGACAGAAGGAGGGTGAGACGCTGCGGGAA
TATGTCAGCAGGTTCCAGGAGGAGCAGTTGAAGGTCGCACACTGCTCCGATGACTCGGCCATGTGCTACTTTCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAACT
TGGAGAGGAGGCCCCGGCCACCTTCACCGAGGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCGGAACGAAAGATCG
GCTGGGGCAGAAGTGGAAAAGATATAGAAAGGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTTTCCTGCCATCGAGCTGGGTATCGAAGGGCGGAGAACGGACCTACC
AGGAGCCGACCTTACGAACGCTTCACCCCGACCACGATTCCAATTTCCGAGATCCTAACGAACATCGAGGAGTCTGGAATGGAAAAACTACTCAAGCGTCCTGAGAAGCT
TCGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCACAACACGTCGGACTGCTGGGAGTTGAAGCGCCAAATTGAGGATCTAA
TTGAAGATGGCTACTTCAAGAAATTTGTGGGAAAGCCCAGGACCAGCTCGGCAGAGAAAAAGGAAGAGCGAAAGCGTTCAAGGACGCCGTCCCGGCGCACTGACCGACCT
GCGGTCATCAATACCATTTTCGGAGGGCCAAGCGGGGGTCAGTCCGGACATAAAAGAAAGGAGTTAGCTCGTGCAGCCAGGCGCGAAGTGTGCATCATCAGGGAGAGGGT
GCTGGTAGACGGAGGCGCATCTGCTAACATCCTGTCCTTACCGACCTACCTCGCCCTGGGATGGACGAGGTCGCAATTGAAGAAAAGCCTGACGCCGCTGGTTGGGTTCT
CTGGAGAATCAGTCATTCCAGAGGGTTGCATCGACTTGCCGGTCACGCTGGGGCAGGATCAAACTCAGGTCACCCAAATGGCCGAGTTCGTGGTAATTGATGGTAGATCG
GCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCATTCCCTCGACACTTCATCAAGTTTTGAAGTATCCCACCCCCAATGGCGTGGGCACGGTCCG
AGGAGAACAGACCGCTTCGAGGGAGTGTTATGCCTCCGCACTCAAAGGTTCCTCGGTCTGCGCCCTCGAAACTCTGGCCGGTAGGGATGGGGTGCTCGAGTTCGAGGCCG
ACCTGTCGAGGAGGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTATCGGCGCTCCAGAATCCTCATGGATGGACCCGATTGCGGACTTCATTAGGGGCAACTCACCA
CAAGATCCCAAGGAGCGCAGAAAGTTGGCAAGGCGAGCAGATCGGTTCGTGGTCCGAAGTGGAGTATTGTACCGACGCGGCTTTTCCCTGCCTCTATTGAGATGCCTAAC
CCCTGAAGAGGGCCTGAGGGTCCAAACCCATGTGGGTGCCCTTGATCCGGCCTGGGAGGGCCCGTTTGAGGTCAAGGGCATAGTCCGACCTGGGACGTACGTATTGGCCG
ATCTGAAAGGGGACGTCCTTGCGCACCCGTGGAACGCGGAGCACCTGAAGCGTTATTATCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAACCCGCAAACTCGACCAATACGGCGGATCGAAGGACCCTAGCTACCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGCGGTAGAGGGGCAAGGTCACGA
CGGCCTAGCAACAGAACCCCTCCGCAGGTTGGCACGGATCACCGCGCCTGCCCTACCGCCTGCGCACCCGAGGACGTCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTA
AGAAGGGCGCCCGGGGTCCAGCCCCGGCTCCAACAAGTGAGAACTTTGATGCACTCCAGAGAGAGATGGAGGCAATGCGCACACAAATGCAGTCCATGGAGGAAATGTAT
AACGAAATGATACTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAGTGACGCGCGTGGACGTACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGCCGAGGAGGAACG
TCCCGAAGACAACGAGAGCGAGGGGTCACCATCCCGCTCACACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAATCCCGCAGGGGTGATCACAAGGGAGGAGTTCG
ACCAGTTGCGGGGCAAGCTCGATGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGAAGATTCACTGAACGATGGCGACTTGGGAGAATCGCCCTTCACCTCG
GACGTTTTGGAAGCACCAATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGATGGGACGAAGGACCCCAAGGACTATGTTGAGGTCTTTGAAGGCCTCATGGA
CTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACT
CTCAGTTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCATTGTGACAAAAAAACAGCGACCCATCTCGCCACCATCAGACAGAAGGAGGGTGAGACGCTGCGGGAA
TATGTCAGCAGGTTCCAGGAGGAGCAGTTGAAGGTCGCACACTGCTCCGATGACTCGGCCATGTGCTACTTTCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAACT
TGGAGAGGAGGCCCCGGCCACCTTCACCGAGGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCGGAACGAAAGATCG
GCTGGGGCAGAAGTGGAAAAGATATAGAAAGGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTTTCCTGCCATCGAGCTGGGTATCGAAGGGCGGAGAACGGACCTACC
AGGAGCCGACCTTACGAACGCTTCACCCCGACCACGATTCCAATTTCCGAGATCCTAACGAACATCGAGGAGTCTGGAATGGAAAAACTACTCAAGCGTCCTGAGAAGCT
TCGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCACAACACGTCGGACTGCTGGGAGTTGAAGCGCCAAATTGAGGATCTAA
TTGAAGATGGCTACTTCAAGAAATTTGTGGGAAAGCCCAGGACCAGCTCGGCAGAGAAAAAGGAAGAGCGAAAGCGTTCAAGGACGCCGTCCCGGCGCACTGACCGACCT
GCGGTCATCAATACCATTTTCGGAGGGCCAAGCGGGGGTCAGTCCGGACATAAAAGAAAGGAGTTAGCTCGTGCAGCCAGGCGCGAAGTGTGCATCATCAGGGAGAGGGT
GCTGGTAGACGGAGGCGCATCTGCTAACATCCTGTCCTTACCGACCTACCTCGCCCTGGGATGGACGAGGTCGCAATTGAAGAAAAGCCTGACGCCGCTGGTTGGGTTCT
CTGGAGAATCAGTCATTCCAGAGGGTTGCATCGACTTGCCGGTCACGCTGGGGCAGGATCAAACTCAGGTCACCCAAATGGCCGAGTTCGTGGTAATTGATGGTAGATCG
GCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCATTCCCTCGACACTTCATCAAGTTTTGAAGTATCCCACCCCCAATGGCGTGGGCACGGTCCG
AGGAGAACAGACCGCTTCGAGGGAGTGTTATGCCTCCGCACTCAAAGGTTCCTCGGTCTGCGCCCTCGAAACTCTGGCCGGTAGGGATGGGGTGCTCGAGTTCGAGGCCG
ACCTGTCGAGGAGGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTATCGGCGCTCCAGAATCCTCATGGATGGACCCGATTGCGGACTTCATTAGGGGCAACTCACCA
CAAGATCCCAAGGAGCGCAGAAAGTTGGCAAGGCGAGCAGATCGGTTCGTGGTCCGAAGTGGAGTATTGTACCGACGCGGCTTTTCCCTGCCTCTATTGAGATGCCTAAC
CCCTGAAGAGGGCCTGAGGGTCCAAACCCATGTGGGTGCCCTTGATCCGGCCTGGGAGGGCCCGTTTGAGGTCAAGGGCATAGTCCGACCTGGGACGTACGTATTGGCCG
ATCTGAAAGGGGACGTCCTTGCGCACCCGTGGAACGCGGAGCACCTGAAGCGTTATTATCCTTGA
Protein sequenceShow/hide protein sequence
MVQPANSTNTADRRTLATSDAHQREVGAAAVEGQGHDGLATEPLRRLARITAPALPPAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAMRTQMQSMEEMY
NEMILAAGAGSRSENRVTRVDVREQRGSHLGPAEEERPEDNESEGSPSRSHRSSNQQAESSHNPAGVITREEFDQLRGKLDAQVEALKAKCEQKEDSLNDGDLGESPFTS
DVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHCDKKTATHLATIRQKEGETLRE
YVSRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFTEVLQKAKKVIDGQELLRTKTGRPERKIGWGRSGKDIERADPKSKDKGSFSCHRAGYRRAENGPT
RSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIEDGYFKKFVGKPRTSSAEKKEERKRSRTPSRRTDRP
AVINTIFGGPSGGQSGHKRKELARAARREVCIIRERVLVDGGASANILSLPTYLALGWTRSQLKKSLTPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRS
AYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKGSSVCALETLAGRDGVLEFEADLSRREFAAPTEELELIGAPESSWMDPIADFIRGNSP
QDPKERRKLARRADRFVVRSGVLYRRGFSLPLLRCLTPEEGLRVQTHVGALDPAWEGPFEVKGIVRPGTYVLADLKGDVLAHPWNAEHLKRYYP