; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g07150 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g07150
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr6:5177740..5181998
RNA-Seq ExpressionMoc06g07150
SyntenyMoc06g07150
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]1.2e-26289.77Show/hide
Query:  QAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKNYVEVFEGLMDFQAASD
        +AESS NPATPAGVITR EFDQLRG+LDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPK+YVEVFE LMDFQAASD
Subjt:  QAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKNYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSVRLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
        AIKCRAF+IALTGS RLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTGSVRLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAE-------------------------IGRGKSGKDIEKADPKSKDNGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE
        TVKLGEEAPATFAE                         IGRG+SGKDIE ADPKSKD GSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE
Subjt:  TVKLGEEAPATFAE-------------------------IGRGKSGKDIEKADPKSKDNGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE

Query:  ESGMEKLLKRSEKLRGAPERRSKDKYCRLHREHGHNTSDCWELKRQIEDLIQDGHFKKFVGKHRTSSAEKKEERKRSRTPPRRTDRPAVINTIFRGPSGG
        ESGMEKLLKR EKLRGAPERRSKDKYCR HREHGHNTSD WELKRQIE+LIQDG+FKKFVGK RTSSAEKKEERKRSRTPPRRTDRPAVINTIF GPSGG
Subjt:  ESGMEKLLKRSEKLRGAPERRSKDKYCRLHREHGHNTSDCWELKRQIEDLIQDGHFKKFVGKHRTSSAEKKEERKRSRTPPRRTDRPAVINTIFRGPSGG

Query:  QSGHKRKELARAARPEVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLTKSPTPLVGFSGE
        QSG KRKELARAAR EVCIIREQRPTCPITFD  DLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLALGWTRSQL KSPTPLVGFSGE
Subjt:  QSGHKRKELARAARPEVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLTKSPTPLVGFSGE

Query:  SVIPEGYIDLPVTLGQDQTQVSQMAEFV
        SVIPEG+IDLPVTLGQDQTQV+QMAEFV
Subjt:  SVIPEGYIDLPVTLGQDQTQVSQMAEFV

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]1.0e-26178.92Show/hide
Query:  SSNQQAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKNYVEVFEGLMDFQ
        SSNQQAESSHNPATP GVITR EFDQLRGKL+AQVEALKAKCEQKEGPLNDGDLGESPFTSDVLE        APTVK YDGSKDPK+YVEVFEGLMDFQ
Subjt:  SSNQQAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKNYVEVFEGLMDFQ

Query:  AASDAIKCRAFQIALTGSVRLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA
        AASDAIKCRAFQIALTGS RLW                                                     FQE+QLKVA  SDDSAMCYFLTGLA
Subjt:  AASDAIKCRAFQIALTGSVRLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA

Query:  DEALTVKLGEEAPATFAE-------------------------IGRGKSGKDIEKADPKSKDNGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEIL
        DEALTVKLG+EAPATFAE                         I RG+SGKD EKAD KSKD GSFSSGRAE+RRA NGPTRSRPYERFTPTTIPISEIL
Subjt:  DEALTVKLGEEAPATFAE-------------------------IGRGKSGKDIEKADPKSKDNGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEIL

Query:  TNIEESGMEKLLKRSEKLRGAPERRSKDKYCRLHREHGHNTSDCWELKRQIEDLIQDGHFKKFVGKHRTSSAEKKEERKRSRTPPRRTDRPAVINTIFRG
        TNIEESGMEKLLKR EKLRGAPERR+KDKYCR HREH HNTSD WELKRQIEDLIQD +FKKFVGK RTSSAEKKEERK SRTP RR DRPAVINTIF G
Subjt:  TNIEESGMEKLLKRSEKLRGAPERRSKDKYCRLHREHGHNTSDCWELKRQIEDLIQDGHFKKFVGKHRTSSAEKKEERKRSRTPPRRTDRPAVINTIFRG

Query:  PSGGQSGHKRKELARAARPEVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLTKSPTPLVG
        PSGGQSGHKRKELARAAR EVCIIREQRPTCPITFDS DLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTRSQL KS TPLVG
Subjt:  PSGGQSGHKRKELARAARPEVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLTKSPTPLVG

Query:  FSGESVIPEGYIDLPVTLGQDQTQVSQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSWVCALETL
        FS ESVIPEG IDLPVTLG DQTQV+QMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVG VRGEQ ASRECYASALKGS VCALETL
Subjt:  FSGESVIPEGYIDLPVTLGQDQTQVSQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSWVCALETL

Query:  AGRDGTLEFEANLPRREFAAPTEELELVPLL
          RDGTLEF+ANLPRREFAAPTEELELVPLL
Subjt:  AGRDGTLEFEANLPRREFAAPTEELELVPLL

XP_022152110.1 uncharacterized protein LOC111019899 [Momordica charantia]2.2e-21187.73Show/hide
Query:  MCYFLTGLADEALTVKLGEEAPATFAE------------------IGRGKSGKDIEKADPKSKDNGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISE
        MCYFLTGLADEALTVKL EEAPATFAE                  IG+G+SGKD+E  DPKSKD GSFS+GRAEYRRAENGPTRSRPYERFTPTTIPISE
Subjt:  MCYFLTGLADEALTVKLGEEAPATFAE------------------IGRGKSGKDIEKADPKSKDNGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISE

Query:  ILTNIEESGMEKLLKRSEKLRGAPERRSKDKYCRLHREHGHNTSDCWELKRQIEDLIQDGHFKKFVGKHRTSSAEKKEERKRSRTPPRRTDRPAVINTIF
        ILTNIEESGMEKLLKR EKLRGAPERRSKDKYCR HREHGHNTSD WELK QIEDLIQDG+FKKFVGK RTSSAEKKEERKRSRTPPRRTDRPAVINTIF
Subjt:  ILTNIEESGMEKLLKRSEKLRGAPERRSKDKYCRLHREHGHNTSDCWELKRQIEDLIQDGHFKKFVGKHRTSSAEKKEERKRSRTPPRRTDRPAVINTIF

Query:  RGPSGGQSGHKRKELARAARPEVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLTKSPTPL
         GPSGGQSGHKRK+LARAAR EVCIIREQRPTCPITFD  DL EVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL KSPTPL
Subjt:  RGPSGGQSGHKRKELARAARPEVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLTKSPTPL

Query:  VGFSGESVIPEGYIDLPVTLGQDQTQVSQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSWVCALE
        VGFSGESV+PEG IDLPVTLGQDQT+V+QMAEFVV+DGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAS LKG+ VCALE
Subjt:  VGFSGESVIPEGYIDLPVTLGQDQTQVSQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSWVCALE

Query:  TLAGRDGTLEFEANLPRREFAAPTEELELVPLLSPEKQGQ
        TL  RDGTLEFEA+LP REFAAP EELELVPLLS EKQ Q
Subjt:  TLAGRDGTLEFEANLPRREFAAPTEELELVPLLSPEKQGQ

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]1.6e-24174.88Show/hide
Query:  QAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKNYVEVFEGLMDFQAASD
        +AESS+NP TP GVITR EFDQL+ K DAQVEALKA+CE+KE   +DGDLGE  F+SD+LEA IPPKFK PT+KPYDGSKDPK+YVEVFE LMDFQAA+D
Subjt:  QAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKNYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSVRLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
        AIKC AFQIALTGS RLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE L
Subjt:  AIKCRAFQIALTGSVRLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAE-------------------------IGRGKSGKDIEKADPKSKDNG-SFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNI
        TVKL EEAPATFAE                         I +G++GKD  KAD KS+D G S SS R +YRR+ +   +SRPYE +TPTTIPI EILTNI
Subjt:  TVKLGEEAPATFAE-------------------------IGRGKSGKDIEKADPKSKDNG-SFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNI

Query:  EESGMEKLLKRSEKLRGAPERRSKDKYCRLHREHGHNTSDCWELKRQIEDLIQDGHFKKFVGKHRTSSAEKKEERKRSRTPPRRTDRPAVINTIFRGPSG
        EE+GMEKLLKR EKLRG PE+R+ DKYCR HR+HGHNTS+ WELKRQIEDLIQDG+FKKFVGK R++S EKKEERKR RTPPRR DRPAVIN        
Subjt:  EESGMEKLLKRSEKLRGAPERRSKDKYCRLHREHGHNTSDCWELKRQIEDLIQDGHFKKFVGKHRTSSAEKKEERKRSRTPPRRTDRPAVINTIFRGPSG

Query:  GQSGHKRKELARAARPEVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLTKSPTPLVGFSG
             K+KELAR AR EVCIIREQRPT  I F+  DLE VHLPHNDALVIAPLID V+VRR+LVDGGASANILSL TYLALGWTRSQL KSPTPLVGFSG
Subjt:  GQSGHKRKELARAARPEVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLTKSPTPLVGFSG

Query:  ESVIPEGYIDLPVTLGQDQTQVSQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSWVCALETLAGR
        ES+  EG IDLPV++ QD TQV+QMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYST NGVGTVRGE   SRECYAS  K S VCALE    R
Subjt:  ESVIPEGYIDLPVTLGQDQTQVSQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSWVCALETLAGR

Query:  D
        D
Subjt:  D

XP_022158414.1 uncharacterized protein LOC111024904 [Momordica charantia]4.6e-20170.77Show/hide
Query:  MDFQAASDAIKCRAFQIALTGSVRLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL
        MDFQAA+DAIKCRAFQIALTGS RLWYRRLPARSISTYSQLR+EF++QFSS HYD+KTATHLATIRQKE ETLREYVTRFQEEQLKVAHCSDDSAMCYFL
Subjt:  MDFQAASDAIKCRAFQIALTGSVRLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL

Query:  TGLADEALTVKLGEEAPATFAE-------------------------IGRGKSGKDIEKADPKSKDNGSFSS-GRAEYRRAENGPTRSRPYERFTPTTIP
        T LADE LTVKLGEEAP TF E                         I + K  ++  KAD KS+D GS SS  R EYRR E+GP+RSRPYER+T +TIP
Subjt:  TGLADEALTVKLGEEAPATFAE-------------------------IGRGKSGKDIEKADPKSKDNGSFSS-GRAEYRRAENGPTRSRPYERFTPTTIP

Query:  ISEILTNIEESGMEKLLKRSEKLRGAPERRSKDKYCRLHREHGHNTSDCWELKRQIEDLIQDGHFKKFVGKHRTSSAEKKEERKRSRTPPRRTDRPAVIN
        ISEILTNIEESGMEKLLKR EKLRG  E+R+K+KYCR HR+HGHNT+ CWELKRQIEDLIQDG+FKKFVGK R++S EKKEERKRSRTPPRR DRPAVIN
Subjt:  ISEILTNIEESGMEKLLKRSEKLRGAPERRSKDKYCRLHREHGHNTSDCWELKRQIEDLIQDGHFKKFVGKHRTSSAEKKEERKRSRTPPRRTDRPAVIN

Query:  TIFRGPSGGQSGHKRKELARAARPEVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLTKSP
        TIF GP+GGQSG+KRKELAR AR EVCIIRE +PTC ITF   DLE VHLPHNDALVIA LIDH +VRRVL+DGG                         
Subjt:  TIFRGPSGGQSGHKRKELARAARPEVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLTKSP

Query:  TPLVGFSGESVIPEGYIDLPVTLGQDQTQVSQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSWVC
                        IDLPVT+GQD TQV+QMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYSTPN VG VRGEQ  SRECYASALKGS VC
Subjt:  TPLVGFSGESVIPEGYIDLPVTLGQDQTQVSQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSWVC

Query:  ALETLAGRDGTLEFEANLP---RREFAAPTEELELVPLLSPEKQ
        ALE    R    E EA+LP   +R+F  PTEELELVPLLSPE+Q
Subjt:  ALETLAGRDGTLEFEANLP---RREFAAPTEELELVPLLSPEKQ

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088136.0e-26389.77Show/hide
Query:  QAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKNYVEVFEGLMDFQAASD
        +AESS NPATPAGVITR EFDQLRG+LDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPK+YVEVFE LMDFQAASD
Subjt:  QAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKNYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSVRLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
        AIKCRAF+IALTGS RLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTGSVRLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAE-------------------------IGRGKSGKDIEKADPKSKDNGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE
        TVKLGEEAPATFAE                         IGRG+SGKDIE ADPKSKD GSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE
Subjt:  TVKLGEEAPATFAE-------------------------IGRGKSGKDIEKADPKSKDNGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE

Query:  ESGMEKLLKRSEKLRGAPERRSKDKYCRLHREHGHNTSDCWELKRQIEDLIQDGHFKKFVGKHRTSSAEKKEERKRSRTPPRRTDRPAVINTIFRGPSGG
        ESGMEKLLKR EKLRGAPERRSKDKYCR HREHGHNTSD WELKRQIE+LIQDG+FKKFVGK RTSSAEKKEERKRSRTPPRRTDRPAVINTIF GPSGG
Subjt:  ESGMEKLLKRSEKLRGAPERRSKDKYCRLHREHGHNTSDCWELKRQIEDLIQDGHFKKFVGKHRTSSAEKKEERKRSRTPPRRTDRPAVINTIFRGPSGG

Query:  QSGHKRKELARAARPEVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLTKSPTPLVGFSGE
        QSG KRKELARAAR EVCIIREQRPTCPITFD  DLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLALGWTRSQL KSPTPLVGFSGE
Subjt:  QSGHKRKELARAARPEVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLTKSPTPLVGFSGE

Query:  SVIPEGYIDLPVTLGQDQTQVSQMAEFV
        SVIPEG+IDLPVTLGQDQTQV+QMAEFV
Subjt:  SVIPEGYIDLPVTLGQDQTQVSQMAEFV

A0A6J1D9E1 uncharacterized protein LOC1110188235.1e-26278.92Show/hide
Query:  SSNQQAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKNYVEVFEGLMDFQ
        SSNQQAESSHNPATP GVITR EFDQLRGKL+AQVEALKAKCEQKEGPLNDGDLGESPFTSDVLE        APTVK YDGSKDPK+YVEVFEGLMDFQ
Subjt:  SSNQQAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKNYVEVFEGLMDFQ

Query:  AASDAIKCRAFQIALTGSVRLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA
        AASDAIKCRAFQIALTGS RLW                                                     FQE+QLKVA  SDDSAMCYFLTGLA
Subjt:  AASDAIKCRAFQIALTGSVRLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA

Query:  DEALTVKLGEEAPATFAE-------------------------IGRGKSGKDIEKADPKSKDNGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEIL
        DEALTVKLG+EAPATFAE                         I RG+SGKD EKAD KSKD GSFSSGRAE+RRA NGPTRSRPYERFTPTTIPISEIL
Subjt:  DEALTVKLGEEAPATFAE-------------------------IGRGKSGKDIEKADPKSKDNGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEIL

Query:  TNIEESGMEKLLKRSEKLRGAPERRSKDKYCRLHREHGHNTSDCWELKRQIEDLIQDGHFKKFVGKHRTSSAEKKEERKRSRTPPRRTDRPAVINTIFRG
        TNIEESGMEKLLKR EKLRGAPERR+KDKYCR HREH HNTSD WELKRQIEDLIQD +FKKFVGK RTSSAEKKEERK SRTP RR DRPAVINTIF G
Subjt:  TNIEESGMEKLLKRSEKLRGAPERRSKDKYCRLHREHGHNTSDCWELKRQIEDLIQDGHFKKFVGKHRTSSAEKKEERKRSRTPPRRTDRPAVINTIFRG

Query:  PSGGQSGHKRKELARAARPEVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLTKSPTPLVG
        PSGGQSGHKRKELARAAR EVCIIREQRPTCPITFDS DLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTRSQL KS TPLVG
Subjt:  PSGGQSGHKRKELARAARPEVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLTKSPTPLVG

Query:  FSGESVIPEGYIDLPVTLGQDQTQVSQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSWVCALETL
        FS ESVIPEG IDLPVTLG DQTQV+QMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVG VRGEQ ASRECYASALKGS VCALETL
Subjt:  FSGESVIPEGYIDLPVTLGQDQTQVSQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSWVCALETL

Query:  AGRDGTLEFEANLPRREFAAPTEELELVPLL
          RDGTLEF+ANLPRREFAAPTEELELVPLL
Subjt:  AGRDGTLEFEANLPRREFAAPTEELELVPLL

A0A6J1DD03 uncharacterized protein LOC1110198991.1e-21187.73Show/hide
Query:  MCYFLTGLADEALTVKLGEEAPATFAE------------------IGRGKSGKDIEKADPKSKDNGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISE
        MCYFLTGLADEALTVKL EEAPATFAE                  IG+G+SGKD+E  DPKSKD GSFS+GRAEYRRAENGPTRSRPYERFTPTTIPISE
Subjt:  MCYFLTGLADEALTVKLGEEAPATFAE------------------IGRGKSGKDIEKADPKSKDNGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISE

Query:  ILTNIEESGMEKLLKRSEKLRGAPERRSKDKYCRLHREHGHNTSDCWELKRQIEDLIQDGHFKKFVGKHRTSSAEKKEERKRSRTPPRRTDRPAVINTIF
        ILTNIEESGMEKLLKR EKLRGAPERRSKDKYCR HREHGHNTSD WELK QIEDLIQDG+FKKFVGK RTSSAEKKEERKRSRTPPRRTDRPAVINTIF
Subjt:  ILTNIEESGMEKLLKRSEKLRGAPERRSKDKYCRLHREHGHNTSDCWELKRQIEDLIQDGHFKKFVGKHRTSSAEKKEERKRSRTPPRRTDRPAVINTIF

Query:  RGPSGGQSGHKRKELARAARPEVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLTKSPTPL
         GPSGGQSGHKRK+LARAAR EVCIIREQRPTCPITFD  DL EVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL KSPTPL
Subjt:  RGPSGGQSGHKRKELARAARPEVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLTKSPTPL

Query:  VGFSGESVIPEGYIDLPVTLGQDQTQVSQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSWVCALE
        VGFSGESV+PEG IDLPVTLGQDQT+V+QMAEFVV+DGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAS LKG+ VCALE
Subjt:  VGFSGESVIPEGYIDLPVTLGQDQTQVSQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSWVCALE

Query:  TLAGRDGTLEFEANLPRREFAAPTEELELVPLLSPEKQGQ
        TL  RDGTLEFEA+LP REFAAP EELELVPLLS EKQ Q
Subjt:  TLAGRDGTLEFEANLPRREFAAPTEELELVPLLSPEKQGQ

A0A6J1DHB3 uncharacterized protein LOC1110204797.6e-24274.88Show/hide
Query:  QAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKNYVEVFEGLMDFQAASD
        +AESS+NP TP GVITR EFDQL+ K DAQVEALKA+CE+KE   +DGDLGE  F+SD+LEA IPPKFK PT+KPYDGSKDPK+YVEVFE LMDFQAA+D
Subjt:  QAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKNYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSVRLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
        AIKC AFQIALTGS RLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE L
Subjt:  AIKCRAFQIALTGSVRLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAE-------------------------IGRGKSGKDIEKADPKSKDNG-SFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNI
        TVKL EEAPATFAE                         I +G++GKD  KAD KS+D G S SS R +YRR+ +   +SRPYE +TPTTIPI EILTNI
Subjt:  TVKLGEEAPATFAE-------------------------IGRGKSGKDIEKADPKSKDNG-SFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNI

Query:  EESGMEKLLKRSEKLRGAPERRSKDKYCRLHREHGHNTSDCWELKRQIEDLIQDGHFKKFVGKHRTSSAEKKEERKRSRTPPRRTDRPAVINTIFRGPSG
        EE+GMEKLLKR EKLRG PE+R+ DKYCR HR+HGHNTS+ WELKRQIEDLIQDG+FKKFVGK R++S EKKEERKR RTPPRR DRPAVIN        
Subjt:  EESGMEKLLKRSEKLRGAPERRSKDKYCRLHREHGHNTSDCWELKRQIEDLIQDGHFKKFVGKHRTSSAEKKEERKRSRTPPRRTDRPAVINTIFRGPSG

Query:  GQSGHKRKELARAARPEVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLTKSPTPLVGFSG
             K+KELAR AR EVCIIREQRPT  I F+  DLE VHLPHNDALVIAPLID V+VRR+LVDGGASANILSL TYLALGWTRSQL KSPTPLVGFSG
Subjt:  GQSGHKRKELARAARPEVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLTKSPTPLVGFSG

Query:  ESVIPEGYIDLPVTLGQDQTQVSQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSWVCALETLAGR
        ES+  EG IDLPV++ QD TQV+QMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYST NGVGTVRGE   SRECYAS  K S VCALE    R
Subjt:  ESVIPEGYIDLPVTLGQDQTQVSQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSWVCALETLAGR

Query:  D
        D
Subjt:  D

A0A6J1DZB9 uncharacterized protein LOC1110249042.2e-20170.77Show/hide
Query:  MDFQAASDAIKCRAFQIALTGSVRLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL
        MDFQAA+DAIKCRAFQIALTGS RLWYRRLPARSISTYSQLR+EF++QFSS HYD+KTATHLATIRQKE ETLREYVTRFQEEQLKVAHCSDDSAMCYFL
Subjt:  MDFQAASDAIKCRAFQIALTGSVRLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL

Query:  TGLADEALTVKLGEEAPATFAE-------------------------IGRGKSGKDIEKADPKSKDNGSFSS-GRAEYRRAENGPTRSRPYERFTPTTIP
        T LADE LTVKLGEEAP TF E                         I + K  ++  KAD KS+D GS SS  R EYRR E+GP+RSRPYER+T +TIP
Subjt:  TGLADEALTVKLGEEAPATFAE-------------------------IGRGKSGKDIEKADPKSKDNGSFSS-GRAEYRRAENGPTRSRPYERFTPTTIP

Query:  ISEILTNIEESGMEKLLKRSEKLRGAPERRSKDKYCRLHREHGHNTSDCWELKRQIEDLIQDGHFKKFVGKHRTSSAEKKEERKRSRTPPRRTDRPAVIN
        ISEILTNIEESGMEKLLKR EKLRG  E+R+K+KYCR HR+HGHNT+ CWELKRQIEDLIQDG+FKKFVGK R++S EKKEERKRSRTPPRR DRPAVIN
Subjt:  ISEILTNIEESGMEKLLKRSEKLRGAPERRSKDKYCRLHREHGHNTSDCWELKRQIEDLIQDGHFKKFVGKHRTSSAEKKEERKRSRTPPRRTDRPAVIN

Query:  TIFRGPSGGQSGHKRKELARAARPEVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLTKSP
        TIF GP+GGQSG+KRKELAR AR EVCIIRE +PTC ITF   DLE VHLPHNDALVIA LIDH +VRRVL+DGG                         
Subjt:  TIFRGPSGGQSGHKRKELARAARPEVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLTKSP

Query:  TPLVGFSGESVIPEGYIDLPVTLGQDQTQVSQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSWVC
                        IDLPVT+GQD TQV+QMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYSTPN VG VRGEQ  SRECYASALKGS VC
Subjt:  TPLVGFSGESVIPEGYIDLPVTLGQDQTQVSQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSWVC

Query:  ALETLAGRDGTLEFEANLP---RREFAAPTEELELVPLLSPEKQ
        ALE    R    E EA+LP   +R+F  PTEELELVPLLSPE+Q
Subjt:  ALETLAGRDGTLEFEANLP---RREFAAPTEELELVPLLSPEKQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGCAATGCGCACACAAATGCGGTCCATGAAGGAAATGTATAACGAGATAATACTAGCTGCAGGCGCAGGATCCCAATCTGAAAATCGAATGACGCGCATTGACAT
ACGCGAGCAAAGGGGTTCCCACCTCGGCCCAATCGAGGAGGAACATCCCGAAGACAACGAGAGCGAGGGACACACTCGTCGGAGGGGAGACCTCCGTGAGCATCTCAACA
GAAAGAGAGGCTCATCTCTCCGAAAAGGACAGTCACCATCCCGCTCACACCGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAACCCAGCAACTCCTGCAGGGGTGATC
ACAAGGGCAGAGTTCGACCAGCTGAGGGGCAAGCTTGACGCTCAGGTGGAGGCTTTAAAGGCCAAATGTGAGCAGAAAGAAGGTCCACTGAACGATGGCGACTTGGGAGA
ATCGCCCTTCACCTCGGACGTTTTGGAAGCACCGATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGATGGGTCGAAGGACCCCAAGAATTATGTTGAGGTCT
TTGAAGGCCTCATGGATTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGTGCGATTGTGGTATCGGAGACTGCCAGCCAGG
TCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCATTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGGCAGAAGGAGGG
TGAGACGCTGCGAGAATATGTCACCAGATTCCAGGAGGAGCAATTGAAGGTCGCACACTGCTCCGATGACTCGGCCATGTGCTATTTTCTCACCGGTCTAGCTGACGAAG
CCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGATCGGTCGGGGCAAAAGTGGAAAAGATATAGAAAAGGCAGATCCCAAGTCCAAGGACAATGGA
TCCTTTTCCAGCGGCCGAGCTGAGTATCGAAGGGCGGAGAACGGGCCTACCAGGAGCCGACCTTACGAACGCTTCACCCCGACGACGATTCCAATTTCCGAGATCCTAAC
GAACATCGAGGAGTCTGGAATGGAAAAACTACTCAAGCGTTCTGAGAAGCTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGCCGCCTCCATCGGGAGCACG
GCCATAACACGTCGGACTGCTGGGAGTTGAAGCGCCAAATTGAGGATCTAATTCAAGATGGCCACTTCAAGAAATTTGTGGGAAAGCACAGGACCAGCTCGGCAGAGAAA
AAGGAAGAGCGGAAGCGTTCGAGGACGCCGCCCCGGCGAACTGACCGACCTGCGGTCATCAATACCATTTTCAGAGGGCCAAGCGGGGGTCAGTCCGGACATAAAAGAAA
GGAGTTAGCTCGTGCAGCTAGGCCCGAGGTGTGCATCATCAGGGAGCAGAGGCCGACCTGCCCAATCACCTTCGACAGTACAGACTTAGAGGAGGTCCACCTGCCCCACA
ATGATGCACTTGTGATCGCTCCCTTGATTGATCATGTGGTGGTCAGGAGGGTGCTGGTAGACGGAGGCGCATCTGCTAACATCCTGTCCTTACCGACCTACCTCGCCCTG
GGATGGACGAGGTCGCAATTGACGAAAAGTCCGACACCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGTTACATCGACTTGCCGGTCACACTTGGGCAGGA
CCAAACTCAGGTCAGCCAAATGGCCGAGTTCGTGGTAATTGATGGTAGATCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCATTCCCTCGA
CACTTCATCAAGTTTTGAAGTATTCCACCCCCAATGGCGTGGGCACGGTCCGAGGAGAACAGACCGCTTCGAGGGAGTGTTATGCCTCCGCACTCAAAGGCTCATGGGTC
TGCGCCCTCGAAACTCTCGCCGGTAGGGATGGGACGCTCGAGTTCGAGGCCAACCTGCCGAGGAGGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTACT
TAGTCCCGAGAAGCAAGGGCAATTCACCACAAGACCCCAAGGAGCGCAGAAAGTTGGCAAGGCGAGCAGCTCGGTTCGTGGTCCGAGGTGGAGCATTGTACCGACGCGGC
TTTTCCCTGCCTCTATTGAGATGCCTATCCCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGGCAATGCGCACACAAATGCGGTCCATGAAGGAAATGTATAACGAGATAATACTAGCTGCAGGCGCAGGATCCCAATCTGAAAATCGAATGACGCGCATTGACAT
ACGCGAGCAAAGGGGTTCCCACCTCGGCCCAATCGAGGAGGAACATCCCGAAGACAACGAGAGCGAGGGACACACTCGTCGGAGGGGAGACCTCCGTGAGCATCTCAACA
GAAAGAGAGGCTCATCTCTCCGAAAAGGACAGTCACCATCCCGCTCACACCGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAACCCAGCAACTCCTGCAGGGGTGATC
ACAAGGGCAGAGTTCGACCAGCTGAGGGGCAAGCTTGACGCTCAGGTGGAGGCTTTAAAGGCCAAATGTGAGCAGAAAGAAGGTCCACTGAACGATGGCGACTTGGGAGA
ATCGCCCTTCACCTCGGACGTTTTGGAAGCACCGATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGATGGGTCGAAGGACCCCAAGAATTATGTTGAGGTCT
TTGAAGGCCTCATGGATTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGTGCGATTGTGGTATCGGAGACTGCCAGCCAGG
TCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCATTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGGCAGAAGGAGGG
TGAGACGCTGCGAGAATATGTCACCAGATTCCAGGAGGAGCAATTGAAGGTCGCACACTGCTCCGATGACTCGGCCATGTGCTATTTTCTCACCGGTCTAGCTGACGAAG
CCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGATCGGTCGGGGCAAAAGTGGAAAAGATATAGAAAAGGCAGATCCCAAGTCCAAGGACAATGGA
TCCTTTTCCAGCGGCCGAGCTGAGTATCGAAGGGCGGAGAACGGGCCTACCAGGAGCCGACCTTACGAACGCTTCACCCCGACGACGATTCCAATTTCCGAGATCCTAAC
GAACATCGAGGAGTCTGGAATGGAAAAACTACTCAAGCGTTCTGAGAAGCTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGCCGCCTCCATCGGGAGCACG
GCCATAACACGTCGGACTGCTGGGAGTTGAAGCGCCAAATTGAGGATCTAATTCAAGATGGCCACTTCAAGAAATTTGTGGGAAAGCACAGGACCAGCTCGGCAGAGAAA
AAGGAAGAGCGGAAGCGTTCGAGGACGCCGCCCCGGCGAACTGACCGACCTGCGGTCATCAATACCATTTTCAGAGGGCCAAGCGGGGGTCAGTCCGGACATAAAAGAAA
GGAGTTAGCTCGTGCAGCTAGGCCCGAGGTGTGCATCATCAGGGAGCAGAGGCCGACCTGCCCAATCACCTTCGACAGTACAGACTTAGAGGAGGTCCACCTGCCCCACA
ATGATGCACTTGTGATCGCTCCCTTGATTGATCATGTGGTGGTCAGGAGGGTGCTGGTAGACGGAGGCGCATCTGCTAACATCCTGTCCTTACCGACCTACCTCGCCCTG
GGATGGACGAGGTCGCAATTGACGAAAAGTCCGACACCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGTTACATCGACTTGCCGGTCACACTTGGGCAGGA
CCAAACTCAGGTCAGCCAAATGGCCGAGTTCGTGGTAATTGATGGTAGATCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCATTCCCTCGA
CACTTCATCAAGTTTTGAAGTATTCCACCCCCAATGGCGTGGGCACGGTCCGAGGAGAACAGACCGCTTCGAGGGAGTGTTATGCCTCCGCACTCAAAGGCTCATGGGTC
TGCGCCCTCGAAACTCTCGCCGGTAGGGATGGGACGCTCGAGTTCGAGGCCAACCTGCCGAGGAGGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTACT
TAGTCCCGAGAAGCAAGGGCAATTCACCACAAGACCCCAAGGAGCGCAGAAAGTTGGCAAGGCGAGCAGCTCGGTTCGTGGTCCGAGGTGGAGCATTGTACCGACGCGGC
TTTTCCCTGCCTCTATTGAGATGCCTATCCCCTGA
Protein sequenceShow/hide protein sequence
MEAMRTQMRSMKEMYNEIILAAGAGSQSENRMTRIDIREQRGSHLGPIEEEHPEDNESEGHTRRRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPATPAGVI
TRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKNYVEVFEGLMDFQAASDAIKCRAFQIALTGSVRLWYRRLPAR
SISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEIGRGKSGKDIEKADPKSKDNG
SFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRSEKLRGAPERRSKDKYCRLHREHGHNTSDCWELKRQIEDLIQDGHFKKFVGKHRTSSAEK
KEERKRSRTPPRRTDRPAVINTIFRGPSGGQSGHKRKELARAARPEVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLAL
GWTRSQLTKSPTPLVGFSGESVIPEGYIDLPVTLGQDQTQVSQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSWV
CALETLAGRDGTLEFEANLPRREFAAPTEELELVPLLSPEKQGQFTTRPQGAQKVGKASSSVRGPRWSIVPTRLFPASIEMPIP