; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g01140 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g01140
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr3:820959..825482
RNA-Seq ExpressionMoc03g01140
SyntenyMoc03g01140
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]1.4e-24886.74Show/hide
Query:  QAESSHN---LAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAAIPPKFKAPTVKPYDGMKDPKDYVEVFEGLMDFQAASD
        +AESS N    AG+ITREEFDQLRG+LDAQVEALKAKCEQK+  LNDGDLGESPFTSDVLEA IPPKFKAPTVKPYDG KDPKDYVEVFE LMDFQAASD
Subjt:  QAESSHN---LAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAAIPPKFKAPTVKPYDGMKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
        AIKCRAF+IALTGSARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEVRATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTMIPIFEILTNIE
        TVKLGEE  ATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD+E ADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPT IPI EILTNIE
Subjt:  TVKLGEEVRATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTMIPIFEILTNIE

Query:  ESGMKKLLKRPEKLRGAPERRNKDKYCRFHREHGHNTSDCWELKRQIENLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPP--TD---------------
        ESGM+KLLKRPEKLRGAPERR+KDKYCRFHREHGHNTSD WELKRQIENLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPP  TD               
Subjt:  ESGMKKLLKRPEKLRGAPERRNKDKYCRFHREHGHNTSDCWELKRQIENLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPP--TD---------------

Query:  -------------------LREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGE
                           +REQ PTCPITFDGADLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLALGWTRSQLK+SPTPLVGFSGE
Subjt:  -------------------LREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGE

Query:  SVILEGCIDLPVTLGQDQTRVTQMAEFV
        SVI EG IDLPVTLGQDQT+VTQMAEFV
Subjt:  SVILEGCIDLPVTLGQDQTRVTQMAEFV

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]6.7e-24675.59Show/hide
Query:  SSNQQAESSHNLA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAAIPPKFKAPTVKPYDGMKDPKDYVEVFEGLMDFQ
        SSNQQAESSHN A   G+ITREEFDQLRG+L+AQVEALKAKCEQK+  LNDGDLGESPFTSDVLE        APTVK YDG KDPKDYVEVFEGLMDFQ
Subjt:  SSNQQAESSHNLA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAAIPPKFKAPTVKPYDGMKDPKDYVEVFEGLMDFQ

Query:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA
        AASDAIKCRAFQIALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFLTGLA
Subjt:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA

Query:  DEALTVKLGEEVRATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTMIPIFEIL
        DEALTVKLG+E  ATFAEVLQKAKKVIDGQELLRTKTGRPER I RGRSGKD E+AD KSKDKGSFSSGRAE+RRA NGPTRSRPYERFTPT IPI EIL
Subjt:  DEALTVKLGEEVRATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTMIPIFEIL

Query:  TNIEESGMKKLLKRPEKLRGAPERRNKDKYCRFHREHGHNTSDCWELKRQIENLIQDGYFKKFVGKPRTSSAEKKEERKRSRTP----------------
        TNIEESGM+KLLKRPEKLRGAPERRNKDKYCRFHREH HNTSD WELKRQIE+LIQD YFKKFVGKPRTSSAEKKEERK SRTP                
Subjt:  TNIEESGMKKLLKRPEKLRGAPERRNKDKYCRFHREHGHNTSDCWELKRQIENLIQDGYFKKFVGKPRTSSAEKKEERKRSRTP----------------

Query:  PTD--------------------LREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVG
        P+                     +REQ PTCPITFD ADLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTRSQLK+S TPLVG
Subjt:  PTD--------------------LREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVG

Query:  FSGESVILEGCIDLPVTLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQAASRECYASALKGSSVCALETL
        FS ESVI EGCIDLPVTLG DQT+VTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKY TPNGVG VRGEQ ASRECYASALKGSSVCALETL
Subjt:  FSGESVILEGCIDLPVTLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQAASRECYASALKGSSVCALETL

Query:  AGRDGPLEFEADLPRKEFAAPTEELELVPLL
          RDG LEF+A+LPR+EFAAPTEELELVPLL
Subjt:  AGRDGPLEFEADLPRKEFAAPTEELELVPLL

XP_022152110.1 uncharacterized protein LOC111019899 [Momordica charantia]1.3e-19682.7Show/hide
Query:  MCYFLTGLADEALTVKLGEEVRATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTP
        MCYFLTGLADEALTVKL EE  ATFAEVLQKAKKVIDGQELLRT       KIG+GRSGKD+E  DPKSKDKGSFS+GRAEYRRAENGPTRSRPYERFTP
Subjt:  MCYFLTGLADEALTVKLGEEVRATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTP

Query:  TMIPIFEILTNIEESGMKKLLKRPEKLRGAPERRNKDKYCRFHREHGHNTSDCWELKRQIENLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPP--TD--
        T IPI EILTNIEESGM+KLLKRPEKLRGAPERR+KDKYCRFHREHGHNTSD WELK QIE+LIQDGYFKKFVGKPRTSSAEKKEERKRSRTPP  TD  
Subjt:  TMIPIFEILTNIEESGMKKLLKRPEKLRGAPERRNKDKYCRFHREHGHNTSDCWELKRQIENLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPP--TD--

Query:  --------------------------------LREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
                                        +REQ PTCPITFD ADL EVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
Subjt:  --------------------------------LREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL

Query:  KRSPTPLVGFSGESVILEGCIDLPVTLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQAASRECYASALKG
        K+SPTPLVGFSGESV+ EGCIDLPVTLGQDQTRVTQMAEFVV+DGRSAYNAIFGRPIIHSFRAIPSTLHQVLKY TPNGVGTVRGEQ ASRECYAS LKG
Subjt:  KRSPTPLVGFSGESVILEGCIDLPVTLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQAASRECYASALKG

Query:  SSVCALETLAGRDGPLEFEADLPRKEFAAPTEELELVPLLSPEKQ
        +SVCALETL  RDG LEFEADLP +EFAAP EELELVPLLS EKQ
Subjt:  SSVCALETLAGRDGPLEFEADLPRKEFAAPTEELELVPLLSPEKQ

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]1.2e-25864.22Show/hide
Query:  MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLAAEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARSPAPAPTSENFDALKREMEAMR
        MVQPANSTNT DRR LAA+  HQREVGA  VEGQGH+ L  EPL RSARIT P LPPAHP+ SK                                    
Subjt:  MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLAAEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARSPAPAPTSENFDALKREMEAMR

Query:  TQMRSMEEIYNEMMLAAGAGSRSENRVTRVDIREQRGSHLGLAEEERPEDNESEGYTRQRGDLHEHLNRKRGSSLQKGQSPSRSHRSSNQQAESSHN--L
                                                                                                   AESS+N   
Subjt:  TQMRSMEEIYNEMMLAAGAGSRSENRVTRVDIREQRGSHLGLAEEERPEDNESEGYTRQRGDLHEHLNRKRGSSLQKGQSPSRSHRSSNQQAESSHN--L

Query:  AGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAAIPPKFKAPTVKPYDGMKDPKDYVEVFEGLMDFQAASDAIKCRAFQIAL
         G+ITREEFDQL+ + DAQVEALKA+CE+K+ S +DGDLGE  F+SD+LEA IPPKFK PT+KPYDG KDPKDYVEVFE LMDFQAA+DAIKC AFQIAL
Subjt:  AGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAAIPPKFKAPTVKPYDGMKDPKDYVEVFEGLMDFQAASDAIKCRAFQIAL

Query:  TGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEVRAT
        TGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EE  AT
Subjt:  TGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEVRAT

Query:  FAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTMIPIFEILTNIEESGMKKLLKR
        FAEVLQK KKVIDGQELLRTKTGRPE+ I +GR+GKD  +AD KS+DKG S SS R +YRR+ +   +SRPYE +TPT IPIFEILTNIEE+GM+KLLKR
Subjt:  FAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTMIPIFEILTNIEESGMKKLLKR

Query:  PEKLRGAPERRNKDKYCRFHREHGHNTSDCWELKRQIENLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPTD-----------------------LREQ
        PEKLRG PE+RN DKYCRFHR+HGHNTS+ WELKRQIE+LIQDGYFKKFVGKPR++S EKKEERKR RTPP                         +REQ
Subjt:  PEKLRGAPERRNKDKYCRFHREHGHNTSDCWELKRQIENLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPTD-----------------------LREQ

Query:  GPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVILEGCIDLPVTLGQDQTRVTQ
         PT  I F+ ADLE VHLPHNDALVIAPLID V+VRR+LVDGGASANILSL TYLALGWTRSQLK+SPTPLVGFSGES+ LEGCIDLPV++ QD T+VTQ
Subjt:  GPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVILEGCIDLPVTLGQDQTRVTQ

Query:  MAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQAASRECYASALKGSSVCALETLAGRD
        MAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKY T NGVGTVRGE   SRECYAS  K SSVCALE    RD
Subjt:  MAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQAASRECYASALKGSSVCALETLAGRD

XP_022156542.1 uncharacterized protein LOC111023421 [Momordica charantia]2.3e-19896.51Show/hide
Query:  NLAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAAIPPKFKAPTVKPYDGMKDPKDYVEVFEGLMDFQAASDAIKCRAFQI
        ++ GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEA IPPKFKAPTVKPYDG KDPKDYVEVFEGLMDFQAASDAIKCRAFQI
Subjt:  NLAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAAIPPKFKAPTVKPYDGMKDPKDYVEVFEGLMDFQAASDAIKCRAFQI

Query:  ALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEVR
        ALTGSARLWYRRLP RSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEE  
Subjt:  ALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEVR

Query:  ATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTMIPIFEILTNIEESGMKKLLK
        ATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPT IPIFEILTNIEESGM+KLLK
Subjt:  ATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTMIPIFEILTNIEESGMKKLLK

Query:  RPEKLRGAPERRNKDKYCRFHREHGHNTSDCWELKRQIENLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPP
        RPEKLRGAPERR+KDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFVGKPRTSSAEKKEERKRSRTPP
Subjt:  RPEKLRGAPERRNKDKYCRFHREHGHNTSDCWELKRQIENLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPP

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088137.0e-24986.74Show/hide
Query:  QAESSHN---LAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAAIPPKFKAPTVKPYDGMKDPKDYVEVFEGLMDFQAASD
        +AESS N    AG+ITREEFDQLRG+LDAQVEALKAKCEQK+  LNDGDLGESPFTSDVLEA IPPKFKAPTVKPYDG KDPKDYVEVFE LMDFQAASD
Subjt:  QAESSHN---LAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAAIPPKFKAPTVKPYDGMKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
        AIKCRAF+IALTGSARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEVRATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTMIPIFEILTNIE
        TVKLGEE  ATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD+E ADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPT IPI EILTNIE
Subjt:  TVKLGEEVRATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTMIPIFEILTNIE

Query:  ESGMKKLLKRPEKLRGAPERRNKDKYCRFHREHGHNTSDCWELKRQIENLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPP--TD---------------
        ESGM+KLLKRPEKLRGAPERR+KDKYCRFHREHGHNTSD WELKRQIENLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPP  TD               
Subjt:  ESGMKKLLKRPEKLRGAPERRNKDKYCRFHREHGHNTSDCWELKRQIENLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPP--TD---------------

Query:  -------------------LREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGE
                           +REQ PTCPITFDGADLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLALGWTRSQLK+SPTPLVGFSGE
Subjt:  -------------------LREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGE

Query:  SVILEGCIDLPVTLGQDQTRVTQMAEFV
        SVI EG IDLPVTLGQDQT+VTQMAEFV
Subjt:  SVILEGCIDLPVTLGQDQTRVTQMAEFV

A0A6J1D9E1 uncharacterized protein LOC1110188233.3e-24675.59Show/hide
Query:  SSNQQAESSHNLA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAAIPPKFKAPTVKPYDGMKDPKDYVEVFEGLMDFQ
        SSNQQAESSHN A   G+ITREEFDQLRG+L+AQVEALKAKCEQK+  LNDGDLGESPFTSDVLE        APTVK YDG KDPKDYVEVFEGLMDFQ
Subjt:  SSNQQAESSHNLA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAAIPPKFKAPTVKPYDGMKDPKDYVEVFEGLMDFQ

Query:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA
        AASDAIKCRAFQIALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFLTGLA
Subjt:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA

Query:  DEALTVKLGEEVRATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTMIPIFEIL
        DEALTVKLG+E  ATFAEVLQKAKKVIDGQELLRTKTGRPER I RGRSGKD E+AD KSKDKGSFSSGRAE+RRA NGPTRSRPYERFTPT IPI EIL
Subjt:  DEALTVKLGEEVRATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTMIPIFEIL

Query:  TNIEESGMKKLLKRPEKLRGAPERRNKDKYCRFHREHGHNTSDCWELKRQIENLIQDGYFKKFVGKPRTSSAEKKEERKRSRTP----------------
        TNIEESGM+KLLKRPEKLRGAPERRNKDKYCRFHREH HNTSD WELKRQIE+LIQD YFKKFVGKPRTSSAEKKEERK SRTP                
Subjt:  TNIEESGMKKLLKRPEKLRGAPERRNKDKYCRFHREHGHNTSDCWELKRQIENLIQDGYFKKFVGKPRTSSAEKKEERKRSRTP----------------

Query:  PTD--------------------LREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVG
        P+                     +REQ PTCPITFD ADLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTRSQLK+S TPLVG
Subjt:  PTD--------------------LREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVG

Query:  FSGESVILEGCIDLPVTLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQAASRECYASALKGSSVCALETL
        FS ESVI EGCIDLPVTLG DQT+VTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKY TPNGVG VRGEQ ASRECYASALKGSSVCALETL
Subjt:  FSGESVILEGCIDLPVTLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQAASRECYASALKGSSVCALETL

Query:  AGRDGPLEFEADLPRKEFAAPTEELELVPLL
          RDG LEF+A+LPR+EFAAPTEELELVPLL
Subjt:  AGRDGPLEFEADLPRKEFAAPTEELELVPLL

A0A6J1DD03 uncharacterized protein LOC1110198996.2e-19782.7Show/hide
Query:  MCYFLTGLADEALTVKLGEEVRATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTP
        MCYFLTGLADEALTVKL EE  ATFAEVLQKAKKVIDGQELLRT       KIG+GRSGKD+E  DPKSKDKGSFS+GRAEYRRAENGPTRSRPYERFTP
Subjt:  MCYFLTGLADEALTVKLGEEVRATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTP

Query:  TMIPIFEILTNIEESGMKKLLKRPEKLRGAPERRNKDKYCRFHREHGHNTSDCWELKRQIENLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPP--TD--
        T IPI EILTNIEESGM+KLLKRPEKLRGAPERR+KDKYCRFHREHGHNTSD WELK QIE+LIQDGYFKKFVGKPRTSSAEKKEERKRSRTPP  TD  
Subjt:  TMIPIFEILTNIEESGMKKLLKRPEKLRGAPERRNKDKYCRFHREHGHNTSDCWELKRQIENLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPP--TD--

Query:  --------------------------------LREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
                                        +REQ PTCPITFD ADL EVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
Subjt:  --------------------------------LREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL

Query:  KRSPTPLVGFSGESVILEGCIDLPVTLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQAASRECYASALKG
        K+SPTPLVGFSGESV+ EGCIDLPVTLGQDQTRVTQMAEFVV+DGRSAYNAIFGRPIIHSFRAIPSTLHQVLKY TPNGVGTVRGEQ ASRECYAS LKG
Subjt:  KRSPTPLVGFSGESVILEGCIDLPVTLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQAASRECYASALKG

Query:  SSVCALETLAGRDGPLEFEADLPRKEFAAPTEELELVPLLSPEKQ
        +SVCALETL  RDG LEFEADLP +EFAAP EELELVPLLS EKQ
Subjt:  SSVCALETLAGRDGPLEFEADLPRKEFAAPTEELELVPLLSPEKQ

A0A6J1DHB3 uncharacterized protein LOC1110204795.7e-25964.22Show/hide
Query:  MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLAAEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARSPAPAPTSENFDALKREMEAMR
        MVQPANSTNT DRR LAA+  HQREVGA  VEGQGH+ L  EPL RSARIT P LPPAHP+ SK                                    
Subjt:  MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLAAEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARSPAPAPTSENFDALKREMEAMR

Query:  TQMRSMEEIYNEMMLAAGAGSRSENRVTRVDIREQRGSHLGLAEEERPEDNESEGYTRQRGDLHEHLNRKRGSSLQKGQSPSRSHRSSNQQAESSHN--L
                                                                                                   AESS+N   
Subjt:  TQMRSMEEIYNEMMLAAGAGSRSENRVTRVDIREQRGSHLGLAEEERPEDNESEGYTRQRGDLHEHLNRKRGSSLQKGQSPSRSHRSSNQQAESSHN--L

Query:  AGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAAIPPKFKAPTVKPYDGMKDPKDYVEVFEGLMDFQAASDAIKCRAFQIAL
         G+ITREEFDQL+ + DAQVEALKA+CE+K+ S +DGDLGE  F+SD+LEA IPPKFK PT+KPYDG KDPKDYVEVFE LMDFQAA+DAIKC AFQIAL
Subjt:  AGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAAIPPKFKAPTVKPYDGMKDPKDYVEVFEGLMDFQAASDAIKCRAFQIAL

Query:  TGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEVRAT
        TGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EE  AT
Subjt:  TGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEVRAT

Query:  FAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTMIPIFEILTNIEESGMKKLLKR
        FAEVLQK KKVIDGQELLRTKTGRPE+ I +GR+GKD  +AD KS+DKG S SS R +YRR+ +   +SRPYE +TPT IPIFEILTNIEE+GM+KLLKR
Subjt:  FAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTMIPIFEILTNIEESGMKKLLKR

Query:  PEKLRGAPERRNKDKYCRFHREHGHNTSDCWELKRQIENLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPTD-----------------------LREQ
        PEKLRG PE+RN DKYCRFHR+HGHNTS+ WELKRQIE+LIQDGYFKKFVGKPR++S EKKEERKR RTPP                         +REQ
Subjt:  PEKLRGAPERRNKDKYCRFHREHGHNTSDCWELKRQIENLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPTD-----------------------LREQ

Query:  GPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVILEGCIDLPVTLGQDQTRVTQ
         PT  I F+ ADLE VHLPHNDALVIAPLID V+VRR+LVDGGASANILSL TYLALGWTRSQLK+SPTPLVGFSGES+ LEGCIDLPV++ QD T+VTQ
Subjt:  GPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVILEGCIDLPVTLGQDQTRVTQ

Query:  MAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQAASRECYASALKGSSVCALETLAGRD
        MAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKY T NGVGTVRGE   SRECYAS  K SSVCALE    RD
Subjt:  MAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQAASRECYASALKGSSVCALETLAGRD

A0A6J1DS95 uncharacterized protein LOC1110234211.1e-19896.51Show/hide
Query:  NLAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAAIPPKFKAPTVKPYDGMKDPKDYVEVFEGLMDFQAASDAIKCRAFQI
        ++ GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEA IPPKFKAPTVKPYDG KDPKDYVEVFEGLMDFQAASDAIKCRAFQI
Subjt:  NLAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAAIPPKFKAPTVKPYDGMKDPKDYVEVFEGLMDFQAASDAIKCRAFQI

Query:  ALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEVR
        ALTGSARLWYRRLP RSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEE  
Subjt:  ALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEVR

Query:  ATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTMIPIFEILTNIEESGMKKLLK
        ATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPT IPIFEILTNIEESGM+KLLK
Subjt:  ATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTMIPIFEILTNIEESGMKKLLK

Query:  RPEKLRGAPERRNKDKYCRFHREHGHNTSDCWELKRQIENLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPP
        RPEKLRGAPERR+KDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFVGKPRTSSAEKKEERKRSRTPP
Subjt:  RPEKLRGAPERRNKDKYCRFHREHGHNTSDCWELKRQIENLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAACCAGCGAACTCGACCAATACGACAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGCGGTAGAGGGGCAAGGTCACGA
CGGCCTAGCAGCGGAACCCCTCCGCAGGTCGGCACGGATCACCGCGCCTGCCCTACCGCCTGCGCACCCGAGGACGTCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTA
AGAAGGGCGCCCGGAGTCCAGCCCCGGCTCCAACAAGCGAGAACTTTGATGCGCTCAAGAGAGAGATGGAGGCAATGCGCACACAAATGCGCTCCATGGAGGAAATATAT
AACGAAATGATGCTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAGTGACGCGCGTGGACATACGCGAGCAAAGGGGTTCCCACCTCGGCCTAGCCGAGGAGGAACG
TCCCGAAGACAACGAGAGCGAGGGGTACACTCGCCAGAGGGGAGACCTCCATGAGCATCTCAACAGAAAGAGAGGCTCGTCTCTCCAAAAAGGGCAGTCACCATCCCGCT
CACACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAATCTCGCAGGGATAATCACAAGGGAGGAGTTCGACCAGCTGAGGGGGGAGCTCGATGCTCAGGTGGAGGCC
TTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAACGATGGCGACTTGGGAGAATCGCCTTTCACCTCGGACGTTTTGGAAGCAGCAATCCCTCCGAAGTTCAAAGC
TCCTACCGTGAAGCCTTATGATGGGATGAAGGACCCCAAGGACTATGTTGAGGTCTTTGAAGGCCTCATGGACTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCT
TTCAAATCGCGCTTACTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCT
CGGCACTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGGCAGAAGGAGGGTGAGACGCTGCGGGAGTATGTCACCAGATTCCAGGAGGAGCAGTTGAAGGTTGC
ACACTGCTCCGATGACTCGGCCATGTGCTATTTCCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGAGAGGAGGTCAGGGCCACCTTCGCCGAGGTGCTTC
AGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCGGAGCGAAAGATCGGCCGGGGCAGAAGTGGAAAAGATGTAGAAAGGGCAGAT
CCCAAGTCCAAGGACAAGGGATCTTTTTCCAGCGGTCGAGCCGAGTATCGAAGGGCGGAGAACGGACCTACCAGGAGCCGACCTTACGAGCGCTTCACCCCGACCATGAT
TCCAATTTTCGAGATCCTAACGAACATCGAGGAGTCTGGAATGAAAAAACTACTCAAGCGTCCTGAGAAACTTCGGGGAGCCCCGGAGAGGCGCAACAAGGACAAGTATT
GCCGCTTCCATCGGGAGCACGGCCACAACACGTCGGACTGCTGGGAGTTGAAGCGCCAAATTGAGAATCTAATTCAAGATGGCTACTTCAAGAAGTTTGTGGGAAAGCCC
AGGACCAGCTCAGCAGAGAAAAAGGAAGAGCGAAAGCGTTCAAGGACGCCACCGACCGACCTGCGGGAGCAGGGGCCGACCTGCCCAATCACCTTCGACGGTGCAGACTT
GGAGGAGGTCCACCTGCCCCACAATGATGCACTTGTGATTGCTCCCTTGATTGATCATGTGGTGGTCAGGAGAGTGCTGGTAGACGGGGGCGCATCCGCTAACATCCTGT
CCTTACCGACCTACCTCGCCTTGGGATGGACGAGGTCGCAATTGAAGAGAAGCCCGACACCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCTAGAGGGTTGCATCGAC
TTGCCGGTCACGCTGGGGCAGGACCAAACTCGGGTCACTCAAATGGCCGAGTTCGTGGTAATTGACGGTAGATCGGCCTATAACGCCATCTTTGGGCGACCCATCATCCA
CTCATTTCGGGCCATTCCCTCAACACTTCATCAAGTTTTGAAGTATCCCACCCCCAATGGCGTGGGCACGGTCCGAGGAGAACAGGCCGCTTCGAGGGAGTGTTATGCCT
CCGCACTCAAAGGCTCATCGGTCTGCGCCCTCGAAACTCTCGCCGGCAGGGATGGGCCGCTCGAGTTCGAGGCCGACCTGCCGAGGAAGGAGTTTGCCGCACCCACTGAG
GAGCTCGAGCTTGTTCCTCTGCTGAGTCCCGAGAAGCAGAAAGTTGGCAAGGCGGGCAGCTCGGTTCGTGGTCCGAGATGGGGCATTGTACCGACGTGGCTTTTTCCTAC
CTCTATTGAGATGCCTAACCCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAACCAGCGAACTCGACCAATACGACAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGCGGTAGAGGGGCAAGGTCACGA
CGGCCTAGCAGCGGAACCCCTCCGCAGGTCGGCACGGATCACCGCGCCTGCCCTACCGCCTGCGCACCCGAGGACGTCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTA
AGAAGGGCGCCCGGAGTCCAGCCCCGGCTCCAACAAGCGAGAACTTTGATGCGCTCAAGAGAGAGATGGAGGCAATGCGCACACAAATGCGCTCCATGGAGGAAATATAT
AACGAAATGATGCTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAGTGACGCGCGTGGACATACGCGAGCAAAGGGGTTCCCACCTCGGCCTAGCCGAGGAGGAACG
TCCCGAAGACAACGAGAGCGAGGGGTACACTCGCCAGAGGGGAGACCTCCATGAGCATCTCAACAGAAAGAGAGGCTCGTCTCTCCAAAAAGGGCAGTCACCATCCCGCT
CACACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAATCTCGCAGGGATAATCACAAGGGAGGAGTTCGACCAGCTGAGGGGGGAGCTCGATGCTCAGGTGGAGGCC
TTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAACGATGGCGACTTGGGAGAATCGCCTTTCACCTCGGACGTTTTGGAAGCAGCAATCCCTCCGAAGTTCAAAGC
TCCTACCGTGAAGCCTTATGATGGGATGAAGGACCCCAAGGACTATGTTGAGGTCTTTGAAGGCCTCATGGACTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCT
TTCAAATCGCGCTTACTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCT
CGGCACTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGGCAGAAGGAGGGTGAGACGCTGCGGGAGTATGTCACCAGATTCCAGGAGGAGCAGTTGAAGGTTGC
ACACTGCTCCGATGACTCGGCCATGTGCTATTTCCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGAGAGGAGGTCAGGGCCACCTTCGCCGAGGTGCTTC
AGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCGGAGCGAAAGATCGGCCGGGGCAGAAGTGGAAAAGATGTAGAAAGGGCAGAT
CCCAAGTCCAAGGACAAGGGATCTTTTTCCAGCGGTCGAGCCGAGTATCGAAGGGCGGAGAACGGACCTACCAGGAGCCGACCTTACGAGCGCTTCACCCCGACCATGAT
TCCAATTTTCGAGATCCTAACGAACATCGAGGAGTCTGGAATGAAAAAACTACTCAAGCGTCCTGAGAAACTTCGGGGAGCCCCGGAGAGGCGCAACAAGGACAAGTATT
GCCGCTTCCATCGGGAGCACGGCCACAACACGTCGGACTGCTGGGAGTTGAAGCGCCAAATTGAGAATCTAATTCAAGATGGCTACTTCAAGAAGTTTGTGGGAAAGCCC
AGGACCAGCTCAGCAGAGAAAAAGGAAGAGCGAAAGCGTTCAAGGACGCCACCGACCGACCTGCGGGAGCAGGGGCCGACCTGCCCAATCACCTTCGACGGTGCAGACTT
GGAGGAGGTCCACCTGCCCCACAATGATGCACTTGTGATTGCTCCCTTGATTGATCATGTGGTGGTCAGGAGAGTGCTGGTAGACGGGGGCGCATCCGCTAACATCCTGT
CCTTACCGACCTACCTCGCCTTGGGATGGACGAGGTCGCAATTGAAGAGAAGCCCGACACCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCTAGAGGGTTGCATCGAC
TTGCCGGTCACGCTGGGGCAGGACCAAACTCGGGTCACTCAAATGGCCGAGTTCGTGGTAATTGACGGTAGATCGGCCTATAACGCCATCTTTGGGCGACCCATCATCCA
CTCATTTCGGGCCATTCCCTCAACACTTCATCAAGTTTTGAAGTATCCCACCCCCAATGGCGTGGGCACGGTCCGAGGAGAACAGGCCGCTTCGAGGGAGTGTTATGCCT
CCGCACTCAAAGGCTCATCGGTCTGCGCCCTCGAAACTCTCGCCGGCAGGGATGGGCCGCTCGAGTTCGAGGCCGACCTGCCGAGGAAGGAGTTTGCCGCACCCACTGAG
GAGCTCGAGCTTGTTCCTCTGCTGAGTCCCGAGAAGCAGAAAGTTGGCAAGGCGGGCAGCTCGGTTCGTGGTCCGAGATGGGGCATTGTACCGACGTGGCTTTTTCCTAC
CTCTATTGAGATGCCTAACCCCTGA
Protein sequenceShow/hide protein sequence
MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLAAEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARSPAPAPTSENFDALKREMEAMRTQMRSMEEIY
NEMMLAAGAGSRSENRVTRVDIREQRGSHLGLAEEERPEDNESEGYTRQRGDLHEHLNRKRGSSLQKGQSPSRSHRSSNQQAESSHNLAGIITREEFDQLRGELDAQVEA
LKAKCEQKDDSLNDGDLGESPFTSDVLEAAIPPKFKAPTVKPYDGMKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSS
RHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEVRATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERAD
PKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTMIPIFEILTNIEESGMKKLLKRPEKLRGAPERRNKDKYCRFHREHGHNTSDCWELKRQIENLIQDGYFKKFVGKP
RTSSAEKKEERKRSRTPPTDLREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVILEGCID
LPVTLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQAASRECYASALKGSSVCALETLAGRDGPLEFEADLPRKEFAAPTE
ELELVPLLSPEKQKVGKAGSSVRGPRWGIVPTWLFPTSIEMPNP