; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g24630 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g24630
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr4:17811640..17817234
RNA-Seq ExpressionMoc04g24630
SyntenyMoc04g24630
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]3.3e-22582.77Show/hide
Query:  GELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPSKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPA
        G+LDAQVEALKAKCEQK+  LNDGDLGESPFTSDVLEAPIP KFKAPTVKPYDG+KDPKDYVEVFE LMDFQAASDAIKCRAF+IALTGSARLWYRRLPA
Subjt:  GELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPSKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPA

Query:  RSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEVPATFAE----------
         SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEE PATFAE          
Subjt:  RSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEVPATFAE----------

Query:  ---------------IGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKD
                       IGRGRSGKD+E ADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKD
Subjt:  ---------------IGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKD

Query:  KYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTLPGALTD----------------------------------LREQ
        KYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFVGKPRTSSAEKKEERKRSRT P   TD                                  +REQ
Subjt:  KYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTLPGALTD----------------------------------LREQ

Query:  GPTCPITFDGADLEEVHLPHNDALVIAPLIDRVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQ
         PTCPITFDGADLEEVHLPHNDALVIAPLID VVV RVLVDGG SANILSLPTYLALGWTRSQLK+SPTPLVGFSGESVIPEG IDLPVTLGQD+T+VTQ
Subjt:  GPTCPITFDGADLEEVHLPHNDALVIAPLIDRVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQ

Query:  MAEFV
        MAEFV
Subjt:  MAEFV

XP_022150613.1 uncharacterized protein LOC111018708, partial [Momordica charantia]3.2e-18081.09Show/hide
Query:  KDDSLNDGDLGESPFTSDVLEAPIPSKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQ
        KDDSLNDGDLGES FTSDVLEAPIP KFKAPTVKPYDG+KDPKDYVEVFEGLMDF AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQ
Subjt:  KDDSLNDGDLGESPFTSDVLEAPIPSKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQ

Query:  FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEVPATFAE-------------------------I
        FSSR Y KKT THLATIRQKEG TLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGE+ P TFAE                         I
Subjt:  FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEVPATFAE-------------------------I

Query:  GRGRSGKDVERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC
        GRGRSGKDVERADPKSKDKGSFSSGRAEYRRAE+GPT+SRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC
Subjt:  GRGRSGKDVERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC

Query:  WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTLPGALTD----------------------------------LREQGPTCPITFDGADLEEV
        WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRT P   TD                                  +REQGPTCPITFDGAD EEV
Subjt:  WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTLPGALTD----------------------------------LREQGPTCPITFDGADLEEV

Query:  HLPHNDALVIAPLIDRVVVRRVL
        HLPHNDA VIAPLID VVVRRVL
Subjt:  HLPHNDALVIAPLIDRVVVRRVL

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]1.3e-22171.81Show/hide
Query:  GELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPSKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPA
        G+L+AQVEALKAKCEQK+  LNDGDLGESPFTSDVLE        APTVK YDG+KDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLW      
Subjt:  GELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPSKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPA

Query:  RSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEVPATFAE----------
                                                       FQE+QLKVA  SDDSAMCYFLTGLADEALTVKLG+E PATFAE          
Subjt:  RSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEVPATFAE----------

Query:  ---------------IGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKD
                       I RGRSGKD E+AD KSKDKGSFSSGRAE+RRA NGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERR+KD
Subjt:  ---------------IGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKD

Query:  KYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRT------LPGALTD---------------------------LREQG
        KYCRFHREH HNTSD WELKRQIEDLIQD YFKKFVGKPRTSSAEKKEERK SRT       P  +                             +REQ 
Subjt:  KYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRT------LPGALTD---------------------------LREQG

Query:  PTCPITFDGADLEEVHLPHNDALVIAPLIDRVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQM
        PTCPITFD ADLEEVHLPHNDALVIAPLID VVVRRVLVD G SANI+SL TYLALGWTRSQLK+S TPLVGFS ESVIPEGCIDLPVTLG D+T+VTQM
Subjt:  PTCPITFDGADLEEVHLPHNDALVIAPLIDRVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQM

Query:  AEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKGSSVCALETLAGRDGTLEFEADLPRKEFAAPTEELELV
        AEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKY TPNGVG VRGEQ ASRECYASALKGSSVCALETL  RDGTLEF+A+LPR+EFAAPTEELELV
Subjt:  AEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKGSSVCALETLAGRDGTLEFEADLPRKEFAAPTEELELV

Query:  PLL
        PLL
Subjt:  PLL

XP_022152110.1 uncharacterized protein LOC111019899 [Momordica charantia]7.7e-19080.91Show/hide
Query:  MCYFLTGLADEALTVKLGEEVPATFAE------------------IGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISE
        MCYFLTGLADEALTVKL EE PATFAE                  IG+GRSGKD+E  DPKSKDKGSFS+GRAEYRRAENGPTRSRPYERFTPTTIPISE
Subjt:  MCYFLTGLADEALTVKLGEEVPATFAE------------------IGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISE

Query:  ILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTLPGALTD--------
        ILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELK QIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRT P   TD        
Subjt:  ILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTLPGALTD--------

Query:  --------------------------LREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDRVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTP
                                  +REQ PTCPITFD ADL EVHLPHNDALVIAPLID VVVRRVLVDGGASANILSLPTYLALGWTRSQLK+SPTP
Subjt:  --------------------------LREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDRVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTP

Query:  LVGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKGSSVCAL
        LVGFSGESV+PEGCIDLPVTLGQD+TRVTQMAEFVV+DGRSAYNAIFGRPIIHSFRAIPSTLHQVLKY TPNGVGTVRGEQTASRECYAS LKG+SVCAL
Subjt:  LVGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKGSSVCAL

Query:  ETLAGRDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQL
        ETL  RDGTLEFEADLP +EFAAP EELELVPLLS EKQ+
Subjt:  ETLAGRDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQL

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]2.6e-23862.62Show/hide
Query:  MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLVTEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALKREMEAMR
        MVQPANSTNT DRR LAA+  HQREVGA  VEGQGH+ L TEPL RSARIT P LPPAHP+ SKA       +           T E FD LK       
Subjt:  MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLVTEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALKREMEAMR

Query:  TQMRSMEEMYNEMMLAAGAGSRSENRVTRVDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSDNHKGGVRPAGGGELDAQVEALKAKCE
                                                                                              + DAQVEALKA+CE
Subjt:  TQMRSMEEMYNEMMLAAGAGSRSENRVTRVDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSDNHKGGVRPAGGGELDAQVEALKAKCE

Query:  QKDDSLNDGDLGESPFTSDVLEAPIPSKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLA
        +K+ S +DGDLGE  F+SD+LEA IP KFK PT+KPYDG+KDPKDYVEVFE LMDFQAA+DAIKC AFQIALTGSARLWYRRLPAR ISTYSQLR+EF++
Subjt:  QKDDSLNDGDLGESPFTSDVLEAPIPSKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLA

Query:  QFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEVPATFAE-------------------------
        QFSSRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EE PATFAE                         
Subjt:  QFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEVPATFAE-------------------------

Query:  IGRGRSGKDVERADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTS
        I +GR+GKD  +AD KS+DKG S SS R +YRR+ +   +SRPYE +TPTTIPI EILTNIEE+GMEKLLKRPEKLRG PE+R+ DKYCRFHR+HGHNTS
Subjt:  IGRGRSGKDVERADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTS

Query:  DCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTLP--------------------GALTDLREQGPTCPITFDGADLEEVHLPHNDALVIAP
        + WELKRQIEDLIQDGYFKKFVGKPR++S EKKEERKR RT P                      +  +REQ PT  I F+ ADLE VHLPHNDALVIAP
Subjt:  DCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTLP--------------------GALTDLREQGPTCPITFDGADLEEVHLPHNDALVIAP

Query:  LIDRVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAI
        LID V+VRR+LVDGGASANILSL TYLALGWTRSQLK+SPTPLVGFSGES+  EGCIDLPV++ QD T+VTQMAEFVVIDGRSAYNAIFGRPIIHSFRA+
Subjt:  LIDRVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAI

Query:  PSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKGSSVCALETLAGRD
        PSTLHQVLKY T NGVGTVRGE   SRECYAS  K SSVCALE    RD
Subjt:  PSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKGSSVCALETLAGRD

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088131.6e-22582.77Show/hide
Query:  GELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPSKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPA
        G+LDAQVEALKAKCEQK+  LNDGDLGESPFTSDVLEAPIP KFKAPTVKPYDG+KDPKDYVEVFE LMDFQAASDAIKCRAF+IALTGSARLWYRRLPA
Subjt:  GELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPSKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPA

Query:  RSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEVPATFAE----------
         SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEE PATFAE          
Subjt:  RSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEVPATFAE----------

Query:  ---------------IGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKD
                       IGRGRSGKD+E ADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKD
Subjt:  ---------------IGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKD

Query:  KYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTLPGALTD----------------------------------LREQ
        KYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFVGKPRTSSAEKKEERKRSRT P   TD                                  +REQ
Subjt:  KYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTLPGALTD----------------------------------LREQ

Query:  GPTCPITFDGADLEEVHLPHNDALVIAPLIDRVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQ
         PTCPITFDGADLEEVHLPHNDALVIAPLID VVV RVLVDGG SANILSLPTYLALGWTRSQLK+SPTPLVGFSGESVIPEG IDLPVTLGQD+T+VTQ
Subjt:  GPTCPITFDGADLEEVHLPHNDALVIAPLIDRVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQ

Query:  MAEFV
        MAEFV
Subjt:  MAEFV

A0A6J1D9E1 uncharacterized protein LOC1110188236.3e-22271.81Show/hide
Query:  GELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPSKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPA
        G+L+AQVEALKAKCEQK+  LNDGDLGESPFTSDVLE        APTVK YDG+KDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLW      
Subjt:  GELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPSKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPA

Query:  RSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEVPATFAE----------
                                                       FQE+QLKVA  SDDSAMCYFLTGLADEALTVKLG+E PATFAE          
Subjt:  RSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEVPATFAE----------

Query:  ---------------IGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKD
                       I RGRSGKD E+AD KSKDKGSFSSGRAE+RRA NGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERR+KD
Subjt:  ---------------IGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKD

Query:  KYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRT------LPGALTD---------------------------LREQG
        KYCRFHREH HNTSD WELKRQIEDLIQD YFKKFVGKPRTSSAEKKEERK SRT       P  +                             +REQ 
Subjt:  KYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRT------LPGALTD---------------------------LREQG

Query:  PTCPITFDGADLEEVHLPHNDALVIAPLIDRVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQM
        PTCPITFD ADLEEVHLPHNDALVIAPLID VVVRRVLVD G SANI+SL TYLALGWTRSQLK+S TPLVGFS ESVIPEGCIDLPVTLG D+T+VTQM
Subjt:  PTCPITFDGADLEEVHLPHNDALVIAPLIDRVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQM

Query:  AEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKGSSVCALETLAGRDGTLEFEADLPRKEFAAPTEELELV
        AEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKY TPNGVG VRGEQ ASRECYASALKGSSVCALETL  RDGTLEF+A+LPR+EFAAPTEELELV
Subjt:  AEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKGSSVCALETLAGRDGTLEFEADLPRKEFAAPTEELELV

Query:  PLL
        PLL
Subjt:  PLL

A0A6J1D9W7 uncharacterized protein LOC1110187081.6e-18081.09Show/hide
Query:  KDDSLNDGDLGESPFTSDVLEAPIPSKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQ
        KDDSLNDGDLGES FTSDVLEAPIP KFKAPTVKPYDG+KDPKDYVEVFEGLMDF AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQ
Subjt:  KDDSLNDGDLGESPFTSDVLEAPIPSKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQ

Query:  FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEVPATFAE-------------------------I
        FSSR Y KKT THLATIRQKEG TLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGE+ P TFAE                         I
Subjt:  FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEVPATFAE-------------------------I

Query:  GRGRSGKDVERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC
        GRGRSGKDVERADPKSKDKGSFSSGRAEYRRAE+GPT+SRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC
Subjt:  GRGRSGKDVERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC

Query:  WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTLPGALTD----------------------------------LREQGPTCPITFDGADLEEV
        WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRT P   TD                                  +REQGPTCPITFDGAD EEV
Subjt:  WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTLPGALTD----------------------------------LREQGPTCPITFDGADLEEV

Query:  HLPHNDALVIAPLIDRVVVRRVL
        HLPHNDA VIAPLID VVVRRVL
Subjt:  HLPHNDALVIAPLIDRVVVRRVL

A0A6J1DD03 uncharacterized protein LOC1110198993.7e-19080.91Show/hide
Query:  MCYFLTGLADEALTVKLGEEVPATFAE------------------IGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISE
        MCYFLTGLADEALTVKL EE PATFAE                  IG+GRSGKD+E  DPKSKDKGSFS+GRAEYRRAENGPTRSRPYERFTPTTIPISE
Subjt:  MCYFLTGLADEALTVKLGEEVPATFAE------------------IGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISE

Query:  ILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTLPGALTD--------
        ILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELK QIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRT P   TD        
Subjt:  ILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTLPGALTD--------

Query:  --------------------------LREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDRVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTP
                                  +REQ PTCPITFD ADL EVHLPHNDALVIAPLID VVVRRVLVDGGASANILSLPTYLALGWTRSQLK+SPTP
Subjt:  --------------------------LREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDRVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTP

Query:  LVGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKGSSVCAL
        LVGFSGESV+PEGCIDLPVTLGQD+TRVTQMAEFVV+DGRSAYNAIFGRPIIHSFRAIPSTLHQVLKY TPNGVGTVRGEQTASRECYAS LKG+SVCAL
Subjt:  LVGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKGSSVCAL

Query:  ETLAGRDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQL
        ETL  RDGTLEFEADLP +EFAAP EELELVPLLS EKQ+
Subjt:  ETLAGRDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQL

A0A6J1DHB3 uncharacterized protein LOC1110204791.3e-23862.62Show/hide
Query:  MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLVTEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALKREMEAMR
        MVQPANSTNT DRR LAA+  HQREVGA  VEGQGH+ L TEPL RSARIT P LPPAHP+ SKA       +           T E FD LK       
Subjt:  MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLVTEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALKREMEAMR

Query:  TQMRSMEEMYNEMMLAAGAGSRSENRVTRVDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSDNHKGGVRPAGGGELDAQVEALKAKCE
                                                                                              + DAQVEALKA+CE
Subjt:  TQMRSMEEMYNEMMLAAGAGSRSENRVTRVDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSDNHKGGVRPAGGGELDAQVEALKAKCE

Query:  QKDDSLNDGDLGESPFTSDVLEAPIPSKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLA
        +K+ S +DGDLGE  F+SD+LEA IP KFK PT+KPYDG+KDPKDYVEVFE LMDFQAA+DAIKC AFQIALTGSARLWYRRLPAR ISTYSQLR+EF++
Subjt:  QKDDSLNDGDLGESPFTSDVLEAPIPSKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLA

Query:  QFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEVPATFAE-------------------------
        QFSSRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EE PATFAE                         
Subjt:  QFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEVPATFAE-------------------------

Query:  IGRGRSGKDVERADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTS
        I +GR+GKD  +AD KS+DKG S SS R +YRR+ +   +SRPYE +TPTTIPI EILTNIEE+GMEKLLKRPEKLRG PE+R+ DKYCRFHR+HGHNTS
Subjt:  IGRGRSGKDVERADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTS

Query:  DCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTLP--------------------GALTDLREQGPTCPITFDGADLEEVHLPHNDALVIAP
        + WELKRQIEDLIQDGYFKKFVGKPR++S EKKEERKR RT P                      +  +REQ PT  I F+ ADLE VHLPHNDALVIAP
Subjt:  DCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTLP--------------------GALTDLREQGPTCPITFDGADLEEVHLPHNDALVIAP

Query:  LIDRVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAI
        LID V+VRR+LVDGGASANILSL TYLALGWTRSQLK+SPTPLVGFSGES+  EGCIDLPV++ QD T+VTQMAEFVVIDGRSAYNAIFGRPIIHSFRA+
Subjt:  LIDRVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAI

Query:  PSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKGSSVCALETLAGRD
        PSTLHQVLKY T NGVGTVRGE   SRECYAS  K SSVCALE    RD
Subjt:  PSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKGSSVCALETLAGRD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAACCAGCGAACTCGACCAATACGACAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGCGGTAGAGGGGCAAGGTCACGA
CGGCCTAGTAACGGAACCCCTCCGCAGGTCGGCACGGATCACCGCGCCTGCCCTACCGCCTGCGCACCCGAGGACGTCCAAGGCCACCCGTGGCCGAGGTGGAACCTCTA
AGAAGGGCGCCCGGGGTCCAGCCCCCGCTCCAACAAGCGAGAACTTTGATGCGCTCAAGAGAGAGATGGAGGCAATGCGCACACAAATGCGCTCCATGGAGGAAATGTAT
AACGAAATGATGCTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAGTGACGCGCGTGGACGTACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGCCGAGGAGGAACG
TCCCGAAGACAACGAGAGCGAGGGGTACACTCGCCAGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCGGATAATCACAAGGGAGGAGTTCGACCAGCTG
GGGGGGGGGAGCTCGATGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAACGATGGCGACTTGGGAGAATCGCCTTTCACCTCGGACGTT
TTGGAAGCACCAATCCCTTCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGACGGGACGAAGGACCCCAAAGACTATGTTGAGGTCTTTGAAGGCCTCATGGACTTCCA
AGCGGCATCAGACGCAATCAAGTGCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTATCGGAGATTGCCAGCCAGGTCGATCTCAACCTACTCTCAGC
TGAGAAGGGAGTTCCTCGCCCAGTTTTCTTCTCGGCACTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGGCAGAAGGAGGGTGAGACGCTGCGAGAATATGTC
ACCAGATTCCAGGAGGAGCAGTTGAAGGTTGCACACTGCTCCGATGACTCGGCCATGTGCTATTTCCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGAGA
GGAGGTCCCGGCCACCTTTGCCGAGATCGGCCGGGGCAGAAGTGGGAAAGATGTAGAAAGGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGAGCTG
AGTATCGAAGGGCGGAGAACGGACCTACCAGGAGCCGACCTTATGAGCGCTTCACCCCGACCACGATTCCAATTTCCGAGATCCTAACAAACATCGAGGAATCTGGAATG
GAAAAACTACTCAAGCGTCCTGAGAAGCTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTCCATCGGGAACACGGCCATAACACGTCAGACTGCTG
GGAGTTGAAGCGCCAAATTGAGGATCTAATTCAAGATGGCTACTTCAAGAAATTTGTGGGAAAGCCCAGGACCAGCTCAGCAGAGAAAAAGGAAGAGCGAAAGCGTTCAA
GGACGCTGCCCGGCGCACTGACCGACCTGCGGGAGCAGGGGCCGACCTGCCCAATCACCTTCGACGGTGCAGACTTGGAGGAGGTCCACTTGCCCCACAATGATGCACTT
GTGATTGCTCCCTTGATTGATCGTGTGGTGGTCAGGAGAGTGCTGGTAGACGGGGGCGCATCCGCTAACATCCTGTCCTTACCGACCTACCTCGCCTTGGGATGGACGAG
GTCGCAATTGAAGAGAAGTCCGACACCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGTTGCATCGACTTGCCGGTCACGCTGGGGCAGGACCGAACTCGGG
TCACTCAAATGGCCGAGTTCGTGGTAATTGACGGTAGATCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCATTCCCTCAACACTTCATCAA
GTTTTGAAGTATCCCACCCCCAATGGCGTGGGCACGGTCCGAGGAGAACAGACCGCTTCGAGGGAGTGTTATGCCTCCGCACTCAAAGGCTCATCGGTCTGCGCCCTCGA
AACTCTCGCCGGTAGGGATGGGACGCTCGAGTTCGAGGCCGACCTGCCGAGGAAGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGA
AGCAGTTAGCATCGGCGTACGAGACCGACCTGGCCAGGTCGGTCCCCGTCGAGATCTTAGATAATCCCTCGATCTCAGAGCAAGATCTGATGGAGATCGGCGCTCTAGAG
TCCTCATGGATGGACCCGATTGCGGACTTCATTAGGGGCAACTCACCACAAGACCCCAAGGAGCGCAGAAAGTTGGCAAGGCGGGCAGCTCGAGTAGAGCATTACGAGCC
TACGACAAATGAGGAAGAGCTGCTCCTCAACCTCGACTTGTTGGAAGAAAGAAGAGCAATGGCCCAGCTACGCCTGGCGGAATATCAGGGCAGAATGGCCAGACATTACA
ATGCCCGCGTTCGACCTCGGGCCTTTCAGGTCGGACATCTGGTCTTAAGGAGGGTCCAAACGCATGTGGGTGCCCTTGATCCGGCCTGGGAGGGCCCGTTTGAGATCAAG
GGCATAGTCCGACCTGGGACGTACATATTGGCCGATCTGAAAGGAGACGTCCTCGCGCACCCGTGGAACGCGGAGCACCTGAAGCGTTATTATCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAACCAGCGAACTCGACCAATACGACAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGCGGTAGAGGGGCAAGGTCACGA
CGGCCTAGTAACGGAACCCCTCCGCAGGTCGGCACGGATCACCGCGCCTGCCCTACCGCCTGCGCACCCGAGGACGTCCAAGGCCACCCGTGGCCGAGGTGGAACCTCTA
AGAAGGGCGCCCGGGGTCCAGCCCCCGCTCCAACAAGCGAGAACTTTGATGCGCTCAAGAGAGAGATGGAGGCAATGCGCACACAAATGCGCTCCATGGAGGAAATGTAT
AACGAAATGATGCTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAGTGACGCGCGTGGACGTACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGCCGAGGAGGAACG
TCCCGAAGACAACGAGAGCGAGGGGTACACTCGCCAGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCGGATAATCACAAGGGAGGAGTTCGACCAGCTG
GGGGGGGGGAGCTCGATGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAACGATGGCGACTTGGGAGAATCGCCTTTCACCTCGGACGTT
TTGGAAGCACCAATCCCTTCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGACGGGACGAAGGACCCCAAAGACTATGTTGAGGTCTTTGAAGGCCTCATGGACTTCCA
AGCGGCATCAGACGCAATCAAGTGCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTATCGGAGATTGCCAGCCAGGTCGATCTCAACCTACTCTCAGC
TGAGAAGGGAGTTCCTCGCCCAGTTTTCTTCTCGGCACTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGGCAGAAGGAGGGTGAGACGCTGCGAGAATATGTC
ACCAGATTCCAGGAGGAGCAGTTGAAGGTTGCACACTGCTCCGATGACTCGGCCATGTGCTATTTCCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGAGA
GGAGGTCCCGGCCACCTTTGCCGAGATCGGCCGGGGCAGAAGTGGGAAAGATGTAGAAAGGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGAGCTG
AGTATCGAAGGGCGGAGAACGGACCTACCAGGAGCCGACCTTATGAGCGCTTCACCCCGACCACGATTCCAATTTCCGAGATCCTAACAAACATCGAGGAATCTGGAATG
GAAAAACTACTCAAGCGTCCTGAGAAGCTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTCCATCGGGAACACGGCCATAACACGTCAGACTGCTG
GGAGTTGAAGCGCCAAATTGAGGATCTAATTCAAGATGGCTACTTCAAGAAATTTGTGGGAAAGCCCAGGACCAGCTCAGCAGAGAAAAAGGAAGAGCGAAAGCGTTCAA
GGACGCTGCCCGGCGCACTGACCGACCTGCGGGAGCAGGGGCCGACCTGCCCAATCACCTTCGACGGTGCAGACTTGGAGGAGGTCCACTTGCCCCACAATGATGCACTT
GTGATTGCTCCCTTGATTGATCGTGTGGTGGTCAGGAGAGTGCTGGTAGACGGGGGCGCATCCGCTAACATCCTGTCCTTACCGACCTACCTCGCCTTGGGATGGACGAG
GTCGCAATTGAAGAGAAGTCCGACACCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGTTGCATCGACTTGCCGGTCACGCTGGGGCAGGACCGAACTCGGG
TCACTCAAATGGCCGAGTTCGTGGTAATTGACGGTAGATCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCATTCCCTCAACACTTCATCAA
GTTTTGAAGTATCCCACCCCCAATGGCGTGGGCACGGTCCGAGGAGAACAGACCGCTTCGAGGGAGTGTTATGCCTCCGCACTCAAAGGCTCATCGGTCTGCGCCCTCGA
AACTCTCGCCGGTAGGGATGGGACGCTCGAGTTCGAGGCCGACCTGCCGAGGAAGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGA
AGCAGTTAGCATCGGCGTACGAGACCGACCTGGCCAGGTCGGTCCCCGTCGAGATCTTAGATAATCCCTCGATCTCAGAGCAAGATCTGATGGAGATCGGCGCTCTAGAG
TCCTCATGGATGGACCCGATTGCGGACTTCATTAGGGGCAACTCACCACAAGACCCCAAGGAGCGCAGAAAGTTGGCAAGGCGGGCAGCTCGAGTAGAGCATTACGAGCC
TACGACAAATGAGGAAGAGCTGCTCCTCAACCTCGACTTGTTGGAAGAAAGAAGAGCAATGGCCCAGCTACGCCTGGCGGAATATCAGGGCAGAATGGCCAGACATTACA
ATGCCCGCGTTCGACCTCGGGCCTTTCAGGTCGGACATCTGGTCTTAAGGAGGGTCCAAACGCATGTGGGTGCCCTTGATCCGGCCTGGGAGGGCCCGTTTGAGATCAAG
GGCATAGTCCGACCTGGGACGTACATATTGGCCGATCTGAAAGGAGACGTCCTCGCGCACCCGTGGAACGCGGAGCACCTGAAGCGTTATTATCCTTGA
Protein sequenceShow/hide protein sequence
MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLVTEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALKREMEAMRTQMRSMEEMY
NEMMLAAGAGSRSENRVTRVDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSDNHKGGVRPAGGGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDV
LEAPIPSKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYV
TRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEVPATFAEIGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGM
EKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTLPGALTDLREQGPTCPITFDGADLEEVHLPHNDAL
VIAPLIDRVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQ
VLKYPTPNGVGTVRGEQTASRECYASALKGSSVCALETLAGRDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQLASAYETDLARSVPVEILDNPSISEQDLMEIGALE
SSWMDPIADFIRGNSPQDPKERRKLARRAARVEHYEPTTNEEELLLNLDLLEERRAMAQLRLAEYQGRMARHYNARVRPRAFQVGHLVLRRVQTHVGALDPAWEGPFEIK
GIVRPGTYILADLKGDVLAHPWNAEHLKRYYP