; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g29890 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g29890
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr9:22546493..22548784
RNA-Seq ExpressionMoc09g29890
SyntenyMoc09g29890
Gene Ontology termsNA
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]2.5e-18567.3Show/hide
Query:  QVESSYNPIVPEGMITKEEFDQLKSKFDAQVEALKARCKVKESAFDDGDLGESPFTSDILEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATD
        + ESS NP  P G+IT+EEFDQL+ + DAQVEALKA+C+ KE   +DGDLGESPFTSD+LEAPIPPKFK PT+KPYDGSKDPKDYVEVFE LMDFQAA+D
Subjt:  QVESSYNPIVPEGMITKEEFDQLKSKFDAQVEALKARCKVKESAFDDGDLGESPFTSDILEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATD

Query:  AIKCRAFQIALTGSALLWYRRLPA-----------------------RKTTTHLTTIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFITGLADETL
        AIKCRAF+IALTGSA LWYRRLPA                       +KT THL TIRQKEGETLREYVTRFQEEQLKV HCSDDSAMCYF+TGLADE L
Subjt:  AIKCRAFQIALTGSALLWYRRLPA-----------------------RKTTTHLTTIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFITGLADETL

Query:  T-----------------------------TKTGRSEKKIDQKKSGQDKGKADSKSKDKGSFSSSNKTDYRKSDSGSNRGRPYERYTPTTIPISEILTNI
        T                             TKTGR E+KI + +SG+D   AD KSKDKGSF SS + +YR++++G  R RPYER+TPTTIPISEILTNI
Subjt:  T-----------------------------TKTGRSEKKIDQKKSGQDKGKADSKSKDKGSFSSSNKTDYRKSDSGSNRGRPYERYTPTTIPISEILTNI

Query:  EETGMEKLL-------------------------NHNTLICWELKRQIEDLIQDGYFKKFLGKLRSNSVKKKDEKKCSRTPPRRDDRPAVINTIFGGPSG
        EE+GMEKLL                          HNT   WELKRQIE+LIQDGYFKKF+GK R++S +KK+E+K SRTPPRR DRPAVINTIFGGPSG
Subjt:  EETGMEKLL-------------------------NHNTLICWELKRQIEDLIQDGYFKKFLGKLRSNSVKKKDEKKCSRTPPRRDDRPAVINTIFGGPSG

Query:  SQSENKRKELAREARREVCVIREQKPTCFITFNDSDLEGVHLPDNDALVIAPLIDHVLVRRVLVDGGASSNILSLTTYLALGWIRAQLKKSPTPLVGFLG
         QS  KRKELAR ARREVC+IREQ+PTC ITF+ +DLE VHLP NDALVIAPLIDHV+V RVLVDGG S+NILSL TYLALGW R+QLKKSPTPLVGF G
Subjt:  SQSENKRKELAREARREVCVIREQKPTCFITFNDSDLEGVHLPDNDALVIAPLIDHVLVRRVLVDGGASSNILSLTTYLALGWIRAQLKKSPTPLVGFLG

Query:  ESVSPEGCIDLPVTIGQDNTQVTQMAEFV
        ESV PEG IDLPVT+GQD TQVTQMAEFV
Subjt:  ESVSPEGCIDLPVTIGQDNTQVTQMAEFV

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]1.2e-18264.86Show/hide
Query:  NSNQQVESSYNPIVPEGMITKEEFDQLKSKFDAQVEALKARCKVKESAFDDGDLGESPFTSDILEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQ
        +SNQQ ESS+NP  P+G+IT+EEFDQL+ K +AQVEALKA+C+ KE   +DGDLGESPFTSD+LEA        PT+K YDGSKDPKDYVEVFEGLMDFQ
Subjt:  NSNQQVESSYNPIVPEGMITKEEFDQLKSKFDAQVEALKARCKVKESAFDDGDLGESPFTSDILEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQ

Query:  AATDAIKCRAFQIALTGSALLWYRRLPARKTTTHLTTIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFITGLADETLT------------------
        AA+DAIKCRAFQIALTGSA LW                              FQE+QLKV   SDDSAMCYF+TGLADE LT                  
Subjt:  AATDAIKCRAFQIALTGSALLWYRRLPARKTTTHLTTIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFITGLADETLT------------------

Query:  -----------TKTGRSEKKIDQKKSGQDKGKADSKSKDKGSFSSSNKTDYRKSDSGSNRGRPYERYTPTTIPISEILTNIEETGMEKLL----------
                   TKTGR E+ ID+ +SG+D+ KAD KSKDKGSF SS + ++R++ +G  R RPYER+TPTTIPISEILTNIEE+GMEKLL          
Subjt:  -----------TKTGRSEKKIDQKKSGQDKGKADSKSKDKGSFSSSNKTDYRKSDSGSNRGRPYERYTPTTIPISEILTNIEETGMEKLL----------

Query:  ---------------NHNTLICWELKRQIEDLIQDGYFKKFLGKLRSNSVKKKDEKKCSRTPPRRDDRPAVINTIFGGPSGSQSENKRKELAREARREVC
                       +HNT   WELKRQIEDLIQD YFKKF+GK R++S +KK+E+K SRTP RR DRPAVINTIFGGPSG QS +KRKELAR ARREVC
Subjt:  ---------------NHNTLICWELKRQIEDLIQDGYFKKFLGKLRSNSVKKKDEKKCSRTPPRRDDRPAVINTIFGGPSGSQSENKRKELAREARREVC

Query:  VIREQKPTCFITFNDSDLEGVHLPDNDALVIAPLIDHVLVRRVLVDGGASSNILSLTTYLALGWIRAQLKKSPTPLVGFLGESVSPEGCIDLPVTIGQDN
        +IREQ+PTC ITF+ +DLE VHLP NDALVIAPLIDHV+VRRVLVD G S+NI+SL TYLALGW R+QLKKS TPLVGF  ESV PEGCIDLPVT+G D 
Subjt:  VIREQKPTCFITFNDSDLEGVHLPDNDALVIAPLIDHVLVRRVLVDGGASSNILSLTTYLALGWIRAQLKKSPTPLVGFLGESVSPEGCIDLPVTIGQDN

Query:  TQVTQMAEFVVIDGRSAYNAIFGRPIIHSFWVVPSTLHQVLKYSTPVGVGTVRGE
        TQVTQMAEFVVIDGRSAYNAIFGRPIIHSF  +PSTLHQVLKYSTP GVG VRGE
Subjt:  TQVTQMAEFVVIDGRSAYNAIFGRPIIHSFWVVPSTLHQVLKYSTPVGVGTVRGE

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]1.8e-21559.29Show/hide
Query:  MVQPASSTNTADRRSLADNSG-QREMEAKVAEDQIPEGLGTEQLRRSARITTLVLPPAHLKPTKTNRGRGGASRRATRGATPAPSRENFDALQKEMEAMR
        MVQPA+STNTADRR+LA N G QRE+ A+V E Q  E LGTE L RSARITT VLPPAH KP+K                                    
Subjt:  MVQPASSTNTADRRSLADNSG-QREMEAKVAEDQIPEGLGTEQLRRSARITTLVLPPAHLKPTKTNRGRGGASRRATRGATPAPSRENFDALQKEMEAMR

Query:  AQIRTMEEMYNEMVQAAGATSRFEDQVVHEEVHVQRDLHLNPIDEEYPRGDENVEYSRQKNDLRDHLNRKRSSSHRGGRTPTCSHKNSNQQVESSYNPIV
                                                                                                    ESSYNPI 
Subjt:  AQIRTMEEMYNEMVQAAGATSRFEDQVVHEEVHVQRDLHLNPIDEEYPRGDENVEYSRQKNDLRDHLNRKRSSSHRGGRTPTCSHKNSNQQVESSYNPIV

Query:  PEGMITKEEFDQLKSKFDAQVEALKARCKVKESAFDDGDLGESPFTSDILEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIA
        P G+IT+EEFDQLKSKFDAQVEALKARC+ KES+FDDGDLGE  F+SDILEA IPPKFKTPTMKPYDGSKDPKDYVEVFE LMDFQAATDAIKC AFQIA
Subjt:  PEGMITKEEFDQLKSKFDAQVEALKARCKVKESAFDDGDLGESPFTSDILEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIA

Query:  LTGSALLWYRRLPA-----------------------RKTTTHLTTIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFITGLADETLT---------
        LTGSA LWYRRLPA                       RKT THL TIRQKEGETLREYVTRF EEQLKV HCSDDSAMCYF+TGLADETLT         
Subjt:  LTGSALLWYRRLPA-----------------------RKTTTHLTTIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFITGLADETLT---------

Query:  --------------------TKTGRSEKKIDQKKSGQDKGKADSKSKDKGSFSSSNKTDYRKSDSGSNRGRPYERYTPTTIPISEILTNIEETGMEKLL-
                            TKTGR EK IDQ ++G+DKGKADSKS+DKG  SSS++ DYR+S+S  N+ RPYE YTPTTIPI EILTNIEETGMEKLL 
Subjt:  --------------------TKTGRSEKKIDQKKSGQDKGKADSKSKDKGSFSSSNKTDYRKSDSGSNRGRPYERYTPTTIPISEILTNIEETGMEKLL-

Query:  ------------------------NHNTLICWELKRQIEDLIQDGYFKKFLGKLRSNSVKKKDEKKCSRTPPRRDDRPAVINTIFGGPSGSQSENKRKEL
                                 HNT   WELKRQIEDLIQDGYFKKF+GK RSNSV+KK+E+K  RTPPRRDDRPAVI             NK+KEL
Subjt:  ------------------------NHNTLICWELKRQIEDLIQDGYFKKFLGKLRSNSVKKKDEKKCSRTPPRRDDRPAVINTIFGGPSGSQSENKRKEL

Query:  AREARREVCVIREQKPTCFITFNDSDLEGVHLPDNDALVIAPLIDHVLVRRVLVDGGASSNILSLTTYLALGWIRAQLKKSPTPLVGFLGESVSPEGCID
        AREARREVC+IREQ+PT  I FN +DLEGVHLP NDALVIAPLID VLVRR+LVDGGAS+NILSL+TYLALGW R+QLKKSPTPLVGF GES+S EGCID
Subjt:  AREARREVCVIREQKPTCFITFNDSDLEGVHLPDNDALVIAPLIDHVLVRRVLVDGGASSNILSLTTYLALGWIRAQLKKSPTPLVGFLGESVSPEGCID

Query:  LPVTIGQDNTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFWVVPSTLHQVLKYSTPVGVGTVRGE
        LPV+I QD+TQVTQMAEFVVIDGRSAYNAIFGRPIIHSF  VPSTLHQVLKYST  GVGTVRGE
Subjt:  LPVTIGQDNTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFWVVPSTLHQVLKYSTPVGVGTVRGE

XP_022155139.1 uncharacterized protein LOC111022280 [Momordica charantia]5.0e-18665.11Show/hide
Query:  LNPIDEEYPRGDENVEYSRQKNDLRDHL-NRKRSSSHRGGRTPTCSHK--NSNQQVESSYNPIVPEGMITKEEFDQLKSKFDAQVEALKARCKVKESAFD
        + P + E    DE V+YS + NDLR HL ++K+ +S     + + S +  NSN + +S Y P++PE +I +EEFD +K +FD QVEALKARC+ KES FD
Subjt:  LNPIDEEYPRGDENVEYSRQKNDLRDHL-NRKRSSSHRGGRTPTCSHK--NSNQQVESSYNPIVPEGMITKEEFDQLKSKFDAQVEALKARCKVKESAFD

Query:  DGDLGESPFTSDILEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSALLWYRRLPA----------------------
        D DLGESPFTSDI+EAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKC AFQIALTGSA LW RRLPA                      
Subjt:  DGDLGESPFTSDILEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSALLWYRRLPA----------------------

Query:  -RKTTTHLTTIRQKEGETL-----REYVTRFQEEQLKVVHCSDDSAMCYFITGLADETLTTKTGRSEKKIDQKKSGQDKGKADSKSKDKGSFSSSNKTDY
         RKT THL TIRQKE ETL      E    F E         D             E L TKT R EK+IDQK+  Q K K DSKSKDKGS SS ++T+Y
Subjt:  -RKTTTHLTTIRQKEGETL-----REYVTRFQEEQLKVVHCSDDSAMCYFITGLADETLTTKTGRSEKKIDQKKSGQDKGKADSKSKDKGSFSSSNKTDY

Query:  RKSDSGSNRGRPYERYTPTTIPISEILTNIEETGMEKLLNHNTLICWELKRQIEDLIQDGYFKKFLGKLRSNSVKKKDEKKCSRTPPRRDDRPAVINTIF
        R+S+SG +R RPYER                              CWELKRQIEDLIQD YFKKF+GK RSNSV+KK+E+K SRTPPRR+DRPAVINTIF
Subjt:  RKSDSGSNRGRPYERYTPTTIPISEILTNIEETGMEKLLNHNTLICWELKRQIEDLIQDGYFKKFLGKLRSNSVKKKDEKKCSRTPPRRDDRPAVINTIF

Query:  GGPSGSQSENKRKELAREARREVCVIREQKPTCFITFNDSDLEGVHLPDNDALVIAPLIDHVLVRRVLVDGGASSNILSLTTYLALGWIRAQLKKSPTPL
        GGPSG Q ENKRKELA EARR+V +IREQKPTC ITF D+DLEGVHLP NDALVIAPLIDHVLVRRVLVDGGAS+NILSL TYLAL   R+QLKKSPTPL
Subjt:  GGPSGSQSENKRKELAREARREVCVIREQKPTCFITFNDSDLEGVHLPDNDALVIAPLIDHVLVRRVLVDGGASSNILSLTTYLALGWIRAQLKKSPTPL

Query:  VGFLGESVSPEGCIDLPVTIGQDNTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFWVVPSTLHQVLKYSTPVGVGTVRGE
        VGF  ESVSPEGCIDLPVTIGQD+TQVTQMAEFVVIDGR AYNAIF RPIIHSF  VPS LHQVLKYSTP GVGTVRGE
Subjt:  VGFLGESVSPEGCIDLPVTIGQDNTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFWVVPSTLHQVLKYSTPVGVGTVRGE

XP_022156088.1 uncharacterized protein LOC111023060 [Momordica charantia]5.8e-18264.25Show/hide
Query:  MEAMRAQIRTMEEMYNEMVQAAGATSRFEDQVVHEEVHVQRDLHLNPIDEEYPRGDENVEYSRQKNDLRDHLNRKRSSSHRGGRTPTCSHKNSNQQVESS
        MEAMR Q+RTMEEMYN+MVQ AGA SR  DQVVHE+VH Q DLH +P+DEE+  G           DLRDHLNRKR+SSHRG RT T  HKNSNQQ ESS
Subjt:  MEAMRAQIRTMEEMYNEMVQAAGATSRFEDQVVHEEVHVQRDLHLNPIDEEYPRGDENVEYSRQKNDLRDHLNRKRSSSHRGGRTPTCSHKNSNQQVESS

Query:  YNPIVPEGMITKEEFDQLKSKFDAQVEALKARCKVKESAFDDGDLGESPFTSDILEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCR
        YNPI PEG+IT+EEF+QLKSKFDAQVEALK RC+ KESAFDDGDLGESPFTSDILEA IPPKFKTPTMK YDGSKDPKDYVEVFEGLMDFQAATDAIKCR
Subjt:  YNPIVPEGMITKEEFDQLKSKFDAQVEALKARCKVKESAFDDGDLGESPFTSDILEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCR

Query:  AFQIALTGSALLWYRRLPA-----------------------RKTTTHLTTIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFITGLADET------
        AFQIALTGSA LWYRRLPA                       RKTTTHL TIRQKEG+TL+EY+TRFQEEQLKVVHCSDDS+MCYF+TGLADET      
Subjt:  AFQIALTGSALLWYRRLPA-----------------------RKTTTHLTTIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFITGLADET------

Query:  -----------------------LTTKTGRSEKKIDQKKSGQDKGKADSKSKDKGSFSSSNKTDYRKSDSGSNRGRPYERYTPTTIPISEILTNIEETGM
                               L TKT R EK+IDQKKS QDK KADSKSKDKGS SS+++TDY                                   
Subjt:  -----------------------LTTKTGRSEKKIDQKKSGQDKGKADSKSKDKGSFSSSNKTDYRKSDSGSNRGRPYERYTPTTIPISEILTNIEETGM

Query:  EKLLNHNTLICWELKRQIEDLIQDGYFKKFLGKLRSNSVKKKDEKKCSRTPPRRDDRPAVINTIFGGPSGSQSENKRKELAREARREVCVIREQKPTCFI
                                          RSNSV+KK+E+K SRTPPR DDRPAVINTIFGGPSG QS NKRKELAREA REVC+IREQ+PTC +
Subjt:  EKLLNHNTLICWELKRQIEDLIQDGYFKKFLGKLRSNSVKKKDEKKCSRTPPRRDDRPAVINTIFGGPSGSQSENKRKELAREARREVCVIREQKPTCFI

Query:  TFNDSDLEGVHLPDNDALVIAPLIDHVLVRRVLVDGGASSNILSLTTYLALGWIRAQLKKSPTPLVGFLGESVSPEGCI
        TF+DSDLEGVHLP NDALVIAPLIDHVLVRRVLVDGGAS+NILS    LALGW R+QLKKSPTPLVGF  ESVS +G +
Subjt:  TFNDSDLEGVHLPDNDALVIAPLIDHVLVRRVLVDGGASSNILSLTTYLALGWIRAQLKKSPTPLVGFLGESVSPEGCI

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088131.2e-18567.3Show/hide
Query:  QVESSYNPIVPEGMITKEEFDQLKSKFDAQVEALKARCKVKESAFDDGDLGESPFTSDILEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATD
        + ESS NP  P G+IT+EEFDQL+ + DAQVEALKA+C+ KE   +DGDLGESPFTSD+LEAPIPPKFK PT+KPYDGSKDPKDYVEVFE LMDFQAA+D
Subjt:  QVESSYNPIVPEGMITKEEFDQLKSKFDAQVEALKARCKVKESAFDDGDLGESPFTSDILEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATD

Query:  AIKCRAFQIALTGSALLWYRRLPA-----------------------RKTTTHLTTIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFITGLADETL
        AIKCRAF+IALTGSA LWYRRLPA                       +KT THL TIRQKEGETLREYVTRFQEEQLKV HCSDDSAMCYF+TGLADE L
Subjt:  AIKCRAFQIALTGSALLWYRRLPA-----------------------RKTTTHLTTIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFITGLADETL

Query:  T-----------------------------TKTGRSEKKIDQKKSGQDKGKADSKSKDKGSFSSSNKTDYRKSDSGSNRGRPYERYTPTTIPISEILTNI
        T                             TKTGR E+KI + +SG+D   AD KSKDKGSF SS + +YR++++G  R RPYER+TPTTIPISEILTNI
Subjt:  T-----------------------------TKTGRSEKKIDQKKSGQDKGKADSKSKDKGSFSSSNKTDYRKSDSGSNRGRPYERYTPTTIPISEILTNI

Query:  EETGMEKLL-------------------------NHNTLICWELKRQIEDLIQDGYFKKFLGKLRSNSVKKKDEKKCSRTPPRRDDRPAVINTIFGGPSG
        EE+GMEKLL                          HNT   WELKRQIE+LIQDGYFKKF+GK R++S +KK+E+K SRTPPRR DRPAVINTIFGGPSG
Subjt:  EETGMEKLL-------------------------NHNTLICWELKRQIEDLIQDGYFKKFLGKLRSNSVKKKDEKKCSRTPPRRDDRPAVINTIFGGPSG

Query:  SQSENKRKELAREARREVCVIREQKPTCFITFNDSDLEGVHLPDNDALVIAPLIDHVLVRRVLVDGGASSNILSLTTYLALGWIRAQLKKSPTPLVGFLG
         QS  KRKELAR ARREVC+IREQ+PTC ITF+ +DLE VHLP NDALVIAPLIDHV+V RVLVDGG S+NILSL TYLALGW R+QLKKSPTPLVGF G
Subjt:  SQSENKRKELAREARREVCVIREQKPTCFITFNDSDLEGVHLPDNDALVIAPLIDHVLVRRVLVDGGASSNILSLTTYLALGWIRAQLKKSPTPLVGFLG

Query:  ESVSPEGCIDLPVTIGQDNTQVTQMAEFV
        ESV PEG IDLPVT+GQD TQVTQMAEFV
Subjt:  ESVSPEGCIDLPVTIGQDNTQVTQMAEFV

A0A6J1D9E1 uncharacterized protein LOC1110188235.6e-18364.86Show/hide
Query:  NSNQQVESSYNPIVPEGMITKEEFDQLKSKFDAQVEALKARCKVKESAFDDGDLGESPFTSDILEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQ
        +SNQQ ESS+NP  P+G+IT+EEFDQL+ K +AQVEALKA+C+ KE   +DGDLGESPFTSD+LEA        PT+K YDGSKDPKDYVEVFEGLMDFQ
Subjt:  NSNQQVESSYNPIVPEGMITKEEFDQLKSKFDAQVEALKARCKVKESAFDDGDLGESPFTSDILEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQ

Query:  AATDAIKCRAFQIALTGSALLWYRRLPARKTTTHLTTIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFITGLADETLT------------------
        AA+DAIKCRAFQIALTGSA LW                              FQE+QLKV   SDDSAMCYF+TGLADE LT                  
Subjt:  AATDAIKCRAFQIALTGSALLWYRRLPARKTTTHLTTIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFITGLADETLT------------------

Query:  -----------TKTGRSEKKIDQKKSGQDKGKADSKSKDKGSFSSSNKTDYRKSDSGSNRGRPYERYTPTTIPISEILTNIEETGMEKLL----------
                   TKTGR E+ ID+ +SG+D+ KAD KSKDKGSF SS + ++R++ +G  R RPYER+TPTTIPISEILTNIEE+GMEKLL          
Subjt:  -----------TKTGRSEKKIDQKKSGQDKGKADSKSKDKGSFSSSNKTDYRKSDSGSNRGRPYERYTPTTIPISEILTNIEETGMEKLL----------

Query:  ---------------NHNTLICWELKRQIEDLIQDGYFKKFLGKLRSNSVKKKDEKKCSRTPPRRDDRPAVINTIFGGPSGSQSENKRKELAREARREVC
                       +HNT   WELKRQIEDLIQD YFKKF+GK R++S +KK+E+K SRTP RR DRPAVINTIFGGPSG QS +KRKELAR ARREVC
Subjt:  ---------------NHNTLICWELKRQIEDLIQDGYFKKFLGKLRSNSVKKKDEKKCSRTPPRRDDRPAVINTIFGGPSGSQSENKRKELAREARREVC

Query:  VIREQKPTCFITFNDSDLEGVHLPDNDALVIAPLIDHVLVRRVLVDGGASSNILSLTTYLALGWIRAQLKKSPTPLVGFLGESVSPEGCIDLPVTIGQDN
        +IREQ+PTC ITF+ +DLE VHLP NDALVIAPLIDHV+VRRVLVD G S+NI+SL TYLALGW R+QLKKS TPLVGF  ESV PEGCIDLPVT+G D 
Subjt:  VIREQKPTCFITFNDSDLEGVHLPDNDALVIAPLIDHVLVRRVLVDGGASSNILSLTTYLALGWIRAQLKKSPTPLVGFLGESVSPEGCIDLPVTIGQDN

Query:  TQVTQMAEFVVIDGRSAYNAIFGRPIIHSFWVVPSTLHQVLKYSTPVGVGTVRGE
        TQVTQMAEFVVIDGRSAYNAIFGRPIIHSF  +PSTLHQVLKYSTP GVG VRGE
Subjt:  TQVTQMAEFVVIDGRSAYNAIFGRPIIHSFWVVPSTLHQVLKYSTPVGVGTVRGE

A0A6J1DHB3 uncharacterized protein LOC1110204798.6e-21659.29Show/hide
Query:  MVQPASSTNTADRRSLADNSG-QREMEAKVAEDQIPEGLGTEQLRRSARITTLVLPPAHLKPTKTNRGRGGASRRATRGATPAPSRENFDALQKEMEAMR
        MVQPA+STNTADRR+LA N G QRE+ A+V E Q  E LGTE L RSARITT VLPPAH KP+K                                    
Subjt:  MVQPASSTNTADRRSLADNSG-QREMEAKVAEDQIPEGLGTEQLRRSARITTLVLPPAHLKPTKTNRGRGGASRRATRGATPAPSRENFDALQKEMEAMR

Query:  AQIRTMEEMYNEMVQAAGATSRFEDQVVHEEVHVQRDLHLNPIDEEYPRGDENVEYSRQKNDLRDHLNRKRSSSHRGGRTPTCSHKNSNQQVESSYNPIV
                                                                                                    ESSYNPI 
Subjt:  AQIRTMEEMYNEMVQAAGATSRFEDQVVHEEVHVQRDLHLNPIDEEYPRGDENVEYSRQKNDLRDHLNRKRSSSHRGGRTPTCSHKNSNQQVESSYNPIV

Query:  PEGMITKEEFDQLKSKFDAQVEALKARCKVKESAFDDGDLGESPFTSDILEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIA
        P G+IT+EEFDQLKSKFDAQVEALKARC+ KES+FDDGDLGE  F+SDILEA IPPKFKTPTMKPYDGSKDPKDYVEVFE LMDFQAATDAIKC AFQIA
Subjt:  PEGMITKEEFDQLKSKFDAQVEALKARCKVKESAFDDGDLGESPFTSDILEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIA

Query:  LTGSALLWYRRLPA-----------------------RKTTTHLTTIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFITGLADETLT---------
        LTGSA LWYRRLPA                       RKT THL TIRQKEGETLREYVTRF EEQLKV HCSDDSAMCYF+TGLADETLT         
Subjt:  LTGSALLWYRRLPA-----------------------RKTTTHLTTIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFITGLADETLT---------

Query:  --------------------TKTGRSEKKIDQKKSGQDKGKADSKSKDKGSFSSSNKTDYRKSDSGSNRGRPYERYTPTTIPISEILTNIEETGMEKLL-
                            TKTGR EK IDQ ++G+DKGKADSKS+DKG  SSS++ DYR+S+S  N+ RPYE YTPTTIPI EILTNIEETGMEKLL 
Subjt:  --------------------TKTGRSEKKIDQKKSGQDKGKADSKSKDKGSFSSSNKTDYRKSDSGSNRGRPYERYTPTTIPISEILTNIEETGMEKLL-

Query:  ------------------------NHNTLICWELKRQIEDLIQDGYFKKFLGKLRSNSVKKKDEKKCSRTPPRRDDRPAVINTIFGGPSGSQSENKRKEL
                                 HNT   WELKRQIEDLIQDGYFKKF+GK RSNSV+KK+E+K  RTPPRRDDRPAVI             NK+KEL
Subjt:  ------------------------NHNTLICWELKRQIEDLIQDGYFKKFLGKLRSNSVKKKDEKKCSRTPPRRDDRPAVINTIFGGPSGSQSENKRKEL

Query:  AREARREVCVIREQKPTCFITFNDSDLEGVHLPDNDALVIAPLIDHVLVRRVLVDGGASSNILSLTTYLALGWIRAQLKKSPTPLVGFLGESVSPEGCID
        AREARREVC+IREQ+PT  I FN +DLEGVHLP NDALVIAPLID VLVRR+LVDGGAS+NILSL+TYLALGW R+QLKKSPTPLVGF GES+S EGCID
Subjt:  AREARREVCVIREQKPTCFITFNDSDLEGVHLPDNDALVIAPLIDHVLVRRVLVDGGASSNILSLTTYLALGWIRAQLKKSPTPLVGFLGESVSPEGCID

Query:  LPVTIGQDNTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFWVVPSTLHQVLKYSTPVGVGTVRGE
        LPV+I QD+TQVTQMAEFVVIDGRSAYNAIFGRPIIHSF  VPSTLHQVLKYST  GVGTVRGE
Subjt:  LPVTIGQDNTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFWVVPSTLHQVLKYSTPVGVGTVRGE

A0A6J1DPC9 uncharacterized protein LOC1110222802.4e-18665.11Show/hide
Query:  LNPIDEEYPRGDENVEYSRQKNDLRDHL-NRKRSSSHRGGRTPTCSHK--NSNQQVESSYNPIVPEGMITKEEFDQLKSKFDAQVEALKARCKVKESAFD
        + P + E    DE V+YS + NDLR HL ++K+ +S     + + S +  NSN + +S Y P++PE +I +EEFD +K +FD QVEALKARC+ KES FD
Subjt:  LNPIDEEYPRGDENVEYSRQKNDLRDHL-NRKRSSSHRGGRTPTCSHK--NSNQQVESSYNPIVPEGMITKEEFDQLKSKFDAQVEALKARCKVKESAFD

Query:  DGDLGESPFTSDILEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSALLWYRRLPA----------------------
        D DLGESPFTSDI+EAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKC AFQIALTGSA LW RRLPA                      
Subjt:  DGDLGESPFTSDILEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSALLWYRRLPA----------------------

Query:  -RKTTTHLTTIRQKEGETL-----REYVTRFQEEQLKVVHCSDDSAMCYFITGLADETLTTKTGRSEKKIDQKKSGQDKGKADSKSKDKGSFSSSNKTDY
         RKT THL TIRQKE ETL      E    F E         D             E L TKT R EK+IDQK+  Q K K DSKSKDKGS SS ++T+Y
Subjt:  -RKTTTHLTTIRQKEGETL-----REYVTRFQEEQLKVVHCSDDSAMCYFITGLADETLTTKTGRSEKKIDQKKSGQDKGKADSKSKDKGSFSSSNKTDY

Query:  RKSDSGSNRGRPYERYTPTTIPISEILTNIEETGMEKLLNHNTLICWELKRQIEDLIQDGYFKKFLGKLRSNSVKKKDEKKCSRTPPRRDDRPAVINTIF
        R+S+SG +R RPYER                              CWELKRQIEDLIQD YFKKF+GK RSNSV+KK+E+K SRTPPRR+DRPAVINTIF
Subjt:  RKSDSGSNRGRPYERYTPTTIPISEILTNIEETGMEKLLNHNTLICWELKRQIEDLIQDGYFKKFLGKLRSNSVKKKDEKKCSRTPPRRDDRPAVINTIF

Query:  GGPSGSQSENKRKELAREARREVCVIREQKPTCFITFNDSDLEGVHLPDNDALVIAPLIDHVLVRRVLVDGGASSNILSLTTYLALGWIRAQLKKSPTPL
        GGPSG Q ENKRKELA EARR+V +IREQKPTC ITF D+DLEGVHLP NDALVIAPLIDHVLVRRVLVDGGAS+NILSL TYLAL   R+QLKKSPTPL
Subjt:  GGPSGSQSENKRKELAREARREVCVIREQKPTCFITFNDSDLEGVHLPDNDALVIAPLIDHVLVRRVLVDGGASSNILSLTTYLALGWIRAQLKKSPTPL

Query:  VGFLGESVSPEGCIDLPVTIGQDNTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFWVVPSTLHQVLKYSTPVGVGTVRGE
        VGF  ESVSPEGCIDLPVTIGQD+TQVTQMAEFVVIDGR AYNAIF RPIIHSF  VPS LHQVLKYSTP GVGTVRGE
Subjt:  VGFLGESVSPEGCIDLPVTIGQDNTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFWVVPSTLHQVLKYSTPVGVGTVRGE

A0A6J1DPN4 uncharacterized protein LOC1110230602.8e-18264.25Show/hide
Query:  MEAMRAQIRTMEEMYNEMVQAAGATSRFEDQVVHEEVHVQRDLHLNPIDEEYPRGDENVEYSRQKNDLRDHLNRKRSSSHRGGRTPTCSHKNSNQQVESS
        MEAMR Q+RTMEEMYN+MVQ AGA SR  DQVVHE+VH Q DLH +P+DEE+  G           DLRDHLNRKR+SSHRG RT T  HKNSNQQ ESS
Subjt:  MEAMRAQIRTMEEMYNEMVQAAGATSRFEDQVVHEEVHVQRDLHLNPIDEEYPRGDENVEYSRQKNDLRDHLNRKRSSSHRGGRTPTCSHKNSNQQVESS

Query:  YNPIVPEGMITKEEFDQLKSKFDAQVEALKARCKVKESAFDDGDLGESPFTSDILEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCR
        YNPI PEG+IT+EEF+QLKSKFDAQVEALK RC+ KESAFDDGDLGESPFTSDILEA IPPKFKTPTMK YDGSKDPKDYVEVFEGLMDFQAATDAIKCR
Subjt:  YNPIVPEGMITKEEFDQLKSKFDAQVEALKARCKVKESAFDDGDLGESPFTSDILEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCR

Query:  AFQIALTGSALLWYRRLPA-----------------------RKTTTHLTTIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFITGLADET------
        AFQIALTGSA LWYRRLPA                       RKTTTHL TIRQKEG+TL+EY+TRFQEEQLKVVHCSDDS+MCYF+TGLADET      
Subjt:  AFQIALTGSALLWYRRLPA-----------------------RKTTTHLTTIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFITGLADET------

Query:  -----------------------LTTKTGRSEKKIDQKKSGQDKGKADSKSKDKGSFSSSNKTDYRKSDSGSNRGRPYERYTPTTIPISEILTNIEETGM
                               L TKT R EK+IDQKKS QDK KADSKSKDKGS SS+++TDY                                   
Subjt:  -----------------------LTTKTGRSEKKIDQKKSGQDKGKADSKSKDKGSFSSSNKTDYRKSDSGSNRGRPYERYTPTTIPISEILTNIEETGM

Query:  EKLLNHNTLICWELKRQIEDLIQDGYFKKFLGKLRSNSVKKKDEKKCSRTPPRRDDRPAVINTIFGGPSGSQSENKRKELAREARREVCVIREQKPTCFI
                                          RSNSV+KK+E+K SRTPPR DDRPAVINTIFGGPSG QS NKRKELAREA REVC+IREQ+PTC +
Subjt:  EKLLNHNTLICWELKRQIEDLIQDGYFKKFLGKLRSNSVKKKDEKKCSRTPPRRDDRPAVINTIFGGPSGSQSENKRKELAREARREVCVIREQKPTCFI

Query:  TFNDSDLEGVHLPDNDALVIAPLIDHVLVRRVLVDGGASSNILSLTTYLALGWIRAQLKKSPTPLVGFLGESVSPEGCI
        TF+DSDLEGVHLP NDALVIAPLIDHVLVRRVLVDGGAS+NILS    LALGW R+QLKKSPTPLVGF  ESVS +G +
Subjt:  TFNDSDLEGVHLPDNDALVIAPLIDHVLVRRVLVDGGASSNILSLTTYLALGWIRAQLKKSPTPLVGFLGESVSPEGCI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAACCGGCGAGCTCAACCAACACAGCAGACCGAAGATCTCTGGCTGATAATAGTGGCCAAAGAGAGATGGAGGCGAAAGTGGCAGAGGATCAAATTCCG
GAAGGCCTAGGAACCGAACAGCTCCGTAGGTCGGCACGCATCACCACACTAGTCTTGCCACCAGCACATCTAAAACCTACTAAGACCAACCGTGGCCGAGGTGGT
GCCTCCAGAAGAGCCACTCGAGGAGCAACACCAGCTCCTAGTAGGGAGAATTTTGATGCCCTCCAGAAAGAAATGGAGGCAATGCGCGCCCAGATACGCACCATG
GAAGAGATGTACAATGAAATGGTGCAAGCTGCTGGTGCTACGTCTCGATTTGAAGACCAAGTGGTGCATGAGGAAGTGCACGTGCAAAGGGATCTGCACCTCAAT
CCAATCGACGAAGAATACCCGAGGGGCGATGAAAATGTGGAGTATAGTCGCCAGAAGAACGATCTTCGCGACCATCTTAACAGAAAGAGAAGCTCGTCCCACCGA
GGTGGACGAACCCCAACATGCTCGCACAAGAACTCCAACCAGCAGGTCGAATCCTCCTATAACCCAATAGTTCCTGAAGGAATGATTACAAAGGAGGAGTTCGAC
CAGCTCAAGAGCAAATTTGATGCTCAAGTTGAAGCCTTAAAGGCAAGGTGCAAGGTGAAAGAAAGTGCATTTGATGATGGCGACTTGGGAGAATCGCCATTCACC
TCGGATATCTTGGAGGCTCCAATCCCCCCAAAGTTCAAAACTCCCACTATGAAGCCATATGATGGGTCTAAGGACCCAAAAGATTACGTTGAGGTCTTTGAAGGC
CTCATGGATTTCCAAGCGGCAACAGACGCCATAAAGTGTCGCGCCTTCCAGATCGCGCTAACCGGCAGCGCGCTCCTGTGGTATAGAAGACTGCCGGCTAGAAAA
ACAACGACTCACCTTACCACCATCAGACAGAAGGAGGGTGAGACGCTTAGAGAATATGTCACCAGGTTCCAGGAGGAGCAGCTGAAAGTCGTGCACTGCTCCGAT
GACTCAGCCATGTGCTACTTCATCACCGGCTTGGCCGATGAGACCCTCACTACCAAGACTGGCCGATCAGAGAAGAAGATCGACCAGAAAAAGTCCGGCCAAGAC
AAAGGAAAGGCCGATTCCAAGTCCAAAGACAAGGGGTCATTCTCATCTAGCAACAAGACGGATTATCGCAAGTCAGACAGCGGTTCCAACAGAGGCAGACCTTAC
GAACGATATACTCCAACCACAATTCCCATCTCGGAAATACTTACAAACATTGAAGAAACTGGGATGGAGAAACTCCTCAACCATAATACATTGATTTGCTGGGAA
CTCAAACGCCAGATTGAAGACCTCATTCAAGATGGCTATTTCAAAAAATTTCTTGGCAAACTAAGGTCTAACTCGGTAAAGAAAAAAGATGAAAAAAAGTGTTCA
AGGACGCCACCTCGCCGAGATGACCGACCAGCGGTCATTAATACTATCTTTGGAGGCCCAAGTGGGAGCCAGTCTGAAAACAAAAGGAAAGAATTAGCTCGAGAA
GCTAGGCGCGAGGTGTGCGTCATTAGGGAGCAGAAACCGACCTGCTTCATCACCTTCAATGATTCCGACCTGGAGGGGGTCCACTTGCCCGATAATGATGCACTT
GTGATTGCACCTCTCATCGATCACGTCTTGGTTCGGAGAGTATTGGTAGATGGGGGTGCATCTTCCAATATTCTGTCTCTTACAACGTATCTTGCCTTGGGATGG
ATCAGGGCACAATTGAAGAAGAGTCCAACACCATTGGTTGGATTTTTGGGAGAATCAGTCTCCCCAGAAGGGTGTATTGACTTGCCGGTTACAATTGGGCAAGAC
AATACACAAGTCACGCAGATGGCCGAGTTCGTTGTGATCGACGGCAGGTCGGCCTATAATGCTATCTTTGGGAGACCTATCATCCATTCATTCTGGGTTGTTCCT
TCAACGCTTCACCAAGTTCTGAAGTACTCAACTCCCGTTGGAGTGGGCACTGTCCGAGGAGAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAACCGGCGAGCTCAACCAACACAGCAGACCGAAGATCTCTGGCTGATAATAGTGGCCAAAGAGAGATGGAGGCGAAAGTGGCAGAGGATCAAATTCCG
GAAGGCCTAGGAACCGAACAGCTCCGTAGGTCGGCACGCATCACCACACTAGTCTTGCCACCAGCACATCTAAAACCTACTAAGACCAACCGTGGCCGAGGTGGT
GCCTCCAGAAGAGCCACTCGAGGAGCAACACCAGCTCCTAGTAGGGAGAATTTTGATGCCCTCCAGAAAGAAATGGAGGCAATGCGCGCCCAGATACGCACCATG
GAAGAGATGTACAATGAAATGGTGCAAGCTGCTGGTGCTACGTCTCGATTTGAAGACCAAGTGGTGCATGAGGAAGTGCACGTGCAAAGGGATCTGCACCTCAAT
CCAATCGACGAAGAATACCCGAGGGGCGATGAAAATGTGGAGTATAGTCGCCAGAAGAACGATCTTCGCGACCATCTTAACAGAAAGAGAAGCTCGTCCCACCGA
GGTGGACGAACCCCAACATGCTCGCACAAGAACTCCAACCAGCAGGTCGAATCCTCCTATAACCCAATAGTTCCTGAAGGAATGATTACAAAGGAGGAGTTCGAC
CAGCTCAAGAGCAAATTTGATGCTCAAGTTGAAGCCTTAAAGGCAAGGTGCAAGGTGAAAGAAAGTGCATTTGATGATGGCGACTTGGGAGAATCGCCATTCACC
TCGGATATCTTGGAGGCTCCAATCCCCCCAAAGTTCAAAACTCCCACTATGAAGCCATATGATGGGTCTAAGGACCCAAAAGATTACGTTGAGGTCTTTGAAGGC
CTCATGGATTTCCAAGCGGCAACAGACGCCATAAAGTGTCGCGCCTTCCAGATCGCGCTAACCGGCAGCGCGCTCCTGTGGTATAGAAGACTGCCGGCTAGAAAA
ACAACGACTCACCTTACCACCATCAGACAGAAGGAGGGTGAGACGCTTAGAGAATATGTCACCAGGTTCCAGGAGGAGCAGCTGAAAGTCGTGCACTGCTCCGAT
GACTCAGCCATGTGCTACTTCATCACCGGCTTGGCCGATGAGACCCTCACTACCAAGACTGGCCGATCAGAGAAGAAGATCGACCAGAAAAAGTCCGGCCAAGAC
AAAGGAAAGGCCGATTCCAAGTCCAAAGACAAGGGGTCATTCTCATCTAGCAACAAGACGGATTATCGCAAGTCAGACAGCGGTTCCAACAGAGGCAGACCTTAC
GAACGATATACTCCAACCACAATTCCCATCTCGGAAATACTTACAAACATTGAAGAAACTGGGATGGAGAAACTCCTCAACCATAATACATTGATTTGCTGGGAA
CTCAAACGCCAGATTGAAGACCTCATTCAAGATGGCTATTTCAAAAAATTTCTTGGCAAACTAAGGTCTAACTCGGTAAAGAAAAAAGATGAAAAAAAGTGTTCA
AGGACGCCACCTCGCCGAGATGACCGACCAGCGGTCATTAATACTATCTTTGGAGGCCCAAGTGGGAGCCAGTCTGAAAACAAAAGGAAAGAATTAGCTCGAGAA
GCTAGGCGCGAGGTGTGCGTCATTAGGGAGCAGAAACCGACCTGCTTCATCACCTTCAATGATTCCGACCTGGAGGGGGTCCACTTGCCCGATAATGATGCACTT
GTGATTGCACCTCTCATCGATCACGTCTTGGTTCGGAGAGTATTGGTAGATGGGGGTGCATCTTCCAATATTCTGTCTCTTACAACGTATCTTGCCTTGGGATGG
ATCAGGGCACAATTGAAGAAGAGTCCAACACCATTGGTTGGATTTTTGGGAGAATCAGTCTCCCCAGAAGGGTGTATTGACTTGCCGGTTACAATTGGGCAAGAC
AATACACAAGTCACGCAGATGGCCGAGTTCGTTGTGATCGACGGCAGGTCGGCCTATAATGCTATCTTTGGGAGACCTATCATCCATTCATTCTGGGTTGTTCCT
TCAACGCTTCACCAAGTTCTGAAGTACTCAACTCCCGTTGGAGTGGGCACTGTCCGAGGAGAGTAG
Protein sequenceShow/hide protein sequence
MVQPASSTNTADRRSLADNSGQREMEAKVAEDQIPEGLGTEQLRRSARITTLVLPPAHLKPTKTNRGRGGASRRATRGATPAPSRENFDALQKEMEAMRAQIRTM
EEMYNEMVQAAGATSRFEDQVVHEEVHVQRDLHLNPIDEEYPRGDENVEYSRQKNDLRDHLNRKRSSSHRGGRTPTCSHKNSNQQVESSYNPIVPEGMITKEEFD
QLKSKFDAQVEALKARCKVKESAFDDGDLGESPFTSDILEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSALLWYRRLPARK
TTTHLTTIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFITGLADETLTTKTGRSEKKIDQKKSGQDKGKADSKSKDKGSFSSSNKTDYRKSDSGSNRGRPY
ERYTPTTIPISEILTNIEETGMEKLLNHNTLICWELKRQIEDLIQDGYFKKFLGKLRSNSVKKKDEKKCSRTPPRRDDRPAVINTIFGGPSGSQSENKRKELARE
ARREVCVIREQKPTCFITFNDSDLEGVHLPDNDALVIAPLIDHVLVRRVLVDGGASSNILSLTTYLALGWIRAQLKKSPTPLVGFLGESVSPEGCIDLPVTIGQD
NTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFWVVPSTLHQVLKYSTPVGVGTVRGE