; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g30220 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g30220
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr6:22726423..22733830
RNA-Seq ExpressionMoc06g30220
SyntenyMoc06g30220
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]8.2e-20673.77Show/hide
Query:  LKAQSKYKPLTPEAVITREEFDLMKHKFDEQVEALKARCEKKECSFDNGDLGESPFTSDILEAQIPLKFKTPTIKPYDGSKDPKDYVEVFEGLMDFQAAT
        +KA+S   P TP  VITREEFD ++ + D QVEALKA+CE+KE   ++GDLGESPFTSD+LEA IP KFK PT+KPYDGSKDPKDYVEVFE LMDFQAA+
Subjt:  LKAQSKYKPLTPEAVITREEFDLMKHKFDEQVEALKARCEKKECSFDNGDLGESPFTSDILEAQIPLKFKTPTIKPYDGSKDPKDYVEVFEGLMDFQAAT

Query:  DAIKCCAFQIALTGSARLWYQRLPARSISTYSQLRKEFISQFSSRHYGRKTATHLATIRQKEGETLGEYVTRFQEEQLKVAHCSDDSAMCYFLTSLADET
        DAIKC AF+IALTGSARLWY+RLPA SISTYSQLR+EF++ FSSRHY +KTATHLATIRQKEGETL EYVTRFQEEQLKVAHCSDDSAMCYFLT LADE 
Subjt:  DAIKCCAFQIALTGSARLWYQRLPARSISTYSQLRKEFISQFSSRHYGRKTATHLATIRQKEGETLGEYVTRFQEEQLKVAHCSDDSAMCYFLTSLADET

Query:  LTVKLGEEAPATFAEIDQK-------------KLNQEKRK------------ADSKSRDKGSSSSASRTEYRRSESGPSRSRPYERYTPTTIPISEILTN
        LTVKLGEEAPATFAE+ QK             K  + +RK            AD KS+DKGS SS  R EYRR+E+GP+RSRPYER+TPTTIPISEILTN
Subjt:  LTVKLGEEAPATFAEIDQK-------------KLNQEKRK------------ADSKSRDKGSSSSASRTEYRRSESGPSRSRPYERYTPTTIPISEILTN

Query:  IEESGMEKLL------------------------------NCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRREDRPAVINTIFGGPS
        IEESGMEKLL                              + WELKRQIE+LIQDGYFKKFVGKPR++S EKKEERKRSRTPPRR DRPAVINTIFGGPS
Subjt:  IEESGMEKLL------------------------------NCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRREDRPAVINTIFGGPS

Query:  GGQSGNKRKGLARKARREVCVIREQKPTCSITFGDVDLEGVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSLTPLVGFS
        GGQSG KRK LAR ARREVC+IREQ+PTC ITF   DLE VHLPHNDALVIAPLIDHV+V RVLVDGG SANILSLPTYLALGWTRSQLKKS TPLVGFS
Subjt:  GGQSGNKRKGLARKARREVCVIREQKPTCSITFGDVDLEGVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSLTPLVGFS

Query:  GESVSPEGCIDLPVTIGQDATQVTQMAEFV
        GESV PEG IDLPVT+GQD TQVTQMAEFV
Subjt:  GESVSPEGCIDLPVTIGQDATQVTQMAEFV

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]8.0e-20163.17Show/hide
Query:  NSNLKAQSKYKPLTPEAVITREEFDLMKHKFDEQVEALKARCEKKECSFDNGDLGESPFTSDILEAQIPLKFKTPTIKPYDGSKDPKDYVEVFEGLMDFQ
        +SN +A+S + P TP+ VITREEFD ++ K + QVEALKA+CE+KE   ++GDLGESPFTSD+LEA        PT+K YDGSKDPKDYVEVFEGLMDFQ
Subjt:  NSNLKAQSKYKPLTPEAVITREEFDLMKHKFDEQVEALKARCEKKECSFDNGDLGESPFTSDILEAQIPLKFKTPTIKPYDGSKDPKDYVEVFEGLMDFQ

Query:  AATDAIKCCAFQIALTGSARLWYQRLPARSISTYSQLRKEFISQFSSRHYGRKTATHLATIRQKEGETLGEYVTRFQEEQLKVAHCSDDSAMCYFLTSLA
        AA+DAIKC AFQIALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFLT LA
Subjt:  AATDAIKCCAFQIALTGSARLWYQRLPARSISTYSQLRKEFISQFSSRHYGRKTATHLATIRQKEGETLGEYVTRFQEEQLKVAHCSDDSAMCYFLTSLA

Query:  DETLTVKLGEEAPATFAEIDQK-------------KLNQEKR-----------KADSKSRDKGSSSSASRTEYRRSESGPSRSRPYERYTPTTIPISEIL
        DE LTVKLG+EAPATFAE+ QK             K  + +R           KAD KS+DKGS SS  R E+RR+ +GP+RSRPYER+TPTTIPISEIL
Subjt:  DETLTVKLGEEAPATFAEIDQK-------------KLNQEKR-----------KADSKSRDKGSSSSASRTEYRRSESGPSRSRPYERYTPTTIPISEIL

Query:  TNIEESGMEKLL------------------------------NCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRREDRPAVINTIFGG
        TNIEESGMEKLL                              + WELKRQIEDLIQD YFKKFVGKPR++S EKKEERK SRTP RR DRPAVINTIFGG
Subjt:  TNIEESGMEKLL------------------------------NCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRREDRPAVINTIFGG

Query:  PSGGQSGNKRKGLARKARREVCVIREQKPTCSITFGDVDLEGVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSLTPLVG
        PSGGQSG+KRK LAR ARREVC+IREQ+PTC ITF   DLE VHLPHNDALVIAPLIDHV+VRRVLVD G SANI+SL TYLALGWTRSQLKKS TPLVG
Subjt:  PSGGQSGNKRKGLARKARREVCVIREQKPTCSITFGDVDLEGVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSLTPLVG

Query:  FSGESVSPEGCIDLPVTIGQDATQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASALKGSSVCVLEEQ
        FS ESV PEGCIDLPVT+G D TQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYSTPNGVG VRGEQ  SRECYASALKGSSVC LE  
Subjt:  FSGESVSPEGCIDLPVTIGQDATQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASALKGSSVCVLEEQ

Query:  TD-------QGDLPREIKRQFSPPAEELELTDLARSVPVEILDSPSILE
                 + +LPR   R+F+ P EELEL  L R    E +D    L+
Subjt:  TD-------QGDLPREIKRQFSPPAEELELTDLARSVPVEILDSPSILE

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]1.9e-23975.79Show/hide
Query:  KAQSKYKPLTPEAVITREEFDLMKHKFDEQVEALKARCEKKECSFDNGDLGESPFTSDILEAQIPLKFKTPTIKPYDGSKDPKDYVEVFEGLMDFQAATD
        KA+S Y P+TP  VITREEFD +K KFD QVEALKARCEKKE SFD+GDLGE  F+SDILEA IP KFKTPT+KPYDGSKDPKDYVEVFE LMDFQAATD
Subjt:  KAQSKYKPLTPEAVITREEFDLMKHKFDEQVEALKARCEKKECSFDNGDLGESPFTSDILEAQIPLKFKTPTIKPYDGSKDPKDYVEVFEGLMDFQAATD

Query:  AIKCCAFQIALTGSARLWYQRLPARSISTYSQLRKEFISQFSSRHYGRKTATHLATIRQKEGETLGEYVTRFQEEQLKVAHCSDDSAMCYFLTSLADETL
        AIKCCAFQIALTGSARLWY+RLPAR ISTYSQLRKEFISQFSSRHY RKT THLATIRQKEGETL EYVTRF EEQLKVAHCSDDSAMCYFLT LADETL
Subjt:  AIKCCAFQIALTGSARLWYQRLPARSISTYSQLRKEFISQFSSRHYGRKTATHLATIRQKEGETLGEYVTRFQEEQLKVAHCSDDSAMCYFLTSLADETL

Query:  TVKLGEEAPATFAE-------------------------IDQKKLNQEKRKADSKSRDKGSSSSASRTEYRRSESGPSRSRPYERYTPTTIPISEILTNI
        TVKL EEAPATFAE                         IDQ +  ++K KADSKSRDKG SSS+SR +YRRS S  ++SRPYE YTPTTIPI EILTNI
Subjt:  TVKLGEEAPATFAE-------------------------IDQKKLNQEKRKADSKSRDKGSSSSASRTEYRRSESGPSRSRPYERYTPTTIPISEILTNI

Query:  EESGMEKLL------------------------------NCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRREDRPAVINTIFGGPSG
        EE+GMEKLL                              N WELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKR RTPPRR+DRPAVI         
Subjt:  EESGMEKLL------------------------------NCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRREDRPAVINTIFGGPSG

Query:  GQSGNKRKGLARKARREVCVIREQKPTCSITFGDVDLEGVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSLTPLVGFSG
            NK+K LAR+ARREVC+IREQ+PT SI F   DLEGVHLPHNDALVIAPLID VLVRR+LVDGGASANILSL TYLALGWTRSQLKKS TPLVGFSG
Subjt:  GQSGNKRKGLARKARREVCVIREQKPTCSITFGDVDLEGVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSLTPLVGFSG

Query:  ESVSPEGCIDLPVTIGQDATQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASALKGSSVCVLEEQTDQ
        ES+S EGCIDLPV+I QD TQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYST NGVGTVRGE KTSRECYAS  K SSVC LEEQT +
Subjt:  ESVSPEGCIDLPVTIGQDATQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASALKGSSVCVLEEQTDQ

Query:  GDL
         +L
Subjt:  GDL

XP_022155139.1 uncharacterized protein LOC111022280 [Momordica charantia]1.0e-26478.11Show/hide
Query:  VPGAPGEKGAPSIQPGDREPIPNDGGVDYSLRDNDLRKHLTEKKKRESREPEDSPSYSREFSNSNLKAQSKYKPLTPEAVITREEFDLMKHKFDEQVEAL
        +PGAPGEKGAPSIQPG+REPIPND GVDYSLRDNDLRKHLT+KKK+ S EPEDS SYSREFSNSNLKAQSKYKPL PEAVI REEFDLMKH+FDEQVEAL
Subjt:  VPGAPGEKGAPSIQPGDREPIPNDGGVDYSLRDNDLRKHLTEKKKRESREPEDSPSYSREFSNSNLKAQSKYKPLTPEAVITREEFDLMKHKFDEQVEAL

Query:  KARCEKKECSFDNGDLGESPFTSDILEAQIPLKFKTPTIKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCCAFQIALTGSARLWYQRLPARSISTYSQLR
        KARCEKKE  FD+ DLGESPFTSDI+EA IP KFKTPT+KPYDGSKDPKDYVEVFEGLMDFQAATDAIKC AFQIALTGSARLW +RLPARSISTYSQLR
Subjt:  KARCEKKECSFDNGDLGESPFTSDILEAQIPLKFKTPTIKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCCAFQIALTGSARLWYQRLPARSISTYSQLR

Query:  KEFISQFSSRHYGRKTATHLATIRQKEGETLGEYVTRFQEEQLKVAHCSDDSAMCYFLTSLADETLTVKLGEEAPATFAE--------------------
        KEFI QFS RHY RKTATHLATIRQKE                                   DETLTVKLGEEAPATFAE                    
Subjt:  KEFISQFSSRHYGRKTATHLATIRQKEGETLGEYVTRFQEEQLKVAHCSDDSAMCYFLTSLADETLTVKLGEEAPATFAE--------------------

Query:  -----IDQKKLNQEKRKADSKSRDKGSSSSASRTEYRRSESGPSRSRPYERYTPTTIPISEILTNIEESGMEKLLNCWELKRQIEDLIQDGYFKKFVGKP
             IDQK+L+Q+KRK DSKS+DKGSSSS SRTEYRRSESGPSRSRPYER                         CWELKRQIEDLIQD YFKKFVGKP
Subjt:  -----IDQKKLNQEKRKADSKSRDKGSSSSASRTEYRRSESGPSRSRPYERYTPTTIPISEILTNIEESGMEKLLNCWELKRQIEDLIQDGYFKKFVGKP

Query:  RSNSVEKKEERKRSRTPPRREDRPAVINTIFGGPSGGQSGNKRKGLARKARREVCVIREQKPTCSITFGDVDLEGVHLPHNDALVIAPLIDHVLVRRVLV
        RSNSVEKKEERKRSRTPPRREDRPAVINTIFGGPSGGQ  NKRK LA +ARR+V +IREQKPTCSITF D DLEGVHLPHNDALVIAPLIDHVLVRRVLV
Subjt:  RSNSVEKKEERKRSRTPPRREDRPAVINTIFGGPSGGQSGNKRKGLARKARREVCVIREQKPTCSITFGDVDLEGVHLPHNDALVIAPLIDHVLVRRVLV

Query:  DGGASANILSLPTYLALGWTRSQLKKSLTPLVGFSGESVSPEGCIDLPVTIGQDATQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYST
        DGGASANILSLPTYLAL  TRSQLKKS TPLVGFS ESVSPEGCIDLPVTIGQD+TQVTQMAEFVVIDGR AYNAIF RPIIHSF+AVPS LHQVLKYST
Subjt:  DGGASANILSLPTYLALGWTRSQLKKSLTPLVGFSGESVSPEGCIDLPVTIGQDATQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYST

Query:  PNGVGTVRGEQKTSRECYASALKGSSVCVLEEQTDQGDLPREIK
        PNGVGTVRGEQKTSRECYASALK SSVC LEEQT Q DLPRE K
Subjt:  PNGVGTVRGEQKTSRECYASALKGSSVCVLEEQTDQGDLPREIK

XP_022158414.1 uncharacterized protein LOC111024904 [Momordica charantia]7.0e-20574.72Show/hide
Query:  MDFQAATDAIKCCAFQIALTGSARLWYQRLPARSISTYSQLRKEFISQFSSRHYGRKTATHLATIRQKEGETLGEYVTRFQEEQLKVAHCSDDSAMCYFL
        MDFQAATDAIKC AFQIALTGSARLWY+RLPARSISTYSQLRKEFISQFSS HY RKTATHLATIRQKE ETL EYVTRFQEEQLKVAHCSDDSAMCYFL
Subjt:  MDFQAATDAIKCCAFQIALTGSARLWYQRLPARSISTYSQLRKEFISQFSSRHYGRKTATHLATIRQKEGETLGEYVTRFQEEQLKVAHCSDDSAMCYFL

Query:  TSLADETLTVKLGEEAPATFAE-------------------------IDQKKLNQEKRKADSKSRDKGSSSSASRTEYRRSESGPSRSRPYERYTPTTIP
        TSLADETLTVKLGEEAP TF E                         IDQKKL+QEKRKADSKSRDKGSSSSASRTEYRR ESGPSRSRPYERYT +TIP
Subjt:  TSLADETLTVKLGEEAPATFAE-------------------------IDQKKLNQEKRKADSKSRDKGSSSSASRTEYRRSESGPSRSRPYERYTPTTIP

Query:  ISEILTNIEESGMEKLL------------------------------NCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRREDRPAVIN
        ISEILTNIEESGMEKLL                              +CWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRREDRPAVIN
Subjt:  ISEILTNIEESGMEKLL------------------------------NCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRREDRPAVIN

Query:  TIFGGPSGGQSGNKRKGLARKARREVCVIREQKPTCSITFGDVDLEGVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSL
        TIFGGP+GGQSGNKRK LAR+ARREVC+IRE KPTCSITFGD DLEGVHLPHNDALVIA LIDH LVRRVL+DG                          
Subjt:  TIFGGPSGGQSGNKRKGLARKARREVCVIREQKPTCSITFGDVDLEGVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSL

Query:  TPLVGFSGESVSPEGCIDLPVTIGQDATQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASALKGSSVC
                      GCIDLPVTIGQDATQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPN VG VRGEQKTSRECYASALKGS+VC
Subjt:  TPLVGFSGESVSPEGCIDLPVTIGQDATQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASALKGSSVC

Query:  VLEEQTDQG-------DLPREIKRQFSPPAEELELTDL
         LEEQT++G       DLP+E KRQF PP EELEL  L
Subjt:  VLEEQTDQG-------DLPREIKRQFSPPAEELELTDL

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088134.0e-20673.77Show/hide
Query:  LKAQSKYKPLTPEAVITREEFDLMKHKFDEQVEALKARCEKKECSFDNGDLGESPFTSDILEAQIPLKFKTPTIKPYDGSKDPKDYVEVFEGLMDFQAAT
        +KA+S   P TP  VITREEFD ++ + D QVEALKA+CE+KE   ++GDLGESPFTSD+LEA IP KFK PT+KPYDGSKDPKDYVEVFE LMDFQAA+
Subjt:  LKAQSKYKPLTPEAVITREEFDLMKHKFDEQVEALKARCEKKECSFDNGDLGESPFTSDILEAQIPLKFKTPTIKPYDGSKDPKDYVEVFEGLMDFQAAT

Query:  DAIKCCAFQIALTGSARLWYQRLPARSISTYSQLRKEFISQFSSRHYGRKTATHLATIRQKEGETLGEYVTRFQEEQLKVAHCSDDSAMCYFLTSLADET
        DAIKC AF+IALTGSARLWY+RLPA SISTYSQLR+EF++ FSSRHY +KTATHLATIRQKEGETL EYVTRFQEEQLKVAHCSDDSAMCYFLT LADE 
Subjt:  DAIKCCAFQIALTGSARLWYQRLPARSISTYSQLRKEFISQFSSRHYGRKTATHLATIRQKEGETLGEYVTRFQEEQLKVAHCSDDSAMCYFLTSLADET

Query:  LTVKLGEEAPATFAEIDQK-------------KLNQEKRK------------ADSKSRDKGSSSSASRTEYRRSESGPSRSRPYERYTPTTIPISEILTN
        LTVKLGEEAPATFAE+ QK             K  + +RK            AD KS+DKGS SS  R EYRR+E+GP+RSRPYER+TPTTIPISEILTN
Subjt:  LTVKLGEEAPATFAEIDQK-------------KLNQEKRK------------ADSKSRDKGSSSSASRTEYRRSESGPSRSRPYERYTPTTIPISEILTN

Query:  IEESGMEKLL------------------------------NCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRREDRPAVINTIFGGPS
        IEESGMEKLL                              + WELKRQIE+LIQDGYFKKFVGKPR++S EKKEERKRSRTPPRR DRPAVINTIFGGPS
Subjt:  IEESGMEKLL------------------------------NCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRREDRPAVINTIFGGPS

Query:  GGQSGNKRKGLARKARREVCVIREQKPTCSITFGDVDLEGVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSLTPLVGFS
        GGQSG KRK LAR ARREVC+IREQ+PTC ITF   DLE VHLPHNDALVIAPLIDHV+V RVLVDGG SANILSLPTYLALGWTRSQLKKS TPLVGFS
Subjt:  GGQSGNKRKGLARKARREVCVIREQKPTCSITFGDVDLEGVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSLTPLVGFS

Query:  GESVSPEGCIDLPVTIGQDATQVTQMAEFV
        GESV PEG IDLPVT+GQD TQVTQMAEFV
Subjt:  GESVSPEGCIDLPVTIGQDATQVTQMAEFV

A0A6J1D9E1 uncharacterized protein LOC1110188233.9e-20163.17Show/hide
Query:  NSNLKAQSKYKPLTPEAVITREEFDLMKHKFDEQVEALKARCEKKECSFDNGDLGESPFTSDILEAQIPLKFKTPTIKPYDGSKDPKDYVEVFEGLMDFQ
        +SN +A+S + P TP+ VITREEFD ++ K + QVEALKA+CE+KE   ++GDLGESPFTSD+LEA        PT+K YDGSKDPKDYVEVFEGLMDFQ
Subjt:  NSNLKAQSKYKPLTPEAVITREEFDLMKHKFDEQVEALKARCEKKECSFDNGDLGESPFTSDILEAQIPLKFKTPTIKPYDGSKDPKDYVEVFEGLMDFQ

Query:  AATDAIKCCAFQIALTGSARLWYQRLPARSISTYSQLRKEFISQFSSRHYGRKTATHLATIRQKEGETLGEYVTRFQEEQLKVAHCSDDSAMCYFLTSLA
        AA+DAIKC AFQIALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFLT LA
Subjt:  AATDAIKCCAFQIALTGSARLWYQRLPARSISTYSQLRKEFISQFSSRHYGRKTATHLATIRQKEGETLGEYVTRFQEEQLKVAHCSDDSAMCYFLTSLA

Query:  DETLTVKLGEEAPATFAEIDQK-------------KLNQEKR-----------KADSKSRDKGSSSSASRTEYRRSESGPSRSRPYERYTPTTIPISEIL
        DE LTVKLG+EAPATFAE+ QK             K  + +R           KAD KS+DKGS SS  R E+RR+ +GP+RSRPYER+TPTTIPISEIL
Subjt:  DETLTVKLGEEAPATFAEIDQK-------------KLNQEKR-----------KADSKSRDKGSSSSASRTEYRRSESGPSRSRPYERYTPTTIPISEIL

Query:  TNIEESGMEKLL------------------------------NCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRREDRPAVINTIFGG
        TNIEESGMEKLL                              + WELKRQIEDLIQD YFKKFVGKPR++S EKKEERK SRTP RR DRPAVINTIFGG
Subjt:  TNIEESGMEKLL------------------------------NCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRREDRPAVINTIFGG

Query:  PSGGQSGNKRKGLARKARREVCVIREQKPTCSITFGDVDLEGVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSLTPLVG
        PSGGQSG+KRK LAR ARREVC+IREQ+PTC ITF   DLE VHLPHNDALVIAPLIDHV+VRRVLVD G SANI+SL TYLALGWTRSQLKKS TPLVG
Subjt:  PSGGQSGNKRKGLARKARREVCVIREQKPTCSITFGDVDLEGVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSLTPLVG

Query:  FSGESVSPEGCIDLPVTIGQDATQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASALKGSSVCVLEEQ
        FS ESV PEGCIDLPVT+G D TQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYSTPNGVG VRGEQ  SRECYASALKGSSVC LE  
Subjt:  FSGESVSPEGCIDLPVTIGQDATQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASALKGSSVCVLEEQ

Query:  TD-------QGDLPREIKRQFSPPAEELELTDLARSVPVEILDSPSILE
                 + +LPR   R+F+ P EELEL  L R    E +D    L+
Subjt:  TD-------QGDLPREIKRQFSPPAEELELTDLARSVPVEILDSPSILE

A0A6J1DHB3 uncharacterized protein LOC1110204799.4e-24075.79Show/hide
Query:  KAQSKYKPLTPEAVITREEFDLMKHKFDEQVEALKARCEKKECSFDNGDLGESPFTSDILEAQIPLKFKTPTIKPYDGSKDPKDYVEVFEGLMDFQAATD
        KA+S Y P+TP  VITREEFD +K KFD QVEALKARCEKKE SFD+GDLGE  F+SDILEA IP KFKTPT+KPYDGSKDPKDYVEVFE LMDFQAATD
Subjt:  KAQSKYKPLTPEAVITREEFDLMKHKFDEQVEALKARCEKKECSFDNGDLGESPFTSDILEAQIPLKFKTPTIKPYDGSKDPKDYVEVFEGLMDFQAATD

Query:  AIKCCAFQIALTGSARLWYQRLPARSISTYSQLRKEFISQFSSRHYGRKTATHLATIRQKEGETLGEYVTRFQEEQLKVAHCSDDSAMCYFLTSLADETL
        AIKCCAFQIALTGSARLWY+RLPAR ISTYSQLRKEFISQFSSRHY RKT THLATIRQKEGETL EYVTRF EEQLKVAHCSDDSAMCYFLT LADETL
Subjt:  AIKCCAFQIALTGSARLWYQRLPARSISTYSQLRKEFISQFSSRHYGRKTATHLATIRQKEGETLGEYVTRFQEEQLKVAHCSDDSAMCYFLTSLADETL

Query:  TVKLGEEAPATFAE-------------------------IDQKKLNQEKRKADSKSRDKGSSSSASRTEYRRSESGPSRSRPYERYTPTTIPISEILTNI
        TVKL EEAPATFAE                         IDQ +  ++K KADSKSRDKG SSS+SR +YRRS S  ++SRPYE YTPTTIPI EILTNI
Subjt:  TVKLGEEAPATFAE-------------------------IDQKKLNQEKRKADSKSRDKGSSSSASRTEYRRSESGPSRSRPYERYTPTTIPISEILTNI

Query:  EESGMEKLL------------------------------NCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRREDRPAVINTIFGGPSG
        EE+GMEKLL                              N WELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKR RTPPRR+DRPAVI         
Subjt:  EESGMEKLL------------------------------NCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRREDRPAVINTIFGGPSG

Query:  GQSGNKRKGLARKARREVCVIREQKPTCSITFGDVDLEGVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSLTPLVGFSG
            NK+K LAR+ARREVC+IREQ+PT SI F   DLEGVHLPHNDALVIAPLID VLVRR+LVDGGASANILSL TYLALGWTRSQLKKS TPLVGFSG
Subjt:  GQSGNKRKGLARKARREVCVIREQKPTCSITFGDVDLEGVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSLTPLVGFSG

Query:  ESVSPEGCIDLPVTIGQDATQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASALKGSSVCVLEEQTDQ
        ES+S EGCIDLPV+I QD TQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYST NGVGTVRGE KTSRECYAS  K SSVC LEEQT +
Subjt:  ESVSPEGCIDLPVTIGQDATQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASALKGSSVCVLEEQTDQ

Query:  GDL
         +L
Subjt:  GDL

A0A6J1DPC9 uncharacterized protein LOC1110222804.9e-26578.11Show/hide
Query:  VPGAPGEKGAPSIQPGDREPIPNDGGVDYSLRDNDLRKHLTEKKKRESREPEDSPSYSREFSNSNLKAQSKYKPLTPEAVITREEFDLMKHKFDEQVEAL
        +PGAPGEKGAPSIQPG+REPIPND GVDYSLRDNDLRKHLT+KKK+ S EPEDS SYSREFSNSNLKAQSKYKPL PEAVI REEFDLMKH+FDEQVEAL
Subjt:  VPGAPGEKGAPSIQPGDREPIPNDGGVDYSLRDNDLRKHLTEKKKRESREPEDSPSYSREFSNSNLKAQSKYKPLTPEAVITREEFDLMKHKFDEQVEAL

Query:  KARCEKKECSFDNGDLGESPFTSDILEAQIPLKFKTPTIKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCCAFQIALTGSARLWYQRLPARSISTYSQLR
        KARCEKKE  FD+ DLGESPFTSDI+EA IP KFKTPT+KPYDGSKDPKDYVEVFEGLMDFQAATDAIKC AFQIALTGSARLW +RLPARSISTYSQLR
Subjt:  KARCEKKECSFDNGDLGESPFTSDILEAQIPLKFKTPTIKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCCAFQIALTGSARLWYQRLPARSISTYSQLR

Query:  KEFISQFSSRHYGRKTATHLATIRQKEGETLGEYVTRFQEEQLKVAHCSDDSAMCYFLTSLADETLTVKLGEEAPATFAE--------------------
        KEFI QFS RHY RKTATHLATIRQKE                                   DETLTVKLGEEAPATFAE                    
Subjt:  KEFISQFSSRHYGRKTATHLATIRQKEGETLGEYVTRFQEEQLKVAHCSDDSAMCYFLTSLADETLTVKLGEEAPATFAE--------------------

Query:  -----IDQKKLNQEKRKADSKSRDKGSSSSASRTEYRRSESGPSRSRPYERYTPTTIPISEILTNIEESGMEKLLNCWELKRQIEDLIQDGYFKKFVGKP
             IDQK+L+Q+KRK DSKS+DKGSSSS SRTEYRRSESGPSRSRPYER                         CWELKRQIEDLIQD YFKKFVGKP
Subjt:  -----IDQKKLNQEKRKADSKSRDKGSSSSASRTEYRRSESGPSRSRPYERYTPTTIPISEILTNIEESGMEKLLNCWELKRQIEDLIQDGYFKKFVGKP

Query:  RSNSVEKKEERKRSRTPPRREDRPAVINTIFGGPSGGQSGNKRKGLARKARREVCVIREQKPTCSITFGDVDLEGVHLPHNDALVIAPLIDHVLVRRVLV
        RSNSVEKKEERKRSRTPPRREDRPAVINTIFGGPSGGQ  NKRK LA +ARR+V +IREQKPTCSITF D DLEGVHLPHNDALVIAPLIDHVLVRRVLV
Subjt:  RSNSVEKKEERKRSRTPPRREDRPAVINTIFGGPSGGQSGNKRKGLARKARREVCVIREQKPTCSITFGDVDLEGVHLPHNDALVIAPLIDHVLVRRVLV

Query:  DGGASANILSLPTYLALGWTRSQLKKSLTPLVGFSGESVSPEGCIDLPVTIGQDATQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYST
        DGGASANILSLPTYLAL  TRSQLKKS TPLVGFS ESVSPEGCIDLPVTIGQD+TQVTQMAEFVVIDGR AYNAIF RPIIHSF+AVPS LHQVLKYST
Subjt:  DGGASANILSLPTYLALGWTRSQLKKSLTPLVGFSGESVSPEGCIDLPVTIGQDATQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYST

Query:  PNGVGTVRGEQKTSRECYASALKGSSVCVLEEQTDQGDLPREIK
        PNGVGTVRGEQKTSRECYASALK SSVC LEEQT Q DLPRE K
Subjt:  PNGVGTVRGEQKTSRECYASALKGSSVCVLEEQTDQGDLPREIK

A0A6J1DZB9 uncharacterized protein LOC1110249043.4e-20574.72Show/hide
Query:  MDFQAATDAIKCCAFQIALTGSARLWYQRLPARSISTYSQLRKEFISQFSSRHYGRKTATHLATIRQKEGETLGEYVTRFQEEQLKVAHCSDDSAMCYFL
        MDFQAATDAIKC AFQIALTGSARLWY+RLPARSISTYSQLRKEFISQFSS HY RKTATHLATIRQKE ETL EYVTRFQEEQLKVAHCSDDSAMCYFL
Subjt:  MDFQAATDAIKCCAFQIALTGSARLWYQRLPARSISTYSQLRKEFISQFSSRHYGRKTATHLATIRQKEGETLGEYVTRFQEEQLKVAHCSDDSAMCYFL

Query:  TSLADETLTVKLGEEAPATFAE-------------------------IDQKKLNQEKRKADSKSRDKGSSSSASRTEYRRSESGPSRSRPYERYTPTTIP
        TSLADETLTVKLGEEAP TF E                         IDQKKL+QEKRKADSKSRDKGSSSSASRTEYRR ESGPSRSRPYERYT +TIP
Subjt:  TSLADETLTVKLGEEAPATFAE-------------------------IDQKKLNQEKRKADSKSRDKGSSSSASRTEYRRSESGPSRSRPYERYTPTTIP

Query:  ISEILTNIEESGMEKLL------------------------------NCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRREDRPAVIN
        ISEILTNIEESGMEKLL                              +CWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRREDRPAVIN
Subjt:  ISEILTNIEESGMEKLL------------------------------NCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRREDRPAVIN

Query:  TIFGGPSGGQSGNKRKGLARKARREVCVIREQKPTCSITFGDVDLEGVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSL
        TIFGGP+GGQSGNKRK LAR+ARREVC+IRE KPTCSITFGD DLEGVHLPHNDALVIA LIDH LVRRVL+DG                          
Subjt:  TIFGGPSGGQSGNKRKGLARKARREVCVIREQKPTCSITFGDVDLEGVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSL

Query:  TPLVGFSGESVSPEGCIDLPVTIGQDATQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASALKGSSVC
                      GCIDLPVTIGQDATQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPN VG VRGEQKTSRECYASALKGS+VC
Subjt:  TPLVGFSGESVSPEGCIDLPVTIGQDATQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASALKGSSVC

Query:  VLEEQTDQG-------DLPREIKRQFSPPAEELELTDL
         LEEQT++G       DLP+E KRQF PP EELEL  L
Subjt:  VLEEQTDQG-------DLPREIKRQFSPPAEELELTDL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTTTCTTAAGTGCATCAATGGAAGTAACATCGCCAATATCGGTCCATCTCAGCCCTGCCGAGGAGACCGGGTTCGCCCGTTATAAAAGATTATGGGCACCAAGGGT
CGACCTGAATAGGGTCCGACCTGCTCGGAACCCGACAGGTCGGAACCAGAGACCGGGTTCGAGCTTGATTCGTGAAGAACCGTTGTGCAAACGCTTGCATAAACATTTGG
CGCCGTCTGTGGGGAAGGACGATCTAAGTCATCCCGATCTGAGAAACATACGCAAAGATGTCGAGGACCAGGTCGAGCAGGGCAAGAGGGAGATCTGCCGCGCAGATCTG
CCCGCCATGCGAACCAGAAGTTACCCCCTGCTCACCCGAAACCCTCAAAGGCCAACCGAGGCCGAGCGCGAGTTGGATGATATGCGCCATCGGTTGCGCACAATGGAAGA
AATGTACGCTGAGGCAACGCGTGCTAACCGAACTGCGTCTCCCTCTAGGGTTCCGGGCGCACCCGGTGAGAAGGGAGCTCCATCTATCCAACCTGGCGACCGCGAGCCCA
TTCCCAACGATGGAGGAGTGGATTACAGCTTGCGGGATAACGATCTGAGAAAACATCTCACTGAAAAGAAAAAGAGAGAATCTCGGGAGCCGGAAGACTCTCCTTCCTAC
TCCCGAGAGTTCTCGAATTCGAACCTAAAGGCTCAATCAAAATACAAGCCTCTGACACCAGAAGCTGTGATCACCAGGGAAGAGTTCGACCTGATGAAGCACAAGTTCGA
TGAGCAGGTCGAGGCGCTTAAGGCCAGGTGCGAGAAGAAAGAGTGCTCGTTCGACAATGGCGACTTAGGAGAATCACCATTCACCTCGGACATCTTGGAGGCTCAAATCC
CTCTGAAGTTCAAAACTCCCACAATAAAACCTTATGATGGGTCTAAGGACCCAAAAGACTATGTCGAGGTCTTCGAGGGCCTCATGGACTTTCAGGCGGCAACAGATGCA
ATAAAATGCTGCGCCTTCCAGATCGCGCTTACCGGCAGCGCACGCTTGTGGTACCAGAGACTACCGGCTAGGTCAATCTCGACATACTCTCAGCTGAGAAAGGAATTCAT
CAGCCAATTCTCTTCTCGGCATTACGGTAGAAAAACAGCGACTCACCTCGCCACCATCAGACAGAAGGAAGGTGAGACGCTGGGAGAGTATGTCACAAGGTTTCAGGAGG
AGCAGCTGAAGGTCGCGCACTGCTCCGATGATTCGGCCATGTGCTACTTCCTCACCAGCCTGGCCGACGAGACCTTAACTGTAAAACTTGGGGAAGAGGCTCCAGCCACC
TTCGCCGAAATCGACCAGAAGAAATTGAACCAAGAGAAGAGGAAGGCTGATTCCAAGTCTAGAGATAAAGGATCGTCCTCTTCTGCCAGCAGAACAGAGTACCGTAGGTC
GGAGAGCGGCCCCAGCCGGAGCCGACCTTATGAACGGTATACCCCAACCACCATCCCCATCTCCGAGATACTCACGAACATCGAGGAAAGCGGGATGGAAAAGCTTCTCA
ATTGCTGGGAGCTGAAGCGCCAAATTGAAGACCTCATTCAAGATGGCTACTTCAAAAAGTTCGTGGGCAAACCGAGGTCTAACTCGGTCGAAAAGAAAGAAGAGAGGAAG
CGTTCAAGAACACCGCCTCGACGAGAGGACCGACCTGCAGTCATCAACACTATTTTTGGAGGTCCAAGTGGAGGCCAGTCCGGAAACAAAAGGAAAGGGCTAGCTCGCAA
GGCCAGGCGCGAGGTATGCGTCATCAGGGAGCAGAAACCCACTTGCTCCATCACTTTCGGCGATGTCGATCTGGAGGGGGTCCATTTGCCCCATAATGACGCACTCGTGA
TCGCTCCTCTCATCGACCACGTCCTTGTCCGAAGAGTACTGGTTGATGGAGGCGCATCTGCCAACATCTTGTCCCTCCCAACGTATCTTGCCTTGGGATGGACTAGGTCA
CAGTTGAAAAAGAGTCTAACACCCTTGGTTGGATTCTCTGGAGAGTCGGTCTCCCCAGAAGGGTGCATTGACCTGCCGGTCACGATCGGGCAAGATGCTACCCAAGTAAC
ACAGATGGCCGAGTTCGTGGTGATCGACGGCAGATCGGCCTACAACGCCATCTTCGGGAGACCCATCATTCACTCATTTCGGGCCGTCCCCTCCACACTGCATCAAGTCC
TGAAGTACTCAACCCCTAATGGAGTGGGCACGGTTCGAGGTGAGCAAAAGACTTCAAGGGAATGTTATGCATCCGCGCTTAAAGGGTCATCAGTATGTGTCCTAGAAGAA
CAGACCGATCAAGGCGACCTGCCTAGGGAAATCAAAAGGCAGTTCTCTCCACCGGCAGAGGAGCTCGAGCTTACCGACCTGGCTAGATCGGTCCCGGTCGAGATCTTGGA
CAGTCCTTCAATCTTGGAGCTAGATGTGGTGGAGATTGACACTCCATCACCCTCTTGGATGGACCCAATCGTGGAGTTCATCAAAGGAAATCCACCGAAAGATCCGAAGG
AGCAAAAGAAGATGGCGCGGAGCGCAGCTCGGTTCACACTCCGAGAAGGAGTGTTGTACCGACGTGGCTTCTCCTTGCCTCTGCTCAAGGTAGAGCAGTACGAGCCAGCG
AGGAACGAGGAAGAGCTACTCCTTAACCTGGACTTATTGGAAGGGAAAAGGGAAATGGCTCAGCTGCGCTTAGCGGAATATCAGAACAGAATGGCCAGACATTACAATGC
CCGAGTTCGACCTCGAAGCTTCCAAGTTGGACATTTGGTTTTGAGAAAAATTCAGAGTCATGTTGGCACCCTTGATCCGAGTTGGGAGGGATCGTTCGAAGTCAAAGGCA
TAGTCCAACGTGGAACTTACATGCTGGCCGACCTGGAAGGAAGAATGCTTGCGCATCCATGGAACGCGGAGCACTTGAAGCGCTATTACCCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTTTCTTAAGTGCATCAATGGAAGTAACATCGCCAATATCGGTCCATCTCAGCCCTGCCGAGGAGACCGGGTTCGCCCGTTATAAAAGATTATGGGCACCAAGGGT
CGACCTGAATAGGGTCCGACCTGCTCGGAACCCGACAGGTCGGAACCAGAGACCGGGTTCGAGCTTGATTCGTGAAGAACCGTTGTGCAAACGCTTGCATAAACATTTGG
CGCCGTCTGTGGGGAAGGACGATCTAAGTCATCCCGATCTGAGAAACATACGCAAAGATGTCGAGGACCAGGTCGAGCAGGGCAAGAGGGAGATCTGCCGCGCAGATCTG
CCCGCCATGCGAACCAGAAGTTACCCCCTGCTCACCCGAAACCCTCAAAGGCCAACCGAGGCCGAGCGCGAGTTGGATGATATGCGCCATCGGTTGCGCACAATGGAAGA
AATGTACGCTGAGGCAACGCGTGCTAACCGAACTGCGTCTCCCTCTAGGGTTCCGGGCGCACCCGGTGAGAAGGGAGCTCCATCTATCCAACCTGGCGACCGCGAGCCCA
TTCCCAACGATGGAGGAGTGGATTACAGCTTGCGGGATAACGATCTGAGAAAACATCTCACTGAAAAGAAAAAGAGAGAATCTCGGGAGCCGGAAGACTCTCCTTCCTAC
TCCCGAGAGTTCTCGAATTCGAACCTAAAGGCTCAATCAAAATACAAGCCTCTGACACCAGAAGCTGTGATCACCAGGGAAGAGTTCGACCTGATGAAGCACAAGTTCGA
TGAGCAGGTCGAGGCGCTTAAGGCCAGGTGCGAGAAGAAAGAGTGCTCGTTCGACAATGGCGACTTAGGAGAATCACCATTCACCTCGGACATCTTGGAGGCTCAAATCC
CTCTGAAGTTCAAAACTCCCACAATAAAACCTTATGATGGGTCTAAGGACCCAAAAGACTATGTCGAGGTCTTCGAGGGCCTCATGGACTTTCAGGCGGCAACAGATGCA
ATAAAATGCTGCGCCTTCCAGATCGCGCTTACCGGCAGCGCACGCTTGTGGTACCAGAGACTACCGGCTAGGTCAATCTCGACATACTCTCAGCTGAGAAAGGAATTCAT
CAGCCAATTCTCTTCTCGGCATTACGGTAGAAAAACAGCGACTCACCTCGCCACCATCAGACAGAAGGAAGGTGAGACGCTGGGAGAGTATGTCACAAGGTTTCAGGAGG
AGCAGCTGAAGGTCGCGCACTGCTCCGATGATTCGGCCATGTGCTACTTCCTCACCAGCCTGGCCGACGAGACCTTAACTGTAAAACTTGGGGAAGAGGCTCCAGCCACC
TTCGCCGAAATCGACCAGAAGAAATTGAACCAAGAGAAGAGGAAGGCTGATTCCAAGTCTAGAGATAAAGGATCGTCCTCTTCTGCCAGCAGAACAGAGTACCGTAGGTC
GGAGAGCGGCCCCAGCCGGAGCCGACCTTATGAACGGTATACCCCAACCACCATCCCCATCTCCGAGATACTCACGAACATCGAGGAAAGCGGGATGGAAAAGCTTCTCA
ATTGCTGGGAGCTGAAGCGCCAAATTGAAGACCTCATTCAAGATGGCTACTTCAAAAAGTTCGTGGGCAAACCGAGGTCTAACTCGGTCGAAAAGAAAGAAGAGAGGAAG
CGTTCAAGAACACCGCCTCGACGAGAGGACCGACCTGCAGTCATCAACACTATTTTTGGAGGTCCAAGTGGAGGCCAGTCCGGAAACAAAAGGAAAGGGCTAGCTCGCAA
GGCCAGGCGCGAGGTATGCGTCATCAGGGAGCAGAAACCCACTTGCTCCATCACTTTCGGCGATGTCGATCTGGAGGGGGTCCATTTGCCCCATAATGACGCACTCGTGA
TCGCTCCTCTCATCGACCACGTCCTTGTCCGAAGAGTACTGGTTGATGGAGGCGCATCTGCCAACATCTTGTCCCTCCCAACGTATCTTGCCTTGGGATGGACTAGGTCA
CAGTTGAAAAAGAGTCTAACACCCTTGGTTGGATTCTCTGGAGAGTCGGTCTCCCCAGAAGGGTGCATTGACCTGCCGGTCACGATCGGGCAAGATGCTACCCAAGTAAC
ACAGATGGCCGAGTTCGTGGTGATCGACGGCAGATCGGCCTACAACGCCATCTTCGGGAGACCCATCATTCACTCATTTCGGGCCGTCCCCTCCACACTGCATCAAGTCC
TGAAGTACTCAACCCCTAATGGAGTGGGCACGGTTCGAGGTGAGCAAAAGACTTCAAGGGAATGTTATGCATCCGCGCTTAAAGGGTCATCAGTATGTGTCCTAGAAGAA
CAGACCGATCAAGGCGACCTGCCTAGGGAAATCAAAAGGCAGTTCTCTCCACCGGCAGAGGAGCTCGAGCTTACCGACCTGGCTAGATCGGTCCCGGTCGAGATCTTGGA
CAGTCCTTCAATCTTGGAGCTAGATGTGGTGGAGATTGACACTCCATCACCCTCTTGGATGGACCCAATCGTGGAGTTCATCAAAGGAAATCCACCGAAAGATCCGAAGG
AGCAAAAGAAGATGGCGCGGAGCGCAGCTCGGTTCACACTCCGAGAAGGAGTGTTGTACCGACGTGGCTTCTCCTTGCCTCTGCTCAAGGTAGAGCAGTACGAGCCAGCG
AGGAACGAGGAAGAGCTACTCCTTAACCTGGACTTATTGGAAGGGAAAAGGGAAATGGCTCAGCTGCGCTTAGCGGAATATCAGAACAGAATGGCCAGACATTACAATGC
CCGAGTTCGACCTCGAAGCTTCCAAGTTGGACATTTGGTTTTGAGAAAAATTCAGAGTCATGTTGGCACCCTTGATCCGAGTTGGGAGGGATCGTTCGAAGTCAAAGGCA
TAGTCCAACGTGGAACTTACATGCTGGCCGACCTGGAAGGAAGAATGCTTGCGCATCCATGGAACGCGGAGCACTTGAAGCGCTATTACCCCTGA
Protein sequenceShow/hide protein sequence
MVFLSASMEVTSPISVHLSPAEETGFARYKRLWAPRVDLNRVRPARNPTGRNQRPGSSLIREEPLCKRLHKHLAPSVGKDDLSHPDLRNIRKDVEDQVEQGKREICRADL
PAMRTRSYPLLTRNPQRPTEAERELDDMRHRLRTMEEMYAEATRANRTASPSRVPGAPGEKGAPSIQPGDREPIPNDGGVDYSLRDNDLRKHLTEKKKRESREPEDSPSY
SREFSNSNLKAQSKYKPLTPEAVITREEFDLMKHKFDEQVEALKARCEKKECSFDNGDLGESPFTSDILEAQIPLKFKTPTIKPYDGSKDPKDYVEVFEGLMDFQAATDA
IKCCAFQIALTGSARLWYQRLPARSISTYSQLRKEFISQFSSRHYGRKTATHLATIRQKEGETLGEYVTRFQEEQLKVAHCSDDSAMCYFLTSLADETLTVKLGEEAPAT
FAEIDQKKLNQEKRKADSKSRDKGSSSSASRTEYRRSESGPSRSRPYERYTPTTIPISEILTNIEESGMEKLLNCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERK
RSRTPPRREDRPAVINTIFGGPSGGQSGNKRKGLARKARREVCVIREQKPTCSITFGDVDLEGVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRS
QLKKSLTPLVGFSGESVSPEGCIDLPVTIGQDATQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASALKGSSVCVLEE
QTDQGDLPREIKRQFSPPAEELELTDLARSVPVEILDSPSILELDVVEIDTPSPSWMDPIVEFIKGNPPKDPKEQKKMARSAARFTLREGVLYRRGFSLPLLKVEQYEPA
RNEEELLLNLDLLEGKREMAQLRLAEYQNRMARHYNARVRPRSFQVGHLVLRKIQSHVGTLDPSWEGSFEVKGIVQRGTYMLADLEGRMLAHPWNAEHLKRYYP