; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g16150 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g16150
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr6:12717702..12723277
RNA-Seq ExpressionMoc06g16150
SyntenyMoc06g16150
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]2.1e-23382.11Show/hide
Query:  QVESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDLKDYVEVFEGLMDFQAASD
        + ESS N   PAG+ITREEFDQLRG+LDAQVEALKAKCEQK+  LNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDG+KD KDYVEVFE LMDFQAASD
Subjt:  QVESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDLKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIVLTGSARLWYRRLPARSISTYSKLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEKQLKVTHCSDDSAMCYFLTGLVDEAL
        AIKCRAF+I LTGSARLWYRRLPA SISTYS+LRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQE+QLKV HCSDDSAMCYFLTGL DEAL
Subjt:  AIKCRAFQIVLTGSARLWYRRLPARSISTYSKLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEKQLKVTHCSDDSAMCYFLTGLVDEAL

Query:  TVKLGEKAPATFAEVLQKAKKVIDGQELLEPKPTDLNERS-----AGAEVE------KMKGQIPSPR-------------------TRDPFPAAELSILT
        TVKLGE+APATFAEVLQKAKKVIDGQELL  K T   ER      +G ++E      K KG   S R                   T    P +E  ILT
Subjt:  TVKLGEKAPATFAEVLQKAKKVIDGQELLEPKPTDLNERS-----AGAEVE------KMKGQIPSPR-------------------TRDPFPAAELSILT

Query:  NIEESGLEKLLKRPEKLRGAPERRSNDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFMGESRTSSAEKKEKRKRSRTPPRRTDRPAVINTIFGGP
        NIEESG+EKLLKRPEKLRGAPERRS DKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKF+G+ RTSSAEKKE+RKRSRTPPRRTDRPAVINTIFGGP
Subjt:  NIEESGLEKLLKRPEKLRGAPERRSNDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFMGESRTSSAEKKEKRKRSRTPPRRTDRPAVINTIFGGP

Query:  SGGQSGHKRKKLARAARCEVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLLDGGASANILSLPTYLALGWTRSQLKKSPIPLVGF
        SGGQSG KRK+LARAAR EVCIIREQ PTCPITFDGADLEEVHLPHNDALVIAPLIDHVVV RVL+DGG SANILSLPTYLALGWTRSQLKKSP PLVGF
Subjt:  SGGQSGHKRKKLARAARCEVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLLDGGASANILSLPTYLALGWTRSQLKKSPIPLVGF

Query:  SGESVIPEGCIDLPVTLGQDQTWVTQMAEFV
        SGESVIPEG IDLPVTLGQDQT VTQMAEFV
Subjt:  SGESVIPEGCIDLPVTLGQDQTWVTQMAEFV

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]3.4e-23669.35Show/hide
Query:  SSNQQVESSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDLKDYVEVFEGLMDFQ
        SSNQQ ESSHNPA   G+ITREEFDQLRG+L+AQVEALKAKCEQK+  LNDGDLGESPFTSDVLE        APTVK YDG+KD KDYVEVFEGLMDFQ
Subjt:  SSNQQVESSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDLKDYVEVFEGLMDFQ

Query:  AASDAIKCRAFQIVLTGSARLWYRRLPARSISTYSKLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEKQLKVTHCSDDSAMCYFLTGLV
        AASDAIKCRAFQI LTGSARLW                                                     FQE QLKV   SDDSAMCYFLTGL 
Subjt:  AASDAIKCRAFQIVLTGSARLWYRRLPARSISTYSKLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEKQLKVTHCSDDSAMCYFLTGLV

Query:  DEALTVKLGEKAPATFAEVLQKAKKVIDGQELLEPK---PTDLNERSAGAEVEKM------KGQIPSPR-------------------TRDPFPAAELSI
        DEALTVKLG++APATFAEVLQKAKKVIDGQELL  K   P    +R    + EK       KG   S R                   T    P +E  I
Subjt:  DEALTVKLGEKAPATFAEVLQKAKKVIDGQELLEPK---PTDLNERSAGAEVEKM------KGQIPSPR-------------------TRDPFPAAELSI

Query:  LTNIEESGLEKLLKRPEKLRGAPERRSNDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFMGESRTSSAEKKEKRKRSRTPPRRTDRPAVINTIFG
        LTNIEESG+EKLLKRPEKLRGAPERR+ DKYCRFHREH HNTSD WELKRQIEDLIQD YFKKF+G+ RTSSAEKKE+RK SRTP RR DRPAVINTIFG
Subjt:  LTNIEESGLEKLLKRPEKLRGAPERRSNDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFMGESRTSSAEKKEKRKRSRTPPRRTDRPAVINTIFG

Query:  GPSGGQSGHKRKKLARAARCEVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLLDGGASANILSLPTYLALGWTRSQLKKSPIPLV
        GPSGGQSGHKRK+LARAAR EVCIIREQ PTCPITFD ADLEEVHLPHNDALVIAPLIDHVVVRRVL+D G SANI+SL TYLALGWTRSQLKKS  PLV
Subjt:  GPSGGQSGHKRKKLARAARCEVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLLDGGASANILSLPTYLALGWTRSQLKKSPIPLV

Query:  GFSGESVIPEGCIDLPVTLGQDQTWVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVVKYPTPNGVGTVRGEQTASRECYASALKGSSVCALET
        GFS ESVIPEGCIDLPVTLG DQT VTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQV+KY TPNGVG VRGEQ ASRECYASALKGSSVCALET
Subjt:  GFSGESVIPEGCIDLPVTLGQDQTWVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVVKYPTPNGVGTVRGEQTASRECYASALKGSSVCALET

Query:  LAGRDGTLEFEADLPRKEFAAPTEDLELVPLLSPEKQLASAYETDLARSVPVEILDNPSISEPDLMENGAPE
        L  RDGTLEF+A+LPR+EFAAPTE+LELVPLL  +      +E +L     +  +D+      D+   G PE
Subjt:  LAGRDGTLEFEADLPRKEFAAPTEDLELVPLLSPEKQLASAYETDLARSVPVEILDNPSISEPDLMENGAPE

XP_022152110.1 uncharacterized protein LOC111019899 [Momordica charantia]1.0e-19281.55Show/hide
Query:  MCYFLTGLVDEALTVKLGEKAPATFAEVLQKAKKVIDGQELL----------------EPKPTDLNERSAG-AEVEKMKGQIPSPRTRDPFPAAEL---S
        MCYFLTGL DEALTVKL E+APATFAEVLQKAKKVIDGQELL                +PK  D    S G AE  + +      R  + F    +    
Subjt:  MCYFLTGLVDEALTVKLGEKAPATFAEVLQKAKKVIDGQELL----------------EPKPTDLNERSAG-AEVEKMKGQIPSPRTRDPFPAAEL---S

Query:  ILTNIEESGLEKLLKRPEKLRGAPERRSNDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFMGESRTSSAEKKEKRKRSRTPPRRTDRPAVINTIF
        ILTNIEESG+EKLLKRPEKLRGAPERRS DKYCRFHREHGHNTSD WELK QIEDLIQDGYFKKF+G+ RTSSAEKKE+RKRSRTPPRRTDRPAVINTIF
Subjt:  ILTNIEESGLEKLLKRPEKLRGAPERRSNDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFMGESRTSSAEKKEKRKRSRTPPRRTDRPAVINTIF

Query:  GGPSGGQSGHKRKKLARAARCEVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLLDGGASANILSLPTYLALGWTRSQLKKSPIPL
        GGPSGGQSGHKRKKLARAAR EVCIIREQ PTCPITFD ADL EVHLPHNDALVIAPLIDHVVVRRVL+DGGASANILSLPTYLALGWTRSQLKKSP PL
Subjt:  GGPSGGQSGHKRKKLARAARCEVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLLDGGASANILSLPTYLALGWTRSQLKKSPIPL

Query:  VGFSGESVIPEGCIDLPVTLGQDQTWVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVVKYPTPNGVGTVRGEQTASRECYASALKGSSVCALE
        VGFSGESV+PEGCIDLPVTLGQDQT VTQMAEFVV+DGRSAYNAIFGRPIIHSFRAIPSTLHQV+KY TPNGVGTVRGEQTASRECYAS LKG+SVCALE
Subjt:  VGFSGESVIPEGCIDLPVTLGQDQTWVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVVKYPTPNGVGTVRGEQTASRECYASALKGSSVCALE

Query:  TLAGRDGTLEFEADLPRKEFAAPTEDLELVPLLSPEKQL
        TL  RDGTLEFEADLP +EFAAP E+LELVPLLS EKQ+
Subjt:  TLAGRDGTLEFEADLPRKEFAAPTEDLELVPLLSPEKQL

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]3.6e-24160.18Show/hide
Query:  MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLATEPLRRSARITVPAVPPVHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAMR
        MVQPANSTNT DRR LAA+  HQREVGA  VEGQGH+ L TEPL RSARIT P +PP HP+ SKA                                   
Subjt:  MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLATEPLRRSARITVPAVPPVHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAMR

Query:  TQMRTMEEMYNEMMLAAGAGSRSESRVTRVGVREQRGSHLGPTEEERPEDNESERYTRQRGDLHEHLNRKRGSSLRKGQSPSRSHRSSNQQVESSHNP--
                                                                                                    ESS+NP  
Subjt:  TQMRTMEEMYNEMMLAAGAGSRSESRVTRVGVREQRGSHLGPTEEERPEDNESERYTRQRGDLHEHLNRKRGSSLRKGQSPSRSHRSSNQQVESSHNP--

Query:  AGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDLKDYVEVFEGLMDFQAASDAIKCRAFQIVL
         G+ITREEFDQL+ + DAQVEALKA+CE+K+ S +DGDLGE  F+SD+LEA IPPKFK PT+KPYDG+KD KDYVEVFE LMDFQAA+DAIKC AFQI L
Subjt:  AGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDLKDYVEVFEGLMDFQAASDAIKCRAFQIVL

Query:  TGSARLWYRRLPARSISTYSKLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEKQLKVTHCSDDSAMCYFLTGLVDEALTVKLGEKAPAT
        TGSARLWYRRLPAR ISTYS+LR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTRF E+QLKV HCSDDSAMCYFLTGL DE LTVKL E+APAT
Subjt:  TGSARLWYRRLPARSISTYSKLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEKQLKVTHCSDDSAMCYFLTGLVDEALTVKLGEKAPAT

Query:  FAEVLQKAKKVIDGQELLEPKPTDLNERSAGAEVEKMKGQIPSPRTRDPFPAAELS-----------------------------ILTNIEESGLEKLLK
        FAEVLQK KKVIDGQELL  K     +        K KG+  S ++RD  P++  S                             ILTNIEE+G+EKLLK
Subjt:  FAEVLQKAKKVIDGQELLEPKPTDLNERSAGAEVEKMKGQIPSPRTRDPFPAAELS-----------------------------ILTNIEESGLEKLLK

Query:  RPEKLRGAPERRSNDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFMGESRTSSAEKKEKRKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKKL
        RPEKLRG PE+R+ DKYCRFHR+HGHNTS+ WELKRQIEDLIQDGYFKKF+G+ R++S EKKE+RKR RTPPRR DRPAVIN             K+K+L
Subjt:  RPEKLRGAPERRSNDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFMGESRTSSAEKKEKRKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKKL

Query:  ARAARCEVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLLDGGASANILSLPTYLALGWTRSQLKKSPIPLVGFSGESVIPEGCID
        AR AR EVCIIREQ PT  I F+ ADLE VHLPHNDALVIAPLID V+VRR+L+DGGASANILSL TYLALGWTRSQLKKSP PLVGFSGES+  EGCID
Subjt:  ARAARCEVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLLDGGASANILSLPTYLALGWTRSQLKKSPIPLVGFSGESVIPEGCID

Query:  LPVTLGQDQTWVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVVKYPTPNGVGTVRGEQTASRECYASALKGSSVCALETLAGRD
        LPV++ QD T VTQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQV+KY T NGVGTVRGE   SRECYAS  K SSVCALE    RD
Subjt:  LPVTLGQDQTWVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVVKYPTPNGVGTVRGEQTASRECYASALKGSSVCALETLAGRD

XP_022155139.1 uncharacterized protein LOC111022280 [Momordica charantia]5.8e-19160.47Show/hide
Query:  EQRGSHLGPTEEERPEDNESERYTRQRGDLHEHLNRKRGSSLRKGQ---SPSRSHRSSNQQVESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQ
        E+    + P   E   ++E   Y+ +  DL +HL  K+  +  + +   S SR   +SN + +S +    P  +I REEFD ++   D QVEALKA+CE+
Subjt:  EQRGSHLGPTEEERPEDNESERYTRQRGDLHEHLNRKRGSSLRKGQ---SPSRSHRSSNQQVESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQ

Query:  KDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDLKDYVEVFEGLMDFQAASDAIKCRAFQIVLTGSARLWYRRLPARSISTYSKLRREFLAQ
        K+   +D DLGESPFTSD++EAPIPPKFK PT+KPYDG+KD KDYVEVFEGLMDFQAA+DAIKC AFQI LTGSARLW RRLPARSISTYS+LR+EF+ Q
Subjt:  KDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDLKDYVEVFEGLMDFQAASDAIKCRAFQIVLTGSARLWYRRLPARSISTYSKLRREFLAQ

Query:  FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEKQLKVTHCSDDSAMCYFLTGLVDEALTVKLGEKAPATFAEVLQKAKKVIDGQELLEPKPTDLNERS
        FS RHYD+KTATHLATIRQKE                                   DE LTVKLGE+APATFAEVLQ AKKVIDGQELL  K TD  E+ 
Subjt:  FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEKQLKVTHCSDDSAMCYFLTGLVDEALTVKLGEKAPATFAEVLQKAKKVIDGQELLEPKPTDLNERS

Query:  AGAEVEKMKGQIPSPRTRDPFPAAELSILTNIEESGLEKLLKRPEKLRGAPERRSNDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFMGESRTSS
           +    K +    +++D               SG     +R E   G    R  ++              CWELKRQIEDLIQD YFKKF+G+ R++S
Subjt:  AGAEVEKMKGQIPSPRTRDPFPAAELSILTNIEESGLEKLLKRPEKLRGAPERRSNDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFMGESRTSS

Query:  AEKKEKRKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKKLARAARCEVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLLDGGA
         EKKE+RKRSRTPPRR DRPAVINTIFGGPSGGQ  +KRK+LA  AR +V IIREQ PTC ITF   DLE VHLPHNDALVIAPLIDHV+VRRVL+DGGA
Subjt:  AEKKEKRKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKKLARAARCEVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLLDGGA

Query:  SANILSLPTYLALGWTRSQLKKSPIPLVGFSGESVIPEGCIDLPVTLGQDQTWVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVVKYPTPNGV
        SANILSLPTYLAL  TRSQLKKSP PLVGFS ESV PEGCIDLPVT+GQD T VTQMAEFVVIDGR AYNAIF RPIIHSF+A+PS LHQV+KY TPNGV
Subjt:  SANILSLPTYLALGWTRSQLKKSPIPLVGFSGESVIPEGCIDLPVTLGQDQTWVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVVKYPTPNGV

Query:  GTVRGEQTASRECYASALKGSSVCALETLAGRDGTLEFEADLPRK
        GTVRGEQ  SRECYASALK SSVCALE    +D       DLPR+
Subjt:  GTVRGEQTASRECYASALKGSSVCALETLAGRDGTLEFEADLPRK

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088131.0e-23382.11Show/hide
Query:  QVESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDLKDYVEVFEGLMDFQAASD
        + ESS N   PAG+ITREEFDQLRG+LDAQVEALKAKCEQK+  LNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDG+KD KDYVEVFE LMDFQAASD
Subjt:  QVESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDLKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIVLTGSARLWYRRLPARSISTYSKLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEKQLKVTHCSDDSAMCYFLTGLVDEAL
        AIKCRAF+I LTGSARLWYRRLPA SISTYS+LRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQE+QLKV HCSDDSAMCYFLTGL DEAL
Subjt:  AIKCRAFQIVLTGSARLWYRRLPARSISTYSKLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEKQLKVTHCSDDSAMCYFLTGLVDEAL

Query:  TVKLGEKAPATFAEVLQKAKKVIDGQELLEPKPTDLNERS-----AGAEVE------KMKGQIPSPR-------------------TRDPFPAAELSILT
        TVKLGE+APATFAEVLQKAKKVIDGQELL  K T   ER      +G ++E      K KG   S R                   T    P +E  ILT
Subjt:  TVKLGEKAPATFAEVLQKAKKVIDGQELLEPKPTDLNERS-----AGAEVE------KMKGQIPSPR-------------------TRDPFPAAELSILT

Query:  NIEESGLEKLLKRPEKLRGAPERRSNDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFMGESRTSSAEKKEKRKRSRTPPRRTDRPAVINTIFGGP
        NIEESG+EKLLKRPEKLRGAPERRS DKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKF+G+ RTSSAEKKE+RKRSRTPPRRTDRPAVINTIFGGP
Subjt:  NIEESGLEKLLKRPEKLRGAPERRSNDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFMGESRTSSAEKKEKRKRSRTPPRRTDRPAVINTIFGGP

Query:  SGGQSGHKRKKLARAARCEVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLLDGGASANILSLPTYLALGWTRSQLKKSPIPLVGF
        SGGQSG KRK+LARAAR EVCIIREQ PTCPITFDGADLEEVHLPHNDALVIAPLIDHVVV RVL+DGG SANILSLPTYLALGWTRSQLKKSP PLVGF
Subjt:  SGGQSGHKRKKLARAARCEVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLLDGGASANILSLPTYLALGWTRSQLKKSPIPLVGF

Query:  SGESVIPEGCIDLPVTLGQDQTWVTQMAEFV
        SGESVIPEG IDLPVTLGQDQT VTQMAEFV
Subjt:  SGESVIPEGCIDLPVTLGQDQTWVTQMAEFV

A0A6J1D9E1 uncharacterized protein LOC1110188231.7e-23669.35Show/hide
Query:  SSNQQVESSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDLKDYVEVFEGLMDFQ
        SSNQQ ESSHNPA   G+ITREEFDQLRG+L+AQVEALKAKCEQK+  LNDGDLGESPFTSDVLE        APTVK YDG+KD KDYVEVFEGLMDFQ
Subjt:  SSNQQVESSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDLKDYVEVFEGLMDFQ

Query:  AASDAIKCRAFQIVLTGSARLWYRRLPARSISTYSKLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEKQLKVTHCSDDSAMCYFLTGLV
        AASDAIKCRAFQI LTGSARLW                                                     FQE QLKV   SDDSAMCYFLTGL 
Subjt:  AASDAIKCRAFQIVLTGSARLWYRRLPARSISTYSKLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEKQLKVTHCSDDSAMCYFLTGLV

Query:  DEALTVKLGEKAPATFAEVLQKAKKVIDGQELLEPK---PTDLNERSAGAEVEKM------KGQIPSPR-------------------TRDPFPAAELSI
        DEALTVKLG++APATFAEVLQKAKKVIDGQELL  K   P    +R    + EK       KG   S R                   T    P +E  I
Subjt:  DEALTVKLGEKAPATFAEVLQKAKKVIDGQELLEPK---PTDLNERSAGAEVEKM------KGQIPSPR-------------------TRDPFPAAELSI

Query:  LTNIEESGLEKLLKRPEKLRGAPERRSNDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFMGESRTSSAEKKEKRKRSRTPPRRTDRPAVINTIFG
        LTNIEESG+EKLLKRPEKLRGAPERR+ DKYCRFHREH HNTSD WELKRQIEDLIQD YFKKF+G+ RTSSAEKKE+RK SRTP RR DRPAVINTIFG
Subjt:  LTNIEESGLEKLLKRPEKLRGAPERRSNDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFMGESRTSSAEKKEKRKRSRTPPRRTDRPAVINTIFG

Query:  GPSGGQSGHKRKKLARAARCEVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLLDGGASANILSLPTYLALGWTRSQLKKSPIPLV
        GPSGGQSGHKRK+LARAAR EVCIIREQ PTCPITFD ADLEEVHLPHNDALVIAPLIDHVVVRRVL+D G SANI+SL TYLALGWTRSQLKKS  PLV
Subjt:  GPSGGQSGHKRKKLARAARCEVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLLDGGASANILSLPTYLALGWTRSQLKKSPIPLV

Query:  GFSGESVIPEGCIDLPVTLGQDQTWVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVVKYPTPNGVGTVRGEQTASRECYASALKGSSVCALET
        GFS ESVIPEGCIDLPVTLG DQT VTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQV+KY TPNGVG VRGEQ ASRECYASALKGSSVCALET
Subjt:  GFSGESVIPEGCIDLPVTLGQDQTWVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVVKYPTPNGVGTVRGEQTASRECYASALKGSSVCALET

Query:  LAGRDGTLEFEADLPRKEFAAPTEDLELVPLLSPEKQLASAYETDLARSVPVEILDNPSISEPDLMENGAPE
        L  RDGTLEF+A+LPR+EFAAPTE+LELVPLL  +      +E +L     +  +D+      D+   G PE
Subjt:  LAGRDGTLEFEADLPRKEFAAPTEDLELVPLLSPEKQLASAYETDLARSVPVEILDNPSISEPDLMENGAPE

A0A6J1DD03 uncharacterized protein LOC1110198995.1e-19381.55Show/hide
Query:  MCYFLTGLVDEALTVKLGEKAPATFAEVLQKAKKVIDGQELL----------------EPKPTDLNERSAG-AEVEKMKGQIPSPRTRDPFPAAEL---S
        MCYFLTGL DEALTVKL E+APATFAEVLQKAKKVIDGQELL                +PK  D    S G AE  + +      R  + F    +    
Subjt:  MCYFLTGLVDEALTVKLGEKAPATFAEVLQKAKKVIDGQELL----------------EPKPTDLNERSAG-AEVEKMKGQIPSPRTRDPFPAAEL---S

Query:  ILTNIEESGLEKLLKRPEKLRGAPERRSNDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFMGESRTSSAEKKEKRKRSRTPPRRTDRPAVINTIF
        ILTNIEESG+EKLLKRPEKLRGAPERRS DKYCRFHREHGHNTSD WELK QIEDLIQDGYFKKF+G+ RTSSAEKKE+RKRSRTPPRRTDRPAVINTIF
Subjt:  ILTNIEESGLEKLLKRPEKLRGAPERRSNDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFMGESRTSSAEKKEKRKRSRTPPRRTDRPAVINTIF

Query:  GGPSGGQSGHKRKKLARAARCEVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLLDGGASANILSLPTYLALGWTRSQLKKSPIPL
        GGPSGGQSGHKRKKLARAAR EVCIIREQ PTCPITFD ADL EVHLPHNDALVIAPLIDHVVVRRVL+DGGASANILSLPTYLALGWTRSQLKKSP PL
Subjt:  GGPSGGQSGHKRKKLARAARCEVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLLDGGASANILSLPTYLALGWTRSQLKKSPIPL

Query:  VGFSGESVIPEGCIDLPVTLGQDQTWVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVVKYPTPNGVGTVRGEQTASRECYASALKGSSVCALE
        VGFSGESV+PEGCIDLPVTLGQDQT VTQMAEFVV+DGRSAYNAIFGRPIIHSFRAIPSTLHQV+KY TPNGVGTVRGEQTASRECYAS LKG+SVCALE
Subjt:  VGFSGESVIPEGCIDLPVTLGQDQTWVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVVKYPTPNGVGTVRGEQTASRECYASALKGSSVCALE

Query:  TLAGRDGTLEFEADLPRKEFAAPTEDLELVPLLSPEKQL
        TL  RDGTLEFEADLP +EFAAP E+LELVPLLS EKQ+
Subjt:  TLAGRDGTLEFEADLPRKEFAAPTEDLELVPLLSPEKQL

A0A6J1DHB3 uncharacterized protein LOC1110204791.7e-24160.18Show/hide
Query:  MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLATEPLRRSARITVPAVPPVHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAMR
        MVQPANSTNT DRR LAA+  HQREVGA  VEGQGH+ L TEPL RSARIT P +PP HP+ SKA                                   
Subjt:  MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLATEPLRRSARITVPAVPPVHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAMR

Query:  TQMRTMEEMYNEMMLAAGAGSRSESRVTRVGVREQRGSHLGPTEEERPEDNESERYTRQRGDLHEHLNRKRGSSLRKGQSPSRSHRSSNQQVESSHNP--
                                                                                                    ESS+NP  
Subjt:  TQMRTMEEMYNEMMLAAGAGSRSESRVTRVGVREQRGSHLGPTEEERPEDNESERYTRQRGDLHEHLNRKRGSSLRKGQSPSRSHRSSNQQVESSHNP--

Query:  AGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDLKDYVEVFEGLMDFQAASDAIKCRAFQIVL
         G+ITREEFDQL+ + DAQVEALKA+CE+K+ S +DGDLGE  F+SD+LEA IPPKFK PT+KPYDG+KD KDYVEVFE LMDFQAA+DAIKC AFQI L
Subjt:  AGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDLKDYVEVFEGLMDFQAASDAIKCRAFQIVL

Query:  TGSARLWYRRLPARSISTYSKLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEKQLKVTHCSDDSAMCYFLTGLVDEALTVKLGEKAPAT
        TGSARLWYRRLPAR ISTYS+LR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTRF E+QLKV HCSDDSAMCYFLTGL DE LTVKL E+APAT
Subjt:  TGSARLWYRRLPARSISTYSKLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEKQLKVTHCSDDSAMCYFLTGLVDEALTVKLGEKAPAT

Query:  FAEVLQKAKKVIDGQELLEPKPTDLNERSAGAEVEKMKGQIPSPRTRDPFPAAELS-----------------------------ILTNIEESGLEKLLK
        FAEVLQK KKVIDGQELL  K     +        K KG+  S ++RD  P++  S                             ILTNIEE+G+EKLLK
Subjt:  FAEVLQKAKKVIDGQELLEPKPTDLNERSAGAEVEKMKGQIPSPRTRDPFPAAELS-----------------------------ILTNIEESGLEKLLK

Query:  RPEKLRGAPERRSNDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFMGESRTSSAEKKEKRKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKKL
        RPEKLRG PE+R+ DKYCRFHR+HGHNTS+ WELKRQIEDLIQDGYFKKF+G+ R++S EKKE+RKR RTPPRR DRPAVIN             K+K+L
Subjt:  RPEKLRGAPERRSNDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFMGESRTSSAEKKEKRKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKKL

Query:  ARAARCEVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLLDGGASANILSLPTYLALGWTRSQLKKSPIPLVGFSGESVIPEGCID
        AR AR EVCIIREQ PT  I F+ ADLE VHLPHNDALVIAPLID V+VRR+L+DGGASANILSL TYLALGWTRSQLKKSP PLVGFSGES+  EGCID
Subjt:  ARAARCEVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLLDGGASANILSLPTYLALGWTRSQLKKSPIPLVGFSGESVIPEGCID

Query:  LPVTLGQDQTWVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVVKYPTPNGVGTVRGEQTASRECYASALKGSSVCALETLAGRD
        LPV++ QD T VTQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQV+KY T NGVGTVRGE   SRECYAS  K SSVCALE    RD
Subjt:  LPVTLGQDQTWVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVVKYPTPNGVGTVRGEQTASRECYASALKGSSVCALETLAGRD

A0A6J1DPC9 uncharacterized protein LOC1110222802.8e-19160.47Show/hide
Query:  EQRGSHLGPTEEERPEDNESERYTRQRGDLHEHLNRKRGSSLRKGQ---SPSRSHRSSNQQVESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQ
        E+    + P   E   ++E   Y+ +  DL +HL  K+  +  + +   S SR   +SN + +S +    P  +I REEFD ++   D QVEALKA+CE+
Subjt:  EQRGSHLGPTEEERPEDNESERYTRQRGDLHEHLNRKRGSSLRKGQ---SPSRSHRSSNQQVESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQ

Query:  KDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDLKDYVEVFEGLMDFQAASDAIKCRAFQIVLTGSARLWYRRLPARSISTYSKLRREFLAQ
        K+   +D DLGESPFTSD++EAPIPPKFK PT+KPYDG+KD KDYVEVFEGLMDFQAA+DAIKC AFQI LTGSARLW RRLPARSISTYS+LR+EF+ Q
Subjt:  KDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDLKDYVEVFEGLMDFQAASDAIKCRAFQIVLTGSARLWYRRLPARSISTYSKLRREFLAQ

Query:  FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEKQLKVTHCSDDSAMCYFLTGLVDEALTVKLGEKAPATFAEVLQKAKKVIDGQELLEPKPTDLNERS
        FS RHYD+KTATHLATIRQKE                                   DE LTVKLGE+APATFAEVLQ AKKVIDGQELL  K TD  E+ 
Subjt:  FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEKQLKVTHCSDDSAMCYFLTGLVDEALTVKLGEKAPATFAEVLQKAKKVIDGQELLEPKPTDLNERS

Query:  AGAEVEKMKGQIPSPRTRDPFPAAELSILTNIEESGLEKLLKRPEKLRGAPERRSNDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFMGESRTSS
           +    K +    +++D               SG     +R E   G    R  ++              CWELKRQIEDLIQD YFKKF+G+ R++S
Subjt:  AGAEVEKMKGQIPSPRTRDPFPAAELSILTNIEESGLEKLLKRPEKLRGAPERRSNDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFMGESRTSS

Query:  AEKKEKRKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKKLARAARCEVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLLDGGA
         EKKE+RKRSRTPPRR DRPAVINTIFGGPSGGQ  +KRK+LA  AR +V IIREQ PTC ITF   DLE VHLPHNDALVIAPLIDHV+VRRVL+DGGA
Subjt:  AEKKEKRKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKKLARAARCEVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLLDGGA

Query:  SANILSLPTYLALGWTRSQLKKSPIPLVGFSGESVIPEGCIDLPVTLGQDQTWVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVVKYPTPNGV
        SANILSLPTYLAL  TRSQLKKSP PLVGFS ESV PEGCIDLPVT+GQD T VTQMAEFVVIDGR AYNAIF RPIIHSF+A+PS LHQV+KY TPNGV
Subjt:  SANILSLPTYLALGWTRSQLKKSPIPLVGFSGESVIPEGCIDLPVTLGQDQTWVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVVKYPTPNGV

Query:  GTVRGEQTASRECYASALKGSSVCALETLAGRDGTLEFEADLPRK
        GTVRGEQ  SRECYASALK SSVCALE    +D       DLPR+
Subjt:  GTVRGEQTASRECYASALKGSSVCALETLAGRDGTLEFEADLPRK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAACCAGCGAACTCGACCAATACGACAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAAGTCGGAGCAGCAGCGGTAGAGGGGCAAGGT
CACGACGGCCTAGCAACGGAACCCCTCCGCAGGTCGGCACGGATCACCGTGCCTGCCGTACCGCCTGTGCACCCGAGGACGTCCAAGGCCACCCGTGGCAGAGGT
GGGACCTCTAAGAAGGGCGCCCGGGGTCCAGCCCCGGCTCCAACAAGCGAGAACTTTGATGCGCTCCAGAGAGAGATGGAGGCAATGCGCACACAAATGCGCACC
ATGGAGGAAATGTATAACGAAATGATGCTAGCTGCAGGCGCAGGGTCCCGATCTGAAAGTCGAGTGACGCGCGTGGGCGTACGCGAGCAGAGGGGTTCCCACCTA
GGCCCAACCGAGGAGGAACGTCCCGAAGACAACGAGAGCGAGAGGTACACTCGCCAGAGGGGAGACCTCCATGAGCATCTCAACAGAAAGAGAGGCTCGTCTCTC
CGAAAAGGGCAGTCACCATCCCGCTCACACAGGAGCTCCAACCAGCAGGTCGAATCCTCTCACAATCCCGCAGGGATAATCACAAGGGAGGAGTTCGACCAGCTG
AGGGGGGAGCTCGATGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAACGATGGCGACTTGGGAGAATCGCCTTTCACCTCGGAC
GTTTTGGAAGCACCAATCCCTCCGAAGTTCAAAGCTCCCACCGTGAAGCCTTATGATGGGACGAAGGACCTCAAGGACTATGTTGAGGTCTTTGAAGGCCTCATG
GACTTCCAAGCGGCATCAGACGCAATCAAATGTCGCGCCTTTCAGATCGTGCTTACTGGCAGCGCGCGATTGTGGTACCGGAGACTGCCAGCAAGGTCGATCTCG
ACCTACTCCAAGCTGAGAAGGGAGTTCCTCGCCCAATTCTCTTCTCGGCACTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGGCAGAAGGAGGGTGAG
ACGCTGCGGGAATATGTCACTAGATTCCAAGAGAAGCAGTTGAAGGTCACACACTGCTCCGATGACTCGGCCATGTGCTATTTCCTCACCGGTCTAGTCGACGAA
GCCCTCACGGTGAAACTTGGAGAGAAGGCCCCGGCCACCTTCGCCGAGGTACTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCGAACCAAAACCG
ACCGACCTGAACGAAAGATCGGCCGGGGCAGAGGTGGAAAAGATGAAAGGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGAGCTGAGTATC
CTAACAAACATCGAGGAATCTGGATTGGAAAAACTACTCAAGCGTCCTGAGAAGCTTCGGGGAGCCCCGGAGAGGCGCAGCAACGACAAGTATTGCCGATTCCAT
CGGGAGCACGGTCATAACACGTCAGACTGCTGGGAGTTGAAGCGCCAAATTGAGGATCTAATTCAAGATGGCTACTTCAAGAAATTTATGGGAGAGTCCAGGACC
AGCTCAGCAGAGAAAAAGGAAAAGCGAAAGCGTTCAAGGACGCCGCCCCGGCGCACTGACCGACCTGCGGTCATCAATACCATTTTTGGTGGGCCAAGCGGGGGT
CAGTCCGGACATAAAAGAAAGAAGTTAGCCCGTGCAGCCAGGTGCGAGGTGTGCATCATCAGGGAGCAGGGGCCGACCTGCCCAATCACCTTCGACGGTGCAGAC
TTGGAGGAGGTCCACCTGCCCCACAATGATGCACTTGTGATTGCTCCCTTGATTGATCATGTGGTGGTCAGGAGAGTGCTGCTAGACGGGGGCGCATCCGCTAAC
ATCCTGTCCTTACCGACTTACCTTGCCTTGGGATGGACGAGGTCGCAATTGAAGAAAAGCCCGATACCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAG
GGTTGCATCGACTTGCCGGTCACGCTGGGGCAGGACCAGACTTGGGTCACTCAAATGGCCGAGTTCGTGGTAATCGACGGTAGATCGGCCTATAACGCCATCTTT
GGGAGACCCATCATCCACTCATTTCGGGCCATTCCCTCAACACTTCATCAAGTTGTGAAGTATCCCACCCCCAATGGCGTGGGCACGGTCCGAGGAGAACAGACC
GCTTCGAGGGAGTGTTATGCCTCCGCACTCAAAGGCTCATCGGTCTGCGCCCTCGAAACTCTCGCCGGTAGGGATGGGACGCTCGAGTTCGAGGCTGACCTGCCG
AGGAAGGAGTTTGCCGCACCCACTGAGGACCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAGTTAGCATCAGCGTACGAGACCGACCTGGCCAGGTCGGTC
CCCGTCGAGATCTTAGATAATCCCTCGATCTCAGAGCCAGATCTGATGGAGAACGGCGCTCCAGAATCCTCATGGATGGACCCGATCGCGGACTTCATTAGGGGC
AACTCACCACGAGACCCCAAGGAGCGCAGGAAGTTGGCAAGGCGGGCAGCTCGGTTCGTGGTCCGAGATGGGGTATTGTACCGACGTGGCTTTTCCCTGCCTCTA
TTGAGATGCCTAACCCCTGAAGAGGGCCTAGTAGAGCATTACGAGCCTACGGCAAATGAGGAAGAGTTGCTCCTCAACCTCGACTTGTTGGAAGAAAGAAGAGCA
GTGGCCCAGCTACGCCTGGCGGAATATCAGGGCAGAATGGCCAGACATTACAACGCCCGCGTTCGACCTCGGACCTTTCAGGTCGAGCATCTGGTCTTAAGGAGG
GTCCAAACCCATGTGGGTGCCCTTGATCCGGCCTGGGAGGGCCCGTTTGAGGTCAAGGGCATAGTCCGACCTGGTACGTACGTATTGGCCGACCTGAAAGGAGAT
GTCCTCGGGCGCACCCGTGGAACGCGGAGCACCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAACCAGCGAACTCGACCAATACGACAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAAGTCGGAGCAGCAGCGGTAGAGGGGCAAGGT
CACGACGGCCTAGCAACGGAACCCCTCCGCAGGTCGGCACGGATCACCGTGCCTGCCGTACCGCCTGTGCACCCGAGGACGTCCAAGGCCACCCGTGGCAGAGGT
GGGACCTCTAAGAAGGGCGCCCGGGGTCCAGCCCCGGCTCCAACAAGCGAGAACTTTGATGCGCTCCAGAGAGAGATGGAGGCAATGCGCACACAAATGCGCACC
ATGGAGGAAATGTATAACGAAATGATGCTAGCTGCAGGCGCAGGGTCCCGATCTGAAAGTCGAGTGACGCGCGTGGGCGTACGCGAGCAGAGGGGTTCCCACCTA
GGCCCAACCGAGGAGGAACGTCCCGAAGACAACGAGAGCGAGAGGTACACTCGCCAGAGGGGAGACCTCCATGAGCATCTCAACAGAAAGAGAGGCTCGTCTCTC
CGAAAAGGGCAGTCACCATCCCGCTCACACAGGAGCTCCAACCAGCAGGTCGAATCCTCTCACAATCCCGCAGGGATAATCACAAGGGAGGAGTTCGACCAGCTG
AGGGGGGAGCTCGATGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAACGATGGCGACTTGGGAGAATCGCCTTTCACCTCGGAC
GTTTTGGAAGCACCAATCCCTCCGAAGTTCAAAGCTCCCACCGTGAAGCCTTATGATGGGACGAAGGACCTCAAGGACTATGTTGAGGTCTTTGAAGGCCTCATG
GACTTCCAAGCGGCATCAGACGCAATCAAATGTCGCGCCTTTCAGATCGTGCTTACTGGCAGCGCGCGATTGTGGTACCGGAGACTGCCAGCAAGGTCGATCTCG
ACCTACTCCAAGCTGAGAAGGGAGTTCCTCGCCCAATTCTCTTCTCGGCACTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGGCAGAAGGAGGGTGAG
ACGCTGCGGGAATATGTCACTAGATTCCAAGAGAAGCAGTTGAAGGTCACACACTGCTCCGATGACTCGGCCATGTGCTATTTCCTCACCGGTCTAGTCGACGAA
GCCCTCACGGTGAAACTTGGAGAGAAGGCCCCGGCCACCTTCGCCGAGGTACTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCGAACCAAAACCG
ACCGACCTGAACGAAAGATCGGCCGGGGCAGAGGTGGAAAAGATGAAAGGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGAGCTGAGTATC
CTAACAAACATCGAGGAATCTGGATTGGAAAAACTACTCAAGCGTCCTGAGAAGCTTCGGGGAGCCCCGGAGAGGCGCAGCAACGACAAGTATTGCCGATTCCAT
CGGGAGCACGGTCATAACACGTCAGACTGCTGGGAGTTGAAGCGCCAAATTGAGGATCTAATTCAAGATGGCTACTTCAAGAAATTTATGGGAGAGTCCAGGACC
AGCTCAGCAGAGAAAAAGGAAAAGCGAAAGCGTTCAAGGACGCCGCCCCGGCGCACTGACCGACCTGCGGTCATCAATACCATTTTTGGTGGGCCAAGCGGGGGT
CAGTCCGGACATAAAAGAAAGAAGTTAGCCCGTGCAGCCAGGTGCGAGGTGTGCATCATCAGGGAGCAGGGGCCGACCTGCCCAATCACCTTCGACGGTGCAGAC
TTGGAGGAGGTCCACCTGCCCCACAATGATGCACTTGTGATTGCTCCCTTGATTGATCATGTGGTGGTCAGGAGAGTGCTGCTAGACGGGGGCGCATCCGCTAAC
ATCCTGTCCTTACCGACTTACCTTGCCTTGGGATGGACGAGGTCGCAATTGAAGAAAAGCCCGATACCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAG
GGTTGCATCGACTTGCCGGTCACGCTGGGGCAGGACCAGACTTGGGTCACTCAAATGGCCGAGTTCGTGGTAATCGACGGTAGATCGGCCTATAACGCCATCTTT
GGGAGACCCATCATCCACTCATTTCGGGCCATTCCCTCAACACTTCATCAAGTTGTGAAGTATCCCACCCCCAATGGCGTGGGCACGGTCCGAGGAGAACAGACC
GCTTCGAGGGAGTGTTATGCCTCCGCACTCAAAGGCTCATCGGTCTGCGCCCTCGAAACTCTCGCCGGTAGGGATGGGACGCTCGAGTTCGAGGCTGACCTGCCG
AGGAAGGAGTTTGCCGCACCCACTGAGGACCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAGTTAGCATCAGCGTACGAGACCGACCTGGCCAGGTCGGTC
CCCGTCGAGATCTTAGATAATCCCTCGATCTCAGAGCCAGATCTGATGGAGAACGGCGCTCCAGAATCCTCATGGATGGACCCGATCGCGGACTTCATTAGGGGC
AACTCACCACGAGACCCCAAGGAGCGCAGGAAGTTGGCAAGGCGGGCAGCTCGGTTCGTGGTCCGAGATGGGGTATTGTACCGACGTGGCTTTTCCCTGCCTCTA
TTGAGATGCCTAACCCCTGAAGAGGGCCTAGTAGAGCATTACGAGCCTACGGCAAATGAGGAAGAGTTGCTCCTCAACCTCGACTTGTTGGAAGAAAGAAGAGCA
GTGGCCCAGCTACGCCTGGCGGAATATCAGGGCAGAATGGCCAGACATTACAACGCCCGCGTTCGACCTCGGACCTTTCAGGTCGAGCATCTGGTCTTAAGGAGG
GTCCAAACCCATGTGGGTGCCCTTGATCCGGCCTGGGAGGGCCCGTTTGAGGTCAAGGGCATAGTCCGACCTGGTACGTACGTATTGGCCGACCTGAAAGGAGAT
GTCCTCGGGCGCACCCGTGGAACGCGGAGCACCTGA
Protein sequenceShow/hide protein sequence
MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLATEPLRRSARITVPAVPPVHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAMRTQMRT
MEEMYNEMMLAAGAGSRSESRVTRVGVREQRGSHLGPTEEERPEDNESERYTRQRGDLHEHLNRKRGSSLRKGQSPSRSHRSSNQQVESSHNPAGIITREEFDQL
RGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDLKDYVEVFEGLMDFQAASDAIKCRAFQIVLTGSARLWYRRLPARSIS
TYSKLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEKQLKVTHCSDDSAMCYFLTGLVDEALTVKLGEKAPATFAEVLQKAKKVIDGQELLEPKP
TDLNERSAGAEVEKMKGQIPSPRTRDPFPAAELSILTNIEESGLEKLLKRPEKLRGAPERRSNDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFMGESRT
SSAEKKEKRKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKKLARAARCEVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLLDGGASAN
ILSLPTYLALGWTRSQLKKSPIPLVGFSGESVIPEGCIDLPVTLGQDQTWVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVVKYPTPNGVGTVRGEQT
ASRECYASALKGSSVCALETLAGRDGTLEFEADLPRKEFAAPTEDLELVPLLSPEKQLASAYETDLARSVPVEILDNPSISEPDLMENGAPESSWMDPIADFIRG
NSPRDPKERRKLARRAARFVVRDGVLYRRGFSLPLLRCLTPEEGLVEHYEPTANEEELLLNLDLLEERRAVAQLRLAEYQGRMARHYNARVRPRTFQVEHLVLRR
VQTHVGALDPAWEGPFEVKGIVRPGTYVLADLKGDVLGRTRGTRST