; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g01810 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g01810
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr1:1223100..1228635
RNA-Seq ExpressionMoc01g01810
SyntenyMoc01g01810
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]3.7e-19370.75Show/hide
Query:  LKAQSKYKPLIPEAVITRQEFDLTKHRFGEQVEALKARCEKKKSSFDDDDLGELPFTSDIMEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAAT
        +KA+S   P  P  VITR+EFD  + +   QVEALKA+CE+K+   +D DLGE PFTSD++EAPIPPKFK PT+KPYDGSKDPKDYVEVFE LMDFQAA+
Subjt:  LKAQSKYKPLIPEAVITRQEFDLTKHRFGEQVEALKARCEKKKSSFDDDDLGELPFTSDIMEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAAT

Query:  DAIKCRAFQIALTGNARLWYRRLPARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADET
        DAIKCRAF+IALTG+ARLWYRRLPA SISTYSQLR+EF++ FSSRHYD+KTATHLATIRQKEGETLREYVTRFQEEQLKV HCSDDSAMCYFLTGLADE 
Subjt:  DAIKCRAFQIALTGNARLWYRRLPARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADET

Query:  LTVKLGEEAPVTFAEVLQKAKKVIDGQELLRTKTGRPEKQIHQKKLVQQKRKTDSKSRDKRSSSSASRAKYHRSESGPSRSRPYERYTPTTIPISEIITN
        LTVKLGEEAP TFAEVLQKAKKVIDGQELLRTKTGRPE++I + +  +     D KS+DK S SS  RA+Y R+E+GP+RSRPYER+TPTTIPISEI+TN
Subjt:  LTVKLGEEAPVTFAEVLQKAKKVIDGQELLRTKTGRPEKQIHQKKLVQQKRKTDSKSRDKRSSSSASRAKYHRSESGPSRSRPYERYTPTTIPISEIITN

Query:  IEESGMEKLLKRPEK----PEETQKSA-ARI-----------------------NGYFKKFVGKPRSNSVEKKEERKRSRTPPRRDDRPAVINTIFGGPS
        IEESGMEKLLKRPEK    PE   K    R                        +GYFKKFVGKPR++S EKKEERKRSRTPPRR DRPAVINTIFGGPS
Subjt:  IEESGMEKLLKRPEK----PEETQKSA-ARI-----------------------NGYFKKFVGKPRSNSVEKKEERKRSRTPPRRDDRPAVINTIFGGPS

Query:  GGQSGNKRKELAREARREVCIIREQKPTCSITFDDADLEGVHLPHNDAMVIAPLIDHVLV--LLVDGGASANILSLPTYLALGWTR--------------
        GGQSG KRKELAR ARREVCIIREQ+PTC ITFD ADLE VHLPHNDA+VIAPLIDHV+V  +LVDGG SANILSLPTYLALGWTR              
Subjt:  GGQSGNKRKELAREARREVCIIREQKPTCSITFDDADLEGVHLPHNDAMVIAPLIDHVLV--LLVDGGASANILSLPTYLALGWTR--------------

Query:  -----------------QDATQVTQMAEFV
                         QD TQVTQMAEFV
Subjt:  -----------------QDATQVTQMAEFV

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]1.0e-18258.78Show/hide
Query:  NSNLKAQSKYKPLIPEAVITRQEFDLTKHRFGEQVEALKARCEKKKSSFDDDDLGELPFTSDIMEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQ
        +SN +A+S + P  P+ VITR+EFD  + +   QVEALKA+CE+K+   +D DLGE PFTSD++EA        PT+K YDGSKDPKDYVEVFEGLMDFQ
Subjt:  NSNLKAQSKYKPLIPEAVITRQEFDLTKHRFGEQVEALKARCEKKKSSFDDDDLGELPFTSDIMEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQ

Query:  AATDAIKCRAFQIALTGNARLWYRRLPARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLA
        AA+DAIKCRAFQIALTG+ARLW                                                     FQE+QLKV   SDDSAMCYFLTGLA
Subjt:  AATDAIKCRAFQIALTGNARLWYRRLPARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLA

Query:  DETLTVKLGEEAPVTFAEVLQKAKKVIDGQELLRTKTGRPEKQIHQKKLVQQKRKTDSKSRDKRSSSSASRAKYHRSESGPSRSRPYERYTPTTIPISEI
        DE LTVKLG+EAP TFAEVLQKAKKVIDGQELLRTKTGRPE+ I + +  + + K D KS+DK S SS  RA++ R+ +GP+RSRPYER+TPTTIPISEI
Subjt:  DETLTVKLGEEAPVTFAEVLQKAKKVIDGQELLRTKTGRPEKQIHQKKLVQQKRKTDSKSRDKRSSSSASRAKYHRSESGPSRSRPYERYTPTTIPISEI

Query:  ITNIEESGMEKLLKRPEK----PEETQKSA-ARI-----------------------NGYFKKFVGKPRSNSVEKKEERKRSRTPPRRDDRPAVINTIFG
        +TNIEESGMEKLLKRPEK    PE   K    R                        + YFKKFVGKPR++S EKKEERK SRTP RR DRPAVINTIFG
Subjt:  ITNIEESGMEKLLKRPEK----PEETQKSA-ARI-----------------------NGYFKKFVGKPRSNSVEKKEERKRSRTPPRRDDRPAVINTIFG

Query:  GPSGGQSGNKRKELAREARREVCIIREQKPTCSITFDDADLEGVHLPHNDAMVIAPLIDHVLV--LLVDGGASANILSLPTYLALGWTR-----------
        GPSGGQSG+KRKELAR ARREVCIIREQ+PTC ITFD ADLE VHLPHNDA+VIAPLIDHV+V  +LVD G SANI+SL TYLALGWTR           
Subjt:  GPSGGQSGNKRKELAREARREVCIIREQKPTCSITFDDADLEGVHLPHNDAMVIAPLIDHVLV--LLVDGGASANILSLPTYLALGWTR-----------

Query:  --------------------QDATQVTQMAEFVVIDGKSAYNAIFGRPIIHSFRVVPSTLHQVLKYSTPNGVSTVRGEQKTSRECYASALKGSAVCALEE
                             D TQVTQMAEFVVIDG+SAYNAIFGRPIIHSFR +PSTLHQVLKYSTPNGV  VRGEQ  SRECYASALKGS+VCALE 
Subjt:  --------------------QDATQVTQMAEFVVIDGKSAYNAIFGRPIIHSFRVVPSTLHQVLKYSTPNGVSTVRGEQKTSRECYASALKGSAVCALEE

Query:  QTNHGKLQGSETDLPKEGKKQFSPPIEELELVPLLSPEKQVSIGTKLGATEREEL
          +       + +LP   +++F+ P EELELVPLL  +   +I  +    E+  L
Subjt:  QTNHGKLQGSETDLPKEGKKQFSPPIEELELVPLLSPEKQVSIGTKLGATEREEL

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]4.9e-22272.97Show/hide
Query:  KAQSKYKPLIPEAVITRQEFDLTKHRFGEQVEALKARCEKKKSSFDDDDLGELPFTSDIMEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATD
        KA+S Y P+ P  VITR+EFD  K +F  QVEALKARCEKK+SSFDD DLGEL F+SDI+EA IPPKFKTPTMKPYDGSKDPKDYVEVFE LMDFQAATD
Subjt:  KAQSKYKPLIPEAVITRQEFDLTKHRFGEQVEALKARCEKKKSSFDDDDLGELPFTSDIMEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATD

Query:  AIKCRAFQIALTGNARLWYRRLPARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADETL
        AIKC AFQIALTG+ARLWYRRLPAR ISTYSQLRKEFISQFSSRHYDRKT THLATIRQKEGETLREYVTRF EEQLKV HCSDDSAMCYFLTGLADETL
Subjt:  AIKCRAFQIALTGNARLWYRRLPARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADETL

Query:  TVKLGEEAPVTFAEVLQKAKKVIDGQELLRTKTGRPEKQIHQKKLVQQKRKTDSKSRDKRSSSSASRAKYHRSESGPSRSRPYERYTPTTIPISEIITNI
        TVKL EEAP TFAEVLQK KKVIDGQELLRTKTGRPEK I Q +  + K K DSKSRDK  SSS+SR  Y RS S  ++SRPYE YTPTTIPI EI+TNI
Subjt:  TVKLGEEAPVTFAEVLQKAKKVIDGQELLRTKTGRPEKQIHQKKLVQQKRKTDSKSRDKRSSSSASRAKYHRSESGPSRSRPYERYTPTTIPISEIITNI

Query:  EESGMEKLLKRPEK----PE----------------------ETQKSAARI--NGYFKKFVGKPRSNSVEKKEERKRSRTPPRRDDRPAVINTIFGGPSG
        EE+GMEKLLKRPEK    PE                      E ++    +  +GYFKKFVGKPRSNSVEKKEERKR RTPPRRDDRPAVI         
Subjt:  EESGMEKLLKRPEK----PE----------------------ETQKSAARI--NGYFKKFVGKPRSNSVEKKEERKRSRTPPRRDDRPAVINTIFGGPSG

Query:  GQSGNKRKELAREARREVCIIREQKPTCSITFDDADLEGVHLPHNDAMVIAPLIDHVLV--LLVDGGASANILSLPTYLALGWT----------------
            NK+KELAREARREVCIIREQ+PT SI F+ ADLEGVHLPHNDA+VIAPLID VLV  +LVDGGASANILSL TYLALGWT                
Subjt:  GQSGNKRKELAREARREVCIIREQKPTCSITFDDADLEGVHLPHNDAMVIAPLIDHVLV--LLVDGGASANILSLPTYLALGWT----------------

Query:  ---------------RQDATQVTQMAEFVVIDGKSAYNAIFGRPIIHSFRVVPSTLHQVLKYSTPNGVSTVRGEQKTSRECYASALKGSAVCALEEQTNH
                       RQD TQVTQMAEFVVIDG+SAYNAIFGRPIIHSFR VPSTLHQVLKYST NGV TVRGE KTSRECYAS  K S+VCALEEQT  
Subjt:  ---------------RQDATQVTQMAEFVVIDGKSAYNAIFGRPIIHSFRVVPSTLHQVLKYSTPNGVSTVRGEQKTSRECYASALKGSAVCALEEQTNH

Query:  GKL
         +L
Subjt:  GKL

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]6.0e-0246.15Show/hide
Query:  MVHPANSANTTEQRGVNADNGPQRDLGARIVEDQVRAGQEGDLPRRSACHANQELRPAHPKPLKA
        MV PANS NT ++R + A++G QR++GA +VE Q       +   RSA      L PAHPKP KA
Subjt:  MVHPANSANTTEQRGVNADNGPQRDLGARIVEDQVRAGQEGDLPRRSACHANQELRPAHPKPLKA

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]8.1e-21780.78Show/hide
Query:  MDFQAATDAIKCRAFQIALTGNARLWYRRLPARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFL
        MDFQAATDAIKCRAFQIALTG+ARLWYRRLPARSISTYSQLRKEFISQFSS HYDRKTATHLATIRQKE ETLREYVTRFQEEQLKV HCSDDSAMCYFL
Subjt:  MDFQAATDAIKCRAFQIALTGNARLWYRRLPARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFL

Query:  TGLADETLTVKLGEEAPVTFAEVLQKAKKVIDGQELLRTKTGRPEKQIHQKKLVQQKRKTDSKSRDKRSSSSASRAKYHRSESGPSRSRPYERYTPTTIP
        T LADETLTVKLGEEAP TF EVLQKAKKVIDGQELLRTKTGRPEKQI QKKL Q+KRK DSKSRDK SSSSASR +Y R ESGPSRSRPYERYT +TIP
Subjt:  TGLADETLTVKLGEEAPVTFAEVLQKAKKVIDGQELLRTKTGRPEKQIHQKKLVQQKRKTDSKSRDKRSSSSASRAKYHRSESGPSRSRPYERYTPTTIP

Query:  ISEIITNIEESGMEKLLKRPEK-----PEETQKSAARI-----------------------NGYFKKFVGKPRSNSVEKKEERKRSRTPPRRDDRPAVIN
        ISEI+TNIEESGMEKLLKRPEK      +  ++   R                        +GYFKKFVGKPRSNSVEKKEERKRSRTPPRR+DRPAVIN
Subjt:  ISEIITNIEESGMEKLLKRPEK-----PEETQKSAARI-----------------------NGYFKKFVGKPRSNSVEKKEERKRSRTPPRRDDRPAVIN

Query:  TIFGGPSGGQSGNKRKELAREARREVCIIREQKPTCSITFDDADLEGVHLPHNDAMVIAPLIDHVLV--LLVDGGASANILSLPTYLALGWTRQDATQVT
        TIFGGP+GGQSGNKRKELAREARREVCIIRE KPTCSITF DADLEGVHLPHNDA+VIA LIDH LV  +L+DGG     + LP  +      QDATQVT
Subjt:  TIFGGPSGGQSGNKRKELAREARREVCIIREQKPTCSITFDDADLEGVHLPHNDAMVIAPLIDHVLV--LLVDGGASANILSLPTYLALGWTRQDATQVT

Query:  QMAEFVVIDGKSAYNAIFGRPIIHSFRVVPSTLHQVLKYSTPNGVSTVRGEQKTSRECYASALKGSAVCALEEQTNHGKLQGSETDLPKEGKKQFSPPIE
        QMAEFVVIDG+SAYNAIFGRPIIHSFR VPSTLHQVLKYSTPN V  VRGEQKTSRECYASALKGSAVCALEEQTN GKLQ SE DLPKEGK+QF PP E
Subjt:  QMAEFVVIDGKSAYNAIFGRPIIHSFRVVPSTLHQVLKYSTPNGVSTVRGEQKTSRECYASALKGSAVCALEEQTNHGKLQGSETDLPKEGKKQFSPPIE

Query:  ELELVPLLSPEKQVS
        ELELVPLLSPE+Q +
Subjt:  ELELVPLLSPEKQVS

XP_022155139.1 uncharacterized protein LOC111022280 [Momordica charantia]1.2e-23971.75Show/hide
Query:  PDAPGEKGAPSIQPGDREPIPND-GVDYSLRDNDLRKQLTEKKKIASREPEDSPSYSREFSNSNLKAQSKYKPLIPEAVITRQEFDLTKHRFGEQVEALK
        P APGEKGAPSIQPG+REPIPND GVDYSLRDNDLRK LT+KKK AS EPEDS SYSREFSNSNLKAQSKYKPLIPEAVI R+EFDL KHRF EQVEALK
Subjt:  PDAPGEKGAPSIQPGDREPIPND-GVDYSLRDNDLRKQLTEKKKIASREPEDSPSYSREFSNSNLKAQSKYKPLIPEAVITRQEFDLTKHRFGEQVEALK

Query:  ARCEKKKSSFDDDDLGELPFTSDIMEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGNARLWYRRLPARSISTYSQLRK
        ARCEKK+S FDDDDLGE PFTSDIMEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKC AFQIALTG+ARLW RRLPARSISTYSQLRK
Subjt:  ARCEKKKSSFDDDDLGELPFTSDIMEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGNARLWYRRLPARSISTYSQLRK

Query:  EFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADETLTVKLGEEAPVTFAEVLQKAKKVIDGQELLRTKTGR
        EFI QFS RHYDRKTATHLATIRQKE                                   DETLTVKLGEEAP TFAEVLQ AKKVIDGQELLRTKT R
Subjt:  EFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADETLTVKLGEEAPVTFAEVLQKAKKVIDGQELLRTKTGR

Query:  PEKQIHQKKLVQQKRKTDSKSRDKRSSSSASRAKYHRSESGPSRSRPYERYTPTTIPISEIITNIEESGMEKLLKRPEKPEETQKSAARINGYFKKFVGK
        PEKQI QK+L Q+KRK DSKS+DK SSSS SR +Y RSESGPSRSRPYER       I ++I                            + YFKKFVGK
Subjt:  PEKQIHQKKLVQQKRKTDSKSRDKRSSSSASRAKYHRSESGPSRSRPYERYTPTTIPISEIITNIEESGMEKLLKRPEKPEETQKSAARINGYFKKFVGK

Query:  PRSNSVEKKEERKRSRTPPRRDDRPAVINTIFGGPSGGQSGNKRKELAREARREVCIIREQKPTCSITFDDADLEGVHLPHNDAMVIAPLIDHVLV--LL
        PRSNSVEKKEERKRSRTPPRR+DRPAVINTIFGGPSGGQ  NKRKELA EARR+V IIREQKPTCSITF D DLEGVHLPHNDA+VIAPLIDHVLV  +L
Subjt:  PRSNSVEKKEERKRSRTPPRRDDRPAVINTIFGGPSGGQSGNKRKELAREARREVCIIREQKPTCSITFDDADLEGVHLPHNDAMVIAPLIDHVLV--LL

Query:  VDGGASANILSLPTYLALGWTR-------------------------------QDATQVTQMAEFVVIDGKSAYNAIFGRPIIHSFRVVPSTLHQVLKYS
        VDGGASANILSLPTYLAL  TR                               QD+TQVTQMAEFVVIDG+ AYNAIF RPIIHSF+ VPS LHQVLKYS
Subjt:  VDGGASANILSLPTYLALGWTR-------------------------------QDATQVTQMAEFVVIDGKSAYNAIFGRPIIHSFRVVPSTLHQVLKYS

Query:  TPNGVSTVRGEQKTSRECYASALKGSAVCALEEQTNHGKLQGSETDLPKEGKKQFSPPIEELELVPLLS
        TPNGV TVRGEQKTSRECYASALK S+VCALEEQT       S+ DLP+E K           L P L+
Subjt:  TPNGVSTVRGEQKTSRECYASALKGSAVCALEEQTNHGKLQGSETDLPKEGKKQFSPPIEELELVPLLS

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088131.8e-19370.75Show/hide
Query:  LKAQSKYKPLIPEAVITRQEFDLTKHRFGEQVEALKARCEKKKSSFDDDDLGELPFTSDIMEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAAT
        +KA+S   P  P  VITR+EFD  + +   QVEALKA+CE+K+   +D DLGE PFTSD++EAPIPPKFK PT+KPYDGSKDPKDYVEVFE LMDFQAA+
Subjt:  LKAQSKYKPLIPEAVITRQEFDLTKHRFGEQVEALKARCEKKKSSFDDDDLGELPFTSDIMEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAAT

Query:  DAIKCRAFQIALTGNARLWYRRLPARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADET
        DAIKCRAF+IALTG+ARLWYRRLPA SISTYSQLR+EF++ FSSRHYD+KTATHLATIRQKEGETLREYVTRFQEEQLKV HCSDDSAMCYFLTGLADE 
Subjt:  DAIKCRAFQIALTGNARLWYRRLPARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADET

Query:  LTVKLGEEAPVTFAEVLQKAKKVIDGQELLRTKTGRPEKQIHQKKLVQQKRKTDSKSRDKRSSSSASRAKYHRSESGPSRSRPYERYTPTTIPISEIITN
        LTVKLGEEAP TFAEVLQKAKKVIDGQELLRTKTGRPE++I + +  +     D KS+DK S SS  RA+Y R+E+GP+RSRPYER+TPTTIPISEI+TN
Subjt:  LTVKLGEEAPVTFAEVLQKAKKVIDGQELLRTKTGRPEKQIHQKKLVQQKRKTDSKSRDKRSSSSASRAKYHRSESGPSRSRPYERYTPTTIPISEIITN

Query:  IEESGMEKLLKRPEK----PEETQKSA-ARI-----------------------NGYFKKFVGKPRSNSVEKKEERKRSRTPPRRDDRPAVINTIFGGPS
        IEESGMEKLLKRPEK    PE   K    R                        +GYFKKFVGKPR++S EKKEERKRSRTPPRR DRPAVINTIFGGPS
Subjt:  IEESGMEKLLKRPEK----PEETQKSA-ARI-----------------------NGYFKKFVGKPRSNSVEKKEERKRSRTPPRRDDRPAVINTIFGGPS

Query:  GGQSGNKRKELAREARREVCIIREQKPTCSITFDDADLEGVHLPHNDAMVIAPLIDHVLV--LLVDGGASANILSLPTYLALGWTR--------------
        GGQSG KRKELAR ARREVCIIREQ+PTC ITFD ADLE VHLPHNDA+VIAPLIDHV+V  +LVDGG SANILSLPTYLALGWTR              
Subjt:  GGQSGNKRKELAREARREVCIIREQKPTCSITFDDADLEGVHLPHNDAMVIAPLIDHVLV--LLVDGGASANILSLPTYLALGWTR--------------

Query:  -----------------QDATQVTQMAEFV
                         QD TQVTQMAEFV
Subjt:  -----------------QDATQVTQMAEFV

A0A6J1D9E1 uncharacterized protein LOC1110188234.8e-18358.78Show/hide
Query:  NSNLKAQSKYKPLIPEAVITRQEFDLTKHRFGEQVEALKARCEKKKSSFDDDDLGELPFTSDIMEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQ
        +SN +A+S + P  P+ VITR+EFD  + +   QVEALKA+CE+K+   +D DLGE PFTSD++EA        PT+K YDGSKDPKDYVEVFEGLMDFQ
Subjt:  NSNLKAQSKYKPLIPEAVITRQEFDLTKHRFGEQVEALKARCEKKKSSFDDDDLGELPFTSDIMEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQ

Query:  AATDAIKCRAFQIALTGNARLWYRRLPARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLA
        AA+DAIKCRAFQIALTG+ARLW                                                     FQE+QLKV   SDDSAMCYFLTGLA
Subjt:  AATDAIKCRAFQIALTGNARLWYRRLPARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLA

Query:  DETLTVKLGEEAPVTFAEVLQKAKKVIDGQELLRTKTGRPEKQIHQKKLVQQKRKTDSKSRDKRSSSSASRAKYHRSESGPSRSRPYERYTPTTIPISEI
        DE LTVKLG+EAP TFAEVLQKAKKVIDGQELLRTKTGRPE+ I + +  + + K D KS+DK S SS  RA++ R+ +GP+RSRPYER+TPTTIPISEI
Subjt:  DETLTVKLGEEAPVTFAEVLQKAKKVIDGQELLRTKTGRPEKQIHQKKLVQQKRKTDSKSRDKRSSSSASRAKYHRSESGPSRSRPYERYTPTTIPISEI

Query:  ITNIEESGMEKLLKRPEK----PEETQKSA-ARI-----------------------NGYFKKFVGKPRSNSVEKKEERKRSRTPPRRDDRPAVINTIFG
        +TNIEESGMEKLLKRPEK    PE   K    R                        + YFKKFVGKPR++S EKKEERK SRTP RR DRPAVINTIFG
Subjt:  ITNIEESGMEKLLKRPEK----PEETQKSA-ARI-----------------------NGYFKKFVGKPRSNSVEKKEERKRSRTPPRRDDRPAVINTIFG

Query:  GPSGGQSGNKRKELAREARREVCIIREQKPTCSITFDDADLEGVHLPHNDAMVIAPLIDHVLV--LLVDGGASANILSLPTYLALGWTR-----------
        GPSGGQSG+KRKELAR ARREVCIIREQ+PTC ITFD ADLE VHLPHNDA+VIAPLIDHV+V  +LVD G SANI+SL TYLALGWTR           
Subjt:  GPSGGQSGNKRKELAREARREVCIIREQKPTCSITFDDADLEGVHLPHNDAMVIAPLIDHVLV--LLVDGGASANILSLPTYLALGWTR-----------

Query:  --------------------QDATQVTQMAEFVVIDGKSAYNAIFGRPIIHSFRVVPSTLHQVLKYSTPNGVSTVRGEQKTSRECYASALKGSAVCALEE
                             D TQVTQMAEFVVIDG+SAYNAIFGRPIIHSFR +PSTLHQVLKYSTPNGV  VRGEQ  SRECYASALKGS+VCALE 
Subjt:  --------------------QDATQVTQMAEFVVIDGKSAYNAIFGRPIIHSFRVVPSTLHQVLKYSTPNGVSTVRGEQKTSRECYASALKGSAVCALEE

Query:  QTNHGKLQGSETDLPKEGKKQFSPPIEELELVPLLSPEKQVSIGTKLGATEREEL
          +       + +LP   +++F+ P EELELVPLL  +   +I  +    E+  L
Subjt:  QTNHGKLQGSETDLPKEGKKQFSPPIEELELVPLLSPEKQVSIGTKLGATEREEL

A0A6J1DHB3 uncharacterized protein LOC1110204792.4e-22272.97Show/hide
Query:  KAQSKYKPLIPEAVITRQEFDLTKHRFGEQVEALKARCEKKKSSFDDDDLGELPFTSDIMEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATD
        KA+S Y P+ P  VITR+EFD  K +F  QVEALKARCEKK+SSFDD DLGEL F+SDI+EA IPPKFKTPTMKPYDGSKDPKDYVEVFE LMDFQAATD
Subjt:  KAQSKYKPLIPEAVITRQEFDLTKHRFGEQVEALKARCEKKKSSFDDDDLGELPFTSDIMEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATD

Query:  AIKCRAFQIALTGNARLWYRRLPARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADETL
        AIKC AFQIALTG+ARLWYRRLPAR ISTYSQLRKEFISQFSSRHYDRKT THLATIRQKEGETLREYVTRF EEQLKV HCSDDSAMCYFLTGLADETL
Subjt:  AIKCRAFQIALTGNARLWYRRLPARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADETL

Query:  TVKLGEEAPVTFAEVLQKAKKVIDGQELLRTKTGRPEKQIHQKKLVQQKRKTDSKSRDKRSSSSASRAKYHRSESGPSRSRPYERYTPTTIPISEIITNI
        TVKL EEAP TFAEVLQK KKVIDGQELLRTKTGRPEK I Q +  + K K DSKSRDK  SSS+SR  Y RS S  ++SRPYE YTPTTIPI EI+TNI
Subjt:  TVKLGEEAPVTFAEVLQKAKKVIDGQELLRTKTGRPEKQIHQKKLVQQKRKTDSKSRDKRSSSSASRAKYHRSESGPSRSRPYERYTPTTIPISEIITNI

Query:  EESGMEKLLKRPEK----PE----------------------ETQKSAARI--NGYFKKFVGKPRSNSVEKKEERKRSRTPPRRDDRPAVINTIFGGPSG
        EE+GMEKLLKRPEK    PE                      E ++    +  +GYFKKFVGKPRSNSVEKKEERKR RTPPRRDDRPAVI         
Subjt:  EESGMEKLLKRPEK----PE----------------------ETQKSAARI--NGYFKKFVGKPRSNSVEKKEERKRSRTPPRRDDRPAVINTIFGGPSG

Query:  GQSGNKRKELAREARREVCIIREQKPTCSITFDDADLEGVHLPHNDAMVIAPLIDHVLV--LLVDGGASANILSLPTYLALGWT----------------
            NK+KELAREARREVCIIREQ+PT SI F+ ADLEGVHLPHNDA+VIAPLID VLV  +LVDGGASANILSL TYLALGWT                
Subjt:  GQSGNKRKELAREARREVCIIREQKPTCSITFDDADLEGVHLPHNDAMVIAPLIDHVLV--LLVDGGASANILSLPTYLALGWT----------------

Query:  ---------------RQDATQVTQMAEFVVIDGKSAYNAIFGRPIIHSFRVVPSTLHQVLKYSTPNGVSTVRGEQKTSRECYASALKGSAVCALEEQTNH
                       RQD TQVTQMAEFVVIDG+SAYNAIFGRPIIHSFR VPSTLHQVLKYST NGV TVRGE KTSRECYAS  K S+VCALEEQT  
Subjt:  ---------------RQDATQVTQMAEFVVIDGKSAYNAIFGRPIIHSFRVVPSTLHQVLKYSTPNGVSTVRGEQKTSRECYASALKGSAVCALEEQTNH

Query:  GKL
         +L
Subjt:  GKL

A0A6J1DHB3 uncharacterized protein LOC1110204792.9e-0246.15Show/hide
Query:  MVHPANSANTTEQRGVNADNGPQRDLGARIVEDQVRAGQEGDLPRRSACHANQELRPAHPKPLKA
        MV PANS NT ++R + A++G QR++GA +VE Q       +   RSA      L PAHPKP KA
Subjt:  MVHPANSANTTEQRGVNADNGPQRDLGARIVEDQVRAGQEGDLPRRSACHANQELRPAHPKPLKA

A0A6J1DHB3 uncharacterized protein LOC1110204793.9e-21780.78Show/hide
Query:  MDFQAATDAIKCRAFQIALTGNARLWYRRLPARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFL
        MDFQAATDAIKCRAFQIALTG+ARLWYRRLPARSISTYSQLRKEFISQFSS HYDRKTATHLATIRQKE ETLREYVTRFQEEQLKV HCSDDSAMCYFL
Subjt:  MDFQAATDAIKCRAFQIALTGNARLWYRRLPARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFL

Query:  TGLADETLTVKLGEEAPVTFAEVLQKAKKVIDGQELLRTKTGRPEKQIHQKKLVQQKRKTDSKSRDKRSSSSASRAKYHRSESGPSRSRPYERYTPTTIP
        T LADETLTVKLGEEAP TF EVLQKAKKVIDGQELLRTKTGRPEKQI QKKL Q+KRK DSKSRDK SSSSASR +Y R ESGPSRSRPYERYT +TIP
Subjt:  TGLADETLTVKLGEEAPVTFAEVLQKAKKVIDGQELLRTKTGRPEKQIHQKKLVQQKRKTDSKSRDKRSSSSASRAKYHRSESGPSRSRPYERYTPTTIP

Query:  ISEIITNIEESGMEKLLKRPEK-----PEETQKSAARI-----------------------NGYFKKFVGKPRSNSVEKKEERKRSRTPPRRDDRPAVIN
        ISEI+TNIEESGMEKLLKRPEK      +  ++   R                        +GYFKKFVGKPRSNSVEKKEERKRSRTPPRR+DRPAVIN
Subjt:  ISEIITNIEESGMEKLLKRPEK-----PEETQKSAARI-----------------------NGYFKKFVGKPRSNSVEKKEERKRSRTPPRRDDRPAVIN

Query:  TIFGGPSGGQSGNKRKELAREARREVCIIREQKPTCSITFDDADLEGVHLPHNDAMVIAPLIDHVLV--LLVDGGASANILSLPTYLALGWTRQDATQVT
        TIFGGP+GGQSGNKRKELAREARREVCIIRE KPTCSITF DADLEGVHLPHNDA+VIA LIDH LV  +L+DGG     + LP  +      QDATQVT
Subjt:  TIFGGPSGGQSGNKRKELAREARREVCIIREQKPTCSITFDDADLEGVHLPHNDAMVIAPLIDHVLV--LLVDGGASANILSLPTYLALGWTRQDATQVT

Query:  QMAEFVVIDGKSAYNAIFGRPIIHSFRVVPSTLHQVLKYSTPNGVSTVRGEQKTSRECYASALKGSAVCALEEQTNHGKLQGSETDLPKEGKKQFSPPIE
        QMAEFVVIDG+SAYNAIFGRPIIHSFR VPSTLHQVLKYSTPN V  VRGEQKTSRECYASALKGSAVCALEEQTN GKLQ SE DLPKEGK+QF PP E
Subjt:  QMAEFVVIDGKSAYNAIFGRPIIHSFRVVPSTLHQVLKYSTPNGVSTVRGEQKTSRECYASALKGSAVCALEEQTNHGKLQGSETDLPKEGKKQFSPPIE

Query:  ELELVPLLSPEKQVS
        ELELVPLLSPE+Q +
Subjt:  ELELVPLLSPEKQVS

A0A6J1DPC9 uncharacterized protein LOC1110222805.6e-24071.75Show/hide
Query:  PDAPGEKGAPSIQPGDREPIPND-GVDYSLRDNDLRKQLTEKKKIASREPEDSPSYSREFSNSNLKAQSKYKPLIPEAVITRQEFDLTKHRFGEQVEALK
        P APGEKGAPSIQPG+REPIPND GVDYSLRDNDLRK LT+KKK AS EPEDS SYSREFSNSNLKAQSKYKPLIPEAVI R+EFDL KHRF EQVEALK
Subjt:  PDAPGEKGAPSIQPGDREPIPND-GVDYSLRDNDLRKQLTEKKKIASREPEDSPSYSREFSNSNLKAQSKYKPLIPEAVITRQEFDLTKHRFGEQVEALK

Query:  ARCEKKKSSFDDDDLGELPFTSDIMEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGNARLWYRRLPARSISTYSQLRK
        ARCEKK+S FDDDDLGE PFTSDIMEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKC AFQIALTG+ARLW RRLPARSISTYSQLRK
Subjt:  ARCEKKKSSFDDDDLGELPFTSDIMEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGNARLWYRRLPARSISTYSQLRK

Query:  EFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADETLTVKLGEEAPVTFAEVLQKAKKVIDGQELLRTKTGR
        EFI QFS RHYDRKTATHLATIRQKE                                   DETLTVKLGEEAP TFAEVLQ AKKVIDGQELLRTKT R
Subjt:  EFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADETLTVKLGEEAPVTFAEVLQKAKKVIDGQELLRTKTGR

Query:  PEKQIHQKKLVQQKRKTDSKSRDKRSSSSASRAKYHRSESGPSRSRPYERYTPTTIPISEIITNIEESGMEKLLKRPEKPEETQKSAARINGYFKKFVGK
        PEKQI QK+L Q+KRK DSKS+DK SSSS SR +Y RSESGPSRSRPYER       I ++I                            + YFKKFVGK
Subjt:  PEKQIHQKKLVQQKRKTDSKSRDKRSSSSASRAKYHRSESGPSRSRPYERYTPTTIPISEIITNIEESGMEKLLKRPEKPEETQKSAARINGYFKKFVGK

Query:  PRSNSVEKKEERKRSRTPPRRDDRPAVINTIFGGPSGGQSGNKRKELAREARREVCIIREQKPTCSITFDDADLEGVHLPHNDAMVIAPLIDHVLV--LL
        PRSNSVEKKEERKRSRTPPRR+DRPAVINTIFGGPSGGQ  NKRKELA EARR+V IIREQKPTCSITF D DLEGVHLPHNDA+VIAPLIDHVLV  +L
Subjt:  PRSNSVEKKEERKRSRTPPRRDDRPAVINTIFGGPSGGQSGNKRKELAREARREVCIIREQKPTCSITFDDADLEGVHLPHNDAMVIAPLIDHVLV--LL

Query:  VDGGASANILSLPTYLALGWTR-------------------------------QDATQVTQMAEFVVIDGKSAYNAIFGRPIIHSFRVVPSTLHQVLKYS
        VDGGASANILSLPTYLAL  TR                               QD+TQVTQMAEFVVIDG+ AYNAIF RPIIHSF+ VPS LHQVLKYS
Subjt:  VDGGASANILSLPTYLALGWTR-------------------------------QDATQVTQMAEFVVIDGKSAYNAIFGRPIIHSFRVVPSTLHQVLKYS

Query:  TPNGVSTVRGEQKTSRECYASALKGSAVCALEEQTNHGKLQGSETDLPKEGKKQFSPPIEELELVPLLS
        TPNGV TVRGEQKTSRECYASALK S+VCALEEQT       S+ DLP+E K           L P L+
Subjt:  TPNGVSTVRGEQKTSRECYASALKGSAVCALEEQTNHGKLQGSETDLPKEGKKQFSPPIEELELVPLLS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGCATCCAGCAAACTCTGCCAATACGACAGAACAAAGGGGCGTGAATGCTGATAACGGCCCTCAGCGAGACCTCGGTGCAAGAATAGTCGAGGATCAGGTC
CGAGCAGGGCAAGAGGGAGATCTGCCGCGCAGATCTGCCTGCCATGCGAACCAAGAGCTACGACCTGCTCACCCGAAACCCTTAAAAGCCAACAGAGGCCGAGGA
GGGACGTCGAGGAAAACCTCCCAAAGGGCCAACCAGGCAGCAGACCCTGAAGCTTTGTCTACTCTCCAGCGCGAGTTGGATGATATGCGCCATCGGTTGCGCACA
ATGGAAGAAATGTATGCTGAAGCAACGCGTGCTAACCGAACTGCGTCTCCCTCTAGAGCCCCGGACGCACCCGGTGAAAAGGGAGCTCCATCTATCCAACCTGGC
GATCGCGAGCCCATTCCCAACGATGGAGTGGATTACAGCTTGCGGGATAACGATCTGAGAAAGCAACTCACTGAGAAGAAGAAGATAGCATCTCGAGAGCCGGAA
GACTCTCCTTCCTACTCCCGAGAATTCTCCAACTCGAACCTAAAGGCTCAATCAAAGTACAAGCCTCTAATACCAGAAGCCGTGATCACCAGGCAAGAGTTCGAC
CTGACGAAACACAGGTTCGGTGAACAGGTCGAGGCACTTAAGGCCAGGTGCGAGAAGAAGAAGAGTTCGTTTGACGATGACGACTTGGGAGAATTGCCATTCACC
TCGGATATTATGGAGGCTCCCATCCCTCCGAAGTTCAAAACTCCCACCATGAAGCCCTATGATGGGTCTAAGGACCCTAAAGACTATGTTGAGGTCTTCGAGGGC
CTCATGGACTTCCAAGCGGCAACAGATGCGATCAAGTGCCGCGCCTTCCAGATCGCGCTCACAGGCAACGCGCGCTTGTGGTACCGAAGACTGCCGGCCAGGTCG
ATCTCGACCTACTCCCAGCTGAGGAAGGAGTTCATCAGTCAGTTCTCCTCGCGGCATTATGATAGAAAGACAGCGACTCACCTTGCCACCATCAGGCAGAAGGAA
GGAGAGACGCTGAGAGAATATGTCACAAGGTTCCAGGAGGAGCAGCTGAAGGTTGTGCACTGCTCCGACGATTCGGCCATGTGCTACTTCCTCACCGGCCTGGCC
GATGAGACCCTCACTGTGAAGCTCGGAGAGGAGGCTCCGGTAACCTTCGCCGAAGTCTTGCAGAAGGCAAAGAAGGTCATCGATGGGCAAGAGCTCCTCCGAACC
AAGACTGGCCGACCTGAAAAGCAGATCCATCAGAAGAAATTGGTCCAACAGAAGAGGAAGACTGATTCCAAGTCTAGAGACAAGAGATCGTCTTCTTCTGCCAGC
AGAGCAAAGTACCATAGGTCGGAGAGCGGCCCCAGCCGAAGCCGACCTTATGAACGGTACACACCAACCACCATCCCCATCTCCGAGATAATCACGAACATCGAG
GAGAGCGGGATGGAAAAGCTCCTCAAGCGACCTGAGAAGCCCGAGGAGACCCAGAAAAGCGCAGCAAGGATAAATGGCTACTTTAAAAAATTCGTGGGCAAACCG
AGGTCTAACTCGGTCGAAAAGAAAGAAGAGAGGAAGCGTTCAAGAACGCCGCCTCGCCGGGATGACCGACCTGCGGTCATCAACACTATTTTCGGAGGTCCGAGC
GGGGGCCAGTCCGGAAACAAAAGGAAAGAACTAGCTCGCGAGGCCAGGCGCGAGGTATGCATTATCAGGGAGCAAAAACCTACTTGCTCCATCACTTTCGACGAT
GCCGACTTGGAGGGGGTCCACTTGCCCCATAACGATGCAATGGTGATCGCCCCTCTGATCGATCACGTCTTGGTCCTATTGGTTGATGGAGGCGCATCTGCCAAC
ATCTTGTCCCTCCCAACATATCTAGCATTGGGATGGACCAGACAAGACGCTACCCAAGTAACGCAGATGGCTGAGTTCGTGGTGATCGACGGCAAGTCGGCCTAC
AACGCCATCTTCGGGAGACCTATTATCCACTCATTCCGGGTCGTCCCCTCCACACTGCATCAGGTCCTGAAGTACTCAACCCCTAATGGAGTAAGCACAGTCCGA
GGTGAGCAAAAGACCTCACGAGAATGCTATGCATCCGCACTCAAAGGGTCGGCCGTATGCGCCCTGGAAGAGCAGACCAATCATGGCAAGCTGCAAGGGTCCGAG
ACAGACCTGCCCAAGGAAGGCAAAAAGCAGTTCTCCCCGCCAATAGAAGAGCTCGAGCTTGTTCCTTTGCTTAGCCCCGAAAAGCAAGTAAGCATAGGAACCAAG
CTGGGGGCCACTGAGAGGGAAGAACTGATCAATTTCCTCAGATCGCACCTCGCCCAGTTTGGGACTTACGAGGTGAGTCAAGTTCCAAGATCTGAGAACTTTAAT
GCAGATGCCTTAGCCAAATTGGCATCAGCATACGAGACCGACTTGGCTAGATCGGTCCCGGTCGAAATCTTGGACACTCCTTCAATCTTGGAGCCAGAAGTAATG
GAGGTTGATACTCCATCACCCACTTGGATGGACCCAATCGTGGAGTTCATCAAAGGAAACCCACCGCAAGATCCGAGGGAGCAAAAGAAGATGGCGCGAAGAGCA
GCTCGGTTCACACTCCGAGAAGGAATGTTGTACCGACGTGGCTTATCCCTGCCTCTCCTCAAGTGTGTGACTCCCGAAGAAGGCCTTTACATTCTTAGGGAAGTT
CGTGAAGGGGTGTGTGGAAACCACTCTGGTGCCAGGTCGTTGTCGGCCAAGGTGGTTCGACAAGGGGTAGAGCAGTACGAGCCAACAAAGAACGAGGAAGAGCTA
CTCCTTAACTTGGACTTATTGGAAGGGAAAAGGGAAATGGTTCAGCTGCACTTAGCAGGTTGGGAGGGACCGTTTGTAGTCAAAGGCATAGTCCGACCTGGAACT
TATATGCTGGCCGACCTGGAAGGAAGAGTGCTTGCGCATCCATGGAACGCGGAGCACTTGAAGCGCTATTACCCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTGCATCCAGCAAACTCTGCCAATACGACAGAACAAAGGGGCGTGAATGCTGATAACGGCCCTCAGCGAGACCTCGGTGCAAGAATAGTCGAGGATCAGGTC
CGAGCAGGGCAAGAGGGAGATCTGCCGCGCAGATCTGCCTGCCATGCGAACCAAGAGCTACGACCTGCTCACCCGAAACCCTTAAAAGCCAACAGAGGCCGAGGA
GGGACGTCGAGGAAAACCTCCCAAAGGGCCAACCAGGCAGCAGACCCTGAAGCTTTGTCTACTCTCCAGCGCGAGTTGGATGATATGCGCCATCGGTTGCGCACA
ATGGAAGAAATGTATGCTGAAGCAACGCGTGCTAACCGAACTGCGTCTCCCTCTAGAGCCCCGGACGCACCCGGTGAAAAGGGAGCTCCATCTATCCAACCTGGC
GATCGCGAGCCCATTCCCAACGATGGAGTGGATTACAGCTTGCGGGATAACGATCTGAGAAAGCAACTCACTGAGAAGAAGAAGATAGCATCTCGAGAGCCGGAA
GACTCTCCTTCCTACTCCCGAGAATTCTCCAACTCGAACCTAAAGGCTCAATCAAAGTACAAGCCTCTAATACCAGAAGCCGTGATCACCAGGCAAGAGTTCGAC
CTGACGAAACACAGGTTCGGTGAACAGGTCGAGGCACTTAAGGCCAGGTGCGAGAAGAAGAAGAGTTCGTTTGACGATGACGACTTGGGAGAATTGCCATTCACC
TCGGATATTATGGAGGCTCCCATCCCTCCGAAGTTCAAAACTCCCACCATGAAGCCCTATGATGGGTCTAAGGACCCTAAAGACTATGTTGAGGTCTTCGAGGGC
CTCATGGACTTCCAAGCGGCAACAGATGCGATCAAGTGCCGCGCCTTCCAGATCGCGCTCACAGGCAACGCGCGCTTGTGGTACCGAAGACTGCCGGCCAGGTCG
ATCTCGACCTACTCCCAGCTGAGGAAGGAGTTCATCAGTCAGTTCTCCTCGCGGCATTATGATAGAAAGACAGCGACTCACCTTGCCACCATCAGGCAGAAGGAA
GGAGAGACGCTGAGAGAATATGTCACAAGGTTCCAGGAGGAGCAGCTGAAGGTTGTGCACTGCTCCGACGATTCGGCCATGTGCTACTTCCTCACCGGCCTGGCC
GATGAGACCCTCACTGTGAAGCTCGGAGAGGAGGCTCCGGTAACCTTCGCCGAAGTCTTGCAGAAGGCAAAGAAGGTCATCGATGGGCAAGAGCTCCTCCGAACC
AAGACTGGCCGACCTGAAAAGCAGATCCATCAGAAGAAATTGGTCCAACAGAAGAGGAAGACTGATTCCAAGTCTAGAGACAAGAGATCGTCTTCTTCTGCCAGC
AGAGCAAAGTACCATAGGTCGGAGAGCGGCCCCAGCCGAAGCCGACCTTATGAACGGTACACACCAACCACCATCCCCATCTCCGAGATAATCACGAACATCGAG
GAGAGCGGGATGGAAAAGCTCCTCAAGCGACCTGAGAAGCCCGAGGAGACCCAGAAAAGCGCAGCAAGGATAAATGGCTACTTTAAAAAATTCGTGGGCAAACCG
AGGTCTAACTCGGTCGAAAAGAAAGAAGAGAGGAAGCGTTCAAGAACGCCGCCTCGCCGGGATGACCGACCTGCGGTCATCAACACTATTTTCGGAGGTCCGAGC
GGGGGCCAGTCCGGAAACAAAAGGAAAGAACTAGCTCGCGAGGCCAGGCGCGAGGTATGCATTATCAGGGAGCAAAAACCTACTTGCTCCATCACTTTCGACGAT
GCCGACTTGGAGGGGGTCCACTTGCCCCATAACGATGCAATGGTGATCGCCCCTCTGATCGATCACGTCTTGGTCCTATTGGTTGATGGAGGCGCATCTGCCAAC
ATCTTGTCCCTCCCAACATATCTAGCATTGGGATGGACCAGACAAGACGCTACCCAAGTAACGCAGATGGCTGAGTTCGTGGTGATCGACGGCAAGTCGGCCTAC
AACGCCATCTTCGGGAGACCTATTATCCACTCATTCCGGGTCGTCCCCTCCACACTGCATCAGGTCCTGAAGTACTCAACCCCTAATGGAGTAAGCACAGTCCGA
GGTGAGCAAAAGACCTCACGAGAATGCTATGCATCCGCACTCAAAGGGTCGGCCGTATGCGCCCTGGAAGAGCAGACCAATCATGGCAAGCTGCAAGGGTCCGAG
ACAGACCTGCCCAAGGAAGGCAAAAAGCAGTTCTCCCCGCCAATAGAAGAGCTCGAGCTTGTTCCTTTGCTTAGCCCCGAAAAGCAAGTAAGCATAGGAACCAAG
CTGGGGGCCACTGAGAGGGAAGAACTGATCAATTTCCTCAGATCGCACCTCGCCCAGTTTGGGACTTACGAGGTGAGTCAAGTTCCAAGATCTGAGAACTTTAAT
GCAGATGCCTTAGCCAAATTGGCATCAGCATACGAGACCGACTTGGCTAGATCGGTCCCGGTCGAAATCTTGGACACTCCTTCAATCTTGGAGCCAGAAGTAATG
GAGGTTGATACTCCATCACCCACTTGGATGGACCCAATCGTGGAGTTCATCAAAGGAAACCCACCGCAAGATCCGAGGGAGCAAAAGAAGATGGCGCGAAGAGCA
GCTCGGTTCACACTCCGAGAAGGAATGTTGTACCGACGTGGCTTATCCCTGCCTCTCCTCAAGTGTGTGACTCCCGAAGAAGGCCTTTACATTCTTAGGGAAGTT
CGTGAAGGGGTGTGTGGAAACCACTCTGGTGCCAGGTCGTTGTCGGCCAAGGTGGTTCGACAAGGGGTAGAGCAGTACGAGCCAACAAAGAACGAGGAAGAGCTA
CTCCTTAACTTGGACTTATTGGAAGGGAAAAGGGAAATGGTTCAGCTGCACTTAGCAGGTTGGGAGGGACCGTTTGTAGTCAAAGGCATAGTCCGACCTGGAACT
TATATGCTGGCCGACCTGGAAGGAAGAGTGCTTGCGCATCCATGGAACGCGGAGCACTTGAAGCGCTATTACCCCTGA
Protein sequenceShow/hide protein sequence
MVHPANSANTTEQRGVNADNGPQRDLGARIVEDQVRAGQEGDLPRRSACHANQELRPAHPKPLKANRGRGGTSRKTSQRANQAADPEALSTLQRELDDMRHRLRT
MEEMYAEATRANRTASPSRAPDAPGEKGAPSIQPGDREPIPNDGVDYSLRDNDLRKQLTEKKKIASREPEDSPSYSREFSNSNLKAQSKYKPLIPEAVITRQEFD
LTKHRFGEQVEALKARCEKKKSSFDDDDLGELPFTSDIMEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGNARLWYRRLPARS
ISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADETLTVKLGEEAPVTFAEVLQKAKKVIDGQELLRT
KTGRPEKQIHQKKLVQQKRKTDSKSRDKRSSSSASRAKYHRSESGPSRSRPYERYTPTTIPISEIITNIEESGMEKLLKRPEKPEETQKSAARINGYFKKFVGKP
RSNSVEKKEERKRSRTPPRRDDRPAVINTIFGGPSGGQSGNKRKELAREARREVCIIREQKPTCSITFDDADLEGVHLPHNDAMVIAPLIDHVLVLLVDGGASAN
ILSLPTYLALGWTRQDATQVTQMAEFVVIDGKSAYNAIFGRPIIHSFRVVPSTLHQVLKYSTPNGVSTVRGEQKTSRECYASALKGSAVCALEEQTNHGKLQGSE
TDLPKEGKKQFSPPIEELELVPLLSPEKQVSIGTKLGATEREELINFLRSHLAQFGTYEVSQVPRSENFNADALAKLASAYETDLARSVPVEILDTPSILEPEVM
EVDTPSPTWMDPIVEFIKGNPPQDPREQKKMARRAARFTLREGMLYRRGLSLPLLKCVTPEEGLYILREVREGVCGNHSGARSLSAKVVRQGVEQYEPTKNEEEL
LLNLDLLEGKREMVQLHLAGWEGPFVVKGIVRPGTYMLADLEGRVLAHPWNAEHLKRYYP