; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g15930 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g15930
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr2:11980354..11984843
RNA-Seq ExpressionMoc02g15930
SyntenyMoc02g15930
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]9.1e-20372.83Show/hide
Query:  LKAQSKYKPLMPEAVITREEFDLMKHKLDEQVEALKARCEKKECSFDDGDLGESPFTPDILEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAAT
        +KA+S   P  P  VITREEFD ++ +LD QVEALKA+CE+KE   +DGDLGESPFT D+LEAPIPPKFK PT+KPYDGSKDPKDYVEVFE LMDFQAA+
Subjt:  LKAQSKYKPLMPEAVITREEFDLMKHKLDEQVEALKARCEKKECSFDDGDLGESPFTPDILEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAAT

Query:  DAIKCRAFQIALTGSTPV--------------------------------TATHLTTIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMFYFLTGLADET
        DAIKCRAF+IALTGS  +                                TATHL TIRQKEGETLREYVTRFQEEQLKVAHCSDDSAM YFLTGLADE 
Subjt:  DAIKCRAFQIALTGSTPV--------------------------------TATHLTTIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMFYFLTGLADET

Query:  LTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKKKADSKYRDKGSSSSASRTEYRRLESGPSRSRPYERYTPTTIPISEILTN
        LTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPE++I + +  ++ + AD K +DKGS SS  R EYRR E+GP+RSRPYER+TPTTIPISEILTN
Subjt:  LTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKKKADSKYRDKGSSSSASRTEYRRLESGPSRSRPYERYTPTTIPISEILTN

Query:  IEESGMEKLLKRPEKLQGDPEKRNKDKYCRFHRDHDHNTTSCWELKRQIEGLIQDGYFKKFVGKPRSNSVEKKEKRKRSRTPPCRDDRPAVINTIFGGPS
        IEESGMEKLLKRPEKL+G PE+R+KDKYCRFHR+H HNT+  WELKRQIE LIQDGYFKKFVGKPR++S EKKE+RKRSRTPP R DRPAVINTIFGGPS
Subjt:  IEESGMEKLLKRPEKLQGDPEKRNKDKYCRFHRDHDHNTTSCWELKRQIEGLIQDGYFKKFVGKPRSNSVEKKEKRKRSRTPPCRDDRPAVINTIFGGPS

Query:  GGQSGNKRKELAREARR---------------------EGIHLPHNDALVIAPLIDHVLVQRVLVDGGASANILSLKTYLALGWTKSQLKKSPTPLAGFS
        GGQSG KRKELAR ARR                     E +HLPHNDALVIAPLIDHV+V RVLVDGG SANILSL TYLALGWT+SQLKKSPTPL GFS
Subjt:  GGQSGNKRKELAREARR---------------------EGIHLPHNDALVIAPLIDHVLVQRVLVDGGASANILSLKTYLALGWTKSQLKKSPTPLAGFS

Query:  RESVSLEGCIDLPVMIGQDATQVTQMAEFV
         ESV  EG IDLPV +GQD TQVTQMAEFV
Subjt:  RESVSLEGCIDLPVMIGQDATQVTQMAEFV

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]3.1e-18762.68Show/hide
Query:  NSNLKAQSKYKPLMPEAVITREEFDLMKHKLDEQVEALKARCEKKECSFDDGDLGESPFTPDILEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQ
        +SN +A+S + P  P+ VITREEFD ++ KL+ QVEALKA+CE+KE   +DGDLGESPFT D+LEA        PT+K YDGSKDPKDYVEVFEGLMDFQ
Subjt:  NSNLKAQSKYKPLMPEAVITREEFDLMKHKLDEQVEALKARCEKKECSFDDGDLGESPFTPDILEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQ

Query:  AATDAIKCRAFQIALTGSTPVTATHLTTIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMFYFLTGLADETLTVKLGEEAPATFAEVLQKAKKVIDGQEL
        AA+DAIKCRAFQIALTGS  +                      FQE+QLKVA  SDDSAM YFLTGLADE LTVKLG+EAPATFAEVLQKAKKVIDGQEL
Subjt:  AATDAIKCRAFQIALTGSTPVTATHLTTIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMFYFLTGLADETLTVKLGEEAPATFAEVLQKAKKVIDGQEL

Query:  LRTKTGRPEKQIDQKKLSQEKKKADSKYRDKGSSSSASRTEYRRLESGPSRSRPYERYTPTTIPISEILTNIEESGMEKLLKRPEKLQGDPEKRNKDKYC
        LRTKTGRPE+ ID+ + S + +KAD K +DKGS SS  R E+RR  +GP+RSRPYER+TPTTIPISEILTNIEESGMEKLLKRPEKL+G PE+RNKDKYC
Subjt:  LRTKTGRPEKQIDQKKLSQEKKKADSKYRDKGSSSSASRTEYRRLESGPSRSRPYERYTPTTIPISEILTNIEESGMEKLLKRPEKLQGDPEKRNKDKYC

Query:  RFHRDHDHNTTSCWELKRQIEGLIQDGYFKKFVGKPRSNSVEKKEKRKRSRTPPCRDDRPAVINTIFGGPSGGQSGNKRKELAREARR------------
        RFHR+HDHNT+  WELKRQIE LIQD YFKKFVGKPR++S EKKE+RK SRTP  R DRPAVINTIFGGPSGGQSG+KRKELAR ARR            
Subjt:  RFHRDHDHNTTSCWELKRQIEGLIQDGYFKKFVGKPRSNSVEKKEKRKRSRTPPCRDDRPAVINTIFGGPSGGQSGNKRKELAREARR------------

Query:  ---------EGIHLPHNDALVIAPLIDHVLVQRVLVDGGASANILSLKTYLALGWTKSQLKKSPTPLAGFSRESVSLEGCIDLPVMIGQDATQVTQMAEF
                 E +HLPHNDALVIAPLIDHV+V+RVLVD G SANI+SL TYLALGWT+SQLKKS TPL GFSRESV  EGCIDLPV +G D TQVTQMAEF
Subjt:  ---------EGIHLPHNDALVIAPLIDHVLVQRVLVDGGASANILSLKTYLALGWTKSQLKKSPTPLAGFSRESVSLEGCIDLPVMIGQDATQVTQMAEF

Query:  V---------------------------------------------------CYASTLKGSAVCALEKQANRGKLQGSETNLPKEGKRQFSPPTEEFVLV
        V                                                   CYAS LKGS+VCALE   +R      + NLP   +R+F+ PTEE  LV
Subjt:  V---------------------------------------------------CYASTLKGSAVCALEKQANRGKLQGSETNLPKEGKRQFSPPTEEFVLV

Query:  PLLSFEKQVSI
        PLL ++   +I
Subjt:  PLLSFEKQVSI

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]1.2e-20256.81Show/hide
Query:  MVHPANSANTTERRGVNADNGTQRDLDTRMVEDQVRTGPEGDLPCRSARHANQELPPAHPKPSKANRGRGGTSRKTSQRANRAADPETLSTLQREVPGAP
        MV PANS NT +RR + A++G QR++   +VE Q       +  CRSAR     LPPAHPKPS                                     
Subjt:  MVHPANSANTTERRGVNADNGTQRDLDTRMVEDQVRTGPEGDLPCRSARHANQELPPAHPKPSKANRGRGGTSRKTSQRANRAADPETLSTLQREVPGAP

Query:  GEKGAPSIQPGDREPIPNVEGVDYSLRDNDLRKHLTEKKKRASRKPEDSPSYSREFSNSNLKAQSKYKPLMPEAVITREEFDLMKHKLDEQVEALKARCE
                                                                     KA+S Y P+ P  VITREEFD +K K D QVEALKARCE
Subjt:  GEKGAPSIQPGDREPIPNVEGVDYSLRDNDLRKHLTEKKKRASRKPEDSPSYSREFSNSNLKAQSKYKPLMPEAVITREEFDLMKHKLDEQVEALKARCE

Query:  KKECSFDDGDLGESPFTPDILEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSTPV----------------------
        KKE SFDDGDLGE  F+ DILEA IPPKFKTPTMKPYDGSKDPKDYVEVFE LMDFQAATDAIKC AFQIALTGS  +                      
Subjt:  KKECSFDDGDLGESPFTPDILEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSTPV----------------------

Query:  ----------TATHLTTIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMFYFLTGLADETLTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQ
                  T THL TIRQKEGETLREYVTRF EEQLKVAHCSDDSAM YFLTGLADETLTVKL EEAPATFAEVLQK KKVIDGQELLRTKTGRPEK 
Subjt:  ----------TATHLTTIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMFYFLTGLADETLTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQ

Query:  IDQKKLSQEKKKADSKYRDKGSSSSASRTEYRRLESGPSRSRPYERYTPTTIPISEILTNIEESGMEKLLKRPEKLQGDPEKRNKDKYCRFHRDHDHNTT
        IDQ +  ++K KADSK RDKG SSS+SR +YRR  S  ++SRPYE YTPTTIPI EILTNIEE+GMEKLLKRPEKL+GDPEKRN DKYCRFHRDH HNT+
Subjt:  IDQKKLSQEKKKADSKYRDKGSSSSASRTEYRRLESGPSRSRPYERYTPTTIPISEILTNIEESGMEKLLKRPEKLQGDPEKRNKDKYCRFHRDHDHNTT

Query:  SCWELKRQIEGLIQDGYFKKFVGKPRSNSVEKKEKRKRSRTPPCRDDRPAVINTIFGGPSGGQSGNKRKELAREARR---------------------EG
        + WELKRQIE LIQDGYFKKFVGKPRSNSVEKKE+RKR RTPP RDDRPAVI             NK+KELAREARR                     EG
Subjt:  SCWELKRQIEGLIQDGYFKKFVGKPRSNSVEKKEKRKRSRTPPCRDDRPAVINTIFGGPSGGQSGNKRKELAREARR---------------------EG

Query:  IHLPHNDALVIAPLIDHVLVQRVLVDGGASANILSLKTYLALGWTKSQLKKSPTPLAGFSRESVSLEGCIDLPVMIGQDATQVTQMAEFV----------
        +HLPHNDALVIAPLID VLV+R+LVDGGASANILSL TYLALGWT+SQLKKSPTPL GFS ES+SLEGCIDLPV I QD TQVTQMAEFV          
Subjt:  IHLPHNDALVIAPLIDHVLVQRVLVDGGASANILSLKTYLALGWTKSQLKKSPTPLAGFSRESVSLEGCIDLPVMIGQDATQVTQMAEFV----------

Query:  -----------------------------------------CYASTLKGSAVCALEKQANRGKL
                                                 CYAS  K S+VCALE+Q  R +L
Subjt:  -----------------------------------------CYASTLKGSAVCALEKQANRGKL

XP_022155139.1 uncharacterized protein LOC111022280 [Momordica charantia]9.4e-20061.83Show/hide
Query:  EVPGAPGEKGAPSIQPGDREPIPNVEGVDYSLRDNDLRKHLTEKKKRASRKPEDSPSYSREFSNSNLKAQSKYKPLMPEAVITREEFDLMKHKLDEQVEA
        E+PGAPGEKGAPSIQPG+REPIPN EGVDYSLRDNDLRKHLT+KKK+AS +PEDS SYSREFSNSNLKAQSKYKPL+PEAVI REEFDLMKH+ DEQVEA
Subjt:  EVPGAPGEKGAPSIQPGDREPIPNVEGVDYSLRDNDLRKHLTEKKKRASRKPEDSPSYSREFSNSNLKAQSKYKPLMPEAVITREEFDLMKHKLDEQVEA

Query:  LKARCEKKECSFDDGDLGESPFTPDILEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGST-----PVTATHLTTIRQK
        LKARCEKKE  FDD DLGESPFT DI+EAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKC AFQIALTGS       + A  ++T  Q 
Subjt:  LKARCEKKECSFDDGDLGESPFTPDILEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGST-----PVTATHLTTIRQK

Query:  EGETLREYVTRFQEEQLKVAHCSDDSAMFYFLTGLADETLTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKKKADSKYRDKG
          E + ++  R  + +    H +        +    DETLTVKLGEEAPATFAEVLQ AKKVIDGQELLRTKT RPEKQIDQK+LSQ+K+K DSK +DKG
Subjt:  EGETLREYVTRFQEEQLKVAHCSDDSAMFYFLTGLADETLTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKKKADSKYRDKG

Query:  SSSSASRTEYRRLESGPSRSRPYERYTPTTIPISEILTNIEESGMEKLLKRPEKLQGDPEKRNKDKYCRFHRDHDHNTTSCWELKRQIEGLIQDGYFKKF
        SSSS SRTEYRR ESGPSRSRPYER                                                       CWELKRQIE LIQD YFKKF
Subjt:  SSSSASRTEYRRLESGPSRSRPYERYTPTTIPISEILTNIEESGMEKLLKRPEKLQGDPEKRNKDKYCRFHRDHDHNTTSCWELKRQIEGLIQDGYFKKF

Query:  VGKPRSNSVEKKEKRKRSRTPPCRDDRPAVINTIFGGPSGGQSGNKRKELAREARR---------------------EGIHLPHNDALVIAPLIDHVLVQ
        VGKPRSNSVEKKE+RKRSRTPP R+DRPAVINTIFGGPSGGQ  NKRKELA EARR                     EG+HLPHNDALVIAPLIDHVLV+
Subjt:  VGKPRSNSVEKKEKRKRSRTPPCRDDRPAVINTIFGGPSGGQSGNKRKELAREARR---------------------EGIHLPHNDALVIAPLIDHVLVQ

Query:  RVLVDGGASANILSLKTYLALGWTKSQLKKSPTPLAGFSRESVSLEGCIDLPVMIGQDATQVTQMAEFV-------------------------------
        RVLVDGGASANILSL TYLAL  T+SQLKKSPTPL GFS ESVS EGCIDLPV IGQD+TQVTQMAEFV                               
Subjt:  RVLVDGGASANILSLKTYLALGWTKSQLKKSPTPLAGFSRESVSLEGCIDLPVMIGQDATQVTQMAEFV-------------------------------

Query:  --------------------CYASTLKGSAVCALEKQANRGKLQGSETNLPKEGKRQFSPPTEEFVLVPLLSFEKQ
                            CYAS LK S+VCALE+Q        S+ +LP+E K           L P L+ + +
Subjt:  --------------------CYASTLKGSAVCALEKQANRGKLQGSETNLPKEGKRQFSPPTEEFVLVPLLSFEKQ

XP_022158414.1 uncharacterized protein LOC111024904 [Momordica charantia]2.3e-17466.12Show/hide
Query:  MDFQAATDAIKCRAFQIALTGSTPV--------------------------------TATHLTTIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMFYFL
        MDFQAATDAIKCRAFQIALTGS  +                                TATHL TIRQKE ETLREYVTRFQEEQLKVAHCSDDSAM YFL
Subjt:  MDFQAATDAIKCRAFQIALTGSTPV--------------------------------TATHLTTIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMFYFL

Query:  TGLADETLTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKKKADSKYRDKGSSSSASRTEYRRLESGPSRSRPYERYTPTTIP
        T LADETLTVKLGEEAP TF EVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEK+KADSK RDKGSSSSASRTEYRRLESGPSRSRPYERYT +TIP
Subjt:  TGLADETLTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKKKADSKYRDKGSSSSASRTEYRRLESGPSRSRPYERYTPTTIP

Query:  ISEILTNIEESGMEKLLKRPEKLQGDPEKRNKDKYCRFHRDHDHNTTSCWELKRQIEGLIQDGYFKKFVGKPRSNSVEKKEKRKRSRTPPCRDDRPAVIN
        ISEILTNIEESGMEKLLKRPEKL+GD EKRNK+KYCRFHRDH HNTTSCWELKRQIE LIQDGYFKKFVGKPRSNSVEKKE+RKRSRTPP R+DRPAVIN
Subjt:  ISEILTNIEESGMEKLLKRPEKLQGDPEKRNKDKYCRFHRDHDHNTTSCWELKRQIEGLIQDGYFKKFVGKPRSNSVEKKEKRKRSRTPPCRDDRPAVIN

Query:  TIFGGPSGGQSGNKRKELAREARR---------------------EGIHLPHNDALVIAPLIDHVLVQRVLVDGGASANILSLKTYLALGWTKSQLKKSP
        TIFGGP+GGQSGNKRKELAREARR                     EG+HLPHNDALVIA LIDH LV+RVL+DG                          
Subjt:  TIFGGPSGGQSGNKRKELAREARR---------------------EGIHLPHNDALVIAPLIDHVLVQRVLVDGGASANILSLKTYLALGWTKSQLKKSP

Query:  TPLAGFSRESVSLEGCIDLPVMIGQDATQVTQMAEFV---------------------------------------------------CYASTLKGSAVC
                      GCIDLPV IGQDATQVTQMAEFV                                                   CYAS LKGSAVC
Subjt:  TPLAGFSRESVSLEGCIDLPVMIGQDATQVTQMAEFV---------------------------------------------------CYASTLKGSAVC

Query:  ALEKQANRGKLQGSETNLPKEGKRQFSPPTEEFVLVPLLSFEKQVS
        ALE+Q NRGKLQ SE +LPKEGKRQF PPTEE  LVPLLS E+Q +
Subjt:  ALEKQANRGKLQGSETNLPKEGKRQFSPPTEEFVLVPLLSFEKQVS

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088134.4e-20372.83Show/hide
Query:  LKAQSKYKPLMPEAVITREEFDLMKHKLDEQVEALKARCEKKECSFDDGDLGESPFTPDILEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAAT
        +KA+S   P  P  VITREEFD ++ +LD QVEALKA+CE+KE   +DGDLGESPFT D+LEAPIPPKFK PT+KPYDGSKDPKDYVEVFE LMDFQAA+
Subjt:  LKAQSKYKPLMPEAVITREEFDLMKHKLDEQVEALKARCEKKECSFDDGDLGESPFTPDILEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAAT

Query:  DAIKCRAFQIALTGSTPV--------------------------------TATHLTTIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMFYFLTGLADET
        DAIKCRAF+IALTGS  +                                TATHL TIRQKEGETLREYVTRFQEEQLKVAHCSDDSAM YFLTGLADE 
Subjt:  DAIKCRAFQIALTGSTPV--------------------------------TATHLTTIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMFYFLTGLADET

Query:  LTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKKKADSKYRDKGSSSSASRTEYRRLESGPSRSRPYERYTPTTIPISEILTN
        LTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPE++I + +  ++ + AD K +DKGS SS  R EYRR E+GP+RSRPYER+TPTTIPISEILTN
Subjt:  LTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKKKADSKYRDKGSSSSASRTEYRRLESGPSRSRPYERYTPTTIPISEILTN

Query:  IEESGMEKLLKRPEKLQGDPEKRNKDKYCRFHRDHDHNTTSCWELKRQIEGLIQDGYFKKFVGKPRSNSVEKKEKRKRSRTPPCRDDRPAVINTIFGGPS
        IEESGMEKLLKRPEKL+G PE+R+KDKYCRFHR+H HNT+  WELKRQIE LIQDGYFKKFVGKPR++S EKKE+RKRSRTPP R DRPAVINTIFGGPS
Subjt:  IEESGMEKLLKRPEKLQGDPEKRNKDKYCRFHRDHDHNTTSCWELKRQIEGLIQDGYFKKFVGKPRSNSVEKKEKRKRSRTPPCRDDRPAVINTIFGGPS

Query:  GGQSGNKRKELAREARR---------------------EGIHLPHNDALVIAPLIDHVLVQRVLVDGGASANILSLKTYLALGWTKSQLKKSPTPLAGFS
        GGQSG KRKELAR ARR                     E +HLPHNDALVIAPLIDHV+V RVLVDGG SANILSL TYLALGWT+SQLKKSPTPL GFS
Subjt:  GGQSGNKRKELAREARR---------------------EGIHLPHNDALVIAPLIDHVLVQRVLVDGGASANILSLKTYLALGWTKSQLKKSPTPLAGFS

Query:  RESVSLEGCIDLPVMIGQDATQVTQMAEFV
         ESV  EG IDLPV +GQD TQVTQMAEFV
Subjt:  RESVSLEGCIDLPVMIGQDATQVTQMAEFV

A0A6J1D9E1 uncharacterized protein LOC1110188231.5e-18762.68Show/hide
Query:  NSNLKAQSKYKPLMPEAVITREEFDLMKHKLDEQVEALKARCEKKECSFDDGDLGESPFTPDILEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQ
        +SN +A+S + P  P+ VITREEFD ++ KL+ QVEALKA+CE+KE   +DGDLGESPFT D+LEA        PT+K YDGSKDPKDYVEVFEGLMDFQ
Subjt:  NSNLKAQSKYKPLMPEAVITREEFDLMKHKLDEQVEALKARCEKKECSFDDGDLGESPFTPDILEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQ

Query:  AATDAIKCRAFQIALTGSTPVTATHLTTIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMFYFLTGLADETLTVKLGEEAPATFAEVLQKAKKVIDGQEL
        AA+DAIKCRAFQIALTGS  +                      FQE+QLKVA  SDDSAM YFLTGLADE LTVKLG+EAPATFAEVLQKAKKVIDGQEL
Subjt:  AATDAIKCRAFQIALTGSTPVTATHLTTIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMFYFLTGLADETLTVKLGEEAPATFAEVLQKAKKVIDGQEL

Query:  LRTKTGRPEKQIDQKKLSQEKKKADSKYRDKGSSSSASRTEYRRLESGPSRSRPYERYTPTTIPISEILTNIEESGMEKLLKRPEKLQGDPEKRNKDKYC
        LRTKTGRPE+ ID+ + S + +KAD K +DKGS SS  R E+RR  +GP+RSRPYER+TPTTIPISEILTNIEESGMEKLLKRPEKL+G PE+RNKDKYC
Subjt:  LRTKTGRPEKQIDQKKLSQEKKKADSKYRDKGSSSSASRTEYRRLESGPSRSRPYERYTPTTIPISEILTNIEESGMEKLLKRPEKLQGDPEKRNKDKYC

Query:  RFHRDHDHNTTSCWELKRQIEGLIQDGYFKKFVGKPRSNSVEKKEKRKRSRTPPCRDDRPAVINTIFGGPSGGQSGNKRKELAREARR------------
        RFHR+HDHNT+  WELKRQIE LIQD YFKKFVGKPR++S EKKE+RK SRTP  R DRPAVINTIFGGPSGGQSG+KRKELAR ARR            
Subjt:  RFHRDHDHNTTSCWELKRQIEGLIQDGYFKKFVGKPRSNSVEKKEKRKRSRTPPCRDDRPAVINTIFGGPSGGQSGNKRKELAREARR------------

Query:  ---------EGIHLPHNDALVIAPLIDHVLVQRVLVDGGASANILSLKTYLALGWTKSQLKKSPTPLAGFSRESVSLEGCIDLPVMIGQDATQVTQMAEF
                 E +HLPHNDALVIAPLIDHV+V+RVLVD G SANI+SL TYLALGWT+SQLKKS TPL GFSRESV  EGCIDLPV +G D TQVTQMAEF
Subjt:  ---------EGIHLPHNDALVIAPLIDHVLVQRVLVDGGASANILSLKTYLALGWTKSQLKKSPTPLAGFSRESVSLEGCIDLPVMIGQDATQVTQMAEF

Query:  V---------------------------------------------------CYASTLKGSAVCALEKQANRGKLQGSETNLPKEGKRQFSPPTEEFVLV
        V                                                   CYAS LKGS+VCALE   +R      + NLP   +R+F+ PTEE  LV
Subjt:  V---------------------------------------------------CYASTLKGSAVCALEKQANRGKLQGSETNLPKEGKRQFSPPTEEFVLV

Query:  PLLSFEKQVSI
        PLL ++   +I
Subjt:  PLLSFEKQVSI

A0A6J1DHB3 uncharacterized protein LOC1110204795.8e-20356.81Show/hide
Query:  MVHPANSANTTERRGVNADNGTQRDLDTRMVEDQVRTGPEGDLPCRSARHANQELPPAHPKPSKANRGRGGTSRKTSQRANRAADPETLSTLQREVPGAP
        MV PANS NT +RR + A++G QR++   +VE Q       +  CRSAR     LPPAHPKPS                                     
Subjt:  MVHPANSANTTERRGVNADNGTQRDLDTRMVEDQVRTGPEGDLPCRSARHANQELPPAHPKPSKANRGRGGTSRKTSQRANRAADPETLSTLQREVPGAP

Query:  GEKGAPSIQPGDREPIPNVEGVDYSLRDNDLRKHLTEKKKRASRKPEDSPSYSREFSNSNLKAQSKYKPLMPEAVITREEFDLMKHKLDEQVEALKARCE
                                                                     KA+S Y P+ P  VITREEFD +K K D QVEALKARCE
Subjt:  GEKGAPSIQPGDREPIPNVEGVDYSLRDNDLRKHLTEKKKRASRKPEDSPSYSREFSNSNLKAQSKYKPLMPEAVITREEFDLMKHKLDEQVEALKARCE

Query:  KKECSFDDGDLGESPFTPDILEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSTPV----------------------
        KKE SFDDGDLGE  F+ DILEA IPPKFKTPTMKPYDGSKDPKDYVEVFE LMDFQAATDAIKC AFQIALTGS  +                      
Subjt:  KKECSFDDGDLGESPFTPDILEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSTPV----------------------

Query:  ----------TATHLTTIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMFYFLTGLADETLTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQ
                  T THL TIRQKEGETLREYVTRF EEQLKVAHCSDDSAM YFLTGLADETLTVKL EEAPATFAEVLQK KKVIDGQELLRTKTGRPEK 
Subjt:  ----------TATHLTTIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMFYFLTGLADETLTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQ

Query:  IDQKKLSQEKKKADSKYRDKGSSSSASRTEYRRLESGPSRSRPYERYTPTTIPISEILTNIEESGMEKLLKRPEKLQGDPEKRNKDKYCRFHRDHDHNTT
        IDQ +  ++K KADSK RDKG SSS+SR +YRR  S  ++SRPYE YTPTTIPI EILTNIEE+GMEKLLKRPEKL+GDPEKRN DKYCRFHRDH HNT+
Subjt:  IDQKKLSQEKKKADSKYRDKGSSSSASRTEYRRLESGPSRSRPYERYTPTTIPISEILTNIEESGMEKLLKRPEKLQGDPEKRNKDKYCRFHRDHDHNTT

Query:  SCWELKRQIEGLIQDGYFKKFVGKPRSNSVEKKEKRKRSRTPPCRDDRPAVINTIFGGPSGGQSGNKRKELAREARR---------------------EG
        + WELKRQIE LIQDGYFKKFVGKPRSNSVEKKE+RKR RTPP RDDRPAVI             NK+KELAREARR                     EG
Subjt:  SCWELKRQIEGLIQDGYFKKFVGKPRSNSVEKKEKRKRSRTPPCRDDRPAVINTIFGGPSGGQSGNKRKELAREARR---------------------EG

Query:  IHLPHNDALVIAPLIDHVLVQRVLVDGGASANILSLKTYLALGWTKSQLKKSPTPLAGFSRESVSLEGCIDLPVMIGQDATQVTQMAEFV----------
        +HLPHNDALVIAPLID VLV+R+LVDGGASANILSL TYLALGWT+SQLKKSPTPL GFS ES+SLEGCIDLPV I QD TQVTQMAEFV          
Subjt:  IHLPHNDALVIAPLIDHVLVQRVLVDGGASANILSLKTYLALGWTKSQLKKSPTPLAGFSRESVSLEGCIDLPVMIGQDATQVTQMAEFV----------

Query:  -----------------------------------------CYASTLKGSAVCALEKQANRGKL
                                                 CYAS  K S+VCALE+Q  R +L
Subjt:  -----------------------------------------CYASTLKGSAVCALEKQANRGKL

A0A6J1DPC9 uncharacterized protein LOC1110222804.6e-20061.83Show/hide
Query:  EVPGAPGEKGAPSIQPGDREPIPNVEGVDYSLRDNDLRKHLTEKKKRASRKPEDSPSYSREFSNSNLKAQSKYKPLMPEAVITREEFDLMKHKLDEQVEA
        E+PGAPGEKGAPSIQPG+REPIPN EGVDYSLRDNDLRKHLT+KKK+AS +PEDS SYSREFSNSNLKAQSKYKPL+PEAVI REEFDLMKH+ DEQVEA
Subjt:  EVPGAPGEKGAPSIQPGDREPIPNVEGVDYSLRDNDLRKHLTEKKKRASRKPEDSPSYSREFSNSNLKAQSKYKPLMPEAVITREEFDLMKHKLDEQVEA

Query:  LKARCEKKECSFDDGDLGESPFTPDILEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGST-----PVTATHLTTIRQK
        LKARCEKKE  FDD DLGESPFT DI+EAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKC AFQIALTGS       + A  ++T  Q 
Subjt:  LKARCEKKECSFDDGDLGESPFTPDILEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGST-----PVTATHLTTIRQK

Query:  EGETLREYVTRFQEEQLKVAHCSDDSAMFYFLTGLADETLTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKKKADSKYRDKG
          E + ++  R  + +    H +        +    DETLTVKLGEEAPATFAEVLQ AKKVIDGQELLRTKT RPEKQIDQK+LSQ+K+K DSK +DKG
Subjt:  EGETLREYVTRFQEEQLKVAHCSDDSAMFYFLTGLADETLTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKKKADSKYRDKG

Query:  SSSSASRTEYRRLESGPSRSRPYERYTPTTIPISEILTNIEESGMEKLLKRPEKLQGDPEKRNKDKYCRFHRDHDHNTTSCWELKRQIEGLIQDGYFKKF
        SSSS SRTEYRR ESGPSRSRPYER                                                       CWELKRQIE LIQD YFKKF
Subjt:  SSSSASRTEYRRLESGPSRSRPYERYTPTTIPISEILTNIEESGMEKLLKRPEKLQGDPEKRNKDKYCRFHRDHDHNTTSCWELKRQIEGLIQDGYFKKF

Query:  VGKPRSNSVEKKEKRKRSRTPPCRDDRPAVINTIFGGPSGGQSGNKRKELAREARR---------------------EGIHLPHNDALVIAPLIDHVLVQ
        VGKPRSNSVEKKE+RKRSRTPP R+DRPAVINTIFGGPSGGQ  NKRKELA EARR                     EG+HLPHNDALVIAPLIDHVLV+
Subjt:  VGKPRSNSVEKKEKRKRSRTPPCRDDRPAVINTIFGGPSGGQSGNKRKELAREARR---------------------EGIHLPHNDALVIAPLIDHVLVQ

Query:  RVLVDGGASANILSLKTYLALGWTKSQLKKSPTPLAGFSRESVSLEGCIDLPVMIGQDATQVTQMAEFV-------------------------------
        RVLVDGGASANILSL TYLAL  T+SQLKKSPTPL GFS ESVS EGCIDLPV IGQD+TQVTQMAEFV                               
Subjt:  RVLVDGGASANILSLKTYLALGWTKSQLKKSPTPLAGFSRESVSLEGCIDLPVMIGQDATQVTQMAEFV-------------------------------

Query:  --------------------CYASTLKGSAVCALEKQANRGKLQGSETNLPKEGKRQFSPPTEEFVLVPLLSFEKQ
                            CYAS LK S+VCALE+Q        S+ +LP+E K           L P L+ + +
Subjt:  --------------------CYASTLKGSAVCALEKQANRGKLQGSETNLPKEGKRQFSPPTEEFVLVPLLSFEKQ

A0A6J1DZB9 uncharacterized protein LOC1110249041.1e-17466.12Show/hide
Query:  MDFQAATDAIKCRAFQIALTGSTPV--------------------------------TATHLTTIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMFYFL
        MDFQAATDAIKCRAFQIALTGS  +                                TATHL TIRQKE ETLREYVTRFQEEQLKVAHCSDDSAM YFL
Subjt:  MDFQAATDAIKCRAFQIALTGSTPV--------------------------------TATHLTTIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMFYFL

Query:  TGLADETLTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKKKADSKYRDKGSSSSASRTEYRRLESGPSRSRPYERYTPTTIP
        T LADETLTVKLGEEAP TF EVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEK+KADSK RDKGSSSSASRTEYRRLESGPSRSRPYERYT +TIP
Subjt:  TGLADETLTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKKKADSKYRDKGSSSSASRTEYRRLESGPSRSRPYERYTPTTIP

Query:  ISEILTNIEESGMEKLLKRPEKLQGDPEKRNKDKYCRFHRDHDHNTTSCWELKRQIEGLIQDGYFKKFVGKPRSNSVEKKEKRKRSRTPPCRDDRPAVIN
        ISEILTNIEESGMEKLLKRPEKL+GD EKRNK+KYCRFHRDH HNTTSCWELKRQIE LIQDGYFKKFVGKPRSNSVEKKE+RKRSRTPP R+DRPAVIN
Subjt:  ISEILTNIEESGMEKLLKRPEKLQGDPEKRNKDKYCRFHRDHDHNTTSCWELKRQIEGLIQDGYFKKFVGKPRSNSVEKKEKRKRSRTPPCRDDRPAVIN

Query:  TIFGGPSGGQSGNKRKELAREARR---------------------EGIHLPHNDALVIAPLIDHVLVQRVLVDGGASANILSLKTYLALGWTKSQLKKSP
        TIFGGP+GGQSGNKRKELAREARR                     EG+HLPHNDALVIA LIDH LV+RVL+DG                          
Subjt:  TIFGGPSGGQSGNKRKELAREARR---------------------EGIHLPHNDALVIAPLIDHVLVQRVLVDGGASANILSLKTYLALGWTKSQLKKSP

Query:  TPLAGFSRESVSLEGCIDLPVMIGQDATQVTQMAEFV---------------------------------------------------CYASTLKGSAVC
                      GCIDLPV IGQDATQVTQMAEFV                                                   CYAS LKGSAVC
Subjt:  TPLAGFSRESVSLEGCIDLPVMIGQDATQVTQMAEFV---------------------------------------------------CYASTLKGSAVC

Query:  ALEKQANRGKLQGSETNLPKEGKRQFSPPTEEFVLVPLLSFEKQVS
        ALE+Q NRGKLQ SE +LPKEGKRQF PPTEE  LVPLLS E+Q +
Subjt:  ALEKQANRGKLQGSETNLPKEGKRQFSPPTEEFVLVPLLSFEKQVS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGCATCCAGCAAACTCTGCCAATACGACAGAACGGAGGGGTGTGAACGCTGATAATGGCACTCAGCGAGACCTCGACACAAGAATGGTCGAGGACCAGGTCCGAAC
AGGACCAGAGGGAGATCTGCCATGCAGATCTGCCCGCCATGCGAACCAAGAGCTACCACCTGCTCACCCGAAACCCTCAAAAGCCAACAGAGGCCGAGGAGGGACCTCGA
GAAAAACCTCCCAAAGGGCCAACCGGGCAGCAGACCCTGAAACTTTGTCTACTCTCCAGCGCGAGGTCCCGGGCGCACCTGGTGAAAAGGGAGCCCCATCCATCCAACCT
GGCGACCGCGAGCCCATTCCCAACGTTGAAGGAGTGGATTATAGCTTGCGGGACAACGATCTGAGAAAGCATCTCACTGAAAAGAAGAAGAGAGCATCTCGGAAGCCGGA
AGACTCTCCTTCCTACTCCCGAGAGTTCTCGAACTCGAACCTAAAGGCTCAGTCAAAATACAAGCCTCTGATGCCAGAAGCTGTGATCACTAGAGAAGAGTTCGACCTAA
TGAAGCACAAGCTCGATGAGCAGGTCGAGGCGCTTAAGGCCAGGTGCGAGAAGAAAGAATGTTCGTTTGACGATGGCGACTTGGGAGAATCGCCATTCACCCCGGACATC
CTGGAGGCTCCAATCCCTCCGAAGTTCAAAACTCCCACAATGAAGCCTTATGATGGGTCTAAGGACCCTAAAGACTATGTTGAGGTCTTCGAAGGCCTCATGGACTTTCA
AGCGGCAACAGATGCAATCAAGTGCCGTGCCTTCCAGATCGCGCTCACCGGTAGCACGCCTGTGACAGCGACTCATCTCACCACTATCAGGCAGAAGGAGGGAGAGACGC
TGAGAGAGTATGTCACGAGGTTCCAGGAGGAGCAGTTGAAGGTCGCGCACTGCTCCGATGATTCGGCCATGTTCTACTTCCTCACCGGCCTGGCCGATGAGACCTTAACA
GTAAAACTTGGAGAGGAGGCTCCAGCCACCTTCGCCGAAGTCCTGCAGAAGGCGAAGAAGGTCATTGATGGGCAAGAGCTCCTCCGAACCAAAACTGGCCGACCTGAGAA
GCAGATCGACCAGAAGAAGTTGAGCCAAGAGAAGAAGAAGGCTGATTCCAAGTATAGAGATAAGGGATCTTCCTCATCCGCCAGCAGAACAGAGTACCGTAGGTTGGAGA
GCGGCCCCAGCCGGAGCCGACCTTATGAACGGTATACTCCAACCACCATCCCCATCTCCGAGATACTCACGAATATCGAGGAAAGTGGGATGGAAAAGCTCCTCAAGCGA
CCTGAGAAGCTCCAAGGAGACCCGGAGAAGCGCAACAAAGATAAGTACTGCCGTTTTCATCGCGATCACGACCATAATACGACAAGCTGCTGGGAATTGAAGCGCCAGAT
TGAAGGCCTCATTCAAGATGGCTACTTTAAAAAGTTCGTGGGCAAACCGAGGTCTAACTCGGTTGAAAAGAAGGAAAAGAGAAAGCGTTCAAGAACACCGCCTTGCCGAG
ATGATCGACCTGCGGTCATCAACACTATTTTCGGAGGTCCGAGCGGGGGCCAGTCCGGAAACAAGAGGAAGGAGCTAGCTCGCGAGGCCAGGCGCGAGGGGATCCATTTG
CCCCATAATGACGCGCTCGTGATCGCCCCTCTCATTGATCACGTCCTGGTCCAAAGAGTATTGGTTGATGGAGGTGCATCTGCCAACATCTTGTCCCTCAAAACATATCT
AGCATTGGGATGGACCAAGTCACAATTGAAGAAGAGTCCAACACCCTTGGCTGGATTCTCTAGAGAATCGGTCTCCCTAGAAGGGTGCATTGACCTGCCGGTAATGATCG
GGCAAGATGCTACCCAAGTAACGCAGATGGCCGAGTTCGTGTGTTATGCATCCACGCTTAAAGGGTCAGCGGTATGCGCCCTGGAAAAGCAAGCCAATCGTGGCAAGCTG
CAAGGGTCCGAGACAAACCTACCCAAGGAAGGCAAAAGGCAGTTCTCCCCGCCAACAGAAGAGTTCGTGCTTGTTCCTTTACTTAGCTTTGAAAAACAAGTAAGCATAGG
AACCAAGCTGGGGGCCACTGACAGGGAAGAACTGATCAACTTCCCCAGGTCTAACTCGGACGTCTTCACATGGTCTCACGAGGACATGCCTGGCATCGACCCAAAGATTA
TGACCGACCTGGCTAGATCGGTCCCGGTCGAGATCTTGGACACTCCTTCAATCTTGGAGCCAAATGTCATGAAGGTTGATACTCCATCACCCACTTGGATGGACCCAATC
GTGGATTTCATCAAAGGAAACCCACCGCAAGATCCGAAAGAGCAAAAGAAGATGGCGCGAAGACCAGTTTGTGCCGACGTGGCTTCTCCTTGCCTCTGCTCAAGTGTGTG
A
mRNA sequenceShow/hide mRNA sequence
ATGGTGCATCCAGCAAACTCTGCCAATACGACAGAACGGAGGGGTGTGAACGCTGATAATGGCACTCAGCGAGACCTCGACACAAGAATGGTCGAGGACCAGGTCCGAAC
AGGACCAGAGGGAGATCTGCCATGCAGATCTGCCCGCCATGCGAACCAAGAGCTACCACCTGCTCACCCGAAACCCTCAAAAGCCAACAGAGGCCGAGGAGGGACCTCGA
GAAAAACCTCCCAAAGGGCCAACCGGGCAGCAGACCCTGAAACTTTGTCTACTCTCCAGCGCGAGGTCCCGGGCGCACCTGGTGAAAAGGGAGCCCCATCCATCCAACCT
GGCGACCGCGAGCCCATTCCCAACGTTGAAGGAGTGGATTATAGCTTGCGGGACAACGATCTGAGAAAGCATCTCACTGAAAAGAAGAAGAGAGCATCTCGGAAGCCGGA
AGACTCTCCTTCCTACTCCCGAGAGTTCTCGAACTCGAACCTAAAGGCTCAGTCAAAATACAAGCCTCTGATGCCAGAAGCTGTGATCACTAGAGAAGAGTTCGACCTAA
TGAAGCACAAGCTCGATGAGCAGGTCGAGGCGCTTAAGGCCAGGTGCGAGAAGAAAGAATGTTCGTTTGACGATGGCGACTTGGGAGAATCGCCATTCACCCCGGACATC
CTGGAGGCTCCAATCCCTCCGAAGTTCAAAACTCCCACAATGAAGCCTTATGATGGGTCTAAGGACCCTAAAGACTATGTTGAGGTCTTCGAAGGCCTCATGGACTTTCA
AGCGGCAACAGATGCAATCAAGTGCCGTGCCTTCCAGATCGCGCTCACCGGTAGCACGCCTGTGACAGCGACTCATCTCACCACTATCAGGCAGAAGGAGGGAGAGACGC
TGAGAGAGTATGTCACGAGGTTCCAGGAGGAGCAGTTGAAGGTCGCGCACTGCTCCGATGATTCGGCCATGTTCTACTTCCTCACCGGCCTGGCCGATGAGACCTTAACA
GTAAAACTTGGAGAGGAGGCTCCAGCCACCTTCGCCGAAGTCCTGCAGAAGGCGAAGAAGGTCATTGATGGGCAAGAGCTCCTCCGAACCAAAACTGGCCGACCTGAGAA
GCAGATCGACCAGAAGAAGTTGAGCCAAGAGAAGAAGAAGGCTGATTCCAAGTATAGAGATAAGGGATCTTCCTCATCCGCCAGCAGAACAGAGTACCGTAGGTTGGAGA
GCGGCCCCAGCCGGAGCCGACCTTATGAACGGTATACTCCAACCACCATCCCCATCTCCGAGATACTCACGAATATCGAGGAAAGTGGGATGGAAAAGCTCCTCAAGCGA
CCTGAGAAGCTCCAAGGAGACCCGGAGAAGCGCAACAAAGATAAGTACTGCCGTTTTCATCGCGATCACGACCATAATACGACAAGCTGCTGGGAATTGAAGCGCCAGAT
TGAAGGCCTCATTCAAGATGGCTACTTTAAAAAGTTCGTGGGCAAACCGAGGTCTAACTCGGTTGAAAAGAAGGAAAAGAGAAAGCGTTCAAGAACACCGCCTTGCCGAG
ATGATCGACCTGCGGTCATCAACACTATTTTCGGAGGTCCGAGCGGGGGCCAGTCCGGAAACAAGAGGAAGGAGCTAGCTCGCGAGGCCAGGCGCGAGGGGATCCATTTG
CCCCATAATGACGCGCTCGTGATCGCCCCTCTCATTGATCACGTCCTGGTCCAAAGAGTATTGGTTGATGGAGGTGCATCTGCCAACATCTTGTCCCTCAAAACATATCT
AGCATTGGGATGGACCAAGTCACAATTGAAGAAGAGTCCAACACCCTTGGCTGGATTCTCTAGAGAATCGGTCTCCCTAGAAGGGTGCATTGACCTGCCGGTAATGATCG
GGCAAGATGCTACCCAAGTAACGCAGATGGCCGAGTTCGTGTGTTATGCATCCACGCTTAAAGGGTCAGCGGTATGCGCCCTGGAAAAGCAAGCCAATCGTGGCAAGCTG
CAAGGGTCCGAGACAAACCTACCCAAGGAAGGCAAAAGGCAGTTCTCCCCGCCAACAGAAGAGTTCGTGCTTGTTCCTTTACTTAGCTTTGAAAAACAAGTAAGCATAGG
AACCAAGCTGGGGGCCACTGACAGGGAAGAACTGATCAACTTCCCCAGGTCTAACTCGGACGTCTTCACATGGTCTCACGAGGACATGCCTGGCATCGACCCAAAGATTA
TGACCGACCTGGCTAGATCGGTCCCGGTCGAGATCTTGGACACTCCTTCAATCTTGGAGCCAAATGTCATGAAGGTTGATACTCCATCACCCACTTGGATGGACCCAATC
GTGGATTTCATCAAAGGAAACCCACCGCAAGATCCGAAAGAGCAAAAGAAGATGGCGCGAAGACCAGTTTGTGCCGACGTGGCTTCTCCTTGCCTCTGCTCAAGTGTGTG
A
Protein sequenceShow/hide protein sequence
MVHPANSANTTERRGVNADNGTQRDLDTRMVEDQVRTGPEGDLPCRSARHANQELPPAHPKPSKANRGRGGTSRKTSQRANRAADPETLSTLQREVPGAPGEKGAPSIQP
GDREPIPNVEGVDYSLRDNDLRKHLTEKKKRASRKPEDSPSYSREFSNSNLKAQSKYKPLMPEAVITREEFDLMKHKLDEQVEALKARCEKKECSFDDGDLGESPFTPDI
LEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSTPVTATHLTTIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMFYFLTGLADETLT
VKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKKKADSKYRDKGSSSSASRTEYRRLESGPSRSRPYERYTPTTIPISEILTNIEESGMEKLLKR
PEKLQGDPEKRNKDKYCRFHRDHDHNTTSCWELKRQIEGLIQDGYFKKFVGKPRSNSVEKKEKRKRSRTPPCRDDRPAVINTIFGGPSGGQSGNKRKELAREARREGIHL
PHNDALVIAPLIDHVLVQRVLVDGGASANILSLKTYLALGWTKSQLKKSPTPLAGFSRESVSLEGCIDLPVMIGQDATQVTQMAEFVCYASTLKGSAVCALEKQANRGKL
QGSETNLPKEGKRQFSPPTEEFVLVPLLSFEKQVSIGTKLGATDREELINFPRSNSDVFTWSHEDMPGIDPKIMTDLARSVPVEILDTPSILEPNVMKVDTPSPTWMDPI
VDFIKGNPPQDPKEQKKMARRPVCADVASPCLCSSV