; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g19330 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g19330
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr3:13014394..13020439
RNA-Seq ExpressionMoc03g19330
SyntenyMoc03g19330
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]6.4e-23784.24Show/hide
Query:  QAESSHN---LAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGMKDPKDYVEVFEGLMDFQAASD
        +AESS N    AG+ITREEFDQLRG+LDAQVEALKAKCEQK+  LNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDG KDPKDYVEVFE LMDFQAASD
Subjt:  QAESSHN---LAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGMKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLNGLADEVL
        AIKCRAF+IALTGSARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKT THLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL GLADE L
Subjt:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLNGLADEVL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTDRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE
        TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKT RPERKIGRGRSGKD E ADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTDRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE

Query:  DSGMEKLLKRPEKLREP-------------RRGAARTS-----------------IAASIGSTATTPQDQLSREKKEERKRSRTPPRRTDRPAVINTIFG
        +SGMEKLLKRPEKLR               R     TS                     +G   T+     S EKKEERKRSRTPPRRTDRPAVINTIFG
Subjt:  DSGMEKLLKRPEKLREP-------------RRGAARTS-----------------IAASIGSTATTPQDQLSREKKEERKRSRTPPRRTDRPAVINTIFG

Query:  GPSGGQSGHKRKELAREARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTSLV
        GPSGGQSG KRKELAR ARREVCIIREQ PTCPITFDGADLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLALGWTRSQLK+SPT LV
Subjt:  GPSGGQSGHKRKELAREARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTSLV

Query:  GFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFV
        GFSGESVIPEG IDLPVTLGQDQT+VTQMAEFV
Subjt:  GFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFV

XP_022150613.1 uncharacterized protein LOC111018708, partial [Momordica charantia]3.7e-18483.14Show/hide
Query:  KDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGMKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQ
        KDDSLNDGDLGES FTSDVLEAPIPPKFKAPTVKPYDG KDPKDYVEVFEGLMDF AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQ
Subjt:  KDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGMKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQ

Query:  FSSRHYDKKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLNGLADEVLTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTDRPERKI
        FSSR Y KKT THLATIRQKEG TLREYVTRFQEEQLKVAHCSDDSAMCYFL GLADE LTVKLGE+AP TFAEVLQKAKKVIDGQELLRTKT RP+RKI
Subjt:  FSSRHYDKKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLNGLADEVLTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTDRPERKI

Query:  GRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLLKRPEKLREP-------------RRGAARTS--
        GRGRSGKD ERADPKSKDKGSFSSGRAEYRRAE+GPT+SRPYERFTPTTIPISEILTNIE+SGMEKLLKRPEKLR               R     TS  
Subjt:  GRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLLKRPEKLREP-------------RRGAARTS--

Query:  ---------------IAASIGSTATTPQDQLSREKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELAREARREVCIIREQGPTCPITFDGAD
                           +G   T+     S EKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELAR ARREVCIIREQGPTCPITFDGAD
Subjt:  ---------------IAASIGSTATTPQDQLSREKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELAREARREVCIIREQGPTCPITFDGAD

Query:  LEEVHLPHNDALVIAPLIDHVVVRRVL
         EEVHLPHNDA VIAPLIDHVVVRRVL
Subjt:  LEEVHLPHNDALVIAPLIDHVVVRRVL

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]2.5e-20965.47Show/hide
Query:  SSNQQAESSHNLA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGMKDPKDYVEVFEGLMDFQ
        SSNQQAESSHN A   G+ITREEFDQLRG+L+AQVEALKAKCEQK+  LNDGDLGESPFTSDVLE        APTVK YDG KDPKDYVEVFEGLMDFQ
Subjt:  SSNQQAESSHNLA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGMKDPKDYVEVFEGLMDFQ

Query:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLNGLA
        AASDAIKCRAFQIALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFL GLA
Subjt:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLNGLA

Query:  DEVLTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTDRPERKIGRGRSGKDERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILT
        DE LTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKT RPER I RGRSGKDE+AD KSKDKGSFSSGRAE+RRA NGPTRSRPYERFTPTTIPISEILT
Subjt:  DEVLTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTDRPERKIGRGRSGKDERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILT

Query:  NIEDSGMEKLLKRPEKLR--EPRRGAARTSIAASIGSTATTPQDQLSR-----------------------EKKEERKRSRTPPRRTDRPAVINTIFGGP
        NIE+SGMEKLLKRPEKLR    RR   +           T+ + +L R                       EKKEERK SRTP RR DRPAVINTIFGGP
Subjt:  NIEDSGMEKLLKRPEKLR--EPRRGAARTSIAASIGSTATTPQDQLSR-----------------------EKKEERKRSRTPPRRTDRPAVINTIFGGP

Query:  SGGQSGHKRKELAREARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTSLVGF
        SGGQSGHKRKELAR ARREVCIIREQ PTCPITFD ADLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTRSQLK+S T LVGF
Subjt:  SGGQSGHKRKELAREARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTSLVGF

Query:  SGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVVD----------------------------------------GEQTASRECYAAALKGSSVCALETL-
        S ESVIPEGCIDLPVTLG DQT+VTQMAEFVV+D                                        GEQ ASRECYA+ALKGSSVCALETL 
Subjt:  SGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVVD----------------------------------------GEQTASRECYAAALKGSSVCALETL-

Query:  -RDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQLASAYETDLARSVPVEILDNPSILE--PELMEIG
         RDGTLEF+A+LPR+EFAAPTEELELVPLL  +      +E +L     +  +D+   +E  PE + +G
Subjt:  -RDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQLASAYETDLARSVPVEILDNPSILE--PELMEIG

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]9.7e-20155.6Show/hide
Query:  RRTLAASDAHQREVGAAAVEGQGHDGLATEPLRRSAQIIAPALPPAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAMRTQMRSMEAMYNE
        RR LAA+  HQREVGA  VEGQGH+ L TEPL RSA+I  P LPPAHP+ SKA                                         E+ YN 
Subjt:  RRTLAASDAHQREVGAAAVEGQGHDGLATEPLRRSAQIIAPALPPAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAMRTQMRSMEAMYNE

Query:  MVLAAGAGSRSENRVTRMDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNLAGIITREEFDQLRG
        +                                                                                      G+ITREEFDQL+ 
Subjt:  MVLAAGAGSRSENRVTRMDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNLAGIITREEFDQLRG

Query:  ELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGMKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPAR
        + DAQVEALKA+CE+K+ S +DGDLGE  F+SD+LEA IPPKFK PT+KPYDG KDPKDYVEVFE LMDFQAA+DAIKC AFQIALTGSARLWYRRLPAR
Subjt:  ELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGMKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPAR

Query:  SISTYSQLRREFLAQFSSRHYDKKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLNGLADEVLTVKLGEEAPATFAEVLQKAKKVIDG
         ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFL GLADE LTVKL EEAPATFAEVLQK KKVIDG
Subjt:  SISTYSQLRREFLAQFSSRHYDKKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLNGLADEVLTVKLGEEAPATFAEVLQKAKKVIDG

Query:  QELLRTKTDRPERKIGRGRSGKDE-RADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLLKRPEKLR---EPRRGA
        QELLRTKT RPE+ I +GR+GKD+ +AD KS+DKG S SS R +YRR+ +   +SRPYE +TPTTIPI EILTNIE++GMEKLLKRPEKLR   E R   
Subjt:  QELLRTKTDRPERKIGRGRSGKDE-RADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLLKRPEKLR---EPRRGA

Query:  ARTSIAASIGSTATT-----------------------PQDQLSREKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELAREARREVCIIREQ
                 G   +                        P+   S EKKEERKR RTPPRR DRPAVIN             K+KELAREARREVCIIREQ
Subjt:  ARTSIAASIGSTATT-----------------------PQDQLSREKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELAREARREVCIIREQ

Query:  GPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTSLVGFSGESVIPEGCIDLPVTLGQDQTRVTQ
         PT  I F+ ADLE VHLPHNDALVIAPLID V+VRR+LVDGGASANILSL TYLALGWTRSQLK+SPT LVGFSGES+  EGCIDLPV++ QD T+VTQ
Subjt:  GPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTSLVGFSGESVIPEGCIDLPVTLGQDQTRVTQ

Query:  MAEFVVVD----------------------------------------GEQTASRECYAAALKGSSVCALE--TLRD
        MAEFVV+D                                        GE   SRECYA+  K SSVCALE  T+RD
Subjt:  MAEFVVVD----------------------------------------GEQTASRECYAAALKGSSVCALE--TLRD

XP_022156088.1 uncharacterized protein LOC111023060 [Momordica charantia]5.7e-18564.94Show/hide
Query:  MEAMRTQMRSMEAMYNEMVLAAGAGSRSENRVTRMDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESS
        MEAMRTQMR+ME MYN+MV  AGA SRS ++V   DV EQ   H  P +EE              GDLR+HLNRKR SS R  ++ +  H++SNQQAESS
Subjt:  MEAMRTQMRSMEAMYNEMVLAAGAGSRSENRVTRMDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESS

Query:  HN---LAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGMKDPKDYVEVFEGLMDFQAASDAIKCR
        +N     G+ITREEF+QL+ + DAQVEALK +CE+K+ + +DGDLGESPFTSD+LEA IPPKFK PT+K YDG KDPKDYVEVFEGLMDFQAA+DAIKCR
Subjt:  HN---LAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGMKDPKDYVEVFEGLMDFQAASDAIKCR

Query:  AFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLNGLADEVLTVKLG
        AFQIALTGSARLWYRRLPARSISTYSQLR+EF++QF SRHYD+KTTTHLATIRQKEG+TL+EY+TRFQEEQLKV HCSDDS+MCYFL GLADE  TVKLG
Subjt:  AFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLNGLADEVLTVKLG

Query:  EEAPATFAEVLQKAKKVIDGQELLRTKTDRPERKIGRGRSGKDER-ADPKSKDKGSFSS-GRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEDSGM
        EEA ATFAEVLQ  KK IDGQELLRTKTDRPE++I + +S +D+R AD KSKDKGS SS  R +Y R+                                
Subjt:  EEAPATFAEVLQKAKKVIDGQELLRTKTDRPERKIGRGRSGKDER-ADPKSKDKGSFSS-GRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEDSGM

Query:  EKLLKRPEKLREPRRGAARTSIAASIGSTATTPQDQLSREKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELAREARREVCIIREQGPTCPI
                                             S EKKEERKRSRTPPR  DRPAVINTIFGGPSGGQSG+KRKELAREA REVCIIREQ PTC +
Subjt:  EKLLKRPEKLREPRRGAARTSIAASIGSTATTPQDQLSREKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELAREARREVCIIREQGPTCPI

Query:  TFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTSLVGFSGESVIPEGCI
        TFD +DLE VHLP+NDALVIAPLIDHV+VRRVLVDGGASANILS    LALGWTRSQLK+SPT LVGFS ESV  +G +
Subjt:  TFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTSLVGFSGESVIPEGCI

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088133.1e-23784.24Show/hide
Query:  QAESSHN---LAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGMKDPKDYVEVFEGLMDFQAASD
        +AESS N    AG+ITREEFDQLRG+LDAQVEALKAKCEQK+  LNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDG KDPKDYVEVFE LMDFQAASD
Subjt:  QAESSHN---LAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGMKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLNGLADEVL
        AIKCRAF+IALTGSARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKT THLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL GLADE L
Subjt:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLNGLADEVL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTDRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE
        TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKT RPERKIGRGRSGKD E ADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTDRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE

Query:  DSGMEKLLKRPEKLREP-------------RRGAARTS-----------------IAASIGSTATTPQDQLSREKKEERKRSRTPPRRTDRPAVINTIFG
        +SGMEKLLKRPEKLR               R     TS                     +G   T+     S EKKEERKRSRTPPRRTDRPAVINTIFG
Subjt:  DSGMEKLLKRPEKLREP-------------RRGAARTS-----------------IAASIGSTATTPQDQLSREKKEERKRSRTPPRRTDRPAVINTIFG

Query:  GPSGGQSGHKRKELAREARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTSLV
        GPSGGQSG KRKELAR ARREVCIIREQ PTCPITFDGADLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLALGWTRSQLK+SPT LV
Subjt:  GPSGGQSGHKRKELAREARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTSLV

Query:  GFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFV
        GFSGESVIPEG IDLPVTLGQDQT+VTQMAEFV
Subjt:  GFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFV

A0A6J1D9E1 uncharacterized protein LOC1110188231.2e-20965.47Show/hide
Query:  SSNQQAESSHNLA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGMKDPKDYVEVFEGLMDFQ
        SSNQQAESSHN A   G+ITREEFDQLRG+L+AQVEALKAKCEQK+  LNDGDLGESPFTSDVLE        APTVK YDG KDPKDYVEVFEGLMDFQ
Subjt:  SSNQQAESSHNLA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGMKDPKDYVEVFEGLMDFQ

Query:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLNGLA
        AASDAIKCRAFQIALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFL GLA
Subjt:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLNGLA

Query:  DEVLTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTDRPERKIGRGRSGKDERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILT
        DE LTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKT RPER I RGRSGKDE+AD KSKDKGSFSSGRAE+RRA NGPTRSRPYERFTPTTIPISEILT
Subjt:  DEVLTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTDRPERKIGRGRSGKDERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILT

Query:  NIEDSGMEKLLKRPEKLR--EPRRGAARTSIAASIGSTATTPQDQLSR-----------------------EKKEERKRSRTPPRRTDRPAVINTIFGGP
        NIE+SGMEKLLKRPEKLR    RR   +           T+ + +L R                       EKKEERK SRTP RR DRPAVINTIFGGP
Subjt:  NIEDSGMEKLLKRPEKLR--EPRRGAARTSIAASIGSTATTPQDQLSR-----------------------EKKEERKRSRTPPRRTDRPAVINTIFGGP

Query:  SGGQSGHKRKELAREARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTSLVGF
        SGGQSGHKRKELAR ARREVCIIREQ PTCPITFD ADLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTRSQLK+S T LVGF
Subjt:  SGGQSGHKRKELAREARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTSLVGF

Query:  SGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVVD----------------------------------------GEQTASRECYAAALKGSSVCALETL-
        S ESVIPEGCIDLPVTLG DQT+VTQMAEFVV+D                                        GEQ ASRECYA+ALKGSSVCALETL 
Subjt:  SGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVVD----------------------------------------GEQTASRECYAAALKGSSVCALETL-

Query:  -RDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQLASAYETDLARSVPVEILDNPSILE--PELMEIG
         RDGTLEF+A+LPR+EFAAPTEELELVPLL  +      +E +L     +  +D+   +E  PE + +G
Subjt:  -RDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQLASAYETDLARSVPVEILDNPSILE--PELMEIG

A0A6J1D9W7 uncharacterized protein LOC1110187081.8e-18483.14Show/hide
Query:  KDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGMKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQ
        KDDSLNDGDLGES FTSDVLEAPIPPKFKAPTVKPYDG KDPKDYVEVFEGLMDF AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQ
Subjt:  KDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGMKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQ

Query:  FSSRHYDKKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLNGLADEVLTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTDRPERKI
        FSSR Y KKT THLATIRQKEG TLREYVTRFQEEQLKVAHCSDDSAMCYFL GLADE LTVKLGE+AP TFAEVLQKAKKVIDGQELLRTKT RP+RKI
Subjt:  FSSRHYDKKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLNGLADEVLTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTDRPERKI

Query:  GRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLLKRPEKLREP-------------RRGAARTS--
        GRGRSGKD ERADPKSKDKGSFSSGRAEYRRAE+GPT+SRPYERFTPTTIPISEILTNIE+SGMEKLLKRPEKLR               R     TS  
Subjt:  GRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLLKRPEKLREP-------------RRGAARTS--

Query:  ---------------IAASIGSTATTPQDQLSREKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELAREARREVCIIREQGPTCPITFDGAD
                           +G   T+     S EKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELAR ARREVCIIREQGPTCPITFDGAD
Subjt:  ---------------IAASIGSTATTPQDQLSREKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELAREARREVCIIREQGPTCPITFDGAD

Query:  LEEVHLPHNDALVIAPLIDHVVVRRVL
         EEVHLPHNDA VIAPLIDHVVVRRVL
Subjt:  LEEVHLPHNDALVIAPLIDHVVVRRVL

A0A6J1DHB3 uncharacterized protein LOC1110204794.7e-20155.6Show/hide
Query:  RRTLAASDAHQREVGAAAVEGQGHDGLATEPLRRSAQIIAPALPPAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAMRTQMRSMEAMYNE
        RR LAA+  HQREVGA  VEGQGH+ L TEPL RSA+I  P LPPAHP+ SKA                                         E+ YN 
Subjt:  RRTLAASDAHQREVGAAAVEGQGHDGLATEPLRRSAQIIAPALPPAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAMRTQMRSMEAMYNE

Query:  MVLAAGAGSRSENRVTRMDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNLAGIITREEFDQLRG
        +                                                                                      G+ITREEFDQL+ 
Subjt:  MVLAAGAGSRSENRVTRMDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNLAGIITREEFDQLRG

Query:  ELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGMKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPAR
        + DAQVEALKA+CE+K+ S +DGDLGE  F+SD+LEA IPPKFK PT+KPYDG KDPKDYVEVFE LMDFQAA+DAIKC AFQIALTGSARLWYRRLPAR
Subjt:  ELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGMKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPAR

Query:  SISTYSQLRREFLAQFSSRHYDKKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLNGLADEVLTVKLGEEAPATFAEVLQKAKKVIDG
         ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFL GLADE LTVKL EEAPATFAEVLQK KKVIDG
Subjt:  SISTYSQLRREFLAQFSSRHYDKKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLNGLADEVLTVKLGEEAPATFAEVLQKAKKVIDG

Query:  QELLRTKTDRPERKIGRGRSGKDE-RADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLLKRPEKLR---EPRRGA
        QELLRTKT RPE+ I +GR+GKD+ +AD KS+DKG S SS R +YRR+ +   +SRPYE +TPTTIPI EILTNIE++GMEKLLKRPEKLR   E R   
Subjt:  QELLRTKTDRPERKIGRGRSGKDE-RADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLLKRPEKLR---EPRRGA

Query:  ARTSIAASIGSTATT-----------------------PQDQLSREKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELAREARREVCIIREQ
                 G   +                        P+   S EKKEERKR RTPPRR DRPAVIN             K+KELAREARREVCIIREQ
Subjt:  ARTSIAASIGSTATT-----------------------PQDQLSREKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELAREARREVCIIREQ

Query:  GPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTSLVGFSGESVIPEGCIDLPVTLGQDQTRVTQ
         PT  I F+ ADLE VHLPHNDALVIAPLID V+VRR+LVDGGASANILSL TYLALGWTRSQLK+SPT LVGFSGES+  EGCIDLPV++ QD T+VTQ
Subjt:  GPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTSLVGFSGESVIPEGCIDLPVTLGQDQTRVTQ

Query:  MAEFVVVD----------------------------------------GEQTASRECYAAALKGSSVCALE--TLRD
        MAEFVV+D                                        GE   SRECYA+  K SSVCALE  T+RD
Subjt:  MAEFVVVD----------------------------------------GEQTASRECYAAALKGSSVCALE--TLRD

A0A6J1DPN4 uncharacterized protein LOC1110230602.8e-18564.94Show/hide
Query:  MEAMRTQMRSMEAMYNEMVLAAGAGSRSENRVTRMDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESS
        MEAMRTQMR+ME MYN+MV  AGA SRS ++V   DV EQ   H  P +EE              GDLR+HLNRKR SS R  ++ +  H++SNQQAESS
Subjt:  MEAMRTQMRSMEAMYNEMVLAAGAGSRSENRVTRMDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESS

Query:  HN---LAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGMKDPKDYVEVFEGLMDFQAASDAIKCR
        +N     G+ITREEF+QL+ + DAQVEALK +CE+K+ + +DGDLGESPFTSD+LEA IPPKFK PT+K YDG KDPKDYVEVFEGLMDFQAA+DAIKCR
Subjt:  HN---LAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGMKDPKDYVEVFEGLMDFQAASDAIKCR

Query:  AFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLNGLADEVLTVKLG
        AFQIALTGSARLWYRRLPARSISTYSQLR+EF++QF SRHYD+KTTTHLATIRQKEG+TL+EY+TRFQEEQLKV HCSDDS+MCYFL GLADE  TVKLG
Subjt:  AFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLNGLADEVLTVKLG

Query:  EEAPATFAEVLQKAKKVIDGQELLRTKTDRPERKIGRGRSGKDER-ADPKSKDKGSFSS-GRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEDSGM
        EEA ATFAEVLQ  KK IDGQELLRTKTDRPE++I + +S +D+R AD KSKDKGS SS  R +Y R+                                
Subjt:  EEAPATFAEVLQKAKKVIDGQELLRTKTDRPERKIGRGRSGKDER-ADPKSKDKGSFSS-GRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEDSGM

Query:  EKLLKRPEKLREPRRGAARTSIAASIGSTATTPQDQLSREKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELAREARREVCIIREQGPTCPI
                                             S EKKEERKRSRTPPR  DRPAVINTIFGGPSGGQSG+KRKELAREA REVCIIREQ PTC +
Subjt:  EKLLKRPEKLREPRRGAARTSIAASIGSTATTPQDQLSREKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELAREARREVCIIREQGPTCPI

Query:  TFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTSLVGFSGESVIPEGCI
        TFD +DLE VHLP+NDALVIAPLIDHV+VRRVLVDGGASANILS    LALGWTRSQLK+SPT LVGFS ESV  +G +
Subjt:  TFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTSLVGFSGESVIPEGCI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTAAGTATGAGGGCCGAGGTGAGCCTGGCCCAGGTCCGCCCAAGTGTTCAGGTCGGTCCGGAGGCCGAGTTCGAGCTACAATCAGGAACACACTGTTATCGAAGGAC
TCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCGGCGGTAGAGGGGCAAGGTCACGACGGCCTAGCAACGGAACCCCTCCGCAGGTCGGCACAGATCATCG
CGCCTGCCCTACCGCCTGCGCACCCGAGGACGTCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTAAGAAGGGCGCCCGGGGTCCAGCCCCGGCTCCAACAAGCGAGAAC
TTTGATGCGCTCCAGAGAGAGATGGAGGCAATGCGCACACAAATGCGCTCTATGGAGGCAATGTATAACGAAATGGTGCTGGCTGCAGGCGCAGGGTCCCGATCTGAAAA
TCGAGTGACGCGCATGGACGTACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGCCGAGGAAGAACGTCCCGAAGACAACGAGAGTGAGGGGTACACTCGCCAGAGGGGAG
ACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCGTCTCTCCGAAAAGGGCAGTCACCATCCCGCTCCCACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAATCTC
GCGGGGATAATCACAAGGGAGGAGTTCGACCAGCTGAGGGGGGAGCTCGATGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAACGATGG
CGACTTGGGAGAATCGCCTTTCACCTCGGACGTTTTGGAAGCACCAATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGACGGGATGAAGGATCCCAAGGACT
ATGTTGAAGTCTTTGAAGGCCTCATGGACTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATTGCGCTTACTGGCAGCGCGCGATTGTGGTACCGGAGA
CTGCCAGCCAGGTCGATCTCAACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCACTATGACAAAAAGACAACGACCCATCTCGCCACCATCAG
GCAGAAGGAGGGTGAGACGCTGCGGGAGTATGTCACCAGATTCCAGGAGGAGCAGTTGAAGGTTGCACACTGCTCCGATGACTCGGCCATGTGCTATTTCCTCAACGGTC
TAGCCGACGAAGTCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTCCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACC
AAAACCGACCGACCTGAACGAAAGATCGGCCGGGGCAGAAGTGGGAAAGATGAAAGGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTCTCCAGCGGCCGAGCTGAGTA
TCGAAGGGCGGAGAACGGACCTACCAGGAGCCGACCTTACGAGCGCTTCACCCCAACCACGATTCCAATTTCCGAGATCCTAACGAACATCGAGGATTCTGGAATGGAAA
AACTACTCAAGCGTCCGGAGAAACTTCGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCACAACACCCCAGGACCAGCTCAGC
AGAGAGAAAAAGGAAGAGCGAAAGCGTTCAAGGACGCCACCCCGGCGCACCGACCGACCTGCGGTCATCAATACCATTTTTGGAGGGCCAAGCGGGGGTCAATCCGGACA
TAAAAGAAAGGAGTTAGCCCGTGAAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCAGGGGCCGACCTGCCCAATCACCTTCGACGGTGCAGACTTGGAGGAGGTCCACC
TGCCCCACAATGATGCCCTTGTGATTGCTCCCTTGATTGATCATGTGGTGGTCAGGAGAGTGCTGGTAGACGGGGGCGCATCCGCTAACATCCTGTCCTTACCGACCTAC
CTCGCCTTGGGCTGGACGAGGTCGCAATTGAAGAGAAGCCCGACTTCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGTTGCATCGACTTGCCGGTCACGCT
GGGGCAGGACCAAACTCGGGTCACTCAAATGGCCGAGTTCGTGGTAGTTGACGGAGAACAGACCGCTTCGAGGGAGTGCTATGCCGCCGCACTCAAAGGCTCATCGGTCT
GCGCCCTCGAAACTCTCAGGGATGGGACGCTCGAGTTCGAGGCCGACCTGCCGAGGAAGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCC
GAGAAGCAGTTAGCATCGGCGTACGAGACCGACCTCGCCAGGTCGGTCCCCGTCGAGATCCTAGATAATCCCTCCATCTTAGAGCCAGAACTGATGGAGATCGGCGCTCC
AGAATCCTCATGGATGGACCCGATCGCGGACTTCATTAGGGGCAACTCACCACAGGACCCCAAGGAGCGCAGAAAGTTGGCAAGGCGGGCAGCTCGGTTCGTGGTCCGAG
ATGGGGCATTGTACCGACGTGGCTTTTCCCTGCCTCTATTGAGATGCCTAACCCCTGAAGAGGGCCCGAGGGTCCAAACGCATGTGGGTGCCCTTGATCCGGCCTGGGAG
GGCCCGTTTGAGATCAAGGGCATAGTCCGACCTGGGACGTACATATTGGCCGATCTGAAAGGAGACGTCCTCGCGCACCCGTGGAACGCGGAGCACCTGAAGCGTTATTA
TCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCTAAGTATGAGGGCCGAGGTGAGCCTGGCCCAGGTCCGCCCAAGTGTTCAGGTCGGTCCGGAGGCCGAGTTCGAGCTACAATCAGGAACACACTGTTATCGAAGGAC
TCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCGGCGGTAGAGGGGCAAGGTCACGACGGCCTAGCAACGGAACCCCTCCGCAGGTCGGCACAGATCATCG
CGCCTGCCCTACCGCCTGCGCACCCGAGGACGTCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTAAGAAGGGCGCCCGGGGTCCAGCCCCGGCTCCAACAAGCGAGAAC
TTTGATGCGCTCCAGAGAGAGATGGAGGCAATGCGCACACAAATGCGCTCTATGGAGGCAATGTATAACGAAATGGTGCTGGCTGCAGGCGCAGGGTCCCGATCTGAAAA
TCGAGTGACGCGCATGGACGTACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGCCGAGGAAGAACGTCCCGAAGACAACGAGAGTGAGGGGTACACTCGCCAGAGGGGAG
ACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCGTCTCTCCGAAAAGGGCAGTCACCATCCCGCTCCCACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAATCTC
GCGGGGATAATCACAAGGGAGGAGTTCGACCAGCTGAGGGGGGAGCTCGATGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAACGATGG
CGACTTGGGAGAATCGCCTTTCACCTCGGACGTTTTGGAAGCACCAATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGACGGGATGAAGGATCCCAAGGACT
ATGTTGAAGTCTTTGAAGGCCTCATGGACTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATTGCGCTTACTGGCAGCGCGCGATTGTGGTACCGGAGA
CTGCCAGCCAGGTCGATCTCAACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCACTATGACAAAAAGACAACGACCCATCTCGCCACCATCAG
GCAGAAGGAGGGTGAGACGCTGCGGGAGTATGTCACCAGATTCCAGGAGGAGCAGTTGAAGGTTGCACACTGCTCCGATGACTCGGCCATGTGCTATTTCCTCAACGGTC
TAGCCGACGAAGTCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTCCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACC
AAAACCGACCGACCTGAACGAAAGATCGGCCGGGGCAGAAGTGGGAAAGATGAAAGGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTCTCCAGCGGCCGAGCTGAGTA
TCGAAGGGCGGAGAACGGACCTACCAGGAGCCGACCTTACGAGCGCTTCACCCCAACCACGATTCCAATTTCCGAGATCCTAACGAACATCGAGGATTCTGGAATGGAAA
AACTACTCAAGCGTCCGGAGAAACTTCGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCACAACACCCCAGGACCAGCTCAGC
AGAGAGAAAAAGGAAGAGCGAAAGCGTTCAAGGACGCCACCCCGGCGCACCGACCGACCTGCGGTCATCAATACCATTTTTGGAGGGCCAAGCGGGGGTCAATCCGGACA
TAAAAGAAAGGAGTTAGCCCGTGAAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCAGGGGCCGACCTGCCCAATCACCTTCGACGGTGCAGACTTGGAGGAGGTCCACC
TGCCCCACAATGATGCCCTTGTGATTGCTCCCTTGATTGATCATGTGGTGGTCAGGAGAGTGCTGGTAGACGGGGGCGCATCCGCTAACATCCTGTCCTTACCGACCTAC
CTCGCCTTGGGCTGGACGAGGTCGCAATTGAAGAGAAGCCCGACTTCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGTTGCATCGACTTGCCGGTCACGCT
GGGGCAGGACCAAACTCGGGTCACTCAAATGGCCGAGTTCGTGGTAGTTGACGGAGAACAGACCGCTTCGAGGGAGTGCTATGCCGCCGCACTCAAAGGCTCATCGGTCT
GCGCCCTCGAAACTCTCAGGGATGGGACGCTCGAGTTCGAGGCCGACCTGCCGAGGAAGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCC
GAGAAGCAGTTAGCATCGGCGTACGAGACCGACCTCGCCAGGTCGGTCCCCGTCGAGATCCTAGATAATCCCTCCATCTTAGAGCCAGAACTGATGGAGATCGGCGCTCC
AGAATCCTCATGGATGGACCCGATCGCGGACTTCATTAGGGGCAACTCACCACAGGACCCCAAGGAGCGCAGAAAGTTGGCAAGGCGGGCAGCTCGGTTCGTGGTCCGAG
ATGGGGCATTGTACCGACGTGGCTTTTCCCTGCCTCTATTGAGATGCCTAACCCCTGAAGAGGGCCCGAGGGTCCAAACGCATGTGGGTGCCCTTGATCCGGCCTGGGAG
GGCCCGTTTGAGATCAAGGGCATAGTCCGACCTGGGACGTACATATTGGCCGATCTGAAAGGAGACGTCCTCGCGCACCCGTGGAACGCGGAGCACCTGAAGCGTTATTA
TCCTTGA
Protein sequenceShow/hide protein sequence
MLSMRAEVSLAQVRPSVQVGPEAEFELQSGTHCYRRTLAASDAHQREVGAAAVEGQGHDGLATEPLRRSAQIIAPALPPAHPRTSKATRGRGGTSKKGARGPAPAPTSEN
FDALQREMEAMRTQMRSMEAMYNEMVLAAGAGSRSENRVTRMDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNL
AGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGMKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRR
LPARSISTYSQLRREFLAQFSSRHYDKKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLNGLADEVLTVKLGEEAPATFAEVLQKAKKVIDGQELLRT
KTDRPERKIGRGRSGKDERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLLKRPEKLREPRRGAARTSIAASIGSTATTPQDQLS
REKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELAREARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTY
LALGWTRSQLKRSPTSLVGFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVVDGEQTASRECYAAALKGSSVCALETLRDGTLEFEADLPRKEFAAPTEELELVPLLSP
EKQLASAYETDLARSVPVEILDNPSILEPELMEIGAPESSWMDPIADFIRGNSPQDPKERRKLARRAARFVVRDGALYRRGFSLPLLRCLTPEEGPRVQTHVGALDPAWE
GPFEIKGIVRPGTYILADLKGDVLAHPWNAEHLKRYYP