; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g20530 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g20530
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr6:16014010..16020020
RNA-Seq ExpressionMoc06g20530
SyntenyMoc06g20530
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]7.7e-25287.1Show/hide
Query:  AESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYEGTKDPKDYVEVFEGLMDFQAASDA
        AESS N   PAG+ITREEFDQLRG+LDAQVEALKAKCEQK+  LNDGDLGESPFTSDVLEAPIPPKFKAPTVKPY+G+KDPKDYVEVFE LMDFQAASDA
Subjt:  AESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYEGTKDPKDYVEVFEGLMDFQAASDA

Query:  IKCRAFQIALTSSARLWYRRLPARSIWTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFKEEQLKVAHCSDDSAICYFLTGLADEALT
        IKCRAF+IALT SARLWYRRLPA SI TYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRF+EEQLKVAHCSDDSA+CYFLTGLADEALT
Subjt:  IKCRAFQIALTSSARLWYRRLPARSIWTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFKEEQLKVAHCSDDSAICYFLTGLADEALT

Query:  VKLGEEAPTTFAE-------------------------IGRGRSGKD-ERADPKSKDKGSFSNGRAECRRAESGPTRSRPYERFTPTTILISEILTNIKE
        VKLGEEAP TFAE                         IGRGRSGKD E ADPKSKDKGSFS+GRAE RRAE+GPTRSRPYERFTPTTI ISEILTNI+E
Subjt:  VKLGEEAPTTFAE-------------------------IGRGRSGKD-ERADPKSKDKGSFSNGRAECRRAESGPTRSRPYERFTPTTILISEILTNIKE

Query:  SGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRHIEDLIQDGYFKKFVGEPRTSSAEKKEKRKRSRTPPRRTDRPAVINTIFCGPSGGQ
        SGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELKR IE+LIQDGYFKKFVG+PRTSSAEKKE+RKRSRTPPRRTDRPAVINTIF GPSGGQ
Subjt:  SGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRHIEDLIQDGYFKKFVGEPRTSSAEKKEKRKRSRTPPRRTDRPAVINTIFCGPSGGQ

Query:  SGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPWIDHVVVRRVLVDGGASANILSLPTYLALRWTRSQLKKSQTPLVGFSGES
        SG KRKELARAARREVCIIREQ PTCPITFDGADLEEVHLPHNDALVIAP IDHVVV RVLVDGG SANILSLPTYLAL WTRSQLKKS TPLVGFSGES
Subjt:  SGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPWIDHVVVRRVLVDGGASANILSLPTYLALRWTRSQLKKSQTPLVGFSGES

Query:  VIPEGCIDLPVTLGQDRTRVTQMAEFV
        VIPEG IDLPVTLGQD+T+VTQMAEFV
Subjt:  VIPEGCIDLPVTLGQDRTRVTQMAEFV

XP_022150613.1 uncharacterized protein LOC111018708, partial [Momordica charantia]3.3e-20287.91Show/hide
Query:  KDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYEGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTSSARLWYRRLPARSIWTYSQLRREFLAQ
        KDDSLNDGDLGES FTSDVLEAPIPPKFKAPTVKPY+G+KDPKDYVEVFEGLMDF AASDAIKCRAFQIALT SARLWYRRLPARSI TYSQLRREFLAQ
Subjt:  KDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYEGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTSSARLWYRRLPARSIWTYSQLRREFLAQ

Query:  FSSRHYDKKTATHLATIRQKEGETLREYVTRFKEEQLKVAHCSDDSAICYFLTGLADEALTVKLGEEAPTTFAE-------------------------I
        FSSR Y KKT THLATIRQKEG TLREYVTRF+EEQLKVAHCSDDSA+CYFLTGLADEALTVKLGE+APTTFAE                         I
Subjt:  FSSRHYDKKTATHLATIRQKEGETLREYVTRFKEEQLKVAHCSDDSAICYFLTGLADEALTVKLGEEAPTTFAE-------------------------I

Query:  GRGRSGKD-ERADPKSKDKGSFSNGRAECRRAESGPTRSRPYERFTPTTILISEILTNIKESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC
        GRGRSGKD ERADPKSKDKGSFS+GRAE RRAESGPT+SRPYERFTPTTI ISEILTNI+ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC
Subjt:  GRGRSGKD-ERADPKSKDKGSFSNGRAECRRAESGPTRSRPYERFTPTTILISEILTNIKESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC

Query:  WELKRHIEDLIQDGYFKKFVGEPRTSSAEKKEKRKRSRTPPRRTDRPAVINTIFCGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVH
        WELKR IEDLIQDGYFKKFVG+PRTSSAEKKE+RKRSRTPPRRTDRPAVINTIF GPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGAD EEVH
Subjt:  WELKRHIEDLIQDGYFKKFVGEPRTSSAEKKEKRKRSRTPPRRTDRPAVINTIFCGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVH

Query:  LPHNDALVIAPWIDHVVVRRVL
        LPHNDA VIAP IDHVVVRRVL
Subjt:  LPHNDALVIAPWIDHVVVRRVL

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]1.6e-25272.46Show/hide
Query:  AESSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYEGTKDPKDYVEVFEGLMDFQAASDA
        AESSHNPA   G+ITREEFDQLRG+L+AQVEALKAKCEQK+  LNDGDLGESPFTSDVLE        APTVK Y+G+KDPKDYVEVFEGLMDFQAASDA
Subjt:  AESSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYEGTKDPKDYVEVFEGLMDFQAASDA

Query:  IKCRAFQIALTSSARLWYRRLPARSIWTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFKEEQLKVAHCSDDSAICYFLTGLADEALT
        IKCRAFQIALT SARLW                                                     F+E+QLKVA  SDDSA+CYFLTGLADEALT
Subjt:  IKCRAFQIALTSSARLWYRRLPARSIWTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFKEEQLKVAHCSDDSAICYFLTGLADEALT

Query:  VKLGEEAPTTFAE-------------------------IGRGRSGKDERADPKSKDKGSFSNGRAECRRAESGPTRSRPYERFTPTTILISEILTNIKES
        VKLG+EAP TFAE                         I RGRSGKDE+AD KSKDKGSFS+GRAE RRA +GPTRSRPYERFTPTTI ISEILTNI+ES
Subjt:  VKLGEEAPTTFAE-------------------------IGRGRSGKDERADPKSKDKGSFSNGRAECRRAESGPTRSRPYERFTPTTILISEILTNIKES

Query:  GMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRHIEDLIQDGYFKKFVGEPRTSSAEKKEKRKRSRTPPRRTDRPAVINTIFCGPSGGQS
        GMEKLLKRPEKLRGAPERR+KDKYCRFHREH HNTSD WELKR IEDLIQD YFKKFVG+PRTSSAEKKE+RK SRTP RR DRPAVINTIF GPSGGQS
Subjt:  GMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRHIEDLIQDGYFKKFVGEPRTSSAEKKEKRKRSRTPPRRTDRPAVINTIFCGPSGGQS

Query:  GHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPWIDHVVVRRVLVDGGASANILSLPTYLALRWTRSQLKKSQTPLVGFSGESV
        GHKRKELARAARREVCIIREQ PTCPITFD ADLEEVHLPHNDALVIAP IDHVVVRRVLVD G SANI+SL TYLAL WTRSQLKKS TPLVGFS ESV
Subjt:  GHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPWIDHVVVRRVLVDGGASANILSLPTYLALRWTRSQLKKSQTPLVGFSGESV

Query:  IPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKSSSVCALETLTGRDGT
        IPEGCIDLPVTLG D+T+VTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKY TPNGVG VRGEQ ASRECYASALK SSVCALETL  RDGT
Subjt:  IPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKSSSVCALETLTGRDGT

Query:  LEFEADLPRKEFAAPTEELELVPLLSPEKQLASAYETDLARSVPVEILDNPSISE--PDLMEIGALES
        LEF+A+LPR+EFAAPTEELELVPLL  +      +E +L     +  +D+    E  P+ + +G   S
Subjt:  LEFEADLPRKEFAAPTEELELVPLLSPEKQLASAYETDLARSVPVEILDNPSISE--PDLMEIGALES

XP_022152110.1 uncharacterized protein LOC111019899 [Momordica charantia]8.7e-21187.47Show/hide
Query:  ICYFLTGLADEALTVKLGEEAPTTFAE------------------IGRGRSGKD-ERADPKSKDKGSFSNGRAECRRAESGPTRSRPYERFTPTTILISE
        +CYFLTGLADEALTVKL EEAP TFAE                  IG+GRSGKD E  DPKSKDKGSFSNGRAE RRAE+GPTRSRPYERFTPTTI ISE
Subjt:  ICYFLTGLADEALTVKLGEEAPTTFAE------------------IGRGRSGKD-ERADPKSKDKGSFSNGRAECRRAESGPTRSRPYERFTPTTILISE

Query:  ILTNIKESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRHIEDLIQDGYFKKFVGEPRTSSAEKKEKRKRSRTPPRRTDRPAVINTIF
        ILTNI+ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELK  IEDLIQDGYFKKFVG+PRTSSAEKKE+RKRSRTPPRRTDRPAVINTIF
Subjt:  ILTNIKESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRHIEDLIQDGYFKKFVGEPRTSSAEKKEKRKRSRTPPRRTDRPAVINTIF

Query:  CGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPWIDHVVVRRVLVDGGASANILSLPTYLALRWTRSQLKKSQTPL
         GPSGGQSGHKRK+LARAARREVCIIREQ PTCPITFD ADL EVHLPHNDALVIAP IDHVVVRRVLVDGGASANILSLPTYLAL WTRSQLKKS TPL
Subjt:  CGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPWIDHVVVRRVLVDGGASANILSLPTYLALRWTRSQLKKSQTPL

Query:  VGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKSSSVCALE
        VGFSGESV+PEGCIDLPVTLGQD+TRVTQMAEFVV+DGRSAYNAIFGRPIIHSFRAIPSTLHQVLKY TPNGVGTVRGEQTASRECYAS LK +SVCALE
Subjt:  VGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKSSSVCALE

Query:  TLTGRDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQL
        TLT RDGTLEFEADLP +EFAAP EELELVPLLS EKQ+
Subjt:  TLTGRDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQL

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]2.3e-24863.76Show/hide
Query:  PSVGNDNLNRRTLAASGAHQREVGAAAVEGQGHDGLATEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGAGGPAPAPTSENFDALQREMEAMRTQM
        P+   +  +RR LAA+  HQREVGA  VEGQGH+ L TEPL RSARIT P LPPAHP+ SK                                       
Subjt:  PSVGNDNLNRRTLAASGAHQREVGAAAVEGQGHDGLATEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGAGGPAPAPTSENFDALQREMEAMRTQM

Query:  RTMEEMYNEMMLAAGAGFRSENRVTRVDVREQRGSGPRRNVPKTTRARGTLARGETSAESSHNP--AGIITREEFDQLRGELDAQVEALKAKCEQKDDSL
                                                                 AESS+NP   G+ITREEFDQL+ + DAQVEALKA+CE+K+ S 
Subjt:  RTMEEMYNEMMLAAGAGFRSENRVTRVDVREQRGSGPRRNVPKTTRARGTLARGETSAESSHNP--AGIITREEFDQLRGELDAQVEALKAKCEQKDDSL

Query:  NDGDLGESPFTSDVLEAPIPPKFKAPTVKPYEGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTSSARLWYRRLPARSIWTYSQLRREFLAQFSSRH
        +DGDLGE  F+SD+LEA IPPKFK PT+KPY+G+KDPKDYVEVFE LMDFQAA+DAIKC AFQIALT SARLWYRRLPAR I TYSQLR+EF++QFSSRH
Subjt:  NDGDLGESPFTSDVLEAPIPPKFKAPTVKPYEGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTSSARLWYRRLPARSIWTYSQLRREFLAQFSSRH

Query:  YDKKTATHLATIRQKEGETLREYVTRFKEEQLKVAHCSDDSAICYFLTGLADEALTVKLGEEAPTTFAE-------------------------IGRGRS
        YD+KT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSA+CYFLTGLADE LTVKL EEAP TFAE                         I +GR+
Subjt:  YDKKTATHLATIRQKEGETLREYVTRFKEEQLKVAHCSDDSAICYFLTGLADEALTVKLGEEAPTTFAE-------------------------IGRGRS

Query:  GKDE-RADPKSKDKG-SFSNGRAECRRAESGPTRSRPYERFTPTTILISEILTNIKESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELK
        GKD+ +AD KS+DKG S S+ R + RR+ S   +SRPYE +TPTTI I EILTNI+E+GMEKLLKRPEKLRG PE+R+ DKYCRFHR+HGHNTS+ WELK
Subjt:  GKDE-RADPKSKDKG-SFSNGRAECRRAESGPTRSRPYERFTPTTILISEILTNIKESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELK

Query:  RHIEDLIQDGYFKKFVGEPRTSSAEKKEKRKRSRTPPRRTDRPAVINTIFCGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHN
        R IEDLIQDGYFKKFVG+PR++S EKKE+RKR RTPPRR DRPAVIN             K+KELAR ARREVCIIREQ PT  I F+ ADLE VHLPHN
Subjt:  RHIEDLIQDGYFKKFVGEPRTSSAEKKEKRKRSRTPPRRTDRPAVINTIFCGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHN

Query:  DALVIAPWIDHVVVRRVLVDGGASANILSLPTYLALRWTRSQLKKSQTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPI
        DALVIAP ID V+VRR+LVDGGASANILSL TYLAL WTRSQLKKS TPLVGFSGES+  EGCIDLPV++ QD T+VTQMAEFVVIDGRSAYNAIFGRPI
Subjt:  DALVIAPWIDHVVVRRVLVDGGASANILSLPTYLALRWTRSQLKKSQTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPI

Query:  IHSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKSSSVCALETLTGRD
        IHSFRA+PSTLHQVLKY T NGVGTVRGE   SRECYAS  K SSVCALE  T RD
Subjt:  IHSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKSSSVCALETLTGRD

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088133.7e-25287.1Show/hide
Query:  AESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYEGTKDPKDYVEVFEGLMDFQAASDA
        AESS N   PAG+ITREEFDQLRG+LDAQVEALKAKCEQK+  LNDGDLGESPFTSDVLEAPIPPKFKAPTVKPY+G+KDPKDYVEVFE LMDFQAASDA
Subjt:  AESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYEGTKDPKDYVEVFEGLMDFQAASDA

Query:  IKCRAFQIALTSSARLWYRRLPARSIWTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFKEEQLKVAHCSDDSAICYFLTGLADEALT
        IKCRAF+IALT SARLWYRRLPA SI TYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRF+EEQLKVAHCSDDSA+CYFLTGLADEALT
Subjt:  IKCRAFQIALTSSARLWYRRLPARSIWTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFKEEQLKVAHCSDDSAICYFLTGLADEALT

Query:  VKLGEEAPTTFAE-------------------------IGRGRSGKD-ERADPKSKDKGSFSNGRAECRRAESGPTRSRPYERFTPTTILISEILTNIKE
        VKLGEEAP TFAE                         IGRGRSGKD E ADPKSKDKGSFS+GRAE RRAE+GPTRSRPYERFTPTTI ISEILTNI+E
Subjt:  VKLGEEAPTTFAE-------------------------IGRGRSGKD-ERADPKSKDKGSFSNGRAECRRAESGPTRSRPYERFTPTTILISEILTNIKE

Query:  SGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRHIEDLIQDGYFKKFVGEPRTSSAEKKEKRKRSRTPPRRTDRPAVINTIFCGPSGGQ
        SGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELKR IE+LIQDGYFKKFVG+PRTSSAEKKE+RKRSRTPPRRTDRPAVINTIF GPSGGQ
Subjt:  SGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRHIEDLIQDGYFKKFVGEPRTSSAEKKEKRKRSRTPPRRTDRPAVINTIFCGPSGGQ

Query:  SGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPWIDHVVVRRVLVDGGASANILSLPTYLALRWTRSQLKKSQTPLVGFSGES
        SG KRKELARAARREVCIIREQ PTCPITFDGADLEEVHLPHNDALVIAP IDHVVV RVLVDGG SANILSLPTYLAL WTRSQLKKS TPLVGFSGES
Subjt:  SGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPWIDHVVVRRVLVDGGASANILSLPTYLALRWTRSQLKKSQTPLVGFSGES

Query:  VIPEGCIDLPVTLGQDRTRVTQMAEFV
        VIPEG IDLPVTLGQD+T+VTQMAEFV
Subjt:  VIPEGCIDLPVTLGQDRTRVTQMAEFV

A0A6J1D9E1 uncharacterized protein LOC1110188237.5e-25372.46Show/hide
Query:  AESSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYEGTKDPKDYVEVFEGLMDFQAASDA
        AESSHNPA   G+ITREEFDQLRG+L+AQVEALKAKCEQK+  LNDGDLGESPFTSDVLE        APTVK Y+G+KDPKDYVEVFEGLMDFQAASDA
Subjt:  AESSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYEGTKDPKDYVEVFEGLMDFQAASDA

Query:  IKCRAFQIALTSSARLWYRRLPARSIWTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFKEEQLKVAHCSDDSAICYFLTGLADEALT
        IKCRAFQIALT SARLW                                                     F+E+QLKVA  SDDSA+CYFLTGLADEALT
Subjt:  IKCRAFQIALTSSARLWYRRLPARSIWTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFKEEQLKVAHCSDDSAICYFLTGLADEALT

Query:  VKLGEEAPTTFAE-------------------------IGRGRSGKDERADPKSKDKGSFSNGRAECRRAESGPTRSRPYERFTPTTILISEILTNIKES
        VKLG+EAP TFAE                         I RGRSGKDE+AD KSKDKGSFS+GRAE RRA +GPTRSRPYERFTPTTI ISEILTNI+ES
Subjt:  VKLGEEAPTTFAE-------------------------IGRGRSGKDERADPKSKDKGSFSNGRAECRRAESGPTRSRPYERFTPTTILISEILTNIKES

Query:  GMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRHIEDLIQDGYFKKFVGEPRTSSAEKKEKRKRSRTPPRRTDRPAVINTIFCGPSGGQS
        GMEKLLKRPEKLRGAPERR+KDKYCRFHREH HNTSD WELKR IEDLIQD YFKKFVG+PRTSSAEKKE+RK SRTP RR DRPAVINTIF GPSGGQS
Subjt:  GMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRHIEDLIQDGYFKKFVGEPRTSSAEKKEKRKRSRTPPRRTDRPAVINTIFCGPSGGQS

Query:  GHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPWIDHVVVRRVLVDGGASANILSLPTYLALRWTRSQLKKSQTPLVGFSGESV
        GHKRKELARAARREVCIIREQ PTCPITFD ADLEEVHLPHNDALVIAP IDHVVVRRVLVD G SANI+SL TYLAL WTRSQLKKS TPLVGFS ESV
Subjt:  GHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPWIDHVVVRRVLVDGGASANILSLPTYLALRWTRSQLKKSQTPLVGFSGESV

Query:  IPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKSSSVCALETLTGRDGT
        IPEGCIDLPVTLG D+T+VTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKY TPNGVG VRGEQ ASRECYASALK SSVCALETL  RDGT
Subjt:  IPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKSSSVCALETLTGRDGT

Query:  LEFEADLPRKEFAAPTEELELVPLLSPEKQLASAYETDLARSVPVEILDNPSISE--PDLMEIGALES
        LEF+A+LPR+EFAAPTEELELVPLL  +      +E +L     +  +D+    E  P+ + +G   S
Subjt:  LEFEADLPRKEFAAPTEELELVPLLSPEKQLASAYETDLARSVPVEILDNPSISE--PDLMEIGALES

A0A6J1D9W7 uncharacterized protein LOC1110187081.6e-20287.91Show/hide
Query:  KDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYEGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTSSARLWYRRLPARSIWTYSQLRREFLAQ
        KDDSLNDGDLGES FTSDVLEAPIPPKFKAPTVKPY+G+KDPKDYVEVFEGLMDF AASDAIKCRAFQIALT SARLWYRRLPARSI TYSQLRREFLAQ
Subjt:  KDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYEGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTSSARLWYRRLPARSIWTYSQLRREFLAQ

Query:  FSSRHYDKKTATHLATIRQKEGETLREYVTRFKEEQLKVAHCSDDSAICYFLTGLADEALTVKLGEEAPTTFAE-------------------------I
        FSSR Y KKT THLATIRQKEG TLREYVTRF+EEQLKVAHCSDDSA+CYFLTGLADEALTVKLGE+APTTFAE                         I
Subjt:  FSSRHYDKKTATHLATIRQKEGETLREYVTRFKEEQLKVAHCSDDSAICYFLTGLADEALTVKLGEEAPTTFAE-------------------------I

Query:  GRGRSGKD-ERADPKSKDKGSFSNGRAECRRAESGPTRSRPYERFTPTTILISEILTNIKESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC
        GRGRSGKD ERADPKSKDKGSFS+GRAE RRAESGPT+SRPYERFTPTTI ISEILTNI+ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC
Subjt:  GRGRSGKD-ERADPKSKDKGSFSNGRAECRRAESGPTRSRPYERFTPTTILISEILTNIKESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC

Query:  WELKRHIEDLIQDGYFKKFVGEPRTSSAEKKEKRKRSRTPPRRTDRPAVINTIFCGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVH
        WELKR IEDLIQDGYFKKFVG+PRTSSAEKKE+RKRSRTPPRRTDRPAVINTIF GPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGAD EEVH
Subjt:  WELKRHIEDLIQDGYFKKFVGEPRTSSAEKKEKRKRSRTPPRRTDRPAVINTIFCGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVH

Query:  LPHNDALVIAPWIDHVVVRRVL
        LPHNDA VIAP IDHVVVRRVL
Subjt:  LPHNDALVIAPWIDHVVVRRVL

A0A6J1DD03 uncharacterized protein LOC1110198994.2e-21187.47Show/hide
Query:  ICYFLTGLADEALTVKLGEEAPTTFAE------------------IGRGRSGKD-ERADPKSKDKGSFSNGRAECRRAESGPTRSRPYERFTPTTILISE
        +CYFLTGLADEALTVKL EEAP TFAE                  IG+GRSGKD E  DPKSKDKGSFSNGRAE RRAE+GPTRSRPYERFTPTTI ISE
Subjt:  ICYFLTGLADEALTVKLGEEAPTTFAE------------------IGRGRSGKD-ERADPKSKDKGSFSNGRAECRRAESGPTRSRPYERFTPTTILISE

Query:  ILTNIKESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRHIEDLIQDGYFKKFVGEPRTSSAEKKEKRKRSRTPPRRTDRPAVINTIF
        ILTNI+ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELK  IEDLIQDGYFKKFVG+PRTSSAEKKE+RKRSRTPPRRTDRPAVINTIF
Subjt:  ILTNIKESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRHIEDLIQDGYFKKFVGEPRTSSAEKKEKRKRSRTPPRRTDRPAVINTIF

Query:  CGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPWIDHVVVRRVLVDGGASANILSLPTYLALRWTRSQLKKSQTPL
         GPSGGQSGHKRK+LARAARREVCIIREQ PTCPITFD ADL EVHLPHNDALVIAP IDHVVVRRVLVDGGASANILSLPTYLAL WTRSQLKKS TPL
Subjt:  CGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPWIDHVVVRRVLVDGGASANILSLPTYLALRWTRSQLKKSQTPL

Query:  VGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKSSSVCALE
        VGFSGESV+PEGCIDLPVTLGQD+TRVTQMAEFVV+DGRSAYNAIFGRPIIHSFRAIPSTLHQVLKY TPNGVGTVRGEQTASRECYAS LK +SVCALE
Subjt:  VGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKSSSVCALE

Query:  TLTGRDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQL
        TLT RDGTLEFEADLP +EFAAP EELELVPLLS EKQ+
Subjt:  TLTGRDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQL

A0A6J1DHB3 uncharacterized protein LOC1110204791.1e-24863.76Show/hide
Query:  PSVGNDNLNRRTLAASGAHQREVGAAAVEGQGHDGLATEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGAGGPAPAPTSENFDALQREMEAMRTQM
        P+   +  +RR LAA+  HQREVGA  VEGQGH+ L TEPL RSARIT P LPPAHP+ SK                                       
Subjt:  PSVGNDNLNRRTLAASGAHQREVGAAAVEGQGHDGLATEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGAGGPAPAPTSENFDALQREMEAMRTQM

Query:  RTMEEMYNEMMLAAGAGFRSENRVTRVDVREQRGSGPRRNVPKTTRARGTLARGETSAESSHNP--AGIITREEFDQLRGELDAQVEALKAKCEQKDDSL
                                                                 AESS+NP   G+ITREEFDQL+ + DAQVEALKA+CE+K+ S 
Subjt:  RTMEEMYNEMMLAAGAGFRSENRVTRVDVREQRGSGPRRNVPKTTRARGTLARGETSAESSHNP--AGIITREEFDQLRGELDAQVEALKAKCEQKDDSL

Query:  NDGDLGESPFTSDVLEAPIPPKFKAPTVKPYEGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTSSARLWYRRLPARSIWTYSQLRREFLAQFSSRH
        +DGDLGE  F+SD+LEA IPPKFK PT+KPY+G+KDPKDYVEVFE LMDFQAA+DAIKC AFQIALT SARLWYRRLPAR I TYSQLR+EF++QFSSRH
Subjt:  NDGDLGESPFTSDVLEAPIPPKFKAPTVKPYEGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTSSARLWYRRLPARSIWTYSQLRREFLAQFSSRH

Query:  YDKKTATHLATIRQKEGETLREYVTRFKEEQLKVAHCSDDSAICYFLTGLADEALTVKLGEEAPTTFAE-------------------------IGRGRS
        YD+KT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSA+CYFLTGLADE LTVKL EEAP TFAE                         I +GR+
Subjt:  YDKKTATHLATIRQKEGETLREYVTRFKEEQLKVAHCSDDSAICYFLTGLADEALTVKLGEEAPTTFAE-------------------------IGRGRS

Query:  GKDE-RADPKSKDKG-SFSNGRAECRRAESGPTRSRPYERFTPTTILISEILTNIKESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELK
        GKD+ +AD KS+DKG S S+ R + RR+ S   +SRPYE +TPTTI I EILTNI+E+GMEKLLKRPEKLRG PE+R+ DKYCRFHR+HGHNTS+ WELK
Subjt:  GKDE-RADPKSKDKG-SFSNGRAECRRAESGPTRSRPYERFTPTTILISEILTNIKESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELK

Query:  RHIEDLIQDGYFKKFVGEPRTSSAEKKEKRKRSRTPPRRTDRPAVINTIFCGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHN
        R IEDLIQDGYFKKFVG+PR++S EKKE+RKR RTPPRR DRPAVIN             K+KELAR ARREVCIIREQ PT  I F+ ADLE VHLPHN
Subjt:  RHIEDLIQDGYFKKFVGEPRTSSAEKKEKRKRSRTPPRRTDRPAVINTIFCGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHN

Query:  DALVIAPWIDHVVVRRVLVDGGASANILSLPTYLALRWTRSQLKKSQTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPI
        DALVIAP ID V+VRR+LVDGGASANILSL TYLAL WTRSQLKKS TPLVGFSGES+  EGCIDLPV++ QD T+VTQMAEFVVIDGRSAYNAIFGRPI
Subjt:  DALVIAPWIDHVVVRRVLVDGGASANILSLPTYLALRWTRSQLKKSQTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPI

Query:  IHSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKSSSVCALETLTGRD
        IHSFRA+PSTLHQVLKY T NGVGTVRGE   SRECYAS  K SSVCALE  T RD
Subjt:  IHSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKSSSVCALETLTGRD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTAAGTATGAGGGCCGAGGTGAACCTGGCCGAGGTCCGACCTACCGGGGAGCTCGGCAGGGGCCAATGTCCGCCCAAGTGTTCAGATTGGTCTGGAGGCCGAGTTCG
AGCTGCAATCAGAAATACACTGTTGTGCATATCCTTGCATAAACATTTGGCACCGTCTGTGGGGAACGACAATCTAAATCGAAGGACTCTAGCTGCCAGCGGTGCCCACC
AGAGGGAGGTCGGAGCAGCAGCGGTAGAGGGGCAAGGTCACGACGGCCTAGCAACGGAACCCCTCCGCAGGTCGGCACGGATCACCGCGCCTGCCCTACCGCCTGCGCAC
CCGAGGACGTCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTAAAAAGGGCGCCGGGGGTCCAGCCCCGGCTCCAACAAGCGAGAACTTTGATGCGCTCCAGAGAGAGAT
GGAGGCAATGCGCACACAAATGCGCACCATGGAGGAAATGTATAACGAAATGATGCTAGCTGCAGGCGCAGGGTTTCGATCTGAAAATCGAGTGACGCGCGTGGACGTAC
GCGAGCAGAGGGGTTCCGGGCCGAGGAGAAACGTCCCGAAGACAACGAGAGCGAGGGGTACACTCGCTAGAGGGGAGACCTCCGCTGAATCCTCTCACAATCCCGCAGGG
ATAATCACAAGGGAGGAGTTCGACCAGCTGAGGGGGGAGCTCGATGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAACGATGGCGACTT
GGGAGAATCGCCTTTCACCTCGGACGTTTTGGAAGCACCAATCCCTCCGAAGTTCAAAGCTCCCACCGTGAAGCCTTATGAAGGGACGAAGGACCCCAAGGACTATGTTG
AGGTCTTTGAAGGCCTCATGGACTTCCAAGCGGCATCAGACGCAATCAAATGCCGAGCCTTTCAGATCGCGCTTACTAGCAGCGCGCGATTGTGGTACCGAAGACTGCCA
GCAAGGTCGATCTGGACCTATTCTCAGCTGAGAAGGGAGTTCCTCGCCCAATTCTCTTCTCGGCACTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGGCAGAA
GGAGGGTGAGACGCTGCGGGAATATGTCACCAGATTCAAGGAGGAGCAGTTGAAGGTCGCACACTGCTCCGATGACTCGGCCATCTGCTATTTCCTCACCGGTCTAGCCG
ACGAAGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGACCACCTTCGCCGAGATCGGCCGGGGCAGAAGTGGAAAAGATGAAAGGGCAGATCCCAAGTCCAAGGACAAG
GGATCCTTTTCCAACGGCCGAGCTGAGTGTCGAAGGGCGGAGAGCGGACCTACTAGGAGCCGACCTTACGAACGCTTCACCCCGACCACGATTCTAATTTCCGAGATCCT
AACAAACATCAAGGAATCTGGAATGGAAAAACTACTCAAGCGTCCGGAAAAACTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGCCGATTCCATCGGGAGC
ACGGCCATAACACGTCAGACTGCTGGGAGTTGAAGCGCCATATTGAGGATCTAATTCAAGATGGCTACTTCAAGAAATTTGTGGGAGAGCCCAGGACTAGCTCAGCAGAG
AAAAAGGAAAAGCGAAAACGTTCAAGGACGCCGCCCCGGCGCACTGACCGACCTGCGGTCATCAATACCATTTTTTGTGGGCCAAGCGGGGGTCAGTCCGGACATAAAAG
AAAGGAGTTAGCCCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCAGGGGCCGACCTGCCCAATCACCTTCGACGGTGCAGACTTGGAGGAGGTCCACCTGCCCC
ACAATGATGCACTTGTGATTGCTCCCTGGATTGATCATGTGGTGGTCAGGAGAGTGCTGGTAGACGGGGGCGCATCCGCTAACATCCTATCCTTACCGACTTACCTGGCC
TTAAGATGGACGAGGTCGCAATTGAAGAAAAGCCAGACACCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGTTGCATCGACTTGCCGGTCACGCTGGGGCA
GGACCGAACTCGGGTCACTCAAATGGCCGAGTTCGTGGTAATTGACGGTAGATCGGCCTATAACGCCATCTTTGGGAGACCCATTATCCACTCATTTCGGGCCATTCCCT
CAACACTTCATCAAGTTTTGAAGTATCCCACCCCCAATGGCGTGGGCACGGTCCGAGGAGAACAGACCGCTTCGAGGGAGTGTTATGCCTCCGCACTCAAAAGCTCATCG
GTCTGCGCCCTCGAAACACTCACCGGTAGGGATGGGACGCTCGAGTTCGAGGCCGACCTGCCGAGGAAGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCT
GCTTAGTCCCGAGAAGCAGTTAGCATCGGCGTACGAGACCGACCTGGCCAGGTCTGTCCCCGTCGAGATCTTAGATAATCCTTCGATCTCAGAGCCAGATCTGATGGAGA
TCGGTGCTCTAGAATCCTCATGGATAGACCCGATCGTGGACTTCATTGGGGGCAACTCACCACGAAACCCCAAGGAGCGCAGGAAGTTGGCAAGGCGGGCAGCTCGGTTC
GTGGTCCGAGATGGGATCGACATGCCATCTGACAGAGTAGTGCATTACGAGCCTACGGCAAATGAGGAAGAGTTGCTCCTCAACCTCGACTTGTTGGAAGAAAGAAGAGC
AATGGCCCAGCTACGCCTGGCGGAATATCAGGGCAGAATGGCCAGACATTACAACGCCTGCGTTCGACCTCGGACCTTTCAGGTCGAACATCTGGTCTTAAGAAGGGTCC
AAACCCATGTGGGTGCCCTTGATCCGGCCTGGGAGGGCCCGTTTGAGGTCAAGGGCATAGTCCGACCTGAGACGTACGTATTGGCCGATCTGAAAGGAGATGTCCTCACG
CACTCATGGAACGCGGAGCACCTGAAGCGTTATTATCCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGCTAAGTATGAGGGCCGAGGTGAACCTGGCCGAGGTCCGACCTACCGGGGAGCTCGGCAGGGGCCAATGTCCGCCCAAGTGTTCAGATTGGTCTGGAGGCCGAGTTCG
AGCTGCAATCAGAAATACACTGTTGTGCATATCCTTGCATAAACATTTGGCACCGTCTGTGGGGAACGACAATCTAAATCGAAGGACTCTAGCTGCCAGCGGTGCCCACC
AGAGGGAGGTCGGAGCAGCAGCGGTAGAGGGGCAAGGTCACGACGGCCTAGCAACGGAACCCCTCCGCAGGTCGGCACGGATCACCGCGCCTGCCCTACCGCCTGCGCAC
CCGAGGACGTCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTAAAAAGGGCGCCGGGGGTCCAGCCCCGGCTCCAACAAGCGAGAACTTTGATGCGCTCCAGAGAGAGAT
GGAGGCAATGCGCACACAAATGCGCACCATGGAGGAAATGTATAACGAAATGATGCTAGCTGCAGGCGCAGGGTTTCGATCTGAAAATCGAGTGACGCGCGTGGACGTAC
GCGAGCAGAGGGGTTCCGGGCCGAGGAGAAACGTCCCGAAGACAACGAGAGCGAGGGGTACACTCGCTAGAGGGGAGACCTCCGCTGAATCCTCTCACAATCCCGCAGGG
ATAATCACAAGGGAGGAGTTCGACCAGCTGAGGGGGGAGCTCGATGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAACGATGGCGACTT
GGGAGAATCGCCTTTCACCTCGGACGTTTTGGAAGCACCAATCCCTCCGAAGTTCAAAGCTCCCACCGTGAAGCCTTATGAAGGGACGAAGGACCCCAAGGACTATGTTG
AGGTCTTTGAAGGCCTCATGGACTTCCAAGCGGCATCAGACGCAATCAAATGCCGAGCCTTTCAGATCGCGCTTACTAGCAGCGCGCGATTGTGGTACCGAAGACTGCCA
GCAAGGTCGATCTGGACCTATTCTCAGCTGAGAAGGGAGTTCCTCGCCCAATTCTCTTCTCGGCACTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGGCAGAA
GGAGGGTGAGACGCTGCGGGAATATGTCACCAGATTCAAGGAGGAGCAGTTGAAGGTCGCACACTGCTCCGATGACTCGGCCATCTGCTATTTCCTCACCGGTCTAGCCG
ACGAAGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGACCACCTTCGCCGAGATCGGCCGGGGCAGAAGTGGAAAAGATGAAAGGGCAGATCCCAAGTCCAAGGACAAG
GGATCCTTTTCCAACGGCCGAGCTGAGTGTCGAAGGGCGGAGAGCGGACCTACTAGGAGCCGACCTTACGAACGCTTCACCCCGACCACGATTCTAATTTCCGAGATCCT
AACAAACATCAAGGAATCTGGAATGGAAAAACTACTCAAGCGTCCGGAAAAACTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGCCGATTCCATCGGGAGC
ACGGCCATAACACGTCAGACTGCTGGGAGTTGAAGCGCCATATTGAGGATCTAATTCAAGATGGCTACTTCAAGAAATTTGTGGGAGAGCCCAGGACTAGCTCAGCAGAG
AAAAAGGAAAAGCGAAAACGTTCAAGGACGCCGCCCCGGCGCACTGACCGACCTGCGGTCATCAATACCATTTTTTGTGGGCCAAGCGGGGGTCAGTCCGGACATAAAAG
AAAGGAGTTAGCCCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCAGGGGCCGACCTGCCCAATCACCTTCGACGGTGCAGACTTGGAGGAGGTCCACCTGCCCC
ACAATGATGCACTTGTGATTGCTCCCTGGATTGATCATGTGGTGGTCAGGAGAGTGCTGGTAGACGGGGGCGCATCCGCTAACATCCTATCCTTACCGACTTACCTGGCC
TTAAGATGGACGAGGTCGCAATTGAAGAAAAGCCAGACACCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGTTGCATCGACTTGCCGGTCACGCTGGGGCA
GGACCGAACTCGGGTCACTCAAATGGCCGAGTTCGTGGTAATTGACGGTAGATCGGCCTATAACGCCATCTTTGGGAGACCCATTATCCACTCATTTCGGGCCATTCCCT
CAACACTTCATCAAGTTTTGAAGTATCCCACCCCCAATGGCGTGGGCACGGTCCGAGGAGAACAGACCGCTTCGAGGGAGTGTTATGCCTCCGCACTCAAAAGCTCATCG
GTCTGCGCCCTCGAAACACTCACCGGTAGGGATGGGACGCTCGAGTTCGAGGCCGACCTGCCGAGGAAGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCT
GCTTAGTCCCGAGAAGCAGTTAGCATCGGCGTACGAGACCGACCTGGCCAGGTCTGTCCCCGTCGAGATCTTAGATAATCCTTCGATCTCAGAGCCAGATCTGATGGAGA
TCGGTGCTCTAGAATCCTCATGGATAGACCCGATCGTGGACTTCATTGGGGGCAACTCACCACGAAACCCCAAGGAGCGCAGGAAGTTGGCAAGGCGGGCAGCTCGGTTC
GTGGTCCGAGATGGGATCGACATGCCATCTGACAGAGTAGTGCATTACGAGCCTACGGCAAATGAGGAAGAGTTGCTCCTCAACCTCGACTTGTTGGAAGAAAGAAGAGC
AATGGCCCAGCTACGCCTGGCGGAATATCAGGGCAGAATGGCCAGACATTACAACGCCTGCGTTCGACCTCGGACCTTTCAGGTCGAACATCTGGTCTTAAGAAGGGTCC
AAACCCATGTGGGTGCCCTTGATCCGGCCTGGGAGGGCCCGTTTGAGGTCAAGGGCATAGTCCGACCTGAGACGTACGTATTGGCCGATCTGAAAGGAGATGTCCTCACG
CACTCATGGAACGCGGAGCACCTGAAGCGTTATTATCCCTGA
Protein sequenceShow/hide protein sequence
MLSMRAEVNLAEVRPTGELGRGQCPPKCSDWSGGRVRAAIRNTLLCISLHKHLAPSVGNDNLNRRTLAASGAHQREVGAAAVEGQGHDGLATEPLRRSARITAPALPPAH
PRTSKATRGRGGTSKKGAGGPAPAPTSENFDALQREMEAMRTQMRTMEEMYNEMMLAAGAGFRSENRVTRVDVREQRGSGPRRNVPKTTRARGTLARGETSAESSHNPAG
IITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYEGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTSSARLWYRRLP
ARSIWTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFKEEQLKVAHCSDDSAICYFLTGLADEALTVKLGEEAPTTFAEIGRGRSGKDERADPKSKDK
GSFSNGRAECRRAESGPTRSRPYERFTPTTILISEILTNIKESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRHIEDLIQDGYFKKFVGEPRTSSAE
KKEKRKRSRTPPRRTDRPAVINTIFCGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPWIDHVVVRRVLVDGGASANILSLPTYLA
LRWTRSQLKKSQTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKSSS
VCALETLTGRDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQLASAYETDLARSVPVEILDNPSISEPDLMEIGALESSWIDPIVDFIGGNSPRNPKERRKLARRAARF
VVRDGIDMPSDRVVHYEPTANEEELLLNLDLLEERRAMAQLRLAEYQGRMARHYNACVRPRTFQVEHLVLRRVQTHVGALDPAWEGPFEVKGIVRPETYVLADLKGDVLT
HSWNAEHLKRYYP