; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g29210 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g29210
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr6:21960976..21965549
RNA-Seq ExpressionMoc06g29210
SyntenyMoc06g29210
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]6.8e-26490.89Show/hide
Query:  AESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVE--------------
        AESS NPATPAGVITR EFDQLRG+LDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVE              
Subjt:  AESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVE--------------

Query:  -------IALTGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALT
               IALTGSARLWYRRLPA SISTY+QLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALT
Subjt:  -------IALTGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALT

Query:  VKLGDEAPATFVEVLQKAKKVIDGQELLRTKTSRPERRIGRDRSGKD-EKADLKSKDKGSFSSGRAEFRRAVNGHTRSRPYERFTPTTIPISEILTNIEE
        VKLG+EAPATF EVLQKAKKVIDGQELLRTKT RPER+IGR RSGKD E AD KSKDKGSFSSGRAE+RRA NG TRSRPYERFTPTTIPISEILTNIEE
Subjt:  VKLGDEAPATFVEVLQKAKKVIDGQELLRTKTSRPERRIGRDRSGKD-EKADLKSKDKGSFSSGRAEFRRAVNGHTRSRPYERFTPTTIPISEILTNIEE

Query:  SGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAKKKEERKRSRTPPRRTDRPAVINTIFGGPSRGQ
        SGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFVGKPRTSSA+KKEERKRSRTPPRRTDRPAVINTIFGGPS GQ
Subjt:  SGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAKKKEERKRSRTPPRRTDRPAVINTIFGGPSRGQ

Query:  SGHKRKESARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGES
        SG KRKE ARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLALGWTRSQLKKSPTPLVGFSGES
Subjt:  SGHKRKESARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGES

Query:  VIPEGCIDLPVTMGQDQTQVTQMAEFV
        VIPEG IDLPVT+GQDQTQVTQMAEFV
Subjt:  VIPEGCIDLPVTMGQDQTQVTQMAEFV

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]7.3e-26677.49Show/hide
Query:  SENQETSAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVE-------
        S NQ+  AESSHNPATP GVITR EFDQLRGKL+AQVEALKAKCEQKEGPLNDGDLGESPFTSDVLE        APTVK YDGSKDPKDYVE       
Subjt:  SENQETSAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVE-------

Query:  --------------IALTGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTG
                      IALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFLTG
Subjt:  --------------IALTGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTG

Query:  LADEALTVKLGDEAPATFVEVLQKAKKVIDGQELLRTKTSRPERRIGRDRSGKDEKADLKSKDKGSFSSGRAEFRRAVNGHTRSRPYERFTPTTIPISEI
        LADEALTVKLG EAPATF EVLQKAKKVIDGQELLRTKT RPER I R RSGKDEKADLKSKDKGSFSSGRAEFRRAVNG TRSRPYERFTPTTIPISEI
Subjt:  LADEALTVKLGDEAPATFVEVLQKAKKVIDGQELLRTKTSRPERRIGRDRSGKDEKADLKSKDKGSFSSGRAEFRRAVNGHTRSRPYERFTPTTIPISEI

Query:  LTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAKKKEERKRSRTPPRRTDRPAVINTIFG
        LTNIEESGMEKLLKRPEKLRGAPERR+KDKYCRFHREH HNTSD WELKRQIEDLIQD YFKKFVGKPRTSSA+KKEERK SRTP RR DRPAVINTIFG
Subjt:  LTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAKKKEERKRSRTPPRRTDRPAVINTIFG

Query:  GPSRGQSGHKRKESARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLV
        GPS GQSGHKRKE ARAARREVCIIREQRPTCPITFD ADLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTRSQLKKS TPLV
Subjt:  GPSRGQSGHKRKESARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLV

Query:  GFSGESVIPEGCIDLPVTMGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALET
        GFS ESVIPEGCIDLPVT+G DQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVG VRGEQ ASRECYASALKGSSVCALET
Subjt:  GFSGESVIPEGCIDLPVTMGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALET

Query:  LAGRDGTLEFEADLPRREFAAPTEELE----LKEHFFRGLDHPTKMMLNNAAN
        L  RDGTLEF+A+LPRREFAAPTEELE    L+  +   +DH  ++   ++ N
Subjt:  LAGRDGTLEFEADLPRREFAAPTEELE----LKEHFFRGLDHPTKMMLNNAAN

XP_022152110.1 uncharacterized protein LOC111019899 [Momordica charantia]3.0e-21991.28Show/hide
Query:  MCYFLTGLADEALTVKLGDEAPATFVEVLQKAKKVIDGQELLRTKTSRPERRIGRDRSGKD-EKADLKSKDKGSFSSGRAEFRRAVNGHTRSRPYERFTP
        MCYFLTGLADEALTVKL +EAPATF EVLQKAKKVIDGQELLRTK       IG+ RSGKD E  D KSKDKGSFS+GRAE+RRA NG TRSRPYERFTP
Subjt:  MCYFLTGLADEALTVKLGDEAPATFVEVLQKAKKVIDGQELLRTKTSRPERRIGRDRSGKD-EKADLKSKDKGSFSSGRAEFRRAVNGHTRSRPYERFTP

Query:  TTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAKKKEERKRSRTPPRRTDRP
        TTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELK QIEDLIQDGYFKKFVGKPRTSSA+KKEERKRSRTPPRRTDRP
Subjt:  TTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAKKKEERKRSRTPPRRTDRP

Query:  AVINTIFGGPSRGQSGHKRKESARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
        AVINTIFGGPS GQSGHKRK+ ARAARREVCIIREQRPTCPITFD ADL EVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
Subjt:  AVINTIFGGPSRGQSGHKRKESARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL

Query:  KKSPTPLVGFSGESVIPEGCIDLPVTMGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKG
        KKSPTPLVGFSGESV+PEGCIDLPVT+GQDQT+VTQMAEFVV+DGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAS LKG
Subjt:  KKSPTPLVGFSGESVIPEGCIDLPVTMGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKG

Query:  SSVCALETLAGRDGTLEFEADLPRREFAAPTEELEL
        +SVCALETL  RDGTLEFEADLP REFAAP EELEL
Subjt:  SSVCALETLAGRDGTLEFEADLPRREFAAPTEELEL

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]5.4e-26969.96Show/hide
Query:  MVQPANLTNTADRKTLAASDAHQREVGVAVVEGQGHDGLATEPLRRSARITAPVLPPAHPRTSKATRGRGGTSKKSARGPAPAPTSENFDALQREMEAMR
        MVQPAN TNTADR+ LAA+  HQREVG  VVEGQGH+ L TEPL RSARIT PVLPPAHP+ SK                                    
Subjt:  MVQPANLTNTADRKTLAASDAHQREVGVAVVEGQGHDGLATEPLRRSARITAPVLPPAHPRTSKATRGRGGTSKKSARGPAPAPTSENFDALQREMEAMR

Query:  TQMRSMEEMYNEMILAAGAGSRSENQETSAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPT
                                     AESS+NP TP GVITR EFDQL+ K DAQVEALKA+CE+KE   +DGDLGE  F+SD+LEA IPPKFK PT
Subjt:  TQMRSMEEMYNEMILAAGAGSRSENQETSAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPT

Query:  VKPYDGSKDPKDYVE---------------------IALTGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRF
        +KPYDGSKDPKDYVE                     IALTGSARLWYRRLPAR ISTY+QLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTRF
Subjt:  VKPYDGSKDPKDYVE---------------------IALTGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRF

Query:  QEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGDEAPATFVEVLQKAKKVIDGQELLRTKTSRPERRIGRDRSGKDE-KADLKSKDKG-SFSSGRAEFRR
         EEQLKVAHCSDDSAMCYFLTGLADE LTVKL +EAPATF EVLQK KKVIDGQELLRTKT RPE+ I + R+GKD+ KAD KS+DKG S SS R ++RR
Subjt:  QEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGDEAPATFVEVLQKAKKVIDGQELLRTKTSRPERRIGRDRSGKDE-KADLKSKDKG-SFSSGRAEFRR

Query:  AVNGHTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAKKK
        + + H +SRPYE +TPTTIPI EILTNIEE+GMEKLLKRPEKLRG PE+R+ DKYCRFHR+HGHNTS+ WELKRQIEDLIQDGYFKKFVGKPR++S +KK
Subjt:  AVNGHTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAKKK

Query:  EERKRSRTPPRRTDRPAVINTIFGGPSRGQSGHKRKESARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANI
        EERKR RTPPRR DRPAVIN             K+KE AR ARREVCIIREQRPT  I F+ ADLE VHLPHNDALVIAPLID V+VRR+LVDGGASANI
Subjt:  EERKRSRTPPRRTDRPAVINTIFGGPSRGQSGHKRKESARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANI

Query:  LSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTMGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVR
        LSL TYLALGWTRSQLKKSPTPLVGFSGES+  EGCIDLPV++ QD TQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYST NGVGTVR
Subjt:  LSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTMGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVR

Query:  GEQTASRECYASALKGSSVCALETLAGRD
        GE   SRECYAS  K SSVCALE    RD
Subjt:  GEQTASRECYASALKGSSVCALETLAGRD

XP_022158414.1 uncharacterized protein LOC111024904 [Momordica charantia]1.0e-20674.81Show/hide
Query:  EIALTGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGDE
        +IALTGSARLWYRRLPARSISTY+QLR+EF++QFSS HYD+KTATHLATIRQKE ETLREYVTRFQEEQLKVAHCSDDSAMCYFLT LADE LTVKLG+E
Subjt:  EIALTGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGDE

Query:  APATFVEVLQKAKKVIDGQELLRTKTSRPERRIGRDR-SGKDEKADLKSKDKGSFSS-GRAEFRRAVNGHTRSRPYERFTPTTIPISEILTNIEESGMEK
        AP TFVEVLQKAKKVIDGQELLRTKT RPE++I + + S +  KAD KS+DKGS SS  R E+RR  +G +RSRPYER+T +TIPISEILTNIEESGMEK
Subjt:  APATFVEVLQKAKKVIDGQELLRTKTSRPERRIGRDR-SGKDEKADLKSKDKGSFSS-GRAEFRRAVNGHTRSRPYERFTPTTIPISEILTNIEESGMEK

Query:  LLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAKKKEERKRSRTPPRRTDRPAVINTIFGGPSRGQSGHKR
        LLKRPEKLRG  E+R+K+KYCRFHR+HGHNT+ CWELKRQIEDLIQDGYFKKFVGKPR++S +KKEERKRSRTPPRR DRPAVINTIFGGP+ GQSG+KR
Subjt:  LLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAKKKEERKRSRTPPRRTDRPAVINTIFGGPSRGQSGHKR

Query:  KESARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEG
        KE AR ARREVCIIRE +PTC ITF  ADLE VHLPHNDALVIA LIDH +VRRVL+DG                                        G
Subjt:  KESARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEG

Query:  CIDLPVTMGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLAGRDGTLEFE
        CIDLPVT+GQD TQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYSTPN VG VRGEQ  SRECYASALKGS+VCALE    R    E E
Subjt:  CIDLPVTMGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLAGRDGTLEFE

Query:  ADLP---RREFAAPTEELEL
        ADLP   +R+F  PTEELEL
Subjt:  ADLP---RREFAAPTEELEL

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088133.3e-26490.89Show/hide
Query:  AESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVE--------------
        AESS NPATPAGVITR EFDQLRG+LDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVE              
Subjt:  AESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVE--------------

Query:  -------IALTGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALT
               IALTGSARLWYRRLPA SISTY+QLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALT
Subjt:  -------IALTGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALT

Query:  VKLGDEAPATFVEVLQKAKKVIDGQELLRTKTSRPERRIGRDRSGKD-EKADLKSKDKGSFSSGRAEFRRAVNGHTRSRPYERFTPTTIPISEILTNIEE
        VKLG+EAPATF EVLQKAKKVIDGQELLRTKT RPER+IGR RSGKD E AD KSKDKGSFSSGRAE+RRA NG TRSRPYERFTPTTIPISEILTNIEE
Subjt:  VKLGDEAPATFVEVLQKAKKVIDGQELLRTKTSRPERRIGRDRSGKD-EKADLKSKDKGSFSSGRAEFRRAVNGHTRSRPYERFTPTTIPISEILTNIEE

Query:  SGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAKKKEERKRSRTPPRRTDRPAVINTIFGGPSRGQ
        SGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFVGKPRTSSA+KKEERKRSRTPPRRTDRPAVINTIFGGPS GQ
Subjt:  SGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAKKKEERKRSRTPPRRTDRPAVINTIFGGPSRGQ

Query:  SGHKRKESARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGES
        SG KRKE ARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLALGWTRSQLKKSPTPLVGFSGES
Subjt:  SGHKRKESARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGES

Query:  VIPEGCIDLPVTMGQDQTQVTQMAEFV
        VIPEG IDLPVT+GQDQTQVTQMAEFV
Subjt:  VIPEGCIDLPVTMGQDQTQVTQMAEFV

A0A6J1D9E1 uncharacterized protein LOC1110188233.5e-26677.49Show/hide
Query:  SENQETSAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVE-------
        S NQ+  AESSHNPATP GVITR EFDQLRGKL+AQVEALKAKCEQKEGPLNDGDLGESPFTSDVLE        APTVK YDGSKDPKDYVE       
Subjt:  SENQETSAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVE-------

Query:  --------------IALTGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTG
                      IALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFLTG
Subjt:  --------------IALTGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTG

Query:  LADEALTVKLGDEAPATFVEVLQKAKKVIDGQELLRTKTSRPERRIGRDRSGKDEKADLKSKDKGSFSSGRAEFRRAVNGHTRSRPYERFTPTTIPISEI
        LADEALTVKLG EAPATF EVLQKAKKVIDGQELLRTKT RPER I R RSGKDEKADLKSKDKGSFSSGRAEFRRAVNG TRSRPYERFTPTTIPISEI
Subjt:  LADEALTVKLGDEAPATFVEVLQKAKKVIDGQELLRTKTSRPERRIGRDRSGKDEKADLKSKDKGSFSSGRAEFRRAVNGHTRSRPYERFTPTTIPISEI

Query:  LTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAKKKEERKRSRTPPRRTDRPAVINTIFG
        LTNIEESGMEKLLKRPEKLRGAPERR+KDKYCRFHREH HNTSD WELKRQIEDLIQD YFKKFVGKPRTSSA+KKEERK SRTP RR DRPAVINTIFG
Subjt:  LTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAKKKEERKRSRTPPRRTDRPAVINTIFG

Query:  GPSRGQSGHKRKESARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLV
        GPS GQSGHKRKE ARAARREVCIIREQRPTCPITFD ADLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTRSQLKKS TPLV
Subjt:  GPSRGQSGHKRKESARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLV

Query:  GFSGESVIPEGCIDLPVTMGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALET
        GFS ESVIPEGCIDLPVT+G DQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVG VRGEQ ASRECYASALKGSSVCALET
Subjt:  GFSGESVIPEGCIDLPVTMGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALET

Query:  LAGRDGTLEFEADLPRREFAAPTEELE----LKEHFFRGLDHPTKMMLNNAAN
        L  RDGTLEF+A+LPRREFAAPTEELE    L+  +   +DH  ++   ++ N
Subjt:  LAGRDGTLEFEADLPRREFAAPTEELE----LKEHFFRGLDHPTKMMLNNAAN

A0A6J1DD03 uncharacterized protein LOC1110198991.4e-21991.28Show/hide
Query:  MCYFLTGLADEALTVKLGDEAPATFVEVLQKAKKVIDGQELLRTKTSRPERRIGRDRSGKD-EKADLKSKDKGSFSSGRAEFRRAVNGHTRSRPYERFTP
        MCYFLTGLADEALTVKL +EAPATF EVLQKAKKVIDGQELLRTK       IG+ RSGKD E  D KSKDKGSFS+GRAE+RRA NG TRSRPYERFTP
Subjt:  MCYFLTGLADEALTVKLGDEAPATFVEVLQKAKKVIDGQELLRTKTSRPERRIGRDRSGKD-EKADLKSKDKGSFSSGRAEFRRAVNGHTRSRPYERFTP

Query:  TTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAKKKEERKRSRTPPRRTDRP
        TTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELK QIEDLIQDGYFKKFVGKPRTSSA+KKEERKRSRTPPRRTDRP
Subjt:  TTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAKKKEERKRSRTPPRRTDRP

Query:  AVINTIFGGPSRGQSGHKRKESARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
        AVINTIFGGPS GQSGHKRK+ ARAARREVCIIREQRPTCPITFD ADL EVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
Subjt:  AVINTIFGGPSRGQSGHKRKESARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL

Query:  KKSPTPLVGFSGESVIPEGCIDLPVTMGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKG
        KKSPTPLVGFSGESV+PEGCIDLPVT+GQDQT+VTQMAEFVV+DGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAS LKG
Subjt:  KKSPTPLVGFSGESVIPEGCIDLPVTMGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKG

Query:  SSVCALETLAGRDGTLEFEADLPRREFAAPTEELEL
        +SVCALETL  RDGTLEFEADLP REFAAP EELEL
Subjt:  SSVCALETLAGRDGTLEFEADLPRREFAAPTEELEL

A0A6J1DHB3 uncharacterized protein LOC1110204792.6e-26969.96Show/hide
Query:  MVQPANLTNTADRKTLAASDAHQREVGVAVVEGQGHDGLATEPLRRSARITAPVLPPAHPRTSKATRGRGGTSKKSARGPAPAPTSENFDALQREMEAMR
        MVQPAN TNTADR+ LAA+  HQREVG  VVEGQGH+ L TEPL RSARIT PVLPPAHP+ SK                                    
Subjt:  MVQPANLTNTADRKTLAASDAHQREVGVAVVEGQGHDGLATEPLRRSARITAPVLPPAHPRTSKATRGRGGTSKKSARGPAPAPTSENFDALQREMEAMR

Query:  TQMRSMEEMYNEMILAAGAGSRSENQETSAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPT
                                     AESS+NP TP GVITR EFDQL+ K DAQVEALKA+CE+KE   +DGDLGE  F+SD+LEA IPPKFK PT
Subjt:  TQMRSMEEMYNEMILAAGAGSRSENQETSAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPT

Query:  VKPYDGSKDPKDYVE---------------------IALTGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRF
        +KPYDGSKDPKDYVE                     IALTGSARLWYRRLPAR ISTY+QLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTRF
Subjt:  VKPYDGSKDPKDYVE---------------------IALTGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRF

Query:  QEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGDEAPATFVEVLQKAKKVIDGQELLRTKTSRPERRIGRDRSGKDE-KADLKSKDKG-SFSSGRAEFRR
         EEQLKVAHCSDDSAMCYFLTGLADE LTVKL +EAPATF EVLQK KKVIDGQELLRTKT RPE+ I + R+GKD+ KAD KS+DKG S SS R ++RR
Subjt:  QEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGDEAPATFVEVLQKAKKVIDGQELLRTKTSRPERRIGRDRSGKDE-KADLKSKDKG-SFSSGRAEFRR

Query:  AVNGHTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAKKK
        + + H +SRPYE +TPTTIPI EILTNIEE+GMEKLLKRPEKLRG PE+R+ DKYCRFHR+HGHNTS+ WELKRQIEDLIQDGYFKKFVGKPR++S +KK
Subjt:  AVNGHTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAKKK

Query:  EERKRSRTPPRRTDRPAVINTIFGGPSRGQSGHKRKESARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANI
        EERKR RTPPRR DRPAVIN             K+KE AR ARREVCIIREQRPT  I F+ ADLE VHLPHNDALVIAPLID V+VRR+LVDGGASANI
Subjt:  EERKRSRTPPRRTDRPAVINTIFGGPSRGQSGHKRKESARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANI

Query:  LSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTMGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVR
        LSL TYLALGWTRSQLKKSPTPLVGFSGES+  EGCIDLPV++ QD TQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYST NGVGTVR
Subjt:  LSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTMGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVR

Query:  GEQTASRECYASALKGSSVCALETLAGRD
        GE   SRECYAS  K SSVCALE    RD
Subjt:  GEQTASRECYASALKGSSVCALETLAGRD

A0A6J1DZB9 uncharacterized protein LOC1110249044.8e-20774.81Show/hide
Query:  EIALTGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGDE
        +IALTGSARLWYRRLPARSISTY+QLR+EF++QFSS HYD+KTATHLATIRQKE ETLREYVTRFQEEQLKVAHCSDDSAMCYFLT LADE LTVKLG+E
Subjt:  EIALTGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGDE

Query:  APATFVEVLQKAKKVIDGQELLRTKTSRPERRIGRDR-SGKDEKADLKSKDKGSFSS-GRAEFRRAVNGHTRSRPYERFTPTTIPISEILTNIEESGMEK
        AP TFVEVLQKAKKVIDGQELLRTKT RPE++I + + S +  KAD KS+DKGS SS  R E+RR  +G +RSRPYER+T +TIPISEILTNIEESGMEK
Subjt:  APATFVEVLQKAKKVIDGQELLRTKTSRPERRIGRDR-SGKDEKADLKSKDKGSFSS-GRAEFRRAVNGHTRSRPYERFTPTTIPISEILTNIEESGMEK

Query:  LLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAKKKEERKRSRTPPRRTDRPAVINTIFGGPSRGQSGHKR
        LLKRPEKLRG  E+R+K+KYCRFHR+HGHNT+ CWELKRQIEDLIQDGYFKKFVGKPR++S +KKEERKRSRTPPRR DRPAVINTIFGGP+ GQSG+KR
Subjt:  LLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAKKKEERKRSRTPPRRTDRPAVINTIFGGPSRGQSGHKR

Query:  KESARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEG
        KE AR ARREVCIIRE +PTC ITF  ADLE VHLPHNDALVIA LIDH +VRRVL+DG                                        G
Subjt:  KESARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEG

Query:  CIDLPVTMGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLAGRDGTLEFE
        CIDLPVT+GQD TQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYSTPN VG VRGEQ  SRECYASALKGS+VCALE    R    E E
Subjt:  CIDLPVTMGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLAGRDGTLEFE

Query:  ADLP---RREFAAPTEELEL
        ADLP   +R+F  PTEELEL
Subjt:  ADLP---RREFAAPTEELEL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAACCAGCAAACTTGACCAATACGGCAGATCGAAAGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGTAGCAGTGGTAGAGGGGCAAGGTCACGA
TGGCCTAGCAACAGAACCGCTCCGCAGGTCGGCACGAATCACCGCGCCTGTCCTACCACCTGCGCACCCAAGGACATCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTA
AGAAGAGCGCCCGGGGTCCAGCCCCGGCTCCAACAAGTGAGAACTTTGACGCACTCCAGAGAGAAATGGAGGCAATGCGCACACAAATGCGGTCCATGGAGGAAATGTAT
AACGAAATGATACTAGCCGCAGGCGCAGGGTCCCGATCTGAAAATCAGGAGACCTCCGCTGAATCCTCTCACAACCCAGCAACTCCCGCAGGGGTGATCACAAGGGCGGA
GTTCGACCAGCTGAGGGGCAAGCTCGACGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGAAGGTCCACTGAACGATGGCGACTTGGGAGAATCGCCCTTCA
CCTCGGACGTTTTGGAAGCACCGATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAACCTTATGATGGGTCGAAGGACCCCAAGGATTATGTTGAGATCGCGCTTACCGGC
AGCGCGCGTTTGTGGTATCGGAGACTGCCAGCTAGGTCGATCTCGACCTACGCTCAGCTGAGAAGGGAGTTCCTTGCCCAGTTCTCCTCTCGGCACTACGACAAAAAGAC
AGCGACCCATCTCGCCACCATCAGGCAAAAGGAAGGTGAGACGCTGCGGGAATATGTCACTAGATTCCAGGAGGAACAATTGAAGGTCGCACACTGCTCTGATGACTCGG
CCATGTGCTACTTTCTCACCGGCCTAGCCGACGAAGCCCTCACGGTGAAGCTTGGAGACGAGGCCCCGGCCACTTTCGTCGAAGTGCTGCAGAAGGCGAAGAAAGTCATC
GATGGGCAGGAGCTCCTCCGAACCAAAACCAGCCGACCAGAACGAAGGATCGGCCGGGATAGAAGCGGAAAAGATGAAAAGGCGGATCTCAAGTCCAAGGACAAGGGATC
TTTCTCCAGTGGCCGAGCTGAGTTTCGAAGGGCGGTGAACGGACACACCAGGAGCCGACCTTACGAACGCTTCACCCCGACCACGATTCCAATTTCCGAGATCCTAACGA
ACATCGAGGAGTCCGGAATGGAAAAACTACTCAAACGTCCTGAGAAGCTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGC
CACAACACGTCGGACTGCTGGGAGTTGAAGCGCCAAATTGAGGATCTAATTCAAGATGGCTACTTCAAGAAATTTGTGGGAAAGCCCAGGACCAGCTCGGCAAAGAAAAA
GGAAGAGCGAAAGCGTTCGAGGACGCCGCCCCGGCGCACTGACCGACCTGCGGTCATCAATACCATTTTCGGAGGGCCAAGCAGGGGTCAGTCCGGACATAAAAGAAAGG
AGTCAGCTCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCAGAGGCCGACCTGTCCAATCACCTTCGACGGTGCAGACTTGGAGGAGGTCCACCTGCCCCACAAT
GATGCACTTGTGATCGCTCCCTTGATTGATCATGTGGTGGTCAGGAGGGTGCTGGTAGATGGAGGCGCATCTGCTAACATCCTGTCCTTACCGACCTACCTCGCCCTGGG
ATGGACGAGGTCGCAATTGAAGAAAAGCCCGACACCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGTTGCATCGACTTGCCGGTCACGATGGGGCAGGACC
AAACTCAGGTCACCCAAATGGCCGAGTTCGTGGTAATTGACGGTAGATCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCATTCCCTCGACA
CTTCATCAAGTTTTGAAGTATTCCACCCCCAATGGCGTGGGCACGGTCCGAGGAGAACAGACCGCTTCGAGGGAGTGTTATGCCTCCGCACTCAAAGGCTCATCGGTCTG
CGCCCTCGAAACTCTCGCCGGTAGGGATGGGACGCTCGAGTTCGAGGCCGACCTGCCGAGGAGGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTAAAGAACATTTCT
TTAGAGGTTTAGATCATCCTACTAAGATGATGCTAAACAATGCTGCCAATGGAGCCTTTACAAAGAAGACATTCAACGAGATAGTTGACATCCTAAATGGCTTAGCTTCA
CACAACGAACTATGGTGTTCGCAAAGATTTAGGGTAGCACCAAAGAAGCAAGATCCAGCTGGAGTTTTGGCTCTGGACATTGCGACCTCGATGCAAAAAGAGATGATTAC
AATGAACCAGAGGCTAAAAGAGATGGCATTGGGAATAAAAAATCCATTAGCCACGCCGATACAACCTGTGCAGTCGGATTATTGCACTCCTGCCCCTGTTTGCCAAGTCA
ACAATCTCATTTGTTCATTTTGCAGTGAAAACCATATTTATGATAATTGTCCACATAACCCTGCTTCCGTTTTTTATGTAGGACATGGGAACAATAGGAACATTAACCCA
TATTCGAACACCTACAACCCAGGTTGGAGGCACCACCCTAATTTCTCATGGGGAGGTCAAGAAGGTTCGAGCGGTTTTAATCAAGGGCAGAGCCAGCAGAACAAGCAGCC
CTATGTTCCACCTACACAGCAATACATCCCACCGCCGCAACAACAGTACAATCAAAGAACACAGACTCCACCAGTTCAAAATAACAACTCAAATCTTGAGAATATGATGG
AGTACATGGCCCGAACCGACGTAGTGATACAATCCCAAGCGGCATCAATGAGGAATTTCGAGACCCAATTGGGACAGCTCGCCAATGAATTGAAGAATAGACCACATGTT
CTTTTCCAGGCCATACTGAATTACCAAGACGAGAAGGGAAAGAACAGTGAAAGCTGTCACCCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAACCAGCAAACTTGACCAATACGGCAGATCGAAAGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGTAGCAGTGGTAGAGGGGCAAGGTCACGA
TGGCCTAGCAACAGAACCGCTCCGCAGGTCGGCACGAATCACCGCGCCTGTCCTACCACCTGCGCACCCAAGGACATCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTA
AGAAGAGCGCCCGGGGTCCAGCCCCGGCTCCAACAAGTGAGAACTTTGACGCACTCCAGAGAGAAATGGAGGCAATGCGCACACAAATGCGGTCCATGGAGGAAATGTAT
AACGAAATGATACTAGCCGCAGGCGCAGGGTCCCGATCTGAAAATCAGGAGACCTCCGCTGAATCCTCTCACAACCCAGCAACTCCCGCAGGGGTGATCACAAGGGCGGA
GTTCGACCAGCTGAGGGGCAAGCTCGACGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGAAGGTCCACTGAACGATGGCGACTTGGGAGAATCGCCCTTCA
CCTCGGACGTTTTGGAAGCACCGATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAACCTTATGATGGGTCGAAGGACCCCAAGGATTATGTTGAGATCGCGCTTACCGGC
AGCGCGCGTTTGTGGTATCGGAGACTGCCAGCTAGGTCGATCTCGACCTACGCTCAGCTGAGAAGGGAGTTCCTTGCCCAGTTCTCCTCTCGGCACTACGACAAAAAGAC
AGCGACCCATCTCGCCACCATCAGGCAAAAGGAAGGTGAGACGCTGCGGGAATATGTCACTAGATTCCAGGAGGAACAATTGAAGGTCGCACACTGCTCTGATGACTCGG
CCATGTGCTACTTTCTCACCGGCCTAGCCGACGAAGCCCTCACGGTGAAGCTTGGAGACGAGGCCCCGGCCACTTTCGTCGAAGTGCTGCAGAAGGCGAAGAAAGTCATC
GATGGGCAGGAGCTCCTCCGAACCAAAACCAGCCGACCAGAACGAAGGATCGGCCGGGATAGAAGCGGAAAAGATGAAAAGGCGGATCTCAAGTCCAAGGACAAGGGATC
TTTCTCCAGTGGCCGAGCTGAGTTTCGAAGGGCGGTGAACGGACACACCAGGAGCCGACCTTACGAACGCTTCACCCCGACCACGATTCCAATTTCCGAGATCCTAACGA
ACATCGAGGAGTCCGGAATGGAAAAACTACTCAAACGTCCTGAGAAGCTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGC
CACAACACGTCGGACTGCTGGGAGTTGAAGCGCCAAATTGAGGATCTAATTCAAGATGGCTACTTCAAGAAATTTGTGGGAAAGCCCAGGACCAGCTCGGCAAAGAAAAA
GGAAGAGCGAAAGCGTTCGAGGACGCCGCCCCGGCGCACTGACCGACCTGCGGTCATCAATACCATTTTCGGAGGGCCAAGCAGGGGTCAGTCCGGACATAAAAGAAAGG
AGTCAGCTCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCAGAGGCCGACCTGTCCAATCACCTTCGACGGTGCAGACTTGGAGGAGGTCCACCTGCCCCACAAT
GATGCACTTGTGATCGCTCCCTTGATTGATCATGTGGTGGTCAGGAGGGTGCTGGTAGATGGAGGCGCATCTGCTAACATCCTGTCCTTACCGACCTACCTCGCCCTGGG
ATGGACGAGGTCGCAATTGAAGAAAAGCCCGACACCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGTTGCATCGACTTGCCGGTCACGATGGGGCAGGACC
AAACTCAGGTCACCCAAATGGCCGAGTTCGTGGTAATTGACGGTAGATCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCATTCCCTCGACA
CTTCATCAAGTTTTGAAGTATTCCACCCCCAATGGCGTGGGCACGGTCCGAGGAGAACAGACCGCTTCGAGGGAGTGTTATGCCTCCGCACTCAAAGGCTCATCGGTCTG
CGCCCTCGAAACTCTCGCCGGTAGGGATGGGACGCTCGAGTTCGAGGCCGACCTGCCGAGGAGGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTAAAGAACATTTCT
TTAGAGGTTTAGATCATCCTACTAAGATGATGCTAAACAATGCTGCCAATGGAGCCTTTACAAAGAAGACATTCAACGAGATAGTTGACATCCTAAATGGCTTAGCTTCA
CACAACGAACTATGGTGTTCGCAAAGATTTAGGGTAGCACCAAAGAAGCAAGATCCAGCTGGAGTTTTGGCTCTGGACATTGCGACCTCGATGCAAAAAGAGATGATTAC
AATGAACCAGAGGCTAAAAGAGATGGCATTGGGAATAAAAAATCCATTAGCCACGCCGATACAACCTGTGCAGTCGGATTATTGCACTCCTGCCCCTGTTTGCCAAGTCA
ACAATCTCATTTGTTCATTTTGCAGTGAAAACCATATTTATGATAATTGTCCACATAACCCTGCTTCCGTTTTTTATGTAGGACATGGGAACAATAGGAACATTAACCCA
TATTCGAACACCTACAACCCAGGTTGGAGGCACCACCCTAATTTCTCATGGGGAGGTCAAGAAGGTTCGAGCGGTTTTAATCAAGGGCAGAGCCAGCAGAACAAGCAGCC
CTATGTTCCACCTACACAGCAATACATCCCACCGCCGCAACAACAGTACAATCAAAGAACACAGACTCCACCAGTTCAAAATAACAACTCAAATCTTGAGAATATGATGG
AGTACATGGCCCGAACCGACGTAGTGATACAATCCCAAGCGGCATCAATGAGGAATTTCGAGACCCAATTGGGACAGCTCGCCAATGAATTGAAGAATAGACCACATGTT
CTTTTCCAGGCCATACTGAATTACCAAGACGAGAAGGGAAAGAACAGTGAAAGCTGTCACCCTTAG
Protein sequenceShow/hide protein sequence
MVQPANLTNTADRKTLAASDAHQREVGVAVVEGQGHDGLATEPLRRSARITAPVLPPAHPRTSKATRGRGGTSKKSARGPAPAPTSENFDALQREMEAMRTQMRSMEEMY
NEMILAAGAGSRSENQETSAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEIALTG
SARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGDEAPATFVEVLQKAKKVI
DGQELLRTKTSRPERRIGRDRSGKDEKADLKSKDKGSFSSGRAEFRRAVNGHTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHG
HNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAKKKEERKRSRTPPRRTDRPAVINTIFGGPSRGQSGHKRKESARAARREVCIIREQRPTCPITFDGADLEEVHLPHN
DALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTMGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPST
LHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLAGRDGTLEFEADLPRREFAAPTEELELKEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNGLAS
HNELWCSQRFRVAPKKQDPAGVLALDIATSMQKEMITMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNNLICSFCSENHIYDNCPHNPASVFYVGHGNNRNINP
YSNTYNPGWRHHPNFSWGGQEGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPVQNNNSNLENMMEYMARTDVVIQSQAASMRNFETQLGQLANELKNRPHV
LFQAILNYQDEKGKNSESCHP