; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g40500 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g40500
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr8:31067815..31073412
RNA-Seq ExpressionMoc08g40500
SyntenyMoc08g40500
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]2.5e-27394.56Show/hide
Query:  VITRAEFDQLRGKLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGFMDFQAASDAIKCRAFQIALTG
        VITR EFDQLRG+LDAQVEALKAKCEQKEG LNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFE  MDFQAASDAIKCRAF+IALTG
Subjt:  VITRAEFDQLRGKLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGFMDFQAASDAIKCRAFQIALTG

Query:  SARLWYRRLPARWISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEEAPATFA
        SARLWYRRLPA  ISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADE LTVKLGEEAPATFA
Subjt:  SARLWYRRLPARWISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEEAPATFA

Query:  EVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENRPTRSRPYERFTPTTIPIFEILTNIEESGMEKLLKRPEK
        EVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIE ADPKSKDKGSFSSGRAEYRRAEN PTRSRPYERFTPTTIPI EILTNIEESGMEKLLKRPEK
Subjt:  EVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENRPTRSRPYERFTPTTIPIFEILTNIEESGMEKLLKRPEK

Query:  LRGALERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPLRIDRPAVINTIFGGPSEGQSGHKRKELARAA
        LRGA ERRSKDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFVGKPRTSSAEKKEERKRSRTPP R DRPAVINTIFGGPS GQSG KRKELARAA
Subjt:  LRGALERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPLRIDRPAVINTIFGGPSEGQSGHKRKELARAA

Query:  RREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHMVVRRVLIDGGASANILSLPTYLALGWTRSQLTKSPTPLVGFSGESVIPEGCIDLPVT
        RREVCIIREQRPTCPITFD  DLEEVHLPHNDALVIAPLIDH+VV RVL+DGG SANILSLPTYLALGWTRSQL KSPTPLVGFSGESVIPEG IDLPVT
Subjt:  RREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHMVVRRVLIDGGASANILSLPTYLALGWTRSQLTKSPTPLVGFSGESVIPEGCIDLPVT

Query:  LGQDQTQVTQMAEFV
        LGQDQTQVTQMAEFV
Subjt:  LGQDQTQVTQMAEFV

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]1.6e-27281.08Show/hide
Query:  NTSTEREAHISEKDRVITRAEFDQLRGKLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGFMDFQAA
        N   E   + +  D VITR EFDQLRGKL+AQVEALKAKCEQKEG LNDGDLGESPFTSDVLE        APTVK YDGSKDPKDYVEVFEG MDFQAA
Subjt:  NTSTEREAHISEKDRVITRAEFDQLRGKLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGFMDFQAA

Query:  SDAIKCRAFQIALTGSARLWYRRLPARWISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADE
        SDAIKCRAFQIALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFLTGLADE
Subjt:  SDAIKCRAFQIALTGSARLWYRRLPARWISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADE

Query:  TLTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENRPTRSRPYERFTPTTIPIFEILTN
         LTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRPER I RGRSGKD EKAD KSKDKGSFSSGRAE+RRA N PTRSRPYERFTPTTIPI EILTN
Subjt:  TLTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENRPTRSRPYERFTPTTIPIFEILTN

Query:  IEESGMEKLLKRPEKLRGALERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPLRIDRPAVINTIFGGPS
        IEESGMEKLLKRPEKLRGA ERR+KDKYCRFHREH HNTSD WELKRQIEDLIQD YFKKFVGKPRTSSAEKKEERK SRTP  RIDRPAVINTIFGGPS
Subjt:  IEESGMEKLLKRPEKLRGALERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPLRIDRPAVINTIFGGPS

Query:  EGQSGHKRKELARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHMVVRRVLIDGGASANILSLPTYLALGWTRSQLTKSPTPLVGFS
         GQSGHKRKELARAARREVCIIREQRPTCPITFDS DLEEVHLPHNDALVIAPLIDH+VVRRVL+D G SANI+SL TYLALGWTRSQL KS TPLVGFS
Subjt:  EGQSGHKRKELARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHMVVRRVLIDGGASANILSLPTYLALGWTRSQLTKSPTPLVGFS

Query:  GESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLAG
         ESVIPEGCIDLPVTLG DQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQ+LKYSTPNGVG VRGEQ ASRECYASALKGSSVCALETL  
Subjt:  GESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLAG

Query:  RDGTFEFEADLPRREFAAPTEELELVPLL
        RDGT EF+A+LPRREFAAPTEELELVPLL
Subjt:  RDGTFEFEADLPRREFAAPTEELELVPLL

XP_022152110.1 uncharacterized protein LOC111019899 [Momordica charantia]3.3e-22590.58Show/hide
Query:  MCYFLTGLADETLTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENRPTRSRPYERFTP
        MCYFLTGLADE LTVKL EEAPATFAEVLQKAKKVIDGQELLRT       KIG+GRSGKD+E  DPKSKDKGSFS+GRAEYRRAEN PTRSRPYERFTP
Subjt:  MCYFLTGLADETLTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENRPTRSRPYERFTP

Query:  TTIPIFEILTNIEESGMEKLLKRPEKLRGALERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPLRIDRP
        TTIPI EILTNIEESGMEKLLKRPEKLRGA ERRSKDKYCRFHREHGHNTSD WELK QIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPP R DRP
Subjt:  TTIPIFEILTNIEESGMEKLLKRPEKLRGALERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPLRIDRP

Query:  AVINTIFGGPSEGQSGHKRKELARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHMVVRRVLIDGGASANILSLPTYLALGWTRSQL
        AVINTIFGGPS GQSGHKRK+LARAARREVCIIREQRPTCPITFD  DL EVHLPHNDALVIAPLIDH+VVRRVL+DGGASANILSLPTYLALGWTRSQL
Subjt:  AVINTIFGGPSEGQSGHKRKELARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHMVVRRVLIDGGASANILSLPTYLALGWTRSQL

Query:  TKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGEQTASRECYASALKG
         KSPTPLVGFSGESV+PEGCIDLPVTLGQDQT+VTQMAEFVV+DGRSAYNAIFGRPIIHSFRAIPSTLHQ+LKYSTPNGVGTVRGEQTASRECYAS LKG
Subjt:  TKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGEQTASRECYASALKG

Query:  SSVCALETLAGRDGTFEFEADLPRREFAAPTEELELVPLLSPEKQL
        +SVCALETL  RDGT EFEADLP REFAAP EELELVPLLS EKQ+
Subjt:  SSVCALETLAGRDGTFEFEADLPRREFAAPTEELELVPLLSPEKQL

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]4.2e-28171.7Show/hide
Query:  MVQPANSTNTADRRTLAASDAHQREVGATVVEGQGHDGLATEPLRRSARITAPVLPPVHPRTSKATRGRGGTSKKGARGPAPAPPSSGSRSENRMTRIDI
        MVQPANSTNTADRR LAA+  HQREVGA VVEGQGH+ L TEPL RSARIT PVLPP HP+ SKA                         S N +T    
Subjt:  MVQPANSTNTADRRTLAASDAHQREVGATVVEGQGHDGLATEPLRRSARITAPVLPPVHPRTSKATRGRGGTSKKGARGPAPAPPSSGSRSENRMTRIDI

Query:  REQRGSHLGPVEEEHPEDNESEGHTRQRGDLVNTSTEREAHISEKDRVITRAEFDQLRGKLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVLEAPIPP
                                                       VITR EFDQL+ K DAQVEALKA+CE+KE S +DGDLGE  F+SD+LEA IPP
Subjt:  REQRGSHLGPVEEEHPEDNESEGHTRQRGDLVNTSTEREAHISEKDRVITRAEFDQLRGKLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVLEAPIPP

Query:  KFKAPTVKPYDGSKDPKDYVEVFEGFMDFQAASDAIKCRAFQIALTGSARLWYRRLPARWISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLR
        KFK PT+KPYDGSKDPKDYVEVFE  MDFQAA+DAIKC AFQIALTGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLR
Subjt:  KFKAPTVKPYDGSKDPKDYVEVFEGFMDFQAASDAIKCRAFQIALTGSARLWYRRLPARWISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLR

Query:  EYVTRFQEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKG-SFSSG
        EYVTRF EEQLKVAHCSDDSAMCYFLTGLADETLTVKL EEAPATFAEVLQK KKVIDGQELLRTKTGRPE+ I +GR+GKD  KAD KS+DKG S SS 
Subjt:  EYVTRFQEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKG-SFSSG

Query:  RAEYRRAENRPTRSRPYERFTPTTIPIFEILTNIEESGMEKLLKRPEKLRGALERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRT
        R +YRR+ +   +SRPYE +TPTTIPIFEILTNIEE+GMEKLLKRPEKLRG  E+R+ DKYCRFHR+HGHNTS+ WELKRQIEDLIQDGYFKKFVGKPR+
Subjt:  RAEYRRAENRPTRSRPYERFTPTTIPIFEILTNIEESGMEKLLKRPEKLRGALERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRT

Query:  SSAEKKEERKRSRTPPLRIDRPAVINTIFGGPSEGQSGHKRKELARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHMVVRRVLIDG
        +S EKKEERKR RTPP R DRPAVIN             K+KELAR ARREVCIIREQRPT  I F+  DLE VHLPHNDALVIAPLID ++VRR+L+DG
Subjt:  SSAEKKEERKRSRTPPLRIDRPAVINTIFGGPSEGQSGHKRKELARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHMVVRRVLIDG

Query:  GASANILSLPTYLALGWTRSQLTKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQILKYSTPN
        GASANILSL TYLALGWTRSQL KSPTPLVGFSGES+  EGCIDLPV++ QD TQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQ+LKYST N
Subjt:  GASANILSLPTYLALGWTRSQLTKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQILKYSTPN

Query:  GVGTVRGEQTASRECYASALKGSSVCALETLAGRD
        GVGTVRGE   SRECYAS  K SSVCALE    RD
Subjt:  GVGTVRGEQTASRECYASALKGSSVCALETLAGRD

XP_022158414.1 uncharacterized protein LOC111024904 [Momordica charantia]8.1e-22476.29Show/hide
Query:  MDFQAASDAIKCRAFQIALTGSARLWYRRLPARWISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL
        MDFQAA+DAIKCRAFQIALTGSARLWYRRLPAR ISTYSQLR+EF++QFSS HYD+KTATHLATIRQKE ETLREYVTRFQEEQLKVAHCSDDSAMCYFL
Subjt:  MDFQAASDAIKCRAFQIALTGSARLWYRRLPARWISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL

Query:  TGLADETLTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSS-GRAEYRRAENRPTRSRPYERFTPTTIP
        T LADETLTVKLGEEAP TF EVLQKAKKVIDGQELLRTKTGRPE++I + +  ++  KAD KS+DKGS SS  R EYRR E+ P+RSRPYER+T +TIP
Subjt:  TGLADETLTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSS-GRAEYRRAENRPTRSRPYERFTPTTIP

Query:  IFEILTNIEESGMEKLLKRPEKLRGALERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPLRIDRPAVIN
        I EILTNIEESGMEKLLKRPEKLRG LE+R+K+KYCRFHR+HGHNT+ CWELKRQIEDLIQDGYFKKFVGKPR++S EKKEERKRSRTPP R DRPAVIN
Subjt:  IFEILTNIEESGMEKLLKRPEKLRGALERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPLRIDRPAVIN

Query:  TIFGGPSEGQSGHKRKELARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHMVVRRVLIDGGASANILSLPTYLALGWTRSQLTKSP
        TIFGGP+ GQSG+KRKELAR ARREVCIIRE +PTC ITF   DLE VHLPHNDALVIA LIDH +VRRVLIDG                          
Subjt:  TIFGGPSEGQSGHKRKELARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHMVVRRVLIDGGASANILSLPTYLALGWTRSQLTKSP

Query:  TPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGEQTASRECYASALKGSSVC
                      GCIDLPVT+GQD TQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQ+LKYSTPN VG VRGEQ  SRECYASALKGS+VC
Subjt:  TPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGEQTASRECYASALKGSSVC

Query:  ALETLAGRDGTFEFEADLP---RREFAAPTEELELVPLLSPEKQ
        ALE    R    E EADLP   +R+F  PTEELELVPLLSPE+Q
Subjt:  ALETLAGRDGTFEFEADLP---RREFAAPTEELELVPLLSPEKQ

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088131.2e-27394.56Show/hide
Query:  VITRAEFDQLRGKLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGFMDFQAASDAIKCRAFQIALTG
        VITR EFDQLRG+LDAQVEALKAKCEQKEG LNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFE  MDFQAASDAIKCRAF+IALTG
Subjt:  VITRAEFDQLRGKLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGFMDFQAASDAIKCRAFQIALTG

Query:  SARLWYRRLPARWISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEEAPATFA
        SARLWYRRLPA  ISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADE LTVKLGEEAPATFA
Subjt:  SARLWYRRLPARWISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEEAPATFA

Query:  EVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENRPTRSRPYERFTPTTIPIFEILTNIEESGMEKLLKRPEK
        EVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIE ADPKSKDKGSFSSGRAEYRRAEN PTRSRPYERFTPTTIPI EILTNIEESGMEKLLKRPEK
Subjt:  EVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENRPTRSRPYERFTPTTIPIFEILTNIEESGMEKLLKRPEK

Query:  LRGALERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPLRIDRPAVINTIFGGPSEGQSGHKRKELARAA
        LRGA ERRSKDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFVGKPRTSSAEKKEERKRSRTPP R DRPAVINTIFGGPS GQSG KRKELARAA
Subjt:  LRGALERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPLRIDRPAVINTIFGGPSEGQSGHKRKELARAA

Query:  RREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHMVVRRVLIDGGASANILSLPTYLALGWTRSQLTKSPTPLVGFSGESVIPEGCIDLPVT
        RREVCIIREQRPTCPITFD  DLEEVHLPHNDALVIAPLIDH+VV RVL+DGG SANILSLPTYLALGWTRSQL KSPTPLVGFSGESVIPEG IDLPVT
Subjt:  RREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHMVVRRVLIDGGASANILSLPTYLALGWTRSQLTKSPTPLVGFSGESVIPEGCIDLPVT

Query:  LGQDQTQVTQMAEFV
        LGQDQTQVTQMAEFV
Subjt:  LGQDQTQVTQMAEFV

A0A6J1D9E1 uncharacterized protein LOC1110188237.8e-27381.08Show/hide
Query:  NTSTEREAHISEKDRVITRAEFDQLRGKLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGFMDFQAA
        N   E   + +  D VITR EFDQLRGKL+AQVEALKAKCEQKEG LNDGDLGESPFTSDVLE        APTVK YDGSKDPKDYVEVFEG MDFQAA
Subjt:  NTSTEREAHISEKDRVITRAEFDQLRGKLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGFMDFQAA

Query:  SDAIKCRAFQIALTGSARLWYRRLPARWISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADE
        SDAIKCRAFQIALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFLTGLADE
Subjt:  SDAIKCRAFQIALTGSARLWYRRLPARWISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADE

Query:  TLTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENRPTRSRPYERFTPTTIPIFEILTN
         LTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRPER I RGRSGKD EKAD KSKDKGSFSSGRAE+RRA N PTRSRPYERFTPTTIPI EILTN
Subjt:  TLTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENRPTRSRPYERFTPTTIPIFEILTN

Query:  IEESGMEKLLKRPEKLRGALERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPLRIDRPAVINTIFGGPS
        IEESGMEKLLKRPEKLRGA ERR+KDKYCRFHREH HNTSD WELKRQIEDLIQD YFKKFVGKPRTSSAEKKEERK SRTP  RIDRPAVINTIFGGPS
Subjt:  IEESGMEKLLKRPEKLRGALERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPLRIDRPAVINTIFGGPS

Query:  EGQSGHKRKELARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHMVVRRVLIDGGASANILSLPTYLALGWTRSQLTKSPTPLVGFS
         GQSGHKRKELARAARREVCIIREQRPTCPITFDS DLEEVHLPHNDALVIAPLIDH+VVRRVL+D G SANI+SL TYLALGWTRSQL KS TPLVGFS
Subjt:  EGQSGHKRKELARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHMVVRRVLIDGGASANILSLPTYLALGWTRSQLTKSPTPLVGFS

Query:  GESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLAG
         ESVIPEGCIDLPVTLG DQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQ+LKYSTPNGVG VRGEQ ASRECYASALKGSSVCALETL  
Subjt:  GESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLAG

Query:  RDGTFEFEADLPRREFAAPTEELELVPLL
        RDGT EF+A+LPRREFAAPTEELELVPLL
Subjt:  RDGTFEFEADLPRREFAAPTEELELVPLL

A0A6J1DD03 uncharacterized protein LOC1110198991.6e-22590.58Show/hide
Query:  MCYFLTGLADETLTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENRPTRSRPYERFTP
        MCYFLTGLADE LTVKL EEAPATFAEVLQKAKKVIDGQELLRT       KIG+GRSGKD+E  DPKSKDKGSFS+GRAEYRRAEN PTRSRPYERFTP
Subjt:  MCYFLTGLADETLTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENRPTRSRPYERFTP

Query:  TTIPIFEILTNIEESGMEKLLKRPEKLRGALERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPLRIDRP
        TTIPI EILTNIEESGMEKLLKRPEKLRGA ERRSKDKYCRFHREHGHNTSD WELK QIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPP R DRP
Subjt:  TTIPIFEILTNIEESGMEKLLKRPEKLRGALERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPLRIDRP

Query:  AVINTIFGGPSEGQSGHKRKELARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHMVVRRVLIDGGASANILSLPTYLALGWTRSQL
        AVINTIFGGPS GQSGHKRK+LARAARREVCIIREQRPTCPITFD  DL EVHLPHNDALVIAPLIDH+VVRRVL+DGGASANILSLPTYLALGWTRSQL
Subjt:  AVINTIFGGPSEGQSGHKRKELARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHMVVRRVLIDGGASANILSLPTYLALGWTRSQL

Query:  TKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGEQTASRECYASALKG
         KSPTPLVGFSGESV+PEGCIDLPVTLGQDQT+VTQMAEFVV+DGRSAYNAIFGRPIIHSFRAIPSTLHQ+LKYSTPNGVGTVRGEQTASRECYAS LKG
Subjt:  TKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGEQTASRECYASALKG

Query:  SSVCALETLAGRDGTFEFEADLPRREFAAPTEELELVPLLSPEKQL
        +SVCALETL  RDGT EFEADLP REFAAP EELELVPLLS EKQ+
Subjt:  SSVCALETLAGRDGTFEFEADLPRREFAAPTEELELVPLLSPEKQL

A0A6J1DHB3 uncharacterized protein LOC1110204792.0e-28171.7Show/hide
Query:  MVQPANSTNTADRRTLAASDAHQREVGATVVEGQGHDGLATEPLRRSARITAPVLPPVHPRTSKATRGRGGTSKKGARGPAPAPPSSGSRSENRMTRIDI
        MVQPANSTNTADRR LAA+  HQREVGA VVEGQGH+ L TEPL RSARIT PVLPP HP+ SKA                         S N +T    
Subjt:  MVQPANSTNTADRRTLAASDAHQREVGATVVEGQGHDGLATEPLRRSARITAPVLPPVHPRTSKATRGRGGTSKKGARGPAPAPPSSGSRSENRMTRIDI

Query:  REQRGSHLGPVEEEHPEDNESEGHTRQRGDLVNTSTEREAHISEKDRVITRAEFDQLRGKLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVLEAPIPP
                                                       VITR EFDQL+ K DAQVEALKA+CE+KE S +DGDLGE  F+SD+LEA IPP
Subjt:  REQRGSHLGPVEEEHPEDNESEGHTRQRGDLVNTSTEREAHISEKDRVITRAEFDQLRGKLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVLEAPIPP

Query:  KFKAPTVKPYDGSKDPKDYVEVFEGFMDFQAASDAIKCRAFQIALTGSARLWYRRLPARWISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLR
        KFK PT+KPYDGSKDPKDYVEVFE  MDFQAA+DAIKC AFQIALTGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLR
Subjt:  KFKAPTVKPYDGSKDPKDYVEVFEGFMDFQAASDAIKCRAFQIALTGSARLWYRRLPARWISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLR

Query:  EYVTRFQEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKG-SFSSG
        EYVTRF EEQLKVAHCSDDSAMCYFLTGLADETLTVKL EEAPATFAEVLQK KKVIDGQELLRTKTGRPE+ I +GR+GKD  KAD KS+DKG S SS 
Subjt:  EYVTRFQEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKG-SFSSG

Query:  RAEYRRAENRPTRSRPYERFTPTTIPIFEILTNIEESGMEKLLKRPEKLRGALERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRT
        R +YRR+ +   +SRPYE +TPTTIPIFEILTNIEE+GMEKLLKRPEKLRG  E+R+ DKYCRFHR+HGHNTS+ WELKRQIEDLIQDGYFKKFVGKPR+
Subjt:  RAEYRRAENRPTRSRPYERFTPTTIPIFEILTNIEESGMEKLLKRPEKLRGALERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRT

Query:  SSAEKKEERKRSRTPPLRIDRPAVINTIFGGPSEGQSGHKRKELARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHMVVRRVLIDG
        +S EKKEERKR RTPP R DRPAVIN             K+KELAR ARREVCIIREQRPT  I F+  DLE VHLPHNDALVIAPLID ++VRR+L+DG
Subjt:  SSAEKKEERKRSRTPPLRIDRPAVINTIFGGPSEGQSGHKRKELARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHMVVRRVLIDG

Query:  GASANILSLPTYLALGWTRSQLTKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQILKYSTPN
        GASANILSL TYLALGWTRSQL KSPTPLVGFSGES+  EGCIDLPV++ QD TQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQ+LKYST N
Subjt:  GASANILSLPTYLALGWTRSQLTKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQILKYSTPN

Query:  GVGTVRGEQTASRECYASALKGSSVCALETLAGRD
        GVGTVRGE   SRECYAS  K SSVCALE    RD
Subjt:  GVGTVRGEQTASRECYASALKGSSVCALETLAGRD

A0A6J1DZB9 uncharacterized protein LOC1110249043.9e-22476.29Show/hide
Query:  MDFQAASDAIKCRAFQIALTGSARLWYRRLPARWISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL
        MDFQAA+DAIKCRAFQIALTGSARLWYRRLPAR ISTYSQLR+EF++QFSS HYD+KTATHLATIRQKE ETLREYVTRFQEEQLKVAHCSDDSAMCYFL
Subjt:  MDFQAASDAIKCRAFQIALTGSARLWYRRLPARWISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL

Query:  TGLADETLTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSS-GRAEYRRAENRPTRSRPYERFTPTTIP
        T LADETLTVKLGEEAP TF EVLQKAKKVIDGQELLRTKTGRPE++I + +  ++  KAD KS+DKGS SS  R EYRR E+ P+RSRPYER+T +TIP
Subjt:  TGLADETLTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSS-GRAEYRRAENRPTRSRPYERFTPTTIP

Query:  IFEILTNIEESGMEKLLKRPEKLRGALERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPLRIDRPAVIN
        I EILTNIEESGMEKLLKRPEKLRG LE+R+K+KYCRFHR+HGHNT+ CWELKRQIEDLIQDGYFKKFVGKPR++S EKKEERKRSRTPP R DRPAVIN
Subjt:  IFEILTNIEESGMEKLLKRPEKLRGALERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPLRIDRPAVIN

Query:  TIFGGPSEGQSGHKRKELARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHMVVRRVLIDGGASANILSLPTYLALGWTRSQLTKSP
        TIFGGP+ GQSG+KRKELAR ARREVCIIRE +PTC ITF   DLE VHLPHNDALVIA LIDH +VRRVLIDG                          
Subjt:  TIFGGPSEGQSGHKRKELARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHMVVRRVLIDGGASANILSLPTYLALGWTRSQLTKSP

Query:  TPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGEQTASRECYASALKGSSVC
                      GCIDLPVT+GQD TQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQ+LKYSTPN VG VRGEQ  SRECYASALKGS+VC
Subjt:  TPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGEQTASRECYASALKGSSVC

Query:  ALETLAGRDGTFEFEADLP---RREFAAPTEELELVPLLSPEKQ
        ALE    R    E EADLP   +R+F  PTEELELVPLLSPE+Q
Subjt:  ALETLAGRDGTFEFEADLP---RREFAAPTEELELVPLLSPEKQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAACCAGCAAACTCGACCAATACGGCAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAAAGGGAGGTCGGAGCAACAGTGGTAGAGGGGCAAGGT
CACGACGGCCTAGCAACAGAACCCCTCCGCAGGTCGGCACGAATCACCGCGCCTGTCCTACCACCTGTGCACCCAAGGACATCCAAGGCCACCCGTGGCCGAGGT
GGGACCTCTAAGAAGGGTGCCCGGGGTCCAGCCCCGGCTCCACCAAGCTCAGGGTCCCGATCTGAAAATCGAATGACGCGCATTGACATACGCGAGCAAAGGGGT
TCCCACCTCGGCCCAGTCGAGGAGGAACATCCCGAAGACAACGAGAGCGAGGGACACACTCGCCAGAGAGGAGACTTAGTGAACACTTCAACAGAAAGAGAGGCT
CATATCTCCGAAAAGGACAGGGTGATCACAAGGGCGGAGTTCGACCAGCTGAGGGGTAAGCTCGACGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAA
GAAGGTTCACTGAACGATGGTGACTTGGGAGAATCGCCCTTCACCTCGGACGTTTTGGAAGCACCGATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTAT
GATGGGTCGAAGGACCCCAAGGATTATGTTGAGGTCTTTGAAGGTTTCATGGATTTCCAAGCGGCATCAGACGCAATCAAATGTCGCGCCTTTCAGATCGCGCTT
ACTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTGGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCATTAT
GACAAAAAGACAGCGACCCATCTCGCCACCATCAGACAGAAGGAGGGTGAGACGCTGCGAGAATATGTCACCAGATTCCAGGAGGAGCAATTGAAGGTCGCACAC
TGCTCCGATGACTCGGCCATGTGCTATTTCCTCACCGGTCTAGCCGATGAAACCCTCACGGTAAAACTTGGAGAGGAGGCCCCAGCCACCTTCGCCGAGGTGCTT
CAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCAGAACGAAAGATCGGCCGGGGCAGAAGTGGAAAAGATATAGAAAAG
GCAGATCCCAAGTCCAAGGACAAGGGATCATTTTCCAGCGGCCGAGCTGAGTATCGAAGGGCGGAGAACCGACCTACCAGGAGCCGACCTTACGAACGCTTCACC
CCAACCACGATTCCAATTTTCGAGATCCTAACGAACATCGAGGAGTCTGGAATGGAAAAACTACTCAAGCGTCCTGAGAAGCTTCGGGGAGCCCTGGAAAGGCGC
AGTAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCATAACACGTCGGACTGCTGGGAGTTGAAGCGTCAAATTGAGGATCTAATTCAAGATGGCTACTTC
AAGAAATTTGTGGGAAAGCCCAGGACCAGCTCGGCAGAGAAAAAGGAAGAGCGGAAGCGTTCGAGGACGCCGCCCCTGCGCATTGACCGACCTGCGGTCATCAAT
ACCATTTTCGGAGGGCCAAGCGAGGGTCAGTCCGGACATAAAAGAAAGGAGCTAGCTCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCAGAGGCCGACC
TGCCCAATCACCTTTGACAGTACAGACTTAGAGGAGGTCCACCTGCCCCACAATGATGCACTTGTGATCGCTCCATTGATTGATCATATGGTGGTCAGGAGGGTG
CTGATAGACGGAGGCGCATCTGCTAACATCCTGTCCTTACCGACCTACCTCGCCCTGGGTTGGACGAGGTCGCAATTGACGAAAAGCCCGACACCGCTGGTTGGG
TTCTCTGGAGAATCGGTCATCCCAGAGGGTTGCATTGACTTGCCGGTCACACTTGGGCAGGACCAAACTCAGGTCACCCAAATGGCCGAGTTCGTGGTAATTGAC
GGTAGATCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCATTCATTTCGGGCCATTCCCTCGACACTTCATCAAATTTTGAAGTATTCCACCCCCAATGGC
GTGGGCACGGTCCGAGGAGAACAGACCGCTTCGAGGGAGTGTTATGCCTCCGCACTCAAAGGCTCATCGGTCTGCGCCCTCGAAACTCTCGCCGGTAGGGATGGG
ACGTTCGAGTTCGAGGCCGACCTGCCGAGGAGGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAATTAGCATCGGCG
TACGAGACCGACCTGGCCAGGCCGGTCCCCGTCGAGATCTTAGATAATCCCTCGATCTCAGAGCCAGATCTGATGGAGATCGGAGCTCCAGAGTCCTCATGGATG
GACCCGATTGCGGACTTCATTAGGGGCAATTCACCACAAAACCCCAAGGAGCGCAGAAAGTTGGCAAGGCGAGCAGCTCGGTTTGTGGTCCGAGGTGGAGCATTA
TACCGACGCGACTTTTCCCTGCCTCTATTGAGATGTCTAACCCCTGAAGAGGGCCTTGTCCAAACCCATGTGGGTGCCCTTGATCCGACCTGGGAGGGCCCGTTT
GAGGTCAAGGGCATAGTCCGACCTGGGACGTACATATTGGCCGATCTGAAAGGAGACGTCCTCGCGCACCCGTGGAACGCGGAGCACCTGAAGCGTTATTATCCT
TGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAACCAGCAAACTCGACCAATACGGCAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAAAGGGAGGTCGGAGCAACAGTGGTAGAGGGGCAAGGT
CACGACGGCCTAGCAACAGAACCCCTCCGCAGGTCGGCACGAATCACCGCGCCTGTCCTACCACCTGTGCACCCAAGGACATCCAAGGCCACCCGTGGCCGAGGT
GGGACCTCTAAGAAGGGTGCCCGGGGTCCAGCCCCGGCTCCACCAAGCTCAGGGTCCCGATCTGAAAATCGAATGACGCGCATTGACATACGCGAGCAAAGGGGT
TCCCACCTCGGCCCAGTCGAGGAGGAACATCCCGAAGACAACGAGAGCGAGGGACACACTCGCCAGAGAGGAGACTTAGTGAACACTTCAACAGAAAGAGAGGCT
CATATCTCCGAAAAGGACAGGGTGATCACAAGGGCGGAGTTCGACCAGCTGAGGGGTAAGCTCGACGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAA
GAAGGTTCACTGAACGATGGTGACTTGGGAGAATCGCCCTTCACCTCGGACGTTTTGGAAGCACCGATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTAT
GATGGGTCGAAGGACCCCAAGGATTATGTTGAGGTCTTTGAAGGTTTCATGGATTTCCAAGCGGCATCAGACGCAATCAAATGTCGCGCCTTTCAGATCGCGCTT
ACTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTGGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCATTAT
GACAAAAAGACAGCGACCCATCTCGCCACCATCAGACAGAAGGAGGGTGAGACGCTGCGAGAATATGTCACCAGATTCCAGGAGGAGCAATTGAAGGTCGCACAC
TGCTCCGATGACTCGGCCATGTGCTATTTCCTCACCGGTCTAGCCGATGAAACCCTCACGGTAAAACTTGGAGAGGAGGCCCCAGCCACCTTCGCCGAGGTGCTT
CAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCAGAACGAAAGATCGGCCGGGGCAGAAGTGGAAAAGATATAGAAAAG
GCAGATCCCAAGTCCAAGGACAAGGGATCATTTTCCAGCGGCCGAGCTGAGTATCGAAGGGCGGAGAACCGACCTACCAGGAGCCGACCTTACGAACGCTTCACC
CCAACCACGATTCCAATTTTCGAGATCCTAACGAACATCGAGGAGTCTGGAATGGAAAAACTACTCAAGCGTCCTGAGAAGCTTCGGGGAGCCCTGGAAAGGCGC
AGTAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCATAACACGTCGGACTGCTGGGAGTTGAAGCGTCAAATTGAGGATCTAATTCAAGATGGCTACTTC
AAGAAATTTGTGGGAAAGCCCAGGACCAGCTCGGCAGAGAAAAAGGAAGAGCGGAAGCGTTCGAGGACGCCGCCCCTGCGCATTGACCGACCTGCGGTCATCAAT
ACCATTTTCGGAGGGCCAAGCGAGGGTCAGTCCGGACATAAAAGAAAGGAGCTAGCTCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCAGAGGCCGACC
TGCCCAATCACCTTTGACAGTACAGACTTAGAGGAGGTCCACCTGCCCCACAATGATGCACTTGTGATCGCTCCATTGATTGATCATATGGTGGTCAGGAGGGTG
CTGATAGACGGAGGCGCATCTGCTAACATCCTGTCCTTACCGACCTACCTCGCCCTGGGTTGGACGAGGTCGCAATTGACGAAAAGCCCGACACCGCTGGTTGGG
TTCTCTGGAGAATCGGTCATCCCAGAGGGTTGCATTGACTTGCCGGTCACACTTGGGCAGGACCAAACTCAGGTCACCCAAATGGCCGAGTTCGTGGTAATTGAC
GGTAGATCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCATTCATTTCGGGCCATTCCCTCGACACTTCATCAAATTTTGAAGTATTCCACCCCCAATGGC
GTGGGCACGGTCCGAGGAGAACAGACCGCTTCGAGGGAGTGTTATGCCTCCGCACTCAAAGGCTCATCGGTCTGCGCCCTCGAAACTCTCGCCGGTAGGGATGGG
ACGTTCGAGTTCGAGGCCGACCTGCCGAGGAGGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAATTAGCATCGGCG
TACGAGACCGACCTGGCCAGGCCGGTCCCCGTCGAGATCTTAGATAATCCCTCGATCTCAGAGCCAGATCTGATGGAGATCGGAGCTCCAGAGTCCTCATGGATG
GACCCGATTGCGGACTTCATTAGGGGCAATTCACCACAAAACCCCAAGGAGCGCAGAAAGTTGGCAAGGCGAGCAGCTCGGTTTGTGGTCCGAGGTGGAGCATTA
TACCGACGCGACTTTTCCCTGCCTCTATTGAGATGTCTAACCCCTGAAGAGGGCCTTGTCCAAACCCATGTGGGTGCCCTTGATCCGACCTGGGAGGGCCCGTTT
GAGGTCAAGGGCATAGTCCGACCTGGGACGTACATATTGGCCGATCTGAAAGGAGACGTCCTCGCGCACCCGTGGAACGCGGAGCACCTGAAGCGTTATTATCCT
TGA
Protein sequenceShow/hide protein sequence
MVQPANSTNTADRRTLAASDAHQREVGATVVEGQGHDGLATEPLRRSARITAPVLPPVHPRTSKATRGRGGTSKKGARGPAPAPPSSGSRSENRMTRIDIREQRG
SHLGPVEEEHPEDNESEGHTRQRGDLVNTSTEREAHISEKDRVITRAEFDQLRGKLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPY
DGSKDPKDYVEVFEGFMDFQAASDAIKCRAFQIALTGSARLWYRRLPARWISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAH
CSDDSAMCYFLTGLADETLTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENRPTRSRPYERFT
PTTIPIFEILTNIEESGMEKLLKRPEKLRGALERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPLRIDRPAVIN
TIFGGPSEGQSGHKRKELARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHMVVRRVLIDGGASANILSLPTYLALGWTRSQLTKSPTPLVG
FSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLAGRDG
TFEFEADLPRREFAAPTEELELVPLLSPEKQLASAYETDLARPVPVEILDNPSISEPDLMEIGAPESSWMDPIADFIRGNSPQNPKERRKLARRAARFVVRGGAL
YRRDFSLPLLRCLTPEEGLVQTHVGALDPTWEGPFEVKGIVRPGTYILADLKGDVLAHPWNAEHLKRYYP