; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g24340 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g24340
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr9:18286038..18291637
RNA-Seq ExpressionMoc09g24340
SyntenyMoc09g24340
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]6.7e-27293.99Show/hide
Query:  GRITREEFDQLRGQLDTQVEALKPKCEQKEGPLNDDDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQISLT
        G ITREEFDQLRGQLD QVEALK KCEQKEGPLND DLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFE LMDFQAASDAIKCRAF+I+LT
Subjt:  GRITREEFDQLRGQLDTQVEALKPKCEQKEGPLNDDDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQISLT

Query:  GSARLWYRRLPARSISTNSQLRREFLTQFSSRHYDKMTATHLATIRQKEGETLRQYVTRFQEEQLKVTHSSDDSAMCYFFTGLADEALMVKLGEEAPATF
        GSARLWYRRLPA SIST SQLRREFL  FSSRHYDK TATHLATIRQKEGETLR+YVTRFQEEQLKV H SDDSAMCYF TGLADEAL VKLGEEAPATF
Subjt:  GSARLWYRRLPARSISTNSQLRREFLTQFSSRHYDKMTATHLATIRQKEGETLRQYVTRFQEEQLKVTHSSDDSAMCYFFTGLADEALMVKLGEEAPATF

Query:  VEVLQKAKKVIDGQDLLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTKIPISEILTNIEESGMEKLLKRPE
         EVLQKAKKVIDGQ+LLRTKTGRPERKIGRGRSGKDIE ADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPT IPISEILTNIEESGMEKLLKRPE
Subjt:  VEVLQKAKKVIDGQDLLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTKIPISEILTNIEESGMEKLLKRPE

Query:  KLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEYLIQDGYFKKFVGKPRTNSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGRKRKELARA
        KLRGAPERRSKDKYCRFHREHGHNTSD WELKRQIE LIQDGYFKKFVGKPRT+SAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGRKRKELARA
Subjt:  KLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEYLIQDGYFKKFVGKPRTNSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGRKRKELARA

Query:  ARREVCIIREQRPTCSITFDGADLEEVHLPHNDALVIPPLIDHVVVRRVLVDGGASANILSLSTYLALGWTRSQLKKSPTPLVGFSGESVIPEGYIDLPV
        ARREVCIIREQRPTC ITFDGADLEEVHLPHNDALVI PLIDHVVV RVLVDGG SANILSL TYLALGWTRSQLKKSPTPLVGFSGESVIPEG+IDLPV
Subjt:  ARREVCIIREQRPTCSITFDGADLEEVHLPHNDALVIPPLIDHVVVRRVLVDGGASANILSLSTYLALGWTRSQLKKSPTPLVGFSGESVIPEGYIDLPV

Query:  TLGQDQTQVTQMVEFV
        TLGQDQTQVTQM EFV
Subjt:  TLGQDQTQVTQMVEFV

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]1.4e-26976.95Show/hide
Query:  GRITREEFDQLRGQLDTQVEALKPKCEQKEGPLNDDDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQISLT
        G ITREEFDQLRG+L+ QVEALK KCEQKEGPLND DLGESPFTSDVLE        APTVK YDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQI+LT
Subjt:  GRITREEFDQLRGQLDTQVEALKPKCEQKEGPLNDDDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQISLT

Query:  GSARLWYRRLPARSISTNSQLRREFLTQFSSRHYDKMTATHLATIRQKEGETLRQYVTRFQEEQLKVTHSSDDSAMCYFFTGLADEALMVKLGEEAPATF
        GSARLW                                                     FQE+QLKV  SSDDSAMCYF TGLADEAL VKLG+EAPATF
Subjt:  GSARLWYRRLPARSISTNSQLRREFLTQFSSRHYDKMTATHLATIRQKEGETLRQYVTRFQEEQLKVTHSSDDSAMCYFFTGLADEALMVKLGEEAPATF

Query:  VEVLQKAKKVIDGQDLLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTKIPISEILTNIEESGMEKLLKRPE
         EVLQKAKKVIDGQ+LLRTKTGRPER I RGRSGKD EKAD KSKDKGSFSSGRAE+RRA NGPTRSRPYERFTPT IPISEILTNIEESGMEKLLKRPE
Subjt:  VEVLQKAKKVIDGQDLLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTKIPISEILTNIEESGMEKLLKRPE

Query:  KLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEYLIQDGYFKKFVGKPRTNSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGRKRKELARA
        KLRGAPERR+KDKYCRFHREH HNTSD WELKRQIE LIQD YFKKFVGKPRT+SAEKKEERK SRTP RR DRPAVINTIFGGPSGGQSG KRKELARA
Subjt:  KLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEYLIQDGYFKKFVGKPRTNSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGRKRKELARA

Query:  ARREVCIIREQRPTCSITFDGADLEEVHLPHNDALVIPPLIDHVVVRRVLVDGGASANILSLSTYLALGWTRSQLKKSPTPLVGFSGESVIPEGYIDLPV
        ARREVCIIREQRPTC ITFD ADLEEVHLPHNDALVI PLIDHVVVRRVLVD G SANI+SL TYLALGWTRSQLKKS TPLVGFS ESVIPEG IDLPV
Subjt:  ARREVCIIREQRPTCSITFDGADLEEVHLPHNDALVIPPLIDHVVVRRVLVDGGASANILSLSTYLALGWTRSQLKKSPTPLVGFSGESVIPEGYIDLPV

Query:  TLGQDQTQVTQMVEFVVIEGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLASRDGTLEFKANLPRR
        TLG DQTQVTQM EFVVI+GRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVG VRGEQ ASRECYASALKGSSVCALETL SRDGTLEFKANLPRR
Subjt:  TLGQDQTQVTQMVEFVVIEGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLASRDGTLEFKANLPRR

Query:  EFAAPTEELELVPLLSPEKQLASAYETDLARSVPIEILDNPSISE--PDLMEIGAPEFSWIDPIADFI
        EFAAPTEELELVPLL  +      +E +L     +  +D+    E  P+ + +G   ++ ID ++  I
Subjt:  EFAAPTEELELVPLLSPEKQLASAYETDLARSVPIEILDNPSISE--PDLMEIGAPEFSWIDPIADFI

XP_022152110.1 uncharacterized protein LOC111019899 [Momordica charantia]8.3e-22290.13Show/hide
Query:  MCYFFTGLADEALMVKLGEEAPATFVEVLQKAKKVIDGQDLLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTP
        MCYF TGLADEAL VKL EEAPATF EVLQKAKKVIDGQ+LLRT       KIG+GRSGKD+E  DPKSKDKGSFS+GRAEYRRAENGPTRSRPYERFTP
Subjt:  MCYFFTGLADEALMVKLGEEAPATFVEVLQKAKKVIDGQDLLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTP

Query:  TKIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEYLIQDGYFKKFVGKPRTNSAEKKEERKRSRTPPRRTDRP
        T IPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELK QIE LIQDGYFKKFVGKPRT+SAEKKEERKRSRTPPRRTDRP
Subjt:  TKIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEYLIQDGYFKKFVGKPRTNSAEKKEERKRSRTPPRRTDRP

Query:  AVINTIFGGPSGGQSGRKRKELARAARREVCIIREQRPTCSITFDGADLEEVHLPHNDALVIPPLIDHVVVRRVLVDGGASANILSLSTYLALGWTRSQL
        AVINTIFGGPSGGQSG KRK+LARAARREVCIIREQRPTC ITFD ADL EVHLPHNDALVI PLIDHVVVRRVLVDGGASANILSL TYLALGWTRSQL
Subjt:  AVINTIFGGPSGGQSGRKRKELARAARREVCIIREQRPTCSITFDGADLEEVHLPHNDALVIPPLIDHVVVRRVLVDGGASANILSLSTYLALGWTRSQL

Query:  KKSPTPLVGFSGESVIPEGYIDLPVTLGQDQTQVTQMVEFVVIEGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKG
        KKSPTPLVGFSGESV+PEG IDLPVTLGQDQT+VTQM EFVV++GRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAS LKG
Subjt:  KKSPTPLVGFSGESVIPEGYIDLPVTLGQDQTQVTQMVEFVVIEGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKG

Query:  SSVCALETLASRDGTLEFKANLPRREFAAPTEELELVPLLSPEKQL
        +SVCALETL SRDGTLEF+A+LP REFAAP EELELVPLLS EKQ+
Subjt:  SSVCALETLASRDGTLEFKANLPRREFAAPTEELELVPLLSPEKQL

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]7.6e-26867.81Show/hide
Query:  MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARIIAPVLPPAHPPRTSKATRGRGGTSKKGARGPAPTPTSENLDVLQREMEAM
        MVQPANSTNTADRR LAA+  HQREVGA VVEGQGH+ L TEPL RSARI  PVLPPAH P+ SKA                                  
Subjt:  MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARIIAPVLPPAHPPRTSKATRGRGGTSKKGARGPAPTPTSENLDVLQREMEAM

Query:  RTKMRSMEEVYNEMILAAGPGSRSENRVTRAGIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGRITREEFDQLRGQLDTQVEALKPKCEQ
               E  YN +                                                          G ITREEFDQL+ + D QVEALK +CE+
Subjt:  RTKMRSMEEVYNEMILAAGPGSRSENRVTRAGIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGRITREEFDQLRGQLDTQVEALKPKCEQ

Query:  KEGPLNDDDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQISLTGSARLWYRRLPARSISTNSQLRREFLTQ
        KE   +D DLGE  F+SD+LEA IPPKFK PT+KPYDGSKDPKDYVEVFE LMDFQAA+DAIKC AFQI+LTGSARLWYRRLPAR IST SQLR+EF++Q
Subjt:  KEGPLNDDDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQISLTGSARLWYRRLPARSISTNSQLRREFLTQ

Query:  FSSRHYDKMTATHLATIRQKEGETLRQYVTRFQEEQLKVTHSSDDSAMCYFFTGLADEALMVKLGEEAPATFVEVLQKAKKVIDGQDLLRTKTGRPERKI
        FSSRHYD+ T THLATIRQKEGETLR+YVTRF EEQLKV H SDDSAMCYF TGLADE L VKL EEAPATF EVLQK KKVIDGQ+LLRTKTGRPE+ I
Subjt:  FSSRHYDKMTATHLATIRQKEGETLRQYVTRFQEEQLKVTHSSDDSAMCYFFTGLADEALMVKLGEEAPATFVEVLQKAKKVIDGQDLLRTKTGRPERKI

Query:  GRGRSGKDIEKADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTKIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD
         +GR+GKD  KAD KS+DKG S SS R +YRR+ +   +SRPYE +TPT IPI EILTNIEE+GMEKLLKRPEKLRG PE+R+ DKYCRFHR+HGHNTS+
Subjt:  GRGRSGKDIEKADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTKIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD

Query:  CWELKRQIEYLIQDGYFKKFVGKPRTNSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGRKRKELARAARREVCIIREQRPTCSITFDGADLEEV
         WELKRQIE LIQDGYFKKFVGKPR+NS EKKEERKR RTPPRR DRPAVIN             K+KELAR ARREVCIIREQRPT SI F+ ADLE V
Subjt:  CWELKRQIEYLIQDGYFKKFVGKPRTNSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGRKRKELARAARREVCIIREQRPTCSITFDGADLEEV

Query:  HLPHNDALVIPPLIDHVVVRRVLVDGGASANILSLSTYLALGWTRSQLKKSPTPLVGFSGESVIPEGYIDLPVTLGQDQTQVTQMVEFVVIEGRSAYNAI
        HLPHNDALVI PLID V+VRR+LVDGGASANILSLSTYLALGWTRSQLKKSPTPLVGFSGES+  EG IDLPV++ QD TQVTQM EFVVI+GRSAYNAI
Subjt:  HLPHNDALVIPPLIDHVVVRRVLVDGGASANILSLSTYLALGWTRSQLKKSPTPLVGFSGESVIPEGYIDLPVTLGQDQTQVTQMVEFVVIEGRSAYNAI

Query:  FGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLASRD
        FGRPIIHSFRA+PSTLHQVLKYST NGVGTVRGE   SRECYAS  K SSVCALE    RD
Subjt:  FGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLASRD

XP_022158414.1 uncharacterized protein LOC111024904 [Momordica charantia]1.0e-21674.45Show/hide
Query:  MDFQAASDAIKCRAFQISLTGSARLWYRRLPARSISTNSQLRREFLTQFSSRHYDKMTATHLATIRQKEGETLRQYVTRFQEEQLKVTHSSDDSAMCYFF
        MDFQAA+DAIKCRAFQI+LTGSARLWYRRLPARSIST SQLR+EF++QFSS HYD+ TATHLATIRQKE ETLR+YVTRFQEEQLKV H SDDSAMCYF 
Subjt:  MDFQAASDAIKCRAFQISLTGSARLWYRRLPARSISTNSQLRREFLTQFSSRHYDKMTATHLATIRQKEGETLRQYVTRFQEEQLKVTHSSDDSAMCYFF

Query:  TGLADEALMVKLGEEAPATFVEVLQKAKKVIDGQDLLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSS-GRAEYRRAENGPTRSRPYERFTPTKIP
        T LADE L VKLGEEAP TFVEVLQKAKKVIDGQ+LLRTKTGRPE++I + +  ++  KAD KS+DKGS SS  R EYRR E+GP+RSRPYER+T + IP
Subjt:  TGLADEALMVKLGEEAPATFVEVLQKAKKVIDGQDLLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSS-GRAEYRRAENGPTRSRPYERFTPTKIP

Query:  ISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEYLIQDGYFKKFVGKPRTNSAEKKEERKRSRTPPRRTDRPAVIN
        ISEILTNIEESGMEKLLKRPEKLRG  E+R+K+KYCRFHR+HGHNT+ CWELKRQIE LIQDGYFKKFVGKPR+NS EKKEERKRSRTPPRR DRPAVIN
Subjt:  ISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEYLIQDGYFKKFVGKPRTNSAEKKEERKRSRTPPRRTDRPAVIN

Query:  TIFGGPSGGQSGRKRKELARAARREVCIIREQRPTCSITFDGADLEEVHLPHNDALVIPPLIDHVVVRRVLVDGGASANILSLSTYLALGWTRSQLKKSP
        TIFGGP+GGQSG KRKELAR ARREVCIIRE +PTCSITF  ADLE VHLPHNDALVI  LIDH +VRRVL+DGG                         
Subjt:  TIFGGPSGGQSGRKRKELARAARREVCIIREQRPTCSITFDGADLEEVHLPHNDALVIPPLIDHVVVRRVLVDGGASANILSLSTYLALGWTRSQLKKSP

Query:  TPLVGFSGESVIPEGYIDLPVTLGQDQTQVTQMVEFVVIEGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVC
                        IDLPVT+GQD TQVTQM EFVVI+GRSAYNAIFGRPIIHSFRA+PSTLHQVLKYSTPN VG VRGEQ  SRECYASALKGS+VC
Subjt:  TPLVGFSGESVIPEGYIDLPVTLGQDQTQVTQMVEFVVIEGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVC

Query:  ALETLASRDGTLEFKANLP---RREFAAPTEELELVPLLSPEKQ
        ALE   +R    E +A+LP   +R+F  PTEELELVPLLSPE+Q
Subjt:  ALETLASRDGTLEFKANLP---RREFAAPTEELELVPLLSPEKQ

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088133.2e-27293.99Show/hide
Query:  GRITREEFDQLRGQLDTQVEALKPKCEQKEGPLNDDDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQISLT
        G ITREEFDQLRGQLD QVEALK KCEQKEGPLND DLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFE LMDFQAASDAIKCRAF+I+LT
Subjt:  GRITREEFDQLRGQLDTQVEALKPKCEQKEGPLNDDDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQISLT

Query:  GSARLWYRRLPARSISTNSQLRREFLTQFSSRHYDKMTATHLATIRQKEGETLRQYVTRFQEEQLKVTHSSDDSAMCYFFTGLADEALMVKLGEEAPATF
        GSARLWYRRLPA SIST SQLRREFL  FSSRHYDK TATHLATIRQKEGETLR+YVTRFQEEQLKV H SDDSAMCYF TGLADEAL VKLGEEAPATF
Subjt:  GSARLWYRRLPARSISTNSQLRREFLTQFSSRHYDKMTATHLATIRQKEGETLRQYVTRFQEEQLKVTHSSDDSAMCYFFTGLADEALMVKLGEEAPATF

Query:  VEVLQKAKKVIDGQDLLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTKIPISEILTNIEESGMEKLLKRPE
         EVLQKAKKVIDGQ+LLRTKTGRPERKIGRGRSGKDIE ADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPT IPISEILTNIEESGMEKLLKRPE
Subjt:  VEVLQKAKKVIDGQDLLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTKIPISEILTNIEESGMEKLLKRPE

Query:  KLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEYLIQDGYFKKFVGKPRTNSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGRKRKELARA
        KLRGAPERRSKDKYCRFHREHGHNTSD WELKRQIE LIQDGYFKKFVGKPRT+SAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGRKRKELARA
Subjt:  KLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEYLIQDGYFKKFVGKPRTNSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGRKRKELARA

Query:  ARREVCIIREQRPTCSITFDGADLEEVHLPHNDALVIPPLIDHVVVRRVLVDGGASANILSLSTYLALGWTRSQLKKSPTPLVGFSGESVIPEGYIDLPV
        ARREVCIIREQRPTC ITFDGADLEEVHLPHNDALVI PLIDHVVV RVLVDGG SANILSL TYLALGWTRSQLKKSPTPLVGFSGESVIPEG+IDLPV
Subjt:  ARREVCIIREQRPTCSITFDGADLEEVHLPHNDALVIPPLIDHVVVRRVLVDGGASANILSLSTYLALGWTRSQLKKSPTPLVGFSGESVIPEGYIDLPV

Query:  TLGQDQTQVTQMVEFV
        TLGQDQTQVTQM EFV
Subjt:  TLGQDQTQVTQMVEFV

A0A6J1D9E1 uncharacterized protein LOC1110188236.7e-27076.95Show/hide
Query:  GRITREEFDQLRGQLDTQVEALKPKCEQKEGPLNDDDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQISLT
        G ITREEFDQLRG+L+ QVEALK KCEQKEGPLND DLGESPFTSDVLE        APTVK YDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQI+LT
Subjt:  GRITREEFDQLRGQLDTQVEALKPKCEQKEGPLNDDDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQISLT

Query:  GSARLWYRRLPARSISTNSQLRREFLTQFSSRHYDKMTATHLATIRQKEGETLRQYVTRFQEEQLKVTHSSDDSAMCYFFTGLADEALMVKLGEEAPATF
        GSARLW                                                     FQE+QLKV  SSDDSAMCYF TGLADEAL VKLG+EAPATF
Subjt:  GSARLWYRRLPARSISTNSQLRREFLTQFSSRHYDKMTATHLATIRQKEGETLRQYVTRFQEEQLKVTHSSDDSAMCYFFTGLADEALMVKLGEEAPATF

Query:  VEVLQKAKKVIDGQDLLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTKIPISEILTNIEESGMEKLLKRPE
         EVLQKAKKVIDGQ+LLRTKTGRPER I RGRSGKD EKAD KSKDKGSFSSGRAE+RRA NGPTRSRPYERFTPT IPISEILTNIEESGMEKLLKRPE
Subjt:  VEVLQKAKKVIDGQDLLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTKIPISEILTNIEESGMEKLLKRPE

Query:  KLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEYLIQDGYFKKFVGKPRTNSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGRKRKELARA
        KLRGAPERR+KDKYCRFHREH HNTSD WELKRQIE LIQD YFKKFVGKPRT+SAEKKEERK SRTP RR DRPAVINTIFGGPSGGQSG KRKELARA
Subjt:  KLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEYLIQDGYFKKFVGKPRTNSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGRKRKELARA

Query:  ARREVCIIREQRPTCSITFDGADLEEVHLPHNDALVIPPLIDHVVVRRVLVDGGASANILSLSTYLALGWTRSQLKKSPTPLVGFSGESVIPEGYIDLPV
        ARREVCIIREQRPTC ITFD ADLEEVHLPHNDALVI PLIDHVVVRRVLVD G SANI+SL TYLALGWTRSQLKKS TPLVGFS ESVIPEG IDLPV
Subjt:  ARREVCIIREQRPTCSITFDGADLEEVHLPHNDALVIPPLIDHVVVRRVLVDGGASANILSLSTYLALGWTRSQLKKSPTPLVGFSGESVIPEGYIDLPV

Query:  TLGQDQTQVTQMVEFVVIEGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLASRDGTLEFKANLPRR
        TLG DQTQVTQM EFVVI+GRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVG VRGEQ ASRECYASALKGSSVCALETL SRDGTLEFKANLPRR
Subjt:  TLGQDQTQVTQMVEFVVIEGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLASRDGTLEFKANLPRR

Query:  EFAAPTEELELVPLLSPEKQLASAYETDLARSVPIEILDNPSISE--PDLMEIGAPEFSWIDPIADFI
        EFAAPTEELELVPLL  +      +E +L     +  +D+    E  P+ + +G   ++ ID ++  I
Subjt:  EFAAPTEELELVPLLSPEKQLASAYETDLARSVPIEILDNPSISE--PDLMEIGAPEFSWIDPIADFI

A0A6J1DD03 uncharacterized protein LOC1110198994.0e-22290.13Show/hide
Query:  MCYFFTGLADEALMVKLGEEAPATFVEVLQKAKKVIDGQDLLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTP
        MCYF TGLADEAL VKL EEAPATF EVLQKAKKVIDGQ+LLRT       KIG+GRSGKD+E  DPKSKDKGSFS+GRAEYRRAENGPTRSRPYERFTP
Subjt:  MCYFFTGLADEALMVKLGEEAPATFVEVLQKAKKVIDGQDLLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTP

Query:  TKIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEYLIQDGYFKKFVGKPRTNSAEKKEERKRSRTPPRRTDRP
        T IPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELK QIE LIQDGYFKKFVGKPRT+SAEKKEERKRSRTPPRRTDRP
Subjt:  TKIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEYLIQDGYFKKFVGKPRTNSAEKKEERKRSRTPPRRTDRP

Query:  AVINTIFGGPSGGQSGRKRKELARAARREVCIIREQRPTCSITFDGADLEEVHLPHNDALVIPPLIDHVVVRRVLVDGGASANILSLSTYLALGWTRSQL
        AVINTIFGGPSGGQSG KRK+LARAARREVCIIREQRPTC ITFD ADL EVHLPHNDALVI PLIDHVVVRRVLVDGGASANILSL TYLALGWTRSQL
Subjt:  AVINTIFGGPSGGQSGRKRKELARAARREVCIIREQRPTCSITFDGADLEEVHLPHNDALVIPPLIDHVVVRRVLVDGGASANILSLSTYLALGWTRSQL

Query:  KKSPTPLVGFSGESVIPEGYIDLPVTLGQDQTQVTQMVEFVVIEGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKG
        KKSPTPLVGFSGESV+PEG IDLPVTLGQDQT+VTQM EFVV++GRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAS LKG
Subjt:  KKSPTPLVGFSGESVIPEGYIDLPVTLGQDQTQVTQMVEFVVIEGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKG

Query:  SSVCALETLASRDGTLEFKANLPRREFAAPTEELELVPLLSPEKQL
        +SVCALETL SRDGTLEF+A+LP REFAAP EELELVPLLS EKQ+
Subjt:  SSVCALETLASRDGTLEFKANLPRREFAAPTEELELVPLLSPEKQL

A0A6J1DHB3 uncharacterized protein LOC1110204793.7e-26867.81Show/hide
Query:  MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARIIAPVLPPAHPPRTSKATRGRGGTSKKGARGPAPTPTSENLDVLQREMEAM
        MVQPANSTNTADRR LAA+  HQREVGA VVEGQGH+ L TEPL RSARI  PVLPPAH P+ SKA                                  
Subjt:  MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARIIAPVLPPAHPPRTSKATRGRGGTSKKGARGPAPTPTSENLDVLQREMEAM

Query:  RTKMRSMEEVYNEMILAAGPGSRSENRVTRAGIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGRITREEFDQLRGQLDTQVEALKPKCEQ
               E  YN +                                                          G ITREEFDQL+ + D QVEALK +CE+
Subjt:  RTKMRSMEEVYNEMILAAGPGSRSENRVTRAGIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGRITREEFDQLRGQLDTQVEALKPKCEQ

Query:  KEGPLNDDDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQISLTGSARLWYRRLPARSISTNSQLRREFLTQ
        KE   +D DLGE  F+SD+LEA IPPKFK PT+KPYDGSKDPKDYVEVFE LMDFQAA+DAIKC AFQI+LTGSARLWYRRLPAR IST SQLR+EF++Q
Subjt:  KEGPLNDDDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQISLTGSARLWYRRLPARSISTNSQLRREFLTQ

Query:  FSSRHYDKMTATHLATIRQKEGETLRQYVTRFQEEQLKVTHSSDDSAMCYFFTGLADEALMVKLGEEAPATFVEVLQKAKKVIDGQDLLRTKTGRPERKI
        FSSRHYD+ T THLATIRQKEGETLR+YVTRF EEQLKV H SDDSAMCYF TGLADE L VKL EEAPATF EVLQK KKVIDGQ+LLRTKTGRPE+ I
Subjt:  FSSRHYDKMTATHLATIRQKEGETLRQYVTRFQEEQLKVTHSSDDSAMCYFFTGLADEALMVKLGEEAPATFVEVLQKAKKVIDGQDLLRTKTGRPERKI

Query:  GRGRSGKDIEKADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTKIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD
         +GR+GKD  KAD KS+DKG S SS R +YRR+ +   +SRPYE +TPT IPI EILTNIEE+GMEKLLKRPEKLRG PE+R+ DKYCRFHR+HGHNTS+
Subjt:  GRGRSGKDIEKADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTKIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD

Query:  CWELKRQIEYLIQDGYFKKFVGKPRTNSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGRKRKELARAARREVCIIREQRPTCSITFDGADLEEV
         WELKRQIE LIQDGYFKKFVGKPR+NS EKKEERKR RTPPRR DRPAVIN             K+KELAR ARREVCIIREQRPT SI F+ ADLE V
Subjt:  CWELKRQIEYLIQDGYFKKFVGKPRTNSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGRKRKELARAARREVCIIREQRPTCSITFDGADLEEV

Query:  HLPHNDALVIPPLIDHVVVRRVLVDGGASANILSLSTYLALGWTRSQLKKSPTPLVGFSGESVIPEGYIDLPVTLGQDQTQVTQMVEFVVIEGRSAYNAI
        HLPHNDALVI PLID V+VRR+LVDGGASANILSLSTYLALGWTRSQLKKSPTPLVGFSGES+  EG IDLPV++ QD TQVTQM EFVVI+GRSAYNAI
Subjt:  HLPHNDALVIPPLIDHVVVRRVLVDGGASANILSLSTYLALGWTRSQLKKSPTPLVGFSGESVIPEGYIDLPVTLGQDQTQVTQMVEFVVIEGRSAYNAI

Query:  FGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLASRD
        FGRPIIHSFRA+PSTLHQVLKYST NGVGTVRGE   SRECYAS  K SSVCALE    RD
Subjt:  FGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLASRD

A0A6J1DZB9 uncharacterized protein LOC1110249045.1e-21774.45Show/hide
Query:  MDFQAASDAIKCRAFQISLTGSARLWYRRLPARSISTNSQLRREFLTQFSSRHYDKMTATHLATIRQKEGETLRQYVTRFQEEQLKVTHSSDDSAMCYFF
        MDFQAA+DAIKCRAFQI+LTGSARLWYRRLPARSIST SQLR+EF++QFSS HYD+ TATHLATIRQKE ETLR+YVTRFQEEQLKV H SDDSAMCYF 
Subjt:  MDFQAASDAIKCRAFQISLTGSARLWYRRLPARSISTNSQLRREFLTQFSSRHYDKMTATHLATIRQKEGETLRQYVTRFQEEQLKVTHSSDDSAMCYFF

Query:  TGLADEALMVKLGEEAPATFVEVLQKAKKVIDGQDLLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSS-GRAEYRRAENGPTRSRPYERFTPTKIP
        T LADE L VKLGEEAP TFVEVLQKAKKVIDGQ+LLRTKTGRPE++I + +  ++  KAD KS+DKGS SS  R EYRR E+GP+RSRPYER+T + IP
Subjt:  TGLADEALMVKLGEEAPATFVEVLQKAKKVIDGQDLLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSS-GRAEYRRAENGPTRSRPYERFTPTKIP

Query:  ISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEYLIQDGYFKKFVGKPRTNSAEKKEERKRSRTPPRRTDRPAVIN
        ISEILTNIEESGMEKLLKRPEKLRG  E+R+K+KYCRFHR+HGHNT+ CWELKRQIE LIQDGYFKKFVGKPR+NS EKKEERKRSRTPPRR DRPAVIN
Subjt:  ISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEYLIQDGYFKKFVGKPRTNSAEKKEERKRSRTPPRRTDRPAVIN

Query:  TIFGGPSGGQSGRKRKELARAARREVCIIREQRPTCSITFDGADLEEVHLPHNDALVIPPLIDHVVVRRVLVDGGASANILSLSTYLALGWTRSQLKKSP
        TIFGGP+GGQSG KRKELAR ARREVCIIRE +PTCSITF  ADLE VHLPHNDALVI  LIDH +VRRVL+DGG                         
Subjt:  TIFGGPSGGQSGRKRKELARAARREVCIIREQRPTCSITFDGADLEEVHLPHNDALVIPPLIDHVVVRRVLVDGGASANILSLSTYLALGWTRSQLKKSP

Query:  TPLVGFSGESVIPEGYIDLPVTLGQDQTQVTQMVEFVVIEGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVC
                        IDLPVT+GQD TQVTQM EFVVI+GRSAYNAIFGRPIIHSFRA+PSTLHQVLKYSTPN VG VRGEQ  SRECYASALKGS+VC
Subjt:  TPLVGFSGESVIPEGYIDLPVTLGQDQTQVTQMVEFVVIEGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVC

Query:  ALETLASRDGTLEFKANLP---RREFAAPTEELELVPLLSPEKQ
        ALE   +R    E +A+LP   +R+F  PTEELELVPLLSPE+Q
Subjt:  ALETLASRDGTLEFKANLP---RREFAAPTEELELVPLLSPEKQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAACCCGCAAACTCGACCAATACGGCAGATCGAAGGACCCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGTGGTAGAGGGGCAAGGTCACGA
CGGCCTAGCAACAGAACCCCTCCGCAGGTCGGCACGAATCATCGCGCCTGTTCTACCACCTGCACACCCCCCAAGGACATCCAAGGCCACCCGTGGCCGAGGTGGAACCT
CTAAGAAGGGCGCCCGGGGTCCAGCCCCGACCCCGACAAGTGAGAACTTGGACGTACTCCAGAGAGAAATGGAGGCAATGCGCACAAAAATGCGGTCCATGGAGGAAGTG
TATAACGAAATGATATTAGCTGCAGGCCCAGGGTCCCGATCTGAGAACCGAGTGACGCGCGCTGGCATACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGTCGAGGAGGA
ACATCCCGAAGACAACGAGAGCGAGGGACACACTCGCCAGAGAGGAGACCTCCGTGAACACCTCAACAGAAAGAGAGGAAGGATTACAAGGGAGGAGTTCGACCAGCTGA
GGGGCCAGCTCGACACTCAGGTGGAGGCCTTAAAGCCCAAATGTGAGCAGAAAGAAGGTCCACTGAACGATGACGACTTGGGAGAATCGCCCTTCACCTCGGACGTTTTG
GAAGCACCGATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAACCTTATGATGGGTCGAAGGACCCCAAGGATTATGTTGAGGTCTTTGAAGGCCTCATGGATTTCCAAGC
GGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATCTCGCTTACTGGTAGCGCGCGATTGTGGTATCGGAGATTGCCAGCTAGGTCGATCTCGACCAATTCTCAGCTGA
GAAGGGAGTTCCTCACCCAGTTCTCTTCTCGGCATTATGACAAAATGACAGCGACCCATCTCGCCACCATCAGACAGAAGGAGGGTGAGACGCTGCGACAATATGTCACC
AGATTCCAAGAGGAGCAATTGAAGGTCACACACTCCTCCGATGACTCGGCCATGTGCTATTTTTTCACCGGTCTAGCCGACGAAGCCCTCATGGTGAAACTTGGAGAGGA
GGCCCCGGCCACCTTCGTCGAGGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAAGATCTCCTCCGAACCAAAACCGGCCGACCAGAACGAAAGATCGGCCGGGGCA
GAAGTGGAAAAGATATAGAAAAGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGAGCTGAGTATCGAAGGGCGGAGAACGGACCTACCAGGAGCCGG
CCTTACGAACGCTTCACCCCGACCAAGATTCCAATTTCCGAGATCCTAACGAACATCGAGGAGTCTGGAATGGAAAAACTACTCAAACGTCCTGAGAAGCTTCGGGGAGC
ACCGGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCATAACACGTCGGACTGCTGGGAGTTGAAGCGCCAAATTGAGTATCTAATTCAAGATG
GCTACTTCAAGAAATTTGTGGGAAAGCCCAGGACCAACTCGGCAGAAAAAAAGGAAGAGCGGAAGCGTTCGAGGACGCCGCCCCGGCGTACTGACCGACCTGCGGTCATC
AATACCATTTTCGGAGGGCCAAGCGGGGGTCAGTCCGGACGTAAAAGAAAGGAGTTAGCTCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCAGAGGCCGACCTG
CTCAATCACCTTCGACGGTGCAGACTTGGAGGAGGTCCACCTGCCCCACAATGATGCACTTGTGATCCCTCCCTTGATTGATCATGTGGTGGTCAGGAGGGTGCTGGTAG
ACGGAGGCGCATCTGCTAACATCCTGTCCTTATCGACCTACCTCGCCCTGGGATGGACGAGGTCACAATTGAAGAAAAGCCCGACACCGCTGGTTGGGTTCTCTGGAGAA
TCGGTCATCCCAGAGGGTTACATCGACTTGCCGGTCACGCTTGGGCAGGACCAAACTCAGGTCACCCAAATGGTCGAGTTCGTAGTAATTGAAGGTAGATCGGCCTATAA
CGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCATTCCCTCGACACTTCATCAAGTTTTGAAGTATTCCACCCCCAATGGCGTGGGCACGGTCCGAGGAGAAC
AGACCGCTTCGAGGGAGTGCTATGCCTCCGCACTCAAAGGCTCATCGGTCTGCGCCCTTGAAACTCTCGCCAGTAGGGATGGGACGCTCGAGTTCAAAGCCAACCTACCG
AGGAGGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAATTAGCATCGGCGTACGAGACCGACCTGGCCAGGTCGGTCCCCAT
AGAGATCTTAGATAATCCCTCGATCTCAGAGCCAGATCTGATGGAGATCGGCGCTCCAGAGTTCTCATGGATCGACCCGATTGCGGACTTCATTAGGGGCAATTCACCAC
AAGACCCCAAGGAGCGCAGAAAGTTGGCAAGGCGAGCAGCTCGGTTCGTGGTCCGAGGTGGAGCATTGTACCGAAGCGGCTTTTCCTTGCCTCTATTGAGATGCCTAACC
CCTGAAGAGGGCCTAGTAGAGCATTACGAGCCTACAGCAAATGAGGAAGAGCTGCTCCTCAACCTCGACTTGTTGGAAGAAAGAAGAGCAATGGCCCAGCTACGCCTGGC
GGAATATCAGGGCAGAATGGCCAGACATTATAACGCCCGCGTTCGACCTCGGACCTTCCAGGTCGGACATCTGGTCTTAAGGAGGGTCCAAACCCATGTGGGTGCCCTTG
ACCCGACCTGGGAGGGGCCGTTTGAGGTCAAGGGCATAGTCCGACCTGTGACGTACTTATTGGCCGATCTGAAAGGAGACGTCCTCGCGCACCCGTGGAACGCAGAGCAC
CTGAAGCGTTATTATCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAACCCGCAAACTCGACCAATACGGCAGATCGAAGGACCCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGTGGTAGAGGGGCAAGGTCACGA
CGGCCTAGCAACAGAACCCCTCCGCAGGTCGGCACGAATCATCGCGCCTGTTCTACCACCTGCACACCCCCCAAGGACATCCAAGGCCACCCGTGGCCGAGGTGGAACCT
CTAAGAAGGGCGCCCGGGGTCCAGCCCCGACCCCGACAAGTGAGAACTTGGACGTACTCCAGAGAGAAATGGAGGCAATGCGCACAAAAATGCGGTCCATGGAGGAAGTG
TATAACGAAATGATATTAGCTGCAGGCCCAGGGTCCCGATCTGAGAACCGAGTGACGCGCGCTGGCATACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGTCGAGGAGGA
ACATCCCGAAGACAACGAGAGCGAGGGACACACTCGCCAGAGAGGAGACCTCCGTGAACACCTCAACAGAAAGAGAGGAAGGATTACAAGGGAGGAGTTCGACCAGCTGA
GGGGCCAGCTCGACACTCAGGTGGAGGCCTTAAAGCCCAAATGTGAGCAGAAAGAAGGTCCACTGAACGATGACGACTTGGGAGAATCGCCCTTCACCTCGGACGTTTTG
GAAGCACCGATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAACCTTATGATGGGTCGAAGGACCCCAAGGATTATGTTGAGGTCTTTGAAGGCCTCATGGATTTCCAAGC
GGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATCTCGCTTACTGGTAGCGCGCGATTGTGGTATCGGAGATTGCCAGCTAGGTCGATCTCGACCAATTCTCAGCTGA
GAAGGGAGTTCCTCACCCAGTTCTCTTCTCGGCATTATGACAAAATGACAGCGACCCATCTCGCCACCATCAGACAGAAGGAGGGTGAGACGCTGCGACAATATGTCACC
AGATTCCAAGAGGAGCAATTGAAGGTCACACACTCCTCCGATGACTCGGCCATGTGCTATTTTTTCACCGGTCTAGCCGACGAAGCCCTCATGGTGAAACTTGGAGAGGA
GGCCCCGGCCACCTTCGTCGAGGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAAGATCTCCTCCGAACCAAAACCGGCCGACCAGAACGAAAGATCGGCCGGGGCA
GAAGTGGAAAAGATATAGAAAAGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGAGCTGAGTATCGAAGGGCGGAGAACGGACCTACCAGGAGCCGG
CCTTACGAACGCTTCACCCCGACCAAGATTCCAATTTCCGAGATCCTAACGAACATCGAGGAGTCTGGAATGGAAAAACTACTCAAACGTCCTGAGAAGCTTCGGGGAGC
ACCGGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCATAACACGTCGGACTGCTGGGAGTTGAAGCGCCAAATTGAGTATCTAATTCAAGATG
GCTACTTCAAGAAATTTGTGGGAAAGCCCAGGACCAACTCGGCAGAAAAAAAGGAAGAGCGGAAGCGTTCGAGGACGCCGCCCCGGCGTACTGACCGACCTGCGGTCATC
AATACCATTTTCGGAGGGCCAAGCGGGGGTCAGTCCGGACGTAAAAGAAAGGAGTTAGCTCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCAGAGGCCGACCTG
CTCAATCACCTTCGACGGTGCAGACTTGGAGGAGGTCCACCTGCCCCACAATGATGCACTTGTGATCCCTCCCTTGATTGATCATGTGGTGGTCAGGAGGGTGCTGGTAG
ACGGAGGCGCATCTGCTAACATCCTGTCCTTATCGACCTACCTCGCCCTGGGATGGACGAGGTCACAATTGAAGAAAAGCCCGACACCGCTGGTTGGGTTCTCTGGAGAA
TCGGTCATCCCAGAGGGTTACATCGACTTGCCGGTCACGCTTGGGCAGGACCAAACTCAGGTCACCCAAATGGTCGAGTTCGTAGTAATTGAAGGTAGATCGGCCTATAA
CGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCATTCCCTCGACACTTCATCAAGTTTTGAAGTATTCCACCCCCAATGGCGTGGGCACGGTCCGAGGAGAAC
AGACCGCTTCGAGGGAGTGCTATGCCTCCGCACTCAAAGGCTCATCGGTCTGCGCCCTTGAAACTCTCGCCAGTAGGGATGGGACGCTCGAGTTCAAAGCCAACCTACCG
AGGAGGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAATTAGCATCGGCGTACGAGACCGACCTGGCCAGGTCGGTCCCCAT
AGAGATCTTAGATAATCCCTCGATCTCAGAGCCAGATCTGATGGAGATCGGCGCTCCAGAGTTCTCATGGATCGACCCGATTGCGGACTTCATTAGGGGCAATTCACCAC
AAGACCCCAAGGAGCGCAGAAAGTTGGCAAGGCGAGCAGCTCGGTTCGTGGTCCGAGGTGGAGCATTGTACCGAAGCGGCTTTTCCTTGCCTCTATTGAGATGCCTAACC
CCTGAAGAGGGCCTAGTAGAGCATTACGAGCCTACAGCAAATGAGGAAGAGCTGCTCCTCAACCTCGACTTGTTGGAAGAAAGAAGAGCAATGGCCCAGCTACGCCTGGC
GGAATATCAGGGCAGAATGGCCAGACATTATAACGCCCGCGTTCGACCTCGGACCTTCCAGGTCGGACATCTGGTCTTAAGGAGGGTCCAAACCCATGTGGGTGCCCTTG
ACCCGACCTGGGAGGGGCCGTTTGAGGTCAAGGGCATAGTCCGACCTGTGACGTACTTATTGGCCGATCTGAAAGGAGACGTCCTCGCGCACCCGTGGAACGCAGAGCAC
CTGAAGCGTTATTATCCTTGA
Protein sequenceShow/hide protein sequence
MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARIIAPVLPPAHPPRTSKATRGRGGTSKKGARGPAPTPTSENLDVLQREMEAMRTKMRSMEEV
YNEMILAAGPGSRSENRVTRAGIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGRITREEFDQLRGQLDTQVEALKPKCEQKEGPLNDDDLGESPFTSDVL
EAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQISLTGSARLWYRRLPARSISTNSQLRREFLTQFSSRHYDKMTATHLATIRQKEGETLRQYVT
RFQEEQLKVTHSSDDSAMCYFFTGLADEALMVKLGEEAPATFVEVLQKAKKVIDGQDLLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSR
PYERFTPTKIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEYLIQDGYFKKFVGKPRTNSAEKKEERKRSRTPPRRTDRPAVI
NTIFGGPSGGQSGRKRKELARAARREVCIIREQRPTCSITFDGADLEEVHLPHNDALVIPPLIDHVVVRRVLVDGGASANILSLSTYLALGWTRSQLKKSPTPLVGFSGE
SVIPEGYIDLPVTLGQDQTQVTQMVEFVVIEGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLASRDGTLEFKANLP
RREFAAPTEELELVPLLSPEKQLASAYETDLARSVPIEILDNPSISEPDLMEIGAPEFSWIDPIADFIRGNSPQDPKERRKLARRAARFVVRGGALYRSGFSLPLLRCLT
PEEGLVEHYEPTANEEELLLNLDLLEERRAMAQLRLAEYQGRMARHYNARVRPRTFQVGHLVLRRVQTHVGALDPTWEGPFEVKGIVRPVTYLLADLKGDVLAHPWNAEH
LKRYYP