; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g24780 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g24780
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr4:17936682..17942285
RNA-Seq ExpressionMoc04g24780
SyntenyMoc04g24780
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]3.6e-27893.94Show/hide
Query:  QAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEKKEGSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASD
        +AESS NPATPAGVITR EFDQLRG+LDAQVEALKAKCE+KEG LNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFE LMDFQAASD
Subjt:  QAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEKKEGSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSQHYDKKTATHLATIRQREGETLREYVTRFQEEQLKVAHCSHDSAMCYFLTGLADEAL
        AIKCRAF+IALTGSARLWYRRLPA SISTYSQLRREFLA FSS+HYDKKTATHLATIRQ+EGETLREYVTRFQEEQLKVAHCS DSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSQHYDKKTATHLATIRQREGETLREYVTRFQEEQLKVAHCSHDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFVEVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTMIPISEILTNIE
        TVKLGEEAPATF EVLQKAKKVIDGQELLRTKTGRPERKI RGRSGKDIE ADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPT IPISEILTNIE
Subjt:  TVKLGEEAPATFVEVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTMIPISEILTNIE

Query:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHDHNTSNCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEEWKRSRTPPRRTDRPAVINTIFEGPSGG
        ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREH HNTS+ WELKRQIE+LIQDGYFKKFVGKPRTSSAEKKEE KRSRTPPRRTDRPAVINTIF GPSGG
Subjt:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHDHNTSNCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEEWKRSRTPPRRTDRPAVINTIFEGPSGG

Query:  QSGHKRKELARAARREVCIIREQRPTCPITFDSIDLEEVHLPHNDALVIAPLIDHVVVRRVLIDGGASANILSLPTYLALGWTRSQLTKSPTPLVGFSGE
        QSG KRKELARAARREVCIIREQRPTCPITFD  DLEEVHLPHNDALVIAPLIDHVVV RVL+DGG SANILSLPTYLALGWTRSQL KSPTPLVGFSGE
Subjt:  QSGHKRKELARAARREVCIIREQRPTCPITFDSIDLEEVHLPHNDALVIAPLIDHVVVRRVLIDGGASANILSLPTYLALGWTRSQLTKSPTPLVGFSGE

Query:  SVIPEGCIDLPVTLGQDQTQVTQMTEFV
        SVIPEG IDLPVTLGQDQTQVTQM EFV
Subjt:  SVIPEGCIDLPVTLGQDQTQVTQMTEFV

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]1.7e-28080.18Show/hide
Query:  SSNQQAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEKKEGSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQ
        SSNQQAESSHNPATP GVITR EFDQLRGKL+AQVEALKAKCE+KEG LNDGDLGESPFTSDVLE        APTVK YDGSKDPKDYVEVFEGLMDFQ
Subjt:  SSNQQAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEKKEGSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQ

Query:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSQHYDKKTATHLATIRQREGETLREYVTRFQEEQLKVAHCSHDSAMCYFLTGLA
        AASDAIKCRAFQIALTGSARLW                                                     FQE+QLKVA  S DSAMCYFLTGLA
Subjt:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSQHYDKKTATHLATIRQREGETLREYVTRFQEEQLKVAHCSHDSAMCYFLTGLA

Query:  DEALTVKLGEEAPATFVEVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTMIPISEIL
        DEALTVKLG+EAPATF EVLQKAKKVIDGQELLRTKTGRPER IDRGRSGKD EKAD KSKDKGSFSSGRAE+RRA NGPTRSRPYERFTPT IPISEIL
Subjt:  DEALTVKLGEEAPATFVEVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTMIPISEIL

Query:  TNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHDHNTSNCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEEWKRSRTPPRRTDRPAVINTIFEG
        TNIEESGMEKLLKRPEKLRGAPERR+KDKYCRFHREHDHNTS+ WELKRQIEDLIQD YFKKFVGKPRTSSAEKKEE K SRTP RR DRPAVINTIF G
Subjt:  TNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHDHNTSNCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEEWKRSRTPPRRTDRPAVINTIFEG

Query:  PSGGQSGHKRKELARAARREVCIIREQRPTCPITFDSIDLEEVHLPHNDALVIAPLIDHVVVRRVLIDGGASANILSLPTYLALGWTRSQLTKSPTPLVG
        PSGGQSGHKRKELARAARREVCIIREQRPTCPITFDS DLEEVHLPHNDALVIAPLIDHVVVRRVL+D G SANI+SL TYLALGWTRSQL KS TPLVG
Subjt:  PSGGQSGHKRKELARAARREVCIIREQRPTCPITFDSIDLEEVHLPHNDALVIAPLIDHVVVRRVLIDGGASANILSLPTYLALGWTRSQLTKSPTPLVG

Query:  FSGESVIPEGCIDLPVTLGQDQTQVTQMTEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETL
        FS ESVIPEGCIDLPVTLG DQTQVTQM EFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQ+LKYSTPNGVG VRGEQ ASRECYASALKGSSVCALETL
Subjt:  FSGESVIPEGCIDLPVTLGQDQTQVTQMTEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETL

Query:  AGRDGTLEFEADLSRKEFAAPTEELELVPLLSPEKQLASTYETDLARSVPVEILDN
          RDGTLEF+A+L R+EFAAPTEELELVPLL  +      +E +L     +  +D+
Subjt:  AGRDGTLEFEADLSRKEFAAPTEELELVPLLSPEKQLASTYETDLARSVPVEILDN

XP_022152110.1 uncharacterized protein LOC111019899 [Momordica charantia]5.2e-22490.36Show/hide
Query:  MCYFLTGLADEALTVKLGEEAPATFVEVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTP
        MCYFLTGLADEALTVKL EEAPATF EVLQKAKKVIDGQELLRTK G       +GRSGKD+E  DPKSKDKGSFS+GRAEYRRAENGPTRSRPYERFTP
Subjt:  MCYFLTGLADEALTVKLGEEAPATFVEVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTP

Query:  TMIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHDHNTSNCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEEWKRSRTPPRRTDRP
        T IPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREH HNTS+ WELK QIEDLIQDGYFKKFVGKPRTSSAEKKEE KRSRTPPRRTDRP
Subjt:  TMIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHDHNTSNCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEEWKRSRTPPRRTDRP

Query:  AVINTIFEGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDSIDLEEVHLPHNDALVIAPLIDHVVVRRVLIDGGASANILSLPTYLALGWTRSQL
        AVINTIF GPSGGQSGHKRK+LARAARREVCIIREQRPTCPITFD  DL EVHLPHNDALVIAPLIDHVVVRRVL+DGGASANILSLPTYLALGWTRSQL
Subjt:  AVINTIFEGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDSIDLEEVHLPHNDALVIAPLIDHVVVRRVLIDGGASANILSLPTYLALGWTRSQL

Query:  TKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMTEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGEQTASRECYASALKG
         KSPTPLVGFSGESV+PEGCIDLPVTLGQDQT+VTQM EFVV+DGRSAYNAIFGRPIIHSFRAIPSTLHQ+LKYSTPNGVGTVRGEQTASRECYAS LKG
Subjt:  TKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMTEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGEQTASRECYASALKG

Query:  SSVCALETLAGRDGTLEFEADLSRKEFAAPTEELELVPLLSPEKQL
        +SVCALETL  RDGTLEFEADL  +EFAAP EELELVPLLS EKQ+
Subjt:  SSVCALETLAGRDGTLEFEADLSRKEFAAPTEELELVPLLSPEKQL

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]2.9e-27566.75Show/hide
Query:  MVQPANSTNTADRRTLAASDAHQREVEAAVVEGQGHDGLATEPLHRSARITAPVLPPTHPRTSKATRGRGGTSKKGARGPAPAPPSENFDALQREMEAMR
        MVQPANSTNTADRR LAA+  HQREV A VVEGQGH+ L TEPL RSARIT PVLPP HP+ SK                                    
Subjt:  MVQPANSTNTADRRTLAASDAHQREVEAAVVEGQGHDGLATEPLHRSARITAPVLPPTHPRTSKATRGRGGTSKKGARGPAPAPPSENFDALQREMEAMR

Query:  TQMRSMEEMYNKMILAAGAGSRSENRMTRIDIREQRGSHLGPVEEEHPEDNESEGHTRQKGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPAT
                                                                                                   AESS+NP T
Subjt:  TQMRSMEEMYNKMILAAGAGSRSENRMTRIDIREQRGSHLGPVEEEHPEDNESEGHTRQKGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPAT

Query:  PAGVITRAEFDQLRGKLDAQVEALKAKCEKKEGSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIA
        P GVITR EFDQL+ K DAQVEALKA+CEKKE S +DGDLGE  F+SD+LEA IPPKFK PT+KPYDGSKDPKDYVEVFE LMDFQAA+DAIKC AFQIA
Subjt:  PAGVITRAEFDQLRGKLDAQVEALKAKCEKKEGSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIA

Query:  LTGSARLWYRRLPARSISTYSQLRREFLAQFSSQHYDKKTATHLATIRQREGETLREYVTRFQEEQLKVAHCSHDSAMCYFLTGLADEALTVKLGEEAPA
        LTGSARLWYRRLPAR ISTYSQLR+EF++QFSS+HYD+KT THLATIRQ+EGETLREYVTRF EEQLKVAHCS DSAMCYFLTGLADE LTVKL EEAPA
Subjt:  LTGSARLWYRRLPARSISTYSQLRREFLAQFSSQHYDKKTATHLATIRQREGETLREYVTRFQEEQLKVAHCSHDSAMCYFLTGLADEALTVKLGEEAPA

Query:  TFVEVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKDIEKADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTMIPISEILTNIEESGMEKLLK
        TF EVLQK KKVIDGQELLRTKTGRPE+ ID+GR+GKD  KAD KS+DKG S SS R +YRR+ +   +SRPYE +TPT IPI EILTNIEE+GMEKLLK
Subjt:  TFVEVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKDIEKADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTMIPISEILTNIEESGMEKLLK

Query:  RPEKLRGAPERRSKDKYCRFHREHDHNTSNCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEEWKRSRTPPRRTDRPAVINTIFEGPSGGQSGHKRKEL
        RPEKLRG PE+R+ DKYCRFHR+H HNTSN WELKRQIEDLIQDGYFKKFVGKPR++S EKKEE KR RTPPRR DRPAVIN             K+KEL
Subjt:  RPEKLRGAPERRSKDKYCRFHREHDHNTSNCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEEWKRSRTPPRRTDRPAVINTIFEGPSGGQSGHKRKEL

Query:  ARAARREVCIIREQRPTCPITFDSIDLEEVHLPHNDALVIAPLIDHVVVRRVLIDGGASANILSLPTYLALGWTRSQLTKSPTPLVGFSGESVIPEGCID
        AR ARREVCIIREQRPT  I F+  DLE VHLPHNDALVIAPLID V+VRR+L+DGGASANILSL TYLALGWTRSQL KSPTPLVGFSGES+  EGCID
Subjt:  ARAARREVCIIREQRPTCPITFDSIDLEEVHLPHNDALVIAPLIDHVVVRRVLIDGGASANILSLPTYLALGWTRSQLTKSPTPLVGFSGESVIPEGCID

Query:  LPVTLGQDQTQVTQMTEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLAGRD
        LPV++ QD TQVTQM EFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQ+LKYST NGVGTVRGE   SRECYAS  K SSVCALE    RD
Subjt:  LPVTLGQDQTQVTQMTEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLAGRD

XP_022158414.1 uncharacterized protein LOC111024904 [Momordica charantia]4.1e-22175.55Show/hide
Query:  MDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSQHYDKKTATHLATIRQREGETLREYVTRFQEEQLKVAHCSHDSAMCYFL
        MDFQAA+DAIKCRAFQIALTGSARLWYRRLPARSISTYSQLR+EF++QFSS HYD+KTATHLATIRQ+E ETLREYVTRFQEEQLKVAHCS DSAMCYFL
Subjt:  MDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSQHYDKKTATHLATIRQREGETLREYVTRFQEEQLKVAHCSHDSAMCYFL

Query:  TGLADEALTVKLGEEAPATFVEVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKDIEKADPKSKDKGSFSS-GRAEYRRAENGPTRSRPYERFTPTMIP
        T LADE LTVKLGEEAP TFVEVLQKAKKVIDGQELLRTKTGRPE++ID+ +  ++  KAD KS+DKGS SS  R EYRR E+GP+RSRPYER+T + IP
Subjt:  TGLADEALTVKLGEEAPATFVEVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKDIEKADPKSKDKGSFSS-GRAEYRRAENGPTRSRPYERFTPTMIP

Query:  ISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHDHNTSNCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEEWKRSRTPPRRTDRPAVIN
        ISEILTNIEESGMEKLLKRPEKLRG  E+R+K+KYCRFHR+H HNT++CWELKRQIEDLIQDGYFKKFVGKPR++S EKKEE KRSRTPPRR DRPAVIN
Subjt:  ISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHDHNTSNCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEEWKRSRTPPRRTDRPAVIN

Query:  TIFEGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDSIDLEEVHLPHNDALVIAPLIDHVVVRRVLIDGGASANILSLPTYLALGWTRSQLTKSP
        TIF GP+GGQSG+KRKELAR ARREVCIIRE +PTC ITF   DLE VHLPHNDALVIA LIDH +VRRVLIDG                          
Subjt:  TIFEGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDSIDLEEVHLPHNDALVIAPLIDHVVVRRVLIDGGASANILSLPTYLALGWTRSQLTKSP

Query:  TPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMTEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGEQTASRECYASALKGSSVC
                      GCIDLPVT+GQD TQVTQM EFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQ+LKYSTPN VG VRGEQ  SRECYASALKGS+VC
Subjt:  TPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMTEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGEQTASRECYASALKGSSVC

Query:  ALETLAGRDGTLEFEADL---SRKEFAAPTEELELVPLLSPEKQ
        ALE    R    E EADL    +++F  PTEELELVPLLSPE+Q
Subjt:  ALETLAGRDGTLEFEADL---SRKEFAAPTEELELVPLLSPEKQ

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088131.8e-27893.94Show/hide
Query:  QAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEKKEGSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASD
        +AESS NPATPAGVITR EFDQLRG+LDAQVEALKAKCE+KEG LNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFE LMDFQAASD
Subjt:  QAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEKKEGSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSQHYDKKTATHLATIRQREGETLREYVTRFQEEQLKVAHCSHDSAMCYFLTGLADEAL
        AIKCRAF+IALTGSARLWYRRLPA SISTYSQLRREFLA FSS+HYDKKTATHLATIRQ+EGETLREYVTRFQEEQLKVAHCS DSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSQHYDKKTATHLATIRQREGETLREYVTRFQEEQLKVAHCSHDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFVEVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTMIPISEILTNIE
        TVKLGEEAPATF EVLQKAKKVIDGQELLRTKTGRPERKI RGRSGKDIE ADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPT IPISEILTNIE
Subjt:  TVKLGEEAPATFVEVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTMIPISEILTNIE

Query:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHDHNTSNCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEEWKRSRTPPRRTDRPAVINTIFEGPSGG
        ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREH HNTS+ WELKRQIE+LIQDGYFKKFVGKPRTSSAEKKEE KRSRTPPRRTDRPAVINTIF GPSGG
Subjt:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHDHNTSNCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEEWKRSRTPPRRTDRPAVINTIFEGPSGG

Query:  QSGHKRKELARAARREVCIIREQRPTCPITFDSIDLEEVHLPHNDALVIAPLIDHVVVRRVLIDGGASANILSLPTYLALGWTRSQLTKSPTPLVGFSGE
        QSG KRKELARAARREVCIIREQRPTCPITFD  DLEEVHLPHNDALVIAPLIDHVVV RVL+DGG SANILSLPTYLALGWTRSQL KSPTPLVGFSGE
Subjt:  QSGHKRKELARAARREVCIIREQRPTCPITFDSIDLEEVHLPHNDALVIAPLIDHVVVRRVLIDGGASANILSLPTYLALGWTRSQLTKSPTPLVGFSGE

Query:  SVIPEGCIDLPVTLGQDQTQVTQMTEFV
        SVIPEG IDLPVTLGQDQTQVTQM EFV
Subjt:  SVIPEGCIDLPVTLGQDQTQVTQMTEFV

A0A6J1D9E1 uncharacterized protein LOC1110188238.4e-28180.18Show/hide
Query:  SSNQQAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEKKEGSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQ
        SSNQQAESSHNPATP GVITR EFDQLRGKL+AQVEALKAKCE+KEG LNDGDLGESPFTSDVLE        APTVK YDGSKDPKDYVEVFEGLMDFQ
Subjt:  SSNQQAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEKKEGSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQ

Query:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSQHYDKKTATHLATIRQREGETLREYVTRFQEEQLKVAHCSHDSAMCYFLTGLA
        AASDAIKCRAFQIALTGSARLW                                                     FQE+QLKVA  S DSAMCYFLTGLA
Subjt:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSQHYDKKTATHLATIRQREGETLREYVTRFQEEQLKVAHCSHDSAMCYFLTGLA

Query:  DEALTVKLGEEAPATFVEVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTMIPISEIL
        DEALTVKLG+EAPATF EVLQKAKKVIDGQELLRTKTGRPER IDRGRSGKD EKAD KSKDKGSFSSGRAE+RRA NGPTRSRPYERFTPT IPISEIL
Subjt:  DEALTVKLGEEAPATFVEVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTMIPISEIL

Query:  TNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHDHNTSNCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEEWKRSRTPPRRTDRPAVINTIFEG
        TNIEESGMEKLLKRPEKLRGAPERR+KDKYCRFHREHDHNTS+ WELKRQIEDLIQD YFKKFVGKPRTSSAEKKEE K SRTP RR DRPAVINTIF G
Subjt:  TNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHDHNTSNCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEEWKRSRTPPRRTDRPAVINTIFEG

Query:  PSGGQSGHKRKELARAARREVCIIREQRPTCPITFDSIDLEEVHLPHNDALVIAPLIDHVVVRRVLIDGGASANILSLPTYLALGWTRSQLTKSPTPLVG
        PSGGQSGHKRKELARAARREVCIIREQRPTCPITFDS DLEEVHLPHNDALVIAPLIDHVVVRRVL+D G SANI+SL TYLALGWTRSQL KS TPLVG
Subjt:  PSGGQSGHKRKELARAARREVCIIREQRPTCPITFDSIDLEEVHLPHNDALVIAPLIDHVVVRRVLIDGGASANILSLPTYLALGWTRSQLTKSPTPLVG

Query:  FSGESVIPEGCIDLPVTLGQDQTQVTQMTEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETL
        FS ESVIPEGCIDLPVTLG DQTQVTQM EFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQ+LKYSTPNGVG VRGEQ ASRECYASALKGSSVCALETL
Subjt:  FSGESVIPEGCIDLPVTLGQDQTQVTQMTEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETL

Query:  AGRDGTLEFEADLSRKEFAAPTEELELVPLLSPEKQLASTYETDLARSVPVEILDN
          RDGTLEF+A+L R+EFAAPTEELELVPLL  +      +E +L     +  +D+
Subjt:  AGRDGTLEFEADLSRKEFAAPTEELELVPLLSPEKQLASTYETDLARSVPVEILDN

A0A6J1DD03 uncharacterized protein LOC1110198992.5e-22490.36Show/hide
Query:  MCYFLTGLADEALTVKLGEEAPATFVEVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTP
        MCYFLTGLADEALTVKL EEAPATF EVLQKAKKVIDGQELLRTK G       +GRSGKD+E  DPKSKDKGSFS+GRAEYRRAENGPTRSRPYERFTP
Subjt:  MCYFLTGLADEALTVKLGEEAPATFVEVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTP

Query:  TMIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHDHNTSNCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEEWKRSRTPPRRTDRP
        T IPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREH HNTS+ WELK QIEDLIQDGYFKKFVGKPRTSSAEKKEE KRSRTPPRRTDRP
Subjt:  TMIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHDHNTSNCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEEWKRSRTPPRRTDRP

Query:  AVINTIFEGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDSIDLEEVHLPHNDALVIAPLIDHVVVRRVLIDGGASANILSLPTYLALGWTRSQL
        AVINTIF GPSGGQSGHKRK+LARAARREVCIIREQRPTCPITFD  DL EVHLPHNDALVIAPLIDHVVVRRVL+DGGASANILSLPTYLALGWTRSQL
Subjt:  AVINTIFEGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDSIDLEEVHLPHNDALVIAPLIDHVVVRRVLIDGGASANILSLPTYLALGWTRSQL

Query:  TKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMTEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGEQTASRECYASALKG
         KSPTPLVGFSGESV+PEGCIDLPVTLGQDQT+VTQM EFVV+DGRSAYNAIFGRPIIHSFRAIPSTLHQ+LKYSTPNGVGTVRGEQTASRECYAS LKG
Subjt:  TKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMTEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGEQTASRECYASALKG

Query:  SSVCALETLAGRDGTLEFEADLSRKEFAAPTEELELVPLLSPEKQL
        +SVCALETL  RDGTLEFEADL  +EFAAP EELELVPLLS EKQ+
Subjt:  SSVCALETLAGRDGTLEFEADLSRKEFAAPTEELELVPLLSPEKQL

A0A6J1DHB3 uncharacterized protein LOC1110204791.4e-27566.75Show/hide
Query:  MVQPANSTNTADRRTLAASDAHQREVEAAVVEGQGHDGLATEPLHRSARITAPVLPPTHPRTSKATRGRGGTSKKGARGPAPAPPSENFDALQREMEAMR
        MVQPANSTNTADRR LAA+  HQREV A VVEGQGH+ L TEPL RSARIT PVLPP HP+ SK                                    
Subjt:  MVQPANSTNTADRRTLAASDAHQREVEAAVVEGQGHDGLATEPLHRSARITAPVLPPTHPRTSKATRGRGGTSKKGARGPAPAPPSENFDALQREMEAMR

Query:  TQMRSMEEMYNKMILAAGAGSRSENRMTRIDIREQRGSHLGPVEEEHPEDNESEGHTRQKGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPAT
                                                                                                   AESS+NP T
Subjt:  TQMRSMEEMYNKMILAAGAGSRSENRMTRIDIREQRGSHLGPVEEEHPEDNESEGHTRQKGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPAT

Query:  PAGVITRAEFDQLRGKLDAQVEALKAKCEKKEGSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIA
        P GVITR EFDQL+ K DAQVEALKA+CEKKE S +DGDLGE  F+SD+LEA IPPKFK PT+KPYDGSKDPKDYVEVFE LMDFQAA+DAIKC AFQIA
Subjt:  PAGVITRAEFDQLRGKLDAQVEALKAKCEKKEGSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIA

Query:  LTGSARLWYRRLPARSISTYSQLRREFLAQFSSQHYDKKTATHLATIRQREGETLREYVTRFQEEQLKVAHCSHDSAMCYFLTGLADEALTVKLGEEAPA
        LTGSARLWYRRLPAR ISTYSQLR+EF++QFSS+HYD+KT THLATIRQ+EGETLREYVTRF EEQLKVAHCS DSAMCYFLTGLADE LTVKL EEAPA
Subjt:  LTGSARLWYRRLPARSISTYSQLRREFLAQFSSQHYDKKTATHLATIRQREGETLREYVTRFQEEQLKVAHCSHDSAMCYFLTGLADEALTVKLGEEAPA

Query:  TFVEVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKDIEKADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTMIPISEILTNIEESGMEKLLK
        TF EVLQK KKVIDGQELLRTKTGRPE+ ID+GR+GKD  KAD KS+DKG S SS R +YRR+ +   +SRPYE +TPT IPI EILTNIEE+GMEKLLK
Subjt:  TFVEVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKDIEKADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTMIPISEILTNIEESGMEKLLK

Query:  RPEKLRGAPERRSKDKYCRFHREHDHNTSNCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEEWKRSRTPPRRTDRPAVINTIFEGPSGGQSGHKRKEL
        RPEKLRG PE+R+ DKYCRFHR+H HNTSN WELKRQIEDLIQDGYFKKFVGKPR++S EKKEE KR RTPPRR DRPAVIN             K+KEL
Subjt:  RPEKLRGAPERRSKDKYCRFHREHDHNTSNCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEEWKRSRTPPRRTDRPAVINTIFEGPSGGQSGHKRKEL

Query:  ARAARREVCIIREQRPTCPITFDSIDLEEVHLPHNDALVIAPLIDHVVVRRVLIDGGASANILSLPTYLALGWTRSQLTKSPTPLVGFSGESVIPEGCID
        AR ARREVCIIREQRPT  I F+  DLE VHLPHNDALVIAPLID V+VRR+L+DGGASANILSL TYLALGWTRSQL KSPTPLVGFSGES+  EGCID
Subjt:  ARAARREVCIIREQRPTCPITFDSIDLEEVHLPHNDALVIAPLIDHVVVRRVLIDGGASANILSLPTYLALGWTRSQLTKSPTPLVGFSGESVIPEGCID

Query:  LPVTLGQDQTQVTQMTEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLAGRD
        LPV++ QD TQVTQM EFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQ+LKYST NGVGTVRGE   SRECYAS  K SSVCALE    RD
Subjt:  LPVTLGQDQTQVTQMTEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLAGRD

A0A6J1DZB9 uncharacterized protein LOC1110249042.0e-22175.55Show/hide
Query:  MDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSQHYDKKTATHLATIRQREGETLREYVTRFQEEQLKVAHCSHDSAMCYFL
        MDFQAA+DAIKCRAFQIALTGSARLWYRRLPARSISTYSQLR+EF++QFSS HYD+KTATHLATIRQ+E ETLREYVTRFQEEQLKVAHCS DSAMCYFL
Subjt:  MDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSQHYDKKTATHLATIRQREGETLREYVTRFQEEQLKVAHCSHDSAMCYFL

Query:  TGLADEALTVKLGEEAPATFVEVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKDIEKADPKSKDKGSFSS-GRAEYRRAENGPTRSRPYERFTPTMIP
        T LADE LTVKLGEEAP TFVEVLQKAKKVIDGQELLRTKTGRPE++ID+ +  ++  KAD KS+DKGS SS  R EYRR E+GP+RSRPYER+T + IP
Subjt:  TGLADEALTVKLGEEAPATFVEVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKDIEKADPKSKDKGSFSS-GRAEYRRAENGPTRSRPYERFTPTMIP

Query:  ISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHDHNTSNCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEEWKRSRTPPRRTDRPAVIN
        ISEILTNIEESGMEKLLKRPEKLRG  E+R+K+KYCRFHR+H HNT++CWELKRQIEDLIQDGYFKKFVGKPR++S EKKEE KRSRTPPRR DRPAVIN
Subjt:  ISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHDHNTSNCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEEWKRSRTPPRRTDRPAVIN

Query:  TIFEGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDSIDLEEVHLPHNDALVIAPLIDHVVVRRVLIDGGASANILSLPTYLALGWTRSQLTKSP
        TIF GP+GGQSG+KRKELAR ARREVCIIRE +PTC ITF   DLE VHLPHNDALVIA LIDH +VRRVLIDG                          
Subjt:  TIFEGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDSIDLEEVHLPHNDALVIAPLIDHVVVRRVLIDGGASANILSLPTYLALGWTRSQLTKSP

Query:  TPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMTEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGEQTASRECYASALKGSSVC
                      GCIDLPVT+GQD TQVTQM EFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQ+LKYSTPN VG VRGEQ  SRECYASALKGS+VC
Subjt:  TPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMTEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGEQTASRECYASALKGSSVC

Query:  ALETLAGRDGTLEFEADL---SRKEFAAPTEELELVPLLSPEKQ
        ALE    R    E EADL    +++F  PTEELELVPLLSPE+Q
Subjt:  ALETLAGRDGTLEFEADL---SRKEFAAPTEELELVPLLSPEKQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAACCAGCAAACTCGACCAATACGGCAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGAAGCAGCAGTGGTAGAGGGGCAAGGTCACGA
CGGCCTAGCAACAGAACCCCTCCACAGGTCGGCACGAATCACCGCGCCTGTCCTACCACCTACGCACCCAAGGACATCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTA
AGAAGGGCGCCCGGGGTCCAGCCCCGGCTCCACCAAGTGAGAACTTTGACGCACTCCAGAGAGAAATGGAGGCAATGCGCACACAAATGCGGTCCATGGAGGAAATGTAT
AACAAGATGATACTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAATGACGCGCATTGACATACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGTCGAGGAGGAACA
TCCCGAAGACAATGAGAGCGAGGGACACACTCGCCAGAAAGGAGACCTTCGTGAACACCTCAACAGAAAGAGAGGCTCATCTCTCCGAAAAGGACAGTCACCATCCCGCT
CACACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAACCCAGCAACTCCCGCAGGGGTGATCACAAGGGCGGAGTTCGACCAGCTGAGGGGCAAGCTCGACGCTCAG
GTGGAGGCCTTAAAGGCCAAATGTGAGAAGAAAGAAGGTTCACTGAACGATGGCGACTTGGGAGAATCGCCCTTCACCTCGGACGTTTTGGAAGCACCGATCCCTCCGAA
GTTCAAAGCTCCTACCGTGAAGCCTTATGATGGGTCGAAGGACCCCAAGGATTATGTTGAGGTCTTTGAAGGCCTCATGGATTTCCAAGCGGCATCAGACGCAATCAAAT
GCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCACGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAG
TTCTCTTCTCAGCATTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGACAGAGGGAGGGTGAGACGCTGCGAGAATATGTCACCAGATTCCAGGAGGAGCAGTT
GAAGGTCGCACACTGCTCCCATGACTCGGCCATGTGCTATTTTCTCACCGGTCTAGCCGACGAGGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGTCG
AGGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCAGAACGAAAGATCGACCGGGGCAGAAGTGGAAAAGATATAGAA
AAGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGAGCTGAGTATCGAAGGGCGGAGAACGGACCTACCAGGAGCCGACCTTACGAACGCTTCACCCC
GACCATGATTCCAATTTCCGAGATCCTAACGAACATCGAGGAGTCTGGAATGGAAAAACTACTCAAGCGTCCTGAGAAGCTTCGGGGAGCCCCGGAAAGGCGCAGCAAGG
ACAAGTATTGCCGCTTCCATCGGGAGCACGACCATAACACGTCGAACTGCTGGGAGTTGAAGCGCCAAATTGAGGATCTAATTCAAGATGGCTACTTCAAGAAATTTGTG
GGAAAGCCTAGGACCAGCTCGGCAGAGAAAAAGGAAGAGTGGAAGCGTTCGAGGACGCCGCCCCGGCGCACTGACCGACCTGCGGTCATCAATACTATTTTCGAAGGGCC
AAGCGGGGGTCAGTCTGGACATAAAAGAAAGGAGCTAGCTCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCAGAGGCCGACCTGCCCAATCACCTTCGACAGTA
TAGACTTAGAGGAGGTCCACCTGCCCCACAATGATGCACTTGTGATCGCTCCATTGATTGATCATGTGGTGGTCAGGAGGGTGCTGATAGACGGAGGCGCATCTGCTAAC
ATCCTGTCCTTACCGACCTACCTCGCCCTGGGTTGGACGAGGTCGCAATTGACGAAAAGCCCGACACCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGTTG
CATCGACTTGCCGGTCACACTTGGGCAGGACCAAACTCAGGTCACCCAAATGACCGAGTTCGTGGTAATTGACGGTAGATCGGCCTATAACGCCATCTTTGGTAGACCCA
TCATCCACTCATTTCGGGCCATTCCCTCGACACTTCATCAAATTTTGAAGTATTCCACCCCCAATGGCGTGGGCACGGTCCGAGGAGAACAGACCGCTTCGAGGGAGTGT
TATGCCTCCGCACTCAAAGGCTCATCGGTCTGCGCCCTCGAAACTCTCGCCGGTAGGGATGGGACGCTCGAGTTCGAGGCCGACCTGTCGAGGAAGGAGTTTGCCGCACC
CACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAATTAGCATCGACGTACGAGACCGACCTGGCCAGGTCGGTCCCCGTCGAGATCTTAGATAATCCCT
CGATCTCAAAGCCAGATCTGATGGAGATCGACGCTCCAGAGTCCTCATGGATGGACCCGATTGCGGACTTCATTAGGGGCAACTCTCCACAAGACCCCAAGGAGCGCAGA
AAGTTGGCAAGGCGAGCCGCTCGGTTCGTGGTCCGAGGTGGAGCATTGTACCGACGCGGCTTTTCCCTGCCTCTATTGAGGTGCCTAACCCCTGAAGAGGGCCTAATGGC
CAGACATTACAATGCCCGCGTTCGACCTCGGGCCTTTCAGGTCGAACATCTGGTCTTAAGGAGGGTCCAAACCCATGTGGGTGCCCTTGATCCGACCTGGGAGGGCCCAT
TTGAGGTCAAGGGCATAGTCCGACCTGGGACGTACGTATTGGCCGATCTGAAAGGAGACGTCCTCGCGCACCAGTGGAACGCGGAGCACCTGAAGCGTTATTATCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAACCAGCAAACTCGACCAATACGGCAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGAAGCAGCAGTGGTAGAGGGGCAAGGTCACGA
CGGCCTAGCAACAGAACCCCTCCACAGGTCGGCACGAATCACCGCGCCTGTCCTACCACCTACGCACCCAAGGACATCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTA
AGAAGGGCGCCCGGGGTCCAGCCCCGGCTCCACCAAGTGAGAACTTTGACGCACTCCAGAGAGAAATGGAGGCAATGCGCACACAAATGCGGTCCATGGAGGAAATGTAT
AACAAGATGATACTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAATGACGCGCATTGACATACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGTCGAGGAGGAACA
TCCCGAAGACAATGAGAGCGAGGGACACACTCGCCAGAAAGGAGACCTTCGTGAACACCTCAACAGAAAGAGAGGCTCATCTCTCCGAAAAGGACAGTCACCATCCCGCT
CACACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAACCCAGCAACTCCCGCAGGGGTGATCACAAGGGCGGAGTTCGACCAGCTGAGGGGCAAGCTCGACGCTCAG
GTGGAGGCCTTAAAGGCCAAATGTGAGAAGAAAGAAGGTTCACTGAACGATGGCGACTTGGGAGAATCGCCCTTCACCTCGGACGTTTTGGAAGCACCGATCCCTCCGAA
GTTCAAAGCTCCTACCGTGAAGCCTTATGATGGGTCGAAGGACCCCAAGGATTATGTTGAGGTCTTTGAAGGCCTCATGGATTTCCAAGCGGCATCAGACGCAATCAAAT
GCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCACGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAG
TTCTCTTCTCAGCATTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGACAGAGGGAGGGTGAGACGCTGCGAGAATATGTCACCAGATTCCAGGAGGAGCAGTT
GAAGGTCGCACACTGCTCCCATGACTCGGCCATGTGCTATTTTCTCACCGGTCTAGCCGACGAGGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGTCG
AGGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCAGAACGAAAGATCGACCGGGGCAGAAGTGGAAAAGATATAGAA
AAGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGAGCTGAGTATCGAAGGGCGGAGAACGGACCTACCAGGAGCCGACCTTACGAACGCTTCACCCC
GACCATGATTCCAATTTCCGAGATCCTAACGAACATCGAGGAGTCTGGAATGGAAAAACTACTCAAGCGTCCTGAGAAGCTTCGGGGAGCCCCGGAAAGGCGCAGCAAGG
ACAAGTATTGCCGCTTCCATCGGGAGCACGACCATAACACGTCGAACTGCTGGGAGTTGAAGCGCCAAATTGAGGATCTAATTCAAGATGGCTACTTCAAGAAATTTGTG
GGAAAGCCTAGGACCAGCTCGGCAGAGAAAAAGGAAGAGTGGAAGCGTTCGAGGACGCCGCCCCGGCGCACTGACCGACCTGCGGTCATCAATACTATTTTCGAAGGGCC
AAGCGGGGGTCAGTCTGGACATAAAAGAAAGGAGCTAGCTCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCAGAGGCCGACCTGCCCAATCACCTTCGACAGTA
TAGACTTAGAGGAGGTCCACCTGCCCCACAATGATGCACTTGTGATCGCTCCATTGATTGATCATGTGGTGGTCAGGAGGGTGCTGATAGACGGAGGCGCATCTGCTAAC
ATCCTGTCCTTACCGACCTACCTCGCCCTGGGTTGGACGAGGTCGCAATTGACGAAAAGCCCGACACCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGTTG
CATCGACTTGCCGGTCACACTTGGGCAGGACCAAACTCAGGTCACCCAAATGACCGAGTTCGTGGTAATTGACGGTAGATCGGCCTATAACGCCATCTTTGGTAGACCCA
TCATCCACTCATTTCGGGCCATTCCCTCGACACTTCATCAAATTTTGAAGTATTCCACCCCCAATGGCGTGGGCACGGTCCGAGGAGAACAGACCGCTTCGAGGGAGTGT
TATGCCTCCGCACTCAAAGGCTCATCGGTCTGCGCCCTCGAAACTCTCGCCGGTAGGGATGGGACGCTCGAGTTCGAGGCCGACCTGTCGAGGAAGGAGTTTGCCGCACC
CACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAATTAGCATCGACGTACGAGACCGACCTGGCCAGGTCGGTCCCCGTCGAGATCTTAGATAATCCCT
CGATCTCAAAGCCAGATCTGATGGAGATCGACGCTCCAGAGTCCTCATGGATGGACCCGATTGCGGACTTCATTAGGGGCAACTCTCCACAAGACCCCAAGGAGCGCAGA
AAGTTGGCAAGGCGAGCCGCTCGGTTCGTGGTCCGAGGTGGAGCATTGTACCGACGCGGCTTTTCCCTGCCTCTATTGAGGTGCCTAACCCCTGAAGAGGGCCTAATGGC
CAGACATTACAATGCCCGCGTTCGACCTCGGGCCTTTCAGGTCGAACATCTGGTCTTAAGGAGGGTCCAAACCCATGTGGGTGCCCTTGATCCGACCTGGGAGGGCCCAT
TTGAGGTCAAGGGCATAGTCCGACCTGGGACGTACGTATTGGCCGATCTGAAAGGAGACGTCCTCGCGCACCAGTGGAACGCGGAGCACCTGAAGCGTTATTATCCTTGA
Protein sequenceShow/hide protein sequence
MVQPANSTNTADRRTLAASDAHQREVEAAVVEGQGHDGLATEPLHRSARITAPVLPPTHPRTSKATRGRGGTSKKGARGPAPAPPSENFDALQREMEAMRTQMRSMEEMY
NKMILAAGAGSRSENRMTRIDIREQRGSHLGPVEEEHPEDNESEGHTRQKGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPATPAGVITRAEFDQLRGKLDAQ
VEALKAKCEKKEGSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQ
FSSQHYDKKTATHLATIRQREGETLREYVTRFQEEQLKVAHCSHDSAMCYFLTGLADEALTVKLGEEAPATFVEVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKDIE
KADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTMIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHDHNTSNCWELKRQIEDLIQDGYFKKFV
GKPRTSSAEKKEEWKRSRTPPRRTDRPAVINTIFEGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDSIDLEEVHLPHNDALVIAPLIDHVVVRRVLIDGGASAN
ILSLPTYLALGWTRSQLTKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMTEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGEQTASREC
YASALKGSSVCALETLAGRDGTLEFEADLSRKEFAAPTEELELVPLLSPEKQLASTYETDLARSVPVEILDNPSISKPDLMEIDAPESSWMDPIADFIRGNSPQDPKERR
KLARRAARFVVRGGALYRRGFSLPLLRCLTPEEGLMARHYNARVRPRAFQVEHLVLRRVQTHVGALDPTWEGPFEVKGIVRPGTYVLADLKGDVLAHQWNAEHLKRYYP