; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g25430 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g25430
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr3:18277982..18283500
RNA-Seq ExpressionMoc03g25430
SyntenyMoc03g25430
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]4.1e-28696.21Show/hide
Query:  QAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASD
        +AESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFE LMDFQAASD
Subjt:  QAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSARLWYRRLPARTISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
        AIKCRAF+IALTGSARLWYRRLPA +ISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTGSARLWYRRLPARTISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFVEVLQKTKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSLSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE
        TVKLGEEAPATF EVLQK KKVIDGQELLRTKTGRPERKIGRGRSGKDIE ADPKSKDKGS SSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE
Subjt:  TVKLGEEAPATFVEVLQKTKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSLSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE

Query:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHSHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVINTIFGGPSGS
        ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREH HNTSD WELKRQIE+LIQDGYFKKFVGKPRTSS EKKEERKRSRTPPRRTDRPAVINTIFGGPSG 
Subjt:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHSHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVINTIFGGPSGS

Query:  QSGQKRKELARAARREVCIIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGE
        QSG+KRKELARAARREVCIIREQRPTCPITFD ADLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLALGWTRSQLKKSPTPLVGFSGE
Subjt:  QSGQKRKELARAARREVCIIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGE

Query:  SVIPEGCIDLPVTLGQDQTQVTQMAEFV
        SVIPEG IDLPVTLGQDQTQVTQMAEFV
Subjt:  SVIPEGCIDLPVTLGQDQTQVTQMAEFV

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]7.5e-28079.58Show/hide
Query:  SSNQQAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQ
        SSNQQAESS NPATP GVITREEFDQLRG+L+AQVEALKAKCEQKEGPLNDGDLGESPFTSDVLE        APTVK YDGSKDPKDYVEVFEGLMDFQ
Subjt:  SSNQQAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQ

Query:  AASDAIKCRAFQIALTGSARLWYRRLPARTISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA
        AASDAIKCRAFQIALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFLTGLA
Subjt:  AASDAIKCRAFQIALTGSARLWYRRLPARTISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA

Query:  DEALTVKLGEEAPATFVEVLQKTKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSLSSGRAEYRRAENGPTRSRPYERFTPTTIPISEIL
        DEALTVKLG+EAPATF EVLQK KKVIDGQELLRTKTGRPER I RGRSGKD EKAD KSKDKGS SSGRAE+RRA NGPTRSRPYERFTPTTIPISEIL
Subjt:  DEALTVKLGEEAPATFVEVLQKTKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSLSSGRAEYRRAENGPTRSRPYERFTPTTIPISEIL

Query:  TNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHSHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVINTIFGG
        TNIEESGMEKLLKRPEKLRGAPERR+KDKYCRFHREH HNTSD WELKRQIEDLIQD YFKKFVGKPRTSS EKKEERK SRTP RR DRPAVINTIFGG
Subjt:  TNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHSHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVINTIFGG

Query:  PSGSQSGQKRKELARAARREVCIIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVG
        PSG QSG KRKELARAARREVCIIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTRSQLKKS TPLVG
Subjt:  PSGSQSGQKRKELARAARREVCIIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVG

Query:  FSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSVYNTIFGRPIIHSFRVIPSTLHQVLKYSTPNGEGTVRGEQTDSRECYASALKGSSVCAIETL
        FS ESVIPEGCIDLPVTLG DQTQVTQMAEFVVIDGRS YN IFGRPIIHSFR IPSTLHQVLKYSTPNG G VRGEQ  SRECYASALKGSSVCA+ETL
Subjt:  FSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSVYNTIFGRPIIHSFRVIPSTLHQVLKYSTPNGEGTVRGEQTDSRECYASALKGSSVCAIETL

Query:  ASGDGTLEFEADLPRREFAAPTEELELVPLLSPEKQPDL-MEIDAPEPSWMDPIVDFIRGNSPQDP
         S DGTLEF+A+LPRREFAAPTEELELVPLL  +   ++  E +  E S ++ I D I      +P
Subjt:  ASGDGTLEFEADLPRREFAAPTEELELVPLLSPEKQPDL-MEIDAPEPSWMDPIVDFIRGNSPQDP

XP_022152110.1 uncharacterized protein LOC111019899 [Momordica charantia]4.5e-22490.4Show/hide
Query:  MCYFLTGLADEALTVKLGEEAPATFVEVLQKTKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSLSSGRAEYRRAENGPTRSRPYERFTP
        MCYFLTGLADEALTVKL EEAPATF EVLQK KKVIDGQELLRT       KIG+GRSGKD+E  DPKSKDKGS S+GRAEYRRAENGPTRSRPYERFTP
Subjt:  MCYFLTGLADEALTVKLGEEAPATFVEVLQKTKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSLSSGRAEYRRAENGPTRSRPYERFTP

Query:  TTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHSHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRP
        TTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREH HNTSD WELK QIEDLIQDGYFKKFVGKPRTSS EKKEERKRSRTPPRRTDRP
Subjt:  TTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHSHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRP

Query:  AVINTIFGGPSGSQSGQKRKELARAARREVCIIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
        AVINTIFGGPSG QSG KRK+LARAARREVCIIREQRPTCPITFD ADL EVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
Subjt:  AVINTIFGGPSGSQSGQKRKELARAARREVCIIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL

Query:  KKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSVYNTIFGRPIIHSFRVIPSTLHQVLKYSTPNGEGTVRGEQTDSRECYASALKG
        KKSPTPLVGFSGESV+PEGCIDLPVTLGQDQT+VTQMAEFVV+DGRS YN IFGRPIIHSFR IPSTLHQVLKYSTPNG GTVRGEQT SRECYAS LKG
Subjt:  KKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSVYNTIFGRPIIHSFRVIPSTLHQVLKYSTPNGEGTVRGEQTDSRECYASALKG

Query:  SSVCAIETLASGDGTLEFEADLPRREFAAPTEELELVPLLSPEKQPDL
        +SVCA+ETL S DGTLEFEADLP REFAAP EELELVPLLS EKQ  L
Subjt:  SSVCAIETLASGDGTLEFEADLPRREFAAPTEELELVPLLSPEKQPDL

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]2.2e-26380.34Show/hide
Query:  QAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASD
        +AESS NP TP GVITREEFDQL+ + DAQVEALKA+CE+KE   +DGDLGE  F+SD+LEA IPPKFK PT+KPYDGSKDPKDYVEVFE LMDFQAA+D
Subjt:  QAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSARLWYRRLPARTISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
        AIKC AFQIALTGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE L
Subjt:  AIKCRAFQIALTGSARLWYRRLPARTISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFVEVLQKTKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKG-SLSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNI
        TVKL EEAPATF EVLQKTKKVIDGQELLRTKTGRPE+ I +GR+GKD  KAD KS+DKG S SS R +YRR+ +   +SRPYE +TPTTIPI EILTNI
Subjt:  TVKLGEEAPATFVEVLQKTKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKG-SLSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNI

Query:  EESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHSHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVINTIFGGPSG
        EE+GMEKLLKRPEKLRG PE+R+ DKYCRFHR+H HNTS+ WELKRQIEDLIQDGYFKKFVGKPR++S EKKEERKR RTPPRR DRPAVIN        
Subjt:  EESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHSHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVINTIFGGPSG

Query:  SQSGQKRKELARAARREVCIIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSG
             K+KELAR ARREVCIIREQRPT  I F+ ADLE VHLPHNDALVIAPLID V+VRR+LVDGGASANILSL TYLALGWTRSQLKKSPTPLVGFSG
Subjt:  SQSGQKRKELARAARREVCIIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSG

Query:  ESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSVYNTIFGRPIIHSFRVIPSTLHQVLKYSTPNGEGTVRGEQTDSRECYASALKGSSVCAIE
        ES+  EGCIDLPV++ QD TQVTQMAEFVVIDGRS YN IFGRPIIHSFR +PSTLHQVLKYST NG GTVRGE   SRECYAS  K SSVCA+E
Subjt:  ESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSVYNTIFGRPIIHSFRVIPSTLHQVLKYSTPNGEGTVRGEQTDSRECYASALKGSSVCAIE

XP_022158414.1 uncharacterized protein LOC111024904 [Momordica charantia]2.7e-22173.75Show/hide
Query:  MDFQAASDAIKCRAFQIALTGSARLWYRRLPARTISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL
        MDFQAA+DAIKCRAFQIALTGSARLWYRRLPAR+ISTYSQLR+EF++QFSS HYD+KTATHLATIRQKE ETLREYVTRFQEEQLKVAHCSDDSAMCYFL
Subjt:  MDFQAASDAIKCRAFQIALTGSARLWYRRLPARTISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL

Query:  TGLADEALTVKLGEEAPATFVEVLQKTKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSLSS-GRAEYRRAENGPTRSRPYERFTPTTIP
        T LADE LTVKLGEEAP TFVEVLQK KKVIDGQELLRTKTGRPE++I + +  ++  KAD KS+DKGS SS  R EYRR E+GP+RSRPYER+T +TIP
Subjt:  TGLADEALTVKLGEEAPATFVEVLQKTKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSLSS-GRAEYRRAENGPTRSRPYERFTPTTIP

Query:  ISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHSHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVIN
        ISEILTNIEESGMEKLLKRPEKLRG  E+R+K+KYCRFHR+H HNT+ CWELKRQIEDLIQDGYFKKFVGKPR++S EKKEERKRSRTPPRR DRPAVIN
Subjt:  ISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHSHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVIN

Query:  TIFGGPSGSQSGQKRKELARAARREVCIIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSP
        TIFGGP+G QSG KRKELAR ARREVCIIRE +PTC ITF  ADLE VHLPHNDALVIA LIDH +VRRVL+DG                          
Subjt:  TIFGGPSGSQSGQKRKELARAARREVCIIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSP

Query:  TPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSVYNTIFGRPIIHSFRVIPSTLHQVLKYSTPNGEGTVRGEQTDSRECYASALKGSSVC
                      GCIDLPVT+GQD TQVTQMAEFVVIDGRS YN IFGRPIIHSFR +PSTLHQVLKYSTPN  G VRGEQ  SRECYASALKGS+VC
Subjt:  TPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSVYNTIFGRPIIHSFRVIPSTLHQVLKYSTPNGEGTVRGEQTDSRECYASALKGSSVC

Query:  AIETLASGDGTLEFEADLP---RREFAAPTEELELVPLLSPEKQPD------LMEIDAPE
        A+E   +     E EADLP   +R+F  PTEELELVPLLSPE+Q +      ++E++AP+
Subjt:  AIETLASGDGTLEFEADLP---RREFAAPTEELELVPLLSPEKQPD------LMEIDAPE

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088132.0e-28696.21Show/hide
Query:  QAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASD
        +AESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFE LMDFQAASD
Subjt:  QAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSARLWYRRLPARTISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
        AIKCRAF+IALTGSARLWYRRLPA +ISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTGSARLWYRRLPARTISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFVEVLQKTKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSLSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE
        TVKLGEEAPATF EVLQK KKVIDGQELLRTKTGRPERKIGRGRSGKDIE ADPKSKDKGS SSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE
Subjt:  TVKLGEEAPATFVEVLQKTKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSLSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE

Query:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHSHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVINTIFGGPSGS
        ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREH HNTSD WELKRQIE+LIQDGYFKKFVGKPRTSS EKKEERKRSRTPPRRTDRPAVINTIFGGPSG 
Subjt:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHSHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVINTIFGGPSGS

Query:  QSGQKRKELARAARREVCIIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGE
        QSG+KRKELARAARREVCIIREQRPTCPITFD ADLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLALGWTRSQLKKSPTPLVGFSGE
Subjt:  QSGQKRKELARAARREVCIIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGE

Query:  SVIPEGCIDLPVTLGQDQTQVTQMAEFV
        SVIPEG IDLPVTLGQDQTQVTQMAEFV
Subjt:  SVIPEGCIDLPVTLGQDQTQVTQMAEFV

A0A6J1D9E1 uncharacterized protein LOC1110188233.6e-28079.58Show/hide
Query:  SSNQQAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQ
        SSNQQAESS NPATP GVITREEFDQLRG+L+AQVEALKAKCEQKEGPLNDGDLGESPFTSDVLE        APTVK YDGSKDPKDYVEVFEGLMDFQ
Subjt:  SSNQQAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQ

Query:  AASDAIKCRAFQIALTGSARLWYRRLPARTISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA
        AASDAIKCRAFQIALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFLTGLA
Subjt:  AASDAIKCRAFQIALTGSARLWYRRLPARTISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA

Query:  DEALTVKLGEEAPATFVEVLQKTKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSLSSGRAEYRRAENGPTRSRPYERFTPTTIPISEIL
        DEALTVKLG+EAPATF EVLQK KKVIDGQELLRTKTGRPER I RGRSGKD EKAD KSKDKGS SSGRAE+RRA NGPTRSRPYERFTPTTIPISEIL
Subjt:  DEALTVKLGEEAPATFVEVLQKTKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSLSSGRAEYRRAENGPTRSRPYERFTPTTIPISEIL

Query:  TNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHSHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVINTIFGG
        TNIEESGMEKLLKRPEKLRGAPERR+KDKYCRFHREH HNTSD WELKRQIEDLIQD YFKKFVGKPRTSS EKKEERK SRTP RR DRPAVINTIFGG
Subjt:  TNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHSHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVINTIFGG

Query:  PSGSQSGQKRKELARAARREVCIIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVG
        PSG QSG KRKELARAARREVCIIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTRSQLKKS TPLVG
Subjt:  PSGSQSGQKRKELARAARREVCIIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVG

Query:  FSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSVYNTIFGRPIIHSFRVIPSTLHQVLKYSTPNGEGTVRGEQTDSRECYASALKGSSVCAIETL
        FS ESVIPEGCIDLPVTLG DQTQVTQMAEFVVIDGRS YN IFGRPIIHSFR IPSTLHQVLKYSTPNG G VRGEQ  SRECYASALKGSSVCA+ETL
Subjt:  FSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSVYNTIFGRPIIHSFRVIPSTLHQVLKYSTPNGEGTVRGEQTDSRECYASALKGSSVCAIETL

Query:  ASGDGTLEFEADLPRREFAAPTEELELVPLLSPEKQPDL-MEIDAPEPSWMDPIVDFIRGNSPQDP
         S DGTLEF+A+LPRREFAAPTEELELVPLL  +   ++  E +  E S ++ I D I      +P
Subjt:  ASGDGTLEFEADLPRREFAAPTEELELVPLLSPEKQPDL-MEIDAPEPSWMDPIVDFIRGNSPQDP

A0A6J1DD03 uncharacterized protein LOC1110198992.2e-22490.4Show/hide
Query:  MCYFLTGLADEALTVKLGEEAPATFVEVLQKTKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSLSSGRAEYRRAENGPTRSRPYERFTP
        MCYFLTGLADEALTVKL EEAPATF EVLQK KKVIDGQELLRT       KIG+GRSGKD+E  DPKSKDKGS S+GRAEYRRAENGPTRSRPYERFTP
Subjt:  MCYFLTGLADEALTVKLGEEAPATFVEVLQKTKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSLSSGRAEYRRAENGPTRSRPYERFTP

Query:  TTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHSHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRP
        TTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREH HNTSD WELK QIEDLIQDGYFKKFVGKPRTSS EKKEERKRSRTPPRRTDRP
Subjt:  TTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHSHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRP

Query:  AVINTIFGGPSGSQSGQKRKELARAARREVCIIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
        AVINTIFGGPSG QSG KRK+LARAARREVCIIREQRPTCPITFD ADL EVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
Subjt:  AVINTIFGGPSGSQSGQKRKELARAARREVCIIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL

Query:  KKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSVYNTIFGRPIIHSFRVIPSTLHQVLKYSTPNGEGTVRGEQTDSRECYASALKG
        KKSPTPLVGFSGESV+PEGCIDLPVTLGQDQT+VTQMAEFVV+DGRS YN IFGRPIIHSFR IPSTLHQVLKYSTPNG GTVRGEQT SRECYAS LKG
Subjt:  KKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSVYNTIFGRPIIHSFRVIPSTLHQVLKYSTPNGEGTVRGEQTDSRECYASALKG

Query:  SSVCAIETLASGDGTLEFEADLPRREFAAPTEELELVPLLSPEKQPDL
        +SVCA+ETL S DGTLEFEADLP REFAAP EELELVPLLS EKQ  L
Subjt:  SSVCAIETLASGDGTLEFEADLPRREFAAPTEELELVPLLSPEKQPDL

A0A6J1DHB3 uncharacterized protein LOC1110204791.1e-26380.34Show/hide
Query:  QAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASD
        +AESS NP TP GVITREEFDQL+ + DAQVEALKA+CE+KE   +DGDLGE  F+SD+LEA IPPKFK PT+KPYDGSKDPKDYVEVFE LMDFQAA+D
Subjt:  QAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSARLWYRRLPARTISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
        AIKC AFQIALTGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE L
Subjt:  AIKCRAFQIALTGSARLWYRRLPARTISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFVEVLQKTKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKG-SLSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNI
        TVKL EEAPATF EVLQKTKKVIDGQELLRTKTGRPE+ I +GR+GKD  KAD KS+DKG S SS R +YRR+ +   +SRPYE +TPTTIPI EILTNI
Subjt:  TVKLGEEAPATFVEVLQKTKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKG-SLSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNI

Query:  EESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHSHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVINTIFGGPSG
        EE+GMEKLLKRPEKLRG PE+R+ DKYCRFHR+H HNTS+ WELKRQIEDLIQDGYFKKFVGKPR++S EKKEERKR RTPPRR DRPAVIN        
Subjt:  EESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHSHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVINTIFGGPSG

Query:  SQSGQKRKELARAARREVCIIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSG
             K+KELAR ARREVCIIREQRPT  I F+ ADLE VHLPHNDALVIAPLID V+VRR+LVDGGASANILSL TYLALGWTRSQLKKSPTPLVGFSG
Subjt:  SQSGQKRKELARAARREVCIIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSG

Query:  ESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSVYNTIFGRPIIHSFRVIPSTLHQVLKYSTPNGEGTVRGEQTDSRECYASALKGSSVCAIE
        ES+  EGCIDLPV++ QD TQVTQMAEFVVIDGRS YN IFGRPIIHSFR +PSTLHQVLKYST NG GTVRGE   SRECYAS  K SSVCA+E
Subjt:  ESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSVYNTIFGRPIIHSFRVIPSTLHQVLKYSTPNGEGTVRGEQTDSRECYASALKGSSVCAIE

A0A6J1DZB9 uncharacterized protein LOC1110249041.3e-22173.75Show/hide
Query:  MDFQAASDAIKCRAFQIALTGSARLWYRRLPARTISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL
        MDFQAA+DAIKCRAFQIALTGSARLWYRRLPAR+ISTYSQLR+EF++QFSS HYD+KTATHLATIRQKE ETLREYVTRFQEEQLKVAHCSDDSAMCYFL
Subjt:  MDFQAASDAIKCRAFQIALTGSARLWYRRLPARTISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL

Query:  TGLADEALTVKLGEEAPATFVEVLQKTKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSLSS-GRAEYRRAENGPTRSRPYERFTPTTIP
        T LADE LTVKLGEEAP TFVEVLQK KKVIDGQELLRTKTGRPE++I + +  ++  KAD KS+DKGS SS  R EYRR E+GP+RSRPYER+T +TIP
Subjt:  TGLADEALTVKLGEEAPATFVEVLQKTKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSLSS-GRAEYRRAENGPTRSRPYERFTPTTIP

Query:  ISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHSHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVIN
        ISEILTNIEESGMEKLLKRPEKLRG  E+R+K+KYCRFHR+H HNT+ CWELKRQIEDLIQDGYFKKFVGKPR++S EKKEERKRSRTPPRR DRPAVIN
Subjt:  ISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHSHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVIN

Query:  TIFGGPSGSQSGQKRKELARAARREVCIIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSP
        TIFGGP+G QSG KRKELAR ARREVCIIRE +PTC ITF  ADLE VHLPHNDALVIA LIDH +VRRVL+DG                          
Subjt:  TIFGGPSGSQSGQKRKELARAARREVCIIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSP

Query:  TPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSVYNTIFGRPIIHSFRVIPSTLHQVLKYSTPNGEGTVRGEQTDSRECYASALKGSSVC
                      GCIDLPVT+GQD TQVTQMAEFVVIDGRS YN IFGRPIIHSFR +PSTLHQVLKYSTPN  G VRGEQ  SRECYASALKGS+VC
Subjt:  TPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSVYNTIFGRPIIHSFRVIPSTLHQVLKYSTPNGEGTVRGEQTDSRECYASALKGSSVC

Query:  AIETLASGDGTLEFEADLP---RREFAAPTEELELVPLLSPEKQPD------LMEIDAPE
        A+E   +     E EADLP   +R+F  PTEELELVPLLSPE+Q +      ++E++AP+
Subjt:  AIETLASGDGTLEFEADLP---RREFAAPTEELELVPLLSPEKQPD------LMEIDAPE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCACGTACGGGCGACGTACATTGCCGAGTCAAGTGAGAATTCCGCTGTAAGTGAAGCTCGGTCGAGGAGGAAAGAACCTCGCCTAAGGTTTGACGAAGCCACAAAGGG
GAGTGAAAAGTGGATTGGAGCTAGAGTTCCTCGTAAGGCTGGCCATGGCGCGGGGTCCCGATCTGAGAACCGAGTGACGCGCGTTGGCATACGCGAGCAAAGGGGTTCCC
ATCTCGGCCCAGCCGAGGAGGAACATCCCGAAGACAACGAGAGCGAGGGACACACTCGCCAGAGAGGAGACCTCCGTGAACACCTCAACAGAAAGAGAGGCTCATCTCTC
CGAAAAGGACAGTCACCATCCCGCTCAAACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCGCAACCCAGCAACTCCTGCAGGAGTGATTACAAGGGAGGAGTTCGACCA
GCTGAGAGGCCAGCTCGACGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGAAGGTCCACTGAACGATGGCGACTTGGGAGAATCGCCCTTCACCTCGGACG
TTTTGGAAGCACCGATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGATGGGTCGAAGGACCCCAAGGATTATGTTGAGGTCTTCGAAGGCCTCATGGATTTC
CAAGCGGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGACGATCTCGACCTACTCTCA
GCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGACATTATGATAAAAAGACAGCGACCCATCTCGCCACCATCAGACAGAAGGAGGGTGAGACGCTGCGAGAATATG
TCACCAGATTCCAGGAGGAGCAATTGAAGGTCGCACACTGCTCCGATGACTCGGCCATGTGCTATTTTCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGA
GAGGAGGCCCCGGCCACCTTCGTCGAGGTGCTTCAAAAGACGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCAGAACGAAAGATCGGCCG
GGGCAGAAGTGGAAAAGATATAGAAAAGGCAGATCCCAAGTCCAAGGACAAGGGATCCCTTTCCAGCGGCCGAGCTGAGTATCGAAGGGCGGAGAACGGACCTACCAGAA
GCCGACCTTACGAACGCTTCACCCCGACCACGATTCCAATTTCCGAGATCCTAACGAACATCGAGGAGTCTGGAATGGAAAAACTACTCAAGCGTCCTGAGAAGCTTCGG
GGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTCCATCGGGAGCACAGCCATAACACGTCGGACTGCTGGGAGTTGAAGCGCCAAATTGAGGATCTAATTCA
AGATGGCTACTTCAAGAAATTTGTGGGAAAGCCCAGGACCAGCTCGACAGAGAAAAAGGAAGAGCGGAAGCGTTCGAGGACGCCGCCCCGGCGCACTGACCGACCTGCGG
TTATCAATACCATTTTCGGAGGGCCAAGCGGGAGTCAGTCCGGACAGAAAAGAAAGGAGTTAGCTCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCAGAGGCCG
ACCTGCCCAATCACCTTCGACAGTGCAGACTTAGAGGAGGTCCACCTGCCCCACAATGATGCACTTGTGATCGCTCCCTTGATTGATCATGTGGTGGTCAGGAGGGTGTT
GGTAGACGGAGGCGCATCTGCTAACATCCTGTCCTTACCGACCTACCTCGCCCTGGGATGGACGAGGTCGCAATTGAAGAAAAGCCCGACACCGCTGGTTGGGTTCTCTG
GAGAATCGGTCATCCCAGAGGGTTGCATCGACTTGCCGGTCACACTTGGGCAGGACCAAACTCAGGTCACCCAAATGGCCGAGTTCGTGGTAATTGACGGTAGATCGGTC
TATAACACCATCTTTGGGAGACCCATCATCCACTCATTTCGGGTCATTCCCTCAACACTTCATCAAGTTTTGAAGTATTCCACCCCCAATGGCGAGGGCACGGTCCGAGG
AGAACAGACCGATTCGAGGGAGTGTTATGCCTCCGCGCTCAAAGGCTCATCTGTCTGCGCCATCGAAACTCTCGCCAGTGGGGATGGGACGCTCGAGTTCGAGGCCGACC
TGCCGAGGAGGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAACAACCAGATCTGATGGAGATCGATGCTCCAGAGCCCTCATGG
ATGGACCCGATTGTGGACTTCATTAGGGGCAATTCACCACAAGACCCCAAGGAGCGCAGGAAGTTGGCAAGGCAAGCAGCTCGGTTCGTGGTCCGAGTCGGACATCTGGT
CTTAAGGAGGGTCCAAACCCATGTGGGTGCCCTTGACCCGACCTGGGAGGGGCCGTTTGAGGTCAACGGAATAGTCCGACCTGGGACGTACATATTGGCCGATCTGAAAG
GAGACGTCCTCGCGCACCCGTGGAACCCGGAGCACCTGAAGCGTTATTATCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCACGTACGGGCGACGTACATTGCCGAGTCAAGTGAGAATTCCGCTGTAAGTGAAGCTCGGTCGAGGAGGAAAGAACCTCGCCTAAGGTTTGACGAAGCCACAAAGGG
GAGTGAAAAGTGGATTGGAGCTAGAGTTCCTCGTAAGGCTGGCCATGGCGCGGGGTCCCGATCTGAGAACCGAGTGACGCGCGTTGGCATACGCGAGCAAAGGGGTTCCC
ATCTCGGCCCAGCCGAGGAGGAACATCCCGAAGACAACGAGAGCGAGGGACACACTCGCCAGAGAGGAGACCTCCGTGAACACCTCAACAGAAAGAGAGGCTCATCTCTC
CGAAAAGGACAGTCACCATCCCGCTCAAACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCGCAACCCAGCAACTCCTGCAGGAGTGATTACAAGGGAGGAGTTCGACCA
GCTGAGAGGCCAGCTCGACGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGAAGGTCCACTGAACGATGGCGACTTGGGAGAATCGCCCTTCACCTCGGACG
TTTTGGAAGCACCGATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGATGGGTCGAAGGACCCCAAGGATTATGTTGAGGTCTTCGAAGGCCTCATGGATTTC
CAAGCGGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGACGATCTCGACCTACTCTCA
GCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGACATTATGATAAAAAGACAGCGACCCATCTCGCCACCATCAGACAGAAGGAGGGTGAGACGCTGCGAGAATATG
TCACCAGATTCCAGGAGGAGCAATTGAAGGTCGCACACTGCTCCGATGACTCGGCCATGTGCTATTTTCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGA
GAGGAGGCCCCGGCCACCTTCGTCGAGGTGCTTCAAAAGACGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCAGAACGAAAGATCGGCCG
GGGCAGAAGTGGAAAAGATATAGAAAAGGCAGATCCCAAGTCCAAGGACAAGGGATCCCTTTCCAGCGGCCGAGCTGAGTATCGAAGGGCGGAGAACGGACCTACCAGAA
GCCGACCTTACGAACGCTTCACCCCGACCACGATTCCAATTTCCGAGATCCTAACGAACATCGAGGAGTCTGGAATGGAAAAACTACTCAAGCGTCCTGAGAAGCTTCGG
GGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTCCATCGGGAGCACAGCCATAACACGTCGGACTGCTGGGAGTTGAAGCGCCAAATTGAGGATCTAATTCA
AGATGGCTACTTCAAGAAATTTGTGGGAAAGCCCAGGACCAGCTCGACAGAGAAAAAGGAAGAGCGGAAGCGTTCGAGGACGCCGCCCCGGCGCACTGACCGACCTGCGG
TTATCAATACCATTTTCGGAGGGCCAAGCGGGAGTCAGTCCGGACAGAAAAGAAAGGAGTTAGCTCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCAGAGGCCG
ACCTGCCCAATCACCTTCGACAGTGCAGACTTAGAGGAGGTCCACCTGCCCCACAATGATGCACTTGTGATCGCTCCCTTGATTGATCATGTGGTGGTCAGGAGGGTGTT
GGTAGACGGAGGCGCATCTGCTAACATCCTGTCCTTACCGACCTACCTCGCCCTGGGATGGACGAGGTCGCAATTGAAGAAAAGCCCGACACCGCTGGTTGGGTTCTCTG
GAGAATCGGTCATCCCAGAGGGTTGCATCGACTTGCCGGTCACACTTGGGCAGGACCAAACTCAGGTCACCCAAATGGCCGAGTTCGTGGTAATTGACGGTAGATCGGTC
TATAACACCATCTTTGGGAGACCCATCATCCACTCATTTCGGGTCATTCCCTCAACACTTCATCAAGTTTTGAAGTATTCCACCCCCAATGGCGAGGGCACGGTCCGAGG
AGAACAGACCGATTCGAGGGAGTGTTATGCCTCCGCGCTCAAAGGCTCATCTGTCTGCGCCATCGAAACTCTCGCCAGTGGGGATGGGACGCTCGAGTTCGAGGCCGACC
TGCCGAGGAGGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAACAACCAGATCTGATGGAGATCGATGCTCCAGAGCCCTCATGG
ATGGACCCGATTGTGGACTTCATTAGGGGCAATTCACCACAAGACCCCAAGGAGCGCAGGAAGTTGGCAAGGCAAGCAGCTCGGTTCGTGGTCCGAGTCGGACATCTGGT
CTTAAGGAGGGTCCAAACCCATGTGGGTGCCCTTGACCCGACCTGGGAGGGGCCGTTTGAGGTCAACGGAATAGTCCGACCTGGGACGTACATATTGGCCGATCTGAAAG
GAGACGTCCTCGCGCACCCGTGGAACCCGGAGCACCTGAAGCGTTATTATCCTTGA
Protein sequenceShow/hide protein sequence
MHVRATYIAESSENSAVSEARSRRKEPRLRFDEATKGSEKWIGARVPRKAGHGAGSRSENRVTRVGIREQRGSHLGPAEEEHPEDNESEGHTRQRGDLREHLNRKRGSSL
RKGQSPSRSNRSSNQQAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDF
QAASDAIKCRAFQIALTGSARLWYRRLPARTISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLG
EEAPATFVEVLQKTKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSLSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLR
GAPERRSKDKYCRFHREHSHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVINTIFGGPSGSQSGQKRKELARAARREVCIIREQRP
TCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSV
YNTIFGRPIIHSFRVIPSTLHQVLKYSTPNGEGTVRGEQTDSRECYASALKGSSVCAIETLASGDGTLEFEADLPRREFAAPTEELELVPLLSPEKQPDLMEIDAPEPSW
MDPIVDFIRGNSPQDPKERRKLARQAARFVVRVGHLVLRRVQTHVGALDPTWEGPFEVNGIVRPGTYILADLKGDVLAHPWNPEHLKRYYP