; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g34720 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g34720
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr1:24581867..24587447
RNA-Seq ExpressionMoc01g34720
SyntenyMoc01g34720
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]1.4e-23984.66Show/hide
Query:  QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKSEQKDDSLNDGDLGESPLTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVE-------------
        +AESS N   PAG+ITREEFDQLRG+LDAQVEALKAK EQK+  LNDGDLGESP TSDVLEAPIPPKFKAPTVKPYDG+KDPKDYVE             
Subjt:  QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKSEQKDDSLNDGDLGESPLTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVE-------------

Query:  ----RAIVVSETAS----------QVDSTYSQLRSEFLAQFSSRHYDKKTATYLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
            RA  ++ T S             STYSQLR EFLA FSSRHYDKKTAT+LATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  ----RAIVVSETAS----------QVDSTYSQLRSEFLAQFSSRHYDKKTATYLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFVEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRTECRRAESGPTKSRPYERFTPTTIPISETLTNIE
        TVKLGEEAPATF EVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD E ADPKSKDKGSFSSGR E RRAE+GPT+SRPYERFTPTTIPISE LTNIE
Subjt:  TVKLGEEAPATFVEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRTECRRAESGPTKSRPYERFTPTTIPISETLTNIE

Query:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPKTSSAEKKEERKHSRTPPQRTDRPAVINTIFGGPSGG
        ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFVGKP+TSSAEKKEERK SRTPP+RTDRPAVINTIFGGPSGG
Subjt:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPKTSSAEKKEERKHSRTPPQRTDRPAVINTIFGGPSGG

Query:  QSGHKRKELARAAKREVCIIREQGPTCPITIDDAHLEEVHLPNNDTLVIAPLIDHVVVKRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSRE
        QSG KRKELARAA+REVCIIREQ PTCPIT D A LEEVHLP+ND LVIAPLIDHVVV RVLVDGG SANILSLPTYLALGWTRSQLK+SPTPLVGFS E
Subjt:  QSGHKRKELARAAKREVCIIREQGPTCPITIDDAHLEEVHLPNNDTLVIAPLIDHVVVKRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSRE

Query:  SVIPEGCIDLPVTLGQDRTRVSQMAEFV
        SVIPEG IDLPVTLGQD+T+V+QMAEFV
Subjt:  SVIPEGCIDLPVTLGQDRTRVSQMAEFV

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]1.2e-25179.27Show/hide
Query:  SSNQQAESSHNPA---GIITREEFDQLRGELDAQVEALKAKSEQKDDSLNDGDLGESPLTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVERAIVVSETA
        SSNQQAESSHNPA   G+ITREEFDQLRG+L+AQVEALKAK EQK+  LNDGDLGESP TSDVLE        APTVK YDG+KDPKDYVE  +      
Subjt:  SSNQQAESSHNPA---GIITREEFDQLRGELDAQVEALKAKSEQKDDSLNDGDLGESPLTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVERAIVVSETA

Query:  SQVDSTYSQLRSEFLAQFSSRHYDKKTATYLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFVEVLQKAKKVI
         Q  S   + R+  +A   S                            FQE+QLKVA  SDDSAMCYFLTGLADEALTVKLG+EAPATF EVLQKAKKVI
Subjt:  SQVDSTYSQLRSEFLAQFSSRHYDKKTATYLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFVEVLQKAKKVI

Query:  DGQELLRTKTGRPERKIGRGRSGKDERADPKSKDKGSFSSGRTECRRAESGPTKSRPYERFTPTTIPISETLTNIEESGMEKLLKRPEKLRGAPERRSKD
        DGQELLRTKTGRPER I RGRSGKDE+AD KSKDKGSFSSGR E RRA +GPT+SRPYERFTPTTIPISE LTNIEESGMEKLLKRPEKLRGAPERR+KD
Subjt:  DGQELLRTKTGRPERKIGRGRSGKDERADPKSKDKGSFSSGRTECRRAESGPTKSRPYERFTPTTIPISETLTNIEESGMEKLLKRPEKLRGAPERRSKD

Query:  KYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPKTSSAEKKEERKHSRTPPQRTDRPAVINTIFGGPSGGQSGHKRKELARAAKREVCIIREQG
        KYCRFHREH HNTSD WELKRQIEDLIQD YFKKFVGKP+TSSAEKKEERK SRTP +R DRPAVINTIFGGPSGGQSGHKRKELARAA+REVCIIREQ 
Subjt:  KYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPKTSSAEKKEERKHSRTPPQRTDRPAVINTIFGGPSGGQSGHKRKELARAAKREVCIIREQG

Query:  PTCPITIDDAHLEEVHLPNNDTLVIAPLIDHVVVKRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSRESVIPEGCIDLPVTLGQDRTRVSQM
        PTCPIT D A LEEVHLP+ND LVIAPLIDHVVV+RVLVD G SANI+SL TYLALGWTRSQLK+S TPLVGFSRESVIPEGCIDLPVTLG D+T+V+QM
Subjt:  PTCPITIDDAHLEEVHLPNNDTLVIAPLIDHVVVKRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSRESVIPEGCIDLPVTLGQDRTRVSQM

Query:  AEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTLNGVGTVRGEQTASRECYASALKGSSVCALETLAGRDGALEFEADLPRKEFAAPTEELELV
        AEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKY T NGVG VRGEQ ASRECYASALKGSSVCALETL  RDG LEF+A+LPR+EFAAPTEELELV
Subjt:  AEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTLNGVGTVRGEQTASRECYASALKGSSVCALETLAGRDGALEFEADLPRKEFAAPTEELELV

Query:  PLL
        PLL
Subjt:  PLL

XP_022152110.1 uncharacterized protein LOC111019899 [Momordica charantia]3.3e-21788.34Show/hide
Query:  MCYFLTGLADEALTVKLGEEAPATFVEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRTECRRAESGPTKSRPYERFTP
        MCYFLTGLADEALTVKL EEAPATF EVLQKAKKVIDGQELLRT       KIG+GRSGKD E  DPKSKDKGSFS+GR E RRAE+GPT+SRPYERFTP
Subjt:  MCYFLTGLADEALTVKLGEEAPATFVEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRTECRRAESGPTKSRPYERFTP

Query:  TTIPISETLTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPKTSSAEKKEERKHSRTPPQRTDRP
        TTIPISE LTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELK QIEDLIQDGYFKKFVGKP+TSSAEKKEERK SRTPP+RTDRP
Subjt:  TTIPISETLTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPKTSSAEKKEERKHSRTPPQRTDRP

Query:  AVINTIFGGPSGGQSGHKRKELARAAKREVCIIREQGPTCPITIDDAHLEEVHLPNNDTLVIAPLIDHVVVKRVLVDGGASANILSLPTYLALGWTRSQL
        AVINTIFGGPSGGQSGHKRK+LARAA+REVCIIREQ PTCPIT D A L EVHLP+ND LVIAPLIDHVVV+RVLVDGGASANILSLPTYLALGWTRSQL
Subjt:  AVINTIFGGPSGGQSGHKRKELARAAKREVCIIREQGPTCPITIDDAHLEEVHLPNNDTLVIAPLIDHVVVKRVLVDGGASANILSLPTYLALGWTRSQL

Query:  KRSPTPLVGFSRESVIPEGCIDLPVTLGQDRTRVSQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTLNGVGTVRGEQTASRECYASALKG
        K+SPTPLVGFS ESV+PEGCIDLPVTLGQD+TRV+QMAEFVV+DGRSAYNAIFGRPIIHSFRAIPSTLHQVLKY T NGVGTVRGEQTASRECYAS LKG
Subjt:  KRSPTPLVGFSRESVIPEGCIDLPVTLGQDRTRVSQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTLNGVGTVRGEQTASRECYASALKG

Query:  SSVCALETLAGRDGALEFEADLPRKEFAAPTEELELVPLLSPEKQL
        +SVCALETL  RDG LEFEADLP +EFAAP EELELVPLLS EKQ+
Subjt:  SSVCALETLAGRDGALEFEADLPRKEFAAPTEELELVPLLSPEKQL

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]1.3e-23459.87Show/hide
Query:  MVQPANSTNTTDRRTLAASDAHQREVAAAAGEGQGHDGLATEPLRRTARITAPALPLVHPRTSKAIRGRGGTSKKGARGPAPAPTSENFDALQREMEAMR
        MVQPANSTNT DRR LAA+  HQREV A   EGQGH+ L TEPL R+ARIT P LP  HP+ SK                                    
Subjt:  MVQPANSTNTTDRRTLAASDAHQREVAAAAGEGQGHDGLATEPLRRTARITAPALPLVHPRTSKAIRGRGGTSKKGARGPAPAPTSENFDALQREMEAMR

Query:  TQMRTMEEIYNKMMLAAGAGSRSENRVTRVDVREQRCSHLGPAEEERPEDNESEGYTHQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNP--
                                                                                                   AESS+NP  
Subjt:  TQMRTMEEIYNKMMLAAGAGSRSENRVTRVDVREQRCSHLGPAEEERPEDNESEGYTHQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNP--

Query:  AGIITREEFDQLRGELDAQVEALKAKSEQKDDSLNDGDLGESPLTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVE-----------------RAIVVSE
         G+ITREEFDQL+ + DAQVEALKA+ E+K+ S +DGDLGE   +SD+LEA IPPKFK PT+KPYDG+KDPKDYVE                  A  ++ 
Subjt:  AGIITREEFDQLRGELDAQVEALKAKSEQKDDSLNDGDLGESPLTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVE-----------------RAIVVSE

Query:  TAS----------QVDSTYSQLRSEFLAQFSSRHYDKKTATYLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPAT
        T S          ++ STYSQLR EF++QFSSRHYD+KT T+LATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EEAPAT
Subjt:  TAS----------QVDSTYSQLRSEFLAQFSSRHYDKKTATYLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPAT

Query:  FVEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDE-RADPKSKDKG-SFSSGRTECRRAESGPTKSRPYERFTPTTIPISETLTNIEESGMEKLLKR
        F EVLQK KKVIDGQELLRTKTGRPE+ I +GR+GKD+ +AD KS+DKG S SS R + RR+ S   +SRPYE +TPTTIPI E LTNIEE+GMEKLLKR
Subjt:  FVEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDE-RADPKSKDKG-SFSSGRTECRRAESGPTKSRPYERFTPTTIPISETLTNIEESGMEKLLKR

Query:  PEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPKTSSAEKKEERKHSRTPPQRTDRPAVINTIFGGPSGGQSGHKRKELA
        PEKLRG PE+R+ DKYCRFHR+HGHNTS+ WELKRQIEDLIQDGYFKKFVGKP+++S EKKEERK  RTPP+R DRPAVIN             K+KELA
Subjt:  PEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPKTSSAEKKEERKHSRTPPQRTDRPAVINTIFGGPSGGQSGHKRKELA

Query:  RAAKREVCIIREQGPTCPITIDDAHLEEVHLPNNDTLVIAPLIDHVVVKRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSRESVIPEGCIDL
        R A+REVCIIREQ PT  I  + A LE VHLP+ND LVIAPLID V+V+R+LVDGGASANILSL TYLALGWTRSQLK+SPTPLVGFS ES+  EGCIDL
Subjt:  RAAKREVCIIREQGPTCPITIDDAHLEEVHLPNNDTLVIAPLIDHVVVKRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSRESVIPEGCIDL

Query:  PVTLGQDRTRVSQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTLNGVGTVRGEQTASRECYASALKGSSVCALETLAGRD
        PV++ QD T+V+QMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKY TLNGVGTVRGE   SRECYAS  K SSVCALE    RD
Subjt:  PVTLGQDRTRVSQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTLNGVGTVRGEQTASRECYASALKGSSVCALETLAGRD

XP_022158414.1 uncharacterized protein LOC111024904 [Momordica charantia]5.5e-19672.89Show/hide
Query:  STYSQLRSEFLAQFSSRHYDKKTATYLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFVEVLQKAKKVIDGQE
        STYSQLR EF++QFSS HYD+KTAT+LATIRQKE ETLREYVTRFQEEQLKVAHCSDDSAMCYFLT LADE LTVKLGEEAP TFVEVLQKAKKVIDGQE
Subjt:  STYSQLRSEFLAQFSSRHYDKKTATYLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFVEVLQKAKKVIDGQE

Query:  LLRTKTGRPERKIGRGRSGKDER-ADPKSKDKGSFSS-GRTECRRAESGPTKSRPYERFTPTTIPISETLTNIEESGMEKLLKRPEKLRGAPERRSKDKY
        LLRTKTGRPE++I + +  +++R AD KS+DKGS SS  RTE RR ESGP++SRPYER+T +TIPISE LTNIEESGMEKLLKRPEKLRG  E+R+K+KY
Subjt:  LLRTKTGRPERKIGRGRSGKDER-ADPKSKDKGSFSS-GRTECRRAESGPTKSRPYERFTPTTIPISETLTNIEESGMEKLLKRPEKLRGAPERRSKDKY

Query:  CRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPKTSSAEKKEERKHSRTPPQRTDRPAVINTIFGGPSGGQSGHKRKELARAAKREVCIIREQGPT
        CRFHR+HGHNT+ CWELKRQIEDLIQDGYFKKFVGKP+++S EKKEERK SRTPP+R DRPAVINTIFGGP+GGQSG+KRKELAR A+REVCIIRE  PT
Subjt:  CRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPKTSSAEKKEERKHSRTPPQRTDRPAVINTIFGGPSGGQSGHKRKELARAAKREVCIIREQGPT

Query:  CPITIDDAHLEEVHLPNNDTLVIAPLIDHVVVKRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSRESVIPEGCIDLPVTLGQDRTRVSQMAE
        C IT  DA LE VHLP+ND LVIA LIDH +V+RVL+DG                                        GCIDLPVT+GQD T+V+QMAE
Subjt:  CPITIDDAHLEEVHLPNNDTLVIAPLIDHVVVKRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSRESVIPEGCIDLPVTLGQDRTRVSQMAE

Query:  FVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTLNGVGTVRGEQTASRECYASALKGSSVCALETLAGRDGALEFEADLP---RKEFAAPTEELEL
        FVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKY T N VG VRGEQ  SRECYASALKGS+VCALE    R    E EADLP   +++F  PTEELEL
Subjt:  FVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTLNGVGTVRGEQTASRECYASALKGSSVCALETLAGRDGALEFEADLP---RKEFAAPTEELEL

Query:  VPLLSPEKQ
        VPLLSPE+Q
Subjt:  VPLLSPEKQ

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088136.6e-24084.66Show/hide
Query:  QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKSEQKDDSLNDGDLGESPLTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVE-------------
        +AESS N   PAG+ITREEFDQLRG+LDAQVEALKAK EQK+  LNDGDLGESP TSDVLEAPIPPKFKAPTVKPYDG+KDPKDYVE             
Subjt:  QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKSEQKDDSLNDGDLGESPLTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVE-------------

Query:  ----RAIVVSETAS----------QVDSTYSQLRSEFLAQFSSRHYDKKTATYLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
            RA  ++ T S             STYSQLR EFLA FSSRHYDKKTAT+LATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  ----RAIVVSETAS----------QVDSTYSQLRSEFLAQFSSRHYDKKTATYLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFVEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRTECRRAESGPTKSRPYERFTPTTIPISETLTNIE
        TVKLGEEAPATF EVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD E ADPKSKDKGSFSSGR E RRAE+GPT+SRPYERFTPTTIPISE LTNIE
Subjt:  TVKLGEEAPATFVEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRTECRRAESGPTKSRPYERFTPTTIPISETLTNIE

Query:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPKTSSAEKKEERKHSRTPPQRTDRPAVINTIFGGPSGG
        ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFVGKP+TSSAEKKEERK SRTPP+RTDRPAVINTIFGGPSGG
Subjt:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPKTSSAEKKEERKHSRTPPQRTDRPAVINTIFGGPSGG

Query:  QSGHKRKELARAAKREVCIIREQGPTCPITIDDAHLEEVHLPNNDTLVIAPLIDHVVVKRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSRE
        QSG KRKELARAA+REVCIIREQ PTCPIT D A LEEVHLP+ND LVIAPLIDHVVV RVLVDGG SANILSLPTYLALGWTRSQLK+SPTPLVGFS E
Subjt:  QSGHKRKELARAAKREVCIIREQGPTCPITIDDAHLEEVHLPNNDTLVIAPLIDHVVVKRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSRE

Query:  SVIPEGCIDLPVTLGQDRTRVSQMAEFV
        SVIPEG IDLPVTLGQD+T+V+QMAEFV
Subjt:  SVIPEGCIDLPVTLGQDRTRVSQMAEFV

A0A6J1D9E1 uncharacterized protein LOC1110188235.8e-25279.27Show/hide
Query:  SSNQQAESSHNPA---GIITREEFDQLRGELDAQVEALKAKSEQKDDSLNDGDLGESPLTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVERAIVVSETA
        SSNQQAESSHNPA   G+ITREEFDQLRG+L+AQVEALKAK EQK+  LNDGDLGESP TSDVLE        APTVK YDG+KDPKDYVE  +      
Subjt:  SSNQQAESSHNPA---GIITREEFDQLRGELDAQVEALKAKSEQKDDSLNDGDLGESPLTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVERAIVVSETA

Query:  SQVDSTYSQLRSEFLAQFSSRHYDKKTATYLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFVEVLQKAKKVI
         Q  S   + R+  +A   S                            FQE+QLKVA  SDDSAMCYFLTGLADEALTVKLG+EAPATF EVLQKAKKVI
Subjt:  SQVDSTYSQLRSEFLAQFSSRHYDKKTATYLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFVEVLQKAKKVI

Query:  DGQELLRTKTGRPERKIGRGRSGKDERADPKSKDKGSFSSGRTECRRAESGPTKSRPYERFTPTTIPISETLTNIEESGMEKLLKRPEKLRGAPERRSKD
        DGQELLRTKTGRPER I RGRSGKDE+AD KSKDKGSFSSGR E RRA +GPT+SRPYERFTPTTIPISE LTNIEESGMEKLLKRPEKLRGAPERR+KD
Subjt:  DGQELLRTKTGRPERKIGRGRSGKDERADPKSKDKGSFSSGRTECRRAESGPTKSRPYERFTPTTIPISETLTNIEESGMEKLLKRPEKLRGAPERRSKD

Query:  KYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPKTSSAEKKEERKHSRTPPQRTDRPAVINTIFGGPSGGQSGHKRKELARAAKREVCIIREQG
        KYCRFHREH HNTSD WELKRQIEDLIQD YFKKFVGKP+TSSAEKKEERK SRTP +R DRPAVINTIFGGPSGGQSGHKRKELARAA+REVCIIREQ 
Subjt:  KYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPKTSSAEKKEERKHSRTPPQRTDRPAVINTIFGGPSGGQSGHKRKELARAAKREVCIIREQG

Query:  PTCPITIDDAHLEEVHLPNNDTLVIAPLIDHVVVKRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSRESVIPEGCIDLPVTLGQDRTRVSQM
        PTCPIT D A LEEVHLP+ND LVIAPLIDHVVV+RVLVD G SANI+SL TYLALGWTRSQLK+S TPLVGFSRESVIPEGCIDLPVTLG D+T+V+QM
Subjt:  PTCPITIDDAHLEEVHLPNNDTLVIAPLIDHVVVKRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSRESVIPEGCIDLPVTLGQDRTRVSQM

Query:  AEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTLNGVGTVRGEQTASRECYASALKGSSVCALETLAGRDGALEFEADLPRKEFAAPTEELELV
        AEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKY T NGVG VRGEQ ASRECYASALKGSSVCALETL  RDG LEF+A+LPR+EFAAPTEELELV
Subjt:  AEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTLNGVGTVRGEQTASRECYASALKGSSVCALETLAGRDGALEFEADLPRKEFAAPTEELELV

Query:  PLL
        PLL
Subjt:  PLL

A0A6J1DD03 uncharacterized protein LOC1110198991.6e-21788.34Show/hide
Query:  MCYFLTGLADEALTVKLGEEAPATFVEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRTECRRAESGPTKSRPYERFTP
        MCYFLTGLADEALTVKL EEAPATF EVLQKAKKVIDGQELLRT       KIG+GRSGKD E  DPKSKDKGSFS+GR E RRAE+GPT+SRPYERFTP
Subjt:  MCYFLTGLADEALTVKLGEEAPATFVEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRTECRRAESGPTKSRPYERFTP

Query:  TTIPISETLTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPKTSSAEKKEERKHSRTPPQRTDRP
        TTIPISE LTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELK QIEDLIQDGYFKKFVGKP+TSSAEKKEERK SRTPP+RTDRP
Subjt:  TTIPISETLTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPKTSSAEKKEERKHSRTPPQRTDRP

Query:  AVINTIFGGPSGGQSGHKRKELARAAKREVCIIREQGPTCPITIDDAHLEEVHLPNNDTLVIAPLIDHVVVKRVLVDGGASANILSLPTYLALGWTRSQL
        AVINTIFGGPSGGQSGHKRK+LARAA+REVCIIREQ PTCPIT D A L EVHLP+ND LVIAPLIDHVVV+RVLVDGGASANILSLPTYLALGWTRSQL
Subjt:  AVINTIFGGPSGGQSGHKRKELARAAKREVCIIREQGPTCPITIDDAHLEEVHLPNNDTLVIAPLIDHVVVKRVLVDGGASANILSLPTYLALGWTRSQL

Query:  KRSPTPLVGFSRESVIPEGCIDLPVTLGQDRTRVSQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTLNGVGTVRGEQTASRECYASALKG
        K+SPTPLVGFS ESV+PEGCIDLPVTLGQD+TRV+QMAEFVV+DGRSAYNAIFGRPIIHSFRAIPSTLHQVLKY T NGVGTVRGEQTASRECYAS LKG
Subjt:  KRSPTPLVGFSRESVIPEGCIDLPVTLGQDRTRVSQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTLNGVGTVRGEQTASRECYASALKG

Query:  SSVCALETLAGRDGALEFEADLPRKEFAAPTEELELVPLLSPEKQL
        +SVCALETL  RDG LEFEADLP +EFAAP EELELVPLLS EKQ+
Subjt:  SSVCALETLAGRDGALEFEADLPRKEFAAPTEELELVPLLSPEKQL

A0A6J1DHB3 uncharacterized protein LOC1110204796.4e-23559.87Show/hide
Query:  MVQPANSTNTTDRRTLAASDAHQREVAAAAGEGQGHDGLATEPLRRTARITAPALPLVHPRTSKAIRGRGGTSKKGARGPAPAPTSENFDALQREMEAMR
        MVQPANSTNT DRR LAA+  HQREV A   EGQGH+ L TEPL R+ARIT P LP  HP+ SK                                    
Subjt:  MVQPANSTNTTDRRTLAASDAHQREVAAAAGEGQGHDGLATEPLRRTARITAPALPLVHPRTSKAIRGRGGTSKKGARGPAPAPTSENFDALQREMEAMR

Query:  TQMRTMEEIYNKMMLAAGAGSRSENRVTRVDVREQRCSHLGPAEEERPEDNESEGYTHQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNP--
                                                                                                   AESS+NP  
Subjt:  TQMRTMEEIYNKMMLAAGAGSRSENRVTRVDVREQRCSHLGPAEEERPEDNESEGYTHQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNP--

Query:  AGIITREEFDQLRGELDAQVEALKAKSEQKDDSLNDGDLGESPLTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVE-----------------RAIVVSE
         G+ITREEFDQL+ + DAQVEALKA+ E+K+ S +DGDLGE   +SD+LEA IPPKFK PT+KPYDG+KDPKDYVE                  A  ++ 
Subjt:  AGIITREEFDQLRGELDAQVEALKAKSEQKDDSLNDGDLGESPLTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVE-----------------RAIVVSE

Query:  TAS----------QVDSTYSQLRSEFLAQFSSRHYDKKTATYLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPAT
        T S          ++ STYSQLR EF++QFSSRHYD+KT T+LATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EEAPAT
Subjt:  TAS----------QVDSTYSQLRSEFLAQFSSRHYDKKTATYLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPAT

Query:  FVEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDE-RADPKSKDKG-SFSSGRTECRRAESGPTKSRPYERFTPTTIPISETLTNIEESGMEKLLKR
        F EVLQK KKVIDGQELLRTKTGRPE+ I +GR+GKD+ +AD KS+DKG S SS R + RR+ S   +SRPYE +TPTTIPI E LTNIEE+GMEKLLKR
Subjt:  FVEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDE-RADPKSKDKG-SFSSGRTECRRAESGPTKSRPYERFTPTTIPISETLTNIEESGMEKLLKR

Query:  PEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPKTSSAEKKEERKHSRTPPQRTDRPAVINTIFGGPSGGQSGHKRKELA
        PEKLRG PE+R+ DKYCRFHR+HGHNTS+ WELKRQIEDLIQDGYFKKFVGKP+++S EKKEERK  RTPP+R DRPAVIN             K+KELA
Subjt:  PEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPKTSSAEKKEERKHSRTPPQRTDRPAVINTIFGGPSGGQSGHKRKELA

Query:  RAAKREVCIIREQGPTCPITIDDAHLEEVHLPNNDTLVIAPLIDHVVVKRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSRESVIPEGCIDL
        R A+REVCIIREQ PT  I  + A LE VHLP+ND LVIAPLID V+V+R+LVDGGASANILSL TYLALGWTRSQLK+SPTPLVGFS ES+  EGCIDL
Subjt:  RAAKREVCIIREQGPTCPITIDDAHLEEVHLPNNDTLVIAPLIDHVVVKRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSRESVIPEGCIDL

Query:  PVTLGQDRTRVSQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTLNGVGTVRGEQTASRECYASALKGSSVCALETLAGRD
        PV++ QD T+V+QMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKY TLNGVGTVRGE   SRECYAS  K SSVCALE    RD
Subjt:  PVTLGQDRTRVSQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTLNGVGTVRGEQTASRECYASALKGSSVCALETLAGRD

A0A6J1DZB9 uncharacterized protein LOC1110249042.6e-19672.89Show/hide
Query:  STYSQLRSEFLAQFSSRHYDKKTATYLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFVEVLQKAKKVIDGQE
        STYSQLR EF++QFSS HYD+KTAT+LATIRQKE ETLREYVTRFQEEQLKVAHCSDDSAMCYFLT LADE LTVKLGEEAP TFVEVLQKAKKVIDGQE
Subjt:  STYSQLRSEFLAQFSSRHYDKKTATYLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFVEVLQKAKKVIDGQE

Query:  LLRTKTGRPERKIGRGRSGKDER-ADPKSKDKGSFSS-GRTECRRAESGPTKSRPYERFTPTTIPISETLTNIEESGMEKLLKRPEKLRGAPERRSKDKY
        LLRTKTGRPE++I + +  +++R AD KS+DKGS SS  RTE RR ESGP++SRPYER+T +TIPISE LTNIEESGMEKLLKRPEKLRG  E+R+K+KY
Subjt:  LLRTKTGRPERKIGRGRSGKDER-ADPKSKDKGSFSS-GRTECRRAESGPTKSRPYERFTPTTIPISETLTNIEESGMEKLLKRPEKLRGAPERRSKDKY

Query:  CRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPKTSSAEKKEERKHSRTPPQRTDRPAVINTIFGGPSGGQSGHKRKELARAAKREVCIIREQGPT
        CRFHR+HGHNT+ CWELKRQIEDLIQDGYFKKFVGKP+++S EKKEERK SRTPP+R DRPAVINTIFGGP+GGQSG+KRKELAR A+REVCIIRE  PT
Subjt:  CRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPKTSSAEKKEERKHSRTPPQRTDRPAVINTIFGGPSGGQSGHKRKELARAAKREVCIIREQGPT

Query:  CPITIDDAHLEEVHLPNNDTLVIAPLIDHVVVKRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSRESVIPEGCIDLPVTLGQDRTRVSQMAE
        C IT  DA LE VHLP+ND LVIA LIDH +V+RVL+DG                                        GCIDLPVT+GQD T+V+QMAE
Subjt:  CPITIDDAHLEEVHLPNNDTLVIAPLIDHVVVKRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSRESVIPEGCIDLPVTLGQDRTRVSQMAE

Query:  FVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTLNGVGTVRGEQTASRECYASALKGSSVCALETLAGRDGALEFEADLP---RKEFAAPTEELEL
        FVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKY T N VG VRGEQ  SRECYASALKGS+VCALE    R    E EADLP   +++F  PTEELEL
Subjt:  FVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTLNGVGTVRGEQTASRECYASALKGSSVCALETLAGRDGALEFEADLP---RKEFAAPTEELEL

Query:  VPLLSPEKQ
        VPLLSPE+Q
Subjt:  VPLLSPEKQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAACCAGCGAACTCGACCAATACGACAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGCAGCAGCAGCGGGAGAGGGGCAAGGTCACGA
CGGCCTAGCAACGGAACCCCTCCGCAGGACGGCACGGATCACCGCGCCTGCCCTACCGCTTGTGCACCCGAGGACGTCCAAGGCCATCCGTGGCCGAGGTGGGACCTCTA
AGAAGGGCGCCCGGGGTCCAGCCCCGGCTCCAACAAGCGAGAACTTTGATGCGCTCCAAAGAGAGATGGAGGCAATGCGCACACAAATGCGCACCATGGAGGAAATATAT
AACAAAATGATGCTAGCTGCAGGCGCAGGGTCCCGATCAGAAAATCGAGTGACGCGCGTGGACGTACGCGAGCAAAGATGTTCCCACCTCGGCCCAGCCGAGGAGGAACG
TCCCGAAGACAACGAGAGCGAGGGGTACACTCACCAGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCGTCTCTCCGAAAAGGGCAGTCACCATCCCGCT
CACACAGGAGTTCCAACCAGCAGGCTGAATCCTCTCACAATCCCGCAGGGATAATCACAAGGGAGGAGTTCGACCAGCTGAGGGGGGAGCTCGATGCTCAGGTGGAGGCC
TTAAAGGCCAAAAGTGAGCAGAAAGACGATTCACTAAACGATGGCGACTTGGGAGAATCGCCTTTAACCTCTGACGTTTTGGAAGCACCAATCCCTCCGAAGTTCAAAGC
TCCTACCGTGAAGCCTTATGATGGGACGAAGGACCCCAAGGACTATGTTGAGCGTGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATTCGACCTACTCTCAGCTGA
GAAGCGAGTTCCTCGCCCAGTTCTCTTCTCGGCACTATGACAAAAAGACAGCGACCTATCTCGCCACCATCAGGCAGAAGGAGGGTGAGACGCTGCGGGAATATGTCACC
AGATTCCAGGAGGAGCAATTGAAGGTTGCACACTGCTCCGATGACTCGGCCATGTGCTATTTTCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGAGAAGA
GGCCCCGGCCACCTTCGTCGAGGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCTGAACGAAAGATCGGTCGGGGCA
GAAGTGGAAAAGATGAAAGGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGAACTGAGTGTCGAAGGGCGGAGAGCGGACCTACCAAGAGCCGACCT
TACGAACGCTTCACCCCGACCACGATTCCAATTTCCGAGACCCTAACAAACATCGAGGAATCTGGAATGGAAAAACTACTCAAGCGTCCTGAGAAGCTTCGGGGAGCCCC
GGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTCCATCGAGAGCACGGCCACAACACGTCGGACTGCTGGGAGTTGAAGCGCCAAATTGAGGATCTAATTCAAGATGGCT
ACTTCAAGAAGTTTGTGGGAAAACCCAAGACCAGCTCAGCAGAAAAAAAGGAGGAGCGAAAGCATTCAAGGACGCCACCCCAGCGCACCGACCGACCTGCGGTCATCAAC
ACCATTTTTGGAGGGCCAAGCGGGGGTCAGTCCGGACATAAAAGAAAGGAGTTAGCCCGTGCAGCCAAGCGCGAGGTGTGCATCATCAGGGAGCAGGGGCCGACCTGCCC
AATTACCATCGACGATGCACACTTAGAGGAGGTCCACCTGCCCAACAATGATACACTTGTGATTGCTCCCTTGATTGATCATGTGGTGGTCAAGAGAGTGCTGGTAGACG
GGGGCGCATCCGCTAACATCCTGTCCTTACCGACCTACCTCGCCTTGGGATGGACGAGGTCGCAATTGAAGAGAAGCCCGACACCGCTGGTTGGGTTCTCTAGAGAATCG
GTCATCCCAGAGGGTTGCATCGACTTGCCGGTCACGCTGGGGCAGGACCGAACTCGGGTCTCTCAAATGGCCGAGTTCGTGGTAATTGACGGTAGATCGGCCTATAACGC
CATCTTTGGGAGACCCATCATCCACTCATTTCGAGCCATTCCCTCAACACTGCATCAAGTTTTGAAGTATCCCACCCTCAATGGCGTGGGCACGGTCCGAGGAGAACAGA
CCGCTTCGAGGGAGTGTTATGCCTCCGCACTCAAGGGCTCATCGGTCTGCGCCCTCGAAACGCTCGCCGGTAGGGATGGGGCGCTCGAGTTCGAGGCCGACCTGCCGAGG
AAGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAGTTAGCATCGGCGTACGAGACCGACCTGGCCAGGTCGGTCCCCGTCGA
GATCTTAGATAATCCCTCGATCTCAGAGCCAGATCTGATGGAGATCGGCGCTCCAGAATCTTCATGGATTGCGGACTTCATTAGGGGCAACTCACCACACGACCCCGAGG
AGCACAGAAAGTTGGCACGGCGGGCAGCTCGGTTCGTGGTCCGAGATGGGGCATTGTACCGACGTGCCCTTGATCCGGCCTGGGAGGGCCTGTTTGAGATCAAGGGCATA
GTCCGACCTGGGACGTACATATTGGCCGATCTGAAAGGAGACGTCCTCGCGCACCCGTGGAACGCGGAGCACCTGAAGCGCTATTATCCATGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAACCAGCGAACTCGACCAATACGACAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGCAGCAGCAGCGGGAGAGGGGCAAGGTCACGA
CGGCCTAGCAACGGAACCCCTCCGCAGGACGGCACGGATCACCGCGCCTGCCCTACCGCTTGTGCACCCGAGGACGTCCAAGGCCATCCGTGGCCGAGGTGGGACCTCTA
AGAAGGGCGCCCGGGGTCCAGCCCCGGCTCCAACAAGCGAGAACTTTGATGCGCTCCAAAGAGAGATGGAGGCAATGCGCACACAAATGCGCACCATGGAGGAAATATAT
AACAAAATGATGCTAGCTGCAGGCGCAGGGTCCCGATCAGAAAATCGAGTGACGCGCGTGGACGTACGCGAGCAAAGATGTTCCCACCTCGGCCCAGCCGAGGAGGAACG
TCCCGAAGACAACGAGAGCGAGGGGTACACTCACCAGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCGTCTCTCCGAAAAGGGCAGTCACCATCCCGCT
CACACAGGAGTTCCAACCAGCAGGCTGAATCCTCTCACAATCCCGCAGGGATAATCACAAGGGAGGAGTTCGACCAGCTGAGGGGGGAGCTCGATGCTCAGGTGGAGGCC
TTAAAGGCCAAAAGTGAGCAGAAAGACGATTCACTAAACGATGGCGACTTGGGAGAATCGCCTTTAACCTCTGACGTTTTGGAAGCACCAATCCCTCCGAAGTTCAAAGC
TCCTACCGTGAAGCCTTATGATGGGACGAAGGACCCCAAGGACTATGTTGAGCGTGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATTCGACCTACTCTCAGCTGA
GAAGCGAGTTCCTCGCCCAGTTCTCTTCTCGGCACTATGACAAAAAGACAGCGACCTATCTCGCCACCATCAGGCAGAAGGAGGGTGAGACGCTGCGGGAATATGTCACC
AGATTCCAGGAGGAGCAATTGAAGGTTGCACACTGCTCCGATGACTCGGCCATGTGCTATTTTCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGAGAAGA
GGCCCCGGCCACCTTCGTCGAGGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCTGAACGAAAGATCGGTCGGGGCA
GAAGTGGAAAAGATGAAAGGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGAACTGAGTGTCGAAGGGCGGAGAGCGGACCTACCAAGAGCCGACCT
TACGAACGCTTCACCCCGACCACGATTCCAATTTCCGAGACCCTAACAAACATCGAGGAATCTGGAATGGAAAAACTACTCAAGCGTCCTGAGAAGCTTCGGGGAGCCCC
GGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTCCATCGAGAGCACGGCCACAACACGTCGGACTGCTGGGAGTTGAAGCGCCAAATTGAGGATCTAATTCAAGATGGCT
ACTTCAAGAAGTTTGTGGGAAAACCCAAGACCAGCTCAGCAGAAAAAAAGGAGGAGCGAAAGCATTCAAGGACGCCACCCCAGCGCACCGACCGACCTGCGGTCATCAAC
ACCATTTTTGGAGGGCCAAGCGGGGGTCAGTCCGGACATAAAAGAAAGGAGTTAGCCCGTGCAGCCAAGCGCGAGGTGTGCATCATCAGGGAGCAGGGGCCGACCTGCCC
AATTACCATCGACGATGCACACTTAGAGGAGGTCCACCTGCCCAACAATGATACACTTGTGATTGCTCCCTTGATTGATCATGTGGTGGTCAAGAGAGTGCTGGTAGACG
GGGGCGCATCCGCTAACATCCTGTCCTTACCGACCTACCTCGCCTTGGGATGGACGAGGTCGCAATTGAAGAGAAGCCCGACACCGCTGGTTGGGTTCTCTAGAGAATCG
GTCATCCCAGAGGGTTGCATCGACTTGCCGGTCACGCTGGGGCAGGACCGAACTCGGGTCTCTCAAATGGCCGAGTTCGTGGTAATTGACGGTAGATCGGCCTATAACGC
CATCTTTGGGAGACCCATCATCCACTCATTTCGAGCCATTCCCTCAACACTGCATCAAGTTTTGAAGTATCCCACCCTCAATGGCGTGGGCACGGTCCGAGGAGAACAGA
CCGCTTCGAGGGAGTGTTATGCCTCCGCACTCAAGGGCTCATCGGTCTGCGCCCTCGAAACGCTCGCCGGTAGGGATGGGGCGCTCGAGTTCGAGGCCGACCTGCCGAGG
AAGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAGTTAGCATCGGCGTACGAGACCGACCTGGCCAGGTCGGTCCCCGTCGA
GATCTTAGATAATCCCTCGATCTCAGAGCCAGATCTGATGGAGATCGGCGCTCCAGAATCTTCATGGATTGCGGACTTCATTAGGGGCAACTCACCACACGACCCCGAGG
AGCACAGAAAGTTGGCACGGCGGGCAGCTCGGTTCGTGGTCCGAGATGGGGCATTGTACCGACGTGCCCTTGATCCGGCCTGGGAGGGCCTGTTTGAGATCAAGGGCATA
GTCCGACCTGGGACGTACATATTGGCCGATCTGAAAGGAGACGTCCTCGCGCACCCGTGGAACGCGGAGCACCTGAAGCGCTATTATCCATGA
Protein sequenceShow/hide protein sequence
MVQPANSTNTTDRRTLAASDAHQREVAAAAGEGQGHDGLATEPLRRTARITAPALPLVHPRTSKAIRGRGGTSKKGARGPAPAPTSENFDALQREMEAMRTQMRTMEEIY
NKMMLAAGAGSRSENRVTRVDVREQRCSHLGPAEEERPEDNESEGYTHQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPAGIITREEFDQLRGELDAQVEA
LKAKSEQKDDSLNDGDLGESPLTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVERAIVVSETASQVDSTYSQLRSEFLAQFSSRHYDKKTATYLATIRQKEGETLREYVT
RFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFVEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDERADPKSKDKGSFSSGRTECRRAESGPTKSRP
YERFTPTTIPISETLTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPKTSSAEKKEERKHSRTPPQRTDRPAVIN
TIFGGPSGGQSGHKRKELARAAKREVCIIREQGPTCPITIDDAHLEEVHLPNNDTLVIAPLIDHVVVKRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSRES
VIPEGCIDLPVTLGQDRTRVSQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTLNGVGTVRGEQTASRECYASALKGSSVCALETLAGRDGALEFEADLPR
KEFAAPTEELELVPLLSPEKQLASAYETDLARSVPVEILDNPSISEPDLMEIGAPESSWIADFIRGNSPHDPEEHRKLARRAARFVVRDGALYRRALDPAWEGLFEIKGI
VRPGTYILADLKGDVLAHPWNAEHLKRYYP