; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc11g29330 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc11g29330
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr11:21368943..21374977
RNA-Seq ExpressionMoc11g29330
SyntenyMoc11g29330
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]1.1e-27192.42Show/hide
Query:  QAESSHN---PVGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQATSD
        +AESS N   P G+ITREEFDQLRG+LDAQVEALKAKCEQK+  LNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDG+KDPKDYVEVFE LMDFQA SD
Subjt:  QAESSHN---PVGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQATSD

Query:  AIKCRAFQIAITGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATISQKEGETLREYVTRFQEEQLKAAYCSDDSAMCYFLTGLADEAL
        AIKCRAF+IA+TGSARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATI QKEGETLREYVTRFQEEQLK A+CSDDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIAITGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATISQKEGETLREYVTRFQEEQLKAAYCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSSRAECRRAESGPTRSRPYERFTPTTIPISEILTNIE
        TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD E ADPKSKDKGSFSS RAE RRAE+GPTRSRPYERFTPTTIPISEILTNIE
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSSRAECRRAESGPTRSRPYERFTPTTIPISEILTNIE

Query:  ESGMEKLLKRPEKLRGALERRSKDKYCRFYREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
        ESGMEKLLKRPEKLRGA ERRSKDKYCRF+REHGHNTSD WELKRQIE+LIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
Subjt:  ESGMEKLLKRPEKLRGALERRSKDKYCRFYREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG

Query:  QSGHKRKELARAARCEVCIIREQEPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVNGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGE
        QSG KRKELARAAR EVCIIREQ PTCPITFDGADLEEVHLPHNDALVIAPLIDHVVV RVLV+GG SANILSLPTYLALGWTRSQLK+SPTPLVGFSGE
Subjt:  QSGHKRKELARAARCEVCIIREQEPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVNGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGE

Query:  SVIPEGCIDLPVTLGQDRTRVTQMAEFV
        SVIPEG IDLPVTLGQD+T+VTQMAEFV
Subjt:  SVIPEGCIDLPVTLGQDRTRVTQMAEFV

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]1.2e-27380.95Show/hide
Query:  SSNQQAESSHNPV---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQ
        SSNQQAESSHNP    G+ITREEFDQLRG+L+AQVEALKAKCEQK+  LNDGDLGESPFTSDVLE        APTVK YDG+KDPKDYVEVFEGLMDFQ
Subjt:  SSNQQAESSHNPV---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQ

Query:  ATSDAIKCRAFQIAITGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATISQKEGETLREYVTRFQEEQLKAAYCSDDSAMCYFLTGLA
        A SDAIKCRAFQIA+TGSARLW                                                     FQE+QLK A  SDDSAMCYFLTGLA
Subjt:  ATSDAIKCRAFQIAITGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATISQKEGETLREYVTRFQEEQLKAAYCSDDSAMCYFLTGLA

Query:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDERADPKSKDKGSFSSSRAECRRAESGPTRSRPYERFTPTTIPISEILT
        DEALTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRPER I RGRSGKDE+AD KSKDKGSFSS RAE RRA +GPTRSRPYERFTPTTIPISEILT
Subjt:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDERADPKSKDKGSFSSSRAECRRAESGPTRSRPYERFTPTTIPISEILT

Query:  NIEESGMEKLLKRPEKLRGALERRSKDKYCRFYREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGP
        NIEESGMEKLLKRPEKLRGA ERR+KDKYCRF+REH HNTSD WELKRQIEDLIQD YFKKFVGKPRTSSAEKKEERK SRTP RR DRPAVINTIFGGP
Subjt:  NIEESGMEKLLKRPEKLRGALERRSKDKYCRFYREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGP

Query:  SGGQSGHKRKELARAARCEVCIIREQEPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVNGGASANILSLPTYLALGWTRSQLKRSPTPLVGF
        SGGQSGHKRKELARAAR EVCIIREQ PTCPITFD ADLEEVHLPHNDALVIAPLIDHVVVRRVLV+ G SANI+SL TYLALGWTRSQLK+S TPLVGF
Subjt:  SGGQSGHKRKELARAARCEVCIIREQEPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVNGGASANILSLPTYLALGWTRSQLKRSPTPLVGF

Query:  SGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGMGTVRGEQTASRECYASALKGSSVCALETLA
        S ESVIPEGCIDLPVTLG D+T+VTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNG+G VRGEQ ASRECYASALKGSSVCALETL 
Subjt:  SGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGMGTVRGEQTASRECYASALKGSSVCALETLA

Query:  GRDGALEFEADLPRKEFAAPTEELELVPLL
         RDG LEF+A+LPR+EFAAPTEELELVPLL
Subjt:  GRDGALEFEADLPRKEFAAPTEELELVPLL

XP_022152110.1 uncharacterized protein LOC111019899 [Momordica charantia]7.5e-22390.81Show/hide
Query:  MCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSSRAECRRAESGPTRSRPYERFTP
        MCYFLTGLADEALTVKL EEAPATFAEVLQKAKKVIDGQELLRT       KIG+GRSGKD E  DPKSKDKGSFS+ RAE RRAE+GPTRSRPYERFTP
Subjt:  MCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSSRAECRRAESGPTRSRPYERFTP

Query:  TTIPISEILTNIEESGMEKLLKRPEKLRGALERRSKDKYCRFYREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRP
        TTIPISEILTNIEESGMEKLLKRPEKLRGA ERRSKDKYCRF+REHGHNTSD WELK QIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRP
Subjt:  TTIPISEILTNIEESGMEKLLKRPEKLRGALERRSKDKYCRFYREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRP

Query:  AVINTIFGGPSGGQSGHKRKELARAARCEVCIIREQEPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVNGGASANILSLPTYLALGWTRSQL
        AVINTIFGGPSGGQSGHKRK+LARAAR EVCIIREQ PTCPITFD ADL EVHLPHNDALVIAPLIDHVVVRRVLV+GGASANILSLPTYLALGWTRSQL
Subjt:  AVINTIFGGPSGGQSGHKRKELARAARCEVCIIREQEPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVNGGASANILSLPTYLALGWTRSQL

Query:  KRSPTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGMGTVRGEQTASRECYASALKG
        K+SPTPLVGFSGESV+PEGCIDLPVTLGQD+TRVTQMAEFVV+DGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNG+GTVRGEQTASRECYAS LKG
Subjt:  KRSPTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGMGTVRGEQTASRECYASALKG

Query:  SSVCALETLAGRDGALEFEADLPRKEFAAPTEELELVPLLSPEKQL
        +SVCALETL  RDG LEFEADLP +EFAAP EELELVPLLS EKQ+
Subjt:  SSVCALETLAGRDGALEFEADLPRKEFAAPTEELELVPLLSPEKQL

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]7.9e-25777.3Show/hide
Query:  PSRRSSNQQAESSHNPV--GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLM
        P       +AESS+NP+  G+ITREEFDQL+ + DAQVEALKA+CE+K+ S +DGDLGE  F+SD+LEA IPPKFK PT+KPYDG+KDPKDYVEVFE LM
Subjt:  PSRRSSNQQAESSHNPV--GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLM

Query:  DFQATSDAIKCRAFQIAITGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATISQKEGETLREYVTRFQEEQLKAAYCSDDSAMCYFLT
        DFQA +DAIKC AFQIA+TGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATI QKEGETLREYVTRF EEQLK A+CSDDSAMCYFLT
Subjt:  DFQATSDAIKCRAFQIAITGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATISQKEGETLREYVTRFQEEQLKAAYCSDDSAMCYFLT

Query:  GLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDE-RADPKSKDKG-SFSSSRAECRRAESGPTRSRPYERFTPTTIPI
        GLADE LTVKL EEAPATFAEVLQK KKVIDGQELLRTKTGRPE+ I +GR+GKD+ +AD KS+DKG S SSSR + RR+ S   +SRPYE +TPTTIPI
Subjt:  GLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDE-RADPKSKDKG-SFSSSRAECRRAESGPTRSRPYERFTPTTIPI

Query:  SEILTNIEESGMEKLLKRPEKLRGALERRSKDKYCRFYREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINT
         EILTNIEE+GMEKLLKRPEKLRG  E+R+ DKYCRF+R+HGHNTS+ WELKRQIEDLIQDGYFKKFVGKPR++S EKKEERKR RTPPRR DRPAVIN 
Subjt:  SEILTNIEESGMEKLLKRPEKLRGALERRSKDKYCRFYREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINT

Query:  IFGGPSGGQSGHKRKELARAARCEVCIIREQEPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVNGGASANILSLPTYLALGWTRSQLKRSPT
                    K+KELAR AR EVCIIREQ PT  I F+ ADLE VHLPHNDALVIAPLID V+VRR+LV+GGASANILSL TYLALGWTRSQLK+SPT
Subjt:  IFGGPSGGQSGHKRKELARAARCEVCIIREQEPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVNGGASANILSLPTYLALGWTRSQLKRSPT

Query:  PLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGMGTVRGEQTASRECYASALKGSSVCA
        PLVGFSGES+  EGCIDLPV++ QD T+VTQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYST NG+GTVRGE   SRECYAS  K SSVCA
Subjt:  PLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGMGTVRGEQTASRECYASALKGSSVCA

Query:  LETLAGRD
        LE    RD
Subjt:  LETLAGRD

XP_022158414.1 uncharacterized protein LOC111024904 [Momordica charantia]7.7e-22075.37Show/hide
Query:  MDFQATSDAIKCRAFQIAITGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATISQKEGETLREYVTRFQEEQLKAAYCSDDSAMCYFL
        MDFQA +DAIKCRAFQIA+TGSARLWYRRLPARSISTYSQLR+EF++QFSS HYD+KTATHLATI QKE ETLREYVTRFQEEQLK A+CSDDSAMCYFL
Subjt:  MDFQATSDAIKCRAFQIAITGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATISQKEGETLREYVTRFQEEQLKAAYCSDDSAMCYFL

Query:  TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDER-ADPKSKDKGSFSS-SRAECRRAESGPTRSRPYERFTPTTIP
        T LADE LTVKLGEEAP TF EVLQKAKKVIDGQELLRTKTGRPE++I + +  +++R AD KS+DKGS SS SR E RR ESGP+RSRPYER+T +TIP
Subjt:  TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDER-ADPKSKDKGSFSS-SRAECRRAESGPTRSRPYERFTPTTIP

Query:  ISEILTNIEESGMEKLLKRPEKLRGALERRSKDKYCRFYREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVIN
        ISEILTNIEESGMEKLLKRPEKLRG LE+R+K+KYCRF+R+HGHNT+ CWELKRQIEDLIQDGYFKKFVGKPR++S EKKEERKRSRTPPRR DRPAVIN
Subjt:  ISEILTNIEESGMEKLLKRPEKLRGALERRSKDKYCRFYREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVIN

Query:  TIFGGPSGGQSGHKRKELARAARCEVCIIREQEPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVNGGASANILSLPTYLALGWTRSQLKRSP
        TIFGGP+GGQSG+KRKELAR AR EVCIIRE +PTC ITF  ADLE VHLPHNDALVIA LIDH +VRRVL++G                          
Subjt:  TIFGGPSGGQSGHKRKELARAARCEVCIIREQEPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVNGGASANILSLPTYLALGWTRSQLKRSP

Query:  TPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGMGTVRGEQTASRECYASALKGSSVC
                      GCIDLPVT+GQD T+VTQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYSTPN +G VRGEQ  SRECYASALKGS+VC
Subjt:  TPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGMGTVRGEQTASRECYASALKGSSVC

Query:  ALETLAGRDGALEFEADLP---RKEFAAPTEELELVPLLSPEKQ
        ALE    R    E EADLP   +++F  PTEELELVPLLSPE+Q
Subjt:  ALETLAGRDGALEFEADLP---RKEFAAPTEELELVPLLSPEKQ

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088135.5e-27292.42Show/hide
Query:  QAESSHN---PVGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQATSD
        +AESS N   P G+ITREEFDQLRG+LDAQVEALKAKCEQK+  LNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDG+KDPKDYVEVFE LMDFQA SD
Subjt:  QAESSHN---PVGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQATSD

Query:  AIKCRAFQIAITGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATISQKEGETLREYVTRFQEEQLKAAYCSDDSAMCYFLTGLADEAL
        AIKCRAF+IA+TGSARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATI QKEGETLREYVTRFQEEQLK A+CSDDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIAITGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATISQKEGETLREYVTRFQEEQLKAAYCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSSRAECRRAESGPTRSRPYERFTPTTIPISEILTNIE
        TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD E ADPKSKDKGSFSS RAE RRAE+GPTRSRPYERFTPTTIPISEILTNIE
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSSRAECRRAESGPTRSRPYERFTPTTIPISEILTNIE

Query:  ESGMEKLLKRPEKLRGALERRSKDKYCRFYREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
        ESGMEKLLKRPEKLRGA ERRSKDKYCRF+REHGHNTSD WELKRQIE+LIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
Subjt:  ESGMEKLLKRPEKLRGALERRSKDKYCRFYREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG

Query:  QSGHKRKELARAARCEVCIIREQEPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVNGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGE
        QSG KRKELARAAR EVCIIREQ PTCPITFDGADLEEVHLPHNDALVIAPLIDHVVV RVLV+GG SANILSLPTYLALGWTRSQLK+SPTPLVGFSGE
Subjt:  QSGHKRKELARAARCEVCIIREQEPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVNGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGE

Query:  SVIPEGCIDLPVTLGQDRTRVTQMAEFV
        SVIPEG IDLPVTLGQD+T+VTQMAEFV
Subjt:  SVIPEGCIDLPVTLGQDRTRVTQMAEFV

A0A6J1D9E1 uncharacterized protein LOC1110188235.9e-27480.95Show/hide
Query:  SSNQQAESSHNPV---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQ
        SSNQQAESSHNP    G+ITREEFDQLRG+L+AQVEALKAKCEQK+  LNDGDLGESPFTSDVLE        APTVK YDG+KDPKDYVEVFEGLMDFQ
Subjt:  SSNQQAESSHNPV---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQ

Query:  ATSDAIKCRAFQIAITGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATISQKEGETLREYVTRFQEEQLKAAYCSDDSAMCYFLTGLA
        A SDAIKCRAFQIA+TGSARLW                                                     FQE+QLK A  SDDSAMCYFLTGLA
Subjt:  ATSDAIKCRAFQIAITGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATISQKEGETLREYVTRFQEEQLKAAYCSDDSAMCYFLTGLA

Query:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDERADPKSKDKGSFSSSRAECRRAESGPTRSRPYERFTPTTIPISEILT
        DEALTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRPER I RGRSGKDE+AD KSKDKGSFSS RAE RRA +GPTRSRPYERFTPTTIPISEILT
Subjt:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDERADPKSKDKGSFSSSRAECRRAESGPTRSRPYERFTPTTIPISEILT

Query:  NIEESGMEKLLKRPEKLRGALERRSKDKYCRFYREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGP
        NIEESGMEKLLKRPEKLRGA ERR+KDKYCRF+REH HNTSD WELKRQIEDLIQD YFKKFVGKPRTSSAEKKEERK SRTP RR DRPAVINTIFGGP
Subjt:  NIEESGMEKLLKRPEKLRGALERRSKDKYCRFYREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGP

Query:  SGGQSGHKRKELARAARCEVCIIREQEPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVNGGASANILSLPTYLALGWTRSQLKRSPTPLVGF
        SGGQSGHKRKELARAAR EVCIIREQ PTCPITFD ADLEEVHLPHNDALVIAPLIDHVVVRRVLV+ G SANI+SL TYLALGWTRSQLK+S TPLVGF
Subjt:  SGGQSGHKRKELARAARCEVCIIREQEPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVNGGASANILSLPTYLALGWTRSQLKRSPTPLVGF

Query:  SGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGMGTVRGEQTASRECYASALKGSSVCALETLA
        S ESVIPEGCIDLPVTLG D+T+VTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNG+G VRGEQ ASRECYASALKGSSVCALETL 
Subjt:  SGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGMGTVRGEQTASRECYASALKGSSVCALETLA

Query:  GRDGALEFEADLPRKEFAAPTEELELVPLL
         RDG LEF+A+LPR+EFAAPTEELELVPLL
Subjt:  GRDGALEFEADLPRKEFAAPTEELELVPLL

A0A6J1DD03 uncharacterized protein LOC1110198993.6e-22390.81Show/hide
Query:  MCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSSRAECRRAESGPTRSRPYERFTP
        MCYFLTGLADEALTVKL EEAPATFAEVLQKAKKVIDGQELLRT       KIG+GRSGKD E  DPKSKDKGSFS+ RAE RRAE+GPTRSRPYERFTP
Subjt:  MCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSSRAECRRAESGPTRSRPYERFTP

Query:  TTIPISEILTNIEESGMEKLLKRPEKLRGALERRSKDKYCRFYREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRP
        TTIPISEILTNIEESGMEKLLKRPEKLRGA ERRSKDKYCRF+REHGHNTSD WELK QIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRP
Subjt:  TTIPISEILTNIEESGMEKLLKRPEKLRGALERRSKDKYCRFYREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRP

Query:  AVINTIFGGPSGGQSGHKRKELARAARCEVCIIREQEPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVNGGASANILSLPTYLALGWTRSQL
        AVINTIFGGPSGGQSGHKRK+LARAAR EVCIIREQ PTCPITFD ADL EVHLPHNDALVIAPLIDHVVVRRVLV+GGASANILSLPTYLALGWTRSQL
Subjt:  AVINTIFGGPSGGQSGHKRKELARAARCEVCIIREQEPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVNGGASANILSLPTYLALGWTRSQL

Query:  KRSPTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGMGTVRGEQTASRECYASALKG
        K+SPTPLVGFSGESV+PEGCIDLPVTLGQD+TRVTQMAEFVV+DGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNG+GTVRGEQTASRECYAS LKG
Subjt:  KRSPTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGMGTVRGEQTASRECYASALKG

Query:  SSVCALETLAGRDGALEFEADLPRKEFAAPTEELELVPLLSPEKQL
        +SVCALETL  RDG LEFEADLP +EFAAP EELELVPLLS EKQ+
Subjt:  SSVCALETLAGRDGALEFEADLPRKEFAAPTEELELVPLLSPEKQL

A0A6J1DHB3 uncharacterized protein LOC1110204793.8e-25777.3Show/hide
Query:  PSRRSSNQQAESSHNPV--GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLM
        P       +AESS+NP+  G+ITREEFDQL+ + DAQVEALKA+CE+K+ S +DGDLGE  F+SD+LEA IPPKFK PT+KPYDG+KDPKDYVEVFE LM
Subjt:  PSRRSSNQQAESSHNPV--GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLM

Query:  DFQATSDAIKCRAFQIAITGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATISQKEGETLREYVTRFQEEQLKAAYCSDDSAMCYFLT
        DFQA +DAIKC AFQIA+TGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATI QKEGETLREYVTRF EEQLK A+CSDDSAMCYFLT
Subjt:  DFQATSDAIKCRAFQIAITGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATISQKEGETLREYVTRFQEEQLKAAYCSDDSAMCYFLT

Query:  GLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDE-RADPKSKDKG-SFSSSRAECRRAESGPTRSRPYERFTPTTIPI
        GLADE LTVKL EEAPATFAEVLQK KKVIDGQELLRTKTGRPE+ I +GR+GKD+ +AD KS+DKG S SSSR + RR+ S   +SRPYE +TPTTIPI
Subjt:  GLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDE-RADPKSKDKG-SFSSSRAECRRAESGPTRSRPYERFTPTTIPI

Query:  SEILTNIEESGMEKLLKRPEKLRGALERRSKDKYCRFYREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINT
         EILTNIEE+GMEKLLKRPEKLRG  E+R+ DKYCRF+R+HGHNTS+ WELKRQIEDLIQDGYFKKFVGKPR++S EKKEERKR RTPPRR DRPAVIN 
Subjt:  SEILTNIEESGMEKLLKRPEKLRGALERRSKDKYCRFYREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINT

Query:  IFGGPSGGQSGHKRKELARAARCEVCIIREQEPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVNGGASANILSLPTYLALGWTRSQLKRSPT
                    K+KELAR AR EVCIIREQ PT  I F+ ADLE VHLPHNDALVIAPLID V+VRR+LV+GGASANILSL TYLALGWTRSQLK+SPT
Subjt:  IFGGPSGGQSGHKRKELARAARCEVCIIREQEPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVNGGASANILSLPTYLALGWTRSQLKRSPT

Query:  PLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGMGTVRGEQTASRECYASALKGSSVCA
        PLVGFSGES+  EGCIDLPV++ QD T+VTQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYST NG+GTVRGE   SRECYAS  K SSVCA
Subjt:  PLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGMGTVRGEQTASRECYASALKGSSVCA

Query:  LETLAGRD
        LE    RD
Subjt:  LETLAGRD

A0A6J1DZB9 uncharacterized protein LOC1110249043.7e-22075.37Show/hide
Query:  MDFQATSDAIKCRAFQIAITGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATISQKEGETLREYVTRFQEEQLKAAYCSDDSAMCYFL
        MDFQA +DAIKCRAFQIA+TGSARLWYRRLPARSISTYSQLR+EF++QFSS HYD+KTATHLATI QKE ETLREYVTRFQEEQLK A+CSDDSAMCYFL
Subjt:  MDFQATSDAIKCRAFQIAITGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATISQKEGETLREYVTRFQEEQLKAAYCSDDSAMCYFL

Query:  TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDER-ADPKSKDKGSFSS-SRAECRRAESGPTRSRPYERFTPTTIP
        T LADE LTVKLGEEAP TF EVLQKAKKVIDGQELLRTKTGRPE++I + +  +++R AD KS+DKGS SS SR E RR ESGP+RSRPYER+T +TIP
Subjt:  TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDER-ADPKSKDKGSFSS-SRAECRRAESGPTRSRPYERFTPTTIP

Query:  ISEILTNIEESGMEKLLKRPEKLRGALERRSKDKYCRFYREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVIN
        ISEILTNIEESGMEKLLKRPEKLRG LE+R+K+KYCRF+R+HGHNT+ CWELKRQIEDLIQDGYFKKFVGKPR++S EKKEERKRSRTPPRR DRPAVIN
Subjt:  ISEILTNIEESGMEKLLKRPEKLRGALERRSKDKYCRFYREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVIN

Query:  TIFGGPSGGQSGHKRKELARAARCEVCIIREQEPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVNGGASANILSLPTYLALGWTRSQLKRSP
        TIFGGP+GGQSG+KRKELAR AR EVCIIRE +PTC ITF  ADLE VHLPHNDALVIA LIDH +VRRVL++G                          
Subjt:  TIFGGPSGGQSGHKRKELARAARCEVCIIREQEPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVNGGASANILSLPTYLALGWTRSQLKRSP

Query:  TPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGMGTVRGEQTASRECYASALKGSSVC
                      GCIDLPVT+GQD T+VTQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYSTPN +G VRGEQ  SRECYASALKGS+VC
Subjt:  TPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGMGTVRGEQTASRECYASALKGSSVC

Query:  ALETLAGRDGALEFEADLP---RKEFAAPTEELELVPLLSPEKQ
        ALE    R    E EADLP   +++F  PTEELELVPLLSPE+Q
Subjt:  ALETLAGRDGALEFEADLP---RKEFAAPTEELELVPLLSPEKQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTAAGTATGAGGGCCGAGGTGAACCTGGCCCAGGTCCGCGCAAGTGTTCAGATCGGCCCGGAAGCCGAGTTCGAGTTGCAATCTGAAATACGTTGTTGTGCATATTC
TTGCATAAACATTTGGCGCTGTCTGACGTCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTAAGAAGGGCGCCCGGGGTCCAACCCCGACTCCAACAAGCGAGAACTTTG
ATGCGCTCCAGAGAGAGATGGAGGCAATGCGCACACAAATGCGCACCATGGAGGAAATGTATAACGAAATGATGCTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGA
GTGACGCGCGTGCAAAGGGGTTCCCACCTCGGCCCAGTCGAGGAGGAACGTCCCGAAGACAACGAGAGCGAGGGGTACACTCGCCAGAGGGGAGACCTCCGTGAGCATCT
CAACAGAAAGAGAGGCTCGTCTCTCCGAAAAGGGCAGTCACCATCCCGCAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAATCCCGTAGGGATAATCACAAGGGAGG
AGTTCGACCAGCTGAGGGGGGAGCTCGATGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAACGATGGCGACTTGGGAGAATCGCCTTTC
ACCTCGGACGTTTTGGAAGCACCAATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGATGGGACGAAGGACCCCAAGGACTATGTTGAGGTCTTTGAAGGCCT
CATGGACTTCCAAGCGACATCAGACGCAATCAAATGTCGCGCCTTTCAGATCGCGATTACTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGA
CCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCACTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGCCAGAAGGAGGGTGAAACGCTG
CGGGAATATGTCACCAGATTCCAGGAGGAGCAGTTGAAGGCTGCATACTGCTCCGATGACTCGGCCATGTGCTATTTTCTCACCGGTCTAGCCGACGAAGCTCTCACGGT
GAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCTGAACGAA
AGATCGGCCGGGGCAGAAGTGGAAAAGATGAAAGGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCAGCCGAGCTGAGTGTCGAAGGGCGGAGAGCGGACCT
ACCAGGAGTCGACCTTACGAACGCTTCACCCCGACCACGATTCCAATTTCCGAGATCCTAACAAACATCGAGGAATCTGGAATGGAAAAACTACTCAAGCGTCCTGAGAA
GCTTCGGGGAGCCCTGGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTCTATCGGGAGCACGGCCACAACACGTCGGACTGCTGGGAGTTGAAGCGCCAAATTGAGGATC
TAATTCAAGATGGCTACTTCAAGAAGTTTGTGGGAAAGCCCAGGACCAGCTCAGCAGAAAAAAAGGAGGAGCGAAAGCGTTCAAGGACGCCACCCCGGCGCACCGACCGA
CCTGCGGTCATCAACACCATTTTTGGAGGGCCAAGCGGGGGTCAGTCCGGACATAAAAGAAAGGAGTTAGCCCGTGCAGCCAGGTGCGAGGTGTGCATCATCAGGGAGCA
GGAGCCGACCTGCCCAATCACCTTCGACGGTGCAGACTTGGAGGAGGTCCACCTGCCCCACAATGATGCACTTGTGATTGCTCCCTTGATTGATCATGTGGTGGTCAGGA
GAGTGCTGGTAAACGGGGGCGCATCCGCTAACATCCTGTCCTTACCGACCTACCTCGCCTTGGGATGGACGAGGTCACAATTGAAGAGAAGCCCGACACCGCTGGTTGGG
TTCTCTGGAGAATCGGTCATCCCAGAGGGTTGCATCGACTTGCCGGTCACGCTGGGGCAGGACCGAACTCGGGTCACTCAAATGGCCGAGTTCGTGGTAATTGACGGTAG
ATCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCATTCCCTCGACACTTCATCAAGTTTTGAAGTATTCCACCCCCAATGGCATGGGCACGG
TCCGAGGAGAACAGACCGCTTCGAGGGAGTGTTATGCCTCCGCACTCAAGGGCTCATCGGTCTGCGCCCTCGAAACGCTCGCCGGTAGGGATGGGGCGCTCGAGTTCGAG
GCCGACCTGCCGAGGAAGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAGTTAGCATCGGCGTACGAGACTGACCTGGCCAG
GTCGGTCCCCGTCGAGATCTTAGATAATCCCTCGATCTCAGAGCCAGATCTGATGGAGATCGGCGCTCCAGAATCTTCATGGATGGACCCGATCGCGGACTTCATTAGGG
GCAACTCACCACAAGACCCCAAGGTGCGCAGAAAGTTGGCACGGCGGGCAGCTCGAGTAGAGCATTTCGAACCTACGGCAAATGAGGAAGAGCTGCTCCTCAACCTCGAC
TTGTTGGAAGAAAGAAGAGCAATGGCCCAGCTACGCCTGGCGGAATATCAGGGCAGAATGGCCAGACATTACAATGCCCGCGTTCGACCTCGGGCCTTTCAGGTCGGACA
TCTGGTCTTAAGGAGGGTCCAAACGCATGTAGGTGCCCTTGATCCGGCCTGGGAGGGCCCGTTTGAGATCAAGGGCATAGTCCGACCTGGGACGTACATATTGGCCGATC
TGAAAGGAGACGTCCTCGCGCACCCGTGGAACGCGGAGCACCTGAAGCGCTATTATCCCAGAAATGCAAAAAGGATTTTCAATGAATCTGTAAAGACTGTTCCAAAAGAA
TTATGA
mRNA sequenceShow/hide mRNA sequence
ATGCTAAGTATGAGGGCCGAGGTGAACCTGGCCCAGGTCCGCGCAAGTGTTCAGATCGGCCCGGAAGCCGAGTTCGAGTTGCAATCTGAAATACGTTGTTGTGCATATTC
TTGCATAAACATTTGGCGCTGTCTGACGTCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTAAGAAGGGCGCCCGGGGTCCAACCCCGACTCCAACAAGCGAGAACTTTG
ATGCGCTCCAGAGAGAGATGGAGGCAATGCGCACACAAATGCGCACCATGGAGGAAATGTATAACGAAATGATGCTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGA
GTGACGCGCGTGCAAAGGGGTTCCCACCTCGGCCCAGTCGAGGAGGAACGTCCCGAAGACAACGAGAGCGAGGGGTACACTCGCCAGAGGGGAGACCTCCGTGAGCATCT
CAACAGAAAGAGAGGCTCGTCTCTCCGAAAAGGGCAGTCACCATCCCGCAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAATCCCGTAGGGATAATCACAAGGGAGG
AGTTCGACCAGCTGAGGGGGGAGCTCGATGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAACGATGGCGACTTGGGAGAATCGCCTTTC
ACCTCGGACGTTTTGGAAGCACCAATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGATGGGACGAAGGACCCCAAGGACTATGTTGAGGTCTTTGAAGGCCT
CATGGACTTCCAAGCGACATCAGACGCAATCAAATGTCGCGCCTTTCAGATCGCGATTACTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGA
CCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCACTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGCCAGAAGGAGGGTGAAACGCTG
CGGGAATATGTCACCAGATTCCAGGAGGAGCAGTTGAAGGCTGCATACTGCTCCGATGACTCGGCCATGTGCTATTTTCTCACCGGTCTAGCCGACGAAGCTCTCACGGT
GAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCTGAACGAA
AGATCGGCCGGGGCAGAAGTGGAAAAGATGAAAGGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCAGCCGAGCTGAGTGTCGAAGGGCGGAGAGCGGACCT
ACCAGGAGTCGACCTTACGAACGCTTCACCCCGACCACGATTCCAATTTCCGAGATCCTAACAAACATCGAGGAATCTGGAATGGAAAAACTACTCAAGCGTCCTGAGAA
GCTTCGGGGAGCCCTGGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTCTATCGGGAGCACGGCCACAACACGTCGGACTGCTGGGAGTTGAAGCGCCAAATTGAGGATC
TAATTCAAGATGGCTACTTCAAGAAGTTTGTGGGAAAGCCCAGGACCAGCTCAGCAGAAAAAAAGGAGGAGCGAAAGCGTTCAAGGACGCCACCCCGGCGCACCGACCGA
CCTGCGGTCATCAACACCATTTTTGGAGGGCCAAGCGGGGGTCAGTCCGGACATAAAAGAAAGGAGTTAGCCCGTGCAGCCAGGTGCGAGGTGTGCATCATCAGGGAGCA
GGAGCCGACCTGCCCAATCACCTTCGACGGTGCAGACTTGGAGGAGGTCCACCTGCCCCACAATGATGCACTTGTGATTGCTCCCTTGATTGATCATGTGGTGGTCAGGA
GAGTGCTGGTAAACGGGGGCGCATCCGCTAACATCCTGTCCTTACCGACCTACCTCGCCTTGGGATGGACGAGGTCACAATTGAAGAGAAGCCCGACACCGCTGGTTGGG
TTCTCTGGAGAATCGGTCATCCCAGAGGGTTGCATCGACTTGCCGGTCACGCTGGGGCAGGACCGAACTCGGGTCACTCAAATGGCCGAGTTCGTGGTAATTGACGGTAG
ATCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCATTCCCTCGACACTTCATCAAGTTTTGAAGTATTCCACCCCCAATGGCATGGGCACGG
TCCGAGGAGAACAGACCGCTTCGAGGGAGTGTTATGCCTCCGCACTCAAGGGCTCATCGGTCTGCGCCCTCGAAACGCTCGCCGGTAGGGATGGGGCGCTCGAGTTCGAG
GCCGACCTGCCGAGGAAGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAGTTAGCATCGGCGTACGAGACTGACCTGGCCAG
GTCGGTCCCCGTCGAGATCTTAGATAATCCCTCGATCTCAGAGCCAGATCTGATGGAGATCGGCGCTCCAGAATCTTCATGGATGGACCCGATCGCGGACTTCATTAGGG
GCAACTCACCACAAGACCCCAAGGTGCGCAGAAAGTTGGCACGGCGGGCAGCTCGAGTAGAGCATTTCGAACCTACGGCAAATGAGGAAGAGCTGCTCCTCAACCTCGAC
TTGTTGGAAGAAAGAAGAGCAATGGCCCAGCTACGCCTGGCGGAATATCAGGGCAGAATGGCCAGACATTACAATGCCCGCGTTCGACCTCGGGCCTTTCAGGTCGGACA
TCTGGTCTTAAGGAGGGTCCAAACGCATGTAGGTGCCCTTGATCCGGCCTGGGAGGGCCCGTTTGAGATCAAGGGCATAGTCCGACCTGGGACGTACATATTGGCCGATC
TGAAAGGAGACGTCCTCGCGCACCCGTGGAACGCGGAGCACCTGAAGCGCTATTATCCCAGAAATGCAAAAAGGATTTTCAATGAATCTGTAAAGACTGTTCCAAAAGAA
TTATGA
Protein sequenceShow/hide protein sequence
MLSMRAEVNLAQVRASVQIGPEAEFELQSEIRCCAYSCINIWRCLTSKATRGRGGTSKKGARGPTPTPTSENFDALQREMEAMRTQMRTMEEMYNEMMLAAGAGSRSENR
VTRVQRGSHLGPVEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRRSSNQQAESSHNPVGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPF
TSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQATSDAIKCRAFQIAITGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATISQKEGETL
REYVTRFQEEQLKAAYCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDERADPKSKDKGSFSSSRAECRRAESGP
TRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGALERRSKDKYCRFYREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDR
PAVINTIFGGPSGGQSGHKRKELARAARCEVCIIREQEPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVNGGASANILSLPTYLALGWTRSQLKRSPTPLVG
FSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGMGTVRGEQTASRECYASALKGSSVCALETLAGRDGALEFE
ADLPRKEFAAPTEELELVPLLSPEKQLASAYETDLARSVPVEILDNPSISEPDLMEIGAPESSWMDPIADFIRGNSPQDPKVRRKLARRAARVEHFEPTANEEELLLNLD
LLEERRAMAQLRLAEYQGRMARHYNARVRPRAFQVGHLVLRRVQTHVGALDPAWEGPFEIKGIVRPGTYILADLKGDVLAHPWNAEHLKRYYPRNAKRIFNESVKTVPKE
L