; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc05g23860 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc05g23860
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr5:17018289..17023848
RNA-Seq ExpressionMoc05g23860
SyntenyMoc05g23860
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]6.9e-25087.12Show/hide
Query:  QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASD
        +AESS N   PAG+ITREEFDQLRG+LDAQVEALKAKCEQK+  LNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDG+KDPKDYVEVFE LMDFQAASD
Subjt:  QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTVTHLATIRQKEGETLREYVTKFQEEQLKVAHCSDDSAMCYFLTSLADEAL
        AIKCRAF+IALTGSARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKT THLATIRQKEGETLREYVT+FQEEQLKVAHCSDDSAMCYFLT LADEAL
Subjt:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTVTHLATIRQKEGETLREYVTKFQEEQLKVAHCSDDSAMCYFLTSLADEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILTNIE
        TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD E ADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPI EILTNIE
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILTNIE

Query:  DSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTILEGQAGV
        +SGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTI  G +G 
Subjt:  DSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTILEGQAGV

Query:  NLDIKER---------------------------NLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKCPTPLVGFSGE
            K +                           +LEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLALGWTRSQLKK PTPLVGFSGE
Subjt:  NLDIKER---------------------------NLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKCPTPLVGFSGE

Query:  SVIPEGCIDLPVTLGQDQTRVTQMAEFV
        SVIPEG IDLPVTLGQDQT+VTQMAEFV
Subjt:  SVIPEGCIDLPVTLGQDQTRVTQMAEFV

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]4.5e-24972.5Show/hide
Query:  SSNQQAESSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQ
        SSNQQAESSHNPA   G+ITREEFDQLRG+L+AQVEALKAKCEQK+  LNDGDLGESPFTSDVLE        APTVK YDG+KDPKDYVEVFEGLMDFQ
Subjt:  SSNQQAESSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQ

Query:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTVTHLATIRQKEGETLREYVTKFQEEQLKVAHCSDDSAMCYFLTSLA
        AASDAIKCRAFQIALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFLT LA
Subjt:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTVTHLATIRQKEGETLREYVTKFQEEQLKVAHCSDDSAMCYFLTSLA

Query:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILT
        DEALTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRPER I RGRSGKDE+AD KSKDKGSFSSGRAE+RRA NGPTRSRPYERFTPTTIPI EILT
Subjt:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILT

Query:  NIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTILEGQ
        NIE+SGMEKLLKRPEKLRGAPERR+KDKYCRFHREH HNTSD WELKRQIEDLIQD YFKKFVGKPRTSSAEKKEERK SRTP RR DRPAVINTI  G 
Subjt:  NIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTILEGQ

Query:  AGVNLDIKER---------------------------NLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKCPTPLVGF
        +G     K +                           +LEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTRSQLKK  TPLVGF
Subjt:  AGVNLDIKER---------------------------NLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKCPTPLVGF

Query:  SGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGEQTASRECYAAAFKSSSVCALETL-
        S ESVIPEGCIDLPVTLG DQT+VTQMAEFVV+DGRSAYNAIFGRPIIHSFRAIPSTLHQ+LKYSTPNGVG VRGEQ ASRECYA+A K SSVCALETL 
Subjt:  SGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGEQTASRECYAAAFKSSSVCALETL-

Query:  -RDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQLASAYETDLARSVPVEILDNPSILE--PELMKIG
         RDGTLEF+A+LPR+EFAAPTEELELVPLL  +      +E +L     +  +D+   +E  PE + +G
Subjt:  -RDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQLASAYETDLARSVPVEILDNPSILE--PELMKIG

XP_022152110.1 uncharacterized protein LOC111019899 [Momordica charantia]1.4e-19783.63Show/hide
Query:  MCYFLTSLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTP
        MCYFLT LADEALTVKL EEAPATFAEVLQKAKKVIDGQELLRT       KIG+GRSGKD E  DPKSKDKGSFS+GRAEYRRAENGPTRSRPYERFTP
Subjt:  MCYFLTSLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTP

Query:  TTIPIFEILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRP
        TTIPI EILTNIE+SGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELK QIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRP
Subjt:  TTIPIFEILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRP

Query:  AVINTILEGQAGVNLDIKER---------------------------NLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
        AVINTI  G +G     K +                           +L EVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
Subjt:  AVINTILEGQAGVNLDIKER---------------------------NLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL

Query:  KKCPTPLVGFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGEQTASRECYAAAFKS
        KK PTPLVGFSGESV+PEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQ+LKYSTPNGVGTVRGEQTASRECYA+  K 
Subjt:  KKCPTPLVGFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGEQTASRECYAAAFKS

Query:  SSVCALETL--RDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQL
        +SVCALETL  RDGTLEFEADLP +EFAAP EELELVPLLS EKQ+
Subjt:  SSVCALETL--RDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQL

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]7.9e-26264.86Show/hide
Query:  MVQPANSTNTTDRRALAASDAHQREVGAAAVEGQGHDGLATEPLRRSARITTPALPPAHPRTSKATRGRGGTSKKGAWGPAPAPTSKNFDALQREMEAMR
        MVQPANSTNT DRRALAA+  HQREVGA  VEGQGH+ L TEPL RSARITTP LPPAHP+ SK                                    
Subjt:  MVQPANSTNTTDRRALAASDAHQREVGAAAVEGQGHDGLATEPLRRSARITTPALPPAHPRTSKATRGRGGTSKKGAWGPAPAPTSKNFDALQREMEAMR

Query:  TQMRSMEAMYNEMVLAAGAGSRSENRVTRMDVREQRGSHLGPAEEECPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNP--
                                                                                                   AESS+NP  
Subjt:  TQMRSMEAMYNEMVLAAGAGSRSENRVTRMDVREQRGSHLGPAEEECPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNP--

Query:  AGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIAL
         G+ITREEFDQL+ + DAQVEALKA+CE+K+ S +DGDLGE  F+SD+LEA IPPKFK PT+KPYDG+KDPKDYVEVFE LMDFQAA+DAIKC AFQIAL
Subjt:  AGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIAL

Query:  TGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTVTHLATIRQKEGETLREYVTKFQEEQLKVAHCSDDSAMCYFLTSLADEALTVKLGEEAPAT
        TGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVT+F EEQLKVAHCSDDSAMCYFLT LADE LTVKL EEAPAT
Subjt:  TGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTVTHLATIRQKEGETLREYVTKFQEEQLKVAHCSDDSAMCYFLTSLADEALTVKLGEEAPAT

Query:  FAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDE-RADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILTNIEDSGMEKLLKR
        FAEVLQK KKVIDGQELLRTKTGRPE+ I +GR+GKD+ +AD KS+DKG S SS R +YRR+ +   +SRPYE +TPTTIPIFEILTNIE++GMEKLLKR
Subjt:  FAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDE-RADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILTNIEDSGMEKLLKR

Query:  PEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVIN--------------TILEG
        PEKLRG PE+R+ DKYCRFHR+HGHNTS+ WELKRQIEDLIQDGYFKKFVGKPR++S EKKEERKR RTPPRR DRPAVIN               I E 
Subjt:  PEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVIN--------------TILEG

Query:  QAGVNLDIKERNLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKCPTPLVGFSGESVIPEGCIDLPVTLGQDQTRVTQ
        +   ++     +LE VHLPHNDALVIAPLID V+VRR+LVDGGASANILSL TYLALGWTRSQLKK PTPLVGFSGES+  EGCIDLPV++ QD T+VTQ
Subjt:  QAGVNLDIKERNLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKCPTPLVGFSGESVIPEGCIDLPVTLGQDQTRVTQ

Query:  MAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGEQTASRECYAAAFKSSSVCALE--TLRD
        MAEFVV+DGRSAYNAIFGRPIIHSFRA+PSTLHQ+LKYST NGVGTVRGE   SRECYA+  K SSVCALE  T+RD
Subjt:  MAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGEQTASRECYAAAFKSSSVCALE--TLRD

XP_022156542.1 uncharacterized protein LOC111023421 [Momordica charantia]2.3e-20895.47Show/hide
Query:  GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALT
        GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALT
Subjt:  GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALT

Query:  GSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTVTHLATIRQKEGETLREYVTKFQEEQLKVAHCSDDSAMCYFLTSLADEALTVKLGEEAPATF
        GSARLWYRRLP RSISTYSQLRREFLAQFSSRHYDKKT THLATIRQKEGETLREYVT+FQEEQLKVAHCSDDSAMCYFLT LADEALTVKLGEEAPATF
Subjt:  GSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTVTHLATIRQKEGETLREYVTKFQEEQLKVAHCSDDSAMCYFLTSLADEALTVKLGEEAPATF

Query:  AEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILTNIEDSGMEKLLKRPE
        AEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILTNIE+SGMEKLLKRPE
Subjt:  AEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILTNIEDSGMEKLLKRPE

Query:  KLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTILEGQAGVNLDIKERNL
        KLRGAPERRSKDKYCRFHREHGHNTSD WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTI  G +G  L  K + L
Subjt:  KLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTILEGQAGVNLDIKERNL

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088133.3e-25087.12Show/hide
Query:  QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASD
        +AESS N   PAG+ITREEFDQLRG+LDAQVEALKAKCEQK+  LNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDG+KDPKDYVEVFE LMDFQAASD
Subjt:  QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTVTHLATIRQKEGETLREYVTKFQEEQLKVAHCSDDSAMCYFLTSLADEAL
        AIKCRAF+IALTGSARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKT THLATIRQKEGETLREYVT+FQEEQLKVAHCSDDSAMCYFLT LADEAL
Subjt:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTVTHLATIRQKEGETLREYVTKFQEEQLKVAHCSDDSAMCYFLTSLADEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILTNIE
        TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD E ADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPI EILTNIE
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILTNIE

Query:  DSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTILEGQAGV
        +SGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTI  G +G 
Subjt:  DSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTILEGQAGV

Query:  NLDIKER---------------------------NLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKCPTPLVGFSGE
            K +                           +LEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLALGWTRSQLKK PTPLVGFSGE
Subjt:  NLDIKER---------------------------NLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKCPTPLVGFSGE

Query:  SVIPEGCIDLPVTLGQDQTRVTQMAEFV
        SVIPEG IDLPVTLGQDQT+VTQMAEFV
Subjt:  SVIPEGCIDLPVTLGQDQTRVTQMAEFV

A0A6J1D9E1 uncharacterized protein LOC1110188232.2e-24972.5Show/hide
Query:  SSNQQAESSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQ
        SSNQQAESSHNPA   G+ITREEFDQLRG+L+AQVEALKAKCEQK+  LNDGDLGESPFTSDVLE        APTVK YDG+KDPKDYVEVFEGLMDFQ
Subjt:  SSNQQAESSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQ

Query:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTVTHLATIRQKEGETLREYVTKFQEEQLKVAHCSDDSAMCYFLTSLA
        AASDAIKCRAFQIALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFLT LA
Subjt:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTVTHLATIRQKEGETLREYVTKFQEEQLKVAHCSDDSAMCYFLTSLA

Query:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILT
        DEALTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRPER I RGRSGKDE+AD KSKDKGSFSSGRAE+RRA NGPTRSRPYERFTPTTIPI EILT
Subjt:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILT

Query:  NIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTILEGQ
        NIE+SGMEKLLKRPEKLRGAPERR+KDKYCRFHREH HNTSD WELKRQIEDLIQD YFKKFVGKPRTSSAEKKEERK SRTP RR DRPAVINTI  G 
Subjt:  NIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTILEGQ

Query:  AGVNLDIKER---------------------------NLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKCPTPLVGF
        +G     K +                           +LEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTRSQLKK  TPLVGF
Subjt:  AGVNLDIKER---------------------------NLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKCPTPLVGF

Query:  SGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGEQTASRECYAAAFKSSSVCALETL-
        S ESVIPEGCIDLPVTLG DQT+VTQMAEFVV+DGRSAYNAIFGRPIIHSFRAIPSTLHQ+LKYSTPNGVG VRGEQ ASRECYA+A K SSVCALETL 
Subjt:  SGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGEQTASRECYAAAFKSSSVCALETL-

Query:  -RDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQLASAYETDLARSVPVEILDNPSILE--PELMKIG
         RDGTLEF+A+LPR+EFAAPTEELELVPLL  +      +E +L     +  +D+   +E  PE + +G
Subjt:  -RDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQLASAYETDLARSVPVEILDNPSILE--PELMKIG

A0A6J1DD03 uncharacterized protein LOC1110198996.6e-19883.63Show/hide
Query:  MCYFLTSLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTP
        MCYFLT LADEALTVKL EEAPATFAEVLQKAKKVIDGQELLRT       KIG+GRSGKD E  DPKSKDKGSFS+GRAEYRRAENGPTRSRPYERFTP
Subjt:  MCYFLTSLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTP

Query:  TTIPIFEILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRP
        TTIPI EILTNIE+SGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELK QIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRP
Subjt:  TTIPIFEILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRP

Query:  AVINTILEGQAGVNLDIKER---------------------------NLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
        AVINTI  G +G     K +                           +L EVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
Subjt:  AVINTILEGQAGVNLDIKER---------------------------NLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL

Query:  KKCPTPLVGFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGEQTASRECYAAAFKS
        KK PTPLVGFSGESV+PEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQ+LKYSTPNGVGTVRGEQTASRECYA+  K 
Subjt:  KKCPTPLVGFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGEQTASRECYAAAFKS

Query:  SSVCALETL--RDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQL
        +SVCALETL  RDGTLEFEADLP +EFAAP EELELVPLLS EKQ+
Subjt:  SSVCALETL--RDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQL

A0A6J1DHB3 uncharacterized protein LOC1110204793.8e-26264.86Show/hide
Query:  MVQPANSTNTTDRRALAASDAHQREVGAAAVEGQGHDGLATEPLRRSARITTPALPPAHPRTSKATRGRGGTSKKGAWGPAPAPTSKNFDALQREMEAMR
        MVQPANSTNT DRRALAA+  HQREVGA  VEGQGH+ L TEPL RSARITTP LPPAHP+ SK                                    
Subjt:  MVQPANSTNTTDRRALAASDAHQREVGAAAVEGQGHDGLATEPLRRSARITTPALPPAHPRTSKATRGRGGTSKKGAWGPAPAPTSKNFDALQREMEAMR

Query:  TQMRSMEAMYNEMVLAAGAGSRSENRVTRMDVREQRGSHLGPAEEECPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNP--
                                                                                                   AESS+NP  
Subjt:  TQMRSMEAMYNEMVLAAGAGSRSENRVTRMDVREQRGSHLGPAEEECPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNP--

Query:  AGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIAL
         G+ITREEFDQL+ + DAQVEALKA+CE+K+ S +DGDLGE  F+SD+LEA IPPKFK PT+KPYDG+KDPKDYVEVFE LMDFQAA+DAIKC AFQIAL
Subjt:  AGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIAL

Query:  TGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTVTHLATIRQKEGETLREYVTKFQEEQLKVAHCSDDSAMCYFLTSLADEALTVKLGEEAPAT
        TGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVT+F EEQLKVAHCSDDSAMCYFLT LADE LTVKL EEAPAT
Subjt:  TGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTVTHLATIRQKEGETLREYVTKFQEEQLKVAHCSDDSAMCYFLTSLADEALTVKLGEEAPAT

Query:  FAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDE-RADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILTNIEDSGMEKLLKR
        FAEVLQK KKVIDGQELLRTKTGRPE+ I +GR+GKD+ +AD KS+DKG S SS R +YRR+ +   +SRPYE +TPTTIPIFEILTNIE++GMEKLLKR
Subjt:  FAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDE-RADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILTNIEDSGMEKLLKR

Query:  PEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVIN--------------TILEG
        PEKLRG PE+R+ DKYCRFHR+HGHNTS+ WELKRQIEDLIQDGYFKKFVGKPR++S EKKEERKR RTPPRR DRPAVIN               I E 
Subjt:  PEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVIN--------------TILEG

Query:  QAGVNLDIKERNLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKCPTPLVGFSGESVIPEGCIDLPVTLGQDQTRVTQ
        +   ++     +LE VHLPHNDALVIAPLID V+VRR+LVDGGASANILSL TYLALGWTRSQLKK PTPLVGFSGES+  EGCIDLPV++ QD T+VTQ
Subjt:  QAGVNLDIKERNLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKCPTPLVGFSGESVIPEGCIDLPVTLGQDQTRVTQ

Query:  MAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGEQTASRECYAAAFKSSSVCALE--TLRD
        MAEFVV+DGRSAYNAIFGRPIIHSFRA+PSTLHQ+LKYST NGVGTVRGE   SRECYA+  K SSVCALE  T+RD
Subjt:  MAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGEQTASRECYAAAFKSSSVCALE--TLRD

A0A6J1DS95 uncharacterized protein LOC1110234211.1e-20895.47Show/hide
Query:  GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALT
        GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALT
Subjt:  GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALT

Query:  GSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTVTHLATIRQKEGETLREYVTKFQEEQLKVAHCSDDSAMCYFLTSLADEALTVKLGEEAPATF
        GSARLWYRRLP RSISTYSQLRREFLAQFSSRHYDKKT THLATIRQKEGETLREYVT+FQEEQLKVAHCSDDSAMCYFLT LADEALTVKLGEEAPATF
Subjt:  GSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTVTHLATIRQKEGETLREYVTKFQEEQLKVAHCSDDSAMCYFLTSLADEALTVKLGEEAPATF

Query:  AEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILTNIEDSGMEKLLKRPE
        AEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILTNIE+SGMEKLLKRPE
Subjt:  AEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILTNIEDSGMEKLLKRPE

Query:  KLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTILEGQAGVNLDIKERNL
        KLRGAPERRSKDKYCRFHREHGHNTSD WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTI  G +G  L  K + L
Subjt:  KLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTILEGQAGVNLDIKERNL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAACCAGCGAATTCGACCAATACGACAGATCGAAGGGCTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCGGCGGTAGAGGGGCAAGGT
CACGACGGCCTAGCAACGGAACCCCTCCGCAGGTCGGCACGGATCACCACGCCTGCCCTACCGCCTGCGCACCCGAGGACGTCCAAGGCCACCCGTGGCCGAGGT
GGGACTTCTAAGAAGGGCGCCTGGGGTCCAGCCCCGGCTCCAACAAGCAAGAACTTTGATGCGCTCCAGAGAGAGATGGAGGCAATGCGCACACAAATGCGCTCC
ATGGAGGCAATGTATAACGAAATGGTGCTGGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAGTGACGCGCATGGATGTACGCGAGCAAAGGGGTTCCCACCTC
GGCCCAGCCGAGGAAGAATGTCCTGAAGACAACGAGAGTGAGGGGTACACTCGCCAGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCGTCTCTC
CGAAAAGGACAGTCACCATCCCGCTCCCACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAATCCCGCAGGGATAATCACAAGGGAGGAGTTCGACCAGCTG
AGGGGGGAGCTCGATGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAACGATGGAGACTTGGGAGAATCGCCTTTCACCTCGGAC
GTTTTGGAAGCACCAATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGACGGGACGAAGGACCCCAAGGACTATGTTGAAGTCTTTGAAGGCCTCATG
GACTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTACCGGAGACTGCCAGCCAGGTCGATCTCG
ACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCATTATGACAAAAAGACAGTGACCCATCTCGCCACCATCAGGCAGAAGGAGGGTGAG
ACGCTGCGGGAGTATGTCACCAAATTCCAGGAGGAGCAGTTGAAGGTTGCACACTGCTCCGATGACTCGGCCATGTGCTATTTCCTCACCAGTCTAGCCGACGAA
GCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTCCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACC
GGCCGACCTGAACGAAAGATCGGCCGGGGCAGAAGTGGAAAAGATGAAAGGGCGGATCCCAAGTCCAAGGACAAGGGATCCTTCTCCAGCGGCCGAGCTGAGTAT
CGAAGGGCGGAGAACGGACCTACTAGGAGCCGACCTTACGAGCGCTTCACCCCAACCACGATTCCAATTTTCGAGATCCTAACGAACATCGAGGATTCTGGAATG
GAAAAACTACTCAAGCGTCCGGAGAAACTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCACAACACGTCGGAC
TGCTGGGAATTGAAGCGCCAAATTGAGGATCTAATTCAAGACGGCTACTTCAAGAAGTTTGTGGGAAAGCCCAGGACCAGCTCAGCAGAGAAAAAGGAAGAGCGA
AAGCGTTCAAGGACGCCACCCCGGCGCACCGACCGACCTGCGGTCATCAATACCATTTTGGAGGGCCAAGCGGGGGTCAATCTGGACATAAAAGAAAGGAACTTG
GAGGAGGTCCACCTGCCCCACAATGATGCCCTTGTGATTGCTCCCTTGATTGATCATGTGGTGGTCAGGAGAGTGCTGGTAGACGGCGGCGCATCCGCTAACATC
CTGTCCTTACCGACCTACCTCGCCTTGGGCTGGACGAGGTCGCAATTGAAGAAATGCCCGACGCCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGT
TGCATCGACTTGCCTGTCACGCTGGGGCAGGACCAAACTCGGGTCACTCAAATGGCCGAGTTCGTGGTAGTTGACGGTAGATCGGCCTATAACGCCATCTTTGGG
AGACCCATCATCCACTCATTTCGGGCCATTCCCTCAACACTTCATCAAATTTTGAAGTATTCCACCCCCAATGGCGTGGGCACAGTCCGAGGAGAACAGACCGCT
TCGAGGGAGTGCTATGCCGCCGCATTCAAAAGCTCATCGGTCTGCGCCCTCGAAACTCTCAGGGATGGGACGCTCGAGTTCGAGGCCGACCTGCCGAGGAAGGAG
TTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAGTTAGCATCGGCGTACGAGACCGACCTCGCCAGGTCGGTCCCCGTCGAG
ATCCTAGATAATCCCTCCATCTTAGAGCCAGAACTGATGAAGATCGGCGCTTCAGAATCCTCATGGATGGACCCGATCGCGGACTTCATTAGGGGCAACCCACCA
CAAGATCCCAAGGAGCGCAGAAAGTTGGCAAGGCGGGCAGCTCGAGTAGAGCATTACGAGCCTACGACGAATGAGGATGGGCTGCTCCTCAACCTCGACTTGTTG
GAAGAAAGAAGAGCAATGGCCCAGCTACGCCTGGTGGAATATCAGGGCAAAATGGCCAGACATTACAATGCCCGCGTTCGACCTCGGGCCTTTCAGGTCGGACAT
CTGGTCTTAAGGAGGGTCCAAACGCATGTGGGTGCCCTTGATCCGGCCTGGGAGGGCCCGTTTGGGACGTACATATTGACCGATCTGAAAGGAGACGTCCTCGCG
CACCCGTGGAACGCGGAGCACCTGAAGCGTTATTATCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAACCAGCGAATTCGACCAATACGACAGATCGAAGGGCTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCGGCGGTAGAGGGGCAAGGT
CACGACGGCCTAGCAACGGAACCCCTCCGCAGGTCGGCACGGATCACCACGCCTGCCCTACCGCCTGCGCACCCGAGGACGTCCAAGGCCACCCGTGGCCGAGGT
GGGACTTCTAAGAAGGGCGCCTGGGGTCCAGCCCCGGCTCCAACAAGCAAGAACTTTGATGCGCTCCAGAGAGAGATGGAGGCAATGCGCACACAAATGCGCTCC
ATGGAGGCAATGTATAACGAAATGGTGCTGGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAGTGACGCGCATGGATGTACGCGAGCAAAGGGGTTCCCACCTC
GGCCCAGCCGAGGAAGAATGTCCTGAAGACAACGAGAGTGAGGGGTACACTCGCCAGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCGTCTCTC
CGAAAAGGACAGTCACCATCCCGCTCCCACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAATCCCGCAGGGATAATCACAAGGGAGGAGTTCGACCAGCTG
AGGGGGGAGCTCGATGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAACGATGGAGACTTGGGAGAATCGCCTTTCACCTCGGAC
GTTTTGGAAGCACCAATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGACGGGACGAAGGACCCCAAGGACTATGTTGAAGTCTTTGAAGGCCTCATG
GACTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTACCGGAGACTGCCAGCCAGGTCGATCTCG
ACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCATTATGACAAAAAGACAGTGACCCATCTCGCCACCATCAGGCAGAAGGAGGGTGAG
ACGCTGCGGGAGTATGTCACCAAATTCCAGGAGGAGCAGTTGAAGGTTGCACACTGCTCCGATGACTCGGCCATGTGCTATTTCCTCACCAGTCTAGCCGACGAA
GCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTCCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACC
GGCCGACCTGAACGAAAGATCGGCCGGGGCAGAAGTGGAAAAGATGAAAGGGCGGATCCCAAGTCCAAGGACAAGGGATCCTTCTCCAGCGGCCGAGCTGAGTAT
CGAAGGGCGGAGAACGGACCTACTAGGAGCCGACCTTACGAGCGCTTCACCCCAACCACGATTCCAATTTTCGAGATCCTAACGAACATCGAGGATTCTGGAATG
GAAAAACTACTCAAGCGTCCGGAGAAACTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCACAACACGTCGGAC
TGCTGGGAATTGAAGCGCCAAATTGAGGATCTAATTCAAGACGGCTACTTCAAGAAGTTTGTGGGAAAGCCCAGGACCAGCTCAGCAGAGAAAAAGGAAGAGCGA
AAGCGTTCAAGGACGCCACCCCGGCGCACCGACCGACCTGCGGTCATCAATACCATTTTGGAGGGCCAAGCGGGGGTCAATCTGGACATAAAAGAAAGGAACTTG
GAGGAGGTCCACCTGCCCCACAATGATGCCCTTGTGATTGCTCCCTTGATTGATCATGTGGTGGTCAGGAGAGTGCTGGTAGACGGCGGCGCATCCGCTAACATC
CTGTCCTTACCGACCTACCTCGCCTTGGGCTGGACGAGGTCGCAATTGAAGAAATGCCCGACGCCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGT
TGCATCGACTTGCCTGTCACGCTGGGGCAGGACCAAACTCGGGTCACTCAAATGGCCGAGTTCGTGGTAGTTGACGGTAGATCGGCCTATAACGCCATCTTTGGG
AGACCCATCATCCACTCATTTCGGGCCATTCCCTCAACACTTCATCAAATTTTGAAGTATTCCACCCCCAATGGCGTGGGCACAGTCCGAGGAGAACAGACCGCT
TCGAGGGAGTGCTATGCCGCCGCATTCAAAAGCTCATCGGTCTGCGCCCTCGAAACTCTCAGGGATGGGACGCTCGAGTTCGAGGCCGACCTGCCGAGGAAGGAG
TTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAGTTAGCATCGGCGTACGAGACCGACCTCGCCAGGTCGGTCCCCGTCGAG
ATCCTAGATAATCCCTCCATCTTAGAGCCAGAACTGATGAAGATCGGCGCTTCAGAATCCTCATGGATGGACCCGATCGCGGACTTCATTAGGGGCAACCCACCA
CAAGATCCCAAGGAGCGCAGAAAGTTGGCAAGGCGGGCAGCTCGAGTAGAGCATTACGAGCCTACGACGAATGAGGATGGGCTGCTCCTCAACCTCGACTTGTTG
GAAGAAAGAAGAGCAATGGCCCAGCTACGCCTGGTGGAATATCAGGGCAAAATGGCCAGACATTACAATGCCCGCGTTCGACCTCGGGCCTTTCAGGTCGGACAT
CTGGTCTTAAGGAGGGTCCAAACGCATGTGGGTGCCCTTGATCCGGCCTGGGAGGGCCCGTTTGGGACGTACATATTGACCGATCTGAAAGGAGACGTCCTCGCG
CACCCGTGGAACGCGGAGCACCTGAAGCGTTATTATCCTTGA
Protein sequenceShow/hide protein sequence
MVQPANSTNTTDRRALAASDAHQREVGAAAVEGQGHDGLATEPLRRSARITTPALPPAHPRTSKATRGRGGTSKKGAWGPAPAPTSKNFDALQREMEAMRTQMRS
MEAMYNEMVLAAGAGSRSENRVTRMDVREQRGSHLGPAEEECPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPAGIITREEFDQL
RGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSIS
TYSQLRREFLAQFSSRHYDKKTVTHLATIRQKEGETLREYVTKFQEEQLKVAHCSDDSAMCYFLTSLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKT
GRPERKIGRGRSGKDERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD
CWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTILEGQAGVNLDIKERNLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANI
LSLPTYLALGWTRSQLKKCPTPLVGFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGEQTA
SRECYAAAFKSSSVCALETLRDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQLASAYETDLARSVPVEILDNPSILEPELMKIGASESSWMDPIADFIRGNPP
QDPKERRKLARRAARVEHYEPTTNEDGLLLNLDLLEERRAMAQLRLVEYQGKMARHYNARVRPRAFQVGHLVLRRVQTHVGALDPAWEGPFGTYILTDLKGDVLA
HPWNAEHLKRYYP