; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g26850 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g26850
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr4:19730803..19736379
RNA-Seq ExpressionMoc04g26850
SyntenyMoc04g26850
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]1.9e-24786.74Show/hide
Query:  QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNNGDLVESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASD
        +AESS N   PAG+ITREEFDQLRG+LDAQVEALKAKCEQK+  LN+GDL ESPFTSDVLEAPIPPKFKAPTVKPYDG+KDPKDYVEVFE LMDFQAASD
Subjt:  QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNNGDLVESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
        AIKCRAF+IALTGSARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRRRSGKD-ERVDLKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE
        TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGR RSGKD E  D KSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRRRSGKD-ERVDLKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE

Query:  DSGMEKLLKRPEKLRGAPERRSKDKYCRFYREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTVFGGPSGG
        +SGMEKLLKRPEKLRGAPERRSKDKYCRF+REHGHNTSD WELKRQIE+LIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINT+FGGPSGG
Subjt:  DSGMEKLLKRPEKLRGAPERRSKDKYCRFYREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTVFGGPSGG

Query:  QSGHKRKELARAARREVCIIRE-------------------------------------RVLVDRGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGE
        QSG KRKELARAARREVCIIRE                                     RVLVD G SANILSLPTYLALGWTRSQLK+SPTPLVGFSGE
Subjt:  QSGHKRKELARAARREVCIIRE-------------------------------------RVLVDRGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGE

Query:  SVIPEGCIDLPVTLGQDQTRVTQMAEFV
        SVIPEG IDLPVTLGQDQT+VTQMAEFV
Subjt:  SVIPEGCIDLPVTLGQDQTRVTQMAEFV

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]1.9e-24775.4Show/hide
Query:  SSNQQAESSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNNGDLVESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQ
        SSNQQAESSHNPA   G+ITREEFDQLRG+L+AQVEALKAKCEQK+  LN+GDL ESPFTSDVLE        APTVK YDG+KDPKDYVEVFEGLMDFQ
Subjt:  SSNQQAESSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNNGDLVESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQ

Query:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA
        AASDAIKCRAFQIALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFLTGLA
Subjt:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA

Query:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRRRSGKDERVDLKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILT
        DEALTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRPER I R RSGKDE+ DLKSKDKGSFSSGRAE+RRA NGPTRSRPYERFTPTTIPISEILT
Subjt:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRRRSGKDERVDLKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILT

Query:  NIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFYREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTVFGGP
        NIE+SGMEKLLKRPEKLRGAPERR+KDKYCRF+REH HNTSD WELKRQIEDLIQD YFKKFVGKPRTSSAEKKEERK SRTP RR DRPAVINT+FGGP
Subjt:  NIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFYREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTVFGGP

Query:  SGGQSGHKRKELARAARREVCIIRE-------------------------------------RVLVDRGASANILSLPTYLALGWTRSQLKRSPTPLVGF
        SGGQSGHKRKELARAARREVCIIRE                                     RVLVD G SANI+SL TYLALGWTRSQLK+S TPLVGF
Subjt:  SGGQSGHKRKELARAARREVCIIRE-------------------------------------RVLVDRGASANILSLPTYLALGWTRSQLKRSPTPLVGF

Query:  SGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNVIFGRPIIHSFRVIPSTLHQFLKYPTPNGVGTVRGEQTASRECYAAALKGSSVCALETL-
        S ESVIPEGCIDLPVTLG DQT+VTQMAEFVV+DGRSAYN IFGRPIIHSFR IPSTLHQ LKY TPNGVG VRGEQ ASRECYA+ALKGSSVCALETL 
Subjt:  SGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNVIFGRPIIHSFRVIPSTLHQFLKYPTPNGVGTVRGEQTASRECYAAALKGSSVCALETL-

Query:  -RDGTPEFEADLPRKEFAAPTEELELVPLL
         RDGT EF+A+LPR+EFAAPTEELELVPLL
Subjt:  -RDGTPEFEADLPRKEFAAPTEELELVPLL

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]6.7e-24561.27Show/hide
Query:  MVPPANSTNTTDRRTLATSDAHQREVGAAAVEGQSHDGLATEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGVRGPAPTPTSENFDALQREMEAMR
        MV PANSTNT DRR LA +  HQREVGA  VEGQ H+ L TEPL RSARIT P LPPAHP+ SK                                    
Subjt:  MVPPANSTNTTDRRTLATSDAHQREVGAAAVEGQSHDGLATEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGVRGPAPTPTSENFDALQREMEAMR

Query:  AQMRSMEAMYDEMVLAAGAGSRSENRVTRMDVREQRGFHLGPAEEERPKDNESEGYTRQRRDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNP--
                                                                                                   AESS+NP  
Subjt:  AQMRSMEAMYDEMVLAAGAGSRSENRVTRMDVREQRGFHLGPAEEERPKDNESEGYTRQRRDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNP--

Query:  AGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNNGDLVESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIAL
         G+ITREEFDQL+ + DAQVEALKA+CE+K+ S ++GDL E  F+SD+LEA IPPKFK PT+KPYDG+KDPKDYVEVFE LMDFQAA+DAIKC AFQIAL
Subjt:  AGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNNGDLVESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIAL

Query:  TGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPAT
        TGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EEAPAT
Subjt:  TGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPAT

Query:  FAEVLQKAKKVIDGQELLRTKTGRPERKIGRRRSGKDE-RVDLKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLLKR
        FAEVLQK KKVIDGQELLRTKTGRPE+ I + R+GKD+ + D KS+DKG S SS R +YRR+ +   +SRPYE +TPTTIPI EILTNIE++GMEKLLKR
Subjt:  FAEVLQKAKKVIDGQELLRTKTGRPERKIGRRRSGKDE-RVDLKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLLKR

Query:  PEKLRGAPERRSKDKYCRFYREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTVFGGPSGGQSGHKRKELA
        PEKLRG PE+R+ DKYCRF+R+HGHNTS+ WELKRQIEDLIQDGYFKKFVGKPR++S EKKEERKR RTPPRR DRPAVIN             K+KELA
Subjt:  PEKLRGAPERRSKDKYCRFYREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTVFGGPSGGQSGHKRKELA

Query:  RAARREVCIIRE-------------------------------------RVLVDRGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDL
        R ARREVCIIRE                                     R+LVD GASANILSL TYLALGWTRSQLK+SPTPLVGFSGES+  EGCIDL
Subjt:  RAARREVCIIRE-------------------------------------RVLVDRGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDL

Query:  PVTLGQDQTRVTQMAEFVVVDGRSAYNVIFGRPIIHSFRVIPSTLHQFLKYPTPNGVGTVRGEQTASRECYAAALKGSSVCALE--TLRD
        PV++ QD T+VTQMAEFVV+DGRSAYN IFGRPIIHSFR +PSTLHQ LKY T NGVGTVRGE   SRECYA+  K SSVCALE  T+RD
Subjt:  PVTLGQDQTRVTQMAEFVVVDGRSAYNVIFGRPIIHSFRVIPSTLHQFLKYPTPNGVGTVRGEQTASRECYAAALKGSSVCALE--TLRD

XP_022156542.1 uncharacterized protein LOC111023421 [Momordica charantia]1.9e-21596.54Show/hide
Query:  GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNNGDLVESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALT
        GIITREEFDQLRGELDAQVEALKAKCEQKDDSLN+GDL ESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALT
Subjt:  GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNNGDLVESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALT

Query:  GSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATF
        GSARLWYRRLP RSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATF
Subjt:  GSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATF

Query:  AEVLQKAKKVIDGQELLRTKTGRPERKIGRRRSGKD-ERVDLKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLLKRPE
        AEVLQKAKKVIDGQELLRTKTGRPERKIGR RSGKD ER D KSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPI EILTNIE+SGMEKLLKRPE
Subjt:  AEVLQKAKKVIDGQELLRTKTGRPERKIGRRRSGKD-ERVDLKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLLKRPE

Query:  KLRGAPERRSKDKYCRFYREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTVFGGPSGGQSGHKRKELARA
        KLRGAPERRSKDKYCRF+REHGHNTSD WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINT+FGGPSGGQ GHKRKELARA
Subjt:  KLRGAPERRSKDKYCRFYREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTVFGGPSGGQSGHKRKELARA

Query:  ARREV
        ARRE+
Subjt:  ARREV

XP_022158652.1 uncharacterized protein LOC111025109 [Momordica charantia]8.0e-20686.91Show/hide
Query:  NRKRGSSLRKGQSPSRSHRSSNQQAESSHNPAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNNGDLVESPFTSDVLEAPIPPKFKAPTVKPYDGTK
        + KRGSSLRKGQSPSRSHRSSNQQAESSHNPAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLN+GDL E PFTSDVLEAPIPPKFKAPTVKPYDGTK
Subjt:  NRKRGSSLRKGQSPSRSHRSSNQQAESSHNPAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNNGDLVESPFTSDVLEAPIPPKFKAPTVKPYDGTK

Query:  DPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA
        DPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPA                                 RQKE ETLREYVTRFQEEQLKVA
Subjt:  DPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA

Query:  HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRRRSGKD-ERVDLKSKDKGSFSSGRAEYRRAENGPTRSR
        HCSDDSAMCYF TGLADEALTVKLGEEAP TFAEVLQKAKKVIDGQELLRTKTGRPERKIGR RSGKD ER D KSKDKGSFSSGRAEYRRAENGPT  R
Subjt:  HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRRRSGKD-ERVDLKSKDKGSFSSGRAEYRRAENGPTRSR

Query:  PYERFTPTTIPISEILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFYREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTP
        PYERFTPTTIPIS ILTNIE+SGMEKLLKR EKLRGAPERR KDKYCRF+REHGHNTS+CWELKRQIEDLIQDGYFKKFVG PRTSSAEKKEERKRSRTP
Subjt:  PYERFTPTTIPISEILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFYREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTP

Query:  PRRTDRPAVINTVFGGPSGGQSGHKRKELARAARREVCIIRER
        PRRTDRPAVINT+FGGPSGGQS HKRK+LARAARREVCIIRE+
Subjt:  PRRTDRPAVINTVFGGPSGGQSGHKRKELARAARREVCIIRER

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088139.1e-24886.74Show/hide
Query:  QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNNGDLVESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASD
        +AESS N   PAG+ITREEFDQLRG+LDAQVEALKAKCEQK+  LN+GDL ESPFTSDVLEAPIPPKFKAPTVKPYDG+KDPKDYVEVFE LMDFQAASD
Subjt:  QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNNGDLVESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
        AIKCRAF+IALTGSARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRRRSGKD-ERVDLKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE
        TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGR RSGKD E  D KSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRRRSGKD-ERVDLKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE

Query:  DSGMEKLLKRPEKLRGAPERRSKDKYCRFYREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTVFGGPSGG
        +SGMEKLLKRPEKLRGAPERRSKDKYCRF+REHGHNTSD WELKRQIE+LIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINT+FGGPSGG
Subjt:  DSGMEKLLKRPEKLRGAPERRSKDKYCRFYREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTVFGGPSGG

Query:  QSGHKRKELARAARREVCIIRE-------------------------------------RVLVDRGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGE
        QSG KRKELARAARREVCIIRE                                     RVLVD G SANILSLPTYLALGWTRSQLK+SPTPLVGFSGE
Subjt:  QSGHKRKELARAARREVCIIRE-------------------------------------RVLVDRGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGE

Query:  SVIPEGCIDLPVTLGQDQTRVTQMAEFV
        SVIPEG IDLPVTLGQDQT+VTQMAEFV
Subjt:  SVIPEGCIDLPVTLGQDQTRVTQMAEFV

A0A6J1D9E1 uncharacterized protein LOC1110188239.1e-24875.4Show/hide
Query:  SSNQQAESSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNNGDLVESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQ
        SSNQQAESSHNPA   G+ITREEFDQLRG+L+AQVEALKAKCEQK+  LN+GDL ESPFTSDVLE        APTVK YDG+KDPKDYVEVFEGLMDFQ
Subjt:  SSNQQAESSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNNGDLVESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQ

Query:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA
        AASDAIKCRAFQIALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFLTGLA
Subjt:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA

Query:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRRRSGKDERVDLKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILT
        DEALTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRPER I R RSGKDE+ DLKSKDKGSFSSGRAE+RRA NGPTRSRPYERFTPTTIPISEILT
Subjt:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRRRSGKDERVDLKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILT

Query:  NIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFYREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTVFGGP
        NIE+SGMEKLLKRPEKLRGAPERR+KDKYCRF+REH HNTSD WELKRQIEDLIQD YFKKFVGKPRTSSAEKKEERK SRTP RR DRPAVINT+FGGP
Subjt:  NIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFYREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTVFGGP

Query:  SGGQSGHKRKELARAARREVCIIRE-------------------------------------RVLVDRGASANILSLPTYLALGWTRSQLKRSPTPLVGF
        SGGQSGHKRKELARAARREVCIIRE                                     RVLVD G SANI+SL TYLALGWTRSQLK+S TPLVGF
Subjt:  SGGQSGHKRKELARAARREVCIIRE-------------------------------------RVLVDRGASANILSLPTYLALGWTRSQLKRSPTPLVGF

Query:  SGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNVIFGRPIIHSFRVIPSTLHQFLKYPTPNGVGTVRGEQTASRECYAAALKGSSVCALETL-
        S ESVIPEGCIDLPVTLG DQT+VTQMAEFVV+DGRSAYN IFGRPIIHSFR IPSTLHQ LKY TPNGVG VRGEQ ASRECYA+ALKGSSVCALETL 
Subjt:  SGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNVIFGRPIIHSFRVIPSTLHQFLKYPTPNGVGTVRGEQTASRECYAAALKGSSVCALETL-

Query:  -RDGTPEFEADLPRKEFAAPTEELELVPLL
         RDGT EF+A+LPR+EFAAPTEELELVPLL
Subjt:  -RDGTPEFEADLPRKEFAAPTEELELVPLL

A0A6J1DHB3 uncharacterized protein LOC1110204793.2e-24561.27Show/hide
Query:  MVPPANSTNTTDRRTLATSDAHQREVGAAAVEGQSHDGLATEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGVRGPAPTPTSENFDALQREMEAMR
        MV PANSTNT DRR LA +  HQREVGA  VEGQ H+ L TEPL RSARIT P LPPAHP+ SK                                    
Subjt:  MVPPANSTNTTDRRTLATSDAHQREVGAAAVEGQSHDGLATEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGVRGPAPTPTSENFDALQREMEAMR

Query:  AQMRSMEAMYDEMVLAAGAGSRSENRVTRMDVREQRGFHLGPAEEERPKDNESEGYTRQRRDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNP--
                                                                                                   AESS+NP  
Subjt:  AQMRSMEAMYDEMVLAAGAGSRSENRVTRMDVREQRGFHLGPAEEERPKDNESEGYTRQRRDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNP--

Query:  AGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNNGDLVESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIAL
         G+ITREEFDQL+ + DAQVEALKA+CE+K+ S ++GDL E  F+SD+LEA IPPKFK PT+KPYDG+KDPKDYVEVFE LMDFQAA+DAIKC AFQIAL
Subjt:  AGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNNGDLVESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIAL

Query:  TGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPAT
        TGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EEAPAT
Subjt:  TGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPAT

Query:  FAEVLQKAKKVIDGQELLRTKTGRPERKIGRRRSGKDE-RVDLKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLLKR
        FAEVLQK KKVIDGQELLRTKTGRPE+ I + R+GKD+ + D KS+DKG S SS R +YRR+ +   +SRPYE +TPTTIPI EILTNIE++GMEKLLKR
Subjt:  FAEVLQKAKKVIDGQELLRTKTGRPERKIGRRRSGKDE-RVDLKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLLKR

Query:  PEKLRGAPERRSKDKYCRFYREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTVFGGPSGGQSGHKRKELA
        PEKLRG PE+R+ DKYCRF+R+HGHNTS+ WELKRQIEDLIQDGYFKKFVGKPR++S EKKEERKR RTPPRR DRPAVIN             K+KELA
Subjt:  PEKLRGAPERRSKDKYCRFYREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTVFGGPSGGQSGHKRKELA

Query:  RAARREVCIIRE-------------------------------------RVLVDRGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDL
        R ARREVCIIRE                                     R+LVD GASANILSL TYLALGWTRSQLK+SPTPLVGFSGES+  EGCIDL
Subjt:  RAARREVCIIRE-------------------------------------RVLVDRGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDL

Query:  PVTLGQDQTRVTQMAEFVVVDGRSAYNVIFGRPIIHSFRVIPSTLHQFLKYPTPNGVGTVRGEQTASRECYAAALKGSSVCALE--TLRD
        PV++ QD T+VTQMAEFVV+DGRSAYN IFGRPIIHSFR +PSTLHQ LKY T NGVGTVRGE   SRECYA+  K SSVCALE  T+RD
Subjt:  PVTLGQDQTRVTQMAEFVVVDGRSAYNVIFGRPIIHSFRVIPSTLHQFLKYPTPNGVGTVRGEQTASRECYAAALKGSSVCALE--TLRD

A0A6J1DS95 uncharacterized protein LOC1110234219.2e-21696.54Show/hide
Query:  GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNNGDLVESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALT
        GIITREEFDQLRGELDAQVEALKAKCEQKDDSLN+GDL ESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALT
Subjt:  GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNNGDLVESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALT

Query:  GSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATF
        GSARLWYRRLP RSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATF
Subjt:  GSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATF

Query:  AEVLQKAKKVIDGQELLRTKTGRPERKIGRRRSGKD-ERVDLKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLLKRPE
        AEVLQKAKKVIDGQELLRTKTGRPERKIGR RSGKD ER D KSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPI EILTNIE+SGMEKLLKRPE
Subjt:  AEVLQKAKKVIDGQELLRTKTGRPERKIGRRRSGKD-ERVDLKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLLKRPE

Query:  KLRGAPERRSKDKYCRFYREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTVFGGPSGGQSGHKRKELARA
        KLRGAPERRSKDKYCRF+REHGHNTSD WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINT+FGGPSGGQ GHKRKELARA
Subjt:  KLRGAPERRSKDKYCRFYREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTVFGGPSGGQSGHKRKELARA

Query:  ARREV
        ARRE+
Subjt:  ARREV

A0A6J1DXR9 uncharacterized protein LOC1110251093.9e-20686.91Show/hide
Query:  NRKRGSSLRKGQSPSRSHRSSNQQAESSHNPAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNNGDLVESPFTSDVLEAPIPPKFKAPTVKPYDGTK
        + KRGSSLRKGQSPSRSHRSSNQQAESSHNPAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLN+GDL E PFTSDVLEAPIPPKFKAPTVKPYDGTK
Subjt:  NRKRGSSLRKGQSPSRSHRSSNQQAESSHNPAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNNGDLVESPFTSDVLEAPIPPKFKAPTVKPYDGTK

Query:  DPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA
        DPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPA                                 RQKE ETLREYVTRFQEEQLKVA
Subjt:  DPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA

Query:  HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRRRSGKD-ERVDLKSKDKGSFSSGRAEYRRAENGPTRSR
        HCSDDSAMCYF TGLADEALTVKLGEEAP TFAEVLQKAKKVIDGQELLRTKTGRPERKIGR RSGKD ER D KSKDKGSFSSGRAEYRRAENGPT  R
Subjt:  HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRRRSGKD-ERVDLKSKDKGSFSSGRAEYRRAENGPTRSR

Query:  PYERFTPTTIPISEILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFYREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTP
        PYERFTPTTIPIS ILTNIE+SGMEKLLKR EKLRGAPERR KDKYCRF+REHGHNTS+CWELKRQIEDLIQDGYFKKFVG PRTSSAEKKEERKRSRTP
Subjt:  PYERFTPTTIPISEILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFYREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTP

Query:  PRRTDRPAVINTVFGGPSGGQSGHKRKELARAARREVCIIRER
        PRRTDRPAVINT+FGGPSGGQS HKRK+LARAARREVCIIRE+
Subjt:  PRRTDRPAVINTVFGGPSGGQSGHKRKELARAARREVCIIRER

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCCACCAGCGAATTCAACCAATACGACAGATCGAAGGACTCTAGCTACCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCGGCGGTAGAGGGGCAAAGTCACGA
CGGCCTAGCAACGGAACCCCTCCGCAGGTCGGCACGGATCACCGCGCCTGCCCTACCGCCTGCGCACCCGAGGACGTCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTA
AGAAGGGCGTCCGGGGTCCAGCCCCGACTCCAACAAGCGAGAACTTTGATGCGCTCCAGAGAGAGATGGAGGCAATGCGCGCACAAATGCGTTCCATGGAGGCAATGTAT
GACGAAATGGTGCTAGCTGCAGGTGCAGGGTCCCGATCTGAAAATCGAGTGACGCGCATGGACGTACGTGAGCAAAGGGGTTTCCACCTCGGCCCAGCCGAGGAAGAACG
TCCCAAAGACAACGAGAGTGAGGGGTACACTCGCCAGAGGAGAGACCTCCGCGAGCATCTCAACAGAAAGAGAGGCTCGTCTCTCCGAAAAGGGCAGTCACCATCCCGCT
CCCACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAATCCCGCAGGGATAATCACAAGGGAGGAGTTCGACCAGCTGAGAGGGGAGCTCGATGCTCAGGTGGAGGCC
TTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAACAACGGCGACTTGGTAGAATCGCCTTTCACCTCGGACGTTTTGGAAGCACCAATCCCTCCGAAGTTCAAAGC
TCCTACCGTGAAGCCTTATGACGGGACGAAGGACCCCAAGGACTATGTTGAGGTCTTTGAAGGCCTCATGGACTTCCAAGCAGCATCAGACGCAATCAAATGCCGCGCCT
TTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTACCGGAGACTGCCAGCCAGGTCAATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCT
CGGCACTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGGCAGAAGGAGGGTGAGACGTTGCGAGAGTATGTCACCAGATTCCAGGAGGAGCAGTTGAAGGTTGC
ACACTGCTCCGATGACTCGGCCATGTGCTATTTTCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTCC
AGAAGGCGAAAAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCTGAACGAAAGATCGGCCGGCGCAGAAGTGGAAAAGATGAAAGGGTAGATCTC
AAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGAGCTGAGTATCGAAGGGCGGAGAACGGACCTACCAGGAGCCGACCTTACGAGCGCTTCACCCCAACCACGATTCC
AATTTCCGAGATCCTAACGAACATCGAGGATTCTGGAATGGAAAAATTACTCAAGCGTCCGGAGAAACTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGCC
GCTTCTATCGGGAGCACGGCCACAACACGTCGGACTGCTGGGAATTGAAGCGCCAAATTGAGGATCTAATTCAAGACGGCTACTTCAAGAAGTTTGTGGGAAAGCCCAGG
ACCAGCTCAGCAGAGAAAAAGGAAGAGCGAAAGCGTTCAAGGACGCCACCCCGGCGCACCGACCGACCTGCGGTCATCAATACCGTTTTTGGAGGGCCAAGCGGGGGTCA
ATCCGGACATAAAAGAAAGGAGTTAGCCCGTGCAGCCAGGCGCGAGGTGTGCATCATTAGGGAAAGAGTGCTGGTAGACAGGGGCGCATCCGCTAACATCCTGTCCTTAC
CGACCTACCTCGCCTTGGGCTGGACGAGGTCGCAATTGAAGAGAAGCCCGACGCCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGTTGCATCGACTTGCCG
GTCACGCTGGGGCAGGACCAAACTCGGGTCACTCAAATGGCCGAGTTCGTGGTAGTTGACGGTAGATCGGCTTATAACGTCATCTTTGGGAGACCCATCATCCACTCATT
TCGGGTCATTCCCTCAACACTTCATCAATTTTTGAAGTATCCCACCCCCAATGGCGTGGGCACGGTCCGAGGAGAACAGACCGCTTCGAGGGAGTGCTATGCCGCAGCAC
TCAAAGGCTCATCGGTCTGCGCCCTCGAAACTCTCAGGGATGGGACGCCCGAGTTCGAGGCCGACCTGCCGAGGAAGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTT
GTTCCTCTGCTTAGTCCCGAGAAACAGCCAGATCTGATGGAGATCGGCACTCCAGAATCCTCATGGATGGACCCGATCACAGACTTCATTAGGGGCAACTCACCACAAGA
TCCCAAGGAGCGCAGAAAGTTGGCAAGGCGGGCAGCTCGGTTCGTGGTCCGAGATGGTGCATTGTACCGACGTGGCTTTTCCCTGCCTCTATTGAAATGTCTAACCCCTG
AAGAGGGCCTAGTAGAGCATTACGAGCCTACGACGAATGAGGATGGGCTACTCCTCAACCTCAACTTGTTGGAAGAAAGAAGAGCAATGGCCCAGCTACGCCTGGTGGAA
TATCAGGGCAGAATGGCCAGACATTACAATGCCCGCGTTCGACCTCGGGCCTTTCAGGTCGGACATCTGGTCTTAAGGAGGGTCCAAACGCATGTGGGTGCCCTTGATCC
GGCCTGGGAGGGCCCGTTTGAGATCAAGGGCATAGTCCGACCTGGGACGTACATATTGGCCGATCTGAGAGGAGACGTCCTCGCGCACCCGTGGAACGTGGAGCACCTGA
AGCGTTATTATCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCCACCAGCGAATTCAACCAATACGACAGATCGAAGGACTCTAGCTACCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCGGCGGTAGAGGGGCAAAGTCACGA
CGGCCTAGCAACGGAACCCCTCCGCAGGTCGGCACGGATCACCGCGCCTGCCCTACCGCCTGCGCACCCGAGGACGTCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTA
AGAAGGGCGTCCGGGGTCCAGCCCCGACTCCAACAAGCGAGAACTTTGATGCGCTCCAGAGAGAGATGGAGGCAATGCGCGCACAAATGCGTTCCATGGAGGCAATGTAT
GACGAAATGGTGCTAGCTGCAGGTGCAGGGTCCCGATCTGAAAATCGAGTGACGCGCATGGACGTACGTGAGCAAAGGGGTTTCCACCTCGGCCCAGCCGAGGAAGAACG
TCCCAAAGACAACGAGAGTGAGGGGTACACTCGCCAGAGGAGAGACCTCCGCGAGCATCTCAACAGAAAGAGAGGCTCGTCTCTCCGAAAAGGGCAGTCACCATCCCGCT
CCCACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAATCCCGCAGGGATAATCACAAGGGAGGAGTTCGACCAGCTGAGAGGGGAGCTCGATGCTCAGGTGGAGGCC
TTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAACAACGGCGACTTGGTAGAATCGCCTTTCACCTCGGACGTTTTGGAAGCACCAATCCCTCCGAAGTTCAAAGC
TCCTACCGTGAAGCCTTATGACGGGACGAAGGACCCCAAGGACTATGTTGAGGTCTTTGAAGGCCTCATGGACTTCCAAGCAGCATCAGACGCAATCAAATGCCGCGCCT
TTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTACCGGAGACTGCCAGCCAGGTCAATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCT
CGGCACTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGGCAGAAGGAGGGTGAGACGTTGCGAGAGTATGTCACCAGATTCCAGGAGGAGCAGTTGAAGGTTGC
ACACTGCTCCGATGACTCGGCCATGTGCTATTTTCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTCC
AGAAGGCGAAAAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCTGAACGAAAGATCGGCCGGCGCAGAAGTGGAAAAGATGAAAGGGTAGATCTC
AAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGAGCTGAGTATCGAAGGGCGGAGAACGGACCTACCAGGAGCCGACCTTACGAGCGCTTCACCCCAACCACGATTCC
AATTTCCGAGATCCTAACGAACATCGAGGATTCTGGAATGGAAAAATTACTCAAGCGTCCGGAGAAACTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGCC
GCTTCTATCGGGAGCACGGCCACAACACGTCGGACTGCTGGGAATTGAAGCGCCAAATTGAGGATCTAATTCAAGACGGCTACTTCAAGAAGTTTGTGGGAAAGCCCAGG
ACCAGCTCAGCAGAGAAAAAGGAAGAGCGAAAGCGTTCAAGGACGCCACCCCGGCGCACCGACCGACCTGCGGTCATCAATACCGTTTTTGGAGGGCCAAGCGGGGGTCA
ATCCGGACATAAAAGAAAGGAGTTAGCCCGTGCAGCCAGGCGCGAGGTGTGCATCATTAGGGAAAGAGTGCTGGTAGACAGGGGCGCATCCGCTAACATCCTGTCCTTAC
CGACCTACCTCGCCTTGGGCTGGACGAGGTCGCAATTGAAGAGAAGCCCGACGCCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGTTGCATCGACTTGCCG
GTCACGCTGGGGCAGGACCAAACTCGGGTCACTCAAATGGCCGAGTTCGTGGTAGTTGACGGTAGATCGGCTTATAACGTCATCTTTGGGAGACCCATCATCCACTCATT
TCGGGTCATTCCCTCAACACTTCATCAATTTTTGAAGTATCCCACCCCCAATGGCGTGGGCACGGTCCGAGGAGAACAGACCGCTTCGAGGGAGTGCTATGCCGCAGCAC
TCAAAGGCTCATCGGTCTGCGCCCTCGAAACTCTCAGGGATGGGACGCCCGAGTTCGAGGCCGACCTGCCGAGGAAGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTT
GTTCCTCTGCTTAGTCCCGAGAAACAGCCAGATCTGATGGAGATCGGCACTCCAGAATCCTCATGGATGGACCCGATCACAGACTTCATTAGGGGCAACTCACCACAAGA
TCCCAAGGAGCGCAGAAAGTTGGCAAGGCGGGCAGCTCGGTTCGTGGTCCGAGATGGTGCATTGTACCGACGTGGCTTTTCCCTGCCTCTATTGAAATGTCTAACCCCTG
AAGAGGGCCTAGTAGAGCATTACGAGCCTACGACGAATGAGGATGGGCTACTCCTCAACCTCAACTTGTTGGAAGAAAGAAGAGCAATGGCCCAGCTACGCCTGGTGGAA
TATCAGGGCAGAATGGCCAGACATTACAATGCCCGCGTTCGACCTCGGGCCTTTCAGGTCGGACATCTGGTCTTAAGGAGGGTCCAAACGCATGTGGGTGCCCTTGATCC
GGCCTGGGAGGGCCCGTTTGAGATCAAGGGCATAGTCCGACCTGGGACGTACATATTGGCCGATCTGAGAGGAGACGTCCTCGCGCACCCGTGGAACGTGGAGCACCTGA
AGCGTTATTATCCTTGA
Protein sequenceShow/hide protein sequence
MVPPANSTNTTDRRTLATSDAHQREVGAAAVEGQSHDGLATEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGVRGPAPTPTSENFDALQREMEAMRAQMRSMEAMY
DEMVLAAGAGSRSENRVTRMDVREQRGFHLGPAEEERPKDNESEGYTRQRRDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPAGIITREEFDQLRGELDAQVEA
LKAKCEQKDDSLNNGDLVESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSS
RHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRRRSGKDERVDL
KSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFYREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPR
TSSAEKKEERKRSRTPPRRTDRPAVINTVFGGPSGGQSGHKRKELARAARREVCIIRERVLVDRGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLP
VTLGQDQTRVTQMAEFVVVDGRSAYNVIFGRPIIHSFRVIPSTLHQFLKYPTPNGVGTVRGEQTASRECYAAALKGSSVCALETLRDGTPEFEADLPRKEFAAPTEELEL
VPLLSPEKQPDLMEIGTPESSWMDPITDFIRGNSPQDPKERRKLARRAARFVVRDGALYRRGFSLPLLKCLTPEEGLVEHYEPTTNEDGLLLNLNLLEERRAMAQLRLVE
YQGRMARHYNARVRPRAFQVGHLVLRRVQTHVGALDPAWEGPFEIKGIVRPGTYILADLRGDVLAHPWNVEHLKRYYP