; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g09470 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g09470
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr7:7285766..7291230
RNA-Seq ExpressionMoc07g09470
SyntenyMoc07g09470
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]3.1e-25587.88Show/hide
Query:  QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVIEGLMDFQAASD
        +AESS N   PAG+ITREEFDQLRG+LDAQVEALKAKCEQK+  LNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDG+KDPKDYVEV E LMDFQAASD
Subjt:  QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVIEGLMDFQAASD

Query:  AIKCRAFQIALTDSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTTTHLATVRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLIGLADEAL
        AIKCRAF+IALT SARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKT THLAT+RQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL GLADEAL
Subjt:  AIKCRAFQIALTDSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTTTHLATVRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLIGLADEAL

Query:  TVKLGEEAPATFAE-------------------------ISRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAESGPTRSRPYERFTPTTIPISEILTNIE
        TVKLGEEAPATFAE                         I RGRSGKD E ADPKSKDKGSFSSGRAEYRRAE+GPTRSRPYERFTPTTIPISEILTNIE
Subjt:  TVKLGEEAPATFAE-------------------------ISRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAESGPTRSRPYERFTPTTIPISEILTNIE

Query:  DSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSNCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
        +SGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTS+ WELKRQIE+LIQDGYFKKFVGKP TSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
Subjt:  DSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSNCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG

Query:  QSGHKRKELARAARREVCIIREQGPTCPIAFDGADLEEVHLPHNDAFVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTLLVGFSGE
        QSG KRKELARAARREVCIIREQ PTCPI FDGADLEEVHLPHNDA VIAPLIDHVVV RVLVDGG SANILSLPTYLALGWTRSQLK+SPT LVGFSGE
Subjt:  QSGHKRKELARAARREVCIIREQGPTCPIAFDGADLEEVHLPHNDAFVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTLLVGFSGE

Query:  SVIPEGCIDLPVTLGQDQTRVTQMAEFV
        SVIPEG IDLPVTLGQDQT+VTQMAEFV
Subjt:  SVIPEGCIDLPVTLGQDQTRVTQMAEFV

XP_022150613.1 uncharacterized protein LOC111018708, partial [Momordica charantia]5.1e-20588.86Show/hide
Query:  KDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVIEGLMDFQAASDAIKCRAFQIALTDSARLWYRRLPARSISTYSQLRREFLAQ
        KDDSLNDGDLGES FTSDVLEAPIPPKFKAPTVKPYDG+KDPKDYVEV EGLMDF AASDAIKCRAFQIALT SARLWYRRLPARSISTYSQLRREFLAQ
Subjt:  KDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVIEGLMDFQAASDAIKCRAFQIALTDSARLWYRRLPARSISTYSQLRREFLAQ

Query:  FSSRHYDKKTTTHLATVRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLIGLADEALTVKLGEEAPATFAE-------------------------I
        FSSR Y KKT THLAT+RQKEG TLREYVTRFQEEQLKVAHCSDDSAMCYFL GLADEALTVKLGE+AP TFAE                         I
Subjt:  FSSRHYDKKTTTHLATVRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLIGLADEALTVKLGEEAPATFAE-------------------------I

Query:  SRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAESGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSNC
         RGRSGKD ERADPKSKDKGSFSSGRAEYRRAESGPT+SRPYERFTPTTIPISEILTNIE+SGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTS+C
Subjt:  SRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAESGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSNC

Query:  WELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPIAFDGADLEEVH
        WELKRQIEDLIQDGYFKKFVGKP TSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPI FDGAD EEVH
Subjt:  WELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPIAFDGADLEEVH

Query:  LPHNDAFVIAPLIDHVVVRRVL
        LPHNDA VIAPLIDHVVVRRVL
Subjt:  LPHNDAFVIAPLIDHVVVRRVL

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]9.7e-25774.66Show/hide
Query:  SSNQQAESSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVIEGLMDFQ
        SSNQQAESSHNPA   G+ITREEFDQLRG+L+AQVEALKAKCEQK+  LNDGDLGESPFTSDVLE        APTVK YDG+KDPKDYVEV EGLMDFQ
Subjt:  SSNQQAESSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVIEGLMDFQ

Query:  AASDAIKCRAFQIALTDSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTTTHLATVRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLIGLA
        AASDAIKCRAFQIALT SARLW                                                     FQE+QLKVA  SDDSAMCYFL GLA
Subjt:  AASDAIKCRAFQIALTDSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTTTHLATVRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLIGLA

Query:  DEALTVKLGEEAPATFAE-------------------------ISRGRSGKDERADPKSKDKGSFSSGRAEYRRAESGPTRSRPYERFTPTTIPISEILT
        DEALTVKLG+EAPATFAE                         I RGRSGKDE+AD KSKDKGSFSSGRAE+RRA +GPTRSRPYERFTPTTIPISEILT
Subjt:  DEALTVKLGEEAPATFAE-------------------------ISRGRSGKDERADPKSKDKGSFSSGRAEYRRAESGPTRSRPYERFTPTTIPISEILT

Query:  NIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSNCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGP
        NIE+SGMEKLLKRPEKLRGAPERR+KDKYCRFHREH HNTS+ WELKRQIEDLIQD YFKKFVGKP TSSAEKKEERK SRTP RR DRPAVINTIFGGP
Subjt:  NIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSNCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGP

Query:  SGGQSGHKRKELARAARREVCIIREQGPTCPIAFDGADLEEVHLPHNDAFVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTLLVGF
        SGGQSGHKRKELARAARREVCIIREQ PTCPI FD ADLEEVHLPHNDA VIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTRSQLK+S T LVGF
Subjt:  SGGQSGHKRKELARAARREVCIIREQGPTCPIAFDGADLEEVHLPHNDAFVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTLLVGF

Query:  SGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAAALKGSSVCALETL-
        S ESVIPEGCIDLPVTLG DQT+VTQMAEFVV+DGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVG VRGEQ ASRECYA+ALKGSSVCALETL 
Subjt:  SGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAAALKGSSVCALETL-

Query:  -QDGTLEFEADLPRKEFGAPTEELELVPLLSPEKQLASAYETDLARSVSVEILDN
         +DGTLEF+A+LPR+EF APTEELELVPLL  +      +E +L    S+  +D+
Subjt:  -QDGTLEFEADLPRKEFGAPTEELELVPLLSPEKQLASAYETDLARSVSVEILDN

XP_022152110.1 uncharacterized protein LOC111019899 [Momordica charantia]1.6e-21187.93Show/hide
Query:  MCYFLIGLADEALTVKLGEEAPATFAE------------------ISRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAESGPTRSRPYERFTPTTIPISE
        MCYFL GLADEALTVKL EEAPATFAE                  I +GRSGKD E  DPKSKDKGSFS+GRAEYRRAE+GPTRSRPYERFTPTTIPISE
Subjt:  MCYFLIGLADEALTVKLGEEAPATFAE------------------ISRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAESGPTRSRPYERFTPTTIPISE

Query:  ILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSNCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIF
        ILTNIE+SGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTS+ WELK QIEDLIQDGYFKKFVGKP TSSAEKKEERKRSRTPPRRTDRPAVINTIF
Subjt:  ILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSNCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIF

Query:  GGPSGGQSGHKRKELARAARREVCIIREQGPTCPIAFDGADLEEVHLPHNDAFVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTLL
        GGPSGGQSGHKRK+LARAARREVCIIREQ PTCPI FD ADL EVHLPHNDA VIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLK+SPT L
Subjt:  GGPSGGQSGHKRKELARAARREVCIIREQGPTCPIAFDGADLEEVHLPHNDAFVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTLL

Query:  VGFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAAALKGSSVCALE
        VGFSGESV+PEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYA+ LKG+SVCALE
Subjt:  VGFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAAALKGSSVCALE

Query:  TL--QDGTLEFEADLPRKEFGAPTEELELVPLLSPEKQL
        TL  +DGTLEFEADLP +EF AP EELELVPLLS EKQ+
Subjt:  TL--QDGTLEFEADLPRKEFGAPTEELELVPLLSPEKQL

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]2.7e-25462.91Show/hide
Query:  MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLATEPLRRSARIIAPTLPPAHPRTSKATRGRGETSKKGARGPAPAPTSENFDALQREMEAMR
        MVQPANSTNT DRR LAA+  HQREVGA  VEGQGH+ L TEPL RSARI  P LPPAHP+ SK                                    
Subjt:  MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLATEPLRRSARIIAPTLPPAHPRTSKATRGRGETSKKGARGPAPAPTSENFDALQREMEAMR

Query:  TQMRSMEAMYNEMVLAAGAGSRSENRVTRMDVREQRGSHLGPAEEERPEDNESEWYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNP--
                                                                                                   AESS+NP  
Subjt:  TQMRSMEAMYNEMVLAAGAGSRSENRVTRMDVREQRGSHLGPAEEERPEDNESEWYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNP--

Query:  AGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVIEGLMDFQAASDAIKCRAFQIAL
         G+ITREEFDQL+ + DAQVEALKA+CE+K+ S +DGDLGE  F+SD+LEA IPPKFK PT+KPYDG+KDPKDYVEV E LMDFQAA+DAIKC AFQIAL
Subjt:  AGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVIEGLMDFQAASDAIKCRAFQIAL

Query:  TDSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTTTHLATVRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLIGLADEALTVKLGEEAPAT
        T SARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLAT+RQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFL GLADE LTVKL EEAPAT
Subjt:  TDSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTTTHLATVRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLIGLADEALTVKLGEEAPAT

Query:  FAE-------------------------ISRGRSGKDE-RADPKSKDKG-SFSSGRAEYRRAESGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLLKR
        FAE                         I +GR+GKD+ +AD KS+DKG S SS R +YRR+ S   +SRPYE +TPTTIPI EILTNIE++GMEKLLKR
Subjt:  FAE-------------------------ISRGRSGKDE-RADPKSKDKG-SFSSGRAEYRRAESGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLLKR

Query:  PEKLRGAPERRSKDKYCRFHREHGHNTSNCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELA
        PEKLRG PE+R+ DKYCRFHR+HGHNTSN WELKRQIEDLIQDGYFKKFVGKP ++S EKKEERKR RTPPRR DRPAVIN             K+KELA
Subjt:  PEKLRGAPERRSKDKYCRFHREHGHNTSNCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELA

Query:  RAARREVCIIREQGPTCPIAFDGADLEEVHLPHNDAFVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTLLVGFSGESVIPEGCIDL
        R ARREVCIIREQ PT  IAF+ ADLE VHLPHNDA VIAPLID V+VRR+LVDGGASANILSL TYLALGWTRSQLK+SPT LVGFSGES+  EGCIDL
Subjt:  RAARREVCIIREQGPTCPIAFDGADLEEVHLPHNDAFVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTLLVGFSGESVIPEGCIDL

Query:  PVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAAALKGSSVCALE--TLQD
        PV++ QD T+VTQMAEFVV+DGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYST NGVGTVRGE   SRECYA+  K SSVCALE  T++D
Subjt:  PVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAAALKGSSVCALE--TLQD

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088131.5e-25587.88Show/hide
Query:  QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVIEGLMDFQAASD
        +AESS N   PAG+ITREEFDQLRG+LDAQVEALKAKCEQK+  LNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDG+KDPKDYVEV E LMDFQAASD
Subjt:  QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVIEGLMDFQAASD

Query:  AIKCRAFQIALTDSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTTTHLATVRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLIGLADEAL
        AIKCRAF+IALT SARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKT THLAT+RQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL GLADEAL
Subjt:  AIKCRAFQIALTDSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTTTHLATVRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLIGLADEAL

Query:  TVKLGEEAPATFAE-------------------------ISRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAESGPTRSRPYERFTPTTIPISEILTNIE
        TVKLGEEAPATFAE                         I RGRSGKD E ADPKSKDKGSFSSGRAEYRRAE+GPTRSRPYERFTPTTIPISEILTNIE
Subjt:  TVKLGEEAPATFAE-------------------------ISRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAESGPTRSRPYERFTPTTIPISEILTNIE

Query:  DSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSNCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
        +SGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTS+ WELKRQIE+LIQDGYFKKFVGKP TSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
Subjt:  DSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSNCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG

Query:  QSGHKRKELARAARREVCIIREQGPTCPIAFDGADLEEVHLPHNDAFVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTLLVGFSGE
        QSG KRKELARAARREVCIIREQ PTCPI FDGADLEEVHLPHNDA VIAPLIDHVVV RVLVDGG SANILSLPTYLALGWTRSQLK+SPT LVGFSGE
Subjt:  QSGHKRKELARAARREVCIIREQGPTCPIAFDGADLEEVHLPHNDAFVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTLLVGFSGE

Query:  SVIPEGCIDLPVTLGQDQTRVTQMAEFV
        SVIPEG IDLPVTLGQDQT+VTQMAEFV
Subjt:  SVIPEGCIDLPVTLGQDQTRVTQMAEFV

A0A6J1D9E1 uncharacterized protein LOC1110188234.7e-25774.66Show/hide
Query:  SSNQQAESSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVIEGLMDFQ
        SSNQQAESSHNPA   G+ITREEFDQLRG+L+AQVEALKAKCEQK+  LNDGDLGESPFTSDVLE        APTVK YDG+KDPKDYVEV EGLMDFQ
Subjt:  SSNQQAESSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVIEGLMDFQ

Query:  AASDAIKCRAFQIALTDSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTTTHLATVRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLIGLA
        AASDAIKCRAFQIALT SARLW                                                     FQE+QLKVA  SDDSAMCYFL GLA
Subjt:  AASDAIKCRAFQIALTDSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTTTHLATVRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLIGLA

Query:  DEALTVKLGEEAPATFAE-------------------------ISRGRSGKDERADPKSKDKGSFSSGRAEYRRAESGPTRSRPYERFTPTTIPISEILT
        DEALTVKLG+EAPATFAE                         I RGRSGKDE+AD KSKDKGSFSSGRAE+RRA +GPTRSRPYERFTPTTIPISEILT
Subjt:  DEALTVKLGEEAPATFAE-------------------------ISRGRSGKDERADPKSKDKGSFSSGRAEYRRAESGPTRSRPYERFTPTTIPISEILT

Query:  NIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSNCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGP
        NIE+SGMEKLLKRPEKLRGAPERR+KDKYCRFHREH HNTS+ WELKRQIEDLIQD YFKKFVGKP TSSAEKKEERK SRTP RR DRPAVINTIFGGP
Subjt:  NIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSNCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGP

Query:  SGGQSGHKRKELARAARREVCIIREQGPTCPIAFDGADLEEVHLPHNDAFVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTLLVGF
        SGGQSGHKRKELARAARREVCIIREQ PTCPI FD ADLEEVHLPHNDA VIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTRSQLK+S T LVGF
Subjt:  SGGQSGHKRKELARAARREVCIIREQGPTCPIAFDGADLEEVHLPHNDAFVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTLLVGF

Query:  SGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAAALKGSSVCALETL-
        S ESVIPEGCIDLPVTLG DQT+VTQMAEFVV+DGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVG VRGEQ ASRECYA+ALKGSSVCALETL 
Subjt:  SGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAAALKGSSVCALETL-

Query:  -QDGTLEFEADLPRKEFGAPTEELELVPLLSPEKQLASAYETDLARSVSVEILDN
         +DGTLEF+A+LPR+EF APTEELELVPLL  +      +E +L    S+  +D+
Subjt:  -QDGTLEFEADLPRKEFGAPTEELELVPLLSPEKQLASAYETDLARSVSVEILDN

A0A6J1D9W7 uncharacterized protein LOC1110187082.5e-20588.86Show/hide
Query:  KDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVIEGLMDFQAASDAIKCRAFQIALTDSARLWYRRLPARSISTYSQLRREFLAQ
        KDDSLNDGDLGES FTSDVLEAPIPPKFKAPTVKPYDG+KDPKDYVEV EGLMDF AASDAIKCRAFQIALT SARLWYRRLPARSISTYSQLRREFLAQ
Subjt:  KDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVIEGLMDFQAASDAIKCRAFQIALTDSARLWYRRLPARSISTYSQLRREFLAQ

Query:  FSSRHYDKKTTTHLATVRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLIGLADEALTVKLGEEAPATFAE-------------------------I
        FSSR Y KKT THLAT+RQKEG TLREYVTRFQEEQLKVAHCSDDSAMCYFL GLADEALTVKLGE+AP TFAE                         I
Subjt:  FSSRHYDKKTTTHLATVRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLIGLADEALTVKLGEEAPATFAE-------------------------I

Query:  SRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAESGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSNC
         RGRSGKD ERADPKSKDKGSFSSGRAEYRRAESGPT+SRPYERFTPTTIPISEILTNIE+SGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTS+C
Subjt:  SRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAESGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSNC

Query:  WELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPIAFDGADLEEVH
        WELKRQIEDLIQDGYFKKFVGKP TSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPI FDGAD EEVH
Subjt:  WELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPIAFDGADLEEVH

Query:  LPHNDAFVIAPLIDHVVVRRVL
        LPHNDA VIAPLIDHVVVRRVL
Subjt:  LPHNDAFVIAPLIDHVVVRRVL

A0A6J1DD03 uncharacterized protein LOC1110198997.9e-21287.93Show/hide
Query:  MCYFLIGLADEALTVKLGEEAPATFAE------------------ISRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAESGPTRSRPYERFTPTTIPISE
        MCYFL GLADEALTVKL EEAPATFAE                  I +GRSGKD E  DPKSKDKGSFS+GRAEYRRAE+GPTRSRPYERFTPTTIPISE
Subjt:  MCYFLIGLADEALTVKLGEEAPATFAE------------------ISRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAESGPTRSRPYERFTPTTIPISE

Query:  ILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSNCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIF
        ILTNIE+SGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTS+ WELK QIEDLIQDGYFKKFVGKP TSSAEKKEERKRSRTPPRRTDRPAVINTIF
Subjt:  ILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSNCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIF

Query:  GGPSGGQSGHKRKELARAARREVCIIREQGPTCPIAFDGADLEEVHLPHNDAFVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTLL
        GGPSGGQSGHKRK+LARAARREVCIIREQ PTCPI FD ADL EVHLPHNDA VIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLK+SPT L
Subjt:  GGPSGGQSGHKRKELARAARREVCIIREQGPTCPIAFDGADLEEVHLPHNDAFVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTLL

Query:  VGFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAAALKGSSVCALE
        VGFSGESV+PEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYA+ LKG+SVCALE
Subjt:  VGFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAAALKGSSVCALE

Query:  TL--QDGTLEFEADLPRKEFGAPTEELELVPLLSPEKQL
        TL  +DGTLEFEADLP +EF AP EELELVPLLS EKQ+
Subjt:  TL--QDGTLEFEADLPRKEFGAPTEELELVPLLSPEKQL

A0A6J1DHB3 uncharacterized protein LOC1110204791.3e-25462.91Show/hide
Query:  MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLATEPLRRSARIIAPTLPPAHPRTSKATRGRGETSKKGARGPAPAPTSENFDALQREMEAMR
        MVQPANSTNT DRR LAA+  HQREVGA  VEGQGH+ L TEPL RSARI  P LPPAHP+ SK                                    
Subjt:  MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLATEPLRRSARIIAPTLPPAHPRTSKATRGRGETSKKGARGPAPAPTSENFDALQREMEAMR

Query:  TQMRSMEAMYNEMVLAAGAGSRSENRVTRMDVREQRGSHLGPAEEERPEDNESEWYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNP--
                                                                                                   AESS+NP  
Subjt:  TQMRSMEAMYNEMVLAAGAGSRSENRVTRMDVREQRGSHLGPAEEERPEDNESEWYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNP--

Query:  AGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVIEGLMDFQAASDAIKCRAFQIAL
         G+ITREEFDQL+ + DAQVEALKA+CE+K+ S +DGDLGE  F+SD+LEA IPPKFK PT+KPYDG+KDPKDYVEV E LMDFQAA+DAIKC AFQIAL
Subjt:  AGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVIEGLMDFQAASDAIKCRAFQIAL

Query:  TDSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTTTHLATVRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLIGLADEALTVKLGEEAPAT
        T SARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLAT+RQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFL GLADE LTVKL EEAPAT
Subjt:  TDSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTTTHLATVRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLIGLADEALTVKLGEEAPAT

Query:  FAE-------------------------ISRGRSGKDE-RADPKSKDKG-SFSSGRAEYRRAESGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLLKR
        FAE                         I +GR+GKD+ +AD KS+DKG S SS R +YRR+ S   +SRPYE +TPTTIPI EILTNIE++GMEKLLKR
Subjt:  FAE-------------------------ISRGRSGKDE-RADPKSKDKG-SFSSGRAEYRRAESGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLLKR

Query:  PEKLRGAPERRSKDKYCRFHREHGHNTSNCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELA
        PEKLRG PE+R+ DKYCRFHR+HGHNTSN WELKRQIEDLIQDGYFKKFVGKP ++S EKKEERKR RTPPRR DRPAVIN             K+KELA
Subjt:  PEKLRGAPERRSKDKYCRFHREHGHNTSNCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELA

Query:  RAARREVCIIREQGPTCPIAFDGADLEEVHLPHNDAFVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTLLVGFSGESVIPEGCIDL
        R ARREVCIIREQ PT  IAF+ ADLE VHLPHNDA VIAPLID V+VRR+LVDGGASANILSL TYLALGWTRSQLK+SPT LVGFSGES+  EGCIDL
Subjt:  RAARREVCIIREQGPTCPIAFDGADLEEVHLPHNDAFVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTLLVGFSGESVIPEGCIDL

Query:  PVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAAALKGSSVCALE--TLQD
        PV++ QD T+VTQMAEFVV+DGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYST NGVGTVRGE   SRECYA+  K SSVCALE  T++D
Subjt:  PVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAAALKGSSVCALE--TLQD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAACCAGCGAATTCGACCAATACGACAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGCGGTAGAGGGGCAAGGTCACGA
CGGCCTAGCAACGGAACCCCTCCGCAGGTCGGCACGGATCATCGCACCTACCCTACCGCCTGCGCACCCGAGGACGTCCAAGGCCACCCGTGGCCGAGGTGAGACCTCTA
AGAAGGGCGCCCGGGGTCCAGCCCCGGCTCCAACAAGCGAGAACTTTGATGCGCTCCAGAGAGAGATGGAGGCAATGCGCACACAAATGCGCTCCATGGAGGCAATGTAT
AACGAAATGGTGCTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAGTGACGCGCATGGACGTACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGCCGAGGAAGAACG
TCCCGAAGACAACGAGAGTGAGTGGTACACTCGCCAGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCGTCTCTCCGAAAAGGGCAGTCACCATCCCGCT
CCCACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAATCCCGCAGGGATAATCACAAGGGAGGAGTTCGACCAGCTGAGGGGGGAGCTCGATGCTCAGGTTGAGGCC
TTAAAGGCCAAATGTGAGCAAAAAGACGATTCACTGAACGATGGCGACTTGGGAGAATCGCCTTTCACCTCGGACGTTTTGGAAGCACCAATCCCTCCGAAGTTCAAAGC
TCCTACCGTGAAGCCTTATGACGGGACGAAGGACCCCAAGGACTATGTTGAGGTCATTGAAGGCCTCATGGACTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCT
TTCAGATCGCGCTTACTGACAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCT
CGGCACTATGACAAAAAGACAACGACCCATCTCGCCACCGTCAGGCAGAAGGAGGGTGAGACACTGCGGGAGTATGTCACCAGATTCCAGGAGGAGCAGTTGAAGGTTGC
ACACTGCTCCGATGACTCGGCCATGTGCTATTTCCTCATCGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGATCAGCC
GGGGCAGAAGTGGAAAAGATGAAAGGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGAGCTGAGTATCGAAGGGCGGAGAGCGGACCTACCAGGAGC
CGACCTTACGAGCGCTTCACCCCAACCACGATTCCAATTTCCGAGATCCTAACGAACATCGAGGATTCTGGAATGGAAAAACTACTCAAGCGTCCGGAGAAACTTCGGGG
AGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCACAACACGTCGAACTGCTGGGAGTTGAAGCGCCAAATTGAAGATCTAATTCAAG
ACGGCTACTTCAAGAAGTTTGTGGGAAAGCCCGGGACCAGCTCAGCAGAGAAAAAGGAAGAGCGAAAGCGTTCAAGGACGCCACCCCGGCGCACCGACCGACCTGCGGTC
ATCAATACCATTTTTGGAGGGCCAAGCGGGGGTCAATCCGGACATAAAAGAAAGGAGTTAGCCCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCAGGGGCCGAC
CTGTCCAATCGCCTTCGACGGAGCAGACTTGGAGGAGGTCCACCTGCCCCACAATGATGCCTTTGTGATTGCTCCCTTGATTGATCATGTGGTGGTCAGGAGAGTTCTGG
TAGACGGGGGCGCATCCGCTAACATCCTATCCTTACCGACCTACCTCGCCTTGGGTTGGACGAGGTCGCAATTGAAGAGAAGCCCGACGCTGCTGGTTGGGTTCTCTGGG
GAATCGGTCATCCCAGAGGGTTGCATCGACTTGCCGGTCACGCTGGGGCAGGACCAAACTCGGGTCACTCAAATGGCCGAGTTCGTGGTAGTTGACGGTAGATCGGCCTA
TAACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCATTCCCTCAACACTTCATCAAGTTTTGAAGTATTCCACCCCCAATGGCGTGGGCACGGTCCGAGGAG
AACAGACCGCTTCGAGGGAGTGCTATGCCGCCGCACTCAAAGGCTCATCGGTCTGCGCCCTCGAAACTCTCCAGGATGGGACGCTCGAGTTCGAGGCCGACCTGCCGAGG
AAGGAGTTTGGCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTACTTAGTCCCGAGAAGCAGTTAGCATCGGCGTACGAGACCGACCTCGCCAGGTCGGTCTCCGTCGA
GATCCTAGATAATCTCTCGATCTTAGAGCTAGATCTGATGGAGATCGGCGCTCCAGAATCCTCATGGATGGACCCGATCACGGACTTCATTAAGGGTAACTCACAACAAG
ATCCCAAGGAGCGCAGAAAGTTGGCAAGGCGGGCAGCTCGGTTCGTGATCCGAGATGGTGCATTAGTAGAACATTACGAGCCTACGACGAATGAGGATGGGCTGCTCCTC
AACCTCGACTTGTTGGAAGAAAGAAGAGCAATGGCTCAGCTACGCCTGGCGGAATATCAGGGCAGAATGGCCAGACATTACAATGCCCGCGTTCGACCTCGGGCCTTTCA
GGTCGGACATCTGGTCTTAAGGAGGGTCCAAATGCATGTGGGTGCCCTTGATCCGGCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAACCAGCGAATTCGACCAATACGACAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGCGGTAGAGGGGCAAGGTCACGA
CGGCCTAGCAACGGAACCCCTCCGCAGGTCGGCACGGATCATCGCACCTACCCTACCGCCTGCGCACCCGAGGACGTCCAAGGCCACCCGTGGCCGAGGTGAGACCTCTA
AGAAGGGCGCCCGGGGTCCAGCCCCGGCTCCAACAAGCGAGAACTTTGATGCGCTCCAGAGAGAGATGGAGGCAATGCGCACACAAATGCGCTCCATGGAGGCAATGTAT
AACGAAATGGTGCTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAGTGACGCGCATGGACGTACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGCCGAGGAAGAACG
TCCCGAAGACAACGAGAGTGAGTGGTACACTCGCCAGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCGTCTCTCCGAAAAGGGCAGTCACCATCCCGCT
CCCACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAATCCCGCAGGGATAATCACAAGGGAGGAGTTCGACCAGCTGAGGGGGGAGCTCGATGCTCAGGTTGAGGCC
TTAAAGGCCAAATGTGAGCAAAAAGACGATTCACTGAACGATGGCGACTTGGGAGAATCGCCTTTCACCTCGGACGTTTTGGAAGCACCAATCCCTCCGAAGTTCAAAGC
TCCTACCGTGAAGCCTTATGACGGGACGAAGGACCCCAAGGACTATGTTGAGGTCATTGAAGGCCTCATGGACTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCT
TTCAGATCGCGCTTACTGACAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCT
CGGCACTATGACAAAAAGACAACGACCCATCTCGCCACCGTCAGGCAGAAGGAGGGTGAGACACTGCGGGAGTATGTCACCAGATTCCAGGAGGAGCAGTTGAAGGTTGC
ACACTGCTCCGATGACTCGGCCATGTGCTATTTCCTCATCGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGATCAGCC
GGGGCAGAAGTGGAAAAGATGAAAGGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGAGCTGAGTATCGAAGGGCGGAGAGCGGACCTACCAGGAGC
CGACCTTACGAGCGCTTCACCCCAACCACGATTCCAATTTCCGAGATCCTAACGAACATCGAGGATTCTGGAATGGAAAAACTACTCAAGCGTCCGGAGAAACTTCGGGG
AGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCACAACACGTCGAACTGCTGGGAGTTGAAGCGCCAAATTGAAGATCTAATTCAAG
ACGGCTACTTCAAGAAGTTTGTGGGAAAGCCCGGGACCAGCTCAGCAGAGAAAAAGGAAGAGCGAAAGCGTTCAAGGACGCCACCCCGGCGCACCGACCGACCTGCGGTC
ATCAATACCATTTTTGGAGGGCCAAGCGGGGGTCAATCCGGACATAAAAGAAAGGAGTTAGCCCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCAGGGGCCGAC
CTGTCCAATCGCCTTCGACGGAGCAGACTTGGAGGAGGTCCACCTGCCCCACAATGATGCCTTTGTGATTGCTCCCTTGATTGATCATGTGGTGGTCAGGAGAGTTCTGG
TAGACGGGGGCGCATCCGCTAACATCCTATCCTTACCGACCTACCTCGCCTTGGGTTGGACGAGGTCGCAATTGAAGAGAAGCCCGACGCTGCTGGTTGGGTTCTCTGGG
GAATCGGTCATCCCAGAGGGTTGCATCGACTTGCCGGTCACGCTGGGGCAGGACCAAACTCGGGTCACTCAAATGGCCGAGTTCGTGGTAGTTGACGGTAGATCGGCCTA
TAACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCATTCCCTCAACACTTCATCAAGTTTTGAAGTATTCCACCCCCAATGGCGTGGGCACGGTCCGAGGAG
AACAGACCGCTTCGAGGGAGTGCTATGCCGCCGCACTCAAAGGCTCATCGGTCTGCGCCCTCGAAACTCTCCAGGATGGGACGCTCGAGTTCGAGGCCGACCTGCCGAGG
AAGGAGTTTGGCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTACTTAGTCCCGAGAAGCAGTTAGCATCGGCGTACGAGACCGACCTCGCCAGGTCGGTCTCCGTCGA
GATCCTAGATAATCTCTCGATCTTAGAGCTAGATCTGATGGAGATCGGCGCTCCAGAATCCTCATGGATGGACCCGATCACGGACTTCATTAAGGGTAACTCACAACAAG
ATCCCAAGGAGCGCAGAAAGTTGGCAAGGCGGGCAGCTCGGTTCGTGATCCGAGATGGTGCATTAGTAGAACATTACGAGCCTACGACGAATGAGGATGGGCTGCTCCTC
AACCTCGACTTGTTGGAAGAAAGAAGAGCAATGGCTCAGCTACGCCTGGCGGAATATCAGGGCAGAATGGCCAGACATTACAATGCCCGCGTTCGACCTCGGGCCTTTCA
GGTCGGACATCTGGTCTTAAGGAGGGTCCAAATGCATGTGGGTGCCCTTGATCCGGCCTAG
Protein sequenceShow/hide protein sequence
MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLATEPLRRSARIIAPTLPPAHPRTSKATRGRGETSKKGARGPAPAPTSENFDALQREMEAMRTQMRSMEAMY
NEMVLAAGAGSRSENRVTRMDVREQRGSHLGPAEEERPEDNESEWYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPAGIITREEFDQLRGELDAQVEA
LKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVIEGLMDFQAASDAIKCRAFQIALTDSARLWYRRLPARSISTYSQLRREFLAQFSS
RHYDKKTTTHLATVRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLIGLADEALTVKLGEEAPATFAEISRGRSGKDERADPKSKDKGSFSSGRAEYRRAESGPTRS
RPYERFTPTTIPISEILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSNCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAV
INTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPIAFDGADLEEVHLPHNDAFVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTLLVGFSG
ESVIPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAAALKGSSVCALETLQDGTLEFEADLPR
KEFGAPTEELELVPLLSPEKQLASAYETDLARSVSVEILDNLSILELDLMEIGAPESSWMDPITDFIKGNSQQDPKERRKLARRAARFVIRDGALVEHYEPTTNEDGLLL
NLDLLEERRAMAQLRLAEYQGRMARHYNARVRPRAFQVGHLVLRRVQMHVGALDPA