; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g22660 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g22660
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr4:16487101..16492287
RNA-Seq ExpressionMoc04g22660
SyntenyMoc04g22660
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]2.2e-27893.75Show/hide
Query:  QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASD
        +AESS N   PAG+ITREEFDQLRG+LDAQVEALKAKCEQK+  LNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDG+KDPKDYVEVFE LMDFQAASD
Subjt:  QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
        AIKCRAF+IALTGSARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILTNIE
        TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD+E ADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPI EILTNIE
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILTNIE

Query:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPKTSSAEKKEERKRSRTPPRRTDLPAVINTIFGGPSGG
        ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFVGKP+TSSAEKKEERKRSRTPPRRTD PAVINTIFGGPSGG
Subjt:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPKTSSAEKKEERKRSRTPPRRTDLPAVINTIFGGPSGG

Query:  QSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLPLGWTRSQLKTSPTPLVGFSGE
        QSG KRKELARAARREVCIIREQ PTCPITFDGADLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYL LGWTRSQLK SPTPLVGFSGE
Subjt:  QSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLPLGWTRSQLKTSPTPLVGFSGE

Query:  SVIPEGCINLPVTLGHDRTQVTQMAAFV
        SVIPEG I+LPVTLG D+TQVTQMA FV
Subjt:  SVIPEGCINLPVTLGHDRTQVTQMAAFV

XP_022150613.1 uncharacterized protein LOC111018708, partial [Momordica charantia]6.2e-22895.97Show/hide
Query:  KDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQ
        KDDSLNDGDLGES FTSDVLEAPIPPKFKAPTVKPYDG+KDPKDYVEVFEGLMDF AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQ
Subjt:  KDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQ

Query:  FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI
        FSSR Y KKT THLATIRQKEG TLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGE+AP TFAEVLQKAKKVIDGQELLRTKTGRP+RKI
Subjt:  FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI

Query:  GRGRSGKDVERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC
        GRGRSGKDVERADPKSKDKGSFSSGRAEYRRAE+GPT+SRPYERFTPTTIPI EILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC
Subjt:  GRGRSGKDVERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC

Query:  WELKRQIEDLIQDGYFKKFVGKPKTSSAEKKEERKRSRTPPRRTDLPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVH
        WELKRQIEDLIQDGYFKKFVGKP+TSSAEKKEERKRSRTPPRRTD PAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGAD EEVH
Subjt:  WELKRQIEDLIQDGYFKKFVGKPKTSSAEKKEERKRSRTPPRRTDLPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVH

Query:  LPHNDALVIAPLIDHVVVRRVL
        LPHNDA VIAPLIDHVVVRRVL
Subjt:  LPHNDALVIAPLIDHVVVRRVL

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]2.4e-24875.91Show/hide
Query:  SSNQQAESSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQ
        SSNQQAESSHNPA   G+ITREEFDQLRG+L+AQVEALKAKCEQK+  LNDGDLGESPFTSDVLE        APTVK YDG+KDPKDYVEVFEGLMDFQ
Subjt:  SSNQQAESSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQ

Query:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA
        AASDAIKCRAFQIALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFLTGLA
Subjt:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA

Query:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEIL
        DEALTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRPER I RGRSGKD E+AD KSKDKGSFSSGRAE+RRA NGPTRSRPYERFTPTTIPI EIL
Subjt:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEIL

Query:  TNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPKTSSAEKKEERKRSRTPPRRTDLPAVINTIFGG
        TNIEESGMEKLLKRPEKLRGAPERR+KDKYCRFHREH HNTSD WELKRQIEDLIQD YFKKFVGKP+TSSAEKKEERK SRTP RR D PAVINTIFGG
Subjt:  TNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPKTSSAEKKEERKRSRTPPRRTDLPAVINTIFGG

Query:  PSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLPLGWTRSQLKTSPTPLVG
        PSGGQSGHKRKELARAARREVCIIREQ PTCPITFD ADLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TYL LGWTRSQLK S TPLVG
Subjt:  PSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLPLGWTRSQLKTSPTPLVG

Query:  FSGESVIPEGCINLPVTLGHDRTQVTQMAAFVVID----------------------------------------GEQTASRECYASALKGSSVCALETL
        FS ESVIPEGCI+LPVTLGHD+TQVTQMA FVVID                                        GEQ ASRECYASALKGSSVCALETL
Subjt:  FSGESVIPEGCINLPVTLGHDRTQVTQMAAFVVID----------------------------------------GEQTASRECYASALKGSSVCALETL

Query:  AGRDGTLEFEADLPRKEFAAPTEELELVPLL
          RDGTLEF+A+LPR+EFAAPTEELELVPLL
Subjt:  AGRDGTLEFEADLPRKEFAAPTEELELVPLL

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]6.1e-24461.27Show/hide
Query:  MVQPANSTNTTDQRTLAASDAHQREVGAAAVEGQGHDGLVTEPLRRSARITAPALPPAHPRTSKTTRGRGGTSKKGARGPTPTPTSENFDALKKEMEAMR
        MVQPANSTNT D+R LAA+  HQREVGA  VEGQGH+ L TEPL RSARIT P LPPAHP+ SK                                    
Subjt:  MVQPANSTNTTDQRTLAASDAHQREVGAAAVEGQGHDGLVTEPLRRSARITAPALPPAHPRTSKTTRGRGGTSKKGARGPTPTPTSENFDALKKEMEAMR

Query:  TQMRSMEEMYNEMMLAAGAGSRSENRVMRVDVREQGGSHLGPAEEERPEDNEREGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNP--
                                                                                                   AESS+NP  
Subjt:  TQMRSMEEMYNEMMLAAGAGSRSENRVMRVDVREQGGSHLGPAEEERPEDNEREGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNP--

Query:  AGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIAL
         G+ITREEFDQL+ + DAQVEALKA+CE+K+ S +DGDLGE  F+SD+LEA IPPKFK PT+KPYDG+KDPKDYVEVFE LMDFQAA+DAIKC AFQIAL
Subjt:  AGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIAL

Query:  TGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPAT
        TGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EEAPAT
Subjt:  TGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPAT

Query:  FAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILTNIEESGMEKLLKR
        FAEVLQK KKVIDGQELLRTKTGRPE+ I +GR+GKD  +AD KS+DKG S SS R +YRR+ +   +SRPYE +TPTTIPIFEILTNIEE+GMEKLLKR
Subjt:  FAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILTNIEESGMEKLLKR

Query:  PEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPKTSSAEKKEERKRSRTPPRRTDLPAVINTIFGGPSGGQSGHKRKELA
        PEKLRG PE+R+ DKYCRFHR+HGHNTS+ WELKRQIEDLIQDGYFKKFVGKP+++S EKKEERKR RTPPRR D PAVIN             K+KELA
Subjt:  PEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPKTSSAEKKEERKRSRTPPRRTDLPAVINTIFGGPSGGQSGHKRKELA

Query:  RAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLPLGWTRSQLKTSPTPLVGFSGESVIPEGCINL
        R ARREVCIIREQ PT  I F+ ADLE VHLPHNDALVIAPLID V+VRR+LVDGGASANILSL TYL LGWTRSQLK SPTPLVGFSGES+  EGCI+L
Subjt:  RAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLPLGWTRSQLKTSPTPLVGFSGESVIPEGCINL

Query:  PVTLGHDRTQVTQMAAFVVID----------------------------------------GEQTASRECYASALKGSSVCALETLAGRD
        PV++  D TQVTQMA FVVID                                        GE   SRECYAS  K SSVCALE    RD
Subjt:  PVTLGHDRTQVTQMAAFVVID----------------------------------------GEQTASRECYASALKGSSVCALETLAGRD

XP_022156542.1 uncharacterized protein LOC111023421 [Momordica charantia]1.0e-22298.52Show/hide
Query:  GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALT
        GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALT
Subjt:  GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALT

Query:  GSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATF
        GSARLWYRRLP RSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATF
Subjt:  GSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATF

Query:  AEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILTNIEESGMEKLLKRPE
        AEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILTNIEESGMEKLLKRPE
Subjt:  AEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILTNIEESGMEKLLKRPE

Query:  KLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPKTSSAEKKEERKRSRTPPRRTDLPAVINTIFGGPSGGQSGHKRKELARA
        KLRGAPERRSKDKYCRFHREHGHNTSD WELKRQIEDLIQDGYFKKFVGKP+TSSAEKKEERKRSRTPPRRTD PAVINTIFGGPSGGQ GHKRKELARA
Subjt:  KLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPKTSSAEKKEERKRSRTPPRRTDLPAVINTIFGGPSGGQSGHKRKELARA

Query:  ARREV
        ARRE+
Subjt:  ARREV

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088131.1e-27893.75Show/hide
Query:  QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASD
        +AESS N   PAG+ITREEFDQLRG+LDAQVEALKAKCEQK+  LNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDG+KDPKDYVEVFE LMDFQAASD
Subjt:  QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
        AIKCRAF+IALTGSARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILTNIE
        TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD+E ADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPI EILTNIE
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILTNIE

Query:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPKTSSAEKKEERKRSRTPPRRTDLPAVINTIFGGPSGG
        ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFVGKP+TSSAEKKEERKRSRTPPRRTD PAVINTIFGGPSGG
Subjt:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPKTSSAEKKEERKRSRTPPRRTDLPAVINTIFGGPSGG

Query:  QSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLPLGWTRSQLKTSPTPLVGFSGE
        QSG KRKELARAARREVCIIREQ PTCPITFDGADLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYL LGWTRSQLK SPTPLVGFSGE
Subjt:  QSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLPLGWTRSQLKTSPTPLVGFSGE

Query:  SVIPEGCINLPVTLGHDRTQVTQMAAFV
        SVIPEG I+LPVTLG D+TQVTQMA FV
Subjt:  SVIPEGCINLPVTLGHDRTQVTQMAAFV

A0A6J1D9E1 uncharacterized protein LOC1110188231.2e-24875.91Show/hide
Query:  SSNQQAESSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQ
        SSNQQAESSHNPA   G+ITREEFDQLRG+L+AQVEALKAKCEQK+  LNDGDLGESPFTSDVLE        APTVK YDG+KDPKDYVEVFEGLMDFQ
Subjt:  SSNQQAESSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQ

Query:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA
        AASDAIKCRAFQIALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFLTGLA
Subjt:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA

Query:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEIL
        DEALTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRPER I RGRSGKD E+AD KSKDKGSFSSGRAE+RRA NGPTRSRPYERFTPTTIPI EIL
Subjt:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEIL

Query:  TNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPKTSSAEKKEERKRSRTPPRRTDLPAVINTIFGG
        TNIEESGMEKLLKRPEKLRGAPERR+KDKYCRFHREH HNTSD WELKRQIEDLIQD YFKKFVGKP+TSSAEKKEERK SRTP RR D PAVINTIFGG
Subjt:  TNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPKTSSAEKKEERKRSRTPPRRTDLPAVINTIFGG

Query:  PSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLPLGWTRSQLKTSPTPLVG
        PSGGQSGHKRKELARAARREVCIIREQ PTCPITFD ADLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TYL LGWTRSQLK S TPLVG
Subjt:  PSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLPLGWTRSQLKTSPTPLVG

Query:  FSGESVIPEGCINLPVTLGHDRTQVTQMAAFVVID----------------------------------------GEQTASRECYASALKGSSVCALETL
        FS ESVIPEGCI+LPVTLGHD+TQVTQMA FVVID                                        GEQ ASRECYASALKGSSVCALETL
Subjt:  FSGESVIPEGCINLPVTLGHDRTQVTQMAAFVVID----------------------------------------GEQTASRECYASALKGSSVCALETL

Query:  AGRDGTLEFEADLPRKEFAAPTEELELVPLL
          RDGTLEF+A+LPR+EFAAPTEELELVPLL
Subjt:  AGRDGTLEFEADLPRKEFAAPTEELELVPLL

A0A6J1D9W7 uncharacterized protein LOC1110187083.0e-22895.97Show/hide
Query:  KDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQ
        KDDSLNDGDLGES FTSDVLEAPIPPKFKAPTVKPYDG+KDPKDYVEVFEGLMDF AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQ
Subjt:  KDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQ

Query:  FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI
        FSSR Y KKT THLATIRQKEG TLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGE+AP TFAEVLQKAKKVIDGQELLRTKTGRP+RKI
Subjt:  FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI

Query:  GRGRSGKDVERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC
        GRGRSGKDVERADPKSKDKGSFSSGRAEYRRAE+GPT+SRPYERFTPTTIPI EILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC
Subjt:  GRGRSGKDVERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC

Query:  WELKRQIEDLIQDGYFKKFVGKPKTSSAEKKEERKRSRTPPRRTDLPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVH
        WELKRQIEDLIQDGYFKKFVGKP+TSSAEKKEERKRSRTPPRRTD PAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGAD EEVH
Subjt:  WELKRQIEDLIQDGYFKKFVGKPKTSSAEKKEERKRSRTPPRRTDLPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVH

Query:  LPHNDALVIAPLIDHVVVRRVL
        LPHNDA VIAPLIDHVVVRRVL
Subjt:  LPHNDALVIAPLIDHVVVRRVL

A0A6J1DHB3 uncharacterized protein LOC1110204793.0e-24461.27Show/hide
Query:  MVQPANSTNTTDQRTLAASDAHQREVGAAAVEGQGHDGLVTEPLRRSARITAPALPPAHPRTSKTTRGRGGTSKKGARGPTPTPTSENFDALKKEMEAMR
        MVQPANSTNT D+R LAA+  HQREVGA  VEGQGH+ L TEPL RSARIT P LPPAHP+ SK                                    
Subjt:  MVQPANSTNTTDQRTLAASDAHQREVGAAAVEGQGHDGLVTEPLRRSARITAPALPPAHPRTSKTTRGRGGTSKKGARGPTPTPTSENFDALKKEMEAMR

Query:  TQMRSMEEMYNEMMLAAGAGSRSENRVMRVDVREQGGSHLGPAEEERPEDNEREGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNP--
                                                                                                   AESS+NP  
Subjt:  TQMRSMEEMYNEMMLAAGAGSRSENRVMRVDVREQGGSHLGPAEEERPEDNEREGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNP--

Query:  AGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIAL
         G+ITREEFDQL+ + DAQVEALKA+CE+K+ S +DGDLGE  F+SD+LEA IPPKFK PT+KPYDG+KDPKDYVEVFE LMDFQAA+DAIKC AFQIAL
Subjt:  AGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIAL

Query:  TGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPAT
        TGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EEAPAT
Subjt:  TGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPAT

Query:  FAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILTNIEESGMEKLLKR
        FAEVLQK KKVIDGQELLRTKTGRPE+ I +GR+GKD  +AD KS+DKG S SS R +YRR+ +   +SRPYE +TPTTIPIFEILTNIEE+GMEKLLKR
Subjt:  FAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILTNIEESGMEKLLKR

Query:  PEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPKTSSAEKKEERKRSRTPPRRTDLPAVINTIFGGPSGGQSGHKRKELA
        PEKLRG PE+R+ DKYCRFHR+HGHNTS+ WELKRQIEDLIQDGYFKKFVGKP+++S EKKEERKR RTPPRR D PAVIN             K+KELA
Subjt:  PEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPKTSSAEKKEERKRSRTPPRRTDLPAVINTIFGGPSGGQSGHKRKELA

Query:  RAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLPLGWTRSQLKTSPTPLVGFSGESVIPEGCINL
        R ARREVCIIREQ PT  I F+ ADLE VHLPHNDALVIAPLID V+VRR+LVDGGASANILSL TYL LGWTRSQLK SPTPLVGFSGES+  EGCI+L
Subjt:  RAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLPLGWTRSQLKTSPTPLVGFSGESVIPEGCINL

Query:  PVTLGHDRTQVTQMAAFVVID----------------------------------------GEQTASRECYASALKGSSVCALETLAGRD
        PV++  D TQVTQMA FVVID                                        GE   SRECYAS  K SSVCALE    RD
Subjt:  PVTLGHDRTQVTQMAAFVVID----------------------------------------GEQTASRECYASALKGSSVCALETLAGRD

A0A6J1DS95 uncharacterized protein LOC1110234214.9e-22398.52Show/hide
Query:  GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALT
        GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALT
Subjt:  GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALT

Query:  GSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATF
        GSARLWYRRLP RSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATF
Subjt:  GSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATF

Query:  AEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILTNIEESGMEKLLKRPE
        AEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILTNIEESGMEKLLKRPE
Subjt:  AEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILTNIEESGMEKLLKRPE

Query:  KLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPKTSSAEKKEERKRSRTPPRRTDLPAVINTIFGGPSGGQSGHKRKELARA
        KLRGAPERRSKDKYCRFHREHGHNTSD WELKRQIEDLIQDGYFKKFVGKP+TSSAEKKEERKRSRTPPRRTD PAVINTIFGGPSGGQ GHKRKELARA
Subjt:  KLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPKTSSAEKKEERKRSRTPPRRTDLPAVINTIFGGPSGGQSGHKRKELARA

Query:  ARREV
        ARRE+
Subjt:  ARREV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAACCAGCGAACTCGACCAATACGACAGATCAAAGGACTCTAGCTGCCAGTGATGCCCACCAGAGGGAGGTCGGAGCAGCAGCGGTAGAGGGGCAAGGT
CACGACGGCCTAGTAACGGAACCCCTCCGCAGGTCGGCACGGATCACCGCACCTGCCCTACCGCCTGCGCACCCGAGGACGTCCAAGACCACCCGTGGCCGAGGT
GGGACCTCTAAGAAGGGCGCCCGGGGTCCAACCCCGACTCCAACAAGCGAGAACTTTGATGCGCTCAAGAAAGAGATGGAGGCAATGCGCACACAAATGCGCTCC
ATGGAGGAAATGTATAACGAAATGATGCTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAGTGATGCGTGTGGACGTACGCGAGCAAGGGGGTTCCCACCTC
GGCCCAGCCGAGGAGGAACGTCCCGAAGACAACGAGAGAGAGGGGTACACTCGCCAGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCGTCTCTC
CGAAAAGGGCAGTCACCATCCCGCTCACACAGGAGCTCGAACCAGCAGGCTGAATCCTCTCACAATCCCGCAGGGATAATCACAAGGGAGGAGTTCGACCAGCTG
AGGGGGGAGCTCGATGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGACGATTCGCTGAACGATGGCGACTTGGGAGAATCGCCTTTCACCTCGGAC
GTTTTGGAAGCACCAATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGACGGGACGAAGGACCCCAAAGACTATGTTGAGGTCTTTGAAGGCCTCATG
GACTTCCAAGCGGCATCAGACGCAATCAAGTGCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTACCGGAGATTGCCAGCCAGGTCAATCTCA
ACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTTTCTTCTCGGCACTATGACAAAAAGACAGCGACCCACCTCGCCACCATCAGGCAGAAGGAGGGTGAG
ACGCTGCGAGAATATGTCACCAGATTCCAGGAGGAGCAGTTGAAGGTTGCACACTGCTCCGATGACTCGGCCATGTGCTATTTCCTCACCGGTCTAGCCGACGAA
GCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTTCAGAAGGCGAAAAAAGTCATCGATGGACAAGAGCTTCTCCGAACCAAAACC
GGCCGACCGGAGCGAAAGATCGGCCGGGGCAGAAGTGGGAAAGATGTAGAAAGGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGAGCTGAG
TATCGAAGGGCGGAGAACGGGCCTACCAGGAGCCGACCTTATGAGCGCTTCACCCCGACCACGATTCCAATTTTCGAGATCCTAACAAACATCGAGGAATCTGGG
ATGGAAAAACTACTCAAGCGTCCTGAGAAGCTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCATAACACGTCA
GACTGCTGGGAGTTGAAGCGCCAAATTGAGGATCTAATTCAAGATGGCTACTTCAAGAAATTTGTGGGAAAGCCCAAGACCAGCTCAGCAGAGAAAAAGGAAGAG
CGAAAGCGTTCAAGGACGCCGCCCCGGCGCACTGACCTACCTGCGGTCATCAATACCATTTTTGGAGGGCCAAGCGGGGGTCAGTCTGGACATAAAAGAAAGGAG
TTAGCCCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCAGGGGCCGACCTGCCCAATCACCTTCGACGGTGCAGACTTGGAGGAGGTCCACCTGCCTCAC
AATGATGCACTTGTGATTGCTCCCTTGATTGATCATGTGGTGGTCAGGAGAGTGCTGGTAGACGGGGGCGCATCCGCTAACATCCTGTCCTTACCGACCTACCTC
CCCTTGGGATGGACGAGGTCGCAATTGAAGACAAGCCCGACACCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGTTGCATCAACTTGCCGGTCACG
CTGGGGCATGACCGAACTCAGGTCACTCAAATGGCCGCGTTCGTGGTAATTGACGGAGAACAGACCGCTTCGAGGGAGTGTTATGCCTCCGCACTCAAAGGCTCA
TCGGTCTGCGCCCTTGAAACTCTCGCCGGTAGGGATGGGACGCTCGAGTTCGAGGCCGACCTACCGAGGAAGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTT
GTTCCTCTGCTTAGTCCCGAGAAGCAGTTAGCATCGGCGTACGAGACCGACCTGGCTAGGTCGGTCCCCGTCGAGATCTTAGATAATCCCTCAATCTCAGAGCCA
GATCTGATGGAGATCGGCGCTCCAGAATCCTCATGGATGGACCCGATCGCGGACTTCATTAGGGGCAACTCACCGCAAGACCCCAAGGAGCGCAGAAAGTTGGCA
CGACGGGCAGCTCGGTTCGTGGTTCGAGATGGGGCATTGTACCGACGTGGCTTTTCCCTGCCTCTATTGAGATGCCTAACCCCTGAAGAGGGCCTGTACGTCCTC
AGAGAGATCCACGAAGGAGTGTGCGGCAATCACTTAGGCGCCCGGTCGCTGTCAGCCAAAGTGATCCGACAAGGATACTATTGGCCGACCCTCAGCCAGGACGCC
AAGAAGGCCAGACCAAGTTCGCTGTGGTTTGAGATCGGCATGCCATCTGACAGAGTAGAGCATTACGAGCCTACGACAAATGAGGAAGAGCTGCTCCTCAACCTC
GACTTGTTGGAAGAAAGAAGAGCAATGGCCCAGCTACGCCTGGTGGAATATCAGGGCAGAATGACCAGACATTACAATGCCCGCGTTCGACCTCGGGCCTTTCAG
GTCGGACATCTGGTCTTAAGGAGGGTCCAAACGCATGTGGGTGCCCTTGATCCGGCCTGGGAGGGCCCGTTTGAGATCAAGGGCATCGTCCGACTTGGGACGTAC
ATATTGGCCGATCTGAAAGGAGACGTCCTCGTGCACCCGTGGAACGCGGAGCACCTGAAGCGTTATTATCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAACCAGCGAACTCGACCAATACGACAGATCAAAGGACTCTAGCTGCCAGTGATGCCCACCAGAGGGAGGTCGGAGCAGCAGCGGTAGAGGGGCAAGGT
CACGACGGCCTAGTAACGGAACCCCTCCGCAGGTCGGCACGGATCACCGCACCTGCCCTACCGCCTGCGCACCCGAGGACGTCCAAGACCACCCGTGGCCGAGGT
GGGACCTCTAAGAAGGGCGCCCGGGGTCCAACCCCGACTCCAACAAGCGAGAACTTTGATGCGCTCAAGAAAGAGATGGAGGCAATGCGCACACAAATGCGCTCC
ATGGAGGAAATGTATAACGAAATGATGCTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAGTGATGCGTGTGGACGTACGCGAGCAAGGGGGTTCCCACCTC
GGCCCAGCCGAGGAGGAACGTCCCGAAGACAACGAGAGAGAGGGGTACACTCGCCAGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCGTCTCTC
CGAAAAGGGCAGTCACCATCCCGCTCACACAGGAGCTCGAACCAGCAGGCTGAATCCTCTCACAATCCCGCAGGGATAATCACAAGGGAGGAGTTCGACCAGCTG
AGGGGGGAGCTCGATGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGACGATTCGCTGAACGATGGCGACTTGGGAGAATCGCCTTTCACCTCGGAC
GTTTTGGAAGCACCAATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGACGGGACGAAGGACCCCAAAGACTATGTTGAGGTCTTTGAAGGCCTCATG
GACTTCCAAGCGGCATCAGACGCAATCAAGTGCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTACCGGAGATTGCCAGCCAGGTCAATCTCA
ACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTTTCTTCTCGGCACTATGACAAAAAGACAGCGACCCACCTCGCCACCATCAGGCAGAAGGAGGGTGAG
ACGCTGCGAGAATATGTCACCAGATTCCAGGAGGAGCAGTTGAAGGTTGCACACTGCTCCGATGACTCGGCCATGTGCTATTTCCTCACCGGTCTAGCCGACGAA
GCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTTCAGAAGGCGAAAAAAGTCATCGATGGACAAGAGCTTCTCCGAACCAAAACC
GGCCGACCGGAGCGAAAGATCGGCCGGGGCAGAAGTGGGAAAGATGTAGAAAGGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGAGCTGAG
TATCGAAGGGCGGAGAACGGGCCTACCAGGAGCCGACCTTATGAGCGCTTCACCCCGACCACGATTCCAATTTTCGAGATCCTAACAAACATCGAGGAATCTGGG
ATGGAAAAACTACTCAAGCGTCCTGAGAAGCTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCATAACACGTCA
GACTGCTGGGAGTTGAAGCGCCAAATTGAGGATCTAATTCAAGATGGCTACTTCAAGAAATTTGTGGGAAAGCCCAAGACCAGCTCAGCAGAGAAAAAGGAAGAG
CGAAAGCGTTCAAGGACGCCGCCCCGGCGCACTGACCTACCTGCGGTCATCAATACCATTTTTGGAGGGCCAAGCGGGGGTCAGTCTGGACATAAAAGAAAGGAG
TTAGCCCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCAGGGGCCGACCTGCCCAATCACCTTCGACGGTGCAGACTTGGAGGAGGTCCACCTGCCTCAC
AATGATGCACTTGTGATTGCTCCCTTGATTGATCATGTGGTGGTCAGGAGAGTGCTGGTAGACGGGGGCGCATCCGCTAACATCCTGTCCTTACCGACCTACCTC
CCCTTGGGATGGACGAGGTCGCAATTGAAGACAAGCCCGACACCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGTTGCATCAACTTGCCGGTCACG
CTGGGGCATGACCGAACTCAGGTCACTCAAATGGCCGCGTTCGTGGTAATTGACGGAGAACAGACCGCTTCGAGGGAGTGTTATGCCTCCGCACTCAAAGGCTCA
TCGGTCTGCGCCCTTGAAACTCTCGCCGGTAGGGATGGGACGCTCGAGTTCGAGGCCGACCTACCGAGGAAGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTT
GTTCCTCTGCTTAGTCCCGAGAAGCAGTTAGCATCGGCGTACGAGACCGACCTGGCTAGGTCGGTCCCCGTCGAGATCTTAGATAATCCCTCAATCTCAGAGCCA
GATCTGATGGAGATCGGCGCTCCAGAATCCTCATGGATGGACCCGATCGCGGACTTCATTAGGGGCAACTCACCGCAAGACCCCAAGGAGCGCAGAAAGTTGGCA
CGACGGGCAGCTCGGTTCGTGGTTCGAGATGGGGCATTGTACCGACGTGGCTTTTCCCTGCCTCTATTGAGATGCCTAACCCCTGAAGAGGGCCTGTACGTCCTC
AGAGAGATCCACGAAGGAGTGTGCGGCAATCACTTAGGCGCCCGGTCGCTGTCAGCCAAAGTGATCCGACAAGGATACTATTGGCCGACCCTCAGCCAGGACGCC
AAGAAGGCCAGACCAAGTTCGCTGTGGTTTGAGATCGGCATGCCATCTGACAGAGTAGAGCATTACGAGCCTACGACAAATGAGGAAGAGCTGCTCCTCAACCTC
GACTTGTTGGAAGAAAGAAGAGCAATGGCCCAGCTACGCCTGGTGGAATATCAGGGCAGAATGACCAGACATTACAATGCCCGCGTTCGACCTCGGGCCTTTCAG
GTCGGACATCTGGTCTTAAGGAGGGTCCAAACGCATGTGGGTGCCCTTGATCCGGCCTGGGAGGGCCCGTTTGAGATCAAGGGCATCGTCCGACTTGGGACGTAC
ATATTGGCCGATCTGAAAGGAGACGTCCTCGTGCACCCGTGGAACGCGGAGCACCTGAAGCGTTATTATCCTTGA
Protein sequenceShow/hide protein sequence
MVQPANSTNTTDQRTLAASDAHQREVGAAAVEGQGHDGLVTEPLRRSARITAPALPPAHPRTSKTTRGRGGTSKKGARGPTPTPTSENFDALKKEMEAMRTQMRS
MEEMYNEMMLAAGAGSRSENRVMRVDVREQGGSHLGPAEEERPEDNEREGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPAGIITREEFDQL
RGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSIS
TYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKT
GRPERKIGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTS
DCWELKRQIEDLIQDGYFKKFVGKPKTSSAEKKEERKRSRTPPRRTDLPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPH
NDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLPLGWTRSQLKTSPTPLVGFSGESVIPEGCINLPVTLGHDRTQVTQMAAFVVIDGEQTASRECYASALKGS
SVCALETLAGRDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQLASAYETDLARSVPVEILDNPSISEPDLMEIGAPESSWMDPIADFIRGNSPQDPKERRKLA
RRAARFVVRDGALYRRGFSLPLLRCLTPEEGLYVLREIHEGVCGNHLGARSLSAKVIRQGYYWPTLSQDAKKARPSSLWFEIGMPSDRVEHYEPTTNEEELLLNL
DLLEERRAMAQLRLVEYQGRMTRHYNARVRPRAFQVGHLVLRRVQTHVGALDPAWEGPFEIKGIVRLGTYILADLKGDVLVHPWNAEHLKRYYP