; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g20500 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g20500
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr6:16001106..16006688
RNA-Seq ExpressionMoc06g20500
SyntenyMoc06g20500
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]7.7e-26291.47Show/hide
Query:  QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCERKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDRKDYVEVFEGLMDFQAASD
        +AESS N   PAG+ITREEFDQLRG+LDAQVEALKAKCE+K+  LNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDG+KD KDYVEVFE LMDFQAASD
Subjt:  QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCERKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDRKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRRKFLAQFSSRHYDKKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHYSDDSAMCYFLTGLADEAL
        AIKCRAF+IALTGSARLWYRRLPA SISTYSQLRR+FLA FSSRHYDKKT THLATIRQKEGETLREYVTRFQEEQLKVAH SDDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRRKFLAQFSSRHYDKKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHYSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGQGRSGKD-ERADPKSKDKGSFSSGRAEYRRAESELTRSRPYERFTPTTIPISEILTNIV
        TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIG+GRSGKD E ADPKSKDKGSFSSGRAEYRRAE+  TRSRPYERFTPTTIPISEILTNI 
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGQGRSGKD-ERADPKSKDKGSFSSGRAEYRRAESELTRSRPYERFTPTTIPISEILTNIV

Query:  DSGMEKLLKRPEKLRGAPERRSKDKYCRFHQEHGHNTSDCWELKRQIEDLIQDVYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFEGPSGG
        +SGMEKLLKRPEKLRGAPERRSKDKYCRFH+EHGHNTSD WELKRQIE+LIQD YFKKFVGKP TSSAEKKEERKRSRTPPRRTDRPAVINTIF GPSGG
Subjt:  DSGMEKLLKRPEKLRGAPERRSKDKYCRFHQEHGHNTSDCWELKRQIEDLIQDVYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFEGPSGG

Query:  QSGHKRKELARAARCEVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHMVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGE
        QSG KRKELARAAR EVCIIREQ PTCPITFDGADLEEVHLPHNDALVIAPLIDH+VV RVLVDGG SANILSLPTYLALGWTRSQLK+SPTPLVGFSGE
Subjt:  QSGHKRKELARAARCEVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHMVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGE

Query:  TVIPEGCIDLPVTLGR
        +VIPEG IDLPVTLG+
Subjt:  TVIPEGCIDLPVTLGR

XP_022150613.1 uncharacterized protein LOC111018708, partial [Momordica charantia]3.4e-21793.36Show/hide
Query:  KDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDRKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRRKFLAQ
        KDDSLNDGDLGES FTSDVLEAPIPPKFKAPTVKPYDG+KD KDYVEVFEGLMDF AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRR+FLAQ
Subjt:  KDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDRKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRRKFLAQ

Query:  FSSRHYDKKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHYSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI
        FSSR Y KKT THLATIRQKEG TLREYVTRFQEEQLKVAH SDDSAMCYFLTGLADEALTVKLGE+AP TFAEVLQKAKKVIDGQELLRTKTGRP+RKI
Subjt:  FSSRHYDKKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHYSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI

Query:  GQGRSGKD-ERADPKSKDKGSFSSGRAEYRRAESELTRSRPYERFTPTTIPISEILTNIVDSGMEKLLKRPEKLRGAPERRSKDKYCRFHQEHGHNTSDC
        G+GRSGKD ERADPKSKDKGSFSSGRAEYRRAES  T+SRPYERFTPTTIPISEILTNI +SGMEKLLKRPEKLRGAPERRSKDKYCRFH+EHGHNTSDC
Subjt:  GQGRSGKD-ERADPKSKDKGSFSSGRAEYRRAESELTRSRPYERFTPTTIPISEILTNIVDSGMEKLLKRPEKLRGAPERRSKDKYCRFHQEHGHNTSDC

Query:  WELKRQIEDLIQDVYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFEGPSGGQSGHKRKELARAARCEVCIIREQGPTCPITFDGADLEEVH
        WELKRQIEDLIQD YFKKFVGKP TSSAEKKEERKRSRTPPRRTDRPAVINTIF GPSGGQSGHKRKELARAAR EVCIIREQGPTCPITFDGAD EEVH
Subjt:  WELKRQIEDLIQDVYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFEGPSGGQSGHKRKELARAARCEVCIIREQGPTCPITFDGADLEEVH

Query:  LPHNDALVIAPLIDHMVVRRVL
        LPHNDA VIAPLIDH+VVRRVL
Subjt:  LPHNDALVIAPLIDHMVVRRVL

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]1.5e-24173.97Show/hide
Query:  SSNQQAESSHNPA---GIITREEFDQLRGELDAQVEALKAKCERKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDRKDYVEVFEGLMDFQ
        SSNQQAESSHNPA   G+ITREEFDQLRG+L+AQVEALKAKCE+K+  LNDGDLGESPFTSDVLE        APTVK YDG+KD KDYVEVFEGLMDFQ
Subjt:  SSNQQAESSHNPA---GIITREEFDQLRGELDAQVEALKAKCERKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDRKDYVEVFEGLMDFQ

Query:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRRKFLAQFSSRHYDKKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHYSDDSAMCYFLTGLA
        AASDAIKCRAFQIALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFLTGLA
Subjt:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRRKFLAQFSSRHYDKKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHYSDDSAMCYFLTGLA

Query:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGQGRSGKDERADPKSKDKGSFSSGRAEYRRAESELTRSRPYERFTPTTIPISEILT
        DEALTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRPER I +GRSGKDE+AD KSKDKGSFSSGRAE+RRA +  TRSRPYERFTPTTIPISEILT
Subjt:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGQGRSGKDERADPKSKDKGSFSSGRAEYRRAESELTRSRPYERFTPTTIPISEILT

Query:  NIVDSGMEKLLKRPEKLRGAPERRSKDKYCRFHQEHGHNTSDCWELKRQIEDLIQDVYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFEGP
        NI +SGMEKLLKRPEKLRGAPERR+KDKYCRFH+EH HNTSD WELKRQIEDLIQD YFKKFVGKP TSSAEKKEERK SRTP RR DRPAVINTIF GP
Subjt:  NIVDSGMEKLLKRPEKLRGAPERRSKDKYCRFHQEHGHNTSDCWELKRQIEDLIQDVYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFEGP

Query:  SGGQSGHKRKELARAARCEVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHMVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGF
        SGGQSGHKRKELARAAR EVCIIREQ PTCPITFD ADLEEVHLPHNDALVIAPLIDH+VVRRVLVD G SANI+SL TYLALGWTRSQLK+S TPLVGF
Subjt:  SGGQSGHKRKELARAARCEVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHMVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGF

Query:  SGETVIPEGCIDLPVTLGRTK--------------------------------LGSLKWPILKYSTPNGVGTVRGEQTASRECYAAALKGSSVCALETL-
        S E+VIPEGCIDLPVTLG  +                                + S    +LKYSTPNGVG VRGEQ ASRECYA+ALKGSSVCALETL 
Subjt:  SGETVIPEGCIDLPVTLGRTK--------------------------------LGSLKWPILKYSTPNGVGTVRGEQTASRECYAAALKGSSVCALETL-

Query:  -QDGTLKFEADLPRKEFAAPTEELELVPLL
         +DGTL+F+A+LPR+EFAAPTEELELVPLL
Subjt:  -QDGTLKFEADLPRKEFAAPTEELELVPLL

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]4.1e-23960.63Show/hide
Query:  MVQPANSTNTTDRRTLATSDAHQREVGAAAVEGQGHDGLATEPLSRSARITAPALPPAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAMR
        MVQPANSTNT DRR LA +  HQREVGA  VEGQGH+ L TEPL RSARIT P LPPAHP+ SK                                    
Subjt:  MVQPANSTNTTDRRTLATSDAHQREVGAAAVEGQGHDGLATEPLSRSARITAPALPPAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAMR

Query:  TQMRSMEAMYNEMVLAAGAGSRSENRVTRMDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNP--
                                                                                                   AESS+NP  
Subjt:  TQMRSMEAMYNEMVLAAGAGSRSENRVTRMDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNP--

Query:  AGIITREEFDQLRGELDAQVEALKAKCERKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDRKDYVEVFEGLMDFQAASDAIKCRAFQIAL
         G+ITREEFDQL+ + DAQVEALKA+CE+K+ S +DGDLGE  F+SD+LEA IPPKFK PT+KPYDG+KD KDYVEVFE LMDFQAA+DAIKC AFQIAL
Subjt:  AGIITREEFDQLRGELDAQVEALKAKCERKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDRKDYVEVFEGLMDFQAASDAIKCRAFQIAL

Query:  TGSARLWYRRLPARSISTYSQLRRKFLAQFSSRHYDKKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHYSDDSAMCYFLTGLADEALTVKLGEEAPAT
        TGSARLWYRRLPAR ISTYSQLR++F++QFSSRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAH SDDSAMCYFLTGLADE LTVKL EEAPAT
Subjt:  TGSARLWYRRLPARSISTYSQLRRKFLAQFSSRHYDKKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHYSDDSAMCYFLTGLADEALTVKLGEEAPAT

Query:  FAEVLQKAKKVIDGQELLRTKTGRPERKIGQGRSGKDE-RADPKSKDKG-SFSSGRAEYRRAESELTRSRPYERFTPTTIPISEILTNIVDSGMEKLLKR
        FAEVLQK KKVIDGQELLRTKTGRPE+ I QGR+GKD+ +AD KS+DKG S SS R +YRR+ S   +SRPYE +TPTTIPI EILTNI ++GMEKLLKR
Subjt:  FAEVLQKAKKVIDGQELLRTKTGRPERKIGQGRSGKDE-RADPKSKDKG-SFSSGRAEYRRAESELTRSRPYERFTPTTIPISEILTNIVDSGMEKLLKR

Query:  PEKLRGAPERRSKDKYCRFHQEHGHNTSDCWELKRQIEDLIQDVYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFEGPSGGQSGHKRKELA
        PEKLRG PE+R+ DKYCRFH++HGHNTS+ WELKRQIEDLIQD YFKKFVGKP ++S EKKEERKR RTPPRR DRPAVIN             K+KELA
Subjt:  PEKLRGAPERRSKDKYCRFHQEHGHNTSDCWELKRQIEDLIQDVYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFEGPSGGQSGHKRKELA

Query:  RAARCEVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHMVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGETVIPEGCIDL
        R AR EVCIIREQ PT  I F+ ADLE VHLPHNDALVIAPLID ++VRR+LVDGGASANILSL TYLALGWTRSQLK+SPTPLVGFSGE++  EGCIDL
Subjt:  RAARCEVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHMVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGETVIPEGCIDL

Query:  PVTL--------------------------GRTKLGSLK------WPILKYSTPNGVGTVRGEQTASRECYAAALKGSSVCALE--TLQD
        PV++                          GR  + S +        +LKYST NGVGTVRGE   SRECYA+  K SSVCALE  T++D
Subjt:  PVTL--------------------------GRTKLGSLK------WPILKYSTPNGVGTVRGEQTASRECYAAALKGSSVCALE--TLQD

XP_022158652.1 uncharacterized protein LOC111025109 [Momordica charantia]1.8e-21086.03Show/hide
Query:  NRKRGSSLRKGQSPSRSHRSSNQQAESSHNPAGIITREEFDQLRGELDAQVEALKAKCERKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTK
        + KRGSSLRKGQSPSRSHRSSNQQAESSHNPAGIITREEFDQLRGELDAQVEALKAKCE+KDDSLNDGDLGE PFTSDVLEAPIPPKFKAPTVKPYDGTK
Subjt:  NRKRGSSLRKGQSPSRSHRSSNQQAESSHNPAGIITREEFDQLRGELDAQVEALKAKCERKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTK

Query:  DRKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRRKFLAQFSSRHYDKKTTTHLATIRQKEGETLREYVTRFQEEQLKVA
        D KDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPA                                 RQKE ETLREYVTRFQEEQLKVA
Subjt:  DRKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRRKFLAQFSSRHYDKKTTTHLATIRQKEGETLREYVTRFQEEQLKVA

Query:  HYSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGQGRSGKD-ERADPKSKDKGSFSSGRAEYRRAESELTRSR
        H SDDSAMCYF TGLADEALTVKLGEEAP TFAEVLQKAKKVIDGQELLRTKTGRPERKIG+GRSGKD ERADPKSKDKGSFSSGRAEYRRAE+  T  R
Subjt:  HYSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGQGRSGKD-ERADPKSKDKGSFSSGRAEYRRAESELTRSR

Query:  PYERFTPTTIPISEILTNIVDSGMEKLLKRPEKLRGAPERRSKDKYCRFHQEHGHNTSDCWELKRQIEDLIQDVYFKKFVGKPGTSSAEKKEERKRSRTP
        PYERFTPTTIPIS ILTNI +SGMEKLLKR EKLRGAPERR KDKYCRFH+EHGHNTS+CWELKRQIEDLIQD YFKKFVG P TSSAEKKEERKRSRTP
Subjt:  PYERFTPTTIPISEILTNIVDSGMEKLLKRPEKLRGAPERRSKDKYCRFHQEHGHNTSDCWELKRQIEDLIQDVYFKKFVGKPGTSSAEKKEERKRSRTP

Query:  PRRTDRPAVINTIFEGPSGGQSGHKRKELARAARCEVCIIREQGPTCPITFDGADLEE
        PRRTDRPAVINTIF GPSGGQS HKRK+LARAAR EVCIIREQGPTCPITFD ADLEE
Subjt:  PRRTDRPAVINTIFEGPSGGQSGHKRKELARAARCEVCIIREQGPTCPITFDGADLEE

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088133.7e-26291.47Show/hide
Query:  QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCERKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDRKDYVEVFEGLMDFQAASD
        +AESS N   PAG+ITREEFDQLRG+LDAQVEALKAKCE+K+  LNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDG+KD KDYVEVFE LMDFQAASD
Subjt:  QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCERKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDRKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRRKFLAQFSSRHYDKKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHYSDDSAMCYFLTGLADEAL
        AIKCRAF+IALTGSARLWYRRLPA SISTYSQLRR+FLA FSSRHYDKKT THLATIRQKEGETLREYVTRFQEEQLKVAH SDDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRRKFLAQFSSRHYDKKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHYSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGQGRSGKD-ERADPKSKDKGSFSSGRAEYRRAESELTRSRPYERFTPTTIPISEILTNIV
        TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIG+GRSGKD E ADPKSKDKGSFSSGRAEYRRAE+  TRSRPYERFTPTTIPISEILTNI 
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGQGRSGKD-ERADPKSKDKGSFSSGRAEYRRAESELTRSRPYERFTPTTIPISEILTNIV

Query:  DSGMEKLLKRPEKLRGAPERRSKDKYCRFHQEHGHNTSDCWELKRQIEDLIQDVYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFEGPSGG
        +SGMEKLLKRPEKLRGAPERRSKDKYCRFH+EHGHNTSD WELKRQIE+LIQD YFKKFVGKP TSSAEKKEERKRSRTPPRRTDRPAVINTIF GPSGG
Subjt:  DSGMEKLLKRPEKLRGAPERRSKDKYCRFHQEHGHNTSDCWELKRQIEDLIQDVYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFEGPSGG

Query:  QSGHKRKELARAARCEVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHMVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGE
        QSG KRKELARAAR EVCIIREQ PTCPITFDGADLEEVHLPHNDALVIAPLIDH+VV RVLVDGG SANILSLPTYLALGWTRSQLK+SPTPLVGFSGE
Subjt:  QSGHKRKELARAARCEVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHMVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGE

Query:  TVIPEGCIDLPVTLGR
        +VIPEG IDLPVTLG+
Subjt:  TVIPEGCIDLPVTLGR

A0A6J1D9E1 uncharacterized protein LOC1110188237.3e-24273.97Show/hide
Query:  SSNQQAESSHNPA---GIITREEFDQLRGELDAQVEALKAKCERKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDRKDYVEVFEGLMDFQ
        SSNQQAESSHNPA   G+ITREEFDQLRG+L+AQVEALKAKCE+K+  LNDGDLGESPFTSDVLE        APTVK YDG+KD KDYVEVFEGLMDFQ
Subjt:  SSNQQAESSHNPA---GIITREEFDQLRGELDAQVEALKAKCERKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDRKDYVEVFEGLMDFQ

Query:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRRKFLAQFSSRHYDKKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHYSDDSAMCYFLTGLA
        AASDAIKCRAFQIALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFLTGLA
Subjt:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRRKFLAQFSSRHYDKKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHYSDDSAMCYFLTGLA

Query:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGQGRSGKDERADPKSKDKGSFSSGRAEYRRAESELTRSRPYERFTPTTIPISEILT
        DEALTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRPER I +GRSGKDE+AD KSKDKGSFSSGRAE+RRA +  TRSRPYERFTPTTIPISEILT
Subjt:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGQGRSGKDERADPKSKDKGSFSSGRAEYRRAESELTRSRPYERFTPTTIPISEILT

Query:  NIVDSGMEKLLKRPEKLRGAPERRSKDKYCRFHQEHGHNTSDCWELKRQIEDLIQDVYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFEGP
        NI +SGMEKLLKRPEKLRGAPERR+KDKYCRFH+EH HNTSD WELKRQIEDLIQD YFKKFVGKP TSSAEKKEERK SRTP RR DRPAVINTIF GP
Subjt:  NIVDSGMEKLLKRPEKLRGAPERRSKDKYCRFHQEHGHNTSDCWELKRQIEDLIQDVYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFEGP

Query:  SGGQSGHKRKELARAARCEVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHMVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGF
        SGGQSGHKRKELARAAR EVCIIREQ PTCPITFD ADLEEVHLPHNDALVIAPLIDH+VVRRVLVD G SANI+SL TYLALGWTRSQLK+S TPLVGF
Subjt:  SGGQSGHKRKELARAARCEVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHMVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGF

Query:  SGETVIPEGCIDLPVTLGRTK--------------------------------LGSLKWPILKYSTPNGVGTVRGEQTASRECYAAALKGSSVCALETL-
        S E+VIPEGCIDLPVTLG  +                                + S    +LKYSTPNGVG VRGEQ ASRECYA+ALKGSSVCALETL 
Subjt:  SGETVIPEGCIDLPVTLGRTK--------------------------------LGSLKWPILKYSTPNGVGTVRGEQTASRECYAAALKGSSVCALETL-

Query:  -QDGTLKFEADLPRKEFAAPTEELELVPLL
         +DGTL+F+A+LPR+EFAAPTEELELVPLL
Subjt:  -QDGTLKFEADLPRKEFAAPTEELELVPLL

A0A6J1D9W7 uncharacterized protein LOC1110187081.6e-21793.36Show/hide
Query:  KDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDRKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRRKFLAQ
        KDDSLNDGDLGES FTSDVLEAPIPPKFKAPTVKPYDG+KD KDYVEVFEGLMDF AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRR+FLAQ
Subjt:  KDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDRKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRRKFLAQ

Query:  FSSRHYDKKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHYSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI
        FSSR Y KKT THLATIRQKEG TLREYVTRFQEEQLKVAH SDDSAMCYFLTGLADEALTVKLGE+AP TFAEVLQKAKKVIDGQELLRTKTGRP+RKI
Subjt:  FSSRHYDKKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHYSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI

Query:  GQGRSGKD-ERADPKSKDKGSFSSGRAEYRRAESELTRSRPYERFTPTTIPISEILTNIVDSGMEKLLKRPEKLRGAPERRSKDKYCRFHQEHGHNTSDC
        G+GRSGKD ERADPKSKDKGSFSSGRAEYRRAES  T+SRPYERFTPTTIPISEILTNI +SGMEKLLKRPEKLRGAPERRSKDKYCRFH+EHGHNTSDC
Subjt:  GQGRSGKD-ERADPKSKDKGSFSSGRAEYRRAESELTRSRPYERFTPTTIPISEILTNIVDSGMEKLLKRPEKLRGAPERRSKDKYCRFHQEHGHNTSDC

Query:  WELKRQIEDLIQDVYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFEGPSGGQSGHKRKELARAARCEVCIIREQGPTCPITFDGADLEEVH
        WELKRQIEDLIQD YFKKFVGKP TSSAEKKEERKRSRTPPRRTDRPAVINTIF GPSGGQSGHKRKELARAAR EVCIIREQGPTCPITFDGAD EEVH
Subjt:  WELKRQIEDLIQDVYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFEGPSGGQSGHKRKELARAARCEVCIIREQGPTCPITFDGADLEEVH

Query:  LPHNDALVIAPLIDHMVVRRVL
        LPHNDA VIAPLIDH+VVRRVL
Subjt:  LPHNDALVIAPLIDHMVVRRVL

A0A6J1DHB3 uncharacterized protein LOC1110204792.0e-23960.63Show/hide
Query:  MVQPANSTNTTDRRTLATSDAHQREVGAAAVEGQGHDGLATEPLSRSARITAPALPPAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAMR
        MVQPANSTNT DRR LA +  HQREVGA  VEGQGH+ L TEPL RSARIT P LPPAHP+ SK                                    
Subjt:  MVQPANSTNTTDRRTLATSDAHQREVGAAAVEGQGHDGLATEPLSRSARITAPALPPAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAMR

Query:  TQMRSMEAMYNEMVLAAGAGSRSENRVTRMDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNP--
                                                                                                   AESS+NP  
Subjt:  TQMRSMEAMYNEMVLAAGAGSRSENRVTRMDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNP--

Query:  AGIITREEFDQLRGELDAQVEALKAKCERKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDRKDYVEVFEGLMDFQAASDAIKCRAFQIAL
         G+ITREEFDQL+ + DAQVEALKA+CE+K+ S +DGDLGE  F+SD+LEA IPPKFK PT+KPYDG+KD KDYVEVFE LMDFQAA+DAIKC AFQIAL
Subjt:  AGIITREEFDQLRGELDAQVEALKAKCERKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDRKDYVEVFEGLMDFQAASDAIKCRAFQIAL

Query:  TGSARLWYRRLPARSISTYSQLRRKFLAQFSSRHYDKKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHYSDDSAMCYFLTGLADEALTVKLGEEAPAT
        TGSARLWYRRLPAR ISTYSQLR++F++QFSSRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAH SDDSAMCYFLTGLADE LTVKL EEAPAT
Subjt:  TGSARLWYRRLPARSISTYSQLRRKFLAQFSSRHYDKKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHYSDDSAMCYFLTGLADEALTVKLGEEAPAT

Query:  FAEVLQKAKKVIDGQELLRTKTGRPERKIGQGRSGKDE-RADPKSKDKG-SFSSGRAEYRRAESELTRSRPYERFTPTTIPISEILTNIVDSGMEKLLKR
        FAEVLQK KKVIDGQELLRTKTGRPE+ I QGR+GKD+ +AD KS+DKG S SS R +YRR+ S   +SRPYE +TPTTIPI EILTNI ++GMEKLLKR
Subjt:  FAEVLQKAKKVIDGQELLRTKTGRPERKIGQGRSGKDE-RADPKSKDKG-SFSSGRAEYRRAESELTRSRPYERFTPTTIPISEILTNIVDSGMEKLLKR

Query:  PEKLRGAPERRSKDKYCRFHQEHGHNTSDCWELKRQIEDLIQDVYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFEGPSGGQSGHKRKELA
        PEKLRG PE+R+ DKYCRFH++HGHNTS+ WELKRQIEDLIQD YFKKFVGKP ++S EKKEERKR RTPPRR DRPAVIN             K+KELA
Subjt:  PEKLRGAPERRSKDKYCRFHQEHGHNTSDCWELKRQIEDLIQDVYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFEGPSGGQSGHKRKELA

Query:  RAARCEVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHMVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGETVIPEGCIDL
        R AR EVCIIREQ PT  I F+ ADLE VHLPHNDALVIAPLID ++VRR+LVDGGASANILSL TYLALGWTRSQLK+SPTPLVGFSGE++  EGCIDL
Subjt:  RAARCEVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHMVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGETVIPEGCIDL

Query:  PVTL--------------------------GRTKLGSLK------WPILKYSTPNGVGTVRGEQTASRECYAAALKGSSVCALE--TLQD
        PV++                          GR  + S +        +LKYST NGVGTVRGE   SRECYA+  K SSVCALE  T++D
Subjt:  PVTL--------------------------GRTKLGSLK------WPILKYSTPNGVGTVRGEQTASRECYAAALKGSSVCALE--TLQD

A0A6J1DXR9 uncharacterized protein LOC1110251098.7e-21186.03Show/hide
Query:  NRKRGSSLRKGQSPSRSHRSSNQQAESSHNPAGIITREEFDQLRGELDAQVEALKAKCERKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTK
        + KRGSSLRKGQSPSRSHRSSNQQAESSHNPAGIITREEFDQLRGELDAQVEALKAKCE+KDDSLNDGDLGE PFTSDVLEAPIPPKFKAPTVKPYDGTK
Subjt:  NRKRGSSLRKGQSPSRSHRSSNQQAESSHNPAGIITREEFDQLRGELDAQVEALKAKCERKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTK

Query:  DRKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRRKFLAQFSSRHYDKKTTTHLATIRQKEGETLREYVTRFQEEQLKVA
        D KDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPA                                 RQKE ETLREYVTRFQEEQLKVA
Subjt:  DRKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRRKFLAQFSSRHYDKKTTTHLATIRQKEGETLREYVTRFQEEQLKVA

Query:  HYSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGQGRSGKD-ERADPKSKDKGSFSSGRAEYRRAESELTRSR
        H SDDSAMCYF TGLADEALTVKLGEEAP TFAEVLQKAKKVIDGQELLRTKTGRPERKIG+GRSGKD ERADPKSKDKGSFSSGRAEYRRAE+  T  R
Subjt:  HYSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGQGRSGKD-ERADPKSKDKGSFSSGRAEYRRAESELTRSR

Query:  PYERFTPTTIPISEILTNIVDSGMEKLLKRPEKLRGAPERRSKDKYCRFHQEHGHNTSDCWELKRQIEDLIQDVYFKKFVGKPGTSSAEKKEERKRSRTP
        PYERFTPTTIPIS ILTNI +SGMEKLLKR EKLRGAPERR KDKYCRFH+EHGHNTS+CWELKRQIEDLIQD YFKKFVG P TSSAEKKEERKRSRTP
Subjt:  PYERFTPTTIPISEILTNIVDSGMEKLLKRPEKLRGAPERRSKDKYCRFHQEHGHNTSDCWELKRQIEDLIQDVYFKKFVGKPGTSSAEKKEERKRSRTP

Query:  PRRTDRPAVINTIFEGPSGGQSGHKRKELARAARCEVCIIREQGPTCPITFDGADLEE
        PRRTDRPAVINTIF GPSGGQS HKRK+LARAAR EVCIIREQGPTCPITFD ADLEE
Subjt:  PRRTDRPAVINTIFEGPSGGQSGHKRKELARAARCEVCIIREQGPTCPITFDGADLEE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAACCAGCGAATTCGACCAATACGACAGATCGAAGGACTCTAGCTACCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGCGGTAGAGGGGCAAGGT
CACGACGGCCTAGCAACGGAACCCCTCAGCAGGTCGGCACGGATCACCGCGCCTGCCCTACCGCCTGCGCACCCGAGGACGTCCAAGGCCACCCGTGGCCGAGGT
GGGACCTCTAAGAAGGGCGCCCGGGGTCCAGCCCCGGCTCCAACAAGCGAGAACTTTGATGCGCTCCAGAGAGAGATGGAGGCAATGCGCACACAAATGCGCTCC
ATGGAGGCAATGTATAACGAAATGGTGCTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAGTGACGCGCATGGACGTACGCGAGCAAAGGGGTTCCCACCTC
GGCCCAGCCGAGGAAGAACGTCCCGAAGACAACGAGAGTGAGGGGTACACTCGCCAGAGGGGAGACCTCCGCGAGCATCTCAACAGAAAGAGAGGCTCGTCTCTC
CGAAAAGGGCAGTCACCATCCCGCTCACACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAATCCCGCAGGGATAATCACAAGGGAGGAGTTCGACCAGCTG
AGGGGGGAGCTCGATGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCGAAAAGACGATTCACTGAACGATGGCGACTTGGGAGAATCGCCTTTCACCTCGGAC
GTTTTGGAAGCACCAATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGACGGGACGAAGGACCGCAAGGACTATGTTGAGGTCTTTGAAGGCCTCATG
GACTTCCAAGCGGCATCAGACGCAATCAAATGCCGTGCCTTTCAGATCGCGCTTACTGGCAGCGCTCGATTGTGGTACCGGAGACTGCCAGCCAGGTCGATCTCG
ACCTACTCTCAGCTGAGAAGGAAGTTCCTCGCCCAGTTCTCTTCTCGGCACTATGACAAAAAGACAACGACCCATCTCGCCACCATCAGGCAGAAGGAGGGTGAG
ACGCTGCGGGAGTATGTCACCAGATTCCAGGAGGAGCAGTTGAAGGTCGCACACTACTCCGATGACTCGGCCATGTGCTATTTCCTCACCGGTCTAGCCGACGAA
GCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTCCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACC
GGCCGACCTGAACGAAAGATCGGCCAGGGCAGAAGTGGAAAAGATGAAAGGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGAGCTGAGTAT
CGAAGGGCGGAGAGCGAACTTACCAGGAGCCGACCTTACGAGCGCTTCACCCCAACCACGATTCCAATTTCCGAGATCCTAACGAACATCGTGGATTCTGGAATG
GAAAAACTACTCAAGCGTCCGGAAAAACTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTCCATCAGGAGCACGGCCACAACACGTCGGAT
TGCTGGGAGTTGAAGCGCCAAATTGAGGATCTAATTCAAGATGTCTACTTCAAGAAGTTTGTGGGAAAGCCCGGGACCAGCTCAGCAGAGAAAAAGGAAGAGCGA
AAGCGTTCAAGGACGCCACCACGGCGCACCGACCGACCTGCGGTCATCAATACCATTTTTGAAGGGCCAAGCGGGGGTCAATCCGGACATAAAAGAAAGGAGTTA
GCCCGTGCAGCCAGGTGCGAGGTGTGCATCATCAGGGAGCAGGGGCCGACCTGCCCAATCACCTTCGACGGAGCAGACTTGGAGGAGGTCCACCTGCCCCACAAT
GATGCCCTTGTGATTGCTCCCTTGATTGATCATATGGTGGTCAGGAGAGTGCTGGTAGACGGGGGCGCATCCGCTAACATCCTGTCCTTACCGACCTACCTCGCC
TTGGGCTGGACGAGGTCGCAATTGAAGAGAAGCCCGACGCCGCTGGTTGGGTTCTCTGGAGAAACGGTCATCCCAGAGGGTTGCATCGACTTGCCGGTCACGCTG
GGCAGGACAAAACTCGGGTCACTCAAATGGCCGATTTTGAAGTATTCCACCCCCAATGGCGTGGGCACGGTCCGAGGAGAACAGACCGCTTCGAGGGAGTGCTAT
GCCGCCGCACTCAAAGGCTCATCGGTCTGCGCCCTCGAAACTCTCCAGGATGGGACGCTCAAGTTCGAGGCCGACCTGCCGAGGAAGGAGTTTGCCGCACCCACT
GAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAGCCAGATCTGATGGAGATCGGCGCTCCAGAATCCTCATGGATGGAACCGATCACGGACTTCATT
AAGGGTAACTCACCACAAGATCCCAAGGAGCGCAGAAAGTTGGCAAGGCGGGCAGCTCGAGTAGAACATTACGAGCCTACGACGAATGAGGATGGGCTGCTCCTC
AACCTCGACTTGTTGGAAGAAAGAAGAGCAATGGCCCAGCTACGCTTGGCGGAATATCAGGGCAGAATGACCAGATATTATAATGCCCGCGTTCGACCTCGGGCC
TTTCAGGTCGGACATCTGGTCTTAAGGAGGGTTGAAACGCATGTGGGTGCCCTTGATCCGGCTTGGGAGGGCCCGTTTGAGTTCAAGGGCATAGTCCGACCTGGG
ACGTACATGTTGGCCGATCTGAAAGGAGACGTCCTCGCGCACCCGTGGAACGCGGAGCACCTGAAGCGTTATCATCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAACCAGCGAATTCGACCAATACGACAGATCGAAGGACTCTAGCTACCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGCGGTAGAGGGGCAAGGT
CACGACGGCCTAGCAACGGAACCCCTCAGCAGGTCGGCACGGATCACCGCGCCTGCCCTACCGCCTGCGCACCCGAGGACGTCCAAGGCCACCCGTGGCCGAGGT
GGGACCTCTAAGAAGGGCGCCCGGGGTCCAGCCCCGGCTCCAACAAGCGAGAACTTTGATGCGCTCCAGAGAGAGATGGAGGCAATGCGCACACAAATGCGCTCC
ATGGAGGCAATGTATAACGAAATGGTGCTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAGTGACGCGCATGGACGTACGCGAGCAAAGGGGTTCCCACCTC
GGCCCAGCCGAGGAAGAACGTCCCGAAGACAACGAGAGTGAGGGGTACACTCGCCAGAGGGGAGACCTCCGCGAGCATCTCAACAGAAAGAGAGGCTCGTCTCTC
CGAAAAGGGCAGTCACCATCCCGCTCACACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAATCCCGCAGGGATAATCACAAGGGAGGAGTTCGACCAGCTG
AGGGGGGAGCTCGATGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCGAAAAGACGATTCACTGAACGATGGCGACTTGGGAGAATCGCCTTTCACCTCGGAC
GTTTTGGAAGCACCAATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGACGGGACGAAGGACCGCAAGGACTATGTTGAGGTCTTTGAAGGCCTCATG
GACTTCCAAGCGGCATCAGACGCAATCAAATGCCGTGCCTTTCAGATCGCGCTTACTGGCAGCGCTCGATTGTGGTACCGGAGACTGCCAGCCAGGTCGATCTCG
ACCTACTCTCAGCTGAGAAGGAAGTTCCTCGCCCAGTTCTCTTCTCGGCACTATGACAAAAAGACAACGACCCATCTCGCCACCATCAGGCAGAAGGAGGGTGAG
ACGCTGCGGGAGTATGTCACCAGATTCCAGGAGGAGCAGTTGAAGGTCGCACACTACTCCGATGACTCGGCCATGTGCTATTTCCTCACCGGTCTAGCCGACGAA
GCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTCCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACC
GGCCGACCTGAACGAAAGATCGGCCAGGGCAGAAGTGGAAAAGATGAAAGGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGAGCTGAGTAT
CGAAGGGCGGAGAGCGAACTTACCAGGAGCCGACCTTACGAGCGCTTCACCCCAACCACGATTCCAATTTCCGAGATCCTAACGAACATCGTGGATTCTGGAATG
GAAAAACTACTCAAGCGTCCGGAAAAACTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTCCATCAGGAGCACGGCCACAACACGTCGGAT
TGCTGGGAGTTGAAGCGCCAAATTGAGGATCTAATTCAAGATGTCTACTTCAAGAAGTTTGTGGGAAAGCCCGGGACCAGCTCAGCAGAGAAAAAGGAAGAGCGA
AAGCGTTCAAGGACGCCACCACGGCGCACCGACCGACCTGCGGTCATCAATACCATTTTTGAAGGGCCAAGCGGGGGTCAATCCGGACATAAAAGAAAGGAGTTA
GCCCGTGCAGCCAGGTGCGAGGTGTGCATCATCAGGGAGCAGGGGCCGACCTGCCCAATCACCTTCGACGGAGCAGACTTGGAGGAGGTCCACCTGCCCCACAAT
GATGCCCTTGTGATTGCTCCCTTGATTGATCATATGGTGGTCAGGAGAGTGCTGGTAGACGGGGGCGCATCCGCTAACATCCTGTCCTTACCGACCTACCTCGCC
TTGGGCTGGACGAGGTCGCAATTGAAGAGAAGCCCGACGCCGCTGGTTGGGTTCTCTGGAGAAACGGTCATCCCAGAGGGTTGCATCGACTTGCCGGTCACGCTG
GGCAGGACAAAACTCGGGTCACTCAAATGGCCGATTTTGAAGTATTCCACCCCCAATGGCGTGGGCACGGTCCGAGGAGAACAGACCGCTTCGAGGGAGTGCTAT
GCCGCCGCACTCAAAGGCTCATCGGTCTGCGCCCTCGAAACTCTCCAGGATGGGACGCTCAAGTTCGAGGCCGACCTGCCGAGGAAGGAGTTTGCCGCACCCACT
GAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAGCCAGATCTGATGGAGATCGGCGCTCCAGAATCCTCATGGATGGAACCGATCACGGACTTCATT
AAGGGTAACTCACCACAAGATCCCAAGGAGCGCAGAAAGTTGGCAAGGCGGGCAGCTCGAGTAGAACATTACGAGCCTACGACGAATGAGGATGGGCTGCTCCTC
AACCTCGACTTGTTGGAAGAAAGAAGAGCAATGGCCCAGCTACGCTTGGCGGAATATCAGGGCAGAATGACCAGATATTATAATGCCCGCGTTCGACCTCGGGCC
TTTCAGGTCGGACATCTGGTCTTAAGGAGGGTTGAAACGCATGTGGGTGCCCTTGATCCGGCTTGGGAGGGCCCGTTTGAGTTCAAGGGCATAGTCCGACCTGGG
ACGTACATGTTGGCCGATCTGAAAGGAGACGTCCTCGCGCACCCGTGGAACGCGGAGCACCTGAAGCGTTATCATCCTTGA
Protein sequenceShow/hide protein sequence
MVQPANSTNTTDRRTLATSDAHQREVGAAAVEGQGHDGLATEPLSRSARITAPALPPAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAMRTQMRS
MEAMYNEMVLAAGAGSRSENRVTRMDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPAGIITREEFDQL
RGELDAQVEALKAKCERKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDRKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSIS
TYSQLRRKFLAQFSSRHYDKKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHYSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKT
GRPERKIGQGRSGKDERADPKSKDKGSFSSGRAEYRRAESELTRSRPYERFTPTTIPISEILTNIVDSGMEKLLKRPEKLRGAPERRSKDKYCRFHQEHGHNTSD
CWELKRQIEDLIQDVYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFEGPSGGQSGHKRKELARAARCEVCIIREQGPTCPITFDGADLEEVHLPHN
DALVIAPLIDHMVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGETVIPEGCIDLPVTLGRTKLGSLKWPILKYSTPNGVGTVRGEQTASRECY
AAALKGSSVCALETLQDGTLKFEADLPRKEFAAPTEELELVPLLSPEKQPDLMEIGAPESSWMEPITDFIKGNSPQDPKERRKLARRAARVEHYEPTTNEDGLLL
NLDLLEERRAMAQLRLAEYQGRMTRYYNARVRPRAFQVGHLVLRRVETHVGALDPAWEGPFEFKGIVRPGTYMLADLKGDVLAHPWNAEHLKRYHP