; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g26080 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g26080
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr8:18717658..18723244
RNA-Seq ExpressionMoc08g26080
SyntenyMoc08g26080
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]3.6e-20676.14Show/hide
Query:  QAESSHN---PAGIIAREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDFGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASD
        +AESS N   PAG+I REEFDQLRG+LDAQVEALKAKCEQK+  LNDGD GESPFTSDVLEAPIPPKFKAPTVKPYDG+KDPKDYVEVFE LMDFQAASD
Subjt:  QAESSHN---PAGIIAREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDFGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
        AIKCRAF+IALTGSARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEA--------------------------RPPSP--SGKKWKD-ERTDPKSKDKGSFSSGRPEYRRVENGPSRSRPYERFTPTTIPISEILTNIE
        TVKLGEEA                          RP      G+  KD E  DPKSKDKGSFSSGR EYRR ENGP+RSRPYERFTPTTIPISEILTNIE
Subjt:  TVKLGEEA--------------------------RPPSP--SGKKWKD-ERTDPKSKDKGSFSSGRPEYRRVENGPSRSRPYERFTPTTIPISEILTNIE

Query:  DSGMEKLLKRPEKLRGAPERRSK----------------------EVRGKAQD----------QLSREKGRAEAFKDATQARRP--TAVINTIFGGPSGG
        +SGMEKLLKRPEKLRGAPERRSK                      ++    QD          + S  + + E  +  T  RR    AVINTIFGGPSGG
Subjt:  DSGMEKLLKRPEKLRGAPERRSK----------------------EVRGKAQD----------QLSREKGRAEAFKDATQARRP--TAVINTIFGGPSGG

Query:  QSEHKRKELARATRREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGE
        QS  KRKELARA RREVCIIREQ PTCPITFDGADLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLALGWTRSQLK+SPTPLVGFSGE
Subjt:  QSEHKRKELARATRREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGE

Query:  SVIPEGCIDLPVTLGQDQTRVTQMVEFV
        SVIPEG IDLPVTLGQDQT+VTQM EFV
Subjt:  SVIPEGCIDLPVTLGQDQTRVTQMVEFV

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]9.1e-21064.28Show/hide
Query:  SSNQQAESSHNPA---GIIAREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDFGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQ
        SSNQQAESSHNPA   G+I REEFDQLRG+L+AQVEALKAKCEQK+  LNDGD GESPFTSDVLE        APTVK YDG+KDPKDYVEVFEGLMDFQ
Subjt:  SSNQQAESSHNPA---GIIAREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDFGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQ

Query:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA
        AASDAIKCRAFQIALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFLTGLA
Subjt:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA

Query:  DEALTVKLGEEA--------------------------RPPS--PSGKKWKDERTDPKSKDKGSFSSGRPEYRRVENGPSRSRPYERFTPTTIPISEILT
        DEALTVKLG+EA                          RP      G+  KDE+ D KSKDKGSFSSGR E+RR  NGP+RSRPYERFTPTTIPISEILT
Subjt:  DEALTVKLGEEA--------------------------RPPS--PSGKKWKDERTDPKSKDKGSFSSGRPEYRRVENGPSRSRPYERFTPTTIPISEILT

Query:  NIEDSGMEKLLKRPEKLRGAPERRSK------------------EVRGKAQDQLSRE----------KGRAEAFKDATQARRP------TAVINTIFGGP
        NIE+SGMEKLLKRPEKLRGAPERR+K                  E++ + +D +  +             AE  ++   +R P       AVINTIFGGP
Subjt:  NIEDSGMEKLLKRPEKLRGAPERRSK------------------EVRGKAQDQLSRE----------KGRAEAFKDATQARRP------TAVINTIFGGP

Query:  SGGQSEHKRKELARATRREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGF
        SGGQS HKRKELARA RREVCIIREQ PTCPITFD ADLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTRSQLK+S TPLVGF
Subjt:  SGGQSEHKRKELARATRREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGF

Query:  SGESVIPEGCIDLPVTLGQDQTRVTQMVEFVVVDGRSAYNAIFGRPIIHSFRVIPSTLHQVLKYSTPSGVGTVRGEQTASRECYAVALKGSSVCALETL-
        S ESVIPEGCIDLPVTLG DQT+VTQM EFVV+DGRSAYNAIFGRPIIHSFR IPSTLHQVLKYSTP+GVG VRGEQ ASRECYA ALKGSSVCALETL 
Subjt:  SGESVIPEGCIDLPVTLGQDQTRVTQMVEFVVVDGRSAYNAIFGRPIIHSFRVIPSTLHQVLKYSTPSGVGTVRGEQTASRECYAVALKGSSVCALETL-

Query:  -RDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQLASAYETDLARSVPVEILDNPSILE--PDLMEIG
         RDGTLEF+A+LPR+EFAAPTEELELVPLL  +      +E +L     +  +D+   +E  P+ + +G
Subjt:  -RDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQLASAYETDLARSVPVEILDNPSILE--PDLMEIG

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]1.8e-20555.47Show/hide
Query:  MVQPANSTNTADRRTLAASDAHQREVRAAAVEGQGHDGLAAEPLRRSARITAPTLPPAHPRTSRATRGRGGTSNKGARGPAPAPTSENFGALQREMEAMR
        MVQPANSTNTADRR LAA+  HQREV A  VEGQGH+ L  EPL RSARIT P LPPAHP+ S+                                    
Subjt:  MVQPANSTNTADRRTLAASDAHQREVRAAAVEGQGHDGLAAEPLRRSARITAPTLPPAHPRTSRATRGRGGTSNKGARGPAPAPTSENFGALQREMEAMR

Query:  TQMRSMEAMYNEMVLAAGAGSRSENRVTRMDVREQRGSHLGPAEEERPKDNESEGYTHQRGDLREHLNRKRGSSLRKGQSPSRSHKSSNQQAESSHNP--
                                                                                                   AESS+NP  
Subjt:  TQMRSMEAMYNEMVLAAGAGSRSENRVTRMDVREQRGSHLGPAEEERPKDNESEGYTHQRGDLREHLNRKRGSSLRKGQSPSRSHKSSNQQAESSHNP--

Query:  AGIIAREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDFGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIAL
         G+I REEFDQL+ + DAQVEALKA+CE+K+ S +DGD GE  F+SD+LEA IPPKFK PT+KPYDG+KDPKDYVEVFE LMDFQAA+DAIKC AFQIAL
Subjt:  AGIIAREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDFGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIAL

Query:  TGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEA---
        TGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EEA   
Subjt:  TGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEA---

Query:  -----------------------RPPS--PSGKKWKDE-RTDPKSKDKG-SFSSGRPEYRRVENGPSRSRPYERFTPTTIPISEILTNIEDSGMEKLLKR
                               RP      G+  KD+ + D KS+DKG S SS R +YRR  +  ++SRPYE +TPTTIPI EILTNIE++GMEKLLKR
Subjt:  -----------------------RPPS--PSGKKWKDE-RTDPKSKDKG-SFSSGRPEYRRVENGPSRSRPYERFTPTTIPISEILTNIEDSGMEKLLKR

Query:  PEKLRGAPERRSKEVRGKAQDQLSREKGRAEAFKDATQARRPTAVINTIFGGPSGGQSE---------------------HKRKELARATRREVCIIREQ
        PEKLRG PE+R+ +   +               K   +            G P     E                     +K+KELAR  RREVCIIREQ
Subjt:  PEKLRGAPERRSKEVRGKAQDQLSREKGRAEAFKDATQARRPTAVINTIFGGPSGGQSE---------------------HKRKELARATRREVCIIREQ

Query:  GPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDQTRVTQ
         PT  I F+ ADLE VHLPHNDALVIAPLID V+VRR+LVDGGASANILSL TYLALGWTRSQLK+SPTPLVGFSGES+  EGCIDLPV++ QD T+VTQ
Subjt:  GPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDQTRVTQ

Query:  MVEFVVVDGRSAYNAIFGRPIIHSFRVIPSTLHQVLKYSTPSGVGTVRGEQTASRECYAVALKGSSVCALE--TLRD
        M EFVV+DGRSAYNAIFGRPIIHSFR +PSTLHQVLKYST +GVGTVRGE   SRECYA   K SSVCALE  T+RD
Subjt:  MVEFVVVDGRSAYNAIFGRPIIHSFRVIPSTLHQVLKYSTPSGVGTVRGEQTASRECYAVALKGSSVCALE--TLRD

XP_022155139.1 uncharacterized protein LOC111022280 [Momordica charantia]6.8e-17360.2Show/hide
Query:  EQRGSHLGPAEEERPKDNESEGYTHQRGDLREHLNRKRGSSLRKGQ---SPSRSHKSSNQQAESSHN---PAGIIAREEFDQLRGELDAQVEALKAKCEQ
        E+    + P   E   ++E   Y+ +  DLR+HL  K+  +  + +   S SR   +SN +A+S +    P  +I REEFD ++   D QVEALKA+CE+
Subjt:  EQRGSHLGPAEEERPKDNESEGYTHQRGDLREHLNRKRGSSLRKGQ---SPSRSHKSSNQQAESSHN---PAGIIAREEFDQLRGELDAQVEALKAKCEQ

Query:  KDDSLNDGDFGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQ
        K+   +D D GESPFTSD++EAPIPPKFK PT+KPYDG+KDPKDYVEVFEGLMDFQAA+DAIKC AFQIALTGSARLW RRLPARSISTYSQLR+EF+ Q
Subjt:  KDDSLNDGDFGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQ

Query:  FSSRHYDKKTATHLATIRQKEGETL-----REYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEARPPSPSGKKWKDERTDPKSKDKGSFSS
        FS RHYD+KTATHLATIRQKE ETL      E    F E         D   +    T   ++ +  K          S KK KD   D KSKDKGS SS
Subjt:  FSSRHYDKKTATHLATIRQKEGETL-----REYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEARPPSPSGKKWKDERTDPKSKDKGSFSS

Query:  G-RPEYRRVENGPSRSRPYERFTPTTIPISEILTNIEDSGMEKLLKRPEKLRGAPERRSKEVRGKAQDQLSREKGRAEAFKDATQARRPTAVINTIFGGP
        G R EYRR E+GPSRSRPYER       I ++   I+DS  +K + +P         RS  V  K + + SR   R E         RP AVINTIFGGP
Subjt:  G-RPEYRRVENGPSRSRPYERFTPTTIPISEILTNIEDSGMEKLLKRPEKLRGAPERRSKEVRGKAQDQLSREKGRAEAFKDATQARRPTAVINTIFGGP

Query:  SGGQSEHKRKELARATRREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGF
        SGGQ E+KRKELA   RR+V IIREQ PTC ITF   DLE VHLPHNDALVIAPLIDHV+VRRVLVDGGASANILSLPTYLAL  TRSQLK+SPTPLVGF
Subjt:  SGGQSEHKRKELARATRREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGF

Query:  SGESVIPEGCIDLPVTLGQDQTRVTQMVEFVVVDGRSAYNAIFGRPIIHSFRVIPSTLHQVLKYSTPSGVGTVRGEQTASRECYAVALKGSSVCALETLR
        S ESV PEGCIDLPVT+GQD T+VTQM EFVV+DGR AYNAIF RPIIHSF+ +PS LHQVLKYSTP+GVGTVRGEQ  SRECYA ALK SSVCALE   
Subjt:  SGESVIPEGCIDLPVTLGQDQTRVTQMVEFVVVDGRSAYNAIFGRPIIHSFRVIPSTLHQVLKYSTPSGVGTVRGEQTASRECYAVALKGSSVCALETLR

Query:  DGTLEFEADLPRK
              + DLPR+
Subjt:  DGTLEFEADLPRK

XP_022157676.1 uncharacterized protein LOC111024332 [Momordica charantia]7.1e-17081.3Show/hide
Query:  LADEALTVKLGEEA--------------------------RPPSP--SGKKWKDERTDPKSKDKGSFSSGRPEYRRVENGPSRSRPYERFTPTTIPISEI
        +ADEALTVKLGEEA                          RP      G+  KDER DPKSKDKGSFSSGR EYRR ENGP+RSRPYERFTPTTIPISEI
Subjt:  LADEALTVKLGEEA--------------------------RPPSP--SGKKWKDERTDPKSKDKGSFSSGRPEYRRVENGPSRSRPYERFTPTTIPISEI

Query:  LTNIEDSGMEKLLKRPEKLRGAPERRSKEVRGKAQDQLSREKGRAEAFKDATQARRPTAVINTIFGGPSGGQSEHKRKELARATRREVCIIREQGPTCPI
        LTNIEDSGMEKLLKRPEKLRGAPERRSK+    A+ +  R++ R        +  RP AVINTIFGGPSGGQS HKRKELAR  RREVCIIREQGPTCPI
Subjt:  LTNIEDSGMEKLLKRPEKLRGAPERRSKEVRGKAQDQLSREKGRAEAFKDATQARRPTAVINTIFGGPSGGQSEHKRKELARATRREVCIIREQGPTCPI

Query:  TFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDQTRVTQMVEFVV
        TFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDQTRVTQM EFVV
Subjt:  TFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDQTRVTQMVEFVV

Query:  VDGRSAYNAIFGRPIIHSFRVIPSTLHQVLKYSTPSGVGTVRGEQTASRECYAVALKGSSVCALETLRDGTLEFEADLPRKEFAAPTEELELVPLLSPEK
        VDGRS YNAIFGRPIIHSFR IPSTLHQVLKYSTP+GVGTVRGEQT SRECYA ALKGSSVCALETLRDGTLE EADLPRKEFAAPTEELELVPLLSPEK
Subjt:  VDGRSAYNAIFGRPIIHSFRVIPSTLHQVLKYSTPSGVGTVRGEQTASRECYAVALKGSSVCALETLRDGTLEFEADLPRKEFAAPTEELELVPLLSPEK

Query:  Q
        Q
Subjt:  Q

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088131.7e-20676.14Show/hide
Query:  QAESSHN---PAGIIAREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDFGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASD
        +AESS N   PAG+I REEFDQLRG+LDAQVEALKAKCEQK+  LNDGD GESPFTSDVLEAPIPPKFKAPTVKPYDG+KDPKDYVEVFE LMDFQAASD
Subjt:  QAESSHN---PAGIIAREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDFGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
        AIKCRAF+IALTGSARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEA--------------------------RPPSP--SGKKWKD-ERTDPKSKDKGSFSSGRPEYRRVENGPSRSRPYERFTPTTIPISEILTNIE
        TVKLGEEA                          RP      G+  KD E  DPKSKDKGSFSSGR EYRR ENGP+RSRPYERFTPTTIPISEILTNIE
Subjt:  TVKLGEEA--------------------------RPPSP--SGKKWKD-ERTDPKSKDKGSFSSGRPEYRRVENGPSRSRPYERFTPTTIPISEILTNIE

Query:  DSGMEKLLKRPEKLRGAPERRSK----------------------EVRGKAQD----------QLSREKGRAEAFKDATQARRP--TAVINTIFGGPSGG
        +SGMEKLLKRPEKLRGAPERRSK                      ++    QD          + S  + + E  +  T  RR    AVINTIFGGPSGG
Subjt:  DSGMEKLLKRPEKLRGAPERRSK----------------------EVRGKAQD----------QLSREKGRAEAFKDATQARRP--TAVINTIFGGPSGG

Query:  QSEHKRKELARATRREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGE
        QS  KRKELARA RREVCIIREQ PTCPITFDGADLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLALGWTRSQLK+SPTPLVGFSGE
Subjt:  QSEHKRKELARATRREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGE

Query:  SVIPEGCIDLPVTLGQDQTRVTQMVEFV
        SVIPEG IDLPVTLGQDQT+VTQM EFV
Subjt:  SVIPEGCIDLPVTLGQDQTRVTQMVEFV

A0A6J1D9E1 uncharacterized protein LOC1110188234.4e-21064.28Show/hide
Query:  SSNQQAESSHNPA---GIIAREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDFGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQ
        SSNQQAESSHNPA   G+I REEFDQLRG+L+AQVEALKAKCEQK+  LNDGD GESPFTSDVLE        APTVK YDG+KDPKDYVEVFEGLMDFQ
Subjt:  SSNQQAESSHNPA---GIIAREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDFGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQ

Query:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA
        AASDAIKCRAFQIALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFLTGLA
Subjt:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA

Query:  DEALTVKLGEEA--------------------------RPPS--PSGKKWKDERTDPKSKDKGSFSSGRPEYRRVENGPSRSRPYERFTPTTIPISEILT
        DEALTVKLG+EA                          RP      G+  KDE+ D KSKDKGSFSSGR E+RR  NGP+RSRPYERFTPTTIPISEILT
Subjt:  DEALTVKLGEEA--------------------------RPPS--PSGKKWKDERTDPKSKDKGSFSSGRPEYRRVENGPSRSRPYERFTPTTIPISEILT

Query:  NIEDSGMEKLLKRPEKLRGAPERRSK------------------EVRGKAQDQLSRE----------KGRAEAFKDATQARRP------TAVINTIFGGP
        NIE+SGMEKLLKRPEKLRGAPERR+K                  E++ + +D +  +             AE  ++   +R P       AVINTIFGGP
Subjt:  NIEDSGMEKLLKRPEKLRGAPERRSK------------------EVRGKAQDQLSRE----------KGRAEAFKDATQARRP------TAVINTIFGGP

Query:  SGGQSEHKRKELARATRREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGF
        SGGQS HKRKELARA RREVCIIREQ PTCPITFD ADLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTRSQLK+S TPLVGF
Subjt:  SGGQSEHKRKELARATRREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGF

Query:  SGESVIPEGCIDLPVTLGQDQTRVTQMVEFVVVDGRSAYNAIFGRPIIHSFRVIPSTLHQVLKYSTPSGVGTVRGEQTASRECYAVALKGSSVCALETL-
        S ESVIPEGCIDLPVTLG DQT+VTQM EFVV+DGRSAYNAIFGRPIIHSFR IPSTLHQVLKYSTP+GVG VRGEQ ASRECYA ALKGSSVCALETL 
Subjt:  SGESVIPEGCIDLPVTLGQDQTRVTQMVEFVVVDGRSAYNAIFGRPIIHSFRVIPSTLHQVLKYSTPSGVGTVRGEQTASRECYAVALKGSSVCALETL-

Query:  -RDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQLASAYETDLARSVPVEILDNPSILE--PDLMEIG
         RDGTLEF+A+LPR+EFAAPTEELELVPLL  +      +E +L     +  +D+   +E  P+ + +G
Subjt:  -RDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQLASAYETDLARSVPVEILDNPSILE--PDLMEIG

A0A6J1DHB3 uncharacterized protein LOC1110204798.6e-20655.47Show/hide
Query:  MVQPANSTNTADRRTLAASDAHQREVRAAAVEGQGHDGLAAEPLRRSARITAPTLPPAHPRTSRATRGRGGTSNKGARGPAPAPTSENFGALQREMEAMR
        MVQPANSTNTADRR LAA+  HQREV A  VEGQGH+ L  EPL RSARIT P LPPAHP+ S+                                    
Subjt:  MVQPANSTNTADRRTLAASDAHQREVRAAAVEGQGHDGLAAEPLRRSARITAPTLPPAHPRTSRATRGRGGTSNKGARGPAPAPTSENFGALQREMEAMR

Query:  TQMRSMEAMYNEMVLAAGAGSRSENRVTRMDVREQRGSHLGPAEEERPKDNESEGYTHQRGDLREHLNRKRGSSLRKGQSPSRSHKSSNQQAESSHNP--
                                                                                                   AESS+NP  
Subjt:  TQMRSMEAMYNEMVLAAGAGSRSENRVTRMDVREQRGSHLGPAEEERPKDNESEGYTHQRGDLREHLNRKRGSSLRKGQSPSRSHKSSNQQAESSHNP--

Query:  AGIIAREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDFGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIAL
         G+I REEFDQL+ + DAQVEALKA+CE+K+ S +DGD GE  F+SD+LEA IPPKFK PT+KPYDG+KDPKDYVEVFE LMDFQAA+DAIKC AFQIAL
Subjt:  AGIIAREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDFGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIAL

Query:  TGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEA---
        TGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EEA   
Subjt:  TGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEA---

Query:  -----------------------RPPS--PSGKKWKDE-RTDPKSKDKG-SFSSGRPEYRRVENGPSRSRPYERFTPTTIPISEILTNIEDSGMEKLLKR
                               RP      G+  KD+ + D KS+DKG S SS R +YRR  +  ++SRPYE +TPTTIPI EILTNIE++GMEKLLKR
Subjt:  -----------------------RPPS--PSGKKWKDE-RTDPKSKDKG-SFSSGRPEYRRVENGPSRSRPYERFTPTTIPISEILTNIEDSGMEKLLKR

Query:  PEKLRGAPERRSKEVRGKAQDQLSREKGRAEAFKDATQARRPTAVINTIFGGPSGGQSE---------------------HKRKELARATRREVCIIREQ
        PEKLRG PE+R+ +   +               K   +            G P     E                     +K+KELAR  RREVCIIREQ
Subjt:  PEKLRGAPERRSKEVRGKAQDQLSREKGRAEAFKDATQARRPTAVINTIFGGPSGGQSE---------------------HKRKELARATRREVCIIREQ

Query:  GPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDQTRVTQ
         PT  I F+ ADLE VHLPHNDALVIAPLID V+VRR+LVDGGASANILSL TYLALGWTRSQLK+SPTPLVGFSGES+  EGCIDLPV++ QD T+VTQ
Subjt:  GPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDQTRVTQ

Query:  MVEFVVVDGRSAYNAIFGRPIIHSFRVIPSTLHQVLKYSTPSGVGTVRGEQTASRECYAVALKGSSVCALE--TLRD
        M EFVV+DGRSAYNAIFGRPIIHSFR +PSTLHQVLKYST +GVGTVRGE   SRECYA   K SSVCALE  T+RD
Subjt:  MVEFVVVDGRSAYNAIFGRPIIHSFRVIPSTLHQVLKYSTPSGVGTVRGEQTASRECYAVALKGSSVCALE--TLRD

A0A6J1DPC9 uncharacterized protein LOC1110222803.3e-17360.2Show/hide
Query:  EQRGSHLGPAEEERPKDNESEGYTHQRGDLREHLNRKRGSSLRKGQ---SPSRSHKSSNQQAESSHN---PAGIIAREEFDQLRGELDAQVEALKAKCEQ
        E+    + P   E   ++E   Y+ +  DLR+HL  K+  +  + +   S SR   +SN +A+S +    P  +I REEFD ++   D QVEALKA+CE+
Subjt:  EQRGSHLGPAEEERPKDNESEGYTHQRGDLREHLNRKRGSSLRKGQ---SPSRSHKSSNQQAESSHN---PAGIIAREEFDQLRGELDAQVEALKAKCEQ

Query:  KDDSLNDGDFGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQ
        K+   +D D GESPFTSD++EAPIPPKFK PT+KPYDG+KDPKDYVEVFEGLMDFQAA+DAIKC AFQIALTGSARLW RRLPARSISTYSQLR+EF+ Q
Subjt:  KDDSLNDGDFGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQ

Query:  FSSRHYDKKTATHLATIRQKEGETL-----REYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEARPPSPSGKKWKDERTDPKSKDKGSFSS
        FS RHYD+KTATHLATIRQKE ETL      E    F E         D   +    T   ++ +  K          S KK KD   D KSKDKGS SS
Subjt:  FSSRHYDKKTATHLATIRQKEGETL-----REYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEARPPSPSGKKWKDERTDPKSKDKGSFSS

Query:  G-RPEYRRVENGPSRSRPYERFTPTTIPISEILTNIEDSGMEKLLKRPEKLRGAPERRSKEVRGKAQDQLSREKGRAEAFKDATQARRPTAVINTIFGGP
        G R EYRR E+GPSRSRPYER       I ++   I+DS  +K + +P         RS  V  K + + SR   R E         RP AVINTIFGGP
Subjt:  G-RPEYRRVENGPSRSRPYERFTPTTIPISEILTNIEDSGMEKLLKRPEKLRGAPERRSKEVRGKAQDQLSREKGRAEAFKDATQARRPTAVINTIFGGP

Query:  SGGQSEHKRKELARATRREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGF
        SGGQ E+KRKELA   RR+V IIREQ PTC ITF   DLE VHLPHNDALVIAPLIDHV+VRRVLVDGGASANILSLPTYLAL  TRSQLK+SPTPLVGF
Subjt:  SGGQSEHKRKELARATRREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGF

Query:  SGESVIPEGCIDLPVTLGQDQTRVTQMVEFVVVDGRSAYNAIFGRPIIHSFRVIPSTLHQVLKYSTPSGVGTVRGEQTASRECYAVALKGSSVCALETLR
        S ESV PEGCIDLPVT+GQD T+VTQM EFVV+DGR AYNAIF RPIIHSF+ +PS LHQVLKYSTP+GVGTVRGEQ  SRECYA ALK SSVCALE   
Subjt:  SGESVIPEGCIDLPVTLGQDQTRVTQMVEFVVVDGRSAYNAIFGRPIIHSFRVIPSTLHQVLKYSTPSGVGTVRGEQTASRECYAVALKGSSVCALETLR

Query:  DGTLEFEADLPRK
              + DLPR+
Subjt:  DGTLEFEADLPRK

A0A6J1DYW5 uncharacterized protein LOC1110243323.4e-17081.3Show/hide
Query:  LADEALTVKLGEEA--------------------------RPPSP--SGKKWKDERTDPKSKDKGSFSSGRPEYRRVENGPSRSRPYERFTPTTIPISEI
        +ADEALTVKLGEEA                          RP      G+  KDER DPKSKDKGSFSSGR EYRR ENGP+RSRPYERFTPTTIPISEI
Subjt:  LADEALTVKLGEEA--------------------------RPPSP--SGKKWKDERTDPKSKDKGSFSSGRPEYRRVENGPSRSRPYERFTPTTIPISEI

Query:  LTNIEDSGMEKLLKRPEKLRGAPERRSKEVRGKAQDQLSREKGRAEAFKDATQARRPTAVINTIFGGPSGGQSEHKRKELARATRREVCIIREQGPTCPI
        LTNIEDSGMEKLLKRPEKLRGAPERRSK+    A+ +  R++ R        +  RP AVINTIFGGPSGGQS HKRKELAR  RREVCIIREQGPTCPI
Subjt:  LTNIEDSGMEKLLKRPEKLRGAPERRSKEVRGKAQDQLSREKGRAEAFKDATQARRPTAVINTIFGGPSGGQSEHKRKELARATRREVCIIREQGPTCPI

Query:  TFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDQTRVTQMVEFVV
        TFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDQTRVTQM EFVV
Subjt:  TFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDQTRVTQMVEFVV

Query:  VDGRSAYNAIFGRPIIHSFRVIPSTLHQVLKYSTPSGVGTVRGEQTASRECYAVALKGSSVCALETLRDGTLEFEADLPRKEFAAPTEELELVPLLSPEK
        VDGRS YNAIFGRPIIHSFR IPSTLHQVLKYSTP+GVGTVRGEQT SRECYA ALKGSSVCALETLRDGTLE EADLPRKEFAAPTEELELVPLLSPEK
Subjt:  VDGRSAYNAIFGRPIIHSFRVIPSTLHQVLKYSTPSGVGTVRGEQTASRECYAVALKGSSVCALETLRDGTLEFEADLPRKEFAAPTEELELVPLLSPEK

Query:  Q
        Q
Subjt:  Q

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAACCAGCGAATTCGACCAATACGGCGGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCAGAGCAGCGGCGGTAGAGGGGCAAGGT
CACGACGGCCTAGCAGCGGAACCTCTCCGCAGGTCGGCACGGATCACCGCGCCTACCCTACCGCCTGCGCACCCGAGGACGTCCAGGGCCACCCGTGGCCGAGGT
GGGACCTCTAATAAGGGCGCCCGGGGTCCAGCCCCGGCTCCAACAAGCGAGAACTTTGGTGCGCTCCAGAGAGAGATGGAGGCAATGCGCACACAAATGCGCTCC
ATGGAGGCAATGTATAACGAAATGGTGCTGGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAGTGACGCGCATGGACGTACGCGAGCAAAGGGGTTCCCACCTC
GGCCCAGCCGAGGAAGAACGTCCCAAAGACAACGAGAGTGAGGGGTACACTCACCAGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCGTCTCTC
AGAAAAGGGCAGTCACCATCCCGCTCCCACAAAAGCTCCAACCAGCAGGCTGAATCCTCTCACAATCCCGCAGGGATAATCGCAAGGGAGGAGTTCGACCAGCTG
AGGGGGGAGCTCGATGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAACGACGGCGACTTCGGAGAATCTCCTTTCACCTCGGAC
GTTTTGGAAGCACCAATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGACGGGACGAAGGATCCCAAGGACTATGTTGAGGTCTTTGAAGGCCTCATG
GACTTCCAAGCGGCATCAGACGCAATCAAGTGCCGCGCCTTCCAGATTGCGCTTACTGGCAGCGCGCGATTGTGGTACCGGAGACTGCCAGCCAGGTCGATCTCG
ACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCACTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGGCAGAAGGAGGGTGAG
ACGCTGCGGGAGTATGTCACCAGATTCCAGGAGGAACAGTTGAAGGTTGCACACTGCTCCGATGATTCGGCCATGTGCTATTTCCTCACCGGTCTAGCCGACGAA
GCCCTCACGGTGAAGCTTGGAGAGGAGGCCCGGCCACCTTCGCCGAGTGGAAAGAAGTGGAAAGATGAAAGGACAGATCCCAAGTCCAAGGACAAGGGATCCTTC
TCCAGTGGCCGACCTGAGTATCGAAGGGTGGAGAACGGACCTAGCAGGAGCCGACCTTACGAGCGCTTCACCCCAACCACGATTCCAATTTCCGAGATCCTAACG
AACATCGAGGATTCTGGAATGGAAAAACTACTCAAGCGTCCGGAGAAACTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGAAGTTCGTGGGAAAGCCCAGGACCAG
CTCAGCAGAGAAAAAGGAAGAGCGGAAGCGTTCAAGGACGCCACCCAGGCGCGCCGACCGACTGCGGTCATCAATACCATTTTTGGAGGGCCAAGCGGGGGTCAA
TCCGAACATAAAAGAAAGGAGTTAGCCCGTGCAACCAGGCGCGAGGTGTGCATCATCAGGGAGCAGGGGCCGACCTGCCCAATCACCTTCGACGGTGCAGACTTG
GAGGAGGTCCACCTGCCCCACAATGATGCCCTTGTGATTGCTCCCTTGATTGATCATGTGGTGGTCAGGAGGGTGCTGGTAGACGGGGGCGCATCCGCTAACATC
CTGTCCTTACCGACCTACCTCGCCTTGGGCTGGACGAGGTCGCAATTGAAGAGAAGCCCGACGCCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGT
TGCATCGACTTGCCGGTCACGCTGGGGCAGGACCAAACTCGGGTCACTCAAATGGTCGAGTTCGTGGTAGTTGACGGTAGATCGGCCTATAACGCCATCTTTGGG
AGACCCATCATCCACTCATTTCGGGTCATCCCCTCAACACTTCATCAAGTTTTGAAGTATTCCACCCCTAGTGGCGTAGGCACGGTCCGAGGAGAACAGACCGCT
TCGAGGGAGTGCTATGCCGTCGCACTCAAAGGCTCATCGGTCTGCGCCCTCGAAACTCTCAGGGACGGGACGCTCGAGTTCGAGGCCGACCTGCCGAGGAAGGAG
TTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAGTTAGCATCGGCGTACGAGACTGACCTCGCCAGGTCGGTCCCCGTCGAG
ATCCTAGATAATCCCTCGATCTTAGAGCCGGATCTGATGGAGATCGGCGCTCCAGAATCCTCATGGATGGACCCGATCACGGACTTCATTAGGGGTAACTCACCA
CAAGATCCCAAGGAGCGCAGAAAGTTGGCACGGCGGGCAGCTCGGTTCGTGGTCCGAGATGGTGCATTGTACCGACGCGGCTTTTCCCTGCCTCTGTTGAAATGC
CTAACCCCTGAAGAGGGCATAGTAGAGCATTATGAGCCTACGACGAATGAGGATGGGCTGCTCCTCAACCTCGACTTGTTGGAAGAAAGAAGAGCAATGGCCCAG
CTACGCTTGGCGGAATATCAGGGCAGAATGGCCAGACATTACAATGCCCGCGTCCGACCTCGGGCCTTTCAGGTCGGACATCTGGTCTTAAGGAGGGTCCAAACG
CATGTGGGTGCCCTTGACCCGGCCTGGGAGGGCCCGTTTGAGATCAAGGGCATAGTCCGACCTGGGACGTACATATTGGTCGATCTGAAAGGAGACGTCCTCGCG
CACCCGTGGAACGCGGAGCACCTGAAGCGTTATTATCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAACCAGCGAATTCGACCAATACGGCGGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCAGAGCAGCGGCGGTAGAGGGGCAAGGT
CACGACGGCCTAGCAGCGGAACCTCTCCGCAGGTCGGCACGGATCACCGCGCCTACCCTACCGCCTGCGCACCCGAGGACGTCCAGGGCCACCCGTGGCCGAGGT
GGGACCTCTAATAAGGGCGCCCGGGGTCCAGCCCCGGCTCCAACAAGCGAGAACTTTGGTGCGCTCCAGAGAGAGATGGAGGCAATGCGCACACAAATGCGCTCC
ATGGAGGCAATGTATAACGAAATGGTGCTGGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAGTGACGCGCATGGACGTACGCGAGCAAAGGGGTTCCCACCTC
GGCCCAGCCGAGGAAGAACGTCCCAAAGACAACGAGAGTGAGGGGTACACTCACCAGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCGTCTCTC
AGAAAAGGGCAGTCACCATCCCGCTCCCACAAAAGCTCCAACCAGCAGGCTGAATCCTCTCACAATCCCGCAGGGATAATCGCAAGGGAGGAGTTCGACCAGCTG
AGGGGGGAGCTCGATGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAACGACGGCGACTTCGGAGAATCTCCTTTCACCTCGGAC
GTTTTGGAAGCACCAATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGACGGGACGAAGGATCCCAAGGACTATGTTGAGGTCTTTGAAGGCCTCATG
GACTTCCAAGCGGCATCAGACGCAATCAAGTGCCGCGCCTTCCAGATTGCGCTTACTGGCAGCGCGCGATTGTGGTACCGGAGACTGCCAGCCAGGTCGATCTCG
ACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCACTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGGCAGAAGGAGGGTGAG
ACGCTGCGGGAGTATGTCACCAGATTCCAGGAGGAACAGTTGAAGGTTGCACACTGCTCCGATGATTCGGCCATGTGCTATTTCCTCACCGGTCTAGCCGACGAA
GCCCTCACGGTGAAGCTTGGAGAGGAGGCCCGGCCACCTTCGCCGAGTGGAAAGAAGTGGAAAGATGAAAGGACAGATCCCAAGTCCAAGGACAAGGGATCCTTC
TCCAGTGGCCGACCTGAGTATCGAAGGGTGGAGAACGGACCTAGCAGGAGCCGACCTTACGAGCGCTTCACCCCAACCACGATTCCAATTTCCGAGATCCTAACG
AACATCGAGGATTCTGGAATGGAAAAACTACTCAAGCGTCCGGAGAAACTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGAAGTTCGTGGGAAAGCCCAGGACCAG
CTCAGCAGAGAAAAAGGAAGAGCGGAAGCGTTCAAGGACGCCACCCAGGCGCGCCGACCGACTGCGGTCATCAATACCATTTTTGGAGGGCCAAGCGGGGGTCAA
TCCGAACATAAAAGAAAGGAGTTAGCCCGTGCAACCAGGCGCGAGGTGTGCATCATCAGGGAGCAGGGGCCGACCTGCCCAATCACCTTCGACGGTGCAGACTTG
GAGGAGGTCCACCTGCCCCACAATGATGCCCTTGTGATTGCTCCCTTGATTGATCATGTGGTGGTCAGGAGGGTGCTGGTAGACGGGGGCGCATCCGCTAACATC
CTGTCCTTACCGACCTACCTCGCCTTGGGCTGGACGAGGTCGCAATTGAAGAGAAGCCCGACGCCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGT
TGCATCGACTTGCCGGTCACGCTGGGGCAGGACCAAACTCGGGTCACTCAAATGGTCGAGTTCGTGGTAGTTGACGGTAGATCGGCCTATAACGCCATCTTTGGG
AGACCCATCATCCACTCATTTCGGGTCATCCCCTCAACACTTCATCAAGTTTTGAAGTATTCCACCCCTAGTGGCGTAGGCACGGTCCGAGGAGAACAGACCGCT
TCGAGGGAGTGCTATGCCGTCGCACTCAAAGGCTCATCGGTCTGCGCCCTCGAAACTCTCAGGGACGGGACGCTCGAGTTCGAGGCCGACCTGCCGAGGAAGGAG
TTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAGTTAGCATCGGCGTACGAGACTGACCTCGCCAGGTCGGTCCCCGTCGAG
ATCCTAGATAATCCCTCGATCTTAGAGCCGGATCTGATGGAGATCGGCGCTCCAGAATCCTCATGGATGGACCCGATCACGGACTTCATTAGGGGTAACTCACCA
CAAGATCCCAAGGAGCGCAGAAAGTTGGCACGGCGGGCAGCTCGGTTCGTGGTCCGAGATGGTGCATTGTACCGACGCGGCTTTTCCCTGCCTCTGTTGAAATGC
CTAACCCCTGAAGAGGGCATAGTAGAGCATTATGAGCCTACGACGAATGAGGATGGGCTGCTCCTCAACCTCGACTTGTTGGAAGAAAGAAGAGCAATGGCCCAG
CTACGCTTGGCGGAATATCAGGGCAGAATGGCCAGACATTACAATGCCCGCGTCCGACCTCGGGCCTTTCAGGTCGGACATCTGGTCTTAAGGAGGGTCCAAACG
CATGTGGGTGCCCTTGACCCGGCCTGGGAGGGCCCGTTTGAGATCAAGGGCATAGTCCGACCTGGGACGTACATATTGGTCGATCTGAAAGGAGACGTCCTCGCG
CACCCGTGGAACGCGGAGCACCTGAAGCGTTATTATCCTTGA
Protein sequenceShow/hide protein sequence
MVQPANSTNTADRRTLAASDAHQREVRAAAVEGQGHDGLAAEPLRRSARITAPTLPPAHPRTSRATRGRGGTSNKGARGPAPAPTSENFGALQREMEAMRTQMRS
MEAMYNEMVLAAGAGSRSENRVTRMDVREQRGSHLGPAEEERPKDNESEGYTHQRGDLREHLNRKRGSSLRKGQSPSRSHKSSNQQAESSHNPAGIIAREEFDQL
RGELDAQVEALKAKCEQKDDSLNDGDFGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSIS
TYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEARPPSPSGKKWKDERTDPKSKDKGSF
SSGRPEYRRVENGPSRSRPYERFTPTTIPISEILTNIEDSGMEKLLKRPEKLRGAPERRSKEVRGKAQDQLSREKGRAEAFKDATQARRPTAVINTIFGGPSGGQ
SEHKRKELARATRREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEG
CIDLPVTLGQDQTRVTQMVEFVVVDGRSAYNAIFGRPIIHSFRVIPSTLHQVLKYSTPSGVGTVRGEQTASRECYAVALKGSSVCALETLRDGTLEFEADLPRKE
FAAPTEELELVPLLSPEKQLASAYETDLARSVPVEILDNPSILEPDLMEIGAPESSWMDPITDFIRGNSPQDPKERRKLARRAARFVVRDGALYRRGFSLPLLKC
LTPEEGIVEHYEPTTNEDGLLLNLDLLEERRAMAQLRLAEYQGRMARHYNARVRPRAFQVGHLVLRRVQTHVGALDPAWEGPFEIKGIVRPGTYILVDLKGDVLA
HPWNAEHLKRYYP