; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0030031 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0030031
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionUnknown protein
Genome locationchr8:43952502..43960231
RNA-Seq ExpressionLag0030031
SyntenyLag0030031
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6602470.1 hypothetical protein SDJN03_07703, partial [Cucurbita argyrosperma subsp. sororia]1.3e-24980.03Show/hide
Query:  MEDEDQKMVALKKAYADIILNTVKEAAARVMVSERKVRCLQQDLTSVKDEALRMLLRLKNMIDSKMHEAEIASLCQRRKVEELESQLHEAEEVITDLRIQ
        MEDE+QKM ALKKAYADIILNTVKEAA RVMVS+R+ R LQQDL+S KDE+LR+LLRLKNM+DSKM EAEI SLCQRRKVEELE++LHEAE+VITDLRIQ
Subjt:  MEDEDQKMVALKKAYADIILNTVKEAAARVMVSERKVRCLQQDLTSVKDEALRMLLRLKNMIDSKMHEAEIASLCQRRKVEELESQLHEAEEVITDLRIQ

Query:  LKEAQYQLEMEKKDKMQPLEGKIMNKMTFSSQSTLEPDSSSLSSSELQTMSSNLRNTKIEKIAHAIYDWVPKSMEQSPVSQVDIVHSHDSDSTSTDVRTK
        LKEAQ+QLE EKKDKMQPLEG IMNK+T SSQS L+PD S  SSSELQT+SSNLRNTK+EKIAHAI+D VPKS+E S +SQVDIVH+HDSDS+S  VRTK
Subjt:  LKEAQYQLEMEKKDKMQPLEGKIMNKMTFSSQSTLEPDSSSLSSSELQTMSSNLRNTKIEKIAHAIYDWVPKSMEQSPVSQVDIVHSHDSDSTSTDVRTK

Query:  EHRLYGKRCTQTIRALERNFLDYSLPLGIDVKDSQVLEVKEPLVKKRDKEERAL-TRAGKTDIGKNVLGAVLKRSVKLHTLRRTSQFGKCKTGSCRLHDS
        E R YGKRCTQ IRALERN+LDYSLPLGIDVKD+QVLE KEPLVKK+ KEE  L TR GKTDIGK V GAVLKRSVKLHTLRRTSQFGKCKTGSCRLH S
Subjt:  EHRLYGKRCTQTIRALERNFLDYSLPLGIDVKDSQVLEVKEPLVKKRDKEERAL-TRAGKTDIGKNVLGAVLKRSVKLHTLRRTSQFGKCKTGSCRLHDS

Query:  QLTKPHHPSCIISLCRPYLKDGGVRSDKNEYATPSLMTDNGNVTQNSCLPEEHKIDSYKDVSGNSNGGTLKGNMKKGNLNGPSPDQPINPCDKSFVLSPC
        QLTKPHHPSCIISLCRPYLKD  VRSD NEY+ PSLM D+GNV + SCLPEEHK DSY   SGNSNG   +GNMK+ NLNG SPDQPINP DK F LSPC
Subjt:  QLTKPHHPSCIISLCRPYLKDGGVRSDKNEYATPSLMTDNGNVTQNSCLPEEHKIDSYKDVSGNSNGGTLKGNMKKGNLNGPSPDQPINPCDKSFVLSPC

Query:  RNSINLVNDNGKSSEDHSKVTKHQSKMKKLTCLDPGLTSIESYVDSTSVPTSVIASVKVNRSRVLENAATSKKELMALSVKQESDEIRNLIFPSSEWNSE
          S NLVNDNGKS +DHSK TKHQSKMKKLTCLDPGLTS  SYVDSTSVP SV AS+KVN S  LEN A +KKELMALSVKQESDEIRNLI PS E+NSE
Subjt:  RNSINLVNDNGKSSEDHSKVTKHQSKMKKLTCLDPGLTSIESYVDSTSVPTSVIASVKVNRSRVLENAATSKKELMALSVKQESDEIRNLIFPSSEWNSE

Query:  MNPNTKCERICEVTNRSPRQVDKKMFLKCTRQSKRKREAMGISDENISPLKSNGKRRLGERLKFEPEFERSDMIRESTRESRQLSQVACQI
        +NPNTKC +IC VTN SP QVDKK  L CTRQSKRKREAMGISDENISP K NGKR LGE+LKFEP+FER+D+ RESTRESRQLS+VA Q+
Subjt:  MNPNTKCERICEVTNRSPRQVDKKMFLKCTRQSKRKREAMGISDENISPLKSNGKRRLGERLKFEPEFERSDMIRESTRESRQLSQVACQI

XP_022923780.1 uncharacterized protein LOC111431386 [Cucurbita moschata]4.5e-25080.37Show/hide
Query:  MEDEDQKMVALKKAYADIILNTVKEAAARVMVSERKVRCLQQDLTSVKDEALRMLLRLKNMIDSKMHEAEIASLCQRRKVEELESQLHEAEEVITDLRIQ
        MEDE+QKM ALKKAYADIILNTVKEAA RVMVS+R+ R LQQDL+S KDE+LR+LLRLKNM+DSKM EAEI SLCQRRKVEELE++LHEAE+VITDLRIQ
Subjt:  MEDEDQKMVALKKAYADIILNTVKEAAARVMVSERKVRCLQQDLTSVKDEALRMLLRLKNMIDSKMHEAEIASLCQRRKVEELESQLHEAEEVITDLRIQ

Query:  LKEAQYQLEMEKKDKMQPLEGKIMNKMTFSSQSTLEPDSSSLSSSELQTMSSNLRNTKIEKIAHAIYDWVPKSMEQSPVSQVDIVHSHDSDSTSTDVRTK
        LKEAQ+QLE EKKDKMQPLEG IMNK+T SSQS L+PDSS  SSSELQT+SSNLRNTK+EKIAHAI+D VPKS+E S VSQVDIVH+HDSDS+S  VRTK
Subjt:  LKEAQYQLEMEKKDKMQPLEGKIMNKMTFSSQSTLEPDSSSLSSSELQTMSSNLRNTKIEKIAHAIYDWVPKSMEQSPVSQVDIVHSHDSDSTSTDVRTK

Query:  EHRLYGKRCTQTIRALERNFLDYSLPLGIDVKDSQVLEVKEPLVKKRDKEERAL-TRAGKTDIGKNVLGAVLKRSVKLHTLRRTSQFGKCKTGSCRLHDS
        E R YGKRCTQ IRALERN+LDYSLPLGIDVKD+QVLE KEPLVKK+ KEE  L TR GKTDIGK V GAVLKRSVKLHTLRRTSQFGKCKTGSCRLH S
Subjt:  EHRLYGKRCTQTIRALERNFLDYSLPLGIDVKDSQVLEVKEPLVKKRDKEERAL-TRAGKTDIGKNVLGAVLKRSVKLHTLRRTSQFGKCKTGSCRLHDS

Query:  QLTKPHHPSCIISLCRPYLKDGGVRSDKNEYATPSLMTDNGNVTQNSCLPEEHKIDSYKDVSGNSNGGTLKGNMKKGNLNGPSPDQPINPCDKSFVLSPC
        QLTKPHHPSCIISLCRPYLKD  VRSD NEY+ PSLM D+GNV + SCLPEEHK DSY   SGNSNG   +GNMK+ NLNG SPDQPINP DK F LSPC
Subjt:  QLTKPHHPSCIISLCRPYLKDGGVRSDKNEYATPSLMTDNGNVTQNSCLPEEHKIDSYKDVSGNSNGGTLKGNMKKGNLNGPSPDQPINPCDKSFVLSPC

Query:  RNSINLVNDNGKSSEDHSKVTKHQSKMKKLTCLDPGLTSIESYVDSTSVPTSVIASVKVNRSRVLENAATSKKELMALSVKQESDEIRNLIFPSSEWNSE
          S NLVNDNGKS +DHSK TKHQSKMKKLTCLDPGLTS  SYVDSTSVP SV AS+KVN S  LEN A +KKELMALSVKQESDEIRNLI PS E NSE
Subjt:  RNSINLVNDNGKSSEDHSKVTKHQSKMKKLTCLDPGLTSIESYVDSTSVPTSVIASVKVNRSRVLENAATSKKELMALSVKQESDEIRNLIFPSSEWNSE

Query:  MNPNTKCERICEVTNRSPRQVDKKMFLKCTRQSKRKREAMGISDENISPLKSNGKRRLGERLKFEPEFERSDMIRESTRESRQLSQVACQI
        +NPNTKC +IC VTN SP QVDKK  L CTRQSKRKREAMGISDENISP K NGKR LGE+LKFEP+FER+D+ RESTRESRQLS+VA Q+
Subjt:  MNPNTKCERICEVTNRSPRQVDKKMFLKCTRQSKRKREAMGISDENISPLKSNGKRRLGERLKFEPEFERSDMIRESTRESRQLSQVACQI

XP_022990043.1 uncharacterized protein LOC111487054 [Cucurbita maxima]4.0e-24679.86Show/hide
Query:  MEDEDQKMVALKKAYADIILNTVKEAAARVMVSERKVRCLQQDLTSVKDEALRMLLRLKNMIDSKMHEAEIASLCQRRKVEELESQLHEAEEVITDLRIQ
        MEDE+QKM ALKKAYADIILNTVKEAA RV+VSER+ R LQQDL+ VKDE+LR+LLRLKNM+DSKM EAEI SLCQRRKVEELE++LHEAE+VITDLRIQ
Subjt:  MEDEDQKMVALKKAYADIILNTVKEAAARVMVSERKVRCLQQDLTSVKDEALRMLLRLKNMIDSKMHEAEIASLCQRRKVEELESQLHEAEEVITDLRIQ

Query:  LKEAQYQLEMEKKDKMQPLEGKIMNKMTFSSQSTLEPDSSSLSSSELQTMSSNLRNTKIEKIAHAIYDWVPKSMEQSPVSQVDIVHSHDSDSTSTDVRTK
        LKEAQ+QLE EKKDKMQPLEG IMNK+T SSQS LEPDSS  SSSELQT+SSNLRNTK+EKIAHAI+D VPKS+E S VSQVDIVH+HDSDS+S  VRTK
Subjt:  LKEAQYQLEMEKKDKMQPLEGKIMNKMTFSSQSTLEPDSSSLSSSELQTMSSNLRNTKIEKIAHAIYDWVPKSMEQSPVSQVDIVHSHDSDSTSTDVRTK

Query:  EHRLYGKRCTQTIRALERNFLDYSLPLGIDVKDSQVLEVKEPLVKKRDKEERAL-TRAGKTDIGKNVLGAVLKRSVKLHTLRRTSQFGKCKTGSCRLHDS
        E R YGKRCTQ IRALE+N+LDYSLPLGIDVKD+QVLE KEPLVKK+ KEE  L TR GKTDIGK V GAVLKRSVKLHTLRRTSQFGKCKTGSCRLH S
Subjt:  EHRLYGKRCTQTIRALERNFLDYSLPLGIDVKDSQVLEVKEPLVKKRDKEERAL-TRAGKTDIGKNVLGAVLKRSVKLHTLRRTSQFGKCKTGSCRLHDS

Query:  QLTKPHHPSCIISLCRPYLKDGGVRSDKNEYATPSLMTDNGNVTQNSCLPEEHKIDSYKDVSGNSNGGTLKGNMKKGNLNGPSPDQPINPCDKSFVLSPC
        QLTKPHHPSCIISLCRPYLKD  VRSD N Y++PSLM D+GNV + SCLPEEHK DSY   SGNSNG   + NMK+GNLNG SPDQPI P DK F LSPC
Subjt:  QLTKPHHPSCIISLCRPYLKDGGVRSDKNEYATPSLMTDNGNVTQNSCLPEEHKIDSYKDVSGNSNGGTLKGNMKKGNLNGPSPDQPINPCDKSFVLSPC

Query:  RNSINLVNDNGKSSEDHSKVTKHQSKMKKLTCLDPGLTSIESYVDSTSVPTSVIASVKVNRSRVLENAATSKKELMALSVKQESDEIRNLIFPSSEWNSE
          S NLVNDN KS +DHSK TKHQSKMKKLTCLDPGLTS  SYVDSTSVP SV ASVKVN S VLENAA SKK+LMALSVKQESDEIRNLI PS E  SE
Subjt:  RNSINLVNDNGKSSEDHSKVTKHQSKMKKLTCLDPGLTSIESYVDSTSVPTSVIASVKVNRSRVLENAATSKKELMALSVKQESDEIRNLIFPSSEWNSE

Query:  MNPNTKCERICEVTNRSPRQVDKKMFLKCTRQSKRKREAMGISDENISPLKSNGKRRLGERLKFEPEFERSDMIRESTRESRQLSQVACQI
        +NPNTKC RIC  TN SP +VDKK  L CTRQ KRKREAMGISDENISP K NGKR LGE+LKFEP+FER+D+ RESTRESRQLS+VA Q+
Subjt:  MNPNTKCERICEVTNRSPRQVDKKMFLKCTRQSKRKREAMGISDENISPLKSNGKRRLGERLKFEPEFERSDMIRESTRESRQLSQVACQI

XP_023523812.1 uncharacterized protein LOC111787938 [Cucurbita pepo subsp. pepo]4.2e-24880.03Show/hide
Query:  MEDEDQKMVALKKAYADIILNTVKEAAARVMVSERKVRCLQQDLTSVKDEALRMLLRLKNMIDSKMHEAEIASLCQRRKVEELESQLHEAEEVITDLRIQ
        MEDE+QKM ALKKAYADIILNTVKEAA RV+VSER+ R LQQDL+S KDE+LR+LLRLKNM+DSKM EAEI SLCQRRKVEELE++LHEAE+VITDLRIQ
Subjt:  MEDEDQKMVALKKAYADIILNTVKEAAARVMVSERKVRCLQQDLTSVKDEALRMLLRLKNMIDSKMHEAEIASLCQRRKVEELESQLHEAEEVITDLRIQ

Query:  LKEAQYQLEMEKKDKMQPLEGKIMNKMTFSSQSTLEPDSSSLSSSELQTMSSNLRNTKIEKIAHAIYDWVPKSMEQSPVSQVDIVHSHDSDSTSTDVRTK
        LKEAQ+QLE EKKDKMQPLEG IMNK+T SSQS LEPDSS  SSSELQT+SSNLRNTK+EKIAHA++D VPKS+E S VSQVDIVH+HDSDS+S  VRTK
Subjt:  LKEAQYQLEMEKKDKMQPLEGKIMNKMTFSSQSTLEPDSSSLSSSELQTMSSNLRNTKIEKIAHAIYDWVPKSMEQSPVSQVDIVHSHDSDSTSTDVRTK

Query:  EHRLYGKRCTQTIRALERNFLDYSLPLGIDVKDSQVLEVKEPLVKKRDKEERAL-TRAGKTDIGKNVLGAVLKRSVKLHTLRRTSQFGKCKTGSCRLHDS
        E R YGKRCTQ IRALERN+LDYSLPLGIDVKD+QVLE KEPLVKK+ KEE  L TR GKTDIGK V GAVLKRSVKLHTLRRTSQFGKCKTGSCRLH S
Subjt:  EHRLYGKRCTQTIRALERNFLDYSLPLGIDVKDSQVLEVKEPLVKKRDKEERAL-TRAGKTDIGKNVLGAVLKRSVKLHTLRRTSQFGKCKTGSCRLHDS

Query:  QLTKPHHPSCIISLCRPYLKDGGVRSDKNEYATPSLMTDNGNVTQNSCLPEEHKIDSYKDVSGNSNGGTLKGNMKKGNLNGPSPDQPINPCDKSFVLSPC
        QLTKPH+PSCIISLCRPYLKD  VRSD NEY+ PSLM D+GNV + SCLPEEHK DSY   SGNSNG   +GNMK+GNLNG SPDQPI  CDK F LSPC
Subjt:  QLTKPHHPSCIISLCRPYLKDGGVRSDKNEYATPSLMTDNGNVTQNSCLPEEHKIDSYKDVSGNSNGGTLKGNMKKGNLNGPSPDQPINPCDKSFVLSPC

Query:  RNSINLVNDNGKSSEDHSKVTKHQSKMKKLTCLDPGLTSIESYVDSTSVPTSVIASVKVNRSRVLENAATSKKELMALSVKQESDEIRNLIFPSSEWNSE
          S NLVNDNGKS +DHSK TKH+SKMKKLTCLDPGLTS  SYVDSTSVP SV ASVKVN S VLENAA SKKELMAL+VKQESDEIRNLI PS + NSE
Subjt:  RNSINLVNDNGKSSEDHSKVTKHQSKMKKLTCLDPGLTSIESYVDSTSVPTSVIASVKVNRSRVLENAATSKKELMALSVKQESDEIRNLIFPSSEWNSE

Query:  MNPNTKCERICEVTNRSPRQVDKKMFLKCTRQSKRKREAMGISDENISPLKSNGKRRLGERLKFEPEFERSDMIRESTRESRQLSQVACQI
        +NPNTK  +IC  TN SP QVDKK  L CTRQSKRKREAMGISDENISP K NGKR LGE+LKFEP+FER+D+ RESTRESRQLS+VA Q+
Subjt:  MNPNTKCERICEVTNRSPRQVDKKMFLKCTRQSKRKREAMGISDENISPLKSNGKRRLGERLKFEPEFERSDMIRESTRESRQLSQVACQI

XP_038889750.1 uncharacterized protein LOC120079594 isoform X1 [Benincasa hispida]3.8e-26584.09Show/hide
Query:  MEDEDQKMVALKKAYADIILNTVKEAAARVMVSERKVRCLQQDLTSVKDEALRMLLRLKNMIDSKMHEAEIASLCQRRKVEELESQLHEAEEVITDLRIQ
        MEDE QKM+ALKKAYADIILNTVKEAAARVMVSERK  CLQQ+L SV DEALRMLLRLKNMIDSKMHEAEI SLCQR++VEELE+QLHEAE VITDLRIQ
Subjt:  MEDEDQKMVALKKAYADIILNTVKEAAARVMVSERKVRCLQQDLTSVKDEALRMLLRLKNMIDSKMHEAEIASLCQRRKVEELESQLHEAEEVITDLRIQ

Query:  LKEAQYQLEMEKKDKMQPLEGKIMNKMTFSSQSTLEPDSSSLSSSELQTMSSNLRNTKIEKIAHAIYDWVPKSMEQSPVSQVDIVHSHDSDSTSTDVRTK
        LKEAQ QLE EKKDKMQPLEGKI+NK+TFSS+STLEPDSSS SSSELQT+SSNL+NTK+E+IAHA+Y+ +PKS+E S VSQVDIVHSHDSDSTST VR K
Subjt:  LKEAQYQLEMEKKDKMQPLEGKIMNKMTFSSQSTLEPDSSSLSSSELQTMSSNLRNTKIEKIAHAIYDWVPKSMEQSPVSQVDIVHSHDSDSTSTDVRTK

Query:  EHRLYGKRCTQTIRALERNFLDYSLPLGIDVKDSQVLEVKEPLVKKRDKEERAL-TRAGKTDIGKNVLGAVLKRSVKLHTLRRTSQFGKCKTGSCRLHDS
        EHRL+ KRCTQ IR+LERNF DYSLPLGIDVKDSQVLE KE LV++RDKEER L TR GKTDI KNV GAVLKRSVKLHTLRRTS+FGKCKTGSCRL  +
Subjt:  EHRLYGKRCTQTIRALERNFLDYSLPLGIDVKDSQVLEVKEPLVKKRDKEERAL-TRAGKTDIGKNVLGAVLKRSVKLHTLRRTSQFGKCKTGSCRLHDS

Query:  QLTKPHHPSCIISLCRPYLKDGGVRSDKNEYATPSLMTDNGNVTQNSCLPEEHKIDSYKDVSGNSNGGTLKGNMKKGNLNGPSPDQPINPCDKSFVLSPC
        QLTKPHHPSCIISLCRP+LKDG VR DK+EY  PSLM D+GNVTQNS LPEEHKID+YKD SGNSNGG  KGNMK+GNLNG SPDQPINPCDKSFVLSPC
Subjt:  QLTKPHHPSCIISLCRPYLKDGGVRSDKNEYATPSLMTDNGNVTQNSCLPEEHKIDSYKDVSGNSNGGTLKGNMKKGNLNGPSPDQPINPCDKSFVLSPC

Query:  RNSINLVNDNGKSSEDHSKVTKHQSKMKKLTCLDPGLTSIESYVDSTSVPTSVIASVKVNRSRVLENAATSKKELMALSVKQESDEIRNLIFPSSEWNSE
        R SINLVNDN KSSEDHS + KHQ KMKKLTCLDPGLTS  SYVDSTSVP SV ASVKVN+SRVLENAA SKK+LMALSVKQESDEI NLIFPSS  NSE
Subjt:  RNSINLVNDNGKSSEDHSKVTKHQSKMKKLTCLDPGLTSIESYVDSTSVPTSVIASVKVNRSRVLENAATSKKELMALSVKQESDEIRNLIFPSSEWNSE

Query:  MNPNTKCERICEVTNRSPRQVDKKMFLKCTRQSKRKREAMGISDENISPLKSNGKRRLGERLKFEPEFERSDMIRESTRESRQLSQVACQI
        MNPNTKCE ICEVTN SP QVDKKMFL+CTRQSKRKREA+GISDENISP KSN KRRLGE+L FEPEFERS++IRESTRESR LSQVA Q+
Subjt:  MNPNTKCERICEVTNRSPRQVDKKMFLKCTRQSKRKREAMGISDENISPLKSNGKRRLGERLKFEPEFERSDMIRESTRESRQLSQVACQI

TrEMBL top hitse value%identityAlignment
A0A0A0KS55 Uncharacterized protein6.2e-23776.99Show/hide
Query:  MEDEDQKMVALKKAYADIILNTVKEAAARVMVSERKVRCLQQDLTSVKDEALRMLLRLKNMIDSKMHEAEIASLCQRRKVEELESQLHEAEEVITDLRIQ
        M+DE QKM+ALK AYADIILNTVKEAAARVMVSERKV CLQQDL+SVKDEALRMLLRLKNMIDSKMHEAEI SLCQRRKVEELE++LH+AE+VITDLR+Q
Subjt:  MEDEDQKMVALKKAYADIILNTVKEAAARVMVSERKVRCLQQDLTSVKDEALRMLLRLKNMIDSKMHEAEIASLCQRRKVEELESQLHEAEEVITDLRIQ

Query:  LKEAQYQLEMEKKDKMQPLEGKIMNKMTFSSQSTLEPDSSSLSSSELQTMSSNLRNTKIEKIAHAIYDWVPKSMEQSPVSQVDIVHSHDSDSTSTDVRTK
        LKEA+ QLE EKKDKMQP+EGKIMNK+TFSS+STLEPDSS  SS ELQT+SSNL +TK+E+IA A+Y+ VP+S++    S VDI+HSHDSDSTST +R K
Subjt:  LKEAQYQLEMEKKDKMQPLEGKIMNKMTFSSQSTLEPDSSSLSSSELQTMSSNLRNTKIEKIAHAIYDWVPKSMEQSPVSQVDIVHSHDSDSTSTDVRTK

Query:  EHRLYGKRCTQTIRALERNFLDYSLPLGIDVKDSQVLEVKEPLVKKRDKEERALT-RAGKTDIGKNVLGAVLKRSVKLHTLRRTSQFGKCKTGSCRLHDS
        EHRL  KRC Q IR+LERNFLDY LPLGIDVKDSQVLE KEPLVK+R+KEER L+ R GKTDI KN  GAVLKRSVKLH+LR TSQFGKCKTG+CRLH S
Subjt:  EHRLYGKRCTQTIRALERNFLDYSLPLGIDVKDSQVLEVKEPLVKKRDKEERALT-RAGKTDIGKNVLGAVLKRSVKLHTLRRTSQFGKCKTGSCRLHDS

Query:  QLTKPHHPSCIISLCRPYLKDGGVRSDKNEYATPSLMTDNGNVTQNSCLPEEHKIDSYKDVSGNSNGGTLKGNMKKGNLNGPSPDQPINPCDKSFVLSPC
        QLTKPHHPSCIIS+C+P LKDG VRSDK+EY  P LM  +G VT +S  PEEHKI SYKD S +S     KGNMK GNLNG SPDQPINPCDKSFVLSPC
Subjt:  QLTKPHHPSCIISLCRPYLKDGGVRSDKNEYATPSLMTDNGNVTQNSCLPEEHKIDSYKDVSGNSNGGTLKGNMKKGNLNGPSPDQPINPCDKSFVLSPC

Query:  RNSINLVNDNGKSSEDHSKVTKHQSKMKKLTCLDPGLTSIESYVDSTSVPTSVIASVKVNRSRVLENAATSKKELMALSVKQESDEIRNLIFPSSEWNSE
          SINLVNDN KS EDH  +TKHQ KMKKLTCLDP L S ESY+DS SV  SV ASVKVN+S+VLENAA SKK+L AL+VKQ+S  IRNLIFPSS+ NSE
Subjt:  RNSINLVNDNGKSSEDHSKVTKHQSKMKKLTCLDPGLTSIESYVDSTSVPTSVIASVKVNRSRVLENAATSKKELMALSVKQESDEIRNLIFPSSEWNSE

Query:  MNPNTKCERICEVTNRSPRQVDKKMFLKCTRQSKRKREAMGISDENISPLKSNGKRRLGERLKFEPEFERSDMIRESTRESRQLSQVACQI
        MNP+TKCE+ICEVTN SP QVDKKM L+CTRQ KRKREA+ ISDENISP KSN K  L E+LKFEPEFERS++I ESTRESRQLSQVA Q+
Subjt:  MNPNTKCERICEVTNRSPRQVDKKMFLKCTRQSKRKREAMGISDENISPLKSNGKRRLGERLKFEPEFERSDMIRESTRESRQLSQVACQI

A0A1S3C921 uncharacterized protein LOC1034980343.0e-23175.59Show/hide
Query:  MEDEDQKMVALKKAYADIILNTVKEAAARVMVSERKVRCLQQDLTSVKDEALRMLLRLKNMIDSKMHEAEIASLCQRRKVEELESQLHEAEEVITDLRIQ
        M+DE QKM+ALK AYADIILNTVKEAAARVMVS+RKV CLQQDL SVKDEALRMLLRLKNMIDSKMHEAEI SLCQRRKVEELE++LH+AE+VITDLR+Q
Subjt:  MEDEDQKMVALKKAYADIILNTVKEAAARVMVSERKVRCLQQDLTSVKDEALRMLLRLKNMIDSKMHEAEIASLCQRRKVEELESQLHEAEEVITDLRIQ

Query:  LKEAQYQLEMEKKDKMQPLEGKIMNKMTFSSQSTLEPDSSSLSSSELQTMSSNLRNTKIEKIAHAIYDWVPKSMEQSPVSQVDIVHSHDSDSTSTDVRTK
        LKEA+ QLE EKKDKMQP EGKIMNK+TFSS+STLEPDSS  SS ELQT SSNL +TK+E+IA A+Y+ VP+S++ S  S VDIVHSHDSDSTST +R K
Subjt:  LKEAQYQLEMEKKDKMQPLEGKIMNKMTFSSQSTLEPDSSSLSSSELQTMSSNLRNTKIEKIAHAIYDWVPKSMEQSPVSQVDIVHSHDSDSTSTDVRTK

Query:  EHRLYGKRCTQTIRALERNFLDYSLPLGIDVKDSQVLEVKEPLVKKRDKEERALTRAGKTDIGKNVLGAVLKRSVKLHTLRRTSQFGKCKTGSCRLHDSQ
        EHRL  KRC Q IR+LERNF DY LPLGIDVKDSQVLE KEPLVK+R+KEER L+                 RSVKLHTLR+TSQFGKCKTG+CRLH SQ
Subjt:  EHRLYGKRCTQTIRALERNFLDYSLPLGIDVKDSQVLEVKEPLVKKRDKEERALTRAGKTDIGKNVLGAVLKRSVKLHTLRRTSQFGKCKTGSCRLHDSQ

Query:  LTKPHHPSCIISLCRPYLKDGGVRSDKNEYATPSLMTDNGNVTQNSCLPEEHKIDSYKDVSGNSNGGTLKGNMKKGNLNGPSPDQPINPCDKSFVLSPCR
        LTKPHHPSCIIS+C+P+LKDG VRS+K+EY  P LM  +GNVT NS  PEEHKIDSYKD SG+S  G  KGNMK GNLNG SPDQPINPC KSF  SPC 
Subjt:  LTKPHHPSCIISLCRPYLKDGGVRSDKNEYATPSLMTDNGNVTQNSCLPEEHKIDSYKDVSGNSNGGTLKGNMKKGNLNGPSPDQPINPCDKSFVLSPCR

Query:  NSINLVNDNGKSSEDHSKVTKHQSKMKKLTCLDPGLTSIESYVDSTSVPTSVIASVKVNRSRVLENAATSKKELMALSVKQESDEIRNLIFPSSEWNSEM
         SINLV+DN KS E HS +TKHQ+KMKKLTCLDPGL S  SY+DS SV  SV ASVKVN+S+VLENAA SKK+  AL+VKQES  IRNLIFPSS+ NSEM
Subjt:  NSINLVNDNGKSSEDHSKVTKHQSKMKKLTCLDPGLTSIESYVDSTSVPTSVIASVKVNRSRVLENAATSKKELMALSVKQESDEIRNLIFPSSEWNSEM

Query:  NPNTKCERICEVTNRSPRQVDKKMFLKCTRQSKRKREAMGISDENISPLKSNGKRRLGERLKFEPEFERSDMIRESTRESRQLSQVACQI
        NP+TKCE+I EVTN SP QVDKKMFL+C+RQSKRKREA+ ISDENISP KSN KRRL E+LKFEPEFERS++IRESTRESRQLSQVA Q+
Subjt:  NPNTKCERICEVTNRSPRQVDKKMFLKCTRQSKRKREAMGISDENISPLKSNGKRRLGERLKFEPEFERSDMIRESTRESRQLSQVACQI

A0A6J1BYJ8 uncharacterized protein LOC1110067581.7e-23174.46Show/hide
Query:  MEDEDQKMVALKKAYADIILNTVKEAAARVMVSERKVRCLQQDLTSVKDEALRMLLRLKNMIDSKMHEAEIASLCQRRKVEELESQLHEAEEVITDLRIQ
        MEDE QKMVAL+KAYADIILNTVKEAAARVMVSERK    QQ+L SVKDEAL+MLLRLK MIDSKMHE+E+ASLCQRRKVEELE+QLHEAE+VITDLRIQ
Subjt:  MEDEDQKMVALKKAYADIILNTVKEAAARVMVSERKVRCLQQDLTSVKDEALRMLLRLKNMIDSKMHEAEIASLCQRRKVEELESQLHEAEEVITDLRIQ

Query:  LKEAQYQLEMEKKDKMQPLEGKIMNKMTFSSQSTLEPDSSSLSSSELQTMSSNLRNTKIEKIAHAIYDWVPKSMEQSPVSQVDIVHSHDSDSTSTDVRTK
        LKEA Y LE EKKDKMQP +GKIMNK+T SS+STLEPD+SS SSSE+QT+SSNLRNTKI++IAHA  D VPKS ++SPVS  DIV+SHDSDSTST+ RTK
Subjt:  LKEAQYQLEMEKKDKMQPLEGKIMNKMTFSSQSTLEPDSSSLSSSELQTMSSNLRNTKIEKIAHAIYDWVPKSMEQSPVSQVDIVHSHDSDSTSTDVRTK

Query:  EHRLYGKRCTQTIRALERNFLDYSLPLGIDVKDSQVLEVKEPLVKKRDKEERALTRAGKTDIGKNVLGAVLKRSVKLHTLRRTSQFGKCKTGSCRLHDSQ
        E  L  KRCTQ  R+  RNFLDY LPLGIDV+  QVLE  EP            T  GKTD GKNV GAVLKRSVKLHTLRR SQF KCKT SCRLH SQ
Subjt:  EHRLYGKRCTQTIRALERNFLDYSLPLGIDVKDSQVLEVKEPLVKKRDKEERALTRAGKTDIGKNVLGAVLKRSVKLHTLRRTSQFGKCKTGSCRLHDSQ

Query:  LTKPHHPSCIISLCRPYLKDGGVRSDKNEYATPSLMTDNGNVTQNSCLPEEHKIDSYKDVSGNSNGGTLKGNMKKGNLNGPSPDQPINPCDKSFVLSPCR
          KPHHPSC++SLC+PYLKDG VRS KN Y  PSLM D+GNVTQNSCL EE+ +DSYKDV+GNS GG LKGNMK GNLNG SPDQ I PCDKSFVLSPC 
Subjt:  LTKPHHPSCIISLCRPYLKDGGVRSDKNEYATPSLMTDNGNVTQNSCLPEEHKIDSYKDVSGNSNGGTLKGNMKKGNLNGPSPDQPINPCDKSFVLSPCR

Query:  NSINLVNDNGKSSEDHSKVTKHQSKMKKLTCLDPGLTSIESYVDSTSVPTSVIASVKVNRSRVLENAATSKKELMALSVKQESDEIRNLIFPSSEWNSEM
         SINL ND+G+S E HS VTKHQ KMK+LTCLDPGLTS  SYVD TSV TSV AS KVNRSR+ +N+A SK+E+MALSVKQES EIRNLIFPSSE NSEM
Subjt:  NSINLVNDNGKSSEDHSKVTKHQSKMKKLTCLDPGLTSIESYVDSTSVPTSVIASVKVNRSRVLENAATSKKELMALSVKQESDEIRNLIFPSSEWNSEM

Query:  NPNTKCERICEVTNRSPRQVDKKMFLKCTRQSKRKREAMGISDENISPLKSNGKRRLGERLKFEPEFERSDMIRESTRESRQLSQVACQI----QLRWS
        N NTK E+ICEVTN SP  VD+KM LKCT QSKRKREAM ISDENISP KS GKRRL E+LK EPE +RS++ RES+ E RQLSQVA Q+      RWS
Subjt:  NPNTKCERICEVTNRSPRQVDKKMFLKCTRQSKRKREAMGISDENISPLKSNGKRRLGERLKFEPEFERSDMIRESTRESRQLSQVACQI----QLRWS

A0A6J1ECW0 uncharacterized protein LOC1114313862.2e-25080.37Show/hide
Query:  MEDEDQKMVALKKAYADIILNTVKEAAARVMVSERKVRCLQQDLTSVKDEALRMLLRLKNMIDSKMHEAEIASLCQRRKVEELESQLHEAEEVITDLRIQ
        MEDE+QKM ALKKAYADIILNTVKEAA RVMVS+R+ R LQQDL+S KDE+LR+LLRLKNM+DSKM EAEI SLCQRRKVEELE++LHEAE+VITDLRIQ
Subjt:  MEDEDQKMVALKKAYADIILNTVKEAAARVMVSERKVRCLQQDLTSVKDEALRMLLRLKNMIDSKMHEAEIASLCQRRKVEELESQLHEAEEVITDLRIQ

Query:  LKEAQYQLEMEKKDKMQPLEGKIMNKMTFSSQSTLEPDSSSLSSSELQTMSSNLRNTKIEKIAHAIYDWVPKSMEQSPVSQVDIVHSHDSDSTSTDVRTK
        LKEAQ+QLE EKKDKMQPLEG IMNK+T SSQS L+PDSS  SSSELQT+SSNLRNTK+EKIAHAI+D VPKS+E S VSQVDIVH+HDSDS+S  VRTK
Subjt:  LKEAQYQLEMEKKDKMQPLEGKIMNKMTFSSQSTLEPDSSSLSSSELQTMSSNLRNTKIEKIAHAIYDWVPKSMEQSPVSQVDIVHSHDSDSTSTDVRTK

Query:  EHRLYGKRCTQTIRALERNFLDYSLPLGIDVKDSQVLEVKEPLVKKRDKEERAL-TRAGKTDIGKNVLGAVLKRSVKLHTLRRTSQFGKCKTGSCRLHDS
        E R YGKRCTQ IRALERN+LDYSLPLGIDVKD+QVLE KEPLVKK+ KEE  L TR GKTDIGK V GAVLKRSVKLHTLRRTSQFGKCKTGSCRLH S
Subjt:  EHRLYGKRCTQTIRALERNFLDYSLPLGIDVKDSQVLEVKEPLVKKRDKEERAL-TRAGKTDIGKNVLGAVLKRSVKLHTLRRTSQFGKCKTGSCRLHDS

Query:  QLTKPHHPSCIISLCRPYLKDGGVRSDKNEYATPSLMTDNGNVTQNSCLPEEHKIDSYKDVSGNSNGGTLKGNMKKGNLNGPSPDQPINPCDKSFVLSPC
        QLTKPHHPSCIISLCRPYLKD  VRSD NEY+ PSLM D+GNV + SCLPEEHK DSY   SGNSNG   +GNMK+ NLNG SPDQPINP DK F LSPC
Subjt:  QLTKPHHPSCIISLCRPYLKDGGVRSDKNEYATPSLMTDNGNVTQNSCLPEEHKIDSYKDVSGNSNGGTLKGNMKKGNLNGPSPDQPINPCDKSFVLSPC

Query:  RNSINLVNDNGKSSEDHSKVTKHQSKMKKLTCLDPGLTSIESYVDSTSVPTSVIASVKVNRSRVLENAATSKKELMALSVKQESDEIRNLIFPSSEWNSE
          S NLVNDNGKS +DHSK TKHQSKMKKLTCLDPGLTS  SYVDSTSVP SV AS+KVN S  LEN A +KKELMALSVKQESDEIRNLI PS E NSE
Subjt:  RNSINLVNDNGKSSEDHSKVTKHQSKMKKLTCLDPGLTSIESYVDSTSVPTSVIASVKVNRSRVLENAATSKKELMALSVKQESDEIRNLIFPSSEWNSE

Query:  MNPNTKCERICEVTNRSPRQVDKKMFLKCTRQSKRKREAMGISDENISPLKSNGKRRLGERLKFEPEFERSDMIRESTRESRQLSQVACQI
        +NPNTKC +IC VTN SP QVDKK  L CTRQSKRKREAMGISDENISP K NGKR LGE+LKFEP+FER+D+ RESTRESRQLS+VA Q+
Subjt:  MNPNTKCERICEVTNRSPRQVDKKMFLKCTRQSKRKREAMGISDENISPLKSNGKRRLGERLKFEPEFERSDMIRESTRESRQLSQVACQI

A0A6J1JS43 uncharacterized protein LOC1114870541.9e-24679.86Show/hide
Query:  MEDEDQKMVALKKAYADIILNTVKEAAARVMVSERKVRCLQQDLTSVKDEALRMLLRLKNMIDSKMHEAEIASLCQRRKVEELESQLHEAEEVITDLRIQ
        MEDE+QKM ALKKAYADIILNTVKEAA RV+VSER+ R LQQDL+ VKDE+LR+LLRLKNM+DSKM EAEI SLCQRRKVEELE++LHEAE+VITDLRIQ
Subjt:  MEDEDQKMVALKKAYADIILNTVKEAAARVMVSERKVRCLQQDLTSVKDEALRMLLRLKNMIDSKMHEAEIASLCQRRKVEELESQLHEAEEVITDLRIQ

Query:  LKEAQYQLEMEKKDKMQPLEGKIMNKMTFSSQSTLEPDSSSLSSSELQTMSSNLRNTKIEKIAHAIYDWVPKSMEQSPVSQVDIVHSHDSDSTSTDVRTK
        LKEAQ+QLE EKKDKMQPLEG IMNK+T SSQS LEPDSS  SSSELQT+SSNLRNTK+EKIAHAI+D VPKS+E S VSQVDIVH+HDSDS+S  VRTK
Subjt:  LKEAQYQLEMEKKDKMQPLEGKIMNKMTFSSQSTLEPDSSSLSSSELQTMSSNLRNTKIEKIAHAIYDWVPKSMEQSPVSQVDIVHSHDSDSTSTDVRTK

Query:  EHRLYGKRCTQTIRALERNFLDYSLPLGIDVKDSQVLEVKEPLVKKRDKEERAL-TRAGKTDIGKNVLGAVLKRSVKLHTLRRTSQFGKCKTGSCRLHDS
        E R YGKRCTQ IRALE+N+LDYSLPLGIDVKD+QVLE KEPLVKK+ KEE  L TR GKTDIGK V GAVLKRSVKLHTLRRTSQFGKCKTGSCRLH S
Subjt:  EHRLYGKRCTQTIRALERNFLDYSLPLGIDVKDSQVLEVKEPLVKKRDKEERAL-TRAGKTDIGKNVLGAVLKRSVKLHTLRRTSQFGKCKTGSCRLHDS

Query:  QLTKPHHPSCIISLCRPYLKDGGVRSDKNEYATPSLMTDNGNVTQNSCLPEEHKIDSYKDVSGNSNGGTLKGNMKKGNLNGPSPDQPINPCDKSFVLSPC
        QLTKPHHPSCIISLCRPYLKD  VRSD N Y++PSLM D+GNV + SCLPEEHK DSY   SGNSNG   + NMK+GNLNG SPDQPI P DK F LSPC
Subjt:  QLTKPHHPSCIISLCRPYLKDGGVRSDKNEYATPSLMTDNGNVTQNSCLPEEHKIDSYKDVSGNSNGGTLKGNMKKGNLNGPSPDQPINPCDKSFVLSPC

Query:  RNSINLVNDNGKSSEDHSKVTKHQSKMKKLTCLDPGLTSIESYVDSTSVPTSVIASVKVNRSRVLENAATSKKELMALSVKQESDEIRNLIFPSSEWNSE
          S NLVNDN KS +DHSK TKHQSKMKKLTCLDPGLTS  SYVDSTSVP SV ASVKVN S VLENAA SKK+LMALSVKQESDEIRNLI PS E  SE
Subjt:  RNSINLVNDNGKSSEDHSKVTKHQSKMKKLTCLDPGLTSIESYVDSTSVPTSVIASVKVNRSRVLENAATSKKELMALSVKQESDEIRNLIFPSSEWNSE

Query:  MNPNTKCERICEVTNRSPRQVDKKMFLKCTRQSKRKREAMGISDENISPLKSNGKRRLGERLKFEPEFERSDMIRESTRESRQLSQVACQI
        +NPNTKC RIC  TN SP +VDKK  L CTRQ KRKREAMGISDENISP K NGKR LGE+LKFEP+FER+D+ RESTRESRQLS+VA Q+
Subjt:  MNPNTKCERICEVTNRSPRQVDKKMFLKCTRQSKRKREAMGISDENISPLKSNGKRRLGERLKFEPEFERSDMIRESTRESRQLSQVACQI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G19010.1 unknown protein5.1e-2659.43Show/hide
Query:  EDQKMVALKKAYADIILNTVKEAAARVMVSERKVRCLQQDLTSVKDEALRMLLRLKNMIDSKMHEAEIASLCQRRKVEELESQLHEAEEVITDLRIQLKE
        + +K+ ALKKAYA+ ILNT KEAAARVM++ERK R  QQ+L SV+DEALR  LRLK + DSK+ EAE+ SL +++K+EELE+QL EAE+++ +LR +L+E
Subjt:  EDQKMVALKKAYADIILNTVKEAAARVMVSERKVRCLQQDLTSVKDEALRMLLRLKNMIDSKMHEAEIASLCQRRKVEELESQLHEAEEVITDLRIQLKE

Query:  AQYQLE
        ++Y LE
Subjt:  AQYQLE

AT1G19010.2 unknown protein1.4e-1555.7Show/hide
Query:  MVSERKVRCLQQDLTSVKDEALRMLLRLKNMIDSKMHEAEIASLCQRRKVEELESQLHEAEEVITDLRIQLKEAQYQLE
        M++ERK R  QQ+L SV+DEALR  LRLK + DSK+ EAE+ SL +++K+EELE+QL EAE+++ +LR +L+E++Y LE
Subjt:  MVSERKVRCLQQDLTSVKDEALRMLLRLKNMIDSKMHEAEIASLCQRRKVEELESQLHEAEEVITDLRIQLKEAQYQLE

AT1G74860.1 unknown protein3.1e-2355.66Show/hide
Query:  EDQKMVALKKAYADIILNTVKEAAARVMVSERKVRCLQQDLTSVKDEALRMLLRLKNMIDSKMHEAEIASLCQRRKVEELESQLHEAEEVITDLRIQLKE
        + + + ALK+AYAD ILNT KEAAARVMVSE+K R  QQ+L +V++EAL  L+RLK M+DSK+ E E+ SL Q++KVEELE+QL EAE+++ +LR++L+ 
Subjt:  EDQKMVALKKAYADIILNTVKEAAARVMVSERKVRCLQQDLTSVKDEALRMLLRLKNMIDSKMHEAEIASLCQRRKVEELESQLHEAEEVITDLRIQLKE

Query:  AQYQLE
           +L+
Subjt:  AQYQLE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCATCCAACTACATATCCGAATATGGGAGGCTCAACCAAGTATTTGTAAGGGAGGGGTGAGAACGACAACTGGTGTTATTGGTCTTCCTGTAACTTGGTCACCACT
ACCCCATAATCACTTCAAGGTGAATGTGGATATAACAATAACTAATGAAGTTCAAGCTCTTATAGATGGGGTAACACTAGCGTTGGAAGCAGTAATTACACTTGTTTGGG
CGGAGGTAGATTCTAAAATCGTTTGGAATCTTCTTAATGACCAAGACCAACATTTCAATGAGATATCTCCCCTTACTGACTACCTCAAGCAACTATTTCATGCTCATTAT
CACATTAGGGGTTTCCTATTTACCCCTAGAGATGAAAATAAAGTTGCTCACAAGTTAGCTTCTCACACCCGCACAATGAAACACTATGAAGTCTGGCTAGAAGAATGTCC
CCCTTGGACTCACAACATCTTAATTCACGAGATGCCTCAAACCAATGGCACTTTCAAGAGATTCCAATTCAACCGAGAGCTTTTCAGTTCAGTTCTTCAGTTGATCTTCA
GTTCTCCTTCTGCTCCATTTGGTTTCTCTCAGCTCGAAGTTTTTATTTATTCAGCCATGGAGGACGAAGACCAGAAAATGGTGGCGCTCAAGAAGGCTTATGCTGACATA
ATTCTGAACACGGTGAAGGAGGCAGCGGCTCGCGTAATGGTATCCGAGCGCAAAGTTCGTTGTCTTCAGCAGGATCTTACTTCTGTAAAGGACGAGGCGCTTAGAATGCT
CCTTCGCTTGAAGAACATGATCGATTCTAAGATGCATGAAGCAGAAATAGCATCATTATGCCAGAGGAGAAAAGTTGAAGAGCTTGAATCACAGCTTCACGAGGCAGAGG
AAGTGATAACAGATCTTAGAATACAGTTGAAAGAGGCACAATACCAGTTGGAGATGGAGAAGAAAGACAAAATGCAGCCTTTAGAAGGGAAGATTATGAATAAAATGACA
TTCTCTAGTCAGTCTACCCTTGAACCGGATTCTTCCAGTCTTTCCTCATCAGAACTGCAAACAATGTCATCTAATTTGAGGAACACGAAGATAGAAAAGATAGCCCATGC
AATATATGATTGGGTGCCAAAGTCTATGGAACAATCACCAGTTTCTCAAGTGGACATTGTCCATTCTCATGATTCTGACTCTACCTCCACGGACGTGAGAACCAAGGAGC
ACAGATTATATGGGAAGAGATGCACCCAGACAATTCGTGCATTAGAAAGGAATTTCCTGGATTATAGCTTACCTCTAGGTATTGATGTAAAAGATAGCCAAGTCCTTGAG
GTAAAGGAACCTTTAGTAAAAAAGAGGGACAAGGAGGAAAGAGCCCTCACAAGAGCTGGAAAGACAGATATAGGGAAGAATGTACTTGGTGCTGTGTTGAAGAGATCTGT
AAAGCTCCATACATTAAGAAGGACATCTCAATTTGGAAAATGTAAAACAGGCTCTTGCAGGTTGCATGATAGTCAACTCACGAAACCTCATCATCCATCATGCATCATTT
CCCTTTGCAGGCCATATTTGAAAGATGGTGGCGTGAGATCTGATAAAAATGAATATGCTACACCATCTCTGATGACTGATAATGGGAATGTGACACAAAATTCTTGTTTG
CCGGAGGAGCATAAAATAGACAGTTACAAAGATGTAAGTGGCAATTCTAATGGAGGGACACTGAAAGGGAACATGAAAAAAGGGAATTTAAATGGTCCCTCGCCTGATCA
GCCAATCAATCCATGTGATAAGTCTTTTGTCCTGTCTCCATGTAGGAATTCCATTAATCTAGTTAATGATAATGGGAAATCTAGTGAAGATCATTCCAAAGTAACTAAGC
ATCAGAGCAAGATGAAGAAATTAACCTGCTTGGATCCAGGACTCACGTCAATTGAAAGTTATGTTGATTCTACATCAGTGCCTACTAGTGTTATTGCCAGTGTTAAGGTC
AATAGATCCAGAGTCCTCGAGAATGCAGCAACCAGTAAGAAAGAATTGATGGCATTATCAGTGAAACAGGAAAGTGATGAAATAAGGAATTTGATTTTTCCAAGTTCTGA
GTGGAACTCTGAAATGAATCCTAATACAAAATGTGAGCGAATATGTGAAGTTACAAATCGCTCTCCTCGCCAAGTAGACAAGAAAATGTTTCTCAAATGTACTAGACAAA
GCAAACGCAAGAGGGAAGCCATGGGCATTTCTGATGAAAATATTTCTCCATTAAAGAGCAATGGAAAAAGGAGGTTGGGTGAGAGACTAAAATTTGAGCCCGAGTTTGAG
AGGTCTGACATGATCAGAGAATCAACGAGGGAGAGTCGACAGCTGTCCCAAGTTGCTTGTCAGATTCAGTTGAGATGGAGTGGAGATCACCAATATGAACTTCGTGAGAA
TAGTCTAACAATGATGAACGACGGAGGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGCATCCAACTACATATCCGAATATGGGAGGCTCAACCAAGTATTTGTAAGGGAGGGGTGAGAACGACAACTGGTGTTATTGGTCTTCCTGTAACTTGGTCACCACT
ACCCCATAATCACTTCAAGGTGAATGTGGATATAACAATAACTAATGAAGTTCAAGCTCTTATAGATGGGGTAACACTAGCGTTGGAAGCAGTAATTACACTTGTTTGGG
CGGAGGTAGATTCTAAAATCGTTTGGAATCTTCTTAATGACCAAGACCAACATTTCAATGAGATATCTCCCCTTACTGACTACCTCAAGCAACTATTTCATGCTCATTAT
CACATTAGGGGTTTCCTATTTACCCCTAGAGATGAAAATAAAGTTGCTCACAAGTTAGCTTCTCACACCCGCACAATGAAACACTATGAAGTCTGGCTAGAAGAATGTCC
CCCTTGGACTCACAACATCTTAATTCACGAGATGCCTCAAACCAATGGCACTTTCAAGAGATTCCAATTCAACCGAGAGCTTTTCAGTTCAGTTCTTCAGTTGATCTTCA
GTTCTCCTTCTGCTCCATTTGGTTTCTCTCAGCTCGAAGTTTTTATTTATTCAGCCATGGAGGACGAAGACCAGAAAATGGTGGCGCTCAAGAAGGCTTATGCTGACATA
ATTCTGAACACGGTGAAGGAGGCAGCGGCTCGCGTAATGGTATCCGAGCGCAAAGTTCGTTGTCTTCAGCAGGATCTTACTTCTGTAAAGGACGAGGCGCTTAGAATGCT
CCTTCGCTTGAAGAACATGATCGATTCTAAGATGCATGAAGCAGAAATAGCATCATTATGCCAGAGGAGAAAAGTTGAAGAGCTTGAATCACAGCTTCACGAGGCAGAGG
AAGTGATAACAGATCTTAGAATACAGTTGAAAGAGGCACAATACCAGTTGGAGATGGAGAAGAAAGACAAAATGCAGCCTTTAGAAGGGAAGATTATGAATAAAATGACA
TTCTCTAGTCAGTCTACCCTTGAACCGGATTCTTCCAGTCTTTCCTCATCAGAACTGCAAACAATGTCATCTAATTTGAGGAACACGAAGATAGAAAAGATAGCCCATGC
AATATATGATTGGGTGCCAAAGTCTATGGAACAATCACCAGTTTCTCAAGTGGACATTGTCCATTCTCATGATTCTGACTCTACCTCCACGGACGTGAGAACCAAGGAGC
ACAGATTATATGGGAAGAGATGCACCCAGACAATTCGTGCATTAGAAAGGAATTTCCTGGATTATAGCTTACCTCTAGGTATTGATGTAAAAGATAGCCAAGTCCTTGAG
GTAAAGGAACCTTTAGTAAAAAAGAGGGACAAGGAGGAAAGAGCCCTCACAAGAGCTGGAAAGACAGATATAGGGAAGAATGTACTTGGTGCTGTGTTGAAGAGATCTGT
AAAGCTCCATACATTAAGAAGGACATCTCAATTTGGAAAATGTAAAACAGGCTCTTGCAGGTTGCATGATAGTCAACTCACGAAACCTCATCATCCATCATGCATCATTT
CCCTTTGCAGGCCATATTTGAAAGATGGTGGCGTGAGATCTGATAAAAATGAATATGCTACACCATCTCTGATGACTGATAATGGGAATGTGACACAAAATTCTTGTTTG
CCGGAGGAGCATAAAATAGACAGTTACAAAGATGTAAGTGGCAATTCTAATGGAGGGACACTGAAAGGGAACATGAAAAAAGGGAATTTAAATGGTCCCTCGCCTGATCA
GCCAATCAATCCATGTGATAAGTCTTTTGTCCTGTCTCCATGTAGGAATTCCATTAATCTAGTTAATGATAATGGGAAATCTAGTGAAGATCATTCCAAAGTAACTAAGC
ATCAGAGCAAGATGAAGAAATTAACCTGCTTGGATCCAGGACTCACGTCAATTGAAAGTTATGTTGATTCTACATCAGTGCCTACTAGTGTTATTGCCAGTGTTAAGGTC
AATAGATCCAGAGTCCTCGAGAATGCAGCAACCAGTAAGAAAGAATTGATGGCATTATCAGTGAAACAGGAAAGTGATGAAATAAGGAATTTGATTTTTCCAAGTTCTGA
GTGGAACTCTGAAATGAATCCTAATACAAAATGTGAGCGAATATGTGAAGTTACAAATCGCTCTCCTCGCCAAGTAGACAAGAAAATGTTTCTCAAATGTACTAGACAAA
GCAAACGCAAGAGGGAAGCCATGGGCATTTCTGATGAAAATATTTCTCCATTAAAGAGCAATGGAAAAAGGAGGTTGGGTGAGAGACTAAAATTTGAGCCCGAGTTTGAG
AGGTCTGACATGATCAGAGAATCAACGAGGGAGAGTCGACAGCTGTCCCAAGTTGCTTGTCAGATTCAGTTGAGATGGAGTGGAGATCACCAATATGAACTTCGTGAGAA
TAGTCTAACAATGATGAACGACGGAGGTTAA
Protein sequenceShow/hide protein sequence
MSIQLHIRIWEAQPSICKGGVRTTTGVIGLPVTWSPLPHNHFKVNVDITITNEVQALIDGVTLALEAVITLVWAEVDSKIVWNLLNDQDQHFNEISPLTDYLKQLFHAHY
HIRGFLFTPRDENKVAHKLASHTRTMKHYEVWLEECPPWTHNILIHEMPQTNGTFKRFQFNRELFSSVLQLIFSSPSAPFGFSQLEVFIYSAMEDEDQKMVALKKAYADI
ILNTVKEAAARVMVSERKVRCLQQDLTSVKDEALRMLLRLKNMIDSKMHEAEIASLCQRRKVEELESQLHEAEEVITDLRIQLKEAQYQLEMEKKDKMQPLEGKIMNKMT
FSSQSTLEPDSSSLSSSELQTMSSNLRNTKIEKIAHAIYDWVPKSMEQSPVSQVDIVHSHDSDSTSTDVRTKEHRLYGKRCTQTIRALERNFLDYSLPLGIDVKDSQVLE
VKEPLVKKRDKEERALTRAGKTDIGKNVLGAVLKRSVKLHTLRRTSQFGKCKTGSCRLHDSQLTKPHHPSCIISLCRPYLKDGGVRSDKNEYATPSLMTDNGNVTQNSCL
PEEHKIDSYKDVSGNSNGGTLKGNMKKGNLNGPSPDQPINPCDKSFVLSPCRNSINLVNDNGKSSEDHSKVTKHQSKMKKLTCLDPGLTSIESYVDSTSVPTSVIASVKV
NRSRVLENAATSKKELMALSVKQESDEIRNLIFPSSEWNSEMNPNTKCERICEVTNRSPRQVDKKMFLKCTRQSKRKREAMGISDENISPLKSNGKRRLGERLKFEPEFE
RSDMIRESTRESRQLSQVACQIQLRWSGDHQYELRENSLTMMNDGG