; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh12G000200 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh12G000200
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionProtein of Unknown Function (DUF239)
Genome locationCmo_Chr12:125061..127577
RNA-Seq ExpressionCmoCh12G000200
SyntenyCmoCh12G000200
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004314 - Neprosin
IPR025521 - Neprosin activation peptide


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6585174.1 hypothetical protein SDJN03_17907, partial [Cucurbita argyrosperma subsp. sororia]2.3e-25098.8Show/hide
Query:  MASACFFNTPQTIAILVPLLLLLLSFTITPVYSSHIPATNQTFHPQQELNKLKMIRAHLDKVNKPAVHTIQSPDGDLIDCVLSHHQPAFDHPNLKAQKPL
        MASACFF TPQTIAILVPLLLLLLSFTITPVYSSHIPATNQTFHPQQELNKLKMIRAHLDKVNKPAVHTIQSPDGDLIDCVLSHHQPAFDHPNLKAQKPL
Subjt:  MASACFFNTPQTIAILVPLLLLLLSFTITPVYSSHIPATNQTFHPQQELNKLKMIRAHLDKVNKPAVHTIQSPDGDLIDCVLSHHQPAFDHPNLKAQKPL

Query:  DPPERPEGHKPPRTEIGSFQLWSMNGETCPEGTVPIRRTTEEDILRATSFQMFGRKVRRWVRRETSSDGHEHAVGYVTGNHYFGAKASINVWAPRVADQY
        DPPERPEGHKPPRTEIGSFQLWSMNGETCPEGTVPIRRTTEEDILRA SFQMFGRKVRRWVRRETSSDGHEHAVGYVTGNHYFGAKASINVWAPRVADQY
Subjt:  DPPERPEGHKPPRTEIGSFQLWSMNGETCPEGTVPIRRTTEEDILRATSFQMFGRKVRRWVRRETSSDGHEHAVGYVTGNHYFGAKASINVWAPRVADQY

Query:  EFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKVAIGATISPTSSLDGGQFDISLMVWKDPKHGNW
        EFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKVAIGATISPTS LDGGQFDISLMVWKDPKHGNW
Subjt:  EFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKVAIGATISPTSSLDGGQFDISLMVWKDPKHGNW

Query:  WLEFGAGVLVGYWPSFLFTHLQQHATMVQFGGEVVNSSPSGFHTSTEMGSGRFGGEGFGKASYFRKLQVVDWDNSLVPLTNLMVLADHPECYDIEAGMNT
        WLEFGAGVLVGYWPSFLFTHLQQHATMVQFGGEVVNSSPSGFHTSTEMGSGR GGEGFGKASYFR LQVVDWDNSLVPLTNLMVLADHPECYDIEAGMNT
Subjt:  WLEFGAGVLVGYWPSFLFTHLQQHATMVQFGGEVVNSSPSGFHTSTEMGSGRFGGEGFGKASYFRKLQVVDWDNSLVPLTNLMVLADHPECYDIEAGMNT

Query:  DWGNYFYYGGPGRNQRCP
        DWGNYFYYGGPGRNQRCP
Subjt:  DWGNYFYYGGPGRNQRCP

XP_022951496.1 uncharacterized protein LOC111454297 isoform X1 [Cucurbita moschata]7.4e-24992.68Show/hide
Query:  MASACFFNTPQTIAILVPLLLLLLSFTITPVYSSHIPATNQTFHPQQELNKLKMIRAHLDKVNKPAVHTIQSPDGDLIDCVLSHHQPAFDHPNLKAQKPL
        MASACFFNTPQTIAILVPLLLLLLSFTITPVYSSHIPATNQTFHPQQELNKLKMIRAHLDKVNKPAVHTIQSPDGDLIDCVLSHHQPAFDHPNLKAQKPL
Subjt:  MASACFFNTPQTIAILVPLLLLLLSFTITPVYSSHIPATNQTFHPQQELNKLKMIRAHLDKVNKPAVHTIQSPDGDLIDCVLSHHQPAFDHPNLKAQKPL

Query:  ---------------------------------DPPERPEGHKPPRTEIGSFQLWSMNGETCPEGTVPIRRTTEEDILRATSFQMFGRKVRRWVRRETSS
                                         DPPERPEGHKPPRTEIGSFQLWSMNGETCPEGTVPIRRTTEEDILRATSFQMFGRKVRRWVRRETSS
Subjt:  ---------------------------------DPPERPEGHKPPRTEIGSFQLWSMNGETCPEGTVPIRRTTEEDILRATSFQMFGRKVRRWVRRETSS

Query:  DGHEHAVGYVTGNHYFGAKASINVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNK
        DGHEHAVGYVTGNHYFGAKASINVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNK
Subjt:  DGHEHAVGYVTGNHYFGAKASINVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNK

Query:  VAIGATISPTSSLDGGQFDISLMVWKDPKHGNWWLEFGAGVLVGYWPSFLFTHLQQHATMVQFGGEVVNSSPSGFHTSTEMGSGRFGGEGFGKASYFRKL
        VAIGATISPTSSLDGGQFDISLMVWKDPKHGNWWLEFGAGVLVGYWPSFLFTHLQQHATMVQFGGEVVNSSPSGFHTSTEMGSGRFGGEGFGKASYFRKL
Subjt:  VAIGATISPTSSLDGGQFDISLMVWKDPKHGNWWLEFGAGVLVGYWPSFLFTHLQQHATMVQFGGEVVNSSPSGFHTSTEMGSGRFGGEGFGKASYFRKL

Query:  QVVDWDNSLVPLTNLMVLADHPECYDIEAGMNTDWGNYFYYGGPGRNQRCP
        QVVDWDNSLVPLTNLMVLADHPECYDIEAGMNTDWGNYFYYGGPGRNQRCP
Subjt:  QVVDWDNSLVPLTNLMVLADHPECYDIEAGMNTDWGNYFYYGGPGRNQRCP

XP_022951497.1 uncharacterized protein LOC111454297 isoform X2 [Cucurbita moschata]5.9e-254100Show/hide
Query:  MASACFFNTPQTIAILVPLLLLLLSFTITPVYSSHIPATNQTFHPQQELNKLKMIRAHLDKVNKPAVHTIQSPDGDLIDCVLSHHQPAFDHPNLKAQKPL
        MASACFFNTPQTIAILVPLLLLLLSFTITPVYSSHIPATNQTFHPQQELNKLKMIRAHLDKVNKPAVHTIQSPDGDLIDCVLSHHQPAFDHPNLKAQKPL
Subjt:  MASACFFNTPQTIAILVPLLLLLLSFTITPVYSSHIPATNQTFHPQQELNKLKMIRAHLDKVNKPAVHTIQSPDGDLIDCVLSHHQPAFDHPNLKAQKPL

Query:  DPPERPEGHKPPRTEIGSFQLWSMNGETCPEGTVPIRRTTEEDILRATSFQMFGRKVRRWVRRETSSDGHEHAVGYVTGNHYFGAKASINVWAPRVADQY
        DPPERPEGHKPPRTEIGSFQLWSMNGETCPEGTVPIRRTTEEDILRATSFQMFGRKVRRWVRRETSSDGHEHAVGYVTGNHYFGAKASINVWAPRVADQY
Subjt:  DPPERPEGHKPPRTEIGSFQLWSMNGETCPEGTVPIRRTTEEDILRATSFQMFGRKVRRWVRRETSSDGHEHAVGYVTGNHYFGAKASINVWAPRVADQY

Query:  EFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKVAIGATISPTSSLDGGQFDISLMVWKDPKHGNW
        EFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKVAIGATISPTSSLDGGQFDISLMVWKDPKHGNW
Subjt:  EFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKVAIGATISPTSSLDGGQFDISLMVWKDPKHGNW

Query:  WLEFGAGVLVGYWPSFLFTHLQQHATMVQFGGEVVNSSPSGFHTSTEMGSGRFGGEGFGKASYFRKLQVVDWDNSLVPLTNLMVLADHPECYDIEAGMNT
        WLEFGAGVLVGYWPSFLFTHLQQHATMVQFGGEVVNSSPSGFHTSTEMGSGRFGGEGFGKASYFRKLQVVDWDNSLVPLTNLMVLADHPECYDIEAGMNT
Subjt:  WLEFGAGVLVGYWPSFLFTHLQQHATMVQFGGEVVNSSPSGFHTSTEMGSGRFGGEGFGKASYFRKLQVVDWDNSLVPLTNLMVLADHPECYDIEAGMNT

Query:  DWGNYFYYGGPGRNQRCP
        DWGNYFYYGGPGRNQRCP
Subjt:  DWGNYFYYGGPGRNQRCP

XP_023002762.1 uncharacterized protein LOC111496524 [Cucurbita maxima]1.1e-24998.56Show/hide
Query:  MASACFFNTPQTIAILVPLLLLLLSFTITPVYSSHIPATNQTFHPQQELNKLKMIRAHLDKVNKPAVHTIQSPDGDLIDCVLSHHQPAFDHPNLKAQKPL
        MA ACFFNTPQTIAILVPLLLLLL+FTITPVY SHIPATNQTFHPQQELNKLKMIRAHLDKVNKPAVHTIQSPDGDLIDCVLSHHQPAFDHPNLKAQKPL
Subjt:  MASACFFNTPQTIAILVPLLLLLLSFTITPVYSSHIPATNQTFHPQQELNKLKMIRAHLDKVNKPAVHTIQSPDGDLIDCVLSHHQPAFDHPNLKAQKPL

Query:  DPPERPEGHKPPRTEIGSFQLWSMNGETCPEGTVPIRRTTEEDILRATSFQMFGRKVRRWVRRETSSDGHEHAVGYVTGNHYFGAKASINVWAPRVADQY
        DPPERPEGHKPPRTEIGSFQLWSMNGETCPEGTVPIRRTTEEDILRATSFQMFGRKVRRWVRRETS DGHEHAVGYVTGNHYFGAKASINVWAPRVADQY
Subjt:  DPPERPEGHKPPRTEIGSFQLWSMNGETCPEGTVPIRRTTEEDILRATSFQMFGRKVRRWVRRETSSDGHEHAVGYVTGNHYFGAKASINVWAPRVADQY

Query:  EFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKVAIGATISPTSSLDGGQFDISLMVWKDPKHGNW
        EFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKVAIGATISPTSSLDGGQFDISLMVWKDPKHGNW
Subjt:  EFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKVAIGATISPTSSLDGGQFDISLMVWKDPKHGNW

Query:  WLEFGAGVLVGYWPSFLFTHLQQHATMVQFGGEVVNSSPSGFHTSTEMGSGRFGGEGFGKASYFRKLQVVDWDNSLVPLTNLMVLADHPECYDIEAGMNT
        WLE GAGVLVGYWPSFLFTHLQQHATMVQFGGEVVNSSPSGFHTSTEMGSGRFGGEGFGKASYFR LQVVDWDNSLVPLTNLMVLADHPECYDIEAGMNT
Subjt:  WLEFGAGVLVGYWPSFLFTHLQQHATMVQFGGEVVNSSPSGFHTSTEMGSGRFGGEGFGKASYFRKLQVVDWDNSLVPLTNLMVLADHPECYDIEAGMNT

Query:  DWGNYFYYGGPGRNQRC
        DWGNYFYYGGPGRNQRC
Subjt:  DWGNYFYYGGPGRNQRC

XP_023538052.1 uncharacterized protein LOC111798929 isoform X2 [Cucurbita pepo subsp. pepo]5.7e-24997.37Show/hide
Query:  MASACFFNTPQTIAILVPLLLLLLSFTITPVYSSHIPATNQTFHPQQELNKLKMIRAHLDKVNKPAVHTIQSPDGDLIDCVLSHHQPAFDHPNLKAQKPL
        MASACFFNTPQTIAILVPLLLLLL+F +TPVYSSHIPATNQTFHPQQE NKLKMIRAHLDKVNKPAVHTIQSPDGDLIDCVLSHHQPAFDHPNLKAQKPL
Subjt:  MASACFFNTPQTIAILVPLLLLLLSFTITPVYSSHIPATNQTFHPQQELNKLKMIRAHLDKVNKPAVHTIQSPDGDLIDCVLSHHQPAFDHPNLKAQKPL

Query:  DPPERPEGHKPPRTEIGSFQLWSMNGETCPEGTVPIRRTTEEDILRATSFQMFGRKVRRWVRRETSSDGHEHAVGYVTGNHYFGAKASINVWAPRVADQY
        DPPERPEGHKPPRTEIGSFQLWSMNGETCPEGTVPIRRTTEEDILRATSFQMFGRKV+RWVRRETSSDGHEHAVGYVTGNHYFGAKASINVWAPRVADQY
Subjt:  DPPERPEGHKPPRTEIGSFQLWSMNGETCPEGTVPIRRTTEEDILRATSFQMFGRKVRRWVRRETSSDGHEHAVGYVTGNHYFGAKASINVWAPRVADQY

Query:  EFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKVAIGATISPTSSLDGGQFDISLMVWKDPKHGNW
        EFSLSQMWVISGSFGDDLNT+EAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKVAIGATISPTSSLDGGQFDISLMVWKDPKHGNW
Subjt:  EFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKVAIGATISPTSSLDGGQFDISLMVWKDPKHGNW

Query:  WLEFGAGVLVGYWPSFLFTHLQQHATMVQFGGEVVNSSPSGFHTSTEMGSGRFGGEGFGKASYFRKLQVVDWDNSLVPLTNLMVLADHPECYDIEAGMNT
        WLEFGAGVLVGYWPSFLFTHLQQHATMVQFGGEVVNS PSGFH+STEMGSG FGGEGFGKASYFR LQVVDWDNSLVPLTNLMVLADHPECYDIEAGMNT
Subjt:  WLEFGAGVLVGYWPSFLFTHLQQHATMVQFGGEVVNSSPSGFHTSTEMGSGRFGGEGFGKASYFRKLQVVDWDNSLVPLTNLMVLADHPECYDIEAGMNT

Query:  DWGNYFYYGGPGRNQRCP
        DWGNYFYYGGPGR+QRCP
Subjt:  DWGNYFYYGGPGRNQRCP

TrEMBL top hitse value%identityAlignment
A0A1S3BA82 LOW QUALITY PROTEIN: uncharacterized protein LOC1034877141.1e-22187.65Show/hide
Query:  MASA-CFFNTPQTIAILVPLLLLLLSFTITPVYSSH-IPATNQT-FHPQQELNKLKMIRAHLDKVNKPAVHTIQSPDGDLIDCVLSHHQPAFDHPNLKAQ
        MASA CF N   TI I V  L  LL+ TITPV+S H  P+TNQT FHPQQELNKLKMIRAHLDK+NKPAVHTIQSPDGD+IDCVLSHHQPAFDHP L+ Q
Subjt:  MASA-CFFNTPQTIAILVPLLLLLLSFTITPVYSSH-IPATNQT-FHPQQELNKLKMIRAHLDKVNKPAVHTIQSPDGDLIDCVLSHHQPAFDHPNLKAQ

Query:  KPLDPPERPEGHKPPRTEIGSFQLWSMNGETCPEGTVPIRRTTEEDILRATSFQMFGRKVRRWVRRETSSDGHEHAVGYVTGNHYFGAKASINVWAPRVA
         PLDPPERP+GHKPPRTE  SFQLWSM+GE CPEGTVPIRRTTEED+LRATSFQMFG+KVR+WVRRETSSDGHEHAVGYVTG HYFGAKASINVWAPRVA
Subjt:  KPLDPPERPEGHKPPRTEIGSFQLWSMNGETCPEGTVPIRRTTEEDILRATSFQMFGRKVRRWVRRETSSDGHEHAVGYVTGNHYFGAKASINVWAPRVA

Query:  DQYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKVAIGATISPTSSLDGGQFDISLMVWKDPKH
        DQYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNK+AIGA ISPTSS +GGQFDISL+VWKDPKH
Subjt:  DQYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKVAIGATISPTSSLDGGQFDISLMVWKDPKH

Query:  GNWWLEFGAGVLVGYWPSFLFTHLQQHATMVQFGGEVVNSSPSGFHTSTEMGSGRFGGEGFGKASYFRKLQVVDWDNSLVPLTNLMVLADHPECYDIEAG
        GNWWLEFG+GVLVGYWPSFLFTHLQ HATMVQFGGEVVNSSPSGFHT+TEMGSG F GEGFGKASYFR LQVVDWDNSLVPL+NL+VLADHP CYDIE G
Subjt:  GNWWLEFGAGVLVGYWPSFLFTHLQQHATMVQFGGEVVNSSPSGFHTSTEMGSGRFGGEGFGKASYFRKLQVVDWDNSLVPLTNLMVLADHPECYDIEAG

Query:  MNTDWGNYFYYGGPGRNQRCP
        +NT WGNYFYYGGPGRN RCP
Subjt:  MNTDWGNYFYYGGPGRNQRCP

A0A6J1GHR2 uncharacterized protein LOC111454297 isoform X13.6e-24992.68Show/hide
Query:  MASACFFNTPQTIAILVPLLLLLLSFTITPVYSSHIPATNQTFHPQQELNKLKMIRAHLDKVNKPAVHTIQSPDGDLIDCVLSHHQPAFDHPNLKAQKPL
        MASACFFNTPQTIAILVPLLLLLLSFTITPVYSSHIPATNQTFHPQQELNKLKMIRAHLDKVNKPAVHTIQSPDGDLIDCVLSHHQPAFDHPNLKAQKPL
Subjt:  MASACFFNTPQTIAILVPLLLLLLSFTITPVYSSHIPATNQTFHPQQELNKLKMIRAHLDKVNKPAVHTIQSPDGDLIDCVLSHHQPAFDHPNLKAQKPL

Query:  ---------------------------------DPPERPEGHKPPRTEIGSFQLWSMNGETCPEGTVPIRRTTEEDILRATSFQMFGRKVRRWVRRETSS
                                         DPPERPEGHKPPRTEIGSFQLWSMNGETCPEGTVPIRRTTEEDILRATSFQMFGRKVRRWVRRETSS
Subjt:  ---------------------------------DPPERPEGHKPPRTEIGSFQLWSMNGETCPEGTVPIRRTTEEDILRATSFQMFGRKVRRWVRRETSS

Query:  DGHEHAVGYVTGNHYFGAKASINVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNK
        DGHEHAVGYVTGNHYFGAKASINVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNK
Subjt:  DGHEHAVGYVTGNHYFGAKASINVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNK

Query:  VAIGATISPTSSLDGGQFDISLMVWKDPKHGNWWLEFGAGVLVGYWPSFLFTHLQQHATMVQFGGEVVNSSPSGFHTSTEMGSGRFGGEGFGKASYFRKL
        VAIGATISPTSSLDGGQFDISLMVWKDPKHGNWWLEFGAGVLVGYWPSFLFTHLQQHATMVQFGGEVVNSSPSGFHTSTEMGSGRFGGEGFGKASYFRKL
Subjt:  VAIGATISPTSSLDGGQFDISLMVWKDPKHGNWWLEFGAGVLVGYWPSFLFTHLQQHATMVQFGGEVVNSSPSGFHTSTEMGSGRFGGEGFGKASYFRKL

Query:  QVVDWDNSLVPLTNLMVLADHPECYDIEAGMNTDWGNYFYYGGPGRNQRCP
        QVVDWDNSLVPLTNLMVLADHPECYDIEAGMNTDWGNYFYYGGPGRNQRCP
Subjt:  QVVDWDNSLVPLTNLMVLADHPECYDIEAGMNTDWGNYFYYGGPGRNQRCP

A0A6J1GHT7 uncharacterized protein LOC111454297 isoform X22.8e-254100Show/hide
Query:  MASACFFNTPQTIAILVPLLLLLLSFTITPVYSSHIPATNQTFHPQQELNKLKMIRAHLDKVNKPAVHTIQSPDGDLIDCVLSHHQPAFDHPNLKAQKPL
        MASACFFNTPQTIAILVPLLLLLLSFTITPVYSSHIPATNQTFHPQQELNKLKMIRAHLDKVNKPAVHTIQSPDGDLIDCVLSHHQPAFDHPNLKAQKPL
Subjt:  MASACFFNTPQTIAILVPLLLLLLSFTITPVYSSHIPATNQTFHPQQELNKLKMIRAHLDKVNKPAVHTIQSPDGDLIDCVLSHHQPAFDHPNLKAQKPL

Query:  DPPERPEGHKPPRTEIGSFQLWSMNGETCPEGTVPIRRTTEEDILRATSFQMFGRKVRRWVRRETSSDGHEHAVGYVTGNHYFGAKASINVWAPRVADQY
        DPPERPEGHKPPRTEIGSFQLWSMNGETCPEGTVPIRRTTEEDILRATSFQMFGRKVRRWVRRETSSDGHEHAVGYVTGNHYFGAKASINVWAPRVADQY
Subjt:  DPPERPEGHKPPRTEIGSFQLWSMNGETCPEGTVPIRRTTEEDILRATSFQMFGRKVRRWVRRETSSDGHEHAVGYVTGNHYFGAKASINVWAPRVADQY

Query:  EFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKVAIGATISPTSSLDGGQFDISLMVWKDPKHGNW
        EFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKVAIGATISPTSSLDGGQFDISLMVWKDPKHGNW
Subjt:  EFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKVAIGATISPTSSLDGGQFDISLMVWKDPKHGNW

Query:  WLEFGAGVLVGYWPSFLFTHLQQHATMVQFGGEVVNSSPSGFHTSTEMGSGRFGGEGFGKASYFRKLQVVDWDNSLVPLTNLMVLADHPECYDIEAGMNT
        WLEFGAGVLVGYWPSFLFTHLQQHATMVQFGGEVVNSSPSGFHTSTEMGSGRFGGEGFGKASYFRKLQVVDWDNSLVPLTNLMVLADHPECYDIEAGMNT
Subjt:  WLEFGAGVLVGYWPSFLFTHLQQHATMVQFGGEVVNSSPSGFHTSTEMGSGRFGGEGFGKASYFRKLQVVDWDNSLVPLTNLMVLADHPECYDIEAGMNT

Query:  DWGNYFYYGGPGRNQRCP
        DWGNYFYYGGPGRNQRCP
Subjt:  DWGNYFYYGGPGRNQRCP

A0A6J1HCZ9 uncharacterized protein LOC1114624882.9e-22287.86Show/hide
Query:  MASACFFNTPQTIAILVPLLLLLLSFTITPVYSS--HIPATNQTFHPQQELNKLKMIRAHLDKVNKPAVHTIQSPDGDLIDCVLSHHQPAFDHPNLKAQK
        MAS+  FN  Q I I   LLLLLL+ TITP++SS     ATNQTFHPQQELNKLKMIRA LD +NKPA+HTIQSPDGDLIDCVLSH QPAFDHP LK Q 
Subjt:  MASACFFNTPQTIAILVPLLLLLLSFTITPVYSS--HIPATNQTFHPQQELNKLKMIRAHLDKVNKPAVHTIQSPDGDLIDCVLSHHQPAFDHPNLKAQK

Query:  PLDPPERPEGHKPPRTEIGSFQLWSMNGETCPEGTVPIRRTTEEDILRATSFQMFGRKVRRWVRRETSSDGHEHAVGYVTGNHYFGAKASINVWAPRVAD
        PLDPPERPEGHKPPRT   SFQLWSMNGE CPEGTVPIRRTTEED+LRATSFQMFGRKVRRWVRRETSSDGHEHAVGYVTG+HYFGAKASINVWAPRVAD
Subjt:  PLDPPERPEGHKPPRTEIGSFQLWSMNGETCPEGTVPIRRTTEEDILRATSFQMFGRKVRRWVRRETSSDGHEHAVGYVTGNHYFGAKASINVWAPRVAD

Query:  QYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKVAIGATISPTSSLDGGQFDISLMVWKDPKHG
        QYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNK+AIGA ISPTSS DGGQFDISL+VWKDPKHG
Subjt:  QYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKVAIGATISPTSSLDGGQFDISLMVWKDPKHG

Query:  NWWLEFGAGVLVGYWPSFLFTHLQQHATMVQFGGEVVNSSPSGFHTSTEMGSGRFGGEGFGKASYFRKLQVVDWDNSLVPLTNLMVLADHPECYDIEAGM
        NWWLEFG+GVLVGYWPSFLFTHLQ HATMVQFGGEVVNSSPSGFHT+TEMGSG F GEGFGKASYFR LQVVDWDNSLVPL+NL+VLADHP CYDI+ G+
Subjt:  NWWLEFGAGVLVGYWPSFLFTHLQQHATMVQFGGEVVNSSPSGFHTSTEMGSGRFGGEGFGKASYFRKLQVVDWDNSLVPLTNLMVLADHPECYDIEAGM

Query:  NTDWGNYFYYGGPGRNQRCP
        NT WGNYFYYGGPGRN RCP
Subjt:  NTDWGNYFYYGGPGRNQRCP

A0A6J1KKE5 uncharacterized protein LOC1114965245.6e-25098.56Show/hide
Query:  MASACFFNTPQTIAILVPLLLLLLSFTITPVYSSHIPATNQTFHPQQELNKLKMIRAHLDKVNKPAVHTIQSPDGDLIDCVLSHHQPAFDHPNLKAQKPL
        MA ACFFNTPQTIAILVPLLLLLL+FTITPVY SHIPATNQTFHPQQELNKLKMIRAHLDKVNKPAVHTIQSPDGDLIDCVLSHHQPAFDHPNLKAQKPL
Subjt:  MASACFFNTPQTIAILVPLLLLLLSFTITPVYSSHIPATNQTFHPQQELNKLKMIRAHLDKVNKPAVHTIQSPDGDLIDCVLSHHQPAFDHPNLKAQKPL

Query:  DPPERPEGHKPPRTEIGSFQLWSMNGETCPEGTVPIRRTTEEDILRATSFQMFGRKVRRWVRRETSSDGHEHAVGYVTGNHYFGAKASINVWAPRVADQY
        DPPERPEGHKPPRTEIGSFQLWSMNGETCPEGTVPIRRTTEEDILRATSFQMFGRKVRRWVRRETS DGHEHAVGYVTGNHYFGAKASINVWAPRVADQY
Subjt:  DPPERPEGHKPPRTEIGSFQLWSMNGETCPEGTVPIRRTTEEDILRATSFQMFGRKVRRWVRRETSSDGHEHAVGYVTGNHYFGAKASINVWAPRVADQY

Query:  EFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKVAIGATISPTSSLDGGQFDISLMVWKDPKHGNW
        EFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKVAIGATISPTSSLDGGQFDISLMVWKDPKHGNW
Subjt:  EFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKVAIGATISPTSSLDGGQFDISLMVWKDPKHGNW

Query:  WLEFGAGVLVGYWPSFLFTHLQQHATMVQFGGEVVNSSPSGFHTSTEMGSGRFGGEGFGKASYFRKLQVVDWDNSLVPLTNLMVLADHPECYDIEAGMNT
        WLE GAGVLVGYWPSFLFTHLQQHATMVQFGGEVVNSSPSGFHTSTEMGSGRFGGEGFGKASYFR LQVVDWDNSLVPLTNLMVLADHPECYDIEAGMNT
Subjt:  WLEFGAGVLVGYWPSFLFTHLQQHATMVQFGGEVVNSSPSGFHTSTEMGSGRFGGEGFGKASYFRKLQVVDWDNSLVPLTNLMVLADHPECYDIEAGMNT

Query:  DWGNYFYYGGPGRNQRC
        DWGNYFYYGGPGRNQRC
Subjt:  DWGNYFYYGGPGRNQRC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10750.1 Protein of Unknown Function (DUF239)2.8e-18573.5Show/hide
Query:  LLLLLLSFTITPVYSSHIPATNQTFHPQQELNKLKMIRAHLDKVNKPAVHTIQSPDGDLIDCVLSHHQPAFDHPNLKAQKPLDPPERPEGHKPPRTEIGS
        L LLLLS + + V S ++   NQT  P  ELNKLK I  HL K+NKP++ TI SPDGD+IDCVL HHQPAFDHP+L+ QKPLDPPERP GH        S
Subjt:  LLLLLLSFTITPVYSSHIPATNQTFHPQQELNKLKMIRAHLDKVNKPAVHTIQSPDGDLIDCVLSHHQPAFDHPNLKAQKPLDPPERPEGHKPPRTEIGS

Query:  FQLWSMNGETCPEGTVPIRRTTEEDILRATSFQMFGRKVRRWVRRETSSDGHEHAVGYVTGNHYFGAKASINVWAPRVADQYEFSLSQMWVISGSFGDDL
        FQLW M GETCPEGTVPIRRT EEDILRA S   FG+K+R + RR+TSS+GHEHAVGYV+G  Y+GAKASINVWAP+V +QYEFSLSQ+W+ISGSFG+DL
Subjt:  FQLWSMNGETCPEGTVPIRRTTEEDILRATSFQMFGRKVRRWVRRETSSDGHEHAVGYVTGNHYFGAKASINVWAPRVADQYEFSLSQMWVISGSFGDDL

Query:  NTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKVAIGATISPTSSLDGGQFDISLMVWKDPKHGNWWLEFGAGVLVGYWPSFLF
        NTIEAGWQVSPELYGDNYPRFFTYWT+DAYQATGCYNLLCSGFVQTN+++AIGA ISP+SS  GGQFDI+L++WKDPKHGNWWLEFG+G+LVGYWPSFLF
Subjt:  NTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKVAIGATISPTSSLDGGQFDISLMVWKDPKHGNWWLEFGAGVLVGYWPSFLF

Query:  THLQQHATMVQFGGEVVNSSPSGFHTSTEMGSGRFGGEGFGKASYFRKLQVVDWDNSLVPLTNLMVLADHPECYDIEAGMNTDWGNYFYYGGPGRNQRCP
        THL++HA+MVQ+GGE+VNSSP G HTST+MGSG F  EGF K+SYFR +QVVDWDN+LVP  NL VLADHP CYDI+ G N  WG+YFYYGGPG+N +CP
Subjt:  THLQQHATMVQFGGEVVNSSPSGFHTSTEMGSGRFGGEGFGKASYFRKLQVVDWDNSLVPLTNLMVLADHPECYDIEAGMNTDWGNYFYYGGPGRNQRCP

AT1G23340.1 Protein of Unknown Function (DUF239)2.0e-17567.06Show/hide
Query:  ASACFFNTPQTIAILVPLLLLLLSFTITPVYSSHIPATNQT--FHPQQELNKLKMIRAHLDKVNKPAVHTIQSPDGDLIDCVLSHHQPAFDHPNLKAQKP
        +S+C F T           +LLLS  +   Y+S   +T++T    PQ+E+ K+K+IR  L K+NKPA+ TI S DGD IDCV SHHQPAFDHP L+ Q+P
Subjt:  ASACFFNTPQTIAILVPLLLLLLSFTITPVYSSHIPATNQT--FHPQQELNKLKMIRAHLDKVNKPAVHTIQSPDGDLIDCVLSHHQPAFDHPNLKAQKP

Query:  LDPPERPEGHKPPRTEIGSFQLWSMNGETCPEGTVPIRRTTEEDILRATSFQMFGRKVRRWVRRETSSDGHEHAVGYVTGNHYFGAKASINVWAPRVADQ
        +DPPE P G+        +FQLWS+ GE+CPEGT+PIRRTTE+D+LRA S + FGRK+RR VRR++SS+GHEHAVGYV+G+ Y+GAKASINVW PRV  Q
Subjt:  LDPPERPEGHKPPRTEIGSFQLWSMNGETCPEGTVPIRRTTEEDILRATSFQMFGRKVRRWVRRETSSDGHEHAVGYVTGNHYFGAKASINVWAPRVADQ

Query:  YEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKVAIGATISPTSSLDGGQFDISLMVWKDPKHGN
        YEFSLSQ+W+I+GSF  DLNTIEAGWQ+SPELYGD  PRFFTYWTSDAYQATGCYNLLCSGFVQTNN++AIGA ISP SS  GGQFDISL++WKDPKHG+
Subjt:  YEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKVAIGATISPTSSLDGGQFDISLMVWKDPKHGN

Query:  WWLEFGAGVLVGYWPSFLFTHLQQHATMVQFGGEVVNSSPSGFHTSTEMGSGRFGGEGFGKASYFRKLQVVDWDNSLVPLTNLMVLADHPECYDIEAGMN
        WWL+FG+G LVGYWP  LFTHL++H  MVQFGGE+VN+ P G HTST+MGSG F GEGFGKASYFR LQ+VDWDN+L+P++NL VLADHP CYDI  G+N
Subjt:  WWLEFGAGVLVGYWPSFLFTHLQQHATMVQFGGEVVNSSPSGFHTSTEMGSGRFGGEGFGKASYFRKLQVVDWDNSLVPLTNLMVLADHPECYDIEAGMN

Query:  TDWGNYFYYGGPGRNQRCP
          WGN+FYYGGPG+N +CP
Subjt:  TDWGNYFYYGGPGRNQRCP

AT1G23340.2 Protein of Unknown Function (DUF239)2.0e-17567.06Show/hide
Query:  ASACFFNTPQTIAILVPLLLLLLSFTITPVYSSHIPATNQT--FHPQQELNKLKMIRAHLDKVNKPAVHTIQSPDGDLIDCVLSHHQPAFDHPNLKAQKP
        +S+C F T           +LLLS  +   Y+S   +T++T    PQ+E+ K+K+IR  L K+NKPA+ TI S DGD IDCV SHHQPAFDHP L+ Q+P
Subjt:  ASACFFNTPQTIAILVPLLLLLLSFTITPVYSSHIPATNQT--FHPQQELNKLKMIRAHLDKVNKPAVHTIQSPDGDLIDCVLSHHQPAFDHPNLKAQKP

Query:  LDPPERPEGHKPPRTEIGSFQLWSMNGETCPEGTVPIRRTTEEDILRATSFQMFGRKVRRWVRRETSSDGHEHAVGYVTGNHYFGAKASINVWAPRVADQ
        +DPPE P G+        +FQLWS+ GE+CPEGT+PIRRTTE+D+LRA S + FGRK+RR VRR++SS+GHEHAVGYV+G+ Y+GAKASINVW PRV  Q
Subjt:  LDPPERPEGHKPPRTEIGSFQLWSMNGETCPEGTVPIRRTTEEDILRATSFQMFGRKVRRWVRRETSSDGHEHAVGYVTGNHYFGAKASINVWAPRVADQ

Query:  YEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKVAIGATISPTSSLDGGQFDISLMVWKDPKHGN
        YEFSLSQ+W+I+GSF  DLNTIEAGWQ+SPELYGD  PRFFTYWTSDAYQATGCYNLLCSGFVQTNN++AIGA ISP SS  GGQFDISL++WKDPKHG+
Subjt:  YEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKVAIGATISPTSSLDGGQFDISLMVWKDPKHGN

Query:  WWLEFGAGVLVGYWPSFLFTHLQQHATMVQFGGEVVNSSPSGFHTSTEMGSGRFGGEGFGKASYFRKLQVVDWDNSLVPLTNLMVLADHPECYDIEAGMN
        WWL+FG+G LVGYWP  LFTHL++H  MVQFGGE+VN+ P G HTST+MGSG F GEGFGKASYFR LQ+VDWDN+L+P++NL VLADHP CYDI  G+N
Subjt:  WWLEFGAGVLVGYWPSFLFTHLQQHATMVQFGGEVVNSSPSGFHTSTEMGSGRFGGEGFGKASYFRKLQVVDWDNSLVPLTNLMVLADHPECYDIEAGMN

Query:  TDWGNYFYYGGPGRNQRCP
          WGN+FYYGGPG+N +CP
Subjt:  TDWGNYFYYGGPGRNQRCP

AT1G70550.1 Protein of Unknown Function (DUF239)1.5e-17866.82Show/hide
Query:  SACFFNTPQTIAI------------LVPLLLLLLSFTITPVYSSHIPATNQTFHPQQELNKLKMIRAHLDKVNKPAVHTIQSPDGDLIDCVLSHHQPAFD
        S CF ++ Q ++             L+ LL L+ S   +   SS+  A +QT  PQ+EL KL +IR  LDK+NKPAV TIQS DGD IDCV +H QPAFD
Subjt:  SACFFNTPQTIAI------------LVPLLLLLLSFTITPVYSSHIPATNQTFHPQQELNKLKMIRAHLDKVNKPAVHTIQSPDGDLIDCVLSHHQPAFD

Query:  HPNLKAQKPLDPPERPEGHKPPRTEIGSFQLWSMNGETCPEGTVPIRRTTEEDILRATSFQMFGRKVRRWVRRETSSDGHEHAVGYVTGNHYFGAKASIN
        HP L+ QKPLDPPE P+G+        + QLWS++GE+CPEGT+PIRRTTE+D+LRA+S Q FGRK+RR V+R+++++GHEHAVGYVTG  Y+GAKASIN
Subjt:  HPNLKAQKPLDPPERPEGHKPPRTEIGSFQLWSMNGETCPEGTVPIRRTTEEDILRATSFQMFGRKVRRWVRRETSSDGHEHAVGYVTGNHYFGAKASIN

Query:  VWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKVAIGATISPTSSLDGGQFDISLM
        VW+PRV  QYEFSLSQ+WVI+GSF  DLNTIEAGWQ+SPELYGD YPRFFTYWTSDAY+ TGCYNLLCSGFVQTN ++AIGA ISP SS  GGQFDISL+
Subjt:  VWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKVAIGATISPTSSLDGGQFDISLM

Query:  VWKDPKHGNWWLEFGAGVLVGYWPSFLFTHLQQHATMVQFGGEVVNSSPSGFHTSTEMGSGRFGGEGFGKASYFRKLQVVDWDNSLVPLTNLMVLADHPE
        +WKDPKHG+WWL+FG+G LVGYWP+FLFTHL+QH +MVQFGGE+VN+ P G HT+T+MGSG F GEGFGKASYFR LQ+VDWDN+L+P +NL +LADHP 
Subjt:  VWKDPKHGNWWLEFGAGVLVGYWPSFLFTHLQQHATMVQFGGEVVNSSPSGFHTSTEMGSGRFGGEGFGKASYFRKLQVVDWDNSLVPLTNLMVLADHPE

Query:  CYDIEAGMNTDWGNYFYYGGPGRNQRCP
        CYDI  G N  WGNYFYYGGPG+N RCP
Subjt:  CYDIEAGMNTDWGNYFYYGGPGRNQRCP

AT1G70550.2 Protein of Unknown Function (DUF239)1.9e-17869.98Show/hide
Query:  LVPLLLLLLSFTITPVYSSHIPATNQTFHPQQELNKLKMIRAHLDKVNKPAVHTIQSPDGDLIDCVLSHHQPAFDHPNLKAQKPLDPPERPEGHKPPRTE
        L+ LL L+ S   +   SS+  A +QT  PQ+EL KL +IR  LDK+NKPAV TIQS DGD IDCV +H QPAFDHP L+ QKPLDPPE P+G+      
Subjt:  LVPLLLLLLSFTITPVYSSHIPATNQTFHPQQELNKLKMIRAHLDKVNKPAVHTIQSPDGDLIDCVLSHHQPAFDHPNLKAQKPLDPPERPEGHKPPRTE

Query:  IGSFQLWSMNGETCPEGTVPIRRTTEEDILRATSFQMFGRKVRRWVRRETSSDGHEHAVGYVTGNHYFGAKASINVWAPRVADQYEFSLSQMWVISGSFG
          + QLWS++GE+CPEGT+PIRRTTE+D+LRA+S Q FGRK+RR V+R+++++GHEHAVGYVTG  Y+GAKASINVW+PRV  QYEFSLSQ+WVI+GSF 
Subjt:  IGSFQLWSMNGETCPEGTVPIRRTTEEDILRATSFQMFGRKVRRWVRRETSSDGHEHAVGYVTGNHYFGAKASINVWAPRVADQYEFSLSQMWVISGSFG

Query:  DDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKVAIGATISPTSSLDGGQFDISLMVWKDPKHGNWWLEFGAGVLVGYWPS
         DLNTIEAGWQ+SPELYGD YPRFFTYWTSDAY+ TGCYNLLCSGFVQTN ++AIGA ISP SS  GGQFDISL++WKDPKHG+WWL+FG+G LVGYWP+
Subjt:  DDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKVAIGATISPTSSLDGGQFDISLMVWKDPKHGNWWLEFGAGVLVGYWPS

Query:  FLFTHLQQHATMVQFGGEVVNSSPSGFHTSTEMGSGRFGGEGFGKASYFRKLQVVDWDNSLVPLTNLMVLADHPECYDIEAGMNTDWGNYFYYGGPGRNQ
        FLFTHL+QH +MVQFGGE+VN+ P G HT+T+MGSG F GEGFGKASYFR LQ+VDWDN+L+P +NL +LADHP CYDI  G N  WGNYFYYGGPG+N 
Subjt:  FLFTHLQQHATMVQFGGEVVNSSPSGFHTSTEMGSGRFGGEGFGKASYFRKLQVVDWDNSLVPLTNLMVLADHPECYDIEAGMNTDWGNYFYYGGPGRNQ

Query:  RCP
        RCP
Subjt:  RCP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCTCCGCCTGCTTCTTCAACACTCCCCAGACCATTGCCATTCTTGTGCCGCTGCTTTTGCTTCTTCTTTCTTTCACAATAACTCCGGTATACTCGTCTCAT
ATTCCTGCCACCAACCAGACTTTCCATCCCCAACAGGAATTGAACAAGTTGAAGATGATAAGAGCTCATCTGGATAAGGTCAACAAGCCTGCTGTCCACACAATT
CAGAGTCCTGACGGGGATCTCATTGATTGTGTTTTATCTCACCACCAACCAGCATTTGACCATCCGAACTTGAAGGCCCAAAAACCCTTGGATCCACCAGAGAGA
CCAGAAGGACATAAGCCGCCTAGGACGGAGATAGGAAGCTTCCAATTATGGAGCATGAATGGGGAAACTTGCCCGGAAGGAACAGTTCCAATAAGAAGAACAACG
GAGGAAGACATACTCAGAGCTACCTCTTTTCAGATGTTTGGAAGAAAAGTAAGAAGATGGGTTAGGAGAGAAACAAGCAGCGACGGGCATGAGCACGCTGTGGGG
TATGTGACCGGCAATCACTACTTCGGTGCAAAGGCAAGCATCAATGTGTGGGCACCTCGAGTCGCCGATCAGTATGAATTCAGCTTATCGCAAATGTGGGTCATC
TCCGGCTCTTTCGGCGACGATCTCAACACCATCGAAGCCGGTTGGCAGGTTAGTCCAGAACTGTACGGAGACAATTACCCAAGATTCTTTACTTACTGGACGTCG
GACGCATACCAAGCAACCGGATGTTACAATCTGCTATGCTCGGGGTTTGTCCAAACAAATAACAAAGTGGCCATTGGGGCCACAATATCTCCGACGTCTTCGTTG
GATGGCGGTCAATTCGACATCAGCTTGATGGTTTGGAAGGATCCGAAGCATGGAAACTGGTGGCTGGAATTCGGAGCGGGAGTACTGGTGGGATACTGGCCGTCC
TTCTTATTCACGCATCTGCAACAGCATGCAACCATGGTGCAGTTCGGGGGAGAGGTGGTGAATTCAAGCCCCTCCGGATTCCACACCAGCACGGAGATGGGAAGT
GGGCGCTTCGGCGGGGAGGGATTCGGAAAAGCTTCCTACTTCCGAAAGCTGCAGGTGGTGGATTGGGACAACAGCTTGGTTCCATTAACAAACCTGATGGTATTG
GCGGATCATCCGGAGTGCTATGATATTGAAGCAGGGATGAACACCGACTGGGGCAACTACTTTTACTACGGTGGACCTGGGCGAAACCAGAGGTGTCCTTGA
mRNA sequenceShow/hide mRNA sequence
ACAACCTCTCCCATATCTTTCACTATGCTTGAAGGATCATCCTTTGGCCTTCTCTCAACAACAATGCTGGGCCCTGCATTTCGTTGGACATGAAACACCCTTCCA
ACACTTAACACTCCGCCCAAGCATGGCCTCCGCCTGCTTCTTCAACACTCCCCAGACCATTGCCATTCTTGTGCCGCTGCTTTTGCTTCTTCTTTCTTTCACAAT
AACTCCGGTATACTCGTCTCATATTCCTGCCACCAACCAGACTTTCCATCCCCAACAGGAATTGAACAAGTTGAAGATGATAAGAGCTCATCTGGATAAGGTCAA
CAAGCCTGCTGTCCACACAATTCAGAGTCCTGACGGGGATCTCATTGATTGTGTTTTATCTCACCACCAACCAGCATTTGACCATCCGAACTTGAAGGCCCAAAA
ACCCTTGGATCCACCAGAGAGACCAGAAGGACATAAGCCGCCTAGGACGGAGATAGGAAGCTTCCAATTATGGAGCATGAATGGGGAAACTTGCCCGGAAGGAAC
AGTTCCAATAAGAAGAACAACGGAGGAAGACATACTCAGAGCTACCTCTTTTCAGATGTTTGGAAGAAAAGTAAGAAGATGGGTTAGGAGAGAAACAAGCAGCGA
CGGGCATGAGCACGCTGTGGGGTATGTGACCGGCAATCACTACTTCGGTGCAAAGGCAAGCATCAATGTGTGGGCACCTCGAGTCGCCGATCAGTATGAATTCAG
CTTATCGCAAATGTGGGTCATCTCCGGCTCTTTCGGCGACGATCTCAACACCATCGAAGCCGGTTGGCAGGTTAGTCCAGAACTGTACGGAGACAATTACCCAAG
ATTCTTTACTTACTGGACGTCGGACGCATACCAAGCAACCGGATGTTACAATCTGCTATGCTCGGGGTTTGTCCAAACAAATAACAAAGTGGCCATTGGGGCCAC
AATATCTCCGACGTCTTCGTTGGATGGCGGTCAATTCGACATCAGCTTGATGGTTTGGAAGGATCCGAAGCATGGAAACTGGTGGCTGGAATTCGGAGCGGGAGT
ACTGGTGGGATACTGGCCGTCCTTCTTATTCACGCATCTGCAACAGCATGCAACCATGGTGCAGTTCGGGGGAGAGGTGGTGAATTCAAGCCCCTCCGGATTCCA
CACCAGCACGGAGATGGGAAGTGGGCGCTTCGGCGGGGAGGGATTCGGAAAAGCTTCCTACTTCCGAAAGCTGCAGGTGGTGGATTGGGACAACAGCTTGGTTCC
ATTAACAAACCTGATGGTATTGGCGGATCATCCGGAGTGCTATGATATTGAAGCAGGGATGAACACCGACTGGGGCAACTACTTTTACTACGGTGGACCTGGGCG
AAACCAGAGGTGTCCTTGACACAATTCTTCTTTTTGTTTTTGTTACATCATAAAGTTGTAAATCTTGATTCACCATTTTTACTATAATATCAATATCAGCAGTTT
AGAC
Protein sequenceShow/hide protein sequence
MASACFFNTPQTIAILVPLLLLLLSFTITPVYSSHIPATNQTFHPQQELNKLKMIRAHLDKVNKPAVHTIQSPDGDLIDCVLSHHQPAFDHPNLKAQKPLDPPER
PEGHKPPRTEIGSFQLWSMNGETCPEGTVPIRRTTEEDILRATSFQMFGRKVRRWVRRETSSDGHEHAVGYVTGNHYFGAKASINVWAPRVADQYEFSLSQMWVI
SGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKVAIGATISPTSSLDGGQFDISLMVWKDPKHGNWWLEFGAGVLVGYWPS
FLFTHLQQHATMVQFGGEVVNSSPSGFHTSTEMGSGRFGGEGFGKASYFRKLQVVDWDNSLVPLTNLMVLADHPECYDIEAGMNTDWGNYFYYGGPGRNQRCP