; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG08G017780 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG08G017780
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionProtein of Unknown Function (DUF239)
Genome locationCG_Chr08:29857739..29859805
RNA-Seq ExpressionClCG08G017780
SyntenyClCG08G017780
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004314 - Neprosin
IPR025521 - Neprosin activation peptide


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0061068.1 uncharacterized protein E6C27_scaffold501G002060 [Cucumis melo var. makuwa]1.8e-24388.84Show/hide
Query:  PLHFYGGKTLPTDPPIVINQTLGHRMSMASALCFINISQTILIFLLPFSLLASTISPVHSLQSR-ATNQTDFHPEQELNKLKMIRTHLDKINKPALHTIQ
        P  F   K  P   P     TLGHR+SMASALCFINI+ TILIF+L FSLLASTI+PVHSL +R +TNQTDFHP+QELNKLKMIR HLDKINKPA+HTIQ
Subjt:  PLHFYGGKTLPTDPPIVINQTLGHRMSMASALCFINISQTILIFLLPFSLLASTISPVHSLQSR-ATNQTDFHPEQELNKLKMIRTHLDKINKPALHTIQ

Query:  SPDGDIIDCVLSHQQPAFDHPNLQGQIPMDPPERPQGHKQPRTVTGSFQLWNMNGENCPEGTVPIRRTTEEDMLRATSFQMFGRKVSRWVRRETSSDGHE
        SPDGDIIDCVLSH QPAFDHP LQGQIP+DPPERPQGHK PRT T SFQLW+M+GENCPEGTVPIRRTTEEDMLRATSFQMFG+KV +WVRRETSSDGHE
Subjt:  SPDGDIIDCVLSHQQPAFDHPNLQGQIPMDPPERPQGHKQPRTVTGSFQLWNMNGENCPEGTVPIRRTTEEDMLRATSFQMFGRKVSRWVRRETSSDGHE

Query:  HAVGYVTGDHYFGAKASINVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQACLSLCYKILVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGF
        HAVGYVTG+HYFGAKASINVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQACL         PELYGDNYPRFFTYWTSDAYQATGCYNLLCSGF
Subjt:  HAVGYVTGDHYFGAKASINVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQACLSLCYKILVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGF

Query:  VQTNNKVAIGAAISPTSSFEGGQFDISLLVWKDPKHGNWWLEFGSGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTTTEMGSGHFAGEGFGKA
        VQTNNK+AIGAAISPTSSFEGGQFDISLLVWKDPKHGNWWLEFGSGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTTTEMGSGHFAGEGFGKA
Subjt:  VQTNNKVAIGAAISPTSSFEGGQFDISLLVWKDPKHGNWWLEFGSGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTTTEMGSGHFAGEGFGKA

Query:  SYFRNLQVVDWDNSLVPLSNLVVLADHPNCYDIQGGINTVWGNYFYYGGPGRNDRCP
        SYFRNLQVVDWDNSLVPLSNLVVLADHPNCYDI+GGINTVWGNYFYYGGPGRNDRCP
Subjt:  SYFRNLQVVDWDNSLVPLSNLVVLADHPNCYDIQGGINTVWGNYFYYGGPGRNDRCP

XP_004142970.2 uncharacterized protein LOC101208399 [Cucumis sativus]1.0e-24188.38Show/hide
Query:  PLHFYGGKTLPTDPPIVINQTLGHRMSMASALCFINISQTILIFLLPFSLLASTISPVHSLQSRATNQTDFHPEQELNKLKMIRTHLDKINKPALHTIQS
        P  F   K  P   P     TLGHR+SMASALCFINI+ TILIF+L FSLLASTI+PVHSLQ R TNQTDFHP+QELNKLKMIR HLDKINKPA+HTIQS
Subjt:  PLHFYGGKTLPTDPPIVINQTLGHRMSMASALCFINISQTILIFLLPFSLLASTISPVHSLQSRATNQTDFHPEQELNKLKMIRTHLDKINKPALHTIQS

Query:  PDGDIIDCVLSHQQPAFDHPNLQGQIPMDPPERPQGHKQPRTVTGSFQLWNMNGENCPEGTVPIRRTTEEDMLRATSFQMFGRKVSRWVRRETSSDGHEH
        PDGDIIDCVLSH QPAFDHP LQGQ P+DPPERPQGHK PRT T SFQLW+ +GENCPEGTVPIRRTTEED+LRATSFQMFGRKV +WVRRETSSDGHEH
Subjt:  PDGDIIDCVLSHQQPAFDHPNLQGQIPMDPPERPQGHKQPRTVTGSFQLWNMNGENCPEGTVPIRRTTEEDMLRATSFQMFGRKVSRWVRRETSSDGHEH

Query:  AVGYVTGDHYFGAKASINVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQACLSLCYKILVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFV
        AVGYVTG+HYFGAKASINVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQ          VSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFV
Subjt:  AVGYVTGDHYFGAKASINVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQACLSLCYKILVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFV

Query:  QTNNKVAIGAAISPTSSFEGGQFDISLLVWKDPKHGNWWLEFGSGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTTTEMGSGHFAGEGFGKAS
        QTNNK+AIGAAISPTSSFEGGQFDISLLVWKDPKHGNWWLEFGSGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSG HTTTEMGSGHFAGEGFGKAS
Subjt:  QTNNKVAIGAAISPTSSFEGGQFDISLLVWKDPKHGNWWLEFGSGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTTTEMGSGHFAGEGFGKAS

Query:  YFRNLQVVDWDNSLVPLSNLVVLADHPNCYDIQGGINTVWGNYFYYGGPGRNDRCP
        YFRNLQVVDWDNSLVPLSNLVVLADHPNCYDI+GGINTVWGNYFYYGGPGRNDRCP
Subjt:  YFRNLQVVDWDNSLVPLSNLVVLADHPNCYDIQGGINTVWGNYFYYGGPGRNDRCP

XP_008444361.1 PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103487714 [Cucumis melo]2.9e-24188.62Show/hide
Query:  PLHFYGGKTLPTDPPIVINQTLGHRMSMASALCFINISQTILIFLLPFSLLASTISPVHSLQSR-ATNQTDFHPEQELNKLKMIRTHLDKINKPALHTIQ
        P  F   K  P   P     TLGHR+SMASALCFINI+ TILIF+L FSLLASTI+PVHSL +R +TNQTDFHP+QELNKLKMIR HLDKINKPA+HTIQ
Subjt:  PLHFYGGKTLPTDPPIVINQTLGHRMSMASALCFINISQTILIFLLPFSLLASTISPVHSLQSR-ATNQTDFHPEQELNKLKMIRTHLDKINKPALHTIQ

Query:  SPDGDIIDCVLSHQQPAFDHPNLQGQIPMDPPERPQGHKQPRTVTGSFQLWNMNGENCPEGTVPIRRTTEEDMLRATSFQMFGRKVSRWVRRETSSDGHE
        SPDGDIIDCVLSH QPAFDHP LQGQIP+DPPERPQGHK PRT T SFQLW+M+GENCPEGTVPIRRTTEEDMLRATSFQMFG+KV +WVRRETSSDGHE
Subjt:  SPDGDIIDCVLSHQQPAFDHPNLQGQIPMDPPERPQGHKQPRTVTGSFQLWNMNGENCPEGTVPIRRTTEEDMLRATSFQMFGRKVSRWVRRETSSDGHE

Query:  HAVGYVTGDHYFGAKASINVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQACLSLCYKILVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGF
        HAVGYVTG+HYFGAKASINVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQ          VSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGF
Subjt:  HAVGYVTGDHYFGAKASINVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQACLSLCYKILVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGF

Query:  VQTNNKVAIGAAISPTSSFEGGQFDISLLVWKDPKHGNWWLEFGSGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTTTEMGSGHFAGEGFGKA
        VQTNNK+AIGAAISPTSSFEGGQFDISLLVWKDPKHGNWWLEFGSGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTTTEMGSGHFAGEGFGKA
Subjt:  VQTNNKVAIGAAISPTSSFEGGQFDISLLVWKDPKHGNWWLEFGSGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTTTEMGSGHFAGEGFGKA

Query:  SYFRNLQVVDWDNSLVPLSNLVVLADHPNCYDIQGGINTVWGNYFYYGGPGRNDRCP
        SYFRNLQVVDWDNSLVPLSNLVVLADHPNCYDI+GGINTVWGNYFYYGGPGRNDRCP
Subjt:  SYFRNLQVVDWDNSLVPLSNLVVLADHPNCYDIQGGINTVWGNYFYYGGPGRNDRCP

XP_022961843.1 uncharacterized protein LOC111462488 [Cucurbita moschata]3.7e-22889.58Show/hide
Query:  MASALCFINISQTILIF-LLPFSLLASTISPVHS--LQSRATNQTDFHPEQELNKLKMIRTHLDKINKPALHTIQSPDGDIIDCVLSHQQPAFDHPNLQG
        MAS+ CF NISQ I IF LL   LL STI+P+HS   Q+ ATNQT FHP+QELNKLKMIR  LD INKPALHTIQSPDGD+IDCVLSH QPAFDHP L+G
Subjt:  MASALCFINISQTILIF-LLPFSLLASTISPVHS--LQSRATNQTDFHPEQELNKLKMIRTHLDKINKPALHTIQSPDGDIIDCVLSHQQPAFDHPNLQG

Query:  QIPMDPPERPQGHKQPRTVTGSFQLWNMNGENCPEGTVPIRRTTEEDMLRATSFQMFGRKVSRWVRRETSSDGHEHAVGYVTGDHYFGAKASINVWAPRV
        QIP+DPPERP+GHK PRTVT SFQLW+MNGENCPEGTVPIRRTTEEDMLRATSFQMFGRKV RWVRRETSSDGHEHAVGYVTGDHYFGAKASINVWAPRV
Subjt:  QIPMDPPERPQGHKQPRTVTGSFQLWNMNGENCPEGTVPIRRTTEEDMLRATSFQMFGRKVSRWVRRETSSDGHEHAVGYVTGDHYFGAKASINVWAPRV

Query:  ADQYEFSLSQMWVISGSFGDDLNTIEAGWQACLSLCYKILVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKVAIGAAISPTSSFEGGQFD
        ADQYEFSLSQMWVISGSFGDDLNTIEAGWQ          VSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNK+AIGAAISPTSSF+GGQFD
Subjt:  ADQYEFSLSQMWVISGSFGDDLNTIEAGWQACLSLCYKILVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKVAIGAAISPTSSFEGGQFD

Query:  ISLLVWKDPKHGNWWLEFGSGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTTTEMGSGHFAGEGFGKASYFRNLQVVDWDNSLVPLSNLVVLA
        ISLLVWKDPKHGNWWLEFGSGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTTTEMGSGHFAGEGFGKASYFRNLQVVDWDNSLVPLSNLVVLA
Subjt:  ISLLVWKDPKHGNWWLEFGSGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTTTEMGSGHFAGEGFGKASYFRNLQVVDWDNSLVPLSNLVVLA

Query:  DHPNCYDIQGGINTVWGNYFYYGGPGRNDRCP
        DHPNCYDIQGGINTVWGNYFYYGGPGRN RCP
Subjt:  DHPNCYDIQGGINTVWGNYFYYGGPGRNDRCP

XP_038885684.1 uncharacterized protein LOC120075988 [Benincasa hispida]1.5e-24292.18Show/hide
Query:  GHRMSMASALCFINISQTILIFLLPFSLL-ASTISPVHSLQSRATNQTDFHPEQELNKLKMIRTHLDKINKPALHTIQSPDGDIIDCVLSHQQPAFDHPN
        GHR+SMASALCFINISQTILIF+L FSLL  STI+PVHS  + ATNQTDFHP++ELNKL MIR HLDKINKPA+HTIQSPDGD+IDCVLSHQQPAFDHP 
Subjt:  GHRMSMASALCFINISQTILIFLLPFSLL-ASTISPVHSLQSRATNQTDFHPEQELNKLKMIRTHLDKINKPALHTIQSPDGDIIDCVLSHQQPAFDHPN

Query:  LQGQIPMDPPERPQGHKQPRTVTGSFQLWNMNGENCPEGTVPIRRTTEEDMLRATSFQMFGRKVSRWVRRETSSDGHEHAVGYVTGDHYFGAKASINVWA
        LQGQIP+DPPERPQGHK PRTVT SFQLW+MNGENCPEGTVPIRRTTEEDMLRATSFQMFG+KVSRWVRRETSSDGHEHAVGYVTGDHYFGAKASINVWA
Subjt:  LQGQIPMDPPERPQGHKQPRTVTGSFQLWNMNGENCPEGTVPIRRTTEEDMLRATSFQMFGRKVSRWVRRETSSDGHEHAVGYVTGDHYFGAKASINVWA

Query:  PRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQACLSLCYKILVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKVAIGAAISPTSSFEGG
        PRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQ          VSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNK+AIGAAISPTSSFEGG
Subjt:  PRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQACLSLCYKILVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKVAIGAAISPTSSFEGG

Query:  QFDISLLVWKDPKHGNWWLEFGSGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTTTEMGSGHFAGEGFGKASYFRNLQVVDWDNSLVPLSNLV
        QFDISLLVWKDPKHGNWWLEFGSGVLVGYWPSFLFTHL+DHATMVQFGGEVVNSSPSGFHTTTEMGSGHFAGEGFGKASYFRNLQVVDWDNSLVPLSNLV
Subjt:  QFDISLLVWKDPKHGNWWLEFGSGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTTTEMGSGHFAGEGFGKASYFRNLQVVDWDNSLVPLSNLV

Query:  VLADHPNCYDIQGGINTVWGNYFYYGGPGRNDRCP
        VLADHPNCYDIQGGINTVWGNYFYYGGPGRNDRCP
Subjt:  VLADHPNCYDIQGGINTVWGNYFYYGGPGRNDRCP

TrEMBL top hitse value%identityAlignment
A0A0A0LKI2 Uncharacterized protein4.9e-24288.38Show/hide
Query:  PLHFYGGKTLPTDPPIVINQTLGHRMSMASALCFINISQTILIFLLPFSLLASTISPVHSLQSRATNQTDFHPEQELNKLKMIRTHLDKINKPALHTIQS
        P  F   K  P   P     TLGHR+SMASALCFINI+ TILIF+L FSLLASTI+PVHSLQ R TNQTDFHP+QELNKLKMIR HLDKINKPA+HTIQS
Subjt:  PLHFYGGKTLPTDPPIVINQTLGHRMSMASALCFINISQTILIFLLPFSLLASTISPVHSLQSRATNQTDFHPEQELNKLKMIRTHLDKINKPALHTIQS

Query:  PDGDIIDCVLSHQQPAFDHPNLQGQIPMDPPERPQGHKQPRTVTGSFQLWNMNGENCPEGTVPIRRTTEEDMLRATSFQMFGRKVSRWVRRETSSDGHEH
        PDGDIIDCVLSH QPAFDHP LQGQ P+DPPERPQGHK PRT T SFQLW+ +GENCPEGTVPIRRTTEED+LRATSFQMFGRKV +WVRRETSSDGHEH
Subjt:  PDGDIIDCVLSHQQPAFDHPNLQGQIPMDPPERPQGHKQPRTVTGSFQLWNMNGENCPEGTVPIRRTTEEDMLRATSFQMFGRKVSRWVRRETSSDGHEH

Query:  AVGYVTGDHYFGAKASINVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQACLSLCYKILVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFV
        AVGYVTG+HYFGAKASINVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQ          VSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFV
Subjt:  AVGYVTGDHYFGAKASINVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQACLSLCYKILVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFV

Query:  QTNNKVAIGAAISPTSSFEGGQFDISLLVWKDPKHGNWWLEFGSGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTTTEMGSGHFAGEGFGKAS
        QTNNK+AIGAAISPTSSFEGGQFDISLLVWKDPKHGNWWLEFGSGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSG HTTTEMGSGHFAGEGFGKAS
Subjt:  QTNNKVAIGAAISPTSSFEGGQFDISLLVWKDPKHGNWWLEFGSGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTTTEMGSGHFAGEGFGKAS

Query:  YFRNLQVVDWDNSLVPLSNLVVLADHPNCYDIQGGINTVWGNYFYYGGPGRNDRCP
        YFRNLQVVDWDNSLVPLSNLVVLADHPNCYDI+GGINTVWGNYFYYGGPGRNDRCP
Subjt:  YFRNLQVVDWDNSLVPLSNLVVLADHPNCYDIQGGINTVWGNYFYYGGPGRNDRCP

A0A1S3BA82 LOW QUALITY PROTEIN: uncharacterized protein LOC1034877141.4e-24188.62Show/hide
Query:  PLHFYGGKTLPTDPPIVINQTLGHRMSMASALCFINISQTILIFLLPFSLLASTISPVHSLQSR-ATNQTDFHPEQELNKLKMIRTHLDKINKPALHTIQ
        P  F   K  P   P     TLGHR+SMASALCFINI+ TILIF+L FSLLASTI+PVHSL +R +TNQTDFHP+QELNKLKMIR HLDKINKPA+HTIQ
Subjt:  PLHFYGGKTLPTDPPIVINQTLGHRMSMASALCFINISQTILIFLLPFSLLASTISPVHSLQSR-ATNQTDFHPEQELNKLKMIRTHLDKINKPALHTIQ

Query:  SPDGDIIDCVLSHQQPAFDHPNLQGQIPMDPPERPQGHKQPRTVTGSFQLWNMNGENCPEGTVPIRRTTEEDMLRATSFQMFGRKVSRWVRRETSSDGHE
        SPDGDIIDCVLSH QPAFDHP LQGQIP+DPPERPQGHK PRT T SFQLW+M+GENCPEGTVPIRRTTEEDMLRATSFQMFG+KV +WVRRETSSDGHE
Subjt:  SPDGDIIDCVLSHQQPAFDHPNLQGQIPMDPPERPQGHKQPRTVTGSFQLWNMNGENCPEGTVPIRRTTEEDMLRATSFQMFGRKVSRWVRRETSSDGHE

Query:  HAVGYVTGDHYFGAKASINVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQACLSLCYKILVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGF
        HAVGYVTG+HYFGAKASINVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQ          VSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGF
Subjt:  HAVGYVTGDHYFGAKASINVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQACLSLCYKILVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGF

Query:  VQTNNKVAIGAAISPTSSFEGGQFDISLLVWKDPKHGNWWLEFGSGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTTTEMGSGHFAGEGFGKA
        VQTNNK+AIGAAISPTSSFEGGQFDISLLVWKDPKHGNWWLEFGSGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTTTEMGSGHFAGEGFGKA
Subjt:  VQTNNKVAIGAAISPTSSFEGGQFDISLLVWKDPKHGNWWLEFGSGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTTTEMGSGHFAGEGFGKA

Query:  SYFRNLQVVDWDNSLVPLSNLVVLADHPNCYDIQGGINTVWGNYFYYGGPGRNDRCP
        SYFRNLQVVDWDNSLVPLSNLVVLADHPNCYDI+GGINTVWGNYFYYGGPGRNDRCP
Subjt:  SYFRNLQVVDWDNSLVPLSNLVVLADHPNCYDIQGGINTVWGNYFYYGGPGRNDRCP

A0A5A7V0I8 Uncharacterized protein8.8e-24488.84Show/hide
Query:  PLHFYGGKTLPTDPPIVINQTLGHRMSMASALCFINISQTILIFLLPFSLLASTISPVHSLQSR-ATNQTDFHPEQELNKLKMIRTHLDKINKPALHTIQ
        P  F   K  P   P     TLGHR+SMASALCFINI+ TILIF+L FSLLASTI+PVHSL +R +TNQTDFHP+QELNKLKMIR HLDKINKPA+HTIQ
Subjt:  PLHFYGGKTLPTDPPIVINQTLGHRMSMASALCFINISQTILIFLLPFSLLASTISPVHSLQSR-ATNQTDFHPEQELNKLKMIRTHLDKINKPALHTIQ

Query:  SPDGDIIDCVLSHQQPAFDHPNLQGQIPMDPPERPQGHKQPRTVTGSFQLWNMNGENCPEGTVPIRRTTEEDMLRATSFQMFGRKVSRWVRRETSSDGHE
        SPDGDIIDCVLSH QPAFDHP LQGQIP+DPPERPQGHK PRT T SFQLW+M+GENCPEGTVPIRRTTEEDMLRATSFQMFG+KV +WVRRETSSDGHE
Subjt:  SPDGDIIDCVLSHQQPAFDHPNLQGQIPMDPPERPQGHKQPRTVTGSFQLWNMNGENCPEGTVPIRRTTEEDMLRATSFQMFGRKVSRWVRRETSSDGHE

Query:  HAVGYVTGDHYFGAKASINVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQACLSLCYKILVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGF
        HAVGYVTG+HYFGAKASINVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQACL         PELYGDNYPRFFTYWTSDAYQATGCYNLLCSGF
Subjt:  HAVGYVTGDHYFGAKASINVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQACLSLCYKILVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGF

Query:  VQTNNKVAIGAAISPTSSFEGGQFDISLLVWKDPKHGNWWLEFGSGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTTTEMGSGHFAGEGFGKA
        VQTNNK+AIGAAISPTSSFEGGQFDISLLVWKDPKHGNWWLEFGSGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTTTEMGSGHFAGEGFGKA
Subjt:  VQTNNKVAIGAAISPTSSFEGGQFDISLLVWKDPKHGNWWLEFGSGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTTTEMGSGHFAGEGFGKA

Query:  SYFRNLQVVDWDNSLVPLSNLVVLADHPNCYDIQGGINTVWGNYFYYGGPGRNDRCP
        SYFRNLQVVDWDNSLVPLSNLVVLADHPNCYDI+GGINTVWGNYFYYGGPGRNDRCP
Subjt:  SYFRNLQVVDWDNSLVPLSNLVVLADHPNCYDIQGGINTVWGNYFYYGGPGRNDRCP

A0A6J1HCZ9 uncharacterized protein LOC1114624881.8e-22889.58Show/hide
Query:  MASALCFINISQTILIF-LLPFSLLASTISPVHS--LQSRATNQTDFHPEQELNKLKMIRTHLDKINKPALHTIQSPDGDIIDCVLSHQQPAFDHPNLQG
        MAS+ CF NISQ I IF LL   LL STI+P+HS   Q+ ATNQT FHP+QELNKLKMIR  LD INKPALHTIQSPDGD+IDCVLSH QPAFDHP L+G
Subjt:  MASALCFINISQTILIF-LLPFSLLASTISPVHS--LQSRATNQTDFHPEQELNKLKMIRTHLDKINKPALHTIQSPDGDIIDCVLSHQQPAFDHPNLQG

Query:  QIPMDPPERPQGHKQPRTVTGSFQLWNMNGENCPEGTVPIRRTTEEDMLRATSFQMFGRKVSRWVRRETSSDGHEHAVGYVTGDHYFGAKASINVWAPRV
        QIP+DPPERP+GHK PRTVT SFQLW+MNGENCPEGTVPIRRTTEEDMLRATSFQMFGRKV RWVRRETSSDGHEHAVGYVTGDHYFGAKASINVWAPRV
Subjt:  QIPMDPPERPQGHKQPRTVTGSFQLWNMNGENCPEGTVPIRRTTEEDMLRATSFQMFGRKVSRWVRRETSSDGHEHAVGYVTGDHYFGAKASINVWAPRV

Query:  ADQYEFSLSQMWVISGSFGDDLNTIEAGWQACLSLCYKILVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKVAIGAAISPTSSFEGGQFD
        ADQYEFSLSQMWVISGSFGDDLNTIEAGWQ          VSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNK+AIGAAISPTSSF+GGQFD
Subjt:  ADQYEFSLSQMWVISGSFGDDLNTIEAGWQACLSLCYKILVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKVAIGAAISPTSSFEGGQFD

Query:  ISLLVWKDPKHGNWWLEFGSGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTTTEMGSGHFAGEGFGKASYFRNLQVVDWDNSLVPLSNLVVLA
        ISLLVWKDPKHGNWWLEFGSGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTTTEMGSGHFAGEGFGKASYFRNLQVVDWDNSLVPLSNLVVLA
Subjt:  ISLLVWKDPKHGNWWLEFGSGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTTTEMGSGHFAGEGFGKASYFRNLQVVDWDNSLVPLSNLVVLA

Query:  DHPNCYDIQGGINTVWGNYFYYGGPGRNDRCP
        DHPNCYDIQGGINTVWGNYFYYGGPGRN RCP
Subjt:  DHPNCYDIQGGINTVWGNYFYYGGPGRNDRCP

A0A6J1KBP1 uncharacterized protein LOC1114918487.5e-22789.12Show/hide
Query:  MASALCFINISQTILIF-LLPFSLLASTISPVHSLQSR--ATNQTDFHPEQELNKLKMIRTHLDKINKPALHTIQSPDGDIIDCVLSHQQPAFDHPNLQG
        MAS+ CF NISQ I IF LL   LL STI+P+HS  +R  ATNQT FHP+QELNKLKMIR  LD INKPALHTIQSPDGD+IDCVLSH QPAFDHP L+G
Subjt:  MASALCFINISQTILIF-LLPFSLLASTISPVHSLQSR--ATNQTDFHPEQELNKLKMIRTHLDKINKPALHTIQSPDGDIIDCVLSHQQPAFDHPNLQG

Query:  QIPMDPPERPQGHKQPRTVTGSFQLWNMNGENCPEGTVPIRRTTEEDMLRATSFQMFGRKVSRWVRRETSSDGHEHAVGYVTGDHYFGAKASINVWAPRV
        QIP+DPPERP+GHK PRTVT SFQLW+MNGENCPEGTVPIRRTTEED+LRATSFQMFGRKV RWVRRETSSDGHEHAVGYVTGDHYFGAKASINVWAPRV
Subjt:  QIPMDPPERPQGHKQPRTVTGSFQLWNMNGENCPEGTVPIRRTTEEDMLRATSFQMFGRKVSRWVRRETSSDGHEHAVGYVTGDHYFGAKASINVWAPRV

Query:  ADQYEFSLSQMWVISGSFGDDLNTIEAGWQACLSLCYKILVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKVAIGAAISPTSSFEGGQFD
        ADQYEFSLSQMWVISGSFGDDLNTIEAGWQ          VSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNK+AIGAAISPTSSF+GGQFD
Subjt:  ADQYEFSLSQMWVISGSFGDDLNTIEAGWQACLSLCYKILVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKVAIGAAISPTSSFEGGQFD

Query:  ISLLVWKDPKHGNWWLEFGSGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTTTEMGSGHFAGEGFGKASYFRNLQVVDWDNSLVPLSNLVVLA
        ISLLVWKDPKHGNWWLEFGSGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSG HTTTEMGSGHFAGEGFGKASYFRNLQVVDWDNSLVPLSNLVVLA
Subjt:  ISLLVWKDPKHGNWWLEFGSGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTTTEMGSGHFAGEGFGKASYFRNLQVVDWDNSLVPLSNLVVLA

Query:  DHPNCYDIQGGINTVWGNYFYYGGPGRNDRCP
        DHPNCYDIQGGINTVWGNYFYYGGPGRN RCP
Subjt:  DHPNCYDIQGGINTVWGNYFYYGGPGRNDRCP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10750.1 Protein of Unknown Function (DUF239)1.3e-18372.46Show/hide
Query:  IFLLPFSLLASTISPVHSLQSRATNQTDFHPEQELNKLKMIRTHLDKINKPALHTIQSPDGDIIDCVLSHQQPAFDHPNLQGQIPMDPPERPQGHKQPRT
        I  L   LL+S+ S V S      NQT   P  ELNKLK I  HL KINKP++ TI SPDGDIIDCVL H QPAFDHP+L+GQ P+DPPERP+GH +   
Subjt:  IFLLPFSLLASTISPVHSLQSRATNQTDFHPEQELNKLKMIRTHLDKINKPALHTIQSPDGDIIDCVLSHQQPAFDHPNLQGQIPMDPPERPQGHKQPRT

Query:  VTGSFQLWNMNGENCPEGTVPIRRTTEEDMLRATSFQMFGRKVSRWVRRETSSDGHEHAVGYVTGDHYFGAKASINVWAPRVADQYEFSLSQMWVISGSF
           SFQLW M GE CPEGTVPIRRT EED+LRA S   FG+K+ R  RR+TSS+GHEHAVGYV+G+ Y+GAKASINVWAP+V +QYEFSLSQ+W+ISGSF
Subjt:  VTGSFQLWNMNGENCPEGTVPIRRTTEEDMLRATSFQMFGRKVSRWVRRETSSDGHEHAVGYVTGDHYFGAKASINVWAPRVADQYEFSLSQMWVISGSF

Query:  GDDLNTIEAGWQACLSLCYKILVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKVAIGAAISPTSSFEGGQFDISLLVWKDPKHGNWWLEF
        G+DLNTIEAGWQ          VSPELYGDNYPRFFTYWT+DAYQATGCYNLLCSGFVQTN+++AIGAAISP+SS++GGQFDI+LL+WKDPKHGNWWLEF
Subjt:  GDDLNTIEAGWQACLSLCYKILVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKVAIGAAISPTSSFEGGQFDISLLVWKDPKHGNWWLEF

Query:  GSGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTTTEMGSGHFAGEGFGKASYFRNLQVVDWDNSLVPLSNLVVLADHPNCYDIQGGINTVWGN
        GSG+LVGYWPSFLFTHL++HA+MVQ+GGE+VNSSP G HT+T+MGSGHFA EGF K+SYFRN+QVVDWDN+LVP  NL VLADHPNCYDIQGG N  WG+
Subjt:  GSGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTTTEMGSGHFAGEGFGKASYFRNLQVVDWDNSLVPLSNLVVLADHPNCYDIQGGINTVWGN

Query:  YFYYGGPGRNDRCP
        YFYYGGPG+N +CP
Subjt:  YFYYGGPGRNDRCP

AT1G23340.1 Protein of Unknown Function (DUF239)5.2e-18070.05Show/hide
Query:  FLLPFSLLASTISPVHSLQSRATNQT-DFHPEQELNKLKMIRTHLDKINKPALHTIQSPDGDIIDCVLSHQQPAFDHPNLQGQIPMDPPERPQGHKQPRT
        F+L  SL +S  SP     S +T++T    P++E+ K+K+IR  L KINKPA+ TI S DGD IDCV SH QPAFDHP LQGQ PMDPPE P G+ Q   
Subjt:  FLLPFSLLASTISPVHSLQSRATNQT-DFHPEQELNKLKMIRTHLDKINKPALHTIQSPDGDIIDCVLSHQQPAFDHPNLQGQIPMDPPERPQGHKQPRT

Query:  VTGSFQLWNMNGENCPEGTVPIRRTTEEDMLRATSFQMFGRKVSRWVRRETSSDGHEHAVGYVTGDHYFGAKASINVWAPRVADQYEFSLSQMWVISGSF
           +FQLW++ GE+CPEGT+PIRRTTE+DMLRA S + FGRK+ R VRR++SS+GHEHAVGYV+G  Y+GAKASINVW PRV  QYEFSLSQ+W+I+GSF
Subjt:  VTGSFQLWNMNGENCPEGTVPIRRTTEEDMLRATSFQMFGRKVSRWVRRETSSDGHEHAVGYVTGDHYFGAKASINVWAPRVADQYEFSLSQMWVISGSF

Query:  GDDLNTIEAGWQACLSLCYKILVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKVAIGAAISPTSSFEGGQFDISLLVWKDPKHGNWWLEF
          DLNTIEAGWQ          +SPELYGD  PRFFTYWTSDAYQATGCYNLLCSGFVQTNN++AIGAAISP SS++GGQFDISLL+WKDPKHG+WWL+F
Subjt:  GDDLNTIEAGWQACLSLCYKILVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKVAIGAAISPTSSFEGGQFDISLLVWKDPKHGNWWLEF

Query:  GSGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTTTEMGSGHFAGEGFGKASYFRNLQVVDWDNSLVPLSNLVVLADHPNCYDIQGGINTVWGN
        GSG LVGYWP  LFTHL++H  MVQFGGE+VN+ P G HT+T+MGSGHFAGEGFGKASYFRNLQ+VDWDN+L+P+SNL VLADHPNCYDI+GG+N VWGN
Subjt:  GSGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTTTEMGSGHFAGEGFGKASYFRNLQVVDWDNSLVPLSNLVVLADHPNCYDIQGGINTVWGN

Query:  YFYYGGPGRNDRCP
        +FYYGGPG+N +CP
Subjt:  YFYYGGPGRNDRCP

AT1G23340.2 Protein of Unknown Function (DUF239)5.2e-18070.05Show/hide
Query:  FLLPFSLLASTISPVHSLQSRATNQT-DFHPEQELNKLKMIRTHLDKINKPALHTIQSPDGDIIDCVLSHQQPAFDHPNLQGQIPMDPPERPQGHKQPRT
        F+L  SL +S  SP     S +T++T    P++E+ K+K+IR  L KINKPA+ TI S DGD IDCV SH QPAFDHP LQGQ PMDPPE P G+ Q   
Subjt:  FLLPFSLLASTISPVHSLQSRATNQT-DFHPEQELNKLKMIRTHLDKINKPALHTIQSPDGDIIDCVLSHQQPAFDHPNLQGQIPMDPPERPQGHKQPRT

Query:  VTGSFQLWNMNGENCPEGTVPIRRTTEEDMLRATSFQMFGRKVSRWVRRETSSDGHEHAVGYVTGDHYFGAKASINVWAPRVADQYEFSLSQMWVISGSF
           +FQLW++ GE+CPEGT+PIRRTTE+DMLRA S + FGRK+ R VRR++SS+GHEHAVGYV+G  Y+GAKASINVW PRV  QYEFSLSQ+W+I+GSF
Subjt:  VTGSFQLWNMNGENCPEGTVPIRRTTEEDMLRATSFQMFGRKVSRWVRRETSSDGHEHAVGYVTGDHYFGAKASINVWAPRVADQYEFSLSQMWVISGSF

Query:  GDDLNTIEAGWQACLSLCYKILVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKVAIGAAISPTSSFEGGQFDISLLVWKDPKHGNWWLEF
          DLNTIEAGWQ          +SPELYGD  PRFFTYWTSDAYQATGCYNLLCSGFVQTNN++AIGAAISP SS++GGQFDISLL+WKDPKHG+WWL+F
Subjt:  GDDLNTIEAGWQACLSLCYKILVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKVAIGAAISPTSSFEGGQFDISLLVWKDPKHGNWWLEF

Query:  GSGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTTTEMGSGHFAGEGFGKASYFRNLQVVDWDNSLVPLSNLVVLADHPNCYDIQGGINTVWGN
        GSG LVGYWP  LFTHL++H  MVQFGGE+VN+ P G HT+T+MGSGHFAGEGFGKASYFRNLQ+VDWDN+L+P+SNL VLADHPNCYDI+GG+N VWGN
Subjt:  GSGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTTTEMGSGHFAGEGFGKASYFRNLQVVDWDNSLVPLSNLVVLADHPNCYDIQGGINTVWGN

Query:  YFYYGGPGRNDRCP
        +FYYGGPG+N +CP
Subjt:  YFYYGGPGRNDRCP

AT1G70550.1 Protein of Unknown Function (DUF239)4.0e-18068.74Show/hide
Query:  SQTILIFLLPFSLLASTISPVHSLQSRATNQTDFHPEQELNKLKMIRTHLDKINKPALHTIQSPDGDIIDCVLSHQQPAFDHPNLQGQIPMDPPERPQGH
        S + L  +L   L++S+ S   S  +         P++EL KL +IR  LDKINKPA+ TIQS DGD IDCV +HQQPAFDHP LQGQ P+DPPE P+G+
Subjt:  SQTILIFLLPFSLLASTISPVHSLQSRATNQTDFHPEQELNKLKMIRTHLDKINKPALHTIQSPDGDIIDCVLSHQQPAFDHPNLQGQIPMDPPERPQGH

Query:  KQPRTVTGSFQLWNMNGENCPEGTVPIRRTTEEDMLRATSFQMFGRKVSRWVRRETSSDGHEHAVGYVTGDHYFGAKASINVWAPRVADQYEFSLSQMWV
         +      + QLW+++GE+CPEGT+PIRRTTE+DMLRA+S Q FGRK+ R V+R+++++GHEHAVGYVTG  Y+GAKASINVW+PRV  QYEFSLSQ+WV
Subjt:  KQPRTVTGSFQLWNMNGENCPEGTVPIRRTTEEDMLRATSFQMFGRKVSRWVRRETSSDGHEHAVGYVTGDHYFGAKASINVWAPRVADQYEFSLSQMWV

Query:  ISGSFGDDLNTIEAGWQACLSLCYKILVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKVAIGAAISPTSSFEGGQFDISLLVWKDPKHGN
        I+GSF  DLNTIEAGWQ          +SPELYGD YPRFFTYWTSDAY+ TGCYNLLCSGFVQTN ++AIGAAISP SS++GGQFDISLL+WKDPKHG+
Subjt:  ISGSFGDDLNTIEAGWQACLSLCYKILVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKVAIGAAISPTSSFEGGQFDISLLVWKDPKHGN

Query:  WWLEFGSGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTTTEMGSGHFAGEGFGKASYFRNLQVVDWDNSLVPLSNLVVLADHPNCYDIQGGIN
        WWL+FGSG LVGYWP+FLFTHL+ H +MVQFGGE+VN+ P G HTTT+MGSGHFAGEGFGKASYFRNLQ+VDWDN+L+P SNL +LADHPNCYDI+GG N
Subjt:  WWLEFGSGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTTTEMGSGHFAGEGFGKASYFRNLQVVDWDNSLVPLSNLVVLADHPNCYDIQGGIN

Query:  TVWGNYFYYGGPGRNDRCP
         VWGNYFYYGGPG+N RCP
Subjt:  TVWGNYFYYGGPGRNDRCP

AT1G70550.2 Protein of Unknown Function (DUF239)4.0e-18068.74Show/hide
Query:  SQTILIFLLPFSLLASTISPVHSLQSRATNQTDFHPEQELNKLKMIRTHLDKINKPALHTIQSPDGDIIDCVLSHQQPAFDHPNLQGQIPMDPPERPQGH
        S + L  +L   L++S+ S   S  +         P++EL KL +IR  LDKINKPA+ TIQS DGD IDCV +HQQPAFDHP LQGQ P+DPPE P+G+
Subjt:  SQTILIFLLPFSLLASTISPVHSLQSRATNQTDFHPEQELNKLKMIRTHLDKINKPALHTIQSPDGDIIDCVLSHQQPAFDHPNLQGQIPMDPPERPQGH

Query:  KQPRTVTGSFQLWNMNGENCPEGTVPIRRTTEEDMLRATSFQMFGRKVSRWVRRETSSDGHEHAVGYVTGDHYFGAKASINVWAPRVADQYEFSLSQMWV
         +      + QLW+++GE+CPEGT+PIRRTTE+DMLRA+S Q FGRK+ R V+R+++++GHEHAVGYVTG  Y+GAKASINVW+PRV  QYEFSLSQ+WV
Subjt:  KQPRTVTGSFQLWNMNGENCPEGTVPIRRTTEEDMLRATSFQMFGRKVSRWVRRETSSDGHEHAVGYVTGDHYFGAKASINVWAPRVADQYEFSLSQMWV

Query:  ISGSFGDDLNTIEAGWQACLSLCYKILVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKVAIGAAISPTSSFEGGQFDISLLVWKDPKHGN
        I+GSF  DLNTIEAGWQ          +SPELYGD YPRFFTYWTSDAY+ TGCYNLLCSGFVQTN ++AIGAAISP SS++GGQFDISLL+WKDPKHG+
Subjt:  ISGSFGDDLNTIEAGWQACLSLCYKILVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKVAIGAAISPTSSFEGGQFDISLLVWKDPKHGN

Query:  WWLEFGSGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTTTEMGSGHFAGEGFGKASYFRNLQVVDWDNSLVPLSNLVVLADHPNCYDIQGGIN
        WWL+FGSG LVGYWP+FLFTHL+ H +MVQFGGE+VN+ P G HTTT+MGSGHFAGEGFGKASYFRNLQ+VDWDN+L+P SNL +LADHPNCYDI+GG N
Subjt:  WWLEFGSGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTTTEMGSGHFAGEGFGKASYFRNLQVVDWDNSLVPLSNLVVLADHPNCYDIQGGIN

Query:  TVWGNYFYYGGPGRNDRCP
         VWGNYFYYGGPG+N RCP
Subjt:  TVWGNYFYYGGPGRNDRCP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCCTTTTGGTTTCTCCCAACAACAATGCTCCTAATTTCCTTGGTTTGACATGGTGGCCGCTTCATTTCTATGGAGGAAAAACACTCCCCACAGATCCCCCCATTGT
GATCAATCAAACACTCGGACACCGCATGAGCATGGCCTCTGCTTTATGCTTCATCAACATCTCCCAAACTATTCTCATTTTTCTCCTGCCTTTTTCCCTTCTTGCATCCA
CAATATCACCAGTGCACTCGTTGCAAAGCCGCGCCACAAATCAGACTGACTTCCATCCTGAACAGGAATTGAACAAGTTGAAGATGATAAGGACTCATTTGGACAAGATC
AACAAGCCTGCTCTCCACACAATTCAGAGTCCTGATGGAGACATCATAGATTGTGTTTTATCTCACCAGCAACCAGCATTCGACCATCCAAATTTGCAAGGGCAAATACC
TATGGATCCGCCAGAGAGACCACAAGGACATAAGCAGCCTAGGACGGTGACAGGGAGCTTCCAATTATGGAACATGAACGGAGAAAATTGCCCAGAAGGAACAGTGCCCA
TAAGAAGAACAACCGAGGAAGACATGCTAAGAGCTACCTCTTTTCAGATGTTTGGGAGAAAAGTAAGCAGATGGGTTAGGAGAGAAACGAGCAGCGATGGGCATGAGCAC
GCTGTGGGGTATGTGACCGGCGATCACTACTTTGGGGCAAAAGCAAGCATCAATGTGTGGGCACCTCGAGTCGCCGATCAGTATGAATTCAGCTTATCGCAAATGTGGGT
CATCTCCGGCTCCTTCGGCGACGACCTCAACACCATTGAAGCTGGTTGGCAGGCATGTCTCTCTCTATGTTACAAAATCCTCGTTAGTCCAGAACTGTACGGAGACAATT
ACCCAAGATTCTTTACTTACTGGACCTCGGATGCATACCAAGCAACCGGATGTTACAATTTGCTATGCTCTGGGTTTGTTCAGACAAATAACAAAGTTGCAATTGGGGCA
GCAATTTCTCCAACGTCTTCGTTTGAGGGTGGTCAATTCGACATCAGCTTGTTGGTTTGGAAGGATCCGAAGCATGGAAACTGGTGGCTGGAATTCGGGTCGGGGGTTTT
GGTGGGATATTGGCCGTCGTTCCTGTTCACCCATTTACAAGACCATGCAACAATGGTGCAATTCGGGGGAGAGGTGGTGAATTCAAGTCCGTCGGGATTCCACACCACCA
CAGAGATGGGGAGTGGGCATTTCGCGGGCGAGGGTTTCGGAAAAGCTTCGTATTTTAGGAACCTGCAAGTGGTGGATTGGGATAACAGCTTAGTTCCATTATCAAACCTG
GTGGTTTTGGCGGATCATCCCAACTGCTATGACATTCAAGGTGGAATCAACACCGTTTGGGGAAACTACTTTTACTACGGTGGACCTGGTCGAAACGACAGGTGTCCTTG
A
mRNA sequenceShow/hide mRNA sequence
ATGATCCTTTTGGTTTCTCCCAACAACAATGCTCCTAATTTCCTTGGTTTGACATGGTGGCCGCTTCATTTCTATGGAGGAAAAACACTCCCCACAGATCCCCCCATTGT
GATCAATCAAACACTCGGACACCGCATGAGCATGGCCTCTGCTTTATGCTTCATCAACATCTCCCAAACTATTCTCATTTTTCTCCTGCCTTTTTCCCTTCTTGCATCCA
CAATATCACCAGTGCACTCGTTGCAAAGCCGCGCCACAAATCAGACTGACTTCCATCCTGAACAGGAATTGAACAAGTTGAAGATGATAAGGACTCATTTGGACAAGATC
AACAAGCCTGCTCTCCACACAATTCAGAGTCCTGATGGAGACATCATAGATTGTGTTTTATCTCACCAGCAACCAGCATTCGACCATCCAAATTTGCAAGGGCAAATACC
TATGGATCCGCCAGAGAGACCACAAGGACATAAGCAGCCTAGGACGGTGACAGGGAGCTTCCAATTATGGAACATGAACGGAGAAAATTGCCCAGAAGGAACAGTGCCCA
TAAGAAGAACAACCGAGGAAGACATGCTAAGAGCTACCTCTTTTCAGATGTTTGGGAGAAAAGTAAGCAGATGGGTTAGGAGAGAAACGAGCAGCGATGGGCATGAGCAC
GCTGTGGGGTATGTGACCGGCGATCACTACTTTGGGGCAAAAGCAAGCATCAATGTGTGGGCACCTCGAGTCGCCGATCAGTATGAATTCAGCTTATCGCAAATGTGGGT
CATCTCCGGCTCCTTCGGCGACGACCTCAACACCATTGAAGCTGGTTGGCAGGCATGTCTCTCTCTATGTTACAAAATCCTCGTTAGTCCAGAACTGTACGGAGACAATT
ACCCAAGATTCTTTACTTACTGGACCTCGGATGCATACCAAGCAACCGGATGTTACAATTTGCTATGCTCTGGGTTTGTTCAGACAAATAACAAAGTTGCAATTGGGGCA
GCAATTTCTCCAACGTCTTCGTTTGAGGGTGGTCAATTCGACATCAGCTTGTTGGTTTGGAAGGATCCGAAGCATGGAAACTGGTGGCTGGAATTCGGGTCGGGGGTTTT
GGTGGGATATTGGCCGTCGTTCCTGTTCACCCATTTACAAGACCATGCAACAATGGTGCAATTCGGGGGAGAGGTGGTGAATTCAAGTCCGTCGGGATTCCACACCACCA
CAGAGATGGGGAGTGGGCATTTCGCGGGCGAGGGTTTCGGAAAAGCTTCGTATTTTAGGAACCTGCAAGTGGTGGATTGGGATAACAGCTTAGTTCCATTATCAAACCTG
GTGGTTTTGGCGGATCATCCCAACTGCTATGACATTCAAGGTGGAATCAACACCGTTTGGGGAAACTACTTTTACTACGGTGGACCTGGTCGAAACGACAGGTGTCCTTG
A
Protein sequenceShow/hide protein sequence
MILLVSPNNNAPNFLGLTWWPLHFYGGKTLPTDPPIVINQTLGHRMSMASALCFINISQTILIFLLPFSLLASTISPVHSLQSRATNQTDFHPEQELNKLKMIRTHLDKI
NKPALHTIQSPDGDIIDCVLSHQQPAFDHPNLQGQIPMDPPERPQGHKQPRTVTGSFQLWNMNGENCPEGTVPIRRTTEEDMLRATSFQMFGRKVSRWVRRETSSDGHEH
AVGYVTGDHYFGAKASINVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQACLSLCYKILVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKVAIGA
AISPTSSFEGGQFDISLLVWKDPKHGNWWLEFGSGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTTTEMGSGHFAGEGFGKASYFRNLQVVDWDNSLVPLSNL
VVLADHPNCYDIQGGINTVWGNYFYYGGPGRNDRCP