; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0034901 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0034901
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionProtein of unknown function (DUF604)
Genome locationchr3:12174310..12177802
RNA-Seq ExpressionLag0034901
SyntenyLag0034901
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0008375 - acetylglucosaminyltransferase activity (molecular function)
InterPro domainsIPR006740 - Protein of unknown function DUF604


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0051225.1 DUF604 domain-containing protein [Cucumis melo var. makuwa]1.4e-24383.37Show/hide
Query:  MSSPLPISSTIAKFQSKLSSITLGNVCKVLAFSGLALFMLYFFLFSPPNYQPYDLLTSLKQKWPI--KTPSPQPPPPTDPPTNISHIVFSIVGSMNTWKF
        MSSPLP++STIAKF++KLSSI+ G+VCKVLAF GLALFM+Y F+FSPPNYQP DLLT+LKQ +PI   +PS   P  TDPPTN SHI+FSIVGSMNTWK+
Subjt:  MSSPLPISSTIAKFQSKLSSITLGNVCKVLAFSGLALFMLYFFLFSPPNYQPYDLLTSLKQKWPI--KTPSPQPPPPTDPPTNISHIVFSIVGSMNTWKF

Query:  KRYYSESWWRPNVTRGHVFFERPPPAEFLPWSDSSAPFRVNEDITSFAVFPRIKWPDQVRIFRTVMESFRESDKDARWYVMTDDDT--------------
        KRYYSESWWRPNVTRGHVFFERPP AEFLPWSDSSAPFRVNEDIT FAV+PRIKWPDQVRIFRTVMESFRE DKD RW+VMTDDDT              
Subjt:  KRYYSESWWRPNVTRGHVFFERPPPAEFLPWSDSSAPFRVNEDITSFAVFPRIKWPDQVRIFRTVMESFRESDKDARWYVMTDDDT--------------

Query:  ----------------SNFDFSFGMAFGGAGYALSYPLASLVAKRLDGCIERYPYLRVSDQMLFFCLADLGFSITHEMGFHQMDLRGDASGFLSYHPQTP
                        SNFDFSF MAFGGAGYALSYPLA+LVAKRLDGCIERYPYLRVSDQMLFFCL+DLGF+ITHEMGFHQ+DLRGDASG+LSYHPQTP
Subjt:  ----------------SNFDFSFGMAFGGAGYALSYPLASLVAKRLDGCIERYPYLRVSDQMLFFCLADLGFSITHEMGFHQMDLRGDASGFLSYHPQTP

Query:  LLSLHHIDLINPIFPNMDRPAAINHLMKAAAVDQSRLMQQTICYHRPSNWTFSMSWGYSAHIYEAIMARNYLKRPLETFAPFERARAPVFMFNTRWGVLE
        LLSLHHIDLINPI+PNMDRPAAI HLMKA AVDQSRL+QQTICYHRP NWTFSMSWGYSAHIYEAIMARNYLKRPLETFAPFER  APVFMFNTRWGVL+
Subjt:  LLSLHHIDLINPIFPNMDRPAAINHLMKAAAVDQSRLMQQTICYHRPSNWTFSMSWGYSAHIYEAIMARNYLKRPLETFAPFERARAPVFMFNTRWGVLE

Query:  NPCEAPHVLYFESIERDGEDRIVTTYLRKWARNLPPCASFGNHSAESISKIRVFSSAKVPLEAGGVECCDVRMLDMNVTEVNYRPCYSGEVMA
        NPCEAPHVLYFESIERDGEDRIVTTYLRKWARNLPPCA +GNHSAESISKIRVFSSAK+PLEAGG ECCDVRMLD+NVTEVNYRPCYSGEVMA
Subjt:  NPCEAPHVLYFESIERDGEDRIVTTYLRKWARNLPPCASFGNHSAESISKIRVFSSAKVPLEAGGVECCDVRMLDMNVTEVNYRPCYSGEVMA

KAG6588196.1 hypothetical protein SDJN03_16761, partial [Cucurbita argyrosperma subsp. sororia]1.3e-23882.72Show/hide
Query:  MSSPLPISSTIAKFQSKLSSITLGNVCKVLAFSGLALFMLYFFLFSPPNYQPYDLLTSLKQKWPIKTPSPQPP-PPTDPPTNISHIVFSIVGSMNTWKFK
        MSSP PISSTIA FQ+KLSSITLGNVCKVLA SGLALF LY F+FSPPNYQ  D LT+LKQK+PI  PS  P  PP DPPTNISHIVFSIVGSMNTWK+K
Subjt:  MSSPLPISSTIAKFQSKLSSITLGNVCKVLAFSGLALFMLYFFLFSPPNYQPYDLLTSLKQKWPIKTPSPQPP-PPTDPPTNISHIVFSIVGSMNTWKFK

Query:  RYYSESWWRPNVTRGHVFFERPPPAEFLPWSDSSAPFRVNEDITSFAVFPRIKWPDQVRIFRTVMESFRESDKDARWYVMTDDDT---------------
        RYYS+SWWRPNVTRGHVFFERPP  EFLPWS+ SAPFRVNEDI+ FAV+P+I+W DQVRIFRTVMESFRE   +ARWYVMTDDDT               
Subjt:  RYYSESWWRPNVTRGHVFFERPPPAEFLPWSDSSAPFRVNEDITSFAVFPRIKWPDQVRIFRTVMESFRESDKDARWYVMTDDDT---------------

Query:  ---------------SNFDFSFGMAFGGAGYALSYPLASLVAKRLDGCIERYPYLRVSDQMLFFCLADLGFSITHEMGFHQMDLRGDASGFLSYHPQTPL
                       SNFDFS+ MAFGGAGYALSYPLA+LVAKRLDGCIERYPYLRVSDQMLF CL+DLGFSITHE GFHQ+DLRGDASGFLSYHPQTPL
Subjt:  ---------------SNFDFSFGMAFGGAGYALSYPLASLVAKRLDGCIERYPYLRVSDQMLFFCLADLGFSITHEMGFHQMDLRGDASGFLSYHPQTPL

Query:  LSLHHIDLINPIFPNMDRPAAINHLMKAAAVDQSRLMQQTICYHRPSNWTFSMSWGYSAHIYEAIMARNYLKRPLETFAPFERARAPVFMFNTRWGVLEN
        LSLHHIDLINPIFPNMDRPAAINHLMKA AVDQSRL+QQTICYHRP NW+FSMSWGYSAHIYEAIMARNYLKRPLETFAPFER +APVFMFNTRWGVL+N
Subjt:  LSLHHIDLINPIFPNMDRPAAINHLMKAAAVDQSRLMQQTICYHRPSNWTFSMSWGYSAHIYEAIMARNYLKRPLETFAPFERARAPVFMFNTRWGVLEN

Query:  PCEAPHVLYFESIERDGEDRIVTTYLRKWARNLPPCASFGNHSAESISKIRVFSSAKVPLEAGGVECCDVRMLDMNVTEVNYRPCYSGEVMA
        PCEAPHVLYFESIERD EDRIVTTYLRKWARNLPPCAS GNHSAES+SKIRVFSSAKVPLEAGGVECCDVRM+DMNVTEV+YRPCY GEVMA
Subjt:  PCEAPHVLYFESIERDGEDRIVTTYLRKWARNLPPCASFGNHSAESISKIRVFSSAKVPLEAGGVECCDVRMLDMNVTEVNYRPCYSGEVMA

XP_008450413.1 PREDICTED: uncharacterized protein LOC103492025 [Cucumis melo]2.0e-24282.96Show/hide
Query:  MSSPLPISSTIAKFQSKLSSITLGNVCKVLAFSGLALFMLYFFLFSPPNYQPYDLLTSLKQKWPI--KTPSPQPPPPTDPPTNISHIVFSIVGSMNTWKF
        MSSPLP++STIAKF++KLSSI+ G+VCKVLAF GLALF++Y F+FSPPNYQP DLLT+LKQ +PI   +PS   P  TDPPTN SHI+FSIVGSMNTWK+
Subjt:  MSSPLPISSTIAKFQSKLSSITLGNVCKVLAFSGLALFMLYFFLFSPPNYQPYDLLTSLKQKWPI--KTPSPQPPPPTDPPTNISHIVFSIVGSMNTWKF

Query:  KRYYSESWWRPNVTRGHVFFERPPPAEFLPWSDSSAPFRVNEDITSFAVFPRIKWPDQVRIFRTVMESFRESDKDARWYVMTDDDT--------------
        KRYYSESWWRPNVTRGHVFFERPP AEFLPWSDSSAPFRVNEDIT FAV+PRIKWPDQVRIFRTVMESFRE +KD RW+VMTDDDT              
Subjt:  KRYYSESWWRPNVTRGHVFFERPPPAEFLPWSDSSAPFRVNEDITSFAVFPRIKWPDQVRIFRTVMESFRESDKDARWYVMTDDDT--------------

Query:  ----------------SNFDFSFGMAFGGAGYALSYPLASLVAKRLDGCIERYPYLRVSDQMLFFCLADLGFSITHEMGFHQMDLRGDASGFLSYHPQTP
                        SNFDFSF MAFGGAGYALSYPLA+LVAKRLDGCIERYPYLRVSDQMLFFCL+DLGF+ITHEMGFHQ+DLRGDASG+LSYHPQTP
Subjt:  ----------------SNFDFSFGMAFGGAGYALSYPLASLVAKRLDGCIERYPYLRVSDQMLFFCLADLGFSITHEMGFHQMDLRGDASGFLSYHPQTP

Query:  LLSLHHIDLINPIFPNMDRPAAINHLMKAAAVDQSRLMQQTICYHRPSNWTFSMSWGYSAHIYEAIMARNYLKRPLETFAPFERARAPVFMFNTRWGVLE
        LLSLHHIDLINPI+PNMDRPAAI HLMKA AVDQSRL+QQTICYHRP NWTFSMSWGYSAHIYEAIMARNYLKRPLETFAPFER  APVFMFNTRWGVL+
Subjt:  LLSLHHIDLINPIFPNMDRPAAINHLMKAAAVDQSRLMQQTICYHRPSNWTFSMSWGYSAHIYEAIMARNYLKRPLETFAPFERARAPVFMFNTRWGVLE

Query:  NPCEAPHVLYFESIERDGEDRIVTTYLRKWARNLPPCASFGNHSAESISKIRVFSSAKVPLEAGGVECCDVRMLDMNVTEVNYRPCYSGEVMA
        NPCEAPHVLYFESIERDGEDRIVTTYLRKWARNLPPCA +GNHSAESISKIRVFSSAK+PLEAGG ECCDVRMLD+NVTEVNYRPCYSGEVMA
Subjt:  NPCEAPHVLYFESIERDGEDRIVTTYLRKWARNLPPCASFGNHSAESISKIRVFSSAKVPLEAGGVECCDVRMLDMNVTEVNYRPCYSGEVMA

XP_022967315.1 uncharacterized protein LOC111466872 [Cucurbita maxima]3.8e-23882.52Show/hide
Query:  MSSPLPISSTIAKFQSKLSSITLGNVCKVLAFSGLALFMLYFFLFSPPNYQPYDLLTSLKQKWPIKTPSPQPP-PPTDPPTNISHIVFSIVGSMNTWKFK
        MSSP PI+STIA FQ+KLSSI+LGNVCKVLA SGLALF LY F+FSPPNYQ  D LT+LKQK+PI  PS  P  PP DPPTNISHIVFSIVGSMNTWK+K
Subjt:  MSSPLPISSTIAKFQSKLSSITLGNVCKVLAFSGLALFMLYFFLFSPPNYQPYDLLTSLKQKWPIKTPSPQPP-PPTDPPTNISHIVFSIVGSMNTWKFK

Query:  RYYSESWWRPNVTRGHVFFERPPPAEFLPWSDSSAPFRVNEDITSFAVFPRIKWPDQVRIFRTVMESFRESDKDARWYVMTDDDT---------------
        RYYSESWWRPNVTRGHVFFERPP  EFLPWS+SSAPFRVNEDI+SFAV+P+I+W DQVRIFRTVMESFRE   +ARWYVMTDDDT               
Subjt:  RYYSESWWRPNVTRGHVFFERPPPAEFLPWSDSSAPFRVNEDITSFAVFPRIKWPDQVRIFRTVMESFRESDKDARWYVMTDDDT---------------

Query:  ---------------SNFDFSFGMAFGGAGYALSYPLASLVAKRLDGCIERYPYLRVSDQMLFFCLADLGFSITHEMGFHQMDLRGDASGFLSYHPQTPL
                       SNFDFS+ MAFGGAGYALSYPLA+LVAKRLDGCIERYPYLRVSDQMLF CL+DLGFSITHE GFHQ+DLRGDASGFLSYHPQTPL
Subjt:  ---------------SNFDFSFGMAFGGAGYALSYPLASLVAKRLDGCIERYPYLRVSDQMLFFCLADLGFSITHEMGFHQMDLRGDASGFLSYHPQTPL

Query:  LSLHHIDLINPIFPNMDRPAAINHLMKAAAVDQSRLMQQTICYHRPSNWTFSMSWGYSAHIYEAIMARNYLKRPLETFAPFERARAPVFMFNTRWGVLEN
        LSLHHIDLINPIFPNMDRPAAINHLMKA AVDQSRL+QQTICYHRP NW+FSMSWGYSAHIYEAIMARNYLKRPLETFAPFER +APVFMFNTRWGVL N
Subjt:  LSLHHIDLINPIFPNMDRPAAINHLMKAAAVDQSRLMQQTICYHRPSNWTFSMSWGYSAHIYEAIMARNYLKRPLETFAPFERARAPVFMFNTRWGVLEN

Query:  PCEAPHVLYFESIERDGEDRIVTTYLRKWARNLPPCASFGNHSAESISKIRVFSSAKVPLEAGGVECCDVRMLDMNVTEVNYRPCYSGEVMA
        PCEAPHVLYFESIERD EDRIVTTYLRKWARNLPPCA +GNHSAES+SKIRVFSSAKVPLEA GVECCDVRM+DMNVTEV+YRPCY GEVMA
Subjt:  PCEAPHVLYFESIERDGEDRIVTTYLRKWARNLPPCASFGNHSAESISKIRVFSSAKVPLEAGGVECCDVRMLDMNVTEVNYRPCYSGEVMA

XP_038880269.1 uncharacterized protein LOC120071916 [Benincasa hispida]2.0e-24283.1Show/hide
Query:  SPLPISSTIAKFQSKLSSITLGNVCKVLAFSGLALFMLYFFLFSPPNYQPYDLLTSLKQKWPIKTPSPQ--PPPPTDPPTNISHIVFSIVGSMNTWKFKR
        SPLPI+STI KFQ+KLSS+  GNV KVLAFSGL LFMLY FLFSPPNY+P DLLT+LKQK+P+   S    PPPPTD PTN+SHI+FSIVGSMNTWK+KR
Subjt:  SPLPISSTIAKFQSKLSSITLGNVCKVLAFSGLALFMLYFFLFSPPNYQPYDLLTSLKQKWPIKTPSPQ--PPPPTDPPTNISHIVFSIVGSMNTWKFKR

Query:  YYSESWWRPNVTRGHVFFERPPPAEFLPWSDSSAPFRVNEDITSFAVFPRIKWPDQVRIFRTVMESFRESDKDARWYVMTDDDT----------------
        +YSESWWRPNVTRGHVFFERPP AEFLPWSDSSAPFRVNEDIT FAV+P+IKWPDQVRIFRTVMESFRE DK+ RW+VMTDDDT                
Subjt:  YYSESWWRPNVTRGHVFFERPPPAEFLPWSDSSAPFRVNEDITSFAVFPRIKWPDQVRIFRTVMESFRESDKDARWYVMTDDDT----------------

Query:  --------------SNFDFSFGMAFGGAGYALSYPLASLVAKRLDGCIERYPYLRVSDQMLFFCLADLGFSITHEMGFHQMDLRGDASGFLSYHPQTPLL
                      SNFDFSF MAFGGAGYALSYPLA+LVAKRLDGCIERYPYLRVSDQMLF CL+DLGFSITHEMGFHQ+DLRGDASG+LSYHPQTPLL
Subjt:  --------------SNFDFSFGMAFGGAGYALSYPLASLVAKRLDGCIERYPYLRVSDQMLFFCLADLGFSITHEMGFHQMDLRGDASGFLSYHPQTPLL

Query:  SLHHIDLINPIFPNMDRPAAINHLMKAAAVDQSRLMQQTICYHRPSNWTFSMSWGYSAHIYEAIMARNYLKRPLETFAPFERARAPVFMFNTRWGVLENP
        SLHHIDLINPI+PNMDRPAAI HLM A AVDQSRL+QQTICYHRPSNWTFSMSWGYSAHIYEAIMAR+YLKRPLETFAPFERARAP+FMFNTRWGVL+NP
Subjt:  SLHHIDLINPIFPNMDRPAAINHLMKAAAVDQSRLMQQTICYHRPSNWTFSMSWGYSAHIYEAIMARNYLKRPLETFAPFERARAPVFMFNTRWGVLENP

Query:  CEAPHVLYFESIERDGEDRIVTTYLRKWARNLPPCASFGNHSAESISKIRVFSSAKVPLEAGGVECCDVRMLDMNVTEVNYRPCYSGEVMA
        CEAPHVLYFESIERDGEDRIVTTYLRKWARNLPPCAS+GNHSAESISKIRVFSSA +PLEAGG ECCDVR+LDMNVTEVNYRPCYSGEVMA
Subjt:  CEAPHVLYFESIERDGEDRIVTTYLRKWARNLPPCASFGNHSAESISKIRVFSSAKVPLEAGGVECCDVRMLDMNVTEVNYRPCYSGEVMA

TrEMBL top hitse value%identityAlignment
A0A1S3BPU4 uncharacterized protein LOC1034920259.5e-24382.96Show/hide
Query:  MSSPLPISSTIAKFQSKLSSITLGNVCKVLAFSGLALFMLYFFLFSPPNYQPYDLLTSLKQKWPI--KTPSPQPPPPTDPPTNISHIVFSIVGSMNTWKF
        MSSPLP++STIAKF++KLSSI+ G+VCKVLAF GLALF++Y F+FSPPNYQP DLLT+LKQ +PI   +PS   P  TDPPTN SHI+FSIVGSMNTWK+
Subjt:  MSSPLPISSTIAKFQSKLSSITLGNVCKVLAFSGLALFMLYFFLFSPPNYQPYDLLTSLKQKWPI--KTPSPQPPPPTDPPTNISHIVFSIVGSMNTWKF

Query:  KRYYSESWWRPNVTRGHVFFERPPPAEFLPWSDSSAPFRVNEDITSFAVFPRIKWPDQVRIFRTVMESFRESDKDARWYVMTDDDT--------------
        KRYYSESWWRPNVTRGHVFFERPP AEFLPWSDSSAPFRVNEDIT FAV+PRIKWPDQVRIFRTVMESFRE +KD RW+VMTDDDT              
Subjt:  KRYYSESWWRPNVTRGHVFFERPPPAEFLPWSDSSAPFRVNEDITSFAVFPRIKWPDQVRIFRTVMESFRESDKDARWYVMTDDDT--------------

Query:  ----------------SNFDFSFGMAFGGAGYALSYPLASLVAKRLDGCIERYPYLRVSDQMLFFCLADLGFSITHEMGFHQMDLRGDASGFLSYHPQTP
                        SNFDFSF MAFGGAGYALSYPLA+LVAKRLDGCIERYPYLRVSDQMLFFCL+DLGF+ITHEMGFHQ+DLRGDASG+LSYHPQTP
Subjt:  ----------------SNFDFSFGMAFGGAGYALSYPLASLVAKRLDGCIERYPYLRVSDQMLFFCLADLGFSITHEMGFHQMDLRGDASGFLSYHPQTP

Query:  LLSLHHIDLINPIFPNMDRPAAINHLMKAAAVDQSRLMQQTICYHRPSNWTFSMSWGYSAHIYEAIMARNYLKRPLETFAPFERARAPVFMFNTRWGVLE
        LLSLHHIDLINPI+PNMDRPAAI HLMKA AVDQSRL+QQTICYHRP NWTFSMSWGYSAHIYEAIMARNYLKRPLETFAPFER  APVFMFNTRWGVL+
Subjt:  LLSLHHIDLINPIFPNMDRPAAINHLMKAAAVDQSRLMQQTICYHRPSNWTFSMSWGYSAHIYEAIMARNYLKRPLETFAPFERARAPVFMFNTRWGVLE

Query:  NPCEAPHVLYFESIERDGEDRIVTTYLRKWARNLPPCASFGNHSAESISKIRVFSSAKVPLEAGGVECCDVRMLDMNVTEVNYRPCYSGEVMA
        NPCEAPHVLYFESIERDGEDRIVTTYLRKWARNLPPCA +GNHSAESISKIRVFSSAK+PLEAGG ECCDVRMLD+NVTEVNYRPCYSGEVMA
Subjt:  NPCEAPHVLYFESIERDGEDRIVTTYLRKWARNLPPCASFGNHSAESISKIRVFSSAKVPLEAGGVECCDVRMLDMNVTEVNYRPCYSGEVMA

A0A5A7UC64 DUF604 domain-containing protein6.6e-24483.37Show/hide
Query:  MSSPLPISSTIAKFQSKLSSITLGNVCKVLAFSGLALFMLYFFLFSPPNYQPYDLLTSLKQKWPI--KTPSPQPPPPTDPPTNISHIVFSIVGSMNTWKF
        MSSPLP++STIAKF++KLSSI+ G+VCKVLAF GLALFM+Y F+FSPPNYQP DLLT+LKQ +PI   +PS   P  TDPPTN SHI+FSIVGSMNTWK+
Subjt:  MSSPLPISSTIAKFQSKLSSITLGNVCKVLAFSGLALFMLYFFLFSPPNYQPYDLLTSLKQKWPI--KTPSPQPPPPTDPPTNISHIVFSIVGSMNTWKF

Query:  KRYYSESWWRPNVTRGHVFFERPPPAEFLPWSDSSAPFRVNEDITSFAVFPRIKWPDQVRIFRTVMESFRESDKDARWYVMTDDDT--------------
        KRYYSESWWRPNVTRGHVFFERPP AEFLPWSDSSAPFRVNEDIT FAV+PRIKWPDQVRIFRTVMESFRE DKD RW+VMTDDDT              
Subjt:  KRYYSESWWRPNVTRGHVFFERPPPAEFLPWSDSSAPFRVNEDITSFAVFPRIKWPDQVRIFRTVMESFRESDKDARWYVMTDDDT--------------

Query:  ----------------SNFDFSFGMAFGGAGYALSYPLASLVAKRLDGCIERYPYLRVSDQMLFFCLADLGFSITHEMGFHQMDLRGDASGFLSYHPQTP
                        SNFDFSF MAFGGAGYALSYPLA+LVAKRLDGCIERYPYLRVSDQMLFFCL+DLGF+ITHEMGFHQ+DLRGDASG+LSYHPQTP
Subjt:  ----------------SNFDFSFGMAFGGAGYALSYPLASLVAKRLDGCIERYPYLRVSDQMLFFCLADLGFSITHEMGFHQMDLRGDASGFLSYHPQTP

Query:  LLSLHHIDLINPIFPNMDRPAAINHLMKAAAVDQSRLMQQTICYHRPSNWTFSMSWGYSAHIYEAIMARNYLKRPLETFAPFERARAPVFMFNTRWGVLE
        LLSLHHIDLINPI+PNMDRPAAI HLMKA AVDQSRL+QQTICYHRP NWTFSMSWGYSAHIYEAIMARNYLKRPLETFAPFER  APVFMFNTRWGVL+
Subjt:  LLSLHHIDLINPIFPNMDRPAAINHLMKAAAVDQSRLMQQTICYHRPSNWTFSMSWGYSAHIYEAIMARNYLKRPLETFAPFERARAPVFMFNTRWGVLE

Query:  NPCEAPHVLYFESIERDGEDRIVTTYLRKWARNLPPCASFGNHSAESISKIRVFSSAKVPLEAGGVECCDVRMLDMNVTEVNYRPCYSGEVMA
        NPCEAPHVLYFESIERDGEDRIVTTYLRKWARNLPPCA +GNHSAESISKIRVFSSAK+PLEAGG ECCDVRMLD+NVTEVNYRPCYSGEVMA
Subjt:  NPCEAPHVLYFESIERDGEDRIVTTYLRKWARNLPPCASFGNHSAESISKIRVFSSAKVPLEAGGVECCDVRMLDMNVTEVNYRPCYSGEVMA

A0A6J1DK52 uncharacterized protein LOC1110216494.1e-23881.47Show/hide
Query:  MSSPLPISSTIAKFQSKLSSITLGNVCKVLAFSGLALFMLYFFLFSPPNYQPYDLLTSLKQKWPIKTPSPQPPPPTDPPTNISHIVFSIVGSMNTWKFKR
        MSSP PI+S IAKF+SKL S+ +GNVCKVLAFSGLALFMLY F+FS PNYQP DLLTSLKQKWPIKTP+  PP   DPPTNISHIVFSIVGSMNTWKFKR
Subjt:  MSSPLPISSTIAKFQSKLSSITLGNVCKVLAFSGLALFMLYFFLFSPPNYQPYDLLTSLKQKWPIKTPSPQPPPPTDPPTNISHIVFSIVGSMNTWKFKR

Query:  YYSESWWRPNVTRGHVFFERPPPAEFLPWSDSSAPFRVNEDITSFAVFPRIKWPDQVRIFRTVMESFRESDKDARWYVMTDDDT----------------
        +YSESWWRPNVTRGHVFFERPP  EFLPWSDSS PFRVNEDIT FAV+P+IKWPDQVRIFR VMESFRE DKD RW+VM DDDT                
Subjt:  YYSESWWRPNVTRGHVFFERPPPAEFLPWSDSSAPFRVNEDITSFAVFPRIKWPDQVRIFRTVMESFRESDKDARWYVMTDDDT----------------

Query:  --------------SNFDFSFGMAFGGAGYALSYPLASLVAKRLDGCIERYPYLRVSDQMLFFCLADLGFSITHEMGFHQMDLRGDASGFLSYHPQTPLL
                      SNFDFSF MAFGGAGYALSYPLA++VAKRLDGCIERYPYLRVSDQMLF CL+DLGFSITHE GFHQ+DLRGDASG+LSYHPQTPLL
Subjt:  --------------SNFDFSFGMAFGGAGYALSYPLASLVAKRLDGCIERYPYLRVSDQMLFFCLADLGFSITHEMGFHQMDLRGDASGFLSYHPQTPLL

Query:  SLHHIDLINPIFPNMDRPAAINHLMKAAAVDQSRLMQQTICYHRPSNWTFSMSWGYSAHIYEAIMARNYLKRPLETFAPFERARAPVFMFNTRWGVLENP
        SLHHIDLINPIFPNMDRPAAINHLM A AVDQSRL+QQTICYHRP NWTFSMSWGYSAHIYEAIM+RNYLKRPLETFAPFERARAP+FMFNTRWGVL NP
Subjt:  SLHHIDLINPIFPNMDRPAAINHLMKAAAVDQSRLMQQTICYHRPSNWTFSMSWGYSAHIYEAIMARNYLKRPLETFAPFERARAPVFMFNTRWGVLENP

Query:  CEAPHVLYFESIERDGEDRIVTTYLRKWARNLPPCASFGNHSAESISKIRVFSSAKVPLEAGGVECCDVRMLDMNVTEVNYRPCYSGEVMA
        CEAPH L+FESIERDGE+R+VTTY+RKWARNLP CA+ GNHSAE ISKIRVFSSA++PLEAGG ECCDV+M+DMNVTEV YRPCY GEVMA
Subjt:  CEAPHVLYFESIERDGEDRIVTTYLRKWARNLPPCASFGNHSAESISKIRVFSSAKVPLEAGGVECCDVRMLDMNVTEVNYRPCYSGEVMA

A0A6J1EZV9 uncharacterized protein LOC1114408874.6e-23781.91Show/hide
Query:  MSSPLPISSTIAKFQSKLSSITLGNVCKVLAFSGLALFMLYFFLFSPPNYQPYDLLTSLKQKWPIKTPSPQPP-PPTDPPTNISHIVFSIVGSMNTWKFK
        MSSP PI+STIA FQ+KLSSITLGNVCKVLA SGLALF LY F+FSPPNYQ  D LT+LKQK+PI   S  P  PP DPPTNISHIVFSIVGSMNTWK+K
Subjt:  MSSPLPISSTIAKFQSKLSSITLGNVCKVLAFSGLALFMLYFFLFSPPNYQPYDLLTSLKQKWPIKTPSPQPP-PPTDPPTNISHIVFSIVGSMNTWKFK

Query:  RYYSESWWRPNVTRGHVFFERPPPAEFLPWSDSSAPFRVNEDITSFAVFPRIKWPDQVRIFRTVMESFRESDKDARWYVMTDDDT---------------
        RYYS+SWWRPNVTRGHVFFERPP  EFLPWS+ SAPFRVNEDI+ FAV+P+I+W DQVRIFRTVMESFRE   +ARWYVMTDDDT               
Subjt:  RYYSESWWRPNVTRGHVFFERPPPAEFLPWSDSSAPFRVNEDITSFAVFPRIKWPDQVRIFRTVMESFRESDKDARWYVMTDDDT---------------

Query:  ---------------SNFDFSFGMAFGGAGYALSYPLASLVAKRLDGCIERYPYLRVSDQMLFFCLADLGFSITHEMGFHQMDLRGDASGFLSYHPQTPL
                       SNFDFS+ MAFGGAGYALSYPLA+LVAKRLDGCIERYPYLRVSDQMLF CL+DLGFSITHE GFHQ+DLRGDASGFLSYHPQTPL
Subjt:  ---------------SNFDFSFGMAFGGAGYALSYPLASLVAKRLDGCIERYPYLRVSDQMLFFCLADLGFSITHEMGFHQMDLRGDASGFLSYHPQTPL

Query:  LSLHHIDLINPIFPNMDRPAAINHLMKAAAVDQSRLMQQTICYHRPSNWTFSMSWGYSAHIYEAIMARNYLKRPLETFAPFERARAPVFMFNTRWGVLEN
        LSLHHIDLINPIFPNMDRPAAINHLMKA AVDQSRL+QQTICYHRP NW+FSMSWGYSAHIYEAIMARNYLKRPLETFAPFER +APVFMFNTRWGVL+N
Subjt:  LSLHHIDLINPIFPNMDRPAAINHLMKAAAVDQSRLMQQTICYHRPSNWTFSMSWGYSAHIYEAIMARNYLKRPLETFAPFERARAPVFMFNTRWGVLEN

Query:  PCEAPHVLYFESIERDGEDRIVTTYLRKWARNLPPCASFGNHSAESISKIRVFSSAKVPLEAGGVECCDVRMLDMNVTEVNYRPCYSGEVMA
        PCEAPHVLYFESIERD EDRIVTTY+RKWARNLPPCA +GNHSAES+SKIRVFSSAKVPLEAGGVECCDVRM+DMNVTEV+YRPCY GEVMA
Subjt:  PCEAPHVLYFESIERDGEDRIVTTYLRKWARNLPPCASFGNHSAESISKIRVFSSAKVPLEAGGVECCDVRMLDMNVTEVNYRPCYSGEVMA

A0A6J1HU35 uncharacterized protein LOC1114668721.8e-23882.52Show/hide
Query:  MSSPLPISSTIAKFQSKLSSITLGNVCKVLAFSGLALFMLYFFLFSPPNYQPYDLLTSLKQKWPIKTPSPQPP-PPTDPPTNISHIVFSIVGSMNTWKFK
        MSSP PI+STIA FQ+KLSSI+LGNVCKVLA SGLALF LY F+FSPPNYQ  D LT+LKQK+PI  PS  P  PP DPPTNISHIVFSIVGSMNTWK+K
Subjt:  MSSPLPISSTIAKFQSKLSSITLGNVCKVLAFSGLALFMLYFFLFSPPNYQPYDLLTSLKQKWPIKTPSPQPP-PPTDPPTNISHIVFSIVGSMNTWKFK

Query:  RYYSESWWRPNVTRGHVFFERPPPAEFLPWSDSSAPFRVNEDITSFAVFPRIKWPDQVRIFRTVMESFRESDKDARWYVMTDDDT---------------
        RYYSESWWRPNVTRGHVFFERPP  EFLPWS+SSAPFRVNEDI+SFAV+P+I+W DQVRIFRTVMESFRE   +ARWYVMTDDDT               
Subjt:  RYYSESWWRPNVTRGHVFFERPPPAEFLPWSDSSAPFRVNEDITSFAVFPRIKWPDQVRIFRTVMESFRESDKDARWYVMTDDDT---------------

Query:  ---------------SNFDFSFGMAFGGAGYALSYPLASLVAKRLDGCIERYPYLRVSDQMLFFCLADLGFSITHEMGFHQMDLRGDASGFLSYHPQTPL
                       SNFDFS+ MAFGGAGYALSYPLA+LVAKRLDGCIERYPYLRVSDQMLF CL+DLGFSITHE GFHQ+DLRGDASGFLSYHPQTPL
Subjt:  ---------------SNFDFSFGMAFGGAGYALSYPLASLVAKRLDGCIERYPYLRVSDQMLFFCLADLGFSITHEMGFHQMDLRGDASGFLSYHPQTPL

Query:  LSLHHIDLINPIFPNMDRPAAINHLMKAAAVDQSRLMQQTICYHRPSNWTFSMSWGYSAHIYEAIMARNYLKRPLETFAPFERARAPVFMFNTRWGVLEN
        LSLHHIDLINPIFPNMDRPAAINHLMKA AVDQSRL+QQTICYHRP NW+FSMSWGYSAHIYEAIMARNYLKRPLETFAPFER +APVFMFNTRWGVL N
Subjt:  LSLHHIDLINPIFPNMDRPAAINHLMKAAAVDQSRLMQQTICYHRPSNWTFSMSWGYSAHIYEAIMARNYLKRPLETFAPFERARAPVFMFNTRWGVLEN

Query:  PCEAPHVLYFESIERDGEDRIVTTYLRKWARNLPPCASFGNHSAESISKIRVFSSAKVPLEAGGVECCDVRMLDMNVTEVNYRPCYSGEVMA
        PCEAPHVLYFESIERD EDRIVTTYLRKWARNLPPCA +GNHSAES+SKIRVFSSAKVPLEA GVECCDVRM+DMNVTEV+YRPCY GEVMA
Subjt:  PCEAPHVLYFESIERDGEDRIVTTYLRKWARNLPPCASFGNHSAESISKIRVFSSAKVPLEAGGVECCDVRMLDMNVTEVNYRPCYSGEVMA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G37730.1 Protein of unknown function (DUF604)8.1e-6936.93Show/hide
Query:  PPPTDPPTNISHIVFSIVGSMNTWKFKRYYSESWWRPNVTRGHVFFERPPPAEFLPWSDSSAPFRVNEDITSFAVFPRIKWPDQVRIFRTVMESFRESDK
        P  +   T+ISHI F I GS+ TW+ +  YSE WWRPNVTRG ++ +  PP   + W  +S P++V+ D + F+          +R+ R + E+F     
Subjt:  PPPTDPPTNISHIVFSIVGSMNTWKFKRYYSESWWRPNVTRGHVFFERPPPAEFLPWSDSSAPFRVNEDITSFAVFPRIKWPDQVRIFRTVMESFRESDK

Query:  DARWYVMTDDDT-----------SNFD-------------------FSFGMAFGGAGYALSYPLASLVAKRLDGCIERYPYLRVSDQMLFFCLADLGFSI
        D RW++M DDDT           + +D                    S+ MA+GG G A+SYPLA  + K LDGCI+RY  L  SDQ +  CL+++G  +
Subjt:  DARWYVMTDDDT-----------SNFD-------------------FSFGMAFGGAGYALSYPLASLVAKRLDGCIERYPYLRVSDQMLFFCLADLGFSI

Query:  THEMGFHQMDLRGDASGFLSYHPQTPLLSLHHIDLINPIFPNMDRPAAINHLMKAAAVDQSRLMQQTICYHRPSNWTFSMSWGYSAHIYEAIMARNYLKR
        T E+GFHQ+D+RG+  G L+ HP  PL++LHH+D ++PIFP   +  A+  L+ A   D SR++Q + C+ +  NW  S+SWGY+  IY  ++    L+ 
Subjt:  THEMGFHQMDLRGDASGFLSYHPQTPLLSLHHIDLINPIFPNMDRPAAINHLMKAAAVDQSRLMQQTICYHRPSNWTFSMSWGYSAHIYEAIMARNYLKR

Query:  PLETFAPFERARAPVFMFNTRWGVLENPCEAPHVLYFESIERDGEDRIVTTY
        P  TF  +  + +  F F+TR  + E+PCE P V + + +   G  + +TTY
Subjt:  PLETFAPFERARAPVFMFNTRWGVLENPCEAPHVLYFESIERDGEDRIVTTY

AT4G11350.1 Protein of unknown function (DUF604)5.1e-6333.26Show/hide
Query:  SLKQKWPIK---TPSPQPPPPTDPPTNISHIVFSIVGSMNTWKFKRYYSESWWRPNVTRGHVFFERPPPAEFLPWSDSSAP-FRVNEDITSFAVFPRIKW
        S+ Q+ P K   T + +  P     T+++H+VF I  S   WK ++ Y + W++P   RG+V+ +     +       S P  R++ D +SF    +   
Subjt:  SLKQKWPIK---TPSPQPPPPTDPPTNISHIVFSIVGSMNTWKFKRYYSESWWRPNVTRGHVFFERPPPAEFLPWSDSSAP-FRVNEDITSFAVFPRIKW

Query:  PDQVRIFRTVMESF----RESDKDARWYVMTDDDT------------------------------SNFDFSFGMAFGGAGYALSYPLASLVAKRLDGCIE
           +RI R V E+      ES K+ RW+VM DDDT                               N  FS+GMA+GG G+A+SYPLA  ++K  D CI+
Subjt:  PDQVRIFRTVMESF----RESDKDARWYVMTDDDT------------------------------SNFDFSFGMAFGGAGYALSYPLASLVAKRLDGCIE

Query:  RYPYLRVSDQMLFFCLADLGFSITHEMGFHQMDLRGDASGFLSYHPQTPLLSLHHIDLINPIFPNMDRPAAINHLMKAAAVDQSRLMQQTICYHRPSNWT
        RYP L  SD  +  C+A+LG  +T E+GFHQ D+ G+  G L+ HP TP +S+HH+D++ PIFPNM R  AI  L     +D + L+QQ+ICY +  +WT
Subjt:  RYPYLRVSDQMLFFCLADLGFSITHEMGFHQMDLRGDASGFLSYHPQTPLLSLHHIDLINPIFPNMDRPAAINHLMKAAAVDQSRLMQQTICYHRPSNWT

Query:  FSMSWGYSAHIYEAIMARNYLKRPLETFAP-FERARAPVFMFNTRWGVLENPCEAPHVLYFESIERDGE-DRIVTTYLRKWARNLPPCASFGNHSAESIS
         S+SWG++  ++    +   ++ P  TF   ++RA    + FNTR  V  N C+ P V +  S + D + +  V+ Y R   R   P   +   + E I+
Subjt:  FSMSWGYSAHIYEAIMARNYLKRPLETFAP-FERARAPVFMFNTRWGVLENPCEAPHVLYFESIERDGE-DRIVTTYLRKWARNLPPCASFGNHSAESIS

Query:  KIRVFSSAKVPL--EAGGVECCDVRMLDMNVT-EVNYRPCYSGEV
         I V+      L   +    CC V     N T  +N   C +GEV
Subjt:  KIRVFSSAKVPL--EAGGVECCDVRMLDMNVT-EVNYRPCYSGEV

AT4G23490.1 Protein of unknown function (DUF604)7.1e-6533.65Show/hide
Query:  DPPTNISHIVFSIVGSMNTWKFKRYYSESWWRPNVTRGHVFFERPPPAEFLPWSDSS--APFRVNEDITSFAVFPRIKWPDQVRIFRTVMESFRESDKDA
        D  T+++H+VF I  S   WK ++ Y + W++P   RG+V+ ++          D     P +++    SF    +      +RI R V E+ R   K+ 
Subjt:  DPPTNISHIVFSIVGSMNTWKFKRYYSESWWRPNVTRGHVFFERPPPAEFLPWSDSS--APFRVNEDITSFAVFPRIKWPDQVRIFRTVMESFRESDKDA

Query:  RWYVMTDDDT------------------------------SNFDFSFGMAFGGAGYALSYPLASLVAKRLDGCIERYPYLRVSDQMLFFCLADLGFSITH
        RW+VM DDDT                               N  FS+GMA+GG G+A+SYPLA  ++K  D CI+RYP L  SD  +  C+A+LG  +T 
Subjt:  RWYVMTDDDT------------------------------SNFDFSFGMAFGGAGYALSYPLASLVAKRLDGCIERYPYLRVSDQMLFFCLADLGFSITH

Query:  EMGFHQMDLRGDASGFLSYHPQTPLLSLHHIDLINPIFPNMDRPAAINHLMKAAAVDQSRLMQQTICYHRPSNWTFSMSWGYSAHIYEAIMARNYLKRPL
        E+GFHQ D+ G+  G L+ HP TP +S+HH+D++ PIFPNM R  A+  + +   +D + L+QQ+ICY +  +WT S+SWGY+  I+  I +   ++ P 
Subjt:  EMGFHQMDLRGDASGFLSYHPQTPLLSLHHIDLINPIFPNMDRPAAINHLMKAAAVDQSRLMQQTICYHRPSNWTFSMSWGYSAHIYEAIMARNYLKRPL

Query:  ETFAP-FERARAPVFMFNTRWGVLENPCEAPHVLYFESIERDGEDRIVTTYLRKWARNLPPCASFGNHSAESISKIRVFSSAKVPL--EAGGVECCDVRM
         TF   ++RA    + FNTR  V  NPC+ P V Y  S + D +     +       + P C     + AE I+ I V+      L   +    CC V  
Subjt:  ETFAP-FERARAPVFMFNTRWGVLENPCEAPHVLYFESIERDGEDRIVTTYLRKWARNLPPCASFGNHSAESISKIRVFSSAKVPL--EAGGVECCDVRM

Query:  LDMNVT-EVNYRPCYSGEV
           N T  +N   C +GEV
Subjt:  LDMNVT-EVNYRPCYSGEV

AT5G12460.1 Protein of unknown function (DUF604)1.5e-10744.37Show/hide
Query:  YQPYDLLTSLKQKWPIKTPSPQPPPPTDPPTNISHIVFSIVGSMNTWKFKRYYSESWWRPNVTRGHVFFERPPPAEFLPWSDSSAPFRVNEDITSFAVFP
        + P DL  S        + S       +PPTNISH+ F IVGS  TW+++R Y E WWRPN+T+G+VF ERPP  + LPW   S PF VN++      F 
Subjt:  YQPYDLLTSLKQKWPIKTPSPQPPPPTDPPTNISHIVFSIVGSMNTWKFKRYYSESWWRPNVTRGHVFFERPPPAEFLPWSDSSAPFRVNEDITSFAVFP

Query:  RIKWPDQVRIFRTVMESFRESDKDARWYVMTDDDT------------------------------SNFDFSFGMAFGGAGYALSYPLASLVAKRLDGCIE
          K+  Q+R+F ++ ESF+++ K+ RW+V+ DDDT                              SN  F+F M +GG GYALSYP    +   ++ CI+
Subjt:  RIKWPDQVRIFRTVMESFRESDKDARWYVMTDDDT------------------------------SNFDFSFGMAFGGAGYALSYPLASLVAKRLDGCIE

Query:  RYPYLRVSDQMLFFCLADLGFSITHEMGFHQMDLRGDASGFLSYHPQTPLLSLHHIDLINPIFPNMDRPAAINHLMKAAAVDQSRLMQQTICYHRPSNWT
        RY  +  SD + F CLADLG  +T E G HQ DL GD SG LS HPQ+PL+SLHH D+I+PIFP M+R  ++NHLM+ A  DQSR++QQTICY R  NW+
Subjt:  RYPYLRVSDQMLFFCLADLGFSITHEMGFHQMDLRGDASGFLSYHPQTPLLSLHHIDLINPIFPNMDRPAAINHLMKAAAVDQSRLMQQTICYHRPSNWT

Query:  FSMSWGYSAHIYEAIMARNYLKRPLETFAPFERARAPVFMFNTRWGVLENPCEAPHVLYFESIERDGEDRIVTT-YLRKWARNLPPCASFGNHSAESISK
         S+SWGYS HIY++I  R++LKRPLETF P++  R P + FNTR  V  +PCE P   +F+S+  D    +VTT Y  K  R LPPC   GNHS+ +I++
Subjt:  FSMSWGYSAHIYEAIMARNYLKRPLETFAPFERARAPVFMFNTRWGVLENPCEAPHVLYFESIERDGEDRIVTT-YLRKWARNLPPCASFGNHSAESISK

Query:  IRVFSSAKVPLEAGGVECCDVRMLD-MNVTEVNYRPCYSGEVMA
        +RV ++    +   G+ECCDV+ ++   + EV  R C+  E +A
Subjt:  IRVFSSAKVPLEAGGVECCDVRMLD-MNVTEVNYRPCYSGEVMA

AT5G41460.1 Protein of unknown function (DUF604)3.3e-7032.99Show/hide
Query:  LPISSTIAKFQSKLSSITLGNVCKVLAFSGLALFMLYFFLFSPPNYQPYDLLTSLKQKWPIKTPSPQPPPPTDPPTNISHIVFSIVGSMNTWKFKRYYSE
        L +S+T   +  KL  I+    C+V  FS +   +      S P    +   T++ + +    PSP PPPP  P T   H+VF I  S   WK ++ Y +
Subjt:  LPISSTIAKFQSKLSSITLGNVCKVLAFSGLALFMLYFFLFSPPNYQPYDLLTSLKQKWPIKTPSPQPPPPTDPPTNISHIVFSIVGSMNTWKFKRYYSE

Query:  SWWRPNVTRGHVFFERPPPAEFLPWSDSSAPFRVNEDITSFAVFPRIKWPDQVRIFRTVMESFRESDKDARWYVMTDDDT--------------------
         W++PN  R +V+ E+P   E      S  P +++ D + F    +      +RI R V E+ +   KD RW+VM DDDT                    
Subjt:  SWWRPNVTRGHVFFERPPPAEFLPWSDSSAPFRVNEDITSFAVFPRIKWPDQVRIFRTVMESFRESDKDARWYVMTDDDT--------------------

Query:  ----------SNFDFSFGMAFGGAGYALSYPLASLVAKRLDGCIERYPYLRVSDQMLFFCLADLGFSITHEMGFHQMDLRGDASGFLSYHPQTPLLSLHH
                   N  FS+GMA+GG G+A+SYPLA  ++K  D CI+RYP L  SD  +  C+A+LG  +T E+GFHQ D+ G+  G L+ HP  PL++LHH
Subjt:  ----------SNFDFSFGMAFGGAGYALSYPLASLVAKRLDGCIERYPYLRVSDQMLFFCLADLGFSITHEMGFHQMDLRGDASGFLSYHPQTPLLSLHH

Query:  IDLINPIFPNMDRPAAINHLMKAAAVDQSRLMQQTICYHRPSNWTFSMSWGYSAHIYEAIMARNYLKRPLETFAP-FERARAPVFMFNTRWGVLENPCEA
        +D++ PIFPNM R  A+ HL   A +D + LMQQ+ICY +   WT S+SWG++  I+  I +   ++ P  TF   + RA    + FNTR  V  +PC+ 
Subjt:  IDLINPIFPNMDRPAAINHLMKAAAVDQSRLMQQTICYHRPSNWTFSMSWGYSAHIYEAIMARNYLKRPLETFAP-FERARAPVFMFNTRWGVLENPCEA

Query:  PHVLYFESIERDGEDRIVTTYLRKWA--RNLPPCASFGNHSAESISKIRVFSSAKVPL--EAGGVECCDVRMLDMNVTEVNYRPCYSGEVM
        P V Y  S       R+    + ++   R   P   +   +   I  + V+      L   +    CC V+    N  E++   C  GEV+
Subjt:  PHVLYFESIERDGEDRIVTTYLRKWA--RNLPPCASFGNHSAESISKIRVFSSAKVPL--EAGGVECCDVRMLDMNVTEVNYRPCYSGEVM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTCTCCACTACCCATCAGCTCCACCATAGCCAAGTTTCAAAGCAAGCTTTCTTCAATAACTCTTGGAAATGTCTGCAAAGTTTTGGCATTTTCAGGGTTGGCCTT
GTTCATGCTCTATTTTTTCTTGTTTTCACCTCCAAACTACCAACCCTATGATCTTCTTACAAGCCTCAAGCAAAAATGGCCCATCAAAACTCCGTCGCCGCAGCCACCGC
CACCCACCGATCCTCCGACAAACATCTCTCACATCGTGTTTAGCATTGTCGGCTCGATGAACACATGGAAGTTCAAGAGATATTATAGCGAATCATGGTGGCGACCCAAT
GTAACCCGTGGCCACGTCTTCTTCGAACGCCCACCTCCCGCCGAGTTCCTACCATGGTCGGACTCGTCTGCTCCATTTCGAGTCAATGAAGATATCACAAGCTTTGCAGT
GTTTCCAAGAATCAAATGGCCGGACCAGGTGAGAATCTTTCGAACTGTGATGGAGTCGTTCAGAGAAAGCGACAAAGATGCAAGGTGGTACGTAATGACGGACGATGATA
CGTCTAACTTTGATTTCTCCTTTGGCATGGCTTTTGGAGGAGCTGGTTATGCTTTGAGTTACCCACTTGCATCATTGGTGGCAAAAAGGTTGGATGGTTGCATTGAGAGA
TATCCTTACTTGAGAGTTAGCGATCAAATGTTGTTCTTTTGTTTGGCTGATTTGGGATTCTCCATTACTCATGAAATGGGGTTTCACCAGATGGATCTACGAGGCGATGC
ATCGGGCTTTCTCTCATACCATCCACAAACCCCTCTCCTCTCCCTCCACCACATAGACCTCATCAATCCCATCTTCCCAAACATGGACCGCCCTGCCGCCATCAACCACT
TGATGAAGGCAGCGGCGGTCGACCAGTCCCGACTAATGCAGCAAACCATCTGCTACCACCGACCGTCGAACTGGACGTTCTCGATGTCATGGGGCTACTCTGCCCACATC
TATGAGGCCATAATGGCTAGAAACTATTTGAAGAGACCCTTGGAAACTTTTGCACCATTTGAACGAGCTAGGGCTCCTGTTTTCATGTTCAACACGAGGTGGGGAGTTCT
TGAGAACCCTTGTGAAGCTCCCCATGTGCTGTATTTTGAGTCCATTGAGAGAGATGGAGAGGATAGGATTGTTACTACTTATTTGAGGAAGTGGGCTCGCAATCTTCCTC
CCTGTGCTTCTTTTGGGAACCATTCTGCTGAATCCATCTCCAAGATTAGGGTTTTTTCCTCGGCTAAAGTCCCTTTGGAGGCAGGAGGAGTAGAGTGTTGTGATGTAAGA
ATGTTGGACATGAATGTTACAGAAGTAAATTACAGGCCTTGTTATAGTGGGGAAGTGATGGCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCTTCTCCACTACCCATCAGCTCCACCATAGCCAAGTTTCAAAGCAAGCTTTCTTCAATAACTCTTGGAAATGTCTGCAAAGTTTTGGCATTTTCAGGGTTGGCCTT
GTTCATGCTCTATTTTTTCTTGTTTTCACCTCCAAACTACCAACCCTATGATCTTCTTACAAGCCTCAAGCAAAAATGGCCCATCAAAACTCCGTCGCCGCAGCCACCGC
CACCCACCGATCCTCCGACAAACATCTCTCACATCGTGTTTAGCATTGTCGGCTCGATGAACACATGGAAGTTCAAGAGATATTATAGCGAATCATGGTGGCGACCCAAT
GTAACCCGTGGCCACGTCTTCTTCGAACGCCCACCTCCCGCCGAGTTCCTACCATGGTCGGACTCGTCTGCTCCATTTCGAGTCAATGAAGATATCACAAGCTTTGCAGT
GTTTCCAAGAATCAAATGGCCGGACCAGGTGAGAATCTTTCGAACTGTGATGGAGTCGTTCAGAGAAAGCGACAAAGATGCAAGGTGGTACGTAATGACGGACGATGATA
CGTCTAACTTTGATTTCTCCTTTGGCATGGCTTTTGGAGGAGCTGGTTATGCTTTGAGTTACCCACTTGCATCATTGGTGGCAAAAAGGTTGGATGGTTGCATTGAGAGA
TATCCTTACTTGAGAGTTAGCGATCAAATGTTGTTCTTTTGTTTGGCTGATTTGGGATTCTCCATTACTCATGAAATGGGGTTTCACCAGATGGATCTACGAGGCGATGC
ATCGGGCTTTCTCTCATACCATCCACAAACCCCTCTCCTCTCCCTCCACCACATAGACCTCATCAATCCCATCTTCCCAAACATGGACCGCCCTGCCGCCATCAACCACT
TGATGAAGGCAGCGGCGGTCGACCAGTCCCGACTAATGCAGCAAACCATCTGCTACCACCGACCGTCGAACTGGACGTTCTCGATGTCATGGGGCTACTCTGCCCACATC
TATGAGGCCATAATGGCTAGAAACTATTTGAAGAGACCCTTGGAAACTTTTGCACCATTTGAACGAGCTAGGGCTCCTGTTTTCATGTTCAACACGAGGTGGGGAGTTCT
TGAGAACCCTTGTGAAGCTCCCCATGTGCTGTATTTTGAGTCCATTGAGAGAGATGGAGAGGATAGGATTGTTACTACTTATTTGAGGAAGTGGGCTCGCAATCTTCCTC
CCTGTGCTTCTTTTGGGAACCATTCTGCTGAATCCATCTCCAAGATTAGGGTTTTTTCCTCGGCTAAAGTCCCTTTGGAGGCAGGAGGAGTAGAGTGTTGTGATGTAAGA
ATGTTGGACATGAATGTTACAGAAGTAAATTACAGGCCTTGTTATAGTGGGGAAGTGATGGCTTAA
Protein sequenceShow/hide protein sequence
MSSPLPISSTIAKFQSKLSSITLGNVCKVLAFSGLALFMLYFFLFSPPNYQPYDLLTSLKQKWPIKTPSPQPPPPTDPPTNISHIVFSIVGSMNTWKFKRYYSESWWRPN
VTRGHVFFERPPPAEFLPWSDSSAPFRVNEDITSFAVFPRIKWPDQVRIFRTVMESFRESDKDARWYVMTDDDTSNFDFSFGMAFGGAGYALSYPLASLVAKRLDGCIER
YPYLRVSDQMLFFCLADLGFSITHEMGFHQMDLRGDASGFLSYHPQTPLLSLHHIDLINPIFPNMDRPAAINHLMKAAAVDQSRLMQQTICYHRPSNWTFSMSWGYSAHI
YEAIMARNYLKRPLETFAPFERARAPVFMFNTRWGVLENPCEAPHVLYFESIERDGEDRIVTTYLRKWARNLPPCASFGNHSAESISKIRVFSSAKVPLEAGGVECCDVR
MLDMNVTEVNYRPCYSGEVMA