; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg16057 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg16057
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionUnknown protein
Genome locationCarg_Chr07:2130302..2133694
RNA-Seq ExpressionCarg16057
SyntenyCarg16057
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6594816.1 hypothetical protein SDJN03_11369, partial [Cucurbita argyrosperma subsp. sororia]1.3e-25498.92Show/hide
Query:  MPTFTTIALDRLLEPGTSKSVDKPLPKHMPALTFNRAPTTNLERRNSASAAERKVQRPQIKPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSF
        MPTFTTIALDRLLEPGTSKSVDKPLPKHMPALTFNRAPTTNLERRNSASAAERKVQRPQIKPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSF
Subjt:  MPTFTTIALDRLLEPGTSKSVDKPLPKHMPALTFNRAPTTNLERRNSASAAERKVQRPQIKPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSF

Query:  SEDVVSSRQKMNDNDIGNVNVNVNVNGSDSNDVKLSEGASVTVDLPIPNKNGHRNGLDCATSSNVGQNGSVDGDHGATAVQHGSNHTNNGSTMMVSNDVA
        SEDVVSS QKMNDNDIGNVNVNVNVNGSDSNDVKLSEGASVTVDLPIPNK+GHRNGLDCAT+SNVGQNGSVDGDHGATAVQHGSNHTNNGSTM VSNDVA
Subjt:  SEDVVSSRQKMNDNDIGNVNVNVNVNGSDSNDVKLSEGASVTVDLPIPNKNGHRNGLDCATSSNVGQNGSVDGDHGATAVQHGSNHTNNGSTMMVSNDVA

Query:  REKDSLKVVVPTLDSLGDAEDFFDPQDSLSVASNTDGEDNGYERSAKFCTPMGEFYDAWEEMSSDGLPHPSISVIEAELREMRLSLLMELEKRKQAEEAL
        REKDSLKVVVPTLDSLGDAEDFFDPQDSLSVASNTDGEDNGYERSAKFCTPMGEFYDAWEEMSSDGLPHPSISVIEAELREMRLSLLMELEKRKQAEEAL
Subjt:  REKDSLKVVVPTLDSLGDAEDFFDPQDSLSVASNTDGEDNGYERSAKFCTPMGEFYDAWEEMSSDGLPHPSISVIEAELREMRLSLLMELEKRKQAEEAL

Query:  DNLRGQWQRLREHLSLVGLTLPSDPTVATNGNLLYSDPAEELCQQVNIARFVSGSIGRGIARAEVETEMEAQLEAKNFEIARLLDRLHYYEAVNHEMSQR
        DNLRGQWQRLREHLSLVGLTLPSDPTVATNGNLLYSDPAEELCQQVNIARFVSGSIGRGIARAEVETEMEAQLEAKNFEIARLLDRLHYYEAVNHEMSQR
Subjt:  DNLRGQWQRLREHLSLVGLTLPSDPTVATNGNLLYSDPAEELCQQVNIARFVSGSIGRGIARAEVETEMEAQLEAKNFEIARLLDRLHYYEAVNHEMSQR

Query:  NQEAVDLARRERLRRKRRLRWMWGSVATAITLGTAVLAWSYLPSGKDSSSINDSKATEHDDATD
        NQEAVDLARRERLRRKRRLRWMWGSVATAITLGTAVLAWSYLPSGKDSSS+NDSKATEHDDATD
Subjt:  NQEAVDLARRERLRRKRRLRWMWGSVATAITLGTAVLAWSYLPSGKDSSSINDSKATEHDDATD

KAG7026781.1 hypothetical protein SDJN02_10788 [Cucurbita argyrosperma subsp. argyrosperma]3.7e-257100Show/hide
Query:  MPTFTTIALDRLLEPGTSKSVDKPLPKHMPALTFNRAPTTNLERRNSASAAERKVQRPQIKPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSF
        MPTFTTIALDRLLEPGTSKSVDKPLPKHMPALTFNRAPTTNLERRNSASAAERKVQRPQIKPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSF
Subjt:  MPTFTTIALDRLLEPGTSKSVDKPLPKHMPALTFNRAPTTNLERRNSASAAERKVQRPQIKPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSF

Query:  SEDVVSSRQKMNDNDIGNVNVNVNVNGSDSNDVKLSEGASVTVDLPIPNKNGHRNGLDCATSSNVGQNGSVDGDHGATAVQHGSNHTNNGSTMMVSNDVA
        SEDVVSSRQKMNDNDIGNVNVNVNVNGSDSNDVKLSEGASVTVDLPIPNKNGHRNGLDCATSSNVGQNGSVDGDHGATAVQHGSNHTNNGSTMMVSNDVA
Subjt:  SEDVVSSRQKMNDNDIGNVNVNVNVNGSDSNDVKLSEGASVTVDLPIPNKNGHRNGLDCATSSNVGQNGSVDGDHGATAVQHGSNHTNNGSTMMVSNDVA

Query:  REKDSLKVVVPTLDSLGDAEDFFDPQDSLSVASNTDGEDNGYERSAKFCTPMGEFYDAWEEMSSDGLPHPSISVIEAELREMRLSLLMELEKRKQAEEAL
        REKDSLKVVVPTLDSLGDAEDFFDPQDSLSVASNTDGEDNGYERSAKFCTPMGEFYDAWEEMSSDGLPHPSISVIEAELREMRLSLLMELEKRKQAEEAL
Subjt:  REKDSLKVVVPTLDSLGDAEDFFDPQDSLSVASNTDGEDNGYERSAKFCTPMGEFYDAWEEMSSDGLPHPSISVIEAELREMRLSLLMELEKRKQAEEAL

Query:  DNLRGQWQRLREHLSLVGLTLPSDPTVATNGNLLYSDPAEELCQQVNIARFVSGSIGRGIARAEVETEMEAQLEAKNFEIARLLDRLHYYEAVNHEMSQR
        DNLRGQWQRLREHLSLVGLTLPSDPTVATNGNLLYSDPAEELCQQVNIARFVSGSIGRGIARAEVETEMEAQLEAKNFEIARLLDRLHYYEAVNHEMSQR
Subjt:  DNLRGQWQRLREHLSLVGLTLPSDPTVATNGNLLYSDPAEELCQQVNIARFVSGSIGRGIARAEVETEMEAQLEAKNFEIARLLDRLHYYEAVNHEMSQR

Query:  NQEAVDLARRERLRRKRRLRWMWGSVATAITLGTAVLAWSYLPSGKDSSSINDSKATEHDDATD
        NQEAVDLARRERLRRKRRLRWMWGSVATAITLGTAVLAWSYLPSGKDSSSINDSKATEHDDATD
Subjt:  NQEAVDLARRERLRRKRRLRWMWGSVATAITLGTAVLAWSYLPSGKDSSSINDSKATEHDDATD

XP_022962776.1 uncharacterized protein LOC111463164 [Cucurbita moschata]6.1e-25298.49Show/hide
Query:  MPTFTTIALDRLLEPGTSKSVDKPLPKHMPALTFNRAPTTNLERRNSASAAERKVQRPQIKPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSF
        MPTFTTIALDRLLEPGTSKSVDKPLPKHMPALTFNRAPTTNLERRNSASAAERKVQRPQIKPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSF
Subjt:  MPTFTTIALDRLLEPGTSKSVDKPLPKHMPALTFNRAPTTNLERRNSASAAERKVQRPQIKPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSF

Query:  SEDVVSSRQKMNDNDIGNVNVNVNVNGSDSNDVKLSEGASVTVDLPIPNKNGHRNGLDCATSSNVGQNGSVDGDHGATAVQHGSNHTNNGSTMMVSNDVA
        SEDVVSSRQKMNDNDIG    NVNVNGSDSNDVKLSEGASVTVDLPIPNK+GHRNGLDCATSSNVGQNGSVDGDHGATAVQHGSNHTNNGST+MVSNDVA
Subjt:  SEDVVSSRQKMNDNDIGNVNVNVNVNGSDSNDVKLSEGASVTVDLPIPNKNGHRNGLDCATSSNVGQNGSVDGDHGATAVQHGSNHTNNGSTMMVSNDVA

Query:  REKDSLKVVVPTLDSLGDAEDFFDPQDSLSVASNTDGEDNGYERSAKFCTPMGEFYDAWEEMSSDGLPHPSISVIEAELREMRLSLLMELEKRKQAEEAL
        REKDSLKVVVPTLDSLGDAEDFFDPQDSLSVASNTDGEDNGYERSAKFCTPMGEFYDAWEEMSSDGLPHPSISVIEAELREMRLSLLMELEKRKQAEEAL
Subjt:  REKDSLKVVVPTLDSLGDAEDFFDPQDSLSVASNTDGEDNGYERSAKFCTPMGEFYDAWEEMSSDGLPHPSISVIEAELREMRLSLLMELEKRKQAEEAL

Query:  DNLRGQWQRLREHLSLVGLTLPSDPTVATNGNLLYSDPAEELCQQVNIARFVSGSIGRGIARAEVETEMEAQLEAKNFEIARLLDRLHYYEAVNHEMSQR
        DNLRGQWQRLREHLSLVGLTLPSDPTVATNGNLLYSDPAEELCQQVNIARFVSGSIGRGIARAEVETEMEAQLEAKNFEIARLLDRLHYYEAVNHEMSQR
Subjt:  DNLRGQWQRLREHLSLVGLTLPSDPTVATNGNLLYSDPAEELCQQVNIARFVSGSIGRGIARAEVETEMEAQLEAKNFEIARLLDRLHYYEAVNHEMSQR

Query:  NQEAVDLARRERLRRKRRLRWMWGSVATAITLGTAVLAWSYLPSGKDSSSINDSKATEHDDATD
        NQEAVDLARRERLRRKRRLRWMWGSVATAITLGTAVLAWSYLPSGKDSSS+NDSKATEHDDATD
Subjt:  NQEAVDLARRERLRRKRRLRWMWGSVATAITLGTAVLAWSYLPSGKDSSSINDSKATEHDDATD

XP_023003998.1 uncharacterized protein LOC111497447 [Cucurbita maxima]1.2e-24796.98Show/hide
Query:  MPTFTTIALDRLLEPGTSKSVDKPLPKHMPALTFNRAPTTNLERRNSASAAERKVQRPQIKPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSF
        MPTFTTIALDRLLEPGTSKSVDKPLPKHMPALTFNRAP+TNLERRNSASAAERKVQRPQIKPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSF
Subjt:  MPTFTTIALDRLLEPGTSKSVDKPLPKHMPALTFNRAPTTNLERRNSASAAERKVQRPQIKPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSF

Query:  SEDVVSSRQKMNDNDIGNVNVNVNVNGSDSNDVKLSEGASVTVDLPIPNKNGHRNGLDCATSSNVGQNGSVDGDHGATAVQHGSNHTNNGSTMMVSNDVA
        SEDVVSS QK+NDNDIG    NVNVNGSDSNDVKLSEGASVTVDLPIPNK+G RNG DCATSSNVGQNGSVDGDHGATAVQHGSNHTNNGS+MMVSNDVA
Subjt:  SEDVVSSRQKMNDNDIGNVNVNVNVNGSDSNDVKLSEGASVTVDLPIPNKNGHRNGLDCATSSNVGQNGSVDGDHGATAVQHGSNHTNNGSTMMVSNDVA

Query:  REKDSLKVVVPTLDSLGDAEDFFDPQDSLSVASNTDGEDNGYERSAKFCTPMGEFYDAWEEMSSDGLPHPSISVIEAELREMRLSLLMELEKRKQAEEAL
        REKDSLKVVVPTLDSLGDAEDFFDPQDSLSVASNTDGEDNGYERSAKFCTPMGEFYDAWEEMSSDGLPHPSISVIEAELREMRLSLLMELEKRKQAEEAL
Subjt:  REKDSLKVVVPTLDSLGDAEDFFDPQDSLSVASNTDGEDNGYERSAKFCTPMGEFYDAWEEMSSDGLPHPSISVIEAELREMRLSLLMELEKRKQAEEAL

Query:  DNLRGQWQRLREHLSLVGLTLPSDPTVATNGNLLYSDPAEELCQQVNIARFVSGSIGRGIARAEVETEMEAQLEAKNFEIARLLDRLHYYEAVNHEMSQR
        DNLRGQWQRLREHLSLVGLTLPSDPTV+TNGNL+YSDPAEELCQQVNIARFVSGSIGRGIARAEVETEMEAQLEAKNFEIARLLDRLHYYEAVNHEMSQR
Subjt:  DNLRGQWQRLREHLSLVGLTLPSDPTVATNGNLLYSDPAEELCQQVNIARFVSGSIGRGIARAEVETEMEAQLEAKNFEIARLLDRLHYYEAVNHEMSQR

Query:  NQEAVDLARRERLRRKRRLRWMWGSVATAITLGTAVLAWSYLPSGKDSSSINDSKATEHDDATD
        NQEAVDLARRERLRRKRRLRWMWGSVAT ITLGTAVLAWSYLPSGKDSSSINDSKATEHDDATD
Subjt:  NQEAVDLARRERLRRKRRLRWMWGSVATAITLGTAVLAWSYLPSGKDSSSINDSKATEHDDATD

XP_023517074.1 uncharacterized protein LOC111780942 [Cucurbita pepo subsp. pepo]2.8e-24997.63Show/hide
Query:  MPTFTTIALDRLLEPGTSKSVDKPLPKHMPALTFNRAPTTNLERRNSASAAERKVQRPQIKPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSF
        MPTFTTIALDRLLEPGTSKSVDKPLPKHMPALTFNRAP+TNLERRNSASAAERKVQRPQIKPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSF
Subjt:  MPTFTTIALDRLLEPGTSKSVDKPLPKHMPALTFNRAPTTNLERRNSASAAERKVQRPQIKPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSF

Query:  SEDVVSSRQKMNDNDIGNVNVNVNVNGSDSNDVKLSEGASVTVDLPIPNKNGHRNGLDCATSSNVGQNGSVDGDHGATAVQHGSNHTNNGSTMMVSNDVA
        SEDVVSS QKMNDNDIGNVNVNVNVNGSDSNDVKLSEGASVTVDLPIPNK+GHRNGLDCAT+SNVGQNGSVDGDHGATAVQHGSNHTNNGSTM VSNDVA
Subjt:  SEDVVSSRQKMNDNDIGNVNVNVNVNGSDSNDVKLSEGASVTVDLPIPNKNGHRNGLDCATSSNVGQNGSVDGDHGATAVQHGSNHTNNGSTMMVSNDVA

Query:  REKDSLKVVVPTLDSLGDAEDFFDPQDSLSVASNTDGEDNGYERSAKFCTPMGEFYDAWEEMSSDGLPHPSISVIEAELREMRLSLLMELEKRKQAEEAL
        REKDSLKVVVPTLDSLGDAEDFFDPQDSLSVASNTDGEDNGYERSAKFCTPMGEFYDAWEEMSSDGLPHPSISVIEAELREMRLSLLMELEKRKQAEEAL
Subjt:  REKDSLKVVVPTLDSLGDAEDFFDPQDSLSVASNTDGEDNGYERSAKFCTPMGEFYDAWEEMSSDGLPHPSISVIEAELREMRLSLLMELEKRKQAEEAL

Query:  DNLRGQWQRLREHLSLVGLTLPSDPTVATNGNLLYSDPAEELCQQVNIARFVSGSIGRGIARAEVETEMEAQLEAKNFEIARLLDRLHYYEAVNHEMSQR
        DNLRGQWQRLREHLSLVGLTLPSDPTVATNGN     PAEELCQQVNIARFVSGSIGRGIARAEVETEMEAQLEAKNFEIARLLDRLHYYEAVNHEMSQR
Subjt:  DNLRGQWQRLREHLSLVGLTLPSDPTVATNGNLLYSDPAEELCQQVNIARFVSGSIGRGIARAEVETEMEAQLEAKNFEIARLLDRLHYYEAVNHEMSQR

Query:  NQEAVDLARRERLRRKRRLRWMWGSVATAITLGTAVLAWSYLPSGKDSSSINDSKATEHDDATD
        NQEAVDLARRERLRRKRRLRWMWGSVATAITLGTAVLAWSYLPSGKDSSS+NDSKATEHDDATD
Subjt:  NQEAVDLARRERLRRKRRLRWMWGSVATAITLGTAVLAWSYLPSGKDSSSINDSKATEHDDATD

TrEMBL top hitse value%identityAlignment
A0A1S3B1E0 uncharacterized protein LOC1034850654.8e-19479.96Show/hide
Query:  MPTFTTIALDRLLEPGTSKSVDKPLPKHMPALTFNRAPTTNLERRNSASAAERKVQRPQIKPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSF
        MPTFTTIALDRLLEPGT+KS+DK LPK  PALTFNRAP+T LERRNSAS A+RKVQRPQIKPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSF
Subjt:  MPTFTTIALDRLLEPGTSKSVDKPLPKHMPALTFNRAPTTNLERRNSASAAERKVQRPQIKPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSF

Query:  SEDVVSSRQKMNDNDIGNVNVNVNVNGSDSNDVKLSEGASVTVDLPIPNKNGHRNGLDCATSSNVGQNGSVDGDHGATAVQHGSNHTNNGSTMMVSNDVA
        SED V S +KMND D+G    N +V  SD NDVKL+EGASVTV  PIP+K+G RNGLDCA+SSN+G+NG VDGDHGATAVQ  S+H N+ S+++ S+ +A
Subjt:  SEDVVSSRQKMNDNDIGNVNVNVNVNGSDSNDVKLSEGASVTVDLPIPNKNGHRNGLDCATSSNVGQNGSVDGDHGATAVQHGSNHTNNGSTMMVSNDVA

Query:  REKDSLKVVVPTLDSLGDAEDFFDPQDSLSVASNTDGEDNGYERSAKFCTPMGEFYDAWEEMSSDGLPHPSISVIEAELREMRLSLLMELEKRKQAEEAL
        +EKDSLK VV   +S GD EDFFDP DSLSVASNTDGEDNG+ERSAKF TPMGEFYDAWEE+SS+G+P PSIS IE + REMR  LLME+EK+KQAEEAL
Subjt:  REKDSLKVVVPTLDSLGDAEDFFDPQDSLSVASNTDGEDNGYERSAKFCTPMGEFYDAWEEMSSDGLPHPSISVIEAELREMRLSLLMELEKRKQAEEAL

Query:  DNLRGQWQRLREHLSLVGLTLPSDPTVATNGNLLYSDPAEELCQQVNIARFVSGSIGRGIARAEVETEMEAQLEAKNFEIARLLDRLHYYEAVNHEMSQR
        + L+ QWQRLRE L LVGLTLPSDPTVAT G  L SDPAEELCQQVN+ARFVS SIG+GIARAEVE EMEAQLE KNFEIARLLDRLHYYEAVNHEMSQR
Subjt:  DNLRGQWQRLREHLSLVGLTLPSDPTVATNGNLLYSDPAEELCQQVNIARFVSGSIGRGIARAEVETEMEAQLEAKNFEIARLLDRLHYYEAVNHEMSQR

Query:  NQEAVDLARRERLRRKRRLRWMWGSVATAITLGTAVLAWSYLPSGKDSSSINDSKATEHDDATD
        NQEAVDLARRERLRRKRR RW+WGSVATAITLGTAVLAWSYLPSGKD  S N+SKA EHDD TD
Subjt:  NQEAVDLARRERLRRKRRLRWMWGSVATAITLGTAVLAWSYLPSGKDSSSINDSKATEHDDATD

A0A5A7T005 Uncharacterized protein4.8e-19479.96Show/hide
Query:  MPTFTTIALDRLLEPGTSKSVDKPLPKHMPALTFNRAPTTNLERRNSASAAERKVQRPQIKPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSF
        MPTFTTIALDRLLEPGT+KS+DK LPK  PALTFNRAP+T LERRNSAS A+RKVQRPQIKPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSF
Subjt:  MPTFTTIALDRLLEPGTSKSVDKPLPKHMPALTFNRAPTTNLERRNSASAAERKVQRPQIKPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSF

Query:  SEDVVSSRQKMNDNDIGNVNVNVNVNGSDSNDVKLSEGASVTVDLPIPNKNGHRNGLDCATSSNVGQNGSVDGDHGATAVQHGSNHTNNGSTMMVSNDVA
        SED V S +KMND D+G    N +V  SD NDVKL+EGASVTV  PIP+K+G RNGLDCA+SSN+G+NG VDGDHGATAVQ  S+H N+ S+++ S+ +A
Subjt:  SEDVVSSRQKMNDNDIGNVNVNVNVNGSDSNDVKLSEGASVTVDLPIPNKNGHRNGLDCATSSNVGQNGSVDGDHGATAVQHGSNHTNNGSTMMVSNDVA

Query:  REKDSLKVVVPTLDSLGDAEDFFDPQDSLSVASNTDGEDNGYERSAKFCTPMGEFYDAWEEMSSDGLPHPSISVIEAELREMRLSLLMELEKRKQAEEAL
        +EKDSLK VV   +S GD EDFFDP DSLSVASNTDGEDNG+ERSAKF TPMGEFYDAWEE+SS+G+P PSIS IE + REMR  LLME+EK+KQAEEAL
Subjt:  REKDSLKVVVPTLDSLGDAEDFFDPQDSLSVASNTDGEDNGYERSAKFCTPMGEFYDAWEEMSSDGLPHPSISVIEAELREMRLSLLMELEKRKQAEEAL

Query:  DNLRGQWQRLREHLSLVGLTLPSDPTVATNGNLLYSDPAEELCQQVNIARFVSGSIGRGIARAEVETEMEAQLEAKNFEIARLLDRLHYYEAVNHEMSQR
        + L+ QWQRLRE L LVGLTLPSDPTVAT G  L SDPAEELCQQVN+ARFVS SIG+GIARAEVE EMEAQLE KNFEIARLLDRLHYYEAVNHEMSQR
Subjt:  DNLRGQWQRLREHLSLVGLTLPSDPTVATNGNLLYSDPAEELCQQVNIARFVSGSIGRGIARAEVETEMEAQLEAKNFEIARLLDRLHYYEAVNHEMSQR

Query:  NQEAVDLARRERLRRKRRLRWMWGSVATAITLGTAVLAWSYLPSGKDSSSINDSKATEHDDATD
        NQEAVDLARRERLRRKRR RW+WGSVATAITLGTAVLAWSYLPSGKD  S N+SKA EHDD TD
Subjt:  NQEAVDLARRERLRRKRRLRWMWGSVATAITLGTAVLAWSYLPSGKDSSSINDSKATEHDDATD

A0A5D3CMF0 Uncharacterized protein1.6e-19480.17Show/hide
Query:  MPTFTTIALDRLLEPGTSKSVDKPLPKHMPALTFNRAPTTNLERRNSASAAERKVQRPQIKPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSF
        MPTFTTIALDRLLEPGT+KS+DK LPK  PALTFNRAP+T LERRNSAS A+RKVQRPQIKPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSF
Subjt:  MPTFTTIALDRLLEPGTSKSVDKPLPKHMPALTFNRAPTTNLERRNSASAAERKVQRPQIKPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSF

Query:  SEDVVSSRQKMNDNDIGNVNVNVNVNGSDSNDVKLSEGASVTVDLPIPNKNGHRNGLDCATSSNVGQNGSVDGDHGATAVQHGSNHTNNGSTMMVSNDVA
        SED V S +KMND D+G    N +V  SD NDVKL+EGASVTV  PIP+K+G RNGLDCA+SSN+G+NG VDGDHGATAVQ  S+H N+ S+++ S+ +A
Subjt:  SEDVVSSRQKMNDNDIGNVNVNVNVNGSDSNDVKLSEGASVTVDLPIPNKNGHRNGLDCATSSNVGQNGSVDGDHGATAVQHGSNHTNNGSTMMVSNDVA

Query:  REKDSLKVVVPTLDSLGDAEDFFDPQDSLSVASNTDGEDNGYERSAKFCTPMGEFYDAWEEMSSDGLPHPSISVIEAELREMRLSLLMELEKRKQAEEAL
        +EKDSLK VV   +S GD EDFFDP DSLSVASNTDGEDNG+ERSAKF TPMGEFYDAWEE+SS+G+P PSIS IE + REMR  LLME+EKRKQAEEAL
Subjt:  REKDSLKVVVPTLDSLGDAEDFFDPQDSLSVASNTDGEDNGYERSAKFCTPMGEFYDAWEEMSSDGLPHPSISVIEAELREMRLSLLMELEKRKQAEEAL

Query:  DNLRGQWQRLREHLSLVGLTLPSDPTVATNGNLLYSDPAEELCQQVNIARFVSGSIGRGIARAEVETEMEAQLEAKNFEIARLLDRLHYYEAVNHEMSQR
        + L+ QWQRLRE L LVGLTLPSDPTVAT G  L SDPAEELCQQVN+ARFVS SIG+GIARAEVE EMEAQLE KNFEIARLLDRLHYYEAVNHEMSQR
Subjt:  DNLRGQWQRLREHLSLVGLTLPSDPTVATNGNLLYSDPAEELCQQVNIARFVSGSIGRGIARAEVETEMEAQLEAKNFEIARLLDRLHYYEAVNHEMSQR

Query:  NQEAVDLARRERLRRKRRLRWMWGSVATAITLGTAVLAWSYLPSGKDSSSINDSKATEHDDATD
        NQEAVDLARRERLRRKRR RW+WGSVATAITLGTAVLAWSYLPSGKD  S N+SKA EHDD TD
Subjt:  NQEAVDLARRERLRRKRRLRWMWGSVATAITLGTAVLAWSYLPSGKDSSSINDSKATEHDDATD

A0A6J1HDH8 uncharacterized protein LOC1114631643.0e-25298.49Show/hide
Query:  MPTFTTIALDRLLEPGTSKSVDKPLPKHMPALTFNRAPTTNLERRNSASAAERKVQRPQIKPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSF
        MPTFTTIALDRLLEPGTSKSVDKPLPKHMPALTFNRAPTTNLERRNSASAAERKVQRPQIKPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSF
Subjt:  MPTFTTIALDRLLEPGTSKSVDKPLPKHMPALTFNRAPTTNLERRNSASAAERKVQRPQIKPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSF

Query:  SEDVVSSRQKMNDNDIGNVNVNVNVNGSDSNDVKLSEGASVTVDLPIPNKNGHRNGLDCATSSNVGQNGSVDGDHGATAVQHGSNHTNNGSTMMVSNDVA
        SEDVVSSRQKMNDNDIG    NVNVNGSDSNDVKLSEGASVTVDLPIPNK+GHRNGLDCATSSNVGQNGSVDGDHGATAVQHGSNHTNNGST+MVSNDVA
Subjt:  SEDVVSSRQKMNDNDIGNVNVNVNVNGSDSNDVKLSEGASVTVDLPIPNKNGHRNGLDCATSSNVGQNGSVDGDHGATAVQHGSNHTNNGSTMMVSNDVA

Query:  REKDSLKVVVPTLDSLGDAEDFFDPQDSLSVASNTDGEDNGYERSAKFCTPMGEFYDAWEEMSSDGLPHPSISVIEAELREMRLSLLMELEKRKQAEEAL
        REKDSLKVVVPTLDSLGDAEDFFDPQDSLSVASNTDGEDNGYERSAKFCTPMGEFYDAWEEMSSDGLPHPSISVIEAELREMRLSLLMELEKRKQAEEAL
Subjt:  REKDSLKVVVPTLDSLGDAEDFFDPQDSLSVASNTDGEDNGYERSAKFCTPMGEFYDAWEEMSSDGLPHPSISVIEAELREMRLSLLMELEKRKQAEEAL

Query:  DNLRGQWQRLREHLSLVGLTLPSDPTVATNGNLLYSDPAEELCQQVNIARFVSGSIGRGIARAEVETEMEAQLEAKNFEIARLLDRLHYYEAVNHEMSQR
        DNLRGQWQRLREHLSLVGLTLPSDPTVATNGNLLYSDPAEELCQQVNIARFVSGSIGRGIARAEVETEMEAQLEAKNFEIARLLDRLHYYEAVNHEMSQR
Subjt:  DNLRGQWQRLREHLSLVGLTLPSDPTVATNGNLLYSDPAEELCQQVNIARFVSGSIGRGIARAEVETEMEAQLEAKNFEIARLLDRLHYYEAVNHEMSQR

Query:  NQEAVDLARRERLRRKRRLRWMWGSVATAITLGTAVLAWSYLPSGKDSSSINDSKATEHDDATD
        NQEAVDLARRERLRRKRRLRWMWGSVATAITLGTAVLAWSYLPSGKDSSS+NDSKATEHDDATD
Subjt:  NQEAVDLARRERLRRKRRLRWMWGSVATAITLGTAVLAWSYLPSGKDSSSINDSKATEHDDATD

A0A6J1KTC8 uncharacterized protein LOC1114974475.8e-24896.98Show/hide
Query:  MPTFTTIALDRLLEPGTSKSVDKPLPKHMPALTFNRAPTTNLERRNSASAAERKVQRPQIKPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSF
        MPTFTTIALDRLLEPGTSKSVDKPLPKHMPALTFNRAP+TNLERRNSASAAERKVQRPQIKPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSF
Subjt:  MPTFTTIALDRLLEPGTSKSVDKPLPKHMPALTFNRAPTTNLERRNSASAAERKVQRPQIKPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSF

Query:  SEDVVSSRQKMNDNDIGNVNVNVNVNGSDSNDVKLSEGASVTVDLPIPNKNGHRNGLDCATSSNVGQNGSVDGDHGATAVQHGSNHTNNGSTMMVSNDVA
        SEDVVSS QK+NDNDIG    NVNVNGSDSNDVKLSEGASVTVDLPIPNK+G RNG DCATSSNVGQNGSVDGDHGATAVQHGSNHTNNGS+MMVSNDVA
Subjt:  SEDVVSSRQKMNDNDIGNVNVNVNVNGSDSNDVKLSEGASVTVDLPIPNKNGHRNGLDCATSSNVGQNGSVDGDHGATAVQHGSNHTNNGSTMMVSNDVA

Query:  REKDSLKVVVPTLDSLGDAEDFFDPQDSLSVASNTDGEDNGYERSAKFCTPMGEFYDAWEEMSSDGLPHPSISVIEAELREMRLSLLMELEKRKQAEEAL
        REKDSLKVVVPTLDSLGDAEDFFDPQDSLSVASNTDGEDNGYERSAKFCTPMGEFYDAWEEMSSDGLPHPSISVIEAELREMRLSLLMELEKRKQAEEAL
Subjt:  REKDSLKVVVPTLDSLGDAEDFFDPQDSLSVASNTDGEDNGYERSAKFCTPMGEFYDAWEEMSSDGLPHPSISVIEAELREMRLSLLMELEKRKQAEEAL

Query:  DNLRGQWQRLREHLSLVGLTLPSDPTVATNGNLLYSDPAEELCQQVNIARFVSGSIGRGIARAEVETEMEAQLEAKNFEIARLLDRLHYYEAVNHEMSQR
        DNLRGQWQRLREHLSLVGLTLPSDPTV+TNGNL+YSDPAEELCQQVNIARFVSGSIGRGIARAEVETEMEAQLEAKNFEIARLLDRLHYYEAVNHEMSQR
Subjt:  DNLRGQWQRLREHLSLVGLTLPSDPTVATNGNLLYSDPAEELCQQVNIARFVSGSIGRGIARAEVETEMEAQLEAKNFEIARLLDRLHYYEAVNHEMSQR

Query:  NQEAVDLARRERLRRKRRLRWMWGSVATAITLGTAVLAWSYLPSGKDSSSINDSKATEHDDATD
        NQEAVDLARRERLRRKRRLRWMWGSVAT ITLGTAVLAWSYLPSGKDSSSINDSKATEHDDATD
Subjt:  NQEAVDLARRERLRRKRRLRWMWGSVATAITLGTAVLAWSYLPSGKDSSSINDSKATEHDDATD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G50910.1 unknown protein3.5e-10449.46Show/hide
Query:  MPTFTTIALDRLLEPGTSKSVDKPLPKHMPALT---FNRAPTTNLERRNSASAAERKVQRPQIKPALYTTPEATPLPDSPSSFPPSPYIVNHKRRG-PRL
        MPTF+ IALDR+LEPG S SV+      +P+ T   +++ P + LE+       ER V RP + PALY TP+A PLP+SPSSFPPSPYI+NHK RG PRL
Subjt:  MPTFTTIALDRLLEPGTSKSVDKPLPKHMPALT---FNRAPTTNLERRNSASAAERKVQRPQIKPALYTTPEATPLPDSPSSFPPSPYIVNHKRRG-PRL

Query:  LKSFSED--VVSSRQKMNDNDIGNVNVNVNVNGSDSNDVKLSEGASVTVDLPIP--NKNGHRNGLDCATSSNVGQNGSVDGDHGATAVQHGSNHTNNGST
        LKS SE   V SS QK  + +      +V V+             S +   PI    ++ + NG+   T  N   +G VDG  G  +   G +       
Subjt:  LKSFSED--VVSSRQKMNDNDIGNVNVNVNVNGSDSNDVKLSEGASVTVDLPIP--NKNGHRNGLDCATSSNVGQNGSVDGDHGATAVQHGSNHTNNGST

Query:  MMVSNDVAREKDSLKVVVPTLDSLGDAEDFFDPQDSLSVASNTDGE-DNGYERSAKFCTPMGEFYDAWEEMSSDGLPHPSISVIEAELREMRLSLLMELE
           +N + R     + V    D   ++EDF+DP +S S  SNTD E D G E S +  TP+GEFYDAW+E+S+D     S++ IE+EL E+RLSLLME+E
Subjt:  MMVSNDVAREKDSLKVVVPTLDSLGDAEDFFDPQDSLSVASNTDGE-DNGYERSAKFCTPMGEFYDAWEEMSSDGLPHPSISVIEAELREMRLSLLMELE

Query:  KRKQAEEALDNLRGQWQRLREHLSLVGLTLPSDPTVATNGNLLYSDPAEELCQQVNIARFVSGSIGRGIARAEVETEMEAQLEAKNFEIARLLDRLHYYE
        KRKQ EEAL+ ++  WQRLRE ++ VGL +P DPT +TN   L    +EEL  Q+ IARFVS S+GRG+A+AEVE EME+ LE KNFEI RL DRLHYYE
Subjt:  KRKQAEEALDNLRGQWQRLREHLSLVGLTLPSDPTVATNGNLLYSDPAEELCQQVNIARFVSGSIGRGIARAEVETEMEAQLEAKNFEIARLLDRLHYYE

Query:  AVNHEMSQRNQEAVDLARRERLRRKRRLRWMWGSVATAITLGTAVLAWSYLPSGKDSSSIN
        AVN EMSQRNQEA+++ARRER +RK+R RW+WGS+A  ITLG+A LAWSY+P+ K SS ++
Subjt:  AVNHEMSQRNQEAVDLARRERLRRKRRLRWMWGSVATAITLGTAVLAWSYLPSGKDSSSIN

AT5G66480.1 unknown protein2.8e-7741.9Show/hide
Query:  MPTFTTIALDRLLEPGTSKSVDKPLPKHMPALTFNRAPTTNLERRNSASAAERKVQRPQIKPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSF
        MPTF+  AL R L  GTS      L    P+   ++    N E   S    E+   RPQ+ P+LY T +  P P+SPSS+PPSPYI+NHK RGP L    
Subjt:  MPTFTTIALDRLLEPGTSKSVDKPLPKHMPALTFNRAPTTNLERRNSASAAERKVQRPQIKPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSF

Query:  SE------DVVSSRQKMNDNDIGNVNVNVNVNGSDSNDVKLSEGASVTVDLPIPNKNG-HRNGL------DCATSSNVGQNGSVDGDHGATAVQHGSNHT
        SE       + S  +K++    GNV+V    + S S  +      ++ VD    + NG H  G+      DC+        G+   +     + +G   +
Subjt:  SE------DVVSSRQKMNDNDIGNVNVNVNVNGSDSNDVKLSEGASVTVDLPIPNKNG-HRNGL------DCATSSNVGQNGSVDGDHGATAVQHGSNHT

Query:  NNGSTMMVSNDVAREKDSLKVVVPTLDSLGDAEDFFDPQDSLSVASNTDGED-NGYERSAKFCTPMGEFYDAWEEMSSDGLPHPSISVIEAELREMRLSL
        NN ++ +       E   +K      D   + E+F++P + +S  SNT+ ED    E S    T +GEFYDA +E+S+D     S + IE+E+REMRL L
Subjt:  NNGSTMMVSNDVAREKDSLKVVVPTLDSLGDAEDFFDPQDSLSVASNTDGED-NGYERSAKFCTPMGEFYDAWEEMSSDGLPHPSISVIEAELREMRLSL

Query:  LMELEKRKQAEEALDNLRGQWQRLREHLSLVGLTLPSDPTVATNGNLLYSDPAEELCQQVNIARFVSGSIGRGIARAEVETEMEAQLEAKNFEIARLLDR
        LME+E+R+QAE  L+ ++  W+RLR+ L+ VG+ LP DPT +     L    A+EL  Q+ + RFVS ++G  +A+ EVE EMEA+LEAKNFEI RL DR
Subjt:  LMELEKRKQAEEALDNLRGQWQRLREHLSLVGLTLPSDPTVATNGNLLYSDPAEELCQQVNIARFVSGSIGRGIARAEVETEMEAQLEAKNFEIARLLDR

Query:  LHYYEAVNHEMSQRNQEAVDLARRERLRRKRRLRWMWGSVATAITLGTAVLAWSYLPSGKDSS
        LHYYE VN EMSQRNQEA+++ARR+  +RKRR RW+WGS+A  ITLG+ VLAWSYLP G  SS
Subjt:  LHYYEAVNHEMSQRNQEAVDLARRERLRRKRRLRWMWGSVATAITLGTAVLAWSYLPSGKDSS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCAACATTTACTACAATTGCGTTGGATAGGTTGTTAGAACCTGGAACTTCCAAATCTGTCGATAAGCCCCTTCCTAAACATATGCCTGCTTTGACCTTTAACCGTGC
TCCGACCACGAATTTGGAGAGGAGAAACAGCGCATCAGCTGCTGAAAGAAAAGTTCAGCGGCCTCAAATAAAGCCAGCTTTGTATACCACTCCAGAGGCAACTCCTCTCC
CGGATTCACCATCTTCATTTCCTCCTTCCCCTTATATTGTCAATCACAAGCGGCGTGGGCCTCGACTTTTGAAGAGTTTCTCTGAGGATGTGGTCTCTTCTCGTCAAAAG
ATGAATGATAACGATATAGGAAATGTGAATGTGAATGTGAATGTGAATGGTTCAGATAGCAATGATGTAAAATTGAGTGAGGGTGCTTCTGTTACTGTTGACTTGCCTAT
TCCAAACAAAAATGGACATAGAAATGGTCTAGATTGTGCTACTAGTAGTAATGTTGGTCAAAATGGTAGCGTTGATGGTGATCATGGTGCTACTGCTGTTCAACACGGGA
GCAATCACACTAATAACGGAAGTACTATGATGGTGAGCAATGATGTCGCTCGGGAAAAGGATTCATTGAAGGTTGTTGTGCCAACTTTGGACAGTCTTGGAGATGCTGAA
GACTTCTTCGACCCACAGGATTCTTTGAGCGTTGCGAGTAACACAGATGGAGAGGACAATGGCTATGAACGTTCAGCCAAGTTCTGTACTCCTATGGGGGAATTTTATGA
TGCTTGGGAAGAGATGTCCTCTGATGGTTTGCCACATCCTTCTATTTCTGTTATTGAAGCTGAATTGCGTGAAATGAGACTATCCCTACTGATGGAACTAGAGAAACGAA
AGCAGGCTGAGGAAGCACTGGACAATTTGCGGGGCCAGTGGCAGAGGCTTAGAGAACATCTATCGCTTGTAGGATTGACCCTTCCTTCAGATCCCACAGTAGCCACAAAT
GGGAATCTCTTATATTCTGACCCTGCTGAAGAATTGTGCCAACAAGTTAATATTGCTAGGTTCGTGTCGGGTTCTATTGGAAGGGGTATAGCAAGGGCAGAGGTGGAAAC
TGAGATGGAGGCACAGCTTGAAGCTAAGAATTTTGAGATTGCTCGATTGTTGGACCGGCTCCATTACTACGAGGCAGTGAATCATGAGATGTCCCAGAGGAATCAAGAGG
CTGTAGACTTGGCACGGCGTGAGAGGCTGAGAAGGAAAAGGAGGCTGAGATGGATGTGGGGTTCGGTTGCCACCGCAATCACACTCGGCACTGCAGTCTTAGCTTGGTCG
TACCTCCCATCAGGAAAAGATTCGTCATCCATCAACGATTCGAAGGCCACCGAGCATGATGATGCAACAGATTGA
mRNA sequenceShow/hide mRNA sequence
TCTCTTTCTCCAACCCCATTTATTTTTCATCCTCTTTTTATGAGCGACGGGGAAGAGTCCACAGGAATCAAATCGAATCGAATCGAATCGATACTCTAAAATGGGGCTTT
CTGTCTTCTCTCGCAACTTTCCGACTCCTCCCCGATCCTCTGCTTTCCTTCTCACCTCTTGCGCCTAGGGTTTCTTTTTTTCTGCCTCCATCTTCCTTTTTGAGGTGATT
TTGTTGGATGTTCGAATAAAGGAATCATGCCAACATTTACTACAATTGCGTTGGATAGGTTGTTAGAACCTGGAACTTCCAAATCTGTCGATAAGCCCCTTCCTAAACAT
ATGCCTGCTTTGACCTTTAACCGTGCTCCGACCACGAATTTGGAGAGGAGAAACAGCGCATCAGCTGCTGAAAGAAAAGTTCAGCGGCCTCAAATAAAGCCAGCTTTGTA
TACCACTCCAGAGGCAACTCCTCTCCCGGATTCACCATCTTCATTTCCTCCTTCCCCTTATATTGTCAATCACAAGCGGCGTGGGCCTCGACTTTTGAAGAGTTTCTCTG
AGGATGTGGTCTCTTCTCGTCAAAAGATGAATGATAACGATATAGGAAATGTGAATGTGAATGTGAATGTGAATGGTTCAGATAGCAATGATGTAAAATTGAGTGAGGGT
GCTTCTGTTACTGTTGACTTGCCTATTCCAAACAAAAATGGACATAGAAATGGTCTAGATTGTGCTACTAGTAGTAATGTTGGTCAAAATGGTAGCGTTGATGGTGATCA
TGGTGCTACTGCTGTTCAACACGGGAGCAATCACACTAATAACGGAAGTACTATGATGGTGAGCAATGATGTCGCTCGGGAAAAGGATTCATTGAAGGTTGTTGTGCCAA
CTTTGGACAGTCTTGGAGATGCTGAAGACTTCTTCGACCCACAGGATTCTTTGAGCGTTGCGAGTAACACAGATGGAGAGGACAATGGCTATGAACGTTCAGCCAAGTTC
TGTACTCCTATGGGGGAATTTTATGATGCTTGGGAAGAGATGTCCTCTGATGGTTTGCCACATCCTTCTATTTCTGTTATTGAAGCTGAATTGCGTGAAATGAGACTATC
CCTACTGATGGAACTAGAGAAACGAAAGCAGGCTGAGGAAGCACTGGACAATTTGCGGGGCCAGTGGCAGAGGCTTAGAGAACATCTATCGCTTGTAGGATTGACCCTTC
CTTCAGATCCCACAGTAGCCACAAATGGGAATCTCTTATATTCTGACCCTGCTGAAGAATTGTGCCAACAAGTTAATATTGCTAGGTTCGTGTCGGGTTCTATTGGAAGG
GGTATAGCAAGGGCAGAGGTGGAAACTGAGATGGAGGCACAGCTTGAAGCTAAGAATTTTGAGATTGCTCGATTGTTGGACCGGCTCCATTACTACGAGGCAGTGAATCA
TGAGATGTCCCAGAGGAATCAAGAGGCTGTAGACTTGGCACGGCGTGAGAGGCTGAGAAGGAAAAGGAGGCTGAGATGGATGTGGGGTTCGGTTGCCACCGCAATCACAC
TCGGCACTGCAGTCTTAGCTTGGTCGTACCTCCCATCAGGAAAAGATTCGTCATCCATCAACGATTCGAAGGCCACCGAGCATGATGATGCAACAGATTGATAATTGGTG
ACACAAGGAAGTAGTTTATGTACCGTGTTATAGAAGGAAAAATGAAAGGGGAGGCATACATCAATTACATGTGGTATAATGTTGATATCAAACGTTGCTTTTGTCAGAAG
TATTCACATGCCGAAGGGTATAGTTGTTATAAATCATCGGGCCTTATGATGATGATTAAAGGTGAAAGAAATGGGTTGATTGTTGAGATATATATTTTGTTTTTTCCAGA
TCTAACAGGGTTATATATAACACCTGAAATTTTTGGCAACATATACACTCCGTTTACTGCTAACTGTAATGGAATACTGCGTTGAGCATCAAATAATTATTTGCGGACAA
GAGTTGGACCTCTGTGATGAACTCTTTCTGAATGAATAATGCCTGTTTTCATCCTTCCATTAGAGCTGTTGTGAGCGCTGAGAGCCATAATTTCTTGTCTTGTGAATAAG
CTTTCAACGTTTGCAGTTTGATGCCGTAAACGAGAGTTTTGTTGGGTTCTAATCTGTTAGGTCGACATTGAGCGATGTGT
Protein sequenceShow/hide protein sequence
MPTFTTIALDRLLEPGTSKSVDKPLPKHMPALTFNRAPTTNLERRNSASAAERKVQRPQIKPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDVVSSRQK
MNDNDIGNVNVNVNVNGSDSNDVKLSEGASVTVDLPIPNKNGHRNGLDCATSSNVGQNGSVDGDHGATAVQHGSNHTNNGSTMMVSNDVAREKDSLKVVVPTLDSLGDAE
DFFDPQDSLSVASNTDGEDNGYERSAKFCTPMGEFYDAWEEMSSDGLPHPSISVIEAELREMRLSLLMELEKRKQAEEALDNLRGQWQRLREHLSLVGLTLPSDPTVATN
GNLLYSDPAEELCQQVNIARFVSGSIGRGIARAEVETEMEAQLEAKNFEIARLLDRLHYYEAVNHEMSQRNQEAVDLARRERLRRKRRLRWMWGSVATAITLGTAVLAWS
YLPSGKDSSSINDSKATEHDDATD