; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS021736 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS021736
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionBEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein .
Genome locationscaffold1:73727..75405
RNA-Seq ExpressionMS021736
SyntenyMS021736
Gene Ontology termsNA
InterPro domainsIPR040420 - Uncharacterized protein At1g76660-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7037989.1 hypothetical protein SDJN02_01622, partial [Cucurbita argyrosperma subsp. argyrosperma]3.0e-23590.95Show/hide
Query:  QRKRWGGCWGALSCFHSQKGGKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPSSTMF
        Q KRWGGCWGALSCFHSQKG KRIVPASRLPEGN VTTQPNGP AAG+ NQATVIAPSLLAPPSSPASFTNSALPST QSPSCFLSLSANSPGGPSSTMF
Subjt:  QRKRWGGCWGALSCFHSQKGGKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPSSTMF

Query:  ATGPYAHEPQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKTNYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLS
        ATGPYAHE Q VSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS+DLKGTGK NY+ASNDLQAAYSLYPGSPASSLVSPISRTSGDCLS
Subjt:  ATGPYAHEPQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKTNYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLS

Query:  SSFPERDFPPQWNPSSSPQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSGGNGYQNRHSKSPKQDVE
        SSFPERDFPPQWNPS+S QDGKYPRSGSGRLFGHEKTGT LASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVY+SGGNGYQNRHSKSPKQDVE
Subjt:  SSFPERDFPPQWNPSSSPQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSGGNGYQNRHSKSPKQDVE

Query:  EIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTKTTIQSQRSMKPASDVVEKETCAEVLPLCNGCEDNKLQR
        EIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSL AEESI+PPL+GEKLKST+ T+QSQRS+K ASDVVEKETC+EVL LCNGC+D+KLQR
Subjt:  EIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTKTTIQSQRSMKPASDVVEKETCAEVLPLCNGCEDNKLQR

Query:  QPGNMSGSSSSFNQVETEDVFSRIVPPKNSRKYNLGLSCSDAEVDYRRGRSLR-EVKGDFSWHD
        QPGN+ GSS+S  Q ETED+FSRI   KNSRKYN  LSCSDAEVDYRRGRSLR EVKGDF WHD
Subjt:  QPGNMSGSSSSFNQVETEDVFSRIVPPKNSRKYNLGLSCSDAEVDYRRGRSLR-EVKGDFSWHD

XP_022133364.1 uncharacterized protein At1g76660 [Momordica charantia]3.8e-26299.78Show/hide
Query:  QRKRWGGCWGALSCFHSQKGGKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPSSTMF
        +RKRWGGCWGALSCFHSQKGGKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPSSTMF
Subjt:  QRKRWGGCWGALSCFHSQKGGKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPSSTMF

Query:  ATGPYAHEPQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKTNYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLS
        ATGPYAHEPQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKTNYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLS
Subjt:  ATGPYAHEPQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKTNYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLS

Query:  SSFPERDFPPQWNPSSSPQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSGGNGYQNRHSKSPKQDVE
        SSFPERDFPPQWNPSSSPQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSGGNGYQNRHSKSPKQDVE
Subjt:  SSFPERDFPPQWNPSSSPQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSGGNGYQNRHSKSPKQDVE

Query:  EIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTKTTIQSQRSMKPASDVVEKETCAEVLPLCNGCEDNKLQR
        EIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTKTTIQSQRSMKPASDVVEKETCAEVLPLCNGCEDNKLQR
Subjt:  EIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTKTTIQSQRSMKPASDVVEKETCAEVLPLCNGCEDNKLQR

Query:  QPGNMSGSSSSFNQVETEDVFSRIVPPKNSRKYNLGLSCSDAEVDYRRGRSLREVKGDFSWHD
        QPGNMSGSSSSFNQVETEDVFSRIVPPKNSRKYNLGLSCSDAEVDYRRGRSLREVKGDFSWHD
Subjt:  QPGNMSGSSSSFNQVETEDVFSRIVPPKNSRKYNLGLSCSDAEVDYRRGRSLREVKGDFSWHD

XP_022940824.1 uncharacterized protein At1g76660 [Cucurbita moschata]1.7e-23391.13Show/hide
Query:  KRWGGCWGALSCFHSQKGGKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPSSTMFAT
        KRWGGCWGALSCFHSQKG KRIVPASRLPEGN VTTQPNGP AAG+ NQATVIAPSLLAPPSSPASFTNSALPST QSPSCFLSLSANSPGGPSSTMFAT
Subjt:  KRWGGCWGALSCFHSQKGGKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPSSTMFAT

Query:  GPYAHEPQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKTNYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSS
        GPYAHE Q VSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS+DLKGTGK NY+ASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSS
Subjt:  GPYAHEPQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKTNYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSS

Query:  FPERDFPPQWNPSSSPQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSGGNGYQNRHSKSPKQDVEEI
        FPERDFPPQWNPS+S QDGKYPRSGSGRLFGHEKTGT LASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVY+SGGNGYQNRHSKSPKQDVEEI
Subjt:  FPERDFPPQWNPSSSPQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSGGNGYQNRHSKSPKQDVEEI

Query:  EAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTKTTIQSQRSMKPASDVVEKETCAEVLPLCNGCEDNKLQRQP
        EAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESI+PPL+GEKLKST+ T+QSQRS+K ASD VEKETC+EVL LCNGC+D+KLQRQP
Subjt:  EAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTKTTIQSQRSMKPASDVVEKETCAEVLPLCNGCEDNKLQRQP

Query:  GNMSGSSSSFNQVETEDVFSRIVPPKNSRKYNLGLSCSDAEVDYRRGRSLR-EVKGDFSWHD
        GN+ GSS+S  Q ETED+FSRI   KNSRKYN  LSCSDAEVDYRRGRSLR EVKGDF WHD
Subjt:  GNMSGSSSSFNQVETEDVFSRIVPPKNSRKYNLGLSCSDAEVDYRRGRSLR-EVKGDFSWHD

XP_022981333.1 uncharacterized protein At1g76660-like [Cucurbita maxima]1.3e-23390.91Show/hide
Query:  KRWGGCWGALSCFHSQKGGKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPSSTMFAT
        KRWGGCWGALSCF SQKG KRIVPASRLPEGN VTTQPNGP AAG+ NQATVIAPSLLAPPSSPASFTNSALPST QSPSCFLSLSANSPGGPSSTMFAT
Subjt:  KRWGGCWGALSCFHSQKGGKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPSSTMFAT

Query:  GPYAHEPQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKTNYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSS
        GPYAHE Q VSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGK NY+ASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSS
Subjt:  GPYAHEPQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKTNYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSS

Query:  FPERDFPPQWNPSSSPQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSGGNGYQNRHSKSPKQDVEEI
        FPERDFPPQWNPS+S QDGKYPRSGSGRLFGHEKTGT LASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVY+SGGNGYQNRHSKSPKQDVEEI
Subjt:  FPERDFPPQWNPSSSPQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSGGNGYQNRHSKSPKQDVEEI

Query:  EAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTKTTIQSQRSMKPASDVVEKETCAEVLPLCNGCEDNKLQRQP
        EAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESI+PPL+GEK KST+ T+ SQRS+K ASD+VEKETC+EVL LCNGC+DNKLQRQP
Subjt:  EAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTKTTIQSQRSMKPASDVVEKETCAEVLPLCNGCEDNKLQRQP

Query:  GNMSGSSSSFNQVETEDVFSRIVPPKNSRKYNLGLSCSDAEVDYRRGRSLR-EVKGDFSWHD
        GN+ GSS+S  Q ETED+FSRI   KNSRKYN  LSCSDAEVDYRRGRSLR EVKGDF WHD
Subjt:  GNMSGSSSSFNQVETEDVFSRIVPPKNSRKYNLGLSCSDAEVDYRRGRSLR-EVKGDFSWHD

XP_023524439.1 uncharacterized protein At1g76660 [Cucurbita pepo subsp. pepo]2.3e-23591.34Show/hide
Query:  KRWGGCWGALSCFHSQKGGKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPSSTMFAT
        KRWGGCWGALSCFHSQKG KRIVPASRLPEGN VTTQPNGP AAG+ NQATVIAPSLLAPPSSPASFTNSALPST QSPSCFLSLSANSPGGPSSTMFAT
Subjt:  KRWGGCWGALSCFHSQKGGKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPSSTMFAT

Query:  GPYAHEPQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKTNYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSS
        GPYAHE Q VSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGK NY+ASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSS
Subjt:  GPYAHEPQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKTNYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSS

Query:  FPERDFPPQWNPSSSPQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSGGNGYQNRHSKSPKQDVEEI
        FPERDFPPQWNPS+S QDGKYPRSGSGRLFGHEKTGT LASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVY+SGGNGYQNRHSKSPKQDVEEI
Subjt:  FPERDFPPQWNPSSSPQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSGGNGYQNRHSKSPKQDVEEI

Query:  EAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTKTTIQSQRSMKPASDVVEKETCAEVLPLCNGCEDNKLQRQP
        EAYRASFGFSADEII+TTQYVEISDVMEDSFTMRPFTSTSLSAEESI+PPL+GEKLKST+ T+QSQRS+K ASDVVEKETC+EVL LCNGC+D+KLQRQP
Subjt:  EAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTKTTIQSQRSMKPASDVVEKETCAEVLPLCNGCEDNKLQRQP

Query:  GNMSGSSSSFNQVETEDVFSRIVPPKNSRKYNLGLSCSDAEVDYRRGRSLR-EVKGDFSWHD
        GN+ GSS+S  Q ETED+FSRI   KNSRKYN  LSCSDAEVDYRRGRSLR EVKGDF WHD
Subjt:  GNMSGSSSSFNQVETEDVFSRIVPPKNSRKYNLGLSCSDAEVDYRRGRSLR-EVKGDFSWHD

TrEMBL top hitse value%identityAlignment
A0A5A7VFM0 Uncharacterized protein7.3e-22789.8Show/hide
Query:  KRWGGCWGALSCFHSQKGGKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPSSTMFAT
        KRWGGCWGALSCFHSQKG KRIVPASRLPEGN VTTQPNGPQAAGMTNQATVI PSLLAPPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPSST++AT
Subjt:  KRWGGCWGALSCFHSQKGGKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPSSTMFAT

Query:  GPYAHEPQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKTNYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSS
        GPYAHE Q VSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS DLKGTGK NYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSS
Subjt:  GPYAHEPQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKTNYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSS

Query:  FPERDFPPQWNPSSSPQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSGGNGYQNRHSKSPKQDVEEI
        FPERDF PQWN S+S QDGKYPRSGSGRLFG+EK GTSLASQDSNFFCPATFAQFYLDN  FPHTGGRLSVSKDSDVYSS GNGYQNRHSKSPKQDVEEI
Subjt:  FPERDFPPQWNPSSSPQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSGGNGYQNRHSKSPKQDVEEI

Query:  EAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTKTTIQSQRSMKPASDVVEKETCAEVLPLCNGCEDNKLQRQP
        EAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEES EPPLLGEKLKS+ TT+Q+QRS+K A +VVEKETC EV  LCNG +DNKLQRQP
Subjt:  EAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTKTTIQSQRSMKPASDVVEKETCAEVLPLCNGCEDNKLQRQP

Query:  GNMSGSSSSFNQVETEDVFSRIVPPKNSRKYNLGLSCSDAEVDYRRGRSLREVKGDFSWHD
        G++ GSS+S +QVE +DVFSRI   KNSRKY+LGLSCSDAEVDYRRGRSLRE KG+ SWHD
Subjt:  GNMSGSSSSFNQVETEDVFSRIVPPKNSRKYNLGLSCSDAEVDYRRGRSLREVKGDFSWHD

A0A5D3D8J8 Uncharacterized protein7.3e-22789.8Show/hide
Query:  KRWGGCWGALSCFHSQKGGKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPSSTMFAT
        KRWGGCWGALSCFHSQKG KRIVPASRLPEGN VTTQPNGPQAAGMTNQATVI PSLLAPPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPSST++AT
Subjt:  KRWGGCWGALSCFHSQKGGKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPSSTMFAT

Query:  GPYAHEPQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKTNYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSS
        GPYAHE Q VSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS DLKGTGK NYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSS
Subjt:  GPYAHEPQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKTNYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSS

Query:  FPERDFPPQWNPSSSPQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSGGNGYQNRHSKSPKQDVEEI
        FPERDF PQWN S+S QDGKYPRSGSGRLFG+EK GTSLASQDSNFFCPATFAQFYLDN  FPHTGGRLSVSKDSDVYSS GNGYQNRHSKSPKQDVEEI
Subjt:  FPERDFPPQWNPSSSPQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSGGNGYQNRHSKSPKQDVEEI

Query:  EAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTKTTIQSQRSMKPASDVVEKETCAEVLPLCNGCEDNKLQRQP
        EAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEES EPPLLGEKLKS+ TT+Q+QRS+K A +VVEKETC EV  LCNG +DNKLQRQP
Subjt:  EAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTKTTIQSQRSMKPASDVVEKETCAEVLPLCNGCEDNKLQRQP

Query:  GNMSGSSSSFNQVETEDVFSRIVPPKNSRKYNLGLSCSDAEVDYRRGRSLREVKGDFSWHD
        G++ GSS+S +QVE +DVFSRI   KNSRKY+LGLSCSDAEVDYRRGRSLRE KG+ SWHD
Subjt:  GNMSGSSSSFNQVETEDVFSRIVPPKNSRKYNLGLSCSDAEVDYRRGRSLREVKGDFSWHD

A0A6J1BVS7 uncharacterized protein At1g766601.8e-26299.78Show/hide
Query:  QRKRWGGCWGALSCFHSQKGGKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPSSTMF
        +RKRWGGCWGALSCFHSQKGGKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPSSTMF
Subjt:  QRKRWGGCWGALSCFHSQKGGKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPSSTMF

Query:  ATGPYAHEPQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKTNYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLS
        ATGPYAHEPQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKTNYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLS
Subjt:  ATGPYAHEPQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKTNYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLS

Query:  SSFPERDFPPQWNPSSSPQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSGGNGYQNRHSKSPKQDVE
        SSFPERDFPPQWNPSSSPQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSGGNGYQNRHSKSPKQDVE
Subjt:  SSFPERDFPPQWNPSSSPQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSGGNGYQNRHSKSPKQDVE

Query:  EIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTKTTIQSQRSMKPASDVVEKETCAEVLPLCNGCEDNKLQR
        EIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTKTTIQSQRSMKPASDVVEKETCAEVLPLCNGCEDNKLQR
Subjt:  EIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTKTTIQSQRSMKPASDVVEKETCAEVLPLCNGCEDNKLQR

Query:  QPGNMSGSSSSFNQVETEDVFSRIVPPKNSRKYNLGLSCSDAEVDYRRGRSLREVKGDFSWHD
        QPGNMSGSSSSFNQVETEDVFSRIVPPKNSRKYNLGLSCSDAEVDYRRGRSLREVKGDFSWHD
Subjt:  QPGNMSGSSSSFNQVETEDVFSRIVPPKNSRKYNLGLSCSDAEVDYRRGRSLREVKGDFSWHD

A0A6J1FRS7 uncharacterized protein At1g766608.1e-23491.13Show/hide
Query:  KRWGGCWGALSCFHSQKGGKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPSSTMFAT
        KRWGGCWGALSCFHSQKG KRIVPASRLPEGN VTTQPNGP AAG+ NQATVIAPSLLAPPSSPASFTNSALPST QSPSCFLSLSANSPGGPSSTMFAT
Subjt:  KRWGGCWGALSCFHSQKGGKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPSSTMFAT

Query:  GPYAHEPQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKTNYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSS
        GPYAHE Q VSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS+DLKGTGK NY+ASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSS
Subjt:  GPYAHEPQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKTNYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSS

Query:  FPERDFPPQWNPSSSPQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSGGNGYQNRHSKSPKQDVEEI
        FPERDFPPQWNPS+S QDGKYPRSGSGRLFGHEKTGT LASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVY+SGGNGYQNRHSKSPKQDVEEI
Subjt:  FPERDFPPQWNPSSSPQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSGGNGYQNRHSKSPKQDVEEI

Query:  EAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTKTTIQSQRSMKPASDVVEKETCAEVLPLCNGCEDNKLQRQP
        EAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESI+PPL+GEKLKST+ T+QSQRS+K ASD VEKETC+EVL LCNGC+D+KLQRQP
Subjt:  EAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTKTTIQSQRSMKPASDVVEKETCAEVLPLCNGCEDNKLQRQP

Query:  GNMSGSSSSFNQVETEDVFSRIVPPKNSRKYNLGLSCSDAEVDYRRGRSLR-EVKGDFSWHD
        GN+ GSS+S  Q ETED+FSRI   KNSRKYN  LSCSDAEVDYRRGRSLR EVKGDF WHD
Subjt:  GNMSGSSSSFNQVETEDVFSRIVPPKNSRKYNLGLSCSDAEVDYRRGRSLR-EVKGDFSWHD

A0A6J1IZ74 uncharacterized protein At1g76660-like6.2e-23490.91Show/hide
Query:  KRWGGCWGALSCFHSQKGGKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPSSTMFAT
        KRWGGCWGALSCF SQKG KRIVPASRLPEGN VTTQPNGP AAG+ NQATVIAPSLLAPPSSPASFTNSALPST QSPSCFLSLSANSPGGPSSTMFAT
Subjt:  KRWGGCWGALSCFHSQKGGKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPSSTMFAT

Query:  GPYAHEPQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKTNYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSS
        GPYAHE Q VSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGK NY+ASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSS
Subjt:  GPYAHEPQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKTNYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSS

Query:  FPERDFPPQWNPSSSPQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSGGNGYQNRHSKSPKQDVEEI
        FPERDFPPQWNPS+S QDGKYPRSGSGRLFGHEKTGT LASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVY+SGGNGYQNRHSKSPKQDVEEI
Subjt:  FPERDFPPQWNPSSSPQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSGGNGYQNRHSKSPKQDVEEI

Query:  EAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTKTTIQSQRSMKPASDVVEKETCAEVLPLCNGCEDNKLQRQP
        EAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESI+PPL+GEK KST+ T+ SQRS+K ASD+VEKETC+EVL LCNGC+DNKLQRQP
Subjt:  EAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTKTTIQSQRSMKPASDVVEKETCAEVLPLCNGCEDNKLQRQP

Query:  GNMSGSSSSFNQVETEDVFSRIVPPKNSRKYNLGLSCSDAEVDYRRGRSLR-EVKGDFSWHD
        GN+ GSS+S  Q ETED+FSRI   KNSRKYN  LSCSDAEVDYRRGRSLR EVKGDF WHD
Subjt:  GNMSGSSSSFNQVETEDVFSRIVPPKNSRKYNLGLSCSDAEVDYRRGRSLR-EVKGDFSWHD

SwissProt top hitse value%identityAlignment
Q9SRE5 Uncharacterized protein At1g766601.1e-12658.48Show/hide
Query:  QRKRWGGCWGALSCFHSQKGGKRIVPASRLPE-GNAVTTQPNGPQAAGMTNQ--ATVIAPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPSS
        QRKRWGGC G  SCF SQKGGKRIVPASR+PE GN   +QPNG   AG+ N   A  I  SLLAPPSSPASFTNSALPST QSP+C+LSL+ANSPGGPSS
Subjt:  QRKRWGGCWGALSCFHSQKGGKRIVPASRLPE-GNAVTTQPNGPQAAGMTNQ--ATVIAPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPSS

Query:  TMFATGPYAHEPQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKTNYIASNDLQAAYSLYPGSPASSLVSPISRTSGD
        +M+ATGPYAHE QLVSPPVFS FTTEPSTAP TPPPELA LT PSSPDVP+A+FL+SS+DLK +GK +Y   NDLQA YSLYPGSPAS+L SPISR SGD
Subjt:  TMFATGPYAHEPQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKTNYIASNDLQAAYSLYPGSPASSLVSPISRTSGD

Query:  CLSSSFPERDFPPQWNPSSSPQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLD-NPPFPHTGGRLSVSKDSDVYSSG--GNGYQNRHSKS
         L                 SPQ+GK  RS SG  FG++  G S   Q+SNFFCP TFA+FYLD +P  P  GGRLSVSKDSDVY +   GNG QNR ++S
Subjt:  CLSSSFPERDFPPQWNPSSSPQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLD-NPPFPHTGGRLSVSKDSDVYSSG--GNGYQNRHSKS

Query:  PKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTKTTIQSQRSMKPASDVVEKETCAEVLPLCNGCE
        PKQD+EE+EAYRASFGFSADEIITT+QYVEI+DVM+ SF    ++            P  G+KL   +  + SQ S K  +D+  +    +     N  +
Subjt:  PKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTKTTIQSQRSMKPASDVVEKETCAEVLPLCNGCE

Query:  DNKLQRQPGNMSGSSSSFNQVETEDVFSRIVPPKNSRKYNLGLSCSDAEVDYRRGRSLRE
        D+K + +              + E + SR+   K SR Y+  +S SDAEV+YRRGRSLRE
Subjt:  DNKLQRQPGNMSGSSSSFNQVETEDVFSRIVPPKNSRKYNLGLSCSDAEVDYRRGRSLRE

Arabidopsis top hitse value%identityAlignment
AT1G63720.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT5G52430.1)9.0e-2840Show/hide
Query:  QRKRWGGCWGALSCFHSQKGGKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPSSTMF
        ++++W   W  L CF S +  KRI  +  +PE  ++++  +    +G  +  T +    +APPSSPASF  S  PS  QSP   LS S   P     ++F
Subjt:  QRKRWGGCWGALSCFHSQKGGKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPSSTMF

Query:  ATGPYAHEPQLVSPPVFSAFTTEPSTAPLTPPPELAHL----TTPSSPDVPFAQFLSSSVDLKGTGKTNYIASNDLQAAYSLYPGSPASSLVSPISRTSG
        A GPYAHE QLVSPPVFS +TTEPS+AP+TPP + + +    TTPSSP+VPFAQ  +S+      G    ++S+     Y L PGSP   L+SP   + G
Subjt:  ATGPYAHEPQLVSPPVFSAFTTEPSTAPLTPPPELAHL----TTPSSPDVPFAQFLSSSVDLKGTGKTNYIASNDLQAAYSLYPGSPASSLVSPISRTSG

Query:  DCLSSSFP--ERDFPPQWNPSSSPQ
           +S FP  E    P +  S  P+
Subjt:  DCLSSSFP--ERDFPPQWNPSSSPQ

AT1G76660.1 FUNCTIONS IN: molecular_function unknown7.7e-12858.48Show/hide
Query:  QRKRWGGCWGALSCFHSQKGGKRIVPASRLPE-GNAVTTQPNGPQAAGMTNQ--ATVIAPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPSS
        QRKRWGGC G  SCF SQKGGKRIVPASR+PE GN   +QPNG   AG+ N   A  I  SLLAPPSSPASFTNSALPST QSP+C+LSL+ANSPGGPSS
Subjt:  QRKRWGGCWGALSCFHSQKGGKRIVPASRLPE-GNAVTTQPNGPQAAGMTNQ--ATVIAPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPSS

Query:  TMFATGPYAHEPQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKTNYIASNDLQAAYSLYPGSPASSLVSPISRTSGD
        +M+ATGPYAHE QLVSPPVFS FTTEPSTAP TPPPELA LT PSSPDVP+A+FL+SS+DLK +GK +Y   NDLQA YSLYPGSPAS+L SPISR SGD
Subjt:  TMFATGPYAHEPQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKTNYIASNDLQAAYSLYPGSPASSLVSPISRTSGD

Query:  CLSSSFPERDFPPQWNPSSSPQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLD-NPPFPHTGGRLSVSKDSDVYSSG--GNGYQNRHSKS
         L                 SPQ+GK  RS SG  FG++  G S   Q+SNFFCP TFA+FYLD +P  P  GGRLSVSKDSDVY +   GNG QNR ++S
Subjt:  CLSSSFPERDFPPQWNPSSSPQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLD-NPPFPHTGGRLSVSKDSDVYSSG--GNGYQNRHSKS

Query:  PKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTKTTIQSQRSMKPASDVVEKETCAEVLPLCNGCE
        PKQD+EE+EAYRASFGFSADEIITT+QYVEI+DVM+ SF    ++            P  G+KL   +  + SQ S K  +D+  +    +     N  +
Subjt:  PKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTKTTIQSQRSMKPASDVVEKETCAEVLPLCNGCE

Query:  DNKLQRQPGNMSGSSSSFNQVETEDVFSRIVPPKNSRKYNLGLSCSDAEVDYRRGRSLRE
        D+K + +              + E + SR+   K SR Y+  +S SDAEV+YRRGRSLRE
Subjt:  DNKLQRQPGNMSGSSSSFNQVETEDVFSRIVPPKNSRKYNLGLSCSDAEVDYRRGRSLRE

AT4G25620.1 hydroxyproline-rich glycoprotein family protein9.0e-2844.78Show/hide
Query:  RKRWGGCWGALSCFHSQKGGKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPST--VQSPSCFLSLSANSPGGPSSTM
        +K+ G  W    CF S+K  KRI  A  +PE  A +     P     +N  ++  P  +APPSSPASF  S  PS      P    SL+ N P  PS+  
Subjt:  RKRWGGCWGALSCFHSQKGGKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPST--VQSPSCFLSLSANSPGGPSSTM

Query:  FATGPYAHEPQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLK-----GTGKTNYIASNDLQAAYSLYPGSPASSLVSPISRT
        F  GPYAHE Q V+PPVFSAFTTEPSTAP TPPPE     +PSSP+VPFAQ L+SS++       G     + A++    +  +YPGSP  +L+SP S T
Subjt:  FATGPYAHEPQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLK-----GTGKTNYIASNDLQAAYSLYPGSPASSLVSPISRT

Query:  S
        S
Subjt:  S

AT5G52430.1 hydroxyproline-rich glycoprotein family protein2.7e-3246.95Show/hide
Query:  QRKRWGGCWGALSCFHSQKGGKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSAN--SPGGPSST
        Q+ RWG CW   SCF +QK  KRI  A  +PE   VT+              TV+ P  +APPSSPASF  S   S   SP   LSL++N  SP  P S 
Subjt:  QRKRWGGCWGALSCFHSQKGGKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSAN--SPGGPSST

Query:  MFATGPYAHEPQLVSPPVFSAFTTEPSTAPLTPPPELA-HLTTPSSPDVPFAQFLSSSVDLKGTGKTNYIASNDLQAAY-----SLYPGSP-ASSLVSPI
        +F  GPYA+E Q V+PPVFSAF TEPSTAP TPPPE + H+TTPSSP+VPFAQ L+SS++L     T+ +      + Y      + PGSP   +L+SP 
Subjt:  MFATGPYAHEPQLVSPPVFSAFTTEPSTAPLTPPPELA-HLTTPSSPDVPFAQFLSSSVDLKGTGKTNYIASNDLQAAY-----SLYPGSP-ASSLVSPI

Query:  SRTSGDCLSSSFP
        S  S    SS +P
Subjt:  SRTSGDCLSSSFP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
CAGCGGAAGAGATGGGGTGGATGTTGGGGTGCATTATCCTGCTTTCACTCTCAGAAAGGTGGAAAGCGCATTGTACCTGCATCTCGTTTACCTGAGGGCAATGCTGTGAC
AACCCAGCCAAATGGACCTCAAGCAGCGGGAATGACCAACCAGGCTACAGTTATAGCTCCATCCCTTCTTGCCCCACCTTCTTCACCAGCATCCTTTACAAATTCTGCAC
TCCCTTCAACCGTTCAGTCACCTAGCTGTTTCTTGTCCTTATCTGCCAACTCACCTGGAGGTCCTTCATCGACAATGTTTGCTACAGGGCCGTATGCCCATGAACCACAA
CTGGTCTCTCCTCCTGTTTTCTCAGCCTTCACCACTGAACCATCAACCGCTCCACTCACTCCCCCACCCGAACTAGCTCACCTAACTACGCCTTCTTCCCCTGATGTGCC
TTTTGCTCAGTTCCTCTCCTCATCAGTGGATCTCAAAGGAACTGGAAAGACAAATTACATTGCTTCAAATGATCTTCAAGCAGCATATTCTCTCTATCCTGGAAGTCCAG
CCAGTAGCCTCGTATCACCAATTTCAAGGACATCCGGCGACTGCCTATCATCTTCATTTCCTGAGAGGGACTTCCCCCCACAGTGGAATCCTTCATCTTCTCCCCAAGAT
GGAAAATATCCAAGAAGTGGTTCTGGTCGGCTATTTGGACACGAGAAAACAGGAACATCTTTGGCATCTCAAGATTCCAATTTCTTCTGCCCTGCTACATTTGCACAATT
CTATCTGGACAATCCACCATTCCCTCATACCGGTGGGAGGTTGAGTGTATCAAAGGATTCAGATGTTTACTCTTCTGGAGGGAACGGTTATCAAAACCGGCACAGCAAGT
CTCCAAAACAAGATGTGGAGGAAATAGAAGCTTACCGAGCATCATTTGGTTTCAGTGCAGATGAAATTATAACTACTACACAATATGTGGAGATATCTGATGTAATGGAG
GATTCCTTTACTATGAGACCTTTTACTTCAACTAGTCTGTCGGCAGAAGAAAGTATTGAACCTCCATTGCTAGGTGAAAAACTAAAATCCACAAAGACAACTATACAGAG
TCAGAGAAGTATGAAGCCAGCATCTGATGTTGTCGAAAAGGAAACCTGCGCTGAAGTGCTGCCATTATGCAATGGCTGTGAAGACAATAAATTGCAAAGACAACCTGGTA
ACATGTCAGGATCAAGTTCCTCTTTCAACCAAGTTGAAACAGAAGATGTATTCTCAAGGATAGTGCCACCCAAAAATAGTCGCAAGTATAATCTTGGATTATCCTGCTCT
GATGCAGAAGTTGACTACAGAAGAGGAAGGAGCCTAAGGGAGGTCAAGGGAGATTTTTCATGGCACGAC
mRNA sequenceShow/hide mRNA sequence
CAGCGGAAGAGATGGGGTGGATGTTGGGGTGCATTATCCTGCTTTCACTCTCAGAAAGGTGGAAAGCGCATTGTACCTGCATCTCGTTTACCTGAGGGCAATGCTGTGAC
AACCCAGCCAAATGGACCTCAAGCAGCGGGAATGACCAACCAGGCTACAGTTATAGCTCCATCCCTTCTTGCCCCACCTTCTTCACCAGCATCCTTTACAAATTCTGCAC
TCCCTTCAACCGTTCAGTCACCTAGCTGTTTCTTGTCCTTATCTGCCAACTCACCTGGAGGTCCTTCATCGACAATGTTTGCTACAGGGCCGTATGCCCATGAACCACAA
CTGGTCTCTCCTCCTGTTTTCTCAGCCTTCACCACTGAACCATCAACCGCTCCACTCACTCCCCCACCCGAACTAGCTCACCTAACTACGCCTTCTTCCCCTGATGTGCC
TTTTGCTCAGTTCCTCTCCTCATCAGTGGATCTCAAAGGAACTGGAAAGACAAATTACATTGCTTCAAATGATCTTCAAGCAGCATATTCTCTCTATCCTGGAAGTCCAG
CCAGTAGCCTCGTATCACCAATTTCAAGGACATCCGGCGACTGCCTATCATCTTCATTTCCTGAGAGGGACTTCCCCCCACAGTGGAATCCTTCATCTTCTCCCCAAGAT
GGAAAATATCCAAGAAGTGGTTCTGGTCGGCTATTTGGACACGAGAAAACAGGAACATCTTTGGCATCTCAAGATTCCAATTTCTTCTGCCCTGCTACATTTGCACAATT
CTATCTGGACAATCCACCATTCCCTCATACCGGTGGGAGGTTGAGTGTATCAAAGGATTCAGATGTTTACTCTTCTGGAGGGAACGGTTATCAAAACCGGCACAGCAAGT
CTCCAAAACAAGATGTGGAGGAAATAGAAGCTTACCGAGCATCATTTGGTTTCAGTGCAGATGAAATTATAACTACTACACAATATGTGGAGATATCTGATGTAATGGAG
GATTCCTTTACTATGAGACCTTTTACTTCAACTAGTCTGTCGGCAGAAGAAAGTATTGAACCTCCATTGCTAGGTGAAAAACTAAAATCCACAAAGACAACTATACAGAG
TCAGAGAAGTATGAAGCCAGCATCTGATGTTGTCGAAAAGGAAACCTGCGCTGAAGTGCTGCCATTATGCAATGGCTGTGAAGACAATAAATTGCAAAGACAACCTGGTA
ACATGTCAGGATCAAGTTCCTCTTTCAACCAAGTTGAAACAGAAGATGTATTCTCAAGGATAGTGCCACCCAAAAATAGTCGCAAGTATAATCTTGGATTATCCTGCTCT
GATGCAGAAGTTGACTACAGAAGAGGAAGGAGCCTAAGGGAGGTCAAGGGAGATTTTTCATGGCACGAC
Protein sequenceShow/hide protein sequence
QRKRWGGCWGALSCFHSQKGGKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPSSTMFATGPYAHEPQ
LVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKTNYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPPQWNPSSSPQD
GKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSGGNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVME
DSFTMRPFTSTSLSAEESIEPPLLGEKLKSTKTTIQSQRSMKPASDVVEKETCAEVLPLCNGCEDNKLQRQPGNMSGSSSSFNQVETEDVFSRIVPPKNSRKYNLGLSCS
DAEVDYRRGRSLREVKGDFSWHD