; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC11g0004 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC11g0004
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionBEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein .
Genome locationMC11:54769..67595
RNA-Seq ExpressionMC11g0004
SyntenyMC11g0004
Gene Ontology termsNA
InterPro domainsIPR040420 - Uncharacterized protein At1g76660-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022133364.1 uncharacterized protein At1g76660 [Momordica charantia]0.0100Show/hide
Query:  MGSEQNRFPQQERRKRWGGCWGALSCFHSQKGGKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTVQSPSCFLSLS
        MGSEQNRFPQQERRKRWGGCWGALSCFHSQKGGKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTVQSPSCFLSLS
Subjt:  MGSEQNRFPQQERRKRWGGCWGALSCFHSQKGGKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTVQSPSCFLSLS

Query:  ANSPGGPSSTMFATGPYAHEPQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKTNYIASNDLQAAYSLYPGSPASSLV
        ANSPGGPSSTMFATGPYAHEPQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKTNYIASNDLQAAYSLYPGSPASSLV
Subjt:  ANSPGGPSSTMFATGPYAHEPQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKTNYIASNDLQAAYSLYPGSPASSLV

Query:  SPISRTSGDCLSSSFPERDFPPQWNPSSSPQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSGGNGYQ
        SPISRTSGDCLSSSFPERDFPPQWNPSSSPQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSGGNGYQ
Subjt:  SPISRTSGDCLSSSFPERDFPPQWNPSSSPQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSGGNGYQ

Query:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTKTTIQSQRSMKPASDVVEKETCAEVLP
        NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTKTTIQSQRSMKPASDVVEKETCAEVLP
Subjt:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTKTTIQSQRSMKPASDVVEKETCAEVLP

Query:  LCNGCEDNKLQRQPGNMSGSSSSFNQVETEDVFSRIVPPKNSRKYNLGLSCSDAEVDYRRGRSLREVKGDFSWHD
        LCNGCEDNKLQRQPGNMSGSSSSFNQVETEDVFSRIVPPKNSRKYNLGLSCSDAEVDYRRGRSLREVKGDFSWHD
Subjt:  LCNGCEDNKLQRQPGNMSGSSSSFNQVETEDVFSRIVPPKNSRKYNLGLSCSDAEVDYRRGRSLREVKGDFSWHD

XP_022940824.1 uncharacterized protein At1g76660 [Cucurbita moschata]2.26e-30690.97Show/hide
Query:  MGSEQNRFPQQERRKRWGGCWGALSCFHSQKGGKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTVQSPSCFLSLS
        MGSEQNR PQQER KRWGGCWGALSCFHSQKG KRIVPASRLPEGN VTTQPNGP AAG+ NQATVIAPSLLAPPSSPASFTNSALPST QSPSCFLSLS
Subjt:  MGSEQNRFPQQERRKRWGGCWGALSCFHSQKGGKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTVQSPSCFLSLS

Query:  ANSPGGPSSTMFATGPYAHEPQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKTNYIASNDLQAAYSLYPGSPASSLV
        ANSPGGPSSTMFATGPYAHE Q VSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS+DLKGTGK NY+ASNDLQAAYSLYPGSPASSLV
Subjt:  ANSPGGPSSTMFATGPYAHEPQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKTNYIASNDLQAAYSLYPGSPASSLV

Query:  SPISRTSGDCLSSSFPERDFPPQWNPSSSPQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSGGNGYQ
        SPISRTSGDCLSSSFPERDFPPQWNPS+S QDGKYPRSGSGRLFGHEKTGT LASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVY+SGGNGYQ
Subjt:  SPISRTSGDCLSSSFPERDFPPQWNPSSSPQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSGGNGYQ

Query:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTKTTIQSQRSMKPASDVVEKETCAEVLP
        NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESI+PPL+GEKLKST+ T+QSQRS+K ASDV EKETC+EVL 
Subjt:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTKTTIQSQRSMKPASDVVEKETCAEVLP

Query:  LCNGCEDNKLQRQPGNMSGSSSSFNQVETEDVFSRIVPPKNSRKYNLGLSCSDAEVDYRRGRSLR-EVKGDFSWHD
        LCNGC+D+KLQRQPGN+ GSS+S  Q ETED+FSRI   KNSRKYN  LSCSDAEVDYRRGRSLR EVKGDF WHD
Subjt:  LCNGCEDNKLQRQPGNMSGSSSSFNQVETEDVFSRIVPPKNSRKYNLGLSCSDAEVDYRRGRSLR-EVKGDFSWHD

XP_022981333.1 uncharacterized protein At1g76660-like [Cucurbita maxima]2.34e-30690.76Show/hide
Query:  MGSEQNRFPQQERRKRWGGCWGALSCFHSQKGGKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTVQSPSCFLSLS
        MGSEQNR PQQER KRWGGCWGALSCF SQKG KRIVPASRLPEGN VTTQPNGP AAG+ NQATVIAPSLLAPPSSPASFTNSALPST QSPSCFLSLS
Subjt:  MGSEQNRFPQQERRKRWGGCWGALSCFHSQKGGKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTVQSPSCFLSLS

Query:  ANSPGGPSSTMFATGPYAHEPQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKTNYIASNDLQAAYSLYPGSPASSLV
        ANSPGGPSSTMFATGPYAHE Q VSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGK NY+ASNDLQAAYSLYPGSPASSLV
Subjt:  ANSPGGPSSTMFATGPYAHEPQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKTNYIASNDLQAAYSLYPGSPASSLV

Query:  SPISRTSGDCLSSSFPERDFPPQWNPSSSPQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSGGNGYQ
        SPISRTSGDCLSSSFPERDFPPQWNPS+S QDGKYPRSGSGRLFGHEKTGT LASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVY+SGGNGYQ
Subjt:  SPISRTSGDCLSSSFPERDFPPQWNPSSSPQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSGGNGYQ

Query:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTKTTIQSQRSMKPASDVVEKETCAEVLP
        NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESI+PPL+GEK KST+ T+ SQRS+K ASD+VEKETC+EVL 
Subjt:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTKTTIQSQRSMKPASDVVEKETCAEVLP

Query:  LCNGCEDNKLQRQPGNMSGSSSSFNQVETEDVFSRIVPPKNSRKYNLGLSCSDAEVDYRRGRSLR-EVKGDFSWHD
        LCNGC+DNKLQRQPGN+ GSS+S  Q ETED+FSRI   KNSRKYN  LSCSDAEVDYRRGRSLR EVKGDF WHD
Subjt:  LCNGCEDNKLQRQPGNMSGSSSSFNQVETEDVFSRIVPPKNSRKYNLGLSCSDAEVDYRRGRSLR-EVKGDFSWHD

XP_023524439.1 uncharacterized protein At1g76660 [Cucurbita pepo subsp. pepo]1.22e-30891.18Show/hide
Query:  MGSEQNRFPQQERRKRWGGCWGALSCFHSQKGGKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTVQSPSCFLSLS
        MGSEQNR PQQER KRWGGCWGALSCFHSQKG KRIVPASRLPEGN VTTQPNGP AAG+ NQATVIAPSLLAPPSSPASFTNSALPST QSPSCFLSLS
Subjt:  MGSEQNRFPQQERRKRWGGCWGALSCFHSQKGGKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTVQSPSCFLSLS

Query:  ANSPGGPSSTMFATGPYAHEPQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKTNYIASNDLQAAYSLYPGSPASSLV
        ANSPGGPSSTMFATGPYAHE Q VSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGK NY+ASNDLQAAYSLYPGSPASSLV
Subjt:  ANSPGGPSSTMFATGPYAHEPQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKTNYIASNDLQAAYSLYPGSPASSLV

Query:  SPISRTSGDCLSSSFPERDFPPQWNPSSSPQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSGGNGYQ
        SPISRTSGDCLSSSFPERDFPPQWNPS+S QDGKYPRSGSGRLFGHEKTGT LASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVY+SGGNGYQ
Subjt:  SPISRTSGDCLSSSFPERDFPPQWNPSSSPQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSGGNGYQ

Query:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTKTTIQSQRSMKPASDVVEKETCAEVLP
        NRHSKSPKQDVEEIEAYRASFGFSADEII+TTQYVEISDVMEDSFTMRPFTSTSLSAEESI+PPL+GEKLKST+ T+QSQRS+K ASDVVEKETC+EVL 
Subjt:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTKTTIQSQRSMKPASDVVEKETCAEVLP

Query:  LCNGCEDNKLQRQPGNMSGSSSSFNQVETEDVFSRIVPPKNSRKYNLGLSCSDAEVDYRRGRSLR-EVKGDFSWHD
        LCNGC+D+KLQRQPGN+ GSS+S  Q ETED+FSRI   KNSRKYN  LSCSDAEVDYRRGRSLR EVKGDF WHD
Subjt:  LCNGCEDNKLQRQPGNMSGSSSSFNQVETEDVFSRIVPPKNSRKYNLGLSCSDAEVDYRRGRSLR-EVKGDFSWHD

XP_038899313.1 uncharacterized protein At1g76660 [Benincasa hispida]5.34e-30792.18Show/hide
Query:  MGSEQNRFPQQERRKRWGGCWGALSCFHSQKGGKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTVQSPSCFLSLS
        MGSEQNRFPQQER KRWGGCWGALSCFHSQKG KRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVI PSLLAPPSSPASFTNSALPSTVQSPSCF+SLS
Subjt:  MGSEQNRFPQQERRKRWGGCWGALSCFHSQKGGKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTVQSPSCFLSLS

Query:  ANSPGGPSSTMFATGPYAHEPQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKTNYIASNDLQAAYSLYPGSPASSLV
        ANSPGGPSSTMFATGPYAHE QLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGK NYIASNDLQAAYSLYPGSPASSLV
Subjt:  ANSPGGPSSTMFATGPYAHEPQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKTNYIASNDLQAAYSLYPGSPASSLV

Query:  SPISRTSGDCLSSSFPERDFPPQWNPSSSPQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSGGNGYQ
        SPISRTSGDCLSSSFPERDFPPQWNPS+S QDGKYPRSGSGRLFG+EK  TSLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSD YSS GNGYQ
Subjt:  SPISRTSGDCLSSSFPERDFPPQWNPSSSPQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSGGNGYQ

Query:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTKTTIQSQRSMKPASDVVEKETCAEVLP
        NRH+KSPKQDVEEIEAYRASFGFSADEII+TTQYVEISDVMEDSFTMRPFTST+LSAEESIEPPLLGEKLKST TT+QSQRS+K A +VVEKETC EVL 
Subjt:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTKTTIQSQRSMKPASDVVEKETCAEVLP

Query:  LCNGCEDNKLQRQPGNMSGSSSSFNQVETEDVFSRIVPPKNSRKYNLGLSCSDAEVDYRRGRSLREVKGDFSW
        LCNG +DNKLQRQPGNMSGSS+S NQVE +D+FSRI   KNSRKYNLGLS SDAEVDYRRGRSLRE KGD SW
Subjt:  LCNGCEDNKLQRQPGNMSGSSSSFNQVETEDVFSRIVPPKNSRKYNLGLSCSDAEVDYRRGRSLREVKGDFSW

TrEMBL top hitse value%identityAlignment
A0A0A0L1G3 Uncharacterized protein5.04e-29589.26Show/hide
Query:  MGSEQNRFPQQERRKRWGGCWGALSCFHSQKGGKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTVQSPSCFLSLS
        MGSEQNRFPQ ER KRWGGCWGALSCFHSQKG KRIVPASRLPEGN VTTQPNGPQAAGMTNQATVI PSLLAPPSSPASFTNSALPSTVQSPSCFLSLS
Subjt:  MGSEQNRFPQQERRKRWGGCWGALSCFHSQKGGKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTVQSPSCFLSLS

Query:  ANSPGGPSSTMFATGPYAHEPQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKTNYIASNDLQAAYSLYPGSPASSLV
        ANSPGGPSSTM+ATGPYAH+ QLVSPPVFSAF TEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS DLKGTGK NYIASNDLQAAYSLYPGSPASSLV
Subjt:  ANSPGGPSSTMFATGPYAHEPQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKTNYIASNDLQAAYSLYPGSPASSLV

Query:  SPISRTSGDCLSSSFPERDFPPQWNPSSSPQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSGGNGYQ
        SPISRTSGDCLSSSFPERDF PQWN S+S QDGKYPRSGSGRLFG+EK GTSLASQDSNFFCPATFAQFYLDN  FPHTGGRLSVSKDSDVYSS GNGYQ
Subjt:  SPISRTSGDCLSSSFPERDFPPQWNPSSSPQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSGGNGYQ

Query:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTKTTIQSQRSMKPASDVVEKETCAEVLP
        NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEES EPPLLGEKLKS+ TT+QSQRS+K A +    ETC E+  
Subjt:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTKTTIQSQRSMKPASDVVEKETCAEVLP

Query:  LCNGCEDNKLQRQPGNMSGSSSSFNQVETEDVFSRIVPPKNSRKYNLGLSCSDAEVDYRRGRSLREVKGDFSWHD
        LCNG +DNKLQRQPG++SGSS+S NQVE +DVFSRI   KNSRKY+LGLSCSDAEVDYRRGRSLRE KG+ SWHD
Subjt:  LCNGCEDNKLQRQPGNMSGSSSSFNQVETEDVFSRIVPPKNSRKYNLGLSCSDAEVDYRRGRSLREVKGDFSWHD

A0A1S3BV86 uncharacterized protein At1g766602.61e-29889.89Show/hide
Query:  MGSEQNRFPQQERRKRWGGCWGALSCFHSQKGGKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTVQSPSCFLSLS
        MGSEQNRFPQQER KRWGGCWGALSCFHSQKG KRIVPASRLPEGN VTTQPNGPQAAGMTNQATVI PSLLAPPSSPASFTNSALPSTVQSPSCFLSLS
Subjt:  MGSEQNRFPQQERRKRWGGCWGALSCFHSQKGGKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTVQSPSCFLSLS

Query:  ANSPGGPSSTMFATGPYAHEPQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKTNYIASNDLQAAYSLYPGSPASSLV
        ANSPGGPSST++ATGPYAHE Q VSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS DLKGTGK NYIASNDLQAAYSLYPGSPASSLV
Subjt:  ANSPGGPSSTMFATGPYAHEPQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKTNYIASNDLQAAYSLYPGSPASSLV

Query:  SPISRTSGDCLSSSFPERDFPPQWNPSSSPQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSGGNGYQ
        SPISRTSGDCLSSSFPERDF PQWN S+S QDGKYPRSGSGRLFG+EK GTSLASQDSNFFCPATFAQFYLDN  FPHTGGRLSVSKDSDVYSS GNGYQ
Subjt:  SPISRTSGDCLSSSFPERDFPPQWNPSSSPQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSGGNGYQ

Query:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTKTTIQSQRSMKPASDVVEKETCAEVLP
        NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEES EPPLLGEKLKS+ TT+Q+QRS+K A +VVEKETC EV  
Subjt:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTKTTIQSQRSMKPASDVVEKETCAEVLP

Query:  LCNGCEDNKLQRQPGNMSGSSSSFNQVETEDVFSRIVPPKNSRKYNLGLSCSDAEVDYRRGRSLREVKGDFSWHD
        LCNG +DNKLQRQPG++ GSS+S +QVE +DVFSRI   KNSRKY+LGLSCSDAEVDYRRGRSLRE KG+ SWHD
Subjt:  LCNGCEDNKLQRQPGNMSGSSSSFNQVETEDVFSRIVPPKNSRKYNLGLSCSDAEVDYRRGRSLREVKGDFSWHD

A0A6J1BVS7 uncharacterized protein At1g766600.0100Show/hide
Query:  MGSEQNRFPQQERRKRWGGCWGALSCFHSQKGGKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTVQSPSCFLSLS
        MGSEQNRFPQQERRKRWGGCWGALSCFHSQKGGKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTVQSPSCFLSLS
Subjt:  MGSEQNRFPQQERRKRWGGCWGALSCFHSQKGGKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTVQSPSCFLSLS

Query:  ANSPGGPSSTMFATGPYAHEPQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKTNYIASNDLQAAYSLYPGSPASSLV
        ANSPGGPSSTMFATGPYAHEPQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKTNYIASNDLQAAYSLYPGSPASSLV
Subjt:  ANSPGGPSSTMFATGPYAHEPQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKTNYIASNDLQAAYSLYPGSPASSLV

Query:  SPISRTSGDCLSSSFPERDFPPQWNPSSSPQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSGGNGYQ
        SPISRTSGDCLSSSFPERDFPPQWNPSSSPQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSGGNGYQ
Subjt:  SPISRTSGDCLSSSFPERDFPPQWNPSSSPQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSGGNGYQ

Query:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTKTTIQSQRSMKPASDVVEKETCAEVLP
        NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTKTTIQSQRSMKPASDVVEKETCAEVLP
Subjt:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTKTTIQSQRSMKPASDVVEKETCAEVLP

Query:  LCNGCEDNKLQRQPGNMSGSSSSFNQVETEDVFSRIVPPKNSRKYNLGLSCSDAEVDYRRGRSLREVKGDFSWHD
        LCNGCEDNKLQRQPGNMSGSSSSFNQVETEDVFSRIVPPKNSRKYNLGLSCSDAEVDYRRGRSLREVKGDFSWHD
Subjt:  LCNGCEDNKLQRQPGNMSGSSSSFNQVETEDVFSRIVPPKNSRKYNLGLSCSDAEVDYRRGRSLREVKGDFSWHD

A0A6J1FRS7 uncharacterized protein At1g766601.09e-30690.97Show/hide
Query:  MGSEQNRFPQQERRKRWGGCWGALSCFHSQKGGKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTVQSPSCFLSLS
        MGSEQNR PQQER KRWGGCWGALSCFHSQKG KRIVPASRLPEGN VTTQPNGP AAG+ NQATVIAPSLLAPPSSPASFTNSALPST QSPSCFLSLS
Subjt:  MGSEQNRFPQQERRKRWGGCWGALSCFHSQKGGKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTVQSPSCFLSLS

Query:  ANSPGGPSSTMFATGPYAHEPQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKTNYIASNDLQAAYSLYPGSPASSLV
        ANSPGGPSSTMFATGPYAHE Q VSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS+DLKGTGK NY+ASNDLQAAYSLYPGSPASSLV
Subjt:  ANSPGGPSSTMFATGPYAHEPQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKTNYIASNDLQAAYSLYPGSPASSLV

Query:  SPISRTSGDCLSSSFPERDFPPQWNPSSSPQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSGGNGYQ
        SPISRTSGDCLSSSFPERDFPPQWNPS+S QDGKYPRSGSGRLFGHEKTGT LASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVY+SGGNGYQ
Subjt:  SPISRTSGDCLSSSFPERDFPPQWNPSSSPQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSGGNGYQ

Query:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTKTTIQSQRSMKPASDVVEKETCAEVLP
        NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESI+PPL+GEKLKST+ T+QSQRS+K ASDV EKETC+EVL 
Subjt:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTKTTIQSQRSMKPASDVVEKETCAEVLP

Query:  LCNGCEDNKLQRQPGNMSGSSSSFNQVETEDVFSRIVPPKNSRKYNLGLSCSDAEVDYRRGRSLR-EVKGDFSWHD
        LCNGC+D+KLQRQPGN+ GSS+S  Q ETED+FSRI   KNSRKYN  LSCSDAEVDYRRGRSLR EVKGDF WHD
Subjt:  LCNGCEDNKLQRQPGNMSGSSSSFNQVETEDVFSRIVPPKNSRKYNLGLSCSDAEVDYRRGRSLR-EVKGDFSWHD

A0A6J1IZ74 uncharacterized protein At1g76660-like1.13e-30690.76Show/hide
Query:  MGSEQNRFPQQERRKRWGGCWGALSCFHSQKGGKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTVQSPSCFLSLS
        MGSEQNR PQQER KRWGGCWGALSCF SQKG KRIVPASRLPEGN VTTQPNGP AAG+ NQATVIAPSLLAPPSSPASFTNSALPST QSPSCFLSLS
Subjt:  MGSEQNRFPQQERRKRWGGCWGALSCFHSQKGGKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTVQSPSCFLSLS

Query:  ANSPGGPSSTMFATGPYAHEPQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKTNYIASNDLQAAYSLYPGSPASSLV
        ANSPGGPSSTMFATGPYAHE Q VSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGK NY+ASNDLQAAYSLYPGSPASSLV
Subjt:  ANSPGGPSSTMFATGPYAHEPQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKTNYIASNDLQAAYSLYPGSPASSLV

Query:  SPISRTSGDCLSSSFPERDFPPQWNPSSSPQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSGGNGYQ
        SPISRTSGDCLSSSFPERDFPPQWNPS+S QDGKYPRSGSGRLFGHEKTGT LASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVY+SGGNGYQ
Subjt:  SPISRTSGDCLSSSFPERDFPPQWNPSSSPQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSGGNGYQ

Query:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTKTTIQSQRSMKPASDVVEKETCAEVLP
        NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESI+PPL+GEK KST+ T+ SQRS+K ASD+VEKETC+EVL 
Subjt:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTKTTIQSQRSMKPASDVVEKETCAEVLP

Query:  LCNGCEDNKLQRQPGNMSGSSSSFNQVETEDVFSRIVPPKNSRKYNLGLSCSDAEVDYRRGRSLR-EVKGDFSWHD
        LCNGC+DNKLQRQPGN+ GSS+S  Q ETED+FSRI   KNSRKYN  LSCSDAEVDYRRGRSLR EVKGDF WHD
Subjt:  LCNGCEDNKLQRQPGNMSGSSSSFNQVETEDVFSRIVPPKNSRKYNLGLSCSDAEVDYRRGRSLR-EVKGDFSWHD

SwissProt top hitse value%identityAlignment
Q9SRE5 Uncharacterized protein At1g766602.9e-12757.84Show/hide
Query:  MGSEQNRFPQQERRKRWGGCWGALSCFHSQKGGKRIVPASRLPE-GNAVTTQPNGPQAAGMTNQ--ATVIAPSLLAPPSSPASFTNSALPSTVQSPSCFL
        MGSE      Q++RKRWGGC G  SCF SQKGGKRIVPASR+PE GN   +QPNG   AG+ N   A  I  SLLAPPSSPASFTNSALPST QSP+C+L
Subjt:  MGSEQNRFPQQERRKRWGGCWGALSCFHSQKGGKRIVPASRLPE-GNAVTTQPNGPQAAGMTNQ--ATVIAPSLLAPPSSPASFTNSALPSTVQSPSCFL

Query:  SLSANSPGGPSSTMFATGPYAHEPQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKTNYIASNDLQAAYSLYPGSPAS
        SL+ANSPGGPSS+M+ATGPYAHE QLVSPPVFS FTTEPSTAP TPPPELA LT PSSPDVP+A+FL+SS+DLK +GK +Y   NDLQA YSLYPGSPAS
Subjt:  SLSANSPGGPSSTMFATGPYAHEPQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKTNYIASNDLQAAYSLYPGSPAS

Query:  SLVSPISRTSGDCLSSSFPERDFPPQWNPSSSPQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLD-NPPFPHTGGRLSVSKDSDVYSSG-
        +L SPISR SGD L                 SPQ+GK  RS SG  FG++  G S   Q+SNFFCP TFA+FYLD +P  P  GGRLSVSKDSDVY +  
Subjt:  SLVSPISRTSGDCLSSSFPERDFPPQWNPSSSPQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLD-NPPFPHTGGRLSVSKDSDVYSSG-

Query:  -GNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTKTTIQSQRSMKPASDVVEKET
         GNG QNR ++SPKQD+EE+EAYRASFGFSADEIITT+QYVEI+DVM+ SF    ++            P  G+KL   +  + SQ S K  +D+  +  
Subjt:  -GNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTKTTIQSQRSMKPASDVVEKET

Query:  CAEVLPLCNGCEDNKLQRQPGNMSGSSSSFNQVETEDVFSRIVPPKNSRKYNLGLSCSDAEVDYRRGRSLRE
          +     N  +D+K + +              + E + SR+   K SR Y+  +S SDAEV+YRRGRSLRE
Subjt:  CAEVLPLCNGCEDNKLQRQPGNMSGSSSSFNQVETEDVFSRIVPPKNSRKYNLGLSCSDAEVDYRRGRSLRE

Arabidopsis top hitse value%identityAlignment
AT1G63720.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT5G52430.1)9.3e-2840Show/hide
Query:  RRKRWGGCWGALSCFHSQKGGKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPSSTMF
        ++++W   W  L CF S +  KRI  +  +PE  ++++  +    +G  +  T +    +APPSSPASF  S  PS  QSP   LS S   P     ++F
Subjt:  RRKRWGGCWGALSCFHSQKGGKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPSSTMF

Query:  ATGPYAHEPQLVSPPVFSAFTTEPSTAPLTPPPELAHL----TTPSSPDVPFAQFLSSSVDLKGTGKTNYIASNDLQAAYSLYPGSPASSLVSPISRTSG
        A GPYAHE QLVSPPVFS +TTEPS+AP+TPP + + +    TTPSSP+VPFAQ  +S+      G    ++S+     Y L PGSP   L+SP   + G
Subjt:  ATGPYAHEPQLVSPPVFSAFTTEPSTAPLTPPPELAHL----TTPSSPDVPFAQFLSSSVDLKGTGKTNYIASNDLQAAYSLYPGSPASSLVSPISRTSG

Query:  DCLSSSFP--ERDFPPQWNPSSSPQ
           +S FP  E    P +  S  P+
Subjt:  DCLSSSFP--ERDFPPQWNPSSSPQ

AT1G76660.1 FUNCTIONS IN: molecular_function unknown2.1e-12857.84Show/hide
Query:  MGSEQNRFPQQERRKRWGGCWGALSCFHSQKGGKRIVPASRLPE-GNAVTTQPNGPQAAGMTNQ--ATVIAPSLLAPPSSPASFTNSALPSTVQSPSCFL
        MGSE      Q++RKRWGGC G  SCF SQKGGKRIVPASR+PE GN   +QPNG   AG+ N   A  I  SLLAPPSSPASFTNSALPST QSP+C+L
Subjt:  MGSEQNRFPQQERRKRWGGCWGALSCFHSQKGGKRIVPASRLPE-GNAVTTQPNGPQAAGMTNQ--ATVIAPSLLAPPSSPASFTNSALPSTVQSPSCFL

Query:  SLSANSPGGPSSTMFATGPYAHEPQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKTNYIASNDLQAAYSLYPGSPAS
        SL+ANSPGGPSS+M+ATGPYAHE QLVSPPVFS FTTEPSTAP TPPPELA LT PSSPDVP+A+FL+SS+DLK +GK +Y   NDLQA YSLYPGSPAS
Subjt:  SLSANSPGGPSSTMFATGPYAHEPQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKTNYIASNDLQAAYSLYPGSPAS

Query:  SLVSPISRTSGDCLSSSFPERDFPPQWNPSSSPQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLD-NPPFPHTGGRLSVSKDSDVYSSG-
        +L SPISR SGD L                 SPQ+GK  RS SG  FG++  G S   Q+SNFFCP TFA+FYLD +P  P  GGRLSVSKDSDVY +  
Subjt:  SLVSPISRTSGDCLSSSFPERDFPPQWNPSSSPQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLD-NPPFPHTGGRLSVSKDSDVYSSG-

Query:  -GNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTKTTIQSQRSMKPASDVVEKET
         GNG QNR ++SPKQD+EE+EAYRASFGFSADEIITT+QYVEI+DVM+ SF    ++            P  G+KL   +  + SQ S K  +D+  +  
Subjt:  -GNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTKTTIQSQRSMKPASDVVEKET

Query:  CAEVLPLCNGCEDNKLQRQPGNMSGSSSSFNQVETEDVFSRIVPPKNSRKYNLGLSCSDAEVDYRRGRSLRE
          +     N  +D+K + +              + E + SR+   K SR Y+  +S SDAEV+YRRGRSLRE
Subjt:  CAEVLPLCNGCEDNKLQRQPGNMSGSSSSFNQVETEDVFSRIVPPKNSRKYNLGLSCSDAEVDYRRGRSLRE

AT4G25620.1 hydroxyproline-rich glycoprotein family protein4.2e-2843.4Show/hide
Query:  SEQNRFPQQERRKRWGGCWGALSCFHSQKGGKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPST--VQSPSCFLSLS
        S ++R      +K+ G  W    CF S+K  KRI  A  +PE  A +     P     +N  ++  P  +APPSSPASF  S  PS      P    SL+
Subjt:  SEQNRFPQQERRKRWGGCWGALSCFHSQKGGKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPST--VQSPSCFLSLS

Query:  ANSPGGPSSTMFATGPYAHEPQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLK-----GTGKTNYIASNDLQAAYSLYPGSP
         N P  PS+  F  GPYAHE Q V+PPVFSAFTTEPSTAP TPPPE     +PSSP+VPFAQ L+SS++       G     + A++    +  +YPGSP
Subjt:  ANSPGGPSSTMFATGPYAHEPQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLK-----GTGKTNYIASNDLQAAYSLYPGSP

Query:  ASSLVSPISRTS
          +L+SP S TS
Subjt:  ASSLVSPISRTS

AT5G52430.1 hydroxyproline-rich glycoprotein family protein1.6e-3246.08Show/hide
Query:  PQQERRKRWGGCWGALSCFHSQKGGKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSAN--SPGG
        P   ++ RWG CW   SCF +QK  KRI  A  +PE   VT+              TV+ P  +APPSSPASF  S   S   SP   LSL++N  SP  
Subjt:  PQQERRKRWGGCWGALSCFHSQKGGKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSAN--SPGG

Query:  PSSTMFATGPYAHEPQLVSPPVFSAFTTEPSTAPLTPPPELA-HLTTPSSPDVPFAQFLSSSVDLKGTGKTNYIASNDLQAAY-----SLYPGSP-ASSL
        P S +F  GPYA+E Q V+PPVFSAF TEPSTAP TPPPE + H+TTPSSP+VPFAQ L+SS++L     T+ +      + Y      + PGSP   +L
Subjt:  PSSTMFATGPYAHEPQLVSPPVFSAFTTEPSTAPLTPPPELA-HLTTPSSPDVPFAQFLSSSVDLKGTGKTNYIASNDLQAAY-----SLYPGSP-ASSL

Query:  VSPISRTSGDCLSSSFP
        +SP S  S    SS +P
Subjt:  VSPISRTSGDCLSSSFP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGTCCGAGCAGAACAGATTCCCTCAGCAGGAACGGCGGAAGAGATGGGGTGGATGTTGGGGTGCATTATCCTGCTTTCACTCTCAGAAAGGTGGAAAGCGCATTGT
ACCTGCATCTCGTTTACCTGAGGGCAATGCTGTGACAACCCAGCCAAATGGACCTCAAGCAGCTGGAATGACCAACCAGGCTACAGTTATAGCTCCATCCCTTCTTGCCC
CACCTTCTTCACCAGCATCCTTTACAAATTCTGCACTCCCTTCAACCGTTCAGTCACCTAGCTGTTTCTTGTCCTTATCTGCCAACTCACCTGGAGGTCCTTCATCGACA
ATGTTTGCTACAGGGCCGTATGCCCATGAACCACAACTGGTCTCTCCTCCTGTTTTCTCAGCCTTCACCACTGAACCATCAACCGCTCCACTCACTCCCCCACCCGAACT
AGCTCACCTAACTACGCCTTCTTCCCCTGATGTGCCTTTTGCTCAGTTCCTCTCCTCATCAGTGGATCTCAAAGGAACTGGAAAGACAAATTACATTGCTTCAAATGATC
TTCAAGCAGCATATTCTCTCTATCCTGGAAGTCCAGCCAGTAGCCTCGTATCACCAATTTCAAGGACATCCGGCGACTGCCTATCATCTTCATTTCCTGAGAGGGACTTC
CCCCCACAGTGGAATCCTTCATCTTCTCCCCAAGATGGAAAATATCCAAGAAGTGGTTCTGGTCGGCTATTTGGACACGAGAAAACAGGAACATCTTTGGCATCTCAAGA
TTCCAATTTCTTCTGCCCTGCTACATTTGCACAATTCTATCTGGACAATCCACCATTCCCTCATACCGGTGGGAGGTTGAGTGTATCAAAGGATTCAGATGTTTACTCTT
CTGGAGGGAACGGTTATCAAAACCGGCACAGCAAGTCTCCAAAACAAGATGTGGAGGAAATAGAAGCTTACCGAGCATCATTTGGTTTCAGTGCAGATGAAATTATAACT
ACTACACAATATGTGGAGATATCTGATGTAATGGAGGATTCCTTTACTATGAGACCTTTTACTTCAACTAGTCTGTCGGCAGAAGAAAGTATTGAACCTCCATTGCTAGG
TGAAAAACTAAAATCCACAAAGACAACTATACAGAGTCAGAGAAGTATGAAGCCAGCATCTGATGTTGTCGAAAAGGAAACCTGCGCTGAAGTGCTGCCATTATGCAATG
GCTGTGAAGACAATAAATTGCAAAGACAACCTGGTAACATGTCAGGATCAAGTTCCTCTTTCAACCAAGTTGAAACAGAAGATGTATTCTCAAGGATAGTGCCACCCAAA
AATAGTCGCAAGTATAATCTTGGATTATCCTGCTCTGATGCAGAAGTTGACTACAGAAGAGGAAGGAGCCTAAGGGAGGTCAAGGGAGATTTTTCATGGCACGACTAA
mRNA sequenceShow/hide mRNA sequence
GAGAGATGAAAAACGGAACTAAATTAAAAATTGAGAAAAATGGCGCAAAAGAAAGGTAAAATTGAAATAGAGGGAAAATAAAATGAAAAAGCAAAAGGGCATCTTTTCTC
AGTCGCTCTTTTCCGCGTTGCGAGAGCAAATCTAAATGCTGTCAGATTGTGCACTGTGAATTGAGAAATATCAGGGTTGATATGGGTCTGACTCCCATTAGTCGTTAAAT
TGACGAAGTTGACGGAGGAGGAGGAGCAGCAGCAGCAGCAGGAAGAGGAGGAGGAGGAGGAGGGCCAAGAGTCTTGTAGGAAGGTGCAGCTCAGGACACTAAAATATAAA
TTAACTTAGTCGAGCTACTAGTTACGGATGGGGTCCGAGCAGAACAGATTCCCTCAGCAGGAACGGCGGAAGAGATGGGGTGGATGTTGGGGTGCATTATCCTGCTTTCA
CTCTCAGAAAGGTGGAAAGCGCATTGTACCTGCATCTCGTTTACCTGAGGGCAATGCTGTGACAACCCAGCCAAATGGACCTCAAGCAGCTGGAATGACCAACCAGGCTA
CAGTTATAGCTCCATCCCTTCTTGCCCCACCTTCTTCACCAGCATCCTTTACAAATTCTGCACTCCCTTCAACCGTTCAGTCACCTAGCTGTTTCTTGTCCTTATCTGCC
AACTCACCTGGAGGTCCTTCATCGACAATGTTTGCTACAGGGCCGTATGCCCATGAACCACAACTGGTCTCTCCTCCTGTTTTCTCAGCCTTCACCACTGAACCATCAAC
CGCTCCACTCACTCCCCCACCCGAACTAGCTCACCTAACTACGCCTTCTTCCCCTGATGTGCCTTTTGCTCAGTTCCTCTCCTCATCAGTGGATCTCAAAGGAACTGGAA
AGACAAATTACATTGCTTCAAATGATCTTCAAGCAGCATATTCTCTCTATCCTGGAAGTCCAGCCAGTAGCCTCGTATCACCAATTTCAAGGACATCCGGCGACTGCCTA
TCATCTTCATTTCCTGAGAGGGACTTCCCCCCACAGTGGAATCCTTCATCTTCTCCCCAAGATGGAAAATATCCAAGAAGTGGTTCTGGTCGGCTATTTGGACACGAGAA
AACAGGAACATCTTTGGCATCTCAAGATTCCAATTTCTTCTGCCCTGCTACATTTGCACAATTCTATCTGGACAATCCACCATTCCCTCATACCGGTGGGAGGTTGAGTG
TATCAAAGGATTCAGATGTTTACTCTTCTGGAGGGAACGGTTATCAAAACCGGCACAGCAAGTCTCCAAAACAAGATGTGGAGGAAATAGAAGCTTACCGAGCATCATTT
GGTTTCAGTGCAGATGAAATTATAACTACTACACAATATGTGGAGATATCTGATGTAATGGAGGATTCCTTTACTATGAGACCTTTTACTTCAACTAGTCTGTCGGCAGA
AGAAAGTATTGAACCTCCATTGCTAGGTGAAAAACTAAAATCCACAAAGACAACTATACAGAGTCAGAGAAGTATGAAGCCAGCATCTGATGTTGTCGAAAAGGAAACCT
GCGCTGAAGTGCTGCCATTATGCAATGGCTGTGAAGACAATAAATTGCAAAGACAACCTGGTAACATGTCAGGATCAAGTTCCTCTTTCAACCAAGTTGAAACAGAAGAT
GTATTCTCAAGGATAGTGCCACCCAAAAATAGTCGCAAGTATAATCTTGGATTATCCTGCTCTGATGCAGAAGTTGACTACAGAAGAGGAAGGAGCCTAAGGGAGGTCAA
GGGAGATTTTTCATGGCACGACTAAGAGAGCCATCTCAAAAATAGTTTTCAGTTTGTTTGTGTATCTGTTTTCTGCTTTGCAGGTTTCCATGGAATGTCTAACCTATGAT
CTAACCTGTTGTCTCTCTCAGTTATGACTTGGTCACTGGACGGATGAACAATTATAATTTGTGTTCCTAATCGTGCAAATTTAGATATGGAATGGGTCGAATTTCTGTAC
TTGATAGATGAAGTTGACTTTTTTCAGTGAATTGTATACTAGCACCGTACAATCTGCTTGCCCATTGTGTACACCCATCGTCACTATGCTTTATTCCATCTAAACTGCTG
TTCGTCTGACCTTTTTCTTCGAGGGAGATAAACAAGTGATTT
Protein sequenceShow/hide protein sequence
MGSEQNRFPQQERRKRWGGCWGALSCFHSQKGGKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPSST
MFATGPYAHEPQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKTNYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDF
PPQWNPSSSPQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSGGNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIIT
TTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTKTTIQSQRSMKPASDVVEKETCAEVLPLCNGCEDNKLQRQPGNMSGSSSSFNQVETEDVFSRIVPPK
NSRKYNLGLSCSDAEVDYRRGRSLREVKGDFSWHD