; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0036644 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0036644
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionBEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein .
Genome locationchr2:103553..116193
RNA-Seq ExpressionLag0036644
SyntenyLag0036644
Gene Ontology termsNA
InterPro domainsIPR040420 - Uncharacterized protein At1g76660-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6608673.1 hypothetical protein SDJN03_02015, partial [Cucurbita argyrosperma subsp. sororia]6.5e-23892.26Show/hide
Query:  MGSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVITQSNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS
        MGSEQNR PQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVV TQ NGP AAG+ NQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS
Subjt:  MGSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVITQSNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS

Query:  ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAAYSLYPGSPASSLV
        ANSPGGPSSTMFATGPYAHETQ VSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS+DLKGTGKANY+ASNDLQAAYSLYPGSPASSLV
Subjt:  ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAAYSLYPGSPASSLV

Query:  SPISRTSGDCLSSSFPERDFPPQWNTSASLQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLDNPPFPHAGGRLSVSKD-DVYSSGGNGYQ
        SPISRTSGDCLSSSFPERDFPPQWN SASLQDGKYPRSGSGRLFGHEKTGT LASQDSNFFCPATFAQFYLDNPPFPH GGRLSVSKD DVY+SGGNGYQ
Subjt:  SPISRTSGDCLSSSFPERDFPPQWNTSASLQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLDNPPFPHAGGRLSVSKD-DVYSSGGNGYQ

Query:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSADESMEPPLLGEKLKSTQTTLQSQRSIKSASDVVEKETCTEVLA
        NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSL A+ES++PPL+GEKLKSTQ TLQSQRSIKSAS+VVEKETC+EVLA
Subjt:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSADESMEPPLLGEKLKSTQTTLQSQRSIKSASDVVEKETCTEVLA

Query:  LCNGHKENKLQRQPGNTSGPSTSFNQVETEDVFSKMGPSRNSRKYNHGLSCSDAEVDYRRGRSLR
        LCNG KE+KLQRQPGN  G STS  Q ETED+FS++G S+NSRKYNH LSCSDAEVDYRRGRSLR
Subjt:  LCNGHKENKLQRQPGNTSGPSTSFNQVETEDVFSKMGPSRNSRKYNHGLSCSDAEVDYRRGRSLR

XP_022133364.1 uncharacterized protein At1g76660 [Momordica charantia]3.7e-24192.93Show/hide
Query:  MGSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVITQSNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS
        MGSEQNRFPQQER KRWGGCWGALSCFHSQKG KRIVPASRLPEGN V TQ NGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPST QSPSCFLSLS
Subjt:  MGSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVITQSNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS

Query:  ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAAYSLYPGSPASSLV
        ANSPGGPSSTMFATGPYAHE QLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGK NYIASNDLQAAYSLYPGSPASSLV
Subjt:  ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAAYSLYPGSPASSLV

Query:  SPISRTSGDCLSSSFPERDFPPQWNTSASLQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLDNPPFPHAGGRLSVSKD-DVYSSGGNGYQ
        SPISRTSGDCLSSSFPERDFPPQWN S+S QDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLDNPPFPH GGRLSVSKD DVYSSGGNGYQ
Subjt:  SPISRTSGDCLSSSFPERDFPPQWNTSASLQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLDNPPFPHAGGRLSVSKD-DVYSSGGNGYQ

Query:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSADESMEPPLLGEKLKSTQTTLQSQRSIKSASDVVEKETCTEVLA
        NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSA+ES+EPPLLGEKLKST+TT+QSQRS+K ASDVVEKETC EVL 
Subjt:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSADESMEPPLLGEKLKSTQTTLQSQRSIKSASDVVEKETCTEVLA

Query:  LCNGHKENKLQRQPGNTSGPSTSFNQVETEDVFSKMGPSRNSRKYNHGLSCSDAEVDYRRGRSLREV
        LCNG ++NKLQRQPGN SG S+SFNQVETEDVFS++ P +NSRKYN GLSCSDAEVDYRRGRSLREV
Subjt:  LCNGHKENKLQRQPGNTSGPSTSFNQVETEDVFSKMGPSRNSRKYNHGLSCSDAEVDYRRGRSLREV

XP_022940824.1 uncharacterized protein At1g76660 [Cucurbita moschata]7.1e-23790.97Show/hide
Query:  MGSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVITQSNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS
        MGSEQNR PQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVV TQ NGP AAG+ NQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS
Subjt:  MGSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVITQSNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS

Query:  ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAAYSLYPGSPASSLV
        ANSPGGPSSTMFATGPYAHETQ VSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS+DLKGTGKANY+ASNDLQAAYSLYPGSPASSLV
Subjt:  ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAAYSLYPGSPASSLV

Query:  SPISRTSGDCLSSSFPERDFPPQWNTSASLQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLDNPPFPHAGGRLSVSKD-DVYSSGGNGYQ
        SPISRTSGDCLSSSFPERDFPPQWN SASLQDGKYPRSGSGRLFGHEKTGT LASQDSNFFCPATFAQFYLDNPPFPH GGRLSVSKD DVY+SGGNGYQ
Subjt:  SPISRTSGDCLSSSFPERDFPPQWNTSASLQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLDNPPFPHAGGRLSVSKD-DVYSSGGNGYQ

Query:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSADESMEPPLLGEKLKSTQTTLQSQRSIKSASDVVEKETCTEVLA
        NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSA+ES++PPL+GEKLKSTQ TLQSQRSIKSASD VEKETC+EVLA
Subjt:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSADESMEPPLLGEKLKSTQTTLQSQRSIKSASDVVEKETCTEVLA

Query:  LCNGHKENKLQRQPGNTSGPSTSFNQVETEDVFSKMGPSRNSRKYNHGLSCSDAEVDYRRGRSLR-EVFMECLTYD
        LCNG K++KLQRQPGN  G STS  Q ETED+FS++G S+NSRKYNH LSCSDAEVDYRRGRSLR EV  + L +D
Subjt:  LCNGHKENKLQRQPGNTSGPSTSFNQVETEDVFSKMGPSRNSRKYNHGLSCSDAEVDYRRGRSLR-EVFMECLTYD

XP_022981333.1 uncharacterized protein At1g76660-like [Cucurbita maxima]7.1e-23790.76Show/hide
Query:  MGSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVITQSNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS
        MGSEQNR PQQERGKRWGGCWGALSCF SQKGEKRIVPASRLPEGNVV TQ NGP AAG+ NQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS
Subjt:  MGSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVITQSNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS

Query:  ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAAYSLYPGSPASSLV
        ANSPGGPSSTMFATGPYAHETQ VSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANY+ASNDLQAAYSLYPGSPASSLV
Subjt:  ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAAYSLYPGSPASSLV

Query:  SPISRTSGDCLSSSFPERDFPPQWNTSASLQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLDNPPFPHAGGRLSVSKD-DVYSSGGNGYQ
        SPISRTSGDCLSSSFPERDFPPQWN SASLQDGKYPRSGSGRLFGHEKTGT LASQDSNFFCPATFAQFYLDNPPFPH GGRLSVSKD DVY+SGGNGYQ
Subjt:  SPISRTSGDCLSSSFPERDFPPQWNTSASLQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLDNPPFPHAGGRLSVSKD-DVYSSGGNGYQ

Query:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSADESMEPPLLGEKLKSTQTTLQSQRSIKSASDVVEKETCTEVLA
        NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSA+ES++PPL+GEK KSTQ TL SQRSIKSASD+VEKETC+EVLA
Subjt:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSADESMEPPLLGEKLKSTQTTLQSQRSIKSASDVVEKETCTEVLA

Query:  LCNGHKENKLQRQPGNTSGPSTSFNQVETEDVFSKMGPSRNSRKYNHGLSCSDAEVDYRRGRSLR-EVFMECLTYD
        LCNG K+NKLQRQPGN  G STS  Q ETED+FS++G S+NSRKYNH LSCSDAEVDYRRGRSLR EV  + L +D
Subjt:  LCNGHKENKLQRQPGNTSGPSTSFNQVETEDVFSKMGPSRNSRKYNHGLSCSDAEVDYRRGRSLR-EVFMECLTYD

XP_023524439.1 uncharacterized protein At1g76660 [Cucurbita pepo subsp. pepo]1.3e-23891.18Show/hide
Query:  MGSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVITQSNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS
        MGSEQNR PQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVV TQ NGP AAG+ NQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS
Subjt:  MGSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVITQSNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS

Query:  ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAAYSLYPGSPASSLV
        ANSPGGPSSTMFATGPYAHETQ VSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANY+ASNDLQAAYSLYPGSPASSLV
Subjt:  ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAAYSLYPGSPASSLV

Query:  SPISRTSGDCLSSSFPERDFPPQWNTSASLQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLDNPPFPHAGGRLSVSKD-DVYSSGGNGYQ
        SPISRTSGDCLSSSFPERDFPPQWN SASLQDGKYPRSGSGRLFGHEKTGT LASQDSNFFCPATFAQFYLDNPPFPH GGRLSVSKD DVY+SGGNGYQ
Subjt:  SPISRTSGDCLSSSFPERDFPPQWNTSASLQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLDNPPFPHAGGRLSVSKD-DVYSSGGNGYQ

Query:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSADESMEPPLLGEKLKSTQTTLQSQRSIKSASDVVEKETCTEVLA
        NRHSKSPKQDVEEIEAYRASFGFSADEII+TTQYVEISDVMEDSFTMRPFTSTSLSA+ES++PPL+GEKLKSTQ TLQSQRSIKSASDVVEKETC+EVLA
Subjt:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSADESMEPPLLGEKLKSTQTTLQSQRSIKSASDVVEKETCTEVLA

Query:  LCNGHKENKLQRQPGNTSGPSTSFNQVETEDVFSKMGPSRNSRKYNHGLSCSDAEVDYRRGRSLR-EVFMECLTYD
        LCNG K++KLQRQPGN  G STS  Q ETED+FS++G S+NSRKYNH LSCSDAEVDYRRGRSLR EV  + L +D
Subjt:  LCNGHKENKLQRQPGNTSGPSTSFNQVETEDVFSKMGPSRNSRKYNHGLSCSDAEVDYRRGRSLR-EVFMECLTYD

TrEMBL top hitse value%identityAlignment
A0A0A0L1G3 Uncharacterized protein4.5e-22990.34Show/hide
Query:  MGSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVITQSNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS
        MGSEQNRFPQ ERGKRWGGCWGALSCFHSQKG+KRIVPASRLPEGNVV TQ NGPQAAGMTNQATVI PSLLAPPSSPASFTNSALPST QSPSCFLSLS
Subjt:  MGSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVITQSNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS

Query:  ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAAYSLYPGSPASSLV
        ANSPGGPSSTM+ATGPYAH+TQLVSPPVFSAF TEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS DLKGTGKANYIASNDLQAAYSLYPGSPASSLV
Subjt:  ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAAYSLYPGSPASSLV

Query:  SPISRTSGDCLSSSFPERDFPPQWNTSASLQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLDNPPFPHAGGRLSVSKD-DVYSSGGNGYQ
        SPISRTSGDCLSSSFPERDF PQWN+SASLQDGKYPRSGSGRLFG+EK GTSLASQDSNFFCPATFAQFYLDN  FPH GGRLSVSKD DVYSS GNGYQ
Subjt:  SPISRTSGDCLSSSFPERDFPPQWNTSASLQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLDNPPFPHAGGRLSVSKD-DVYSSGGNGYQ

Query:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSADESMEPPLLGEKLKSTQTTLQSQRSIKSASDVVEKETCTEVLA
        NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSA+ES EPPLLGEKLKS+ TTLQSQRSIKSA +    ETCTE+ A
Subjt:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSADESMEPPLLGEKLKSTQTTLQSQRSIKSASDVVEKETCTEVLA

Query:  LCNGHKENKLQRQPGNTSGPSTSFNQVETEDVFSKMGPSRNSRKYNHGLSCSDAEVDYRRGRSLRE
        LCNG+K+NKLQRQPG+ SG STS NQVE +DVFS++G S+NSRKY+ GLSCSDAEVDYRRGRSLRE
Subjt:  LCNGHKENKLQRQPGNTSGPSTSFNQVETEDVFSKMGPSRNSRKYNHGLSCSDAEVDYRRGRSLRE

A0A1S3BV86 uncharacterized protein At1g766605.7e-23291.2Show/hide
Query:  MGSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVITQSNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS
        MGSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVV TQ NGPQAAGMTNQATVI PSLLAPPSSPASFTNSALPST QSPSCFLSLS
Subjt:  MGSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVITQSNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS

Query:  ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAAYSLYPGSPASSLV
        ANSPGGPSST++ATGPYAHETQ VSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS DLKGTGKANYIASNDLQAAYSLYPGSPASSLV
Subjt:  ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAAYSLYPGSPASSLV

Query:  SPISRTSGDCLSSSFPERDFPPQWNTSASLQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLDNPPFPHAGGRLSVSKD-DVYSSGGNGYQ
        SPISRTSGDCLSSSFPERDF PQWN+SASLQDGKYPRSGSGRLFG+EK GTSLASQDSNFFCPATFAQFYLDN  FPH GGRLSVSKD DVYSS GNGYQ
Subjt:  SPISRTSGDCLSSSFPERDFPPQWNTSASLQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLDNPPFPHAGGRLSVSKD-DVYSSGGNGYQ

Query:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSADESMEPPLLGEKLKSTQTTLQSQRSIKSASDVVEKETCTEVLA
        NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSA+ES EPPLLGEKLKS+ TTLQ+QRSIKSA +VVEKETCTEV A
Subjt:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSADESMEPPLLGEKLKSTQTTLQSQRSIKSASDVVEKETCTEVLA

Query:  LCNGHKENKLQRQPGNTSGPSTSFNQVETEDVFSKMGPSRNSRKYNHGLSCSDAEVDYRRGRSLRE
        LCNG+K+NKLQRQPG+  G STS +QVE +DVFS++G S+NSRKY+ GLSCSDAEVDYRRGRSLRE
Subjt:  LCNGHKENKLQRQPGNTSGPSTSFNQVETEDVFSKMGPSRNSRKYNHGLSCSDAEVDYRRGRSLRE

A0A6J1BVS7 uncharacterized protein At1g766601.8e-24192.93Show/hide
Query:  MGSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVITQSNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS
        MGSEQNRFPQQER KRWGGCWGALSCFHSQKG KRIVPASRLPEGN V TQ NGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPST QSPSCFLSLS
Subjt:  MGSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVITQSNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS

Query:  ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAAYSLYPGSPASSLV
        ANSPGGPSSTMFATGPYAHE QLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGK NYIASNDLQAAYSLYPGSPASSLV
Subjt:  ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAAYSLYPGSPASSLV

Query:  SPISRTSGDCLSSSFPERDFPPQWNTSASLQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLDNPPFPHAGGRLSVSKD-DVYSSGGNGYQ
        SPISRTSGDCLSSSFPERDFPPQWN S+S QDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLDNPPFPH GGRLSVSKD DVYSSGGNGYQ
Subjt:  SPISRTSGDCLSSSFPERDFPPQWNTSASLQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLDNPPFPHAGGRLSVSKD-DVYSSGGNGYQ

Query:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSADESMEPPLLGEKLKSTQTTLQSQRSIKSASDVVEKETCTEVLA
        NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSA+ES+EPPLLGEKLKST+TT+QSQRS+K ASDVVEKETC EVL 
Subjt:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSADESMEPPLLGEKLKSTQTTLQSQRSIKSASDVVEKETCTEVLA

Query:  LCNGHKENKLQRQPGNTSGPSTSFNQVETEDVFSKMGPSRNSRKYNHGLSCSDAEVDYRRGRSLREV
        LCNG ++NKLQRQPGN SG S+SFNQVETEDVFS++ P +NSRKYN GLSCSDAEVDYRRGRSLREV
Subjt:  LCNGHKENKLQRQPGNTSGPSTSFNQVETEDVFSKMGPSRNSRKYNHGLSCSDAEVDYRRGRSLREV

A0A6J1FRS7 uncharacterized protein At1g766603.5e-23790.97Show/hide
Query:  MGSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVITQSNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS
        MGSEQNR PQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVV TQ NGP AAG+ NQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS
Subjt:  MGSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVITQSNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS

Query:  ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAAYSLYPGSPASSLV
        ANSPGGPSSTMFATGPYAHETQ VSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS+DLKGTGKANY+ASNDLQAAYSLYPGSPASSLV
Subjt:  ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAAYSLYPGSPASSLV

Query:  SPISRTSGDCLSSSFPERDFPPQWNTSASLQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLDNPPFPHAGGRLSVSKD-DVYSSGGNGYQ
        SPISRTSGDCLSSSFPERDFPPQWN SASLQDGKYPRSGSGRLFGHEKTGT LASQDSNFFCPATFAQFYLDNPPFPH GGRLSVSKD DVY+SGGNGYQ
Subjt:  SPISRTSGDCLSSSFPERDFPPQWNTSASLQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLDNPPFPHAGGRLSVSKD-DVYSSGGNGYQ

Query:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSADESMEPPLLGEKLKSTQTTLQSQRSIKSASDVVEKETCTEVLA
        NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSA+ES++PPL+GEKLKSTQ TLQSQRSIKSASD VEKETC+EVLA
Subjt:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSADESMEPPLLGEKLKSTQTTLQSQRSIKSASDVVEKETCTEVLA

Query:  LCNGHKENKLQRQPGNTSGPSTSFNQVETEDVFSKMGPSRNSRKYNHGLSCSDAEVDYRRGRSLR-EVFMECLTYD
        LCNG K++KLQRQPGN  G STS  Q ETED+FS++G S+NSRKYNH LSCSDAEVDYRRGRSLR EV  + L +D
Subjt:  LCNGHKENKLQRQPGNTSGPSTSFNQVETEDVFSKMGPSRNSRKYNHGLSCSDAEVDYRRGRSLR-EVFMECLTYD

A0A6J1IZ74 uncharacterized protein At1g76660-like3.5e-23790.76Show/hide
Query:  MGSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVITQSNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS
        MGSEQNR PQQERGKRWGGCWGALSCF SQKGEKRIVPASRLPEGNVV TQ NGP AAG+ NQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS
Subjt:  MGSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVITQSNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS

Query:  ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAAYSLYPGSPASSLV
        ANSPGGPSSTMFATGPYAHETQ VSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANY+ASNDLQAAYSLYPGSPASSLV
Subjt:  ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAAYSLYPGSPASSLV

Query:  SPISRTSGDCLSSSFPERDFPPQWNTSASLQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLDNPPFPHAGGRLSVSKD-DVYSSGGNGYQ
        SPISRTSGDCLSSSFPERDFPPQWN SASLQDGKYPRSGSGRLFGHEKTGT LASQDSNFFCPATFAQFYLDNPPFPH GGRLSVSKD DVY+SGGNGYQ
Subjt:  SPISRTSGDCLSSSFPERDFPPQWNTSASLQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLDNPPFPHAGGRLSVSKD-DVYSSGGNGYQ

Query:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSADESMEPPLLGEKLKSTQTTLQSQRSIKSASDVVEKETCTEVLA
        NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSA+ES++PPL+GEK KSTQ TL SQRSIKSASD+VEKETC+EVLA
Subjt:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSADESMEPPLLGEKLKSTQTTLQSQRSIKSASDVVEKETCTEVLA

Query:  LCNGHKENKLQRQPGNTSGPSTSFNQVETEDVFSKMGPSRNSRKYNHGLSCSDAEVDYRRGRSLR-EVFMECLTYD
        LCNG K+NKLQRQPGN  G STS  Q ETED+FS++G S+NSRKYNH LSCSDAEVDYRRGRSLR EV  + L +D
Subjt:  LCNGHKENKLQRQPGNTSGPSTSFNQVETEDVFSKMGPSRNSRKYNHGLSCSDAEVDYRRGRSLR-EVFMECLTYD

SwissProt top hitse value%identityAlignment
Q9SRE5 Uncharacterized protein At1g766601.9e-12357.42Show/hide
Query:  MGSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPE-GNVVITQSNGPQAAGMTNQ--ATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFL
        MGSE      Q++ KRWGGC G  SCF SQKG KRIVPASR+PE GNV  +Q NG   AG+ N   A  I  SLLAPPSSPASFTNSALPST QSP+C+L
Subjt:  MGSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPE-GNVVITQSNGPQAAGMTNQ--ATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFL

Query:  SLSANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAAYSLYPGSPAS
        SL+ANSPGGPSS+M+ATGPYAHETQLVSPPVFS FTTEPSTAP TPPPELA LT PSSPDVP+A+FL+SS+DLK +GK +Y   NDLQA YSLYPGSPAS
Subjt:  SLSANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAAYSLYPGSPAS

Query:  SLVSPISRTSGDCLSSSFPERDFPPQWNTSASLQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLD-NPPFPHAGGRLSVSKD-DVYSSG-
        +L SPISR SGD L S                 Q+GK  RS SG  FG++  G S   Q+SNFFCP TFA+FYLD +P  P  GGRLSVSKD DVY +  
Subjt:  SLVSPISRTSGDCLSSSFPERDFPPQWNTSASLQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLD-NPPFPHAGGRLSVSKD-DVYSSG-

Query:  -GNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSADESMEPPLLGEKLKSTQTTLQSQRSIKSASDVVEKET
         GNG QNR ++SPKQD+EE+EAYRASFGFSADEIITT+QYVEI+DVM+ SF    ++            P  G+KL   +  L SQ S KS +D+  +  
Subjt:  -GNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSADESMEPPLLGEKLKSTQTTLQSQRSIKSASDVVEKET

Query:  CTEVLALCNGHKENKLQRQPGNTSGPSTSFNQVETEDVFSKMGPSRNSRKYNHGLSCSDAEVDYRRGRSLRE
          +     N +K++K + +              + E + S++G  + SR Y+  +S SDAEV+YRRGRSLRE
Subjt:  CTEVLALCNGHKENKLQRQPGNTSGPSTSFNQVETEDVFSKMGPSRNSRKYNHGLSCSDAEVDYRRGRSLRE

Arabidopsis top hitse value%identityAlignment
AT1G63720.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT5G52430.1)5.2e-2842.03Show/hide
Query:  KRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVITQSNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLSANSPGGPSSTMFAT
        ++W   W  L CF S +  KRI  +  +PE   + + ++    +G  +  T +    +APPSSPASF  S  PS  QSP   LS S   P     ++FA 
Subjt:  KRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVITQSNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLSANSPGGPSSTMFAT

Query:  GPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHL----TTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAAYSLYPGSPASSLVSPISRTSGDC
        GPYAHETQLVSPPVFS +TTEPS+AP+TPP + + +    TTPSSP+VPFAQ  +S+      G    ++S+     Y L PGSP   L+SP   + G  
Subjt:  GPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHL----TTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAAYSLYPGSPASSLVSPISRTSGDC

Query:  LSSSFPE
         +S FP+
Subjt:  LSSSFPE

AT1G76660.1 FUNCTIONS IN: molecular_function unknown1.3e-12457.42Show/hide
Query:  MGSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPE-GNVVITQSNGPQAAGMTNQ--ATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFL
        MGSE      Q++ KRWGGC G  SCF SQKG KRIVPASR+PE GNV  +Q NG   AG+ N   A  I  SLLAPPSSPASFTNSALPST QSP+C+L
Subjt:  MGSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPE-GNVVITQSNGPQAAGMTNQ--ATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFL

Query:  SLSANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAAYSLYPGSPAS
        SL+ANSPGGPSS+M+ATGPYAHETQLVSPPVFS FTTEPSTAP TPPPELA LT PSSPDVP+A+FL+SS+DLK +GK +Y   NDLQA YSLYPGSPAS
Subjt:  SLSANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAAYSLYPGSPAS

Query:  SLVSPISRTSGDCLSSSFPERDFPPQWNTSASLQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLD-NPPFPHAGGRLSVSKD-DVYSSG-
        +L SPISR SGD L S                 Q+GK  RS SG  FG++  G S   Q+SNFFCP TFA+FYLD +P  P  GGRLSVSKD DVY +  
Subjt:  SLVSPISRTSGDCLSSSFPERDFPPQWNTSASLQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLD-NPPFPHAGGRLSVSKD-DVYSSG-

Query:  -GNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSADESMEPPLLGEKLKSTQTTLQSQRSIKSASDVVEKET
         GNG QNR ++SPKQD+EE+EAYRASFGFSADEIITT+QYVEI+DVM+ SF    ++            P  G+KL   +  L SQ S KS +D+  +  
Subjt:  -GNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSADESMEPPLLGEKLKSTQTTLQSQRSIKSASDVVEKET

Query:  CTEVLALCNGHKENKLQRQPGNTSGPSTSFNQVETEDVFSKMGPSRNSRKYNHGLSCSDAEVDYRRGRSLRE
          +     N +K++K + +              + E + S++G  + SR Y+  +S SDAEV+YRRGRSLRE
Subjt:  CTEVLALCNGHKENKLQRQPGNTSGPSTSFNQVETEDVFSKMGPSRNSRKYNHGLSCSDAEVDYRRGRSLRE

AT4G20520.1 RNA binding;RNA-directed DNA polymerases2.3e-0727.96Show/hide
Query:  LVSDEQSTFVPGRLISDNVVIRFECLHAIASKTRGKTDNVVIRFECLHAIASKTRGKTSNVAMKLDMSKAYDRVEWSFLRQMMLQMGFQTIWV
        L+   Q++F+PGR+ +DN+V   E +H++  K                      +G    + +KLD+ KAYDR+ W +L   ++  GF  +W+
Subjt:  LVSDEQSTFVPGRLISDNVVIRFECLHAIASKTRGKTDNVVIRFECLHAIASKTRGKTSNVAMKLDMSKAYDRVEWSFLRQMMLQMGFQTIWV

AT4G25620.1 hydroxyproline-rich glycoprotein family protein6.8e-2843.72Show/hide
Query:  SEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVITQSNGPQAAGMTN---QATVIAPSLLAPPSSPASFTNSALPSTAQS--PSCFL
        S ++R       K+ G  W    CF S+K  KRI  A  +PE       ++G   A + N    +T I    +APPSSPASF  S  PS + +  P    
Subjt:  SEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVITQSNGPQAAGMTN---QATVIAPSLLAPPSSPASFTNSALPSTAQS--PSCFL

Query:  SLSANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLK-----GTGKANYIASNDLQAAYSLYP
        SL+ N P  PS+  F  GPYAHETQ V+PPVFSAFTTEPSTAP TPPPE     +PSSP+VPFAQ L+SS++       G     + A++    +  +YP
Subjt:  SLSANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLK-----GTGKANYIASNDLQAAYSLYP

Query:  GSPASSLVSPISRTS
        GSP  +L+SP S TS
Subjt:  GSPASSLVSPISRTS

AT5G52430.1 hydroxyproline-rich glycoprotein family protein9.2e-3346.61Show/hide
Query:  PQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPE----GNVVITQSNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLSAN--
        P   +  RWG CW   SCF +QK  KRI  A  +PE    G  V+T  N           TV+ P  +APPSSPASF  S   S + SP   LSL++N  
Subjt:  PQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPE----GNVVITQSNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLSAN--

Query:  SPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELA-HLTTPSSPDVPFAQFLSSSVDL----KGTGKANYIASNDLQ-AAYSLYPGSP-
        SP  P S +F  GPYA+ETQ V+PPVFSAF TEPSTAP TPPPE + H+TTPSSP+VPFAQ L+SS++L      +G     +S+  +  +  + PGSP 
Subjt:  SPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELA-HLTTPSSPDVPFAQFLSSSVDL----KGTGKANYIASNDLQ-AAYSLYPGSP-

Query:  ASSLVSPISRTSGDCLSSSFP
          +L+SP S  S    SS +P
Subjt:  ASSLVSPISRTSGDCLSSSFP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGTCCGAGCAGAACAGGTTCCCTCAGCAAGAACGGGGAAAGAGATGGGGTGGATGTTGGGGTGCATTATCTTGTTTTCACTCTCAGAAAGGAGAAAAGCGCATTGT
ACCTGCATCTCGTTTACCTGAGGGCAATGTCGTGATAACCCAGTCAAATGGACCTCAAGCAGCAGGAATGACCAACCAGGCTACAGTGATAGCTCCGTCCCTTCTAGCCC
CACCTTCTTCACCAGCATCCTTTACAAATTCTGCACTCCCTTCAACAGCTCAATCACCTAGCTGTTTCTTGTCGTTGTCTGCCAACTCACCTGGAGGTCCTTCATCCACA
ATGTTTGCCACAGGGCCATATGCCCACGAAACACAGCTGGTTTCTCCTCCTGTTTTCTCAGCCTTCACCACTGAACCGTCAACTGCTCCACTCACTCCCCCACCTGAACT
AGCTCACCTAACCACACCTTCTTCCCCCGATGTGCCTTTTGCTCAGTTCTTATCCTCTTCAGTGGATCTCAAAGGAACTGGAAAGGCAAATTACATTGCTTCAAATGATC
TTCAAGCAGCATATTCTCTCTACCCTGGAAGTCCTGCGAGTAGCCTCGTCTCACCAATTTCAAGGACCTCCGGCGATTGCTTATCATCTTCATTTCCTGAGAGGGACTTC
CCACCACAGTGGAATACTTCAGCTTCTCTCCAAGATGGAAAATATCCAAGAAGTGGTTCTGGTCGGCTATTTGGACATGAGAAAACTGGCACATCTTTGGCATCTCAGGA
TTCTAATTTCTTCTGCCCTGCTACATTTGCACAATTCTATCTAGACAATCCTCCATTCCCTCATGCTGGTGGAAGGTTAAGTGTATCAAAGGATGATGTTTACTCCTCTG
GTGGGAATGGATACCAAAATCGGCATAGTAAGTCTCCAAAACAAGATGTGGAGGAAATAGAAGCTTACCGAGCGTCATTTGGTTTCAGTGCAGATGAAATTATAACTACT
ACACAATATGTGGAGATATCTGATGTGATGGAGGATTCCTTTACTATGAGACCTTTTACTTCAACTAGTCTGTCAGCAGATGAAAGTATGGAGCCTCCATTGTTGGGTGA
AAAACTGAAATCCACACAGACAACTTTACAGAGTCAGAGAAGTATTAAATCAGCATCTGATGTTGTCGAAAAGGAAACCTGCACCGAAGTGCTGGCATTATGCAATGGCC
ATAAAGAAAATAAATTGCAAAGACAACCTGGTAACACGTCAGGACCAAGTACTTCTTTCAACCAAGTTGAAACAGAAGATGTATTCTCAAAGATGGGGCCATCCAGAAAC
AGTCGCAAGTATAATCATGGTTTATCCTGCTCTGATGCAGAAGTTGACTACAGAAGAGGAAGGAGCCTAAGAGAGGTTTTCATGGAATGTCTAACCTATGATCTAACCTG
TTATTCGGAATTGGTTTCGCAACATGACAGCATCCGTGGAATGATCTGGGAGGTAACTCCGGCAGCGTTGTTTGTTCGGTCAACCCGACTGATTGGGAAGTGGGAACTGA
AATCAGAGGTGGTCGTTGATACGGTGCTGCGTTGTGTCACTCCTACAGTTAGTGATGTACAGAATCAGGCTTTGCTCAGGGAGTTCACTCGGGCTGAGGTGGAGGCAGCC
TTGAAAAGCATAGGCCCCACGAAGGCTCCAGGCTCCGATGGGGTTAATGCTCTATTTTACCAACGACACTGGGATTTGATCGGTGACGAGACTTCCGCGCTTTGCCTTGT
CTCTGATGAGCAATCGACATTTGTGCCTGGGCGATTGATCTCAGATAACGTGGTCATCAGGTTTGAGTGCTTGCACGCGATTGCTAGTAAAACGAGAGGGAAAACTGATA
ACGTGGTCATCAGGTTTGAGTGCTTGCACGCGATTGCTAGTAAAACGAGAGGGAAAACTAGTAATGTGGCGATGAAGTTGGATATGAGCAAGGCCTATGATCGGGTGGAA
TGGAGTTTTCTAAGGCAAATGATGCTTCAAATGGGGTTTCAGACAATATGGGTCGAGCTAGTTATGAGTTGTGTTGAGTCAGTTTGGTTTTTCAGTTTTGCTAAATGGAA
GGGTCTTTGTGCAATGCTTCATTATGAGATGTCTTGTCGAAATTATATAGGCCTGCAAATTAATAAATATTGTTCGGTCATTTCTCCTCTTTTTTATTCTAATGACAATC
TTATCTTCTTTCGTGCTGTTAAGCGTGACTGTCTGACCATTAAATCTGTTTTACATACTTATGAGTTGGCTTTTGGACAGGTTATTAATTATGATAAATCAATGTTCATG
GTCAGCAGGAACACTAGCCCTGACATGCAACAATTTATTAGGAGCACATTGGGAGTGGTTCAAACGAGCCCACTTGGTCGATATTTGGGGCTCCCTTCGCAGAATGCTCG
GGCAAAGTGTGTGATCTTTCGATCAGTTAGGGATCGGGTTTGGAAGGCTATTCTGGCTAAGCAGTGTTGGCGGTTGTCCCGAGATCAGTCTAGCCTTTGTATAGAGTTTT
GCGGGGTAGGTATTTTCGTACGGGTTCCTTCCTTCGGACAACATTGGGGTCGAACCCGTCATATGCATGGTGGAGCCTATTGTGGGGGCGAGAACTATTTAGATGAGGAG
GAGCTTCGTTCAAAGACGATGTCCTCGCTGATTATGGAGGATGGGTCCTGGGATGTGGAGAAGGTGAGGAGGGAATTTCTGCGGGACGATGCTGAACATATTTTGGCTAT
ACCATTAAGTGGTCAGAGAGAGGACAATGAGATATATTGGGCGTCGGATGGTAACTTTGCTTCTGTGGGTTACTCTTATAAGTACCCTAGATTTTATCCCCTGGGACAAT
CATCTGTTAATGAAACAACAATTCTCTTCCATTTGGTTTACACCAAATTTGGTATCAGAGCAACCAAGAAATTAGGTCTGATGACTGGAAAAGGCGCTGGCCAATCCAGT
AAAGGGCGTGAGGTGGACCCTTCAAACCTACCTGAAATTTTTTCTCCAAGACTACCACTGAACGCTTGCTGTCGGTTGAAAGTTCTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGGTCCGAGCAGAACAGGTTCCCTCAGCAAGAACGGGGAAAGAGATGGGGTGGATGTTGGGGTGCATTATCTTGTTTTCACTCTCAGAAAGGAGAAAAGCGCATTGT
ACCTGCATCTCGTTTACCTGAGGGCAATGTCGTGATAACCCAGTCAAATGGACCTCAAGCAGCAGGAATGACCAACCAGGCTACAGTGATAGCTCCGTCCCTTCTAGCCC
CACCTTCTTCACCAGCATCCTTTACAAATTCTGCACTCCCTTCAACAGCTCAATCACCTAGCTGTTTCTTGTCGTTGTCTGCCAACTCACCTGGAGGTCCTTCATCCACA
ATGTTTGCCACAGGGCCATATGCCCACGAAACACAGCTGGTTTCTCCTCCTGTTTTCTCAGCCTTCACCACTGAACCGTCAACTGCTCCACTCACTCCCCCACCTGAACT
AGCTCACCTAACCACACCTTCTTCCCCCGATGTGCCTTTTGCTCAGTTCTTATCCTCTTCAGTGGATCTCAAAGGAACTGGAAAGGCAAATTACATTGCTTCAAATGATC
TTCAAGCAGCATATTCTCTCTACCCTGGAAGTCCTGCGAGTAGCCTCGTCTCACCAATTTCAAGGACCTCCGGCGATTGCTTATCATCTTCATTTCCTGAGAGGGACTTC
CCACCACAGTGGAATACTTCAGCTTCTCTCCAAGATGGAAAATATCCAAGAAGTGGTTCTGGTCGGCTATTTGGACATGAGAAAACTGGCACATCTTTGGCATCTCAGGA
TTCTAATTTCTTCTGCCCTGCTACATTTGCACAATTCTATCTAGACAATCCTCCATTCCCTCATGCTGGTGGAAGGTTAAGTGTATCAAAGGATGATGTTTACTCCTCTG
GTGGGAATGGATACCAAAATCGGCATAGTAAGTCTCCAAAACAAGATGTGGAGGAAATAGAAGCTTACCGAGCGTCATTTGGTTTCAGTGCAGATGAAATTATAACTACT
ACACAATATGTGGAGATATCTGATGTGATGGAGGATTCCTTTACTATGAGACCTTTTACTTCAACTAGTCTGTCAGCAGATGAAAGTATGGAGCCTCCATTGTTGGGTGA
AAAACTGAAATCCACACAGACAACTTTACAGAGTCAGAGAAGTATTAAATCAGCATCTGATGTTGTCGAAAAGGAAACCTGCACCGAAGTGCTGGCATTATGCAATGGCC
ATAAAGAAAATAAATTGCAAAGACAACCTGGTAACACGTCAGGACCAAGTACTTCTTTCAACCAAGTTGAAACAGAAGATGTATTCTCAAAGATGGGGCCATCCAGAAAC
AGTCGCAAGTATAATCATGGTTTATCCTGCTCTGATGCAGAAGTTGACTACAGAAGAGGAAGGAGCCTAAGAGAGGTTTTCATGGAATGTCTAACCTATGATCTAACCTG
TTATTCGGAATTGGTTTCGCAACATGACAGCATCCGTGGAATGATCTGGGAGGTAACTCCGGCAGCGTTGTTTGTTCGGTCAACCCGACTGATTGGGAAGTGGGAACTGA
AATCAGAGGTGGTCGTTGATACGGTGCTGCGTTGTGTCACTCCTACAGTTAGTGATGTACAGAATCAGGCTTTGCTCAGGGAGTTCACTCGGGCTGAGGTGGAGGCAGCC
TTGAAAAGCATAGGCCCCACGAAGGCTCCAGGCTCCGATGGGGTTAATGCTCTATTTTACCAACGACACTGGGATTTGATCGGTGACGAGACTTCCGCGCTTTGCCTTGT
CTCTGATGAGCAATCGACATTTGTGCCTGGGCGATTGATCTCAGATAACGTGGTCATCAGGTTTGAGTGCTTGCACGCGATTGCTAGTAAAACGAGAGGGAAAACTGATA
ACGTGGTCATCAGGTTTGAGTGCTTGCACGCGATTGCTAGTAAAACGAGAGGGAAAACTAGTAATGTGGCGATGAAGTTGGATATGAGCAAGGCCTATGATCGGGTGGAA
TGGAGTTTTCTAAGGCAAATGATGCTTCAAATGGGGTTTCAGACAATATGGGTCGAGCTAGTTATGAGTTGTGTTGAGTCAGTTTGGTTTTTCAGTTTTGCTAAATGGAA
GGGTCTTTGTGCAATGCTTCATTATGAGATGTCTTGTCGAAATTATATAGGCCTGCAAATTAATAAATATTGTTCGGTCATTTCTCCTCTTTTTTATTCTAATGACAATC
TTATCTTCTTTCGTGCTGTTAAGCGTGACTGTCTGACCATTAAATCTGTTTTACATACTTATGAGTTGGCTTTTGGACAGGTTATTAATTATGATAAATCAATGTTCATG
GTCAGCAGGAACACTAGCCCTGACATGCAACAATTTATTAGGAGCACATTGGGAGTGGTTCAAACGAGCCCACTTGGTCGATATTTGGGGCTCCCTTCGCAGAATGCTCG
GGCAAAGTGTGTGATCTTTCGATCAGTTAGGGATCGGGTTTGGAAGGCTATTCTGGCTAAGCAGTGTTGGCGGTTGTCCCGAGATCAGTCTAGCCTTTGTATAGAGTTTT
GCGGGGTAGGTATTTTCGTACGGGTTCCTTCCTTCGGACAACATTGGGGTCGAACCCGTCATATGCATGGTGGAGCCTATTGTGGGGGCGAGAACTATTTAGATGAGGAG
GAGCTTCGTTCAAAGACGATGTCCTCGCTGATTATGGAGGATGGGTCCTGGGATGTGGAGAAGGTGAGGAGGGAATTTCTGCGGGACGATGCTGAACATATTTTGGCTAT
ACCATTAAGTGGTCAGAGAGAGGACAATGAGATATATTGGGCGTCGGATGGTAACTTTGCTTCTGTGGGTTACTCTTATAAGTACCCTAGATTTTATCCCCTGGGACAAT
CATCTGTTAATGAAACAACAATTCTCTTCCATTTGGTTTACACCAAATTTGGTATCAGAGCAACCAAGAAATTAGGTCTGATGACTGGAAAAGGCGCTGGCCAATCCAGT
AAAGGGCGTGAGGTGGACCCTTCAAACCTACCTGAAATTTTTTCTCCAAGACTACCACTGAACGCTTGCTGTCGGTTGAAAGTTCTTTAA
Protein sequenceShow/hide protein sequence
MGSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVITQSNGPQAAGMTNQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLSANSPGGPSST
MFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDF
PPQWNTSASLQDGKYPRSGSGRLFGHEKTGTSLASQDSNFFCPATFAQFYLDNPPFPHAGGRLSVSKDDVYSSGGNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIITT
TQYVEISDVMEDSFTMRPFTSTSLSADESMEPPLLGEKLKSTQTTLQSQRSIKSASDVVEKETCTEVLALCNGHKENKLQRQPGNTSGPSTSFNQVETEDVFSKMGPSRN
SRKYNHGLSCSDAEVDYRRGRSLREVFMECLTYDLTCYSELVSQHDSIRGMIWEVTPAALFVRSTRLIGKWELKSEVVVDTVLRCVTPTVSDVQNQALLREFTRAEVEAA
LKSIGPTKAPGSDGVNALFYQRHWDLIGDETSALCLVSDEQSTFVPGRLISDNVVIRFECLHAIASKTRGKTDNVVIRFECLHAIASKTRGKTSNVAMKLDMSKAYDRVE
WSFLRQMMLQMGFQTIWVELVMSCVESVWFFSFAKWKGLCAMLHYEMSCRNYIGLQINKYCSVISPLFYSNDNLIFFRAVKRDCLTIKSVLHTYELAFGQVINYDKSMFM
VSRNTSPDMQQFIRSTLGVVQTSPLGRYLGLPSQNARAKCVIFRSVRDRVWKAILAKQCWRLSRDQSSLCIEFCGVGIFVRVPSFGQHWGRTRHMHGGAYCGGENYLDEE
ELRSKTMSSLIMEDGSWDVEKVRREFLRDDAEHILAIPLSGQREDNEIYWASDGNFASVGYSYKYPRFYPLGQSSVNETTILFHLVYTKFGIRATKKLGLMTGKGAGQSS
KGREVDPSNLPEIFSPRLPLNACCRLKVL