; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0005403 (gene) of Chayote v1 genome

Gene IDSed0005403
OrganismSechium edule (Chayote v1)
DescriptionBEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein .
Genome locationLG13:17760935..17763471
RNA-Seq ExpressionSed0005403
SyntenySed0005403
Gene Ontology termsNA
InterPro domainsIPR040420 - Uncharacterized protein At1g76660-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6608673.1 hypothetical protein SDJN03_02015, partial [Cucurbita argyrosperma subsp. sororia]2.5e-22989.1Show/hide
Query:  MGSEQNRYPRHERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMANQATAIAPSLLAPPSSPASFSNSALPSTAQSPSCFLSLS
        MGSEQNR P+ ERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGP AAG+ANQAT IAPSLLAPPSSPASF+NSALPSTAQSPSCFLSLS
Subjt:  MGSEQNRYPRHERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMANQATAIAPSLLAPPSSPASFSNSALPSTAQSPSCFLSLS

Query:  ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKGTGKANYVASNDLHAAYSLYPGSPASSLV
        ANSPGGPSSTMFATGPYAHETQ VSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKGTGKANYVASNDL AAYSLYPGSPASSLV
Subjt:  ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKGTGKANYVASNDLHAAYSLYPGSPASSLV

Query:  SPISMTSGDCLSSSFPEREFAPQWNPSASHQDGKYPRSGSGRLFGNEKTGTSLASQDSNFFCPATFAQFYLDNPPFPNTGGRLSVSKDSDVYSSTGNGYQ
        SPIS TSGDCLSSSFPER+F PQWNPSAS QDGKYPRSGSGRLFG+EKTGT LASQDSNFFCPATFAQFYLDNPPFP+TGGRLSVSKDSDVY+S GNGYQ
Subjt:  SPISMTSGDCLSSSFPEREFAPQWNPSASHQDGKYPRSGSGRLFGNEKTGTSLASQDSNFFCPATFAQFYLDNPPFPNTGGRLSVSKDSDVYSSTGNGYQ

Query:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTSQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTQTTLPSERSIKSASDVDEKKTCTEVLA
        NRHSKSPKQDVEEIEAYRASFGFSADEIITT+QYVEISDVMEDSFTMRPFTSTSL AEESI+PPL+GEKLKSTQ TL S+RSIKSAS+V EK+TC+EVLA
Subjt:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTSQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTQTTLPSERSIKSASDVDEKKTCTEVLA

Query:  FCNGFKDNKLQR--------PFSQVETEDVFSRMGPSKSSRKYNHGLSCSDAEVDYRRGRSLRGEVKG
         CNG K++KLQR          SQ ETED+FSR+G SK+SRKYNH LSCSDAEVDYRRGRSLRGE  G
Subjt:  FCNGFKDNKLQR--------PFSQVETEDVFSRMGPSKSSRKYNHGLSCSDAEVDYRRGRSLRGEVKG

XP_022133364.1 uncharacterized protein At1g76660 [Momordica charantia]5.4e-23288.42Show/hide
Query:  MGSEQNRYPRHERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMANQATAIAPSLLAPPSSPASFSNSALPSTAQSPSCFLSLS
        MGSEQNR+P+ ER KRWGGCWGALSCFHSQKG KRIVPASRLPEGN VTTQPNGPQAAGM NQAT IAPSLLAPPSSPASF+NSALPST QSPSCFLSLS
Subjt:  MGSEQNRYPRHERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMANQATAIAPSLLAPPSSPASFSNSALPSTAQSPSCFLSLS

Query:  ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKGTGKANYVASNDLHAAYSLYPGSPASSLV
        ANSPGGPSSTMFATGPYAHE QLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS+DLKGTGK NY+ASNDL AAYSLYPGSPASSLV
Subjt:  ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKGTGKANYVASNDLHAAYSLYPGSPASSLV

Query:  SPISMTSGDCLSSSFPEREFAPQWNPSASHQDGKYPRSGSGRLFGNEKTGTSLASQDSNFFCPATFAQFYLDNPPFPNTGGRLSVSKDSDVYSSTGNGYQ
        SPIS TSGDCLSSSFPER+F PQWNPS+S QDGKYPRSGSGRLFG+EKTGTSLASQDSNFFCPATFAQFYLDNPPFP+TGGRLSVSKDSDVYSS GNGYQ
Subjt:  SPISMTSGDCLSSSFPEREFAPQWNPSASHQDGKYPRSGSGRLFGNEKTGTSLASQDSNFFCPATFAQFYLDNPPFPNTGGRLSVSKDSDVYSSTGNGYQ

Query:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTSQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTQTTLPSERSIKSASDVDEKKTCTEVLA
        NRHSKSPKQDVEEIEAYRASFGFSADEIITT+QYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKST+TT+ S+RS+K ASDV EK+TC EVL 
Subjt:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTSQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTQTTLPSERSIKSASDVDEKKTCTEVLA

Query:  FCNGFKDNKLQR----------PFSQVETEDVFSRMGPSKSSRKYNHGLSCSDAEVDYRRGRSLRGEVKGDFSWH
         CNG +DNKLQR           F+QVETEDVFSR+ P K+SRKYN GLSCSDAEVDYRRGRSLR EVKGDFSWH
Subjt:  FCNGFKDNKLQR----------PFSQVETEDVFSRMGPSKSSRKYNHGLSCSDAEVDYRRGRSLRGEVKGDFSWH

XP_022940824.1 uncharacterized protein At1g76660 [Cucurbita moschata]6.8e-23590.06Show/hide
Query:  MGSEQNRYPRHERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMANQATAIAPSLLAPPSSPASFSNSALPSTAQSPSCFLSLS
        MGSEQNR P+ ERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGP AAG+ANQAT IAPSLLAPPSSPASF+NSALPSTAQSPSCFLSLS
Subjt:  MGSEQNRYPRHERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMANQATAIAPSLLAPPSSPASFSNSALPSTAQSPSCFLSLS

Query:  ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKGTGKANYVASNDLHAAYSLYPGSPASSLV
        ANSPGGPSSTMFATGPYAHETQ VSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKGTGKANYVASNDL AAYSLYPGSPASSLV
Subjt:  ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKGTGKANYVASNDLHAAYSLYPGSPASSLV

Query:  SPISMTSGDCLSSSFPEREFAPQWNPSASHQDGKYPRSGSGRLFGNEKTGTSLASQDSNFFCPATFAQFYLDNPPFPNTGGRLSVSKDSDVYSSTGNGYQ
        SPIS TSGDCLSSSFPER+F PQWNPSAS QDGKYPRSGSGRLFG+EKTGT LASQDSNFFCPATFAQFYLDNPPFP+TGGRLSVSKDSDVY+S GNGYQ
Subjt:  SPISMTSGDCLSSSFPEREFAPQWNPSASHQDGKYPRSGSGRLFGNEKTGTSLASQDSNFFCPATFAQFYLDNPPFPNTGGRLSVSKDSDVYSSTGNGYQ

Query:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTSQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTQTTLPSERSIKSASDVDEKKTCTEVLA
        NRHSKSPKQDVEEIEAYRASFGFSADEIITT+QYVEISDVMEDSFTMRPFTSTSLSAEESI+PPL+GEKLKSTQ TL S+RSIKSASDV EK+TC+EVLA
Subjt:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTSQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTQTTLPSERSIKSASDVDEKKTCTEVLA

Query:  FCNGFKDNKLQR--------PFSQVETEDVFSRMGPSKSSRKYNHGLSCSDAEVDYRRGRSLRGEVKGDFSWH
         CNG KD+KLQR          SQ ETED+FSR+G SK+SRKYNH LSCSDAEVDYRRGRSLRGEVKGDF WH
Subjt:  FCNGFKDNKLQR--------PFSQVETEDVFSRMGPSKSSRKYNHGLSCSDAEVDYRRGRSLRGEVKGDFSWH

XP_022981333.1 uncharacterized protein At1g76660-like [Cucurbita maxima]1.3e-23389.43Show/hide
Query:  MGSEQNRYPRHERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMANQATAIAPSLLAPPSSPASFSNSALPSTAQSPSCFLSLS
        MGSEQNR P+ ERGKRWGGCWGALSCF SQKGEKRIVPASRLPEGNVVTTQPNGP AAG+ANQAT IAPSLLAPPSSPASF+NSALPSTAQSPSCFLSLS
Subjt:  MGSEQNRYPRHERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMANQATAIAPSLLAPPSSPASFSNSALPSTAQSPSCFLSLS

Query:  ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKGTGKANYVASNDLHAAYSLYPGSPASSLV
        ANSPGGPSSTMFATGPYAHETQ VSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS+DLKGTGKANYVASNDL AAYSLYPGSPASSLV
Subjt:  ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKGTGKANYVASNDLHAAYSLYPGSPASSLV

Query:  SPISMTSGDCLSSSFPEREFAPQWNPSASHQDGKYPRSGSGRLFGNEKTGTSLASQDSNFFCPATFAQFYLDNPPFPNTGGRLSVSKDSDVYSSTGNGYQ
        SPIS TSGDCLSSSFPER+F PQWNPSAS QDGKYPRSGSGRLFG+EKTGT LASQDSNFFCPATFAQFYLDNPPFP+TGGRLSVSKDSDVY+S GNGYQ
Subjt:  SPISMTSGDCLSSSFPEREFAPQWNPSASHQDGKYPRSGSGRLFGNEKTGTSLASQDSNFFCPATFAQFYLDNPPFPNTGGRLSVSKDSDVYSSTGNGYQ

Query:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTSQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTQTTLPSERSIKSASDVDEKKTCTEVLA
        NRHSKSPKQDVEEIEAYRASFGFSADEIITT+QYVEISDVMEDSFTMRPFTSTSLSAEESI+PPL+GEK KSTQ TL S+RSIKSASD+ EK+TC+EVLA
Subjt:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTSQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTQTTLPSERSIKSASDVDEKKTCTEVLA

Query:  FCNGFKDNKLQR--------PFSQVETEDVFSRMGPSKSSRKYNHGLSCSDAEVDYRRGRSLRGEVKGDFSWH
         CNG KDNKLQR          SQ ETED+FSR+G SK+SRKYNH LSCSDAEVDYRRGRSLRGEVKGDF WH
Subjt:  FCNGFKDNKLQR--------PFSQVETEDVFSRMGPSKSSRKYNHGLSCSDAEVDYRRGRSLRGEVKGDFSWH

XP_023524439.1 uncharacterized protein At1g76660 [Cucurbita pepo subsp. pepo]6.8e-23589.64Show/hide
Query:  MGSEQNRYPRHERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMANQATAIAPSLLAPPSSPASFSNSALPSTAQSPSCFLSLS
        MGSEQNR P+ ERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGP AAG+ANQAT IAPSLLAPPSSPASF+NSALPSTAQSPSCFLSLS
Subjt:  MGSEQNRYPRHERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMANQATAIAPSLLAPPSSPASFSNSALPSTAQSPSCFLSLS

Query:  ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKGTGKANYVASNDLHAAYSLYPGSPASSLV
        ANSPGGPSSTMFATGPYAHETQ VSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS+DLKGTGKANYVASNDL AAYSLYPGSPASSLV
Subjt:  ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKGTGKANYVASNDLHAAYSLYPGSPASSLV

Query:  SPISMTSGDCLSSSFPEREFAPQWNPSASHQDGKYPRSGSGRLFGNEKTGTSLASQDSNFFCPATFAQFYLDNPPFPNTGGRLSVSKDSDVYSSTGNGYQ
        SPIS TSGDCLSSSFPER+F PQWNPSAS QDGKYPRSGSGRLFG+EKTGT LASQDSNFFCPATFAQFYLDNPPFP+TGGRLSVSKDSDVY+S GNGYQ
Subjt:  SPISMTSGDCLSSSFPEREFAPQWNPSASHQDGKYPRSGSGRLFGNEKTGTSLASQDSNFFCPATFAQFYLDNPPFPNTGGRLSVSKDSDVYSSTGNGYQ

Query:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTSQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTQTTLPSERSIKSASDVDEKKTCTEVLA
        NRHSKSPKQDVEEIEAYRASFGFSADEII+T+QYVEISDVMEDSFTMRPFTSTSLSAEESI+PPL+GEKLKSTQ TL S+RSIKSASDV EK+TC+EVLA
Subjt:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTSQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTQTTLPSERSIKSASDVDEKKTCTEVLA

Query:  FCNGFKDNKLQR--------PFSQVETEDVFSRMGPSKSSRKYNHGLSCSDAEVDYRRGRSLRGEVKGDFSWH
         CNG KD+KLQR          SQ ETED+FSR+G SK+SRKYNH LSCSDAEVDYRRGRSLRGEVKGDF WH
Subjt:  FCNGFKDNKLQR--------PFSQVETEDVFSRMGPSKSSRKYNHGLSCSDAEVDYRRGRSLRGEVKGDFSWH

TrEMBL top hitse value%identityAlignment
A0A0A0L1G3 Uncharacterized protein1.7e-22386.5Show/hide
Query:  MGSEQNRYPRHERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMANQATAIAPSLLAPPSSPASFSNSALPSTAQSPSCFLSLS
        MGSEQNR+P+HERGKRWGGCWGALSCFHSQKG+KRIVPASRLPEGNVVTTQPNGPQAAGM NQAT I PSLLAPPSSPASF+NSALPST QSPSCFLSLS
Subjt:  MGSEQNRYPRHERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMANQATAIAPSLLAPPSSPASFSNSALPSTAQSPSCFLSLS

Query:  ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKGTGKANYVASNDLHAAYSLYPGSPASSLV
        ANSPGGPSSTM+ATGPYAH+TQLVSPPVFSAF TEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS DLKGTGKANY+ASNDL AAYSLYPGSPASSLV
Subjt:  ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKGTGKANYVASNDLHAAYSLYPGSPASSLV

Query:  SPISMTSGDCLSSSFPEREFAPQWNPSASHQDGKYPRSGSGRLFGNEKTGTSLASQDSNFFCPATFAQFYLDNPPFPNTGGRLSVSKDSDVYSSTGNGYQ
        SPIS TSGDCLSSSFPER+F PQWN SAS QDGKYPRSGSGRLFGNEK GTSLASQDSNFFCPATFAQFYLDN  FP+TGGRLSVSKDSDVYSS GNGYQ
Subjt:  SPISMTSGDCLSSSFPEREFAPQWNPSASHQDGKYPRSGSGRLFGNEKTGTSLASQDSNFFCPATFAQFYLDNPPFPNTGGRLSVSKDSDVYSSTGNGYQ

Query:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTSQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTQTTLPSERSIKSASDVDEKKTCTEVLA
        NRHSKSPKQDVEEIEAYRASFGFSADEIITT+QYVEISDVMEDSFTMRPFTSTSLSAEES EPPLLGEKLKS+ TTL S+RSIKSA +    +TCTE+ A
Subjt:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTSQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTQTTLPSERSIKSASDVDEKKTCTEVLA

Query:  FCNGFKDNKLQR---------PFSQVETEDVFSRMGPSKSSRKYNHGLSCSDAEVDYRRGRSLRGEVKGDFSWH
         CNG+KDNKLQR           +QVE +DVFSR+G SK+SRKY+ GLSCSDAEVDYRRGRSLR E KG+ SWH
Subjt:  FCNGFKDNKLQR---------PFSQVETEDVFSRMGPSKSSRKYNHGLSCSDAEVDYRRGRSLRGEVKGDFSWH

A0A1S3BV86 uncharacterized protein At1g766602.4e-22587.13Show/hide
Query:  MGSEQNRYPRHERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMANQATAIAPSLLAPPSSPASFSNSALPSTAQSPSCFLSLS
        MGSEQNR+P+ ERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGM NQAT I PSLLAPPSSPASF+NSALPST QSPSCFLSLS
Subjt:  MGSEQNRYPRHERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMANQATAIAPSLLAPPSSPASFSNSALPSTAQSPSCFLSLS

Query:  ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKGTGKANYVASNDLHAAYSLYPGSPASSLV
        ANSPGGPSST++ATGPYAHETQ VSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS DLKGTGKANY+ASNDL AAYSLYPGSPASSLV
Subjt:  ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKGTGKANYVASNDLHAAYSLYPGSPASSLV

Query:  SPISMTSGDCLSSSFPEREFAPQWNPSASHQDGKYPRSGSGRLFGNEKTGTSLASQDSNFFCPATFAQFYLDNPPFPNTGGRLSVSKDSDVYSSTGNGYQ
        SPIS TSGDCLSSSFPER+F PQWN SAS QDGKYPRSGSGRLFGNEK GTSLASQDSNFFCPATFAQFYLDN  FP+TGGRLSVSKDSDVYSS GNGYQ
Subjt:  SPISMTSGDCLSSSFPEREFAPQWNPSASHQDGKYPRSGSGRLFGNEKTGTSLASQDSNFFCPATFAQFYLDNPPFPNTGGRLSVSKDSDVYSSTGNGYQ

Query:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTSQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTQTTLPSERSIKSASDVDEKKTCTEVLA
        NRHSKSPKQDVEEIEAYRASFGFSADEIITT+QYVEISDVMEDSFTMRPFTSTSLSAEES EPPLLGEKLKS+ TTL ++RSIKSA +V EK+TCTEV A
Subjt:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTSQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTQTTLPSERSIKSASDVDEKKTCTEVLA

Query:  FCNGFKDNKLQR---------PFSQVETEDVFSRMGPSKSSRKYNHGLSCSDAEVDYRRGRSLRGEVKGDFSWH
         CNG+KDNKLQR            QVE +DVFSR+G SK+SRKY+ GLSCSDAEVDYRRGRSLR E KG+ SWH
Subjt:  FCNGFKDNKLQR---------PFSQVETEDVFSRMGPSKSSRKYNHGLSCSDAEVDYRRGRSLRGEVKGDFSWH

A0A6J1BVS7 uncharacterized protein At1g766602.6e-23288.42Show/hide
Query:  MGSEQNRYPRHERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMANQATAIAPSLLAPPSSPASFSNSALPSTAQSPSCFLSLS
        MGSEQNR+P+ ER KRWGGCWGALSCFHSQKG KRIVPASRLPEGN VTTQPNGPQAAGM NQAT IAPSLLAPPSSPASF+NSALPST QSPSCFLSLS
Subjt:  MGSEQNRYPRHERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMANQATAIAPSLLAPPSSPASFSNSALPSTAQSPSCFLSLS

Query:  ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKGTGKANYVASNDLHAAYSLYPGSPASSLV
        ANSPGGPSSTMFATGPYAHE QLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS+DLKGTGK NY+ASNDL AAYSLYPGSPASSLV
Subjt:  ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKGTGKANYVASNDLHAAYSLYPGSPASSLV

Query:  SPISMTSGDCLSSSFPEREFAPQWNPSASHQDGKYPRSGSGRLFGNEKTGTSLASQDSNFFCPATFAQFYLDNPPFPNTGGRLSVSKDSDVYSSTGNGYQ
        SPIS TSGDCLSSSFPER+F PQWNPS+S QDGKYPRSGSGRLFG+EKTGTSLASQDSNFFCPATFAQFYLDNPPFP+TGGRLSVSKDSDVYSS GNGYQ
Subjt:  SPISMTSGDCLSSSFPEREFAPQWNPSASHQDGKYPRSGSGRLFGNEKTGTSLASQDSNFFCPATFAQFYLDNPPFPNTGGRLSVSKDSDVYSSTGNGYQ

Query:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTSQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTQTTLPSERSIKSASDVDEKKTCTEVLA
        NRHSKSPKQDVEEIEAYRASFGFSADEIITT+QYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKST+TT+ S+RS+K ASDV EK+TC EVL 
Subjt:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTSQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTQTTLPSERSIKSASDVDEKKTCTEVLA

Query:  FCNGFKDNKLQR----------PFSQVETEDVFSRMGPSKSSRKYNHGLSCSDAEVDYRRGRSLRGEVKGDFSWH
         CNG +DNKLQR           F+QVETEDVFSR+ P K+SRKYN GLSCSDAEVDYRRGRSLR EVKGDFSWH
Subjt:  FCNGFKDNKLQR----------PFSQVETEDVFSRMGPSKSSRKYNHGLSCSDAEVDYRRGRSLRGEVKGDFSWH

A0A6J1FRS7 uncharacterized protein At1g766603.3e-23590.06Show/hide
Query:  MGSEQNRYPRHERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMANQATAIAPSLLAPPSSPASFSNSALPSTAQSPSCFLSLS
        MGSEQNR P+ ERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGP AAG+ANQAT IAPSLLAPPSSPASF+NSALPSTAQSPSCFLSLS
Subjt:  MGSEQNRYPRHERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMANQATAIAPSLLAPPSSPASFSNSALPSTAQSPSCFLSLS

Query:  ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKGTGKANYVASNDLHAAYSLYPGSPASSLV
        ANSPGGPSSTMFATGPYAHETQ VSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKGTGKANYVASNDL AAYSLYPGSPASSLV
Subjt:  ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKGTGKANYVASNDLHAAYSLYPGSPASSLV

Query:  SPISMTSGDCLSSSFPEREFAPQWNPSASHQDGKYPRSGSGRLFGNEKTGTSLASQDSNFFCPATFAQFYLDNPPFPNTGGRLSVSKDSDVYSSTGNGYQ
        SPIS TSGDCLSSSFPER+F PQWNPSAS QDGKYPRSGSGRLFG+EKTGT LASQDSNFFCPATFAQFYLDNPPFP+TGGRLSVSKDSDVY+S GNGYQ
Subjt:  SPISMTSGDCLSSSFPEREFAPQWNPSASHQDGKYPRSGSGRLFGNEKTGTSLASQDSNFFCPATFAQFYLDNPPFPNTGGRLSVSKDSDVYSSTGNGYQ

Query:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTSQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTQTTLPSERSIKSASDVDEKKTCTEVLA
        NRHSKSPKQDVEEIEAYRASFGFSADEIITT+QYVEISDVMEDSFTMRPFTSTSLSAEESI+PPL+GEKLKSTQ TL S+RSIKSASDV EK+TC+EVLA
Subjt:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTSQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTQTTLPSERSIKSASDVDEKKTCTEVLA

Query:  FCNGFKDNKLQR--------PFSQVETEDVFSRMGPSKSSRKYNHGLSCSDAEVDYRRGRSLRGEVKGDFSWH
         CNG KD+KLQR          SQ ETED+FSR+G SK+SRKYNH LSCSDAEVDYRRGRSLRGEVKGDF WH
Subjt:  FCNGFKDNKLQR--------PFSQVETEDVFSRMGPSKSSRKYNHGLSCSDAEVDYRRGRSLRGEVKGDFSWH

A0A6J1IZ74 uncharacterized protein At1g76660-like6.2e-23489.43Show/hide
Query:  MGSEQNRYPRHERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMANQATAIAPSLLAPPSSPASFSNSALPSTAQSPSCFLSLS
        MGSEQNR P+ ERGKRWGGCWGALSCF SQKGEKRIVPASRLPEGNVVTTQPNGP AAG+ANQAT IAPSLLAPPSSPASF+NSALPSTAQSPSCFLSLS
Subjt:  MGSEQNRYPRHERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMANQATAIAPSLLAPPSSPASFSNSALPSTAQSPSCFLSLS

Query:  ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKGTGKANYVASNDLHAAYSLYPGSPASSLV
        ANSPGGPSSTMFATGPYAHETQ VSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS+DLKGTGKANYVASNDL AAYSLYPGSPASSLV
Subjt:  ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKGTGKANYVASNDLHAAYSLYPGSPASSLV

Query:  SPISMTSGDCLSSSFPEREFAPQWNPSASHQDGKYPRSGSGRLFGNEKTGTSLASQDSNFFCPATFAQFYLDNPPFPNTGGRLSVSKDSDVYSSTGNGYQ
        SPIS TSGDCLSSSFPER+F PQWNPSAS QDGKYPRSGSGRLFG+EKTGT LASQDSNFFCPATFAQFYLDNPPFP+TGGRLSVSKDSDVY+S GNGYQ
Subjt:  SPISMTSGDCLSSSFPEREFAPQWNPSASHQDGKYPRSGSGRLFGNEKTGTSLASQDSNFFCPATFAQFYLDNPPFPNTGGRLSVSKDSDVYSSTGNGYQ

Query:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTSQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTQTTLPSERSIKSASDVDEKKTCTEVLA
        NRHSKSPKQDVEEIEAYRASFGFSADEIITT+QYVEISDVMEDSFTMRPFTSTSLSAEESI+PPL+GEK KSTQ TL S+RSIKSASD+ EK+TC+EVLA
Subjt:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTSQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTQTTLPSERSIKSASDVDEKKTCTEVLA

Query:  FCNGFKDNKLQR--------PFSQVETEDVFSRMGPSKSSRKYNHGLSCSDAEVDYRRGRSLRGEVKGDFSWH
         CNG KDNKLQR          SQ ETED+FSR+G SK+SRKYNH LSCSDAEVDYRRGRSLRGEVKGDF WH
Subjt:  FCNGFKDNKLQR--------PFSQVETEDVFSRMGPSKSSRKYNHGLSCSDAEVDYRRGRSLRGEVKGDFSWH

SwissProt top hitse value%identityAlignment
Q9SRE5 Uncharacterized protein At1g766605.2e-12959.87Show/hide
Query:  MGSEQNRYPRHERGKRWGGCWGALSCFHSQKGEKRIVPASRLPE-GNVVTTQPNGPQAAGMANQATA--IAPSLLAPPSSPASFSNSALPSTAQSPSCFL
        MGSEQ      ++ KRWGGC G  SCF SQKG KRIVPASR+PE GNV  +QPNG   AG+ N   A  I  SLLAPPSSPASF+NSALPST QSP+C+L
Subjt:  MGSEQNRYPRHERGKRWGGCWGALSCFHSQKGEKRIVPASRLPE-GNVVTTQPNGPQAAGMANQATA--IAPSLLAPPSSPASFSNSALPSTAQSPSCFL

Query:  SLSANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKGTGKANYVASNDLHAAYSLYPGSPAS
        SL+ANSPGGPSS+M+ATGPYAHETQLVSPPVFS FTTEPSTAP TPPPELA LT PSSPDVP+A+FL+SSMDLK +GK +Y   NDL A YSLYPGSPAS
Subjt:  SLSANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKGTGKANYVASNDLHAAYSLYPGSPAS

Query:  SLVSPISMTSGDCLSSSFPEREFAPQWNPSASHQDGKYPRSGSGRLFGNEKTGTSLASQDSNFFCPATFAQFYLD-NPPFPNTGGRLSVSKDSDVYSST-
        +L SPIS  SGD L                 S Q+GK  RS SG  FG +  G S   Q+SNFFCP TFA+FYLD +P  P  GGRLSVSKDSDVY +  
Subjt:  SLVSPISMTSGDCLSSSFPEREFAPQWNPSASHQDGKYPRSGSGRLFGNEKTGTSLASQDSNFFCPATFAQFYLD-NPPFPNTGGRLSVSKDSDVYSST-

Query:  -GNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIITTSQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTQTTLPSERSIKSASDVDEKKT
         GNG QNR ++SPKQD+EE+EAYRASFGFSADEIITTSQYVEI+DVM+ SF    ++            P  G+KL   +  L S+ S KS +D+D +  
Subjt:  -GNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIITTSQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTQTTLPSERSIKSASDVDEKKT

Query:  CTEVLAFCNGFKDNKLQRPFSQVETEDVFSRMGPSKSSRKYNHGLSCSDAEVDYRRGRSLR
          +     N +KD+K QR     + E + SR+G  K SR Y+  +S SDAEV+YRRGRSLR
Subjt:  CTEVLAFCNGFKDNKLQRPFSQVETEDVFSRMGPSKSSRKYNHGLSCSDAEVDYRRGRSLR

Arabidopsis top hitse value%identityAlignment
AT1G63720.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT5G52430.1)8.2e-2941.86Show/hide
Query:  PRHERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMANQATAIAPSLLAPPSSPASFSNSALPSTAQSPSCFLSLSANSPGGPS
        P H++ ++W   W  L CF S +  KRI  +  +PE   +++  +    +G  +  T +    +APPSSPASF  S  PS  QSP   LS S   P    
Subjt:  PRHERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMANQATAIAPSLLAPPSSPASFSNSALPSTAQSPSCFLSLSANSPGGPS

Query:  STMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHL----TTPSSPDVPFAQFLSSSMDLKGTGKANYVASNDLHAAYSLYPGSPASSLVSPIS
         ++FA GPYAHETQLVSPPVFS +TTEPS+AP+TPP + + +    TTPSSP+VPFAQ  +S+      G    ++S+     Y L PGSP   L+SP  
Subjt:  STMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHL----TTPSSPDVPFAQFLSSSMDLKGTGKANYVASNDLHAAYSLYPGSPASSLVSPIS

Query:  MTSGDCLSSSFPERE
         + G   +S FP+ E
Subjt:  MTSGDCLSSSFPERE

AT1G76660.1 FUNCTIONS IN: molecular_function unknown3.7e-13059.87Show/hide
Query:  MGSEQNRYPRHERGKRWGGCWGALSCFHSQKGEKRIVPASRLPE-GNVVTTQPNGPQAAGMANQATA--IAPSLLAPPSSPASFSNSALPSTAQSPSCFL
        MGSEQ      ++ KRWGGC G  SCF SQKG KRIVPASR+PE GNV  +QPNG   AG+ N   A  I  SLLAPPSSPASF+NSALPST QSP+C+L
Subjt:  MGSEQNRYPRHERGKRWGGCWGALSCFHSQKGEKRIVPASRLPE-GNVVTTQPNGPQAAGMANQATA--IAPSLLAPPSSPASFSNSALPSTAQSPSCFL

Query:  SLSANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKGTGKANYVASNDLHAAYSLYPGSPAS
        SL+ANSPGGPSS+M+ATGPYAHETQLVSPPVFS FTTEPSTAP TPPPELA LT PSSPDVP+A+FL+SSMDLK +GK +Y   NDL A YSLYPGSPAS
Subjt:  SLSANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKGTGKANYVASNDLHAAYSLYPGSPAS

Query:  SLVSPISMTSGDCLSSSFPEREFAPQWNPSASHQDGKYPRSGSGRLFGNEKTGTSLASQDSNFFCPATFAQFYLD-NPPFPNTGGRLSVSKDSDVYSST-
        +L SPIS  SGD L                 S Q+GK  RS SG  FG +  G S   Q+SNFFCP TFA+FYLD +P  P  GGRLSVSKDSDVY +  
Subjt:  SLVSPISMTSGDCLSSSFPEREFAPQWNPSASHQDGKYPRSGSGRLFGNEKTGTSLASQDSNFFCPATFAQFYLD-NPPFPNTGGRLSVSKDSDVYSST-

Query:  -GNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIITTSQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTQTTLPSERSIKSASDVDEKKT
         GNG QNR ++SPKQD+EE+EAYRASFGFSADEIITTSQYVEI+DVM+ SF    ++            P  G+KL   +  L S+ S KS +D+D +  
Subjt:  -GNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIITTSQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTQTTLPSERSIKSASDVDEKKT

Query:  CTEVLAFCNGFKDNKLQRPFSQVETEDVFSRMGPSKSSRKYNHGLSCSDAEVDYRRGRSLR
          +     N +KD+K QR     + E + SR+G  K SR Y+  +S SDAEV+YRRGRSLR
Subjt:  CTEVLAFCNGFKDNKLQRPFSQVETEDVFSRMGPSKSSRKYNHGLSCSDAEVDYRRGRSLR

AT4G25620.1 hydroxyproline-rich glycoprotein family protein5.3e-2843.24Show/hide
Query:  MGSEQNRYPRHERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMA--------NQATAIAPSLLAPPSSPASFSNSALPSTAQS
        + +E    P   + KR G  W    CF S+K  KRI  A  +PE          P A+G A        + +T+I    +APPSSPASF  S  PS + +
Subjt:  MGSEQNRYPRHERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMA--------NQATAIAPSLLAPPSSPASFSNSALPSTAQS

Query:  --PSCFLSLSANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLK-----GTGKANYVASNDLH
          P    SL+ N P  PS+  F  GPYAHETQ V+PPVFSAFTTEPSTAP TPPPE     +PSSP+VPFAQ L+SS++       G     + A++   
Subjt:  --PSCFLSLSANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLK-----GTGKANYVASNDLH

Query:  AAYSLYPGSPASSLVSPISMTS
         +  +YPGSP  +L+SP S TS
Subjt:  AAYSLYPGSPASSLVSPISMTS

AT5G52430.1 hydroxyproline-rich glycoprotein family protein2.7e-3241.98Show/hide
Query:  SEQNRYPRHERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMANQATAIAPSLLAPPSSPASFSNSALPSTAQSPSCFLSLSAN
        +E    P   +  RWG CW   SCF +QK  KRI  A  +PE   VT+          A   T + P  +APPSSPASF  S   S + SP   LSL++N
Subjt:  SEQNRYPRHERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMANQATAIAPSLLAPPSSPASFSNSALPSTAQSPSCFLSLSAN

Query:  --SPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELA-HLTTPSSPDVPFAQFLSSSMDL---KGTGKAN--YVASNDLHAAYSLYPGS
          SP  P S +F  GPYA+ETQ V+PPVFSAF TEPSTAP TPPPE + H+TTPSSP+VPFAQ L+SS++L     T   N  + +S+    +  + PGS
Subjt:  --SPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELA-HLTTPSSPDVPFAQFLSSSMDL---KGTGKAN--YVASNDLHAAYSLYPGS

Query:  P-ASSLVSPISMTSGDCLSSSFPEREFAPQWNPSASHQDGKYPRSGSGRLFGNEKTGTSLAS
        P   +L+SP S+ S    SS +P +      +P    + G+ P+      F   K G+   S
Subjt:  P-ASSLVSPISMTSGDCLSSSFPEREFAPQWNPSASHQDGKYPRSGSGRLFGNEKTGTSLAS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGTCTGAGCAAAACAGATACCCTCGCCACGAACGGGGAAAGAGATGGGGTGGATGTTGGGGTGCATTATCTTGTTTTCACTCTCAGAAAGGGGAAAAACGCATTGT
GCCTGCATCTCGTTTACCAGAGGGCAATGTCGTGACAACTCAGCCAAATGGACCTCAAGCAGCAGGGATGGCGAACCAGGCTACAGCGATTGCTCCATCTCTTCTAGCCC
CACCTTCTTCACCAGCATCCTTTTCAAATTCTGCACTCCCTTCAACAGCCCAATCACCTAGCTGTTTCTTGTCGTTGTCTGCCAACTCACCTGGAGGTCCTTCATCCACA
ATGTTTGCTACTGGGCCATATGCCCACGAAACACAACTGGTTTCTCCTCCTGTTTTCTCAGCCTTCACCACAGAACCATCAACTGCTCCACTCACTCCCCCACCCGAGCT
AGCTCACCTAACCACACCTTCTTCCCCTGATGTGCCTTTTGCCCAGTTCCTATCCTCATCAATGGATCTTAAAGGTACTGGAAAGGCAAATTACGTTGCTTCAAATGATC
TTCATGCAGCTTATTCTCTTTACCCTGGAAGTCCTGCCAGTAGCCTCGTGTCGCCAATTTCAATGACCTCCGGCGATTGCTTATCATCTTCATTTCCTGAGAGGGAATTC
GCACCTCAGTGGAATCCTTCAGCTTCTCACCAGGATGGAAAATATCCAAGAAGTGGCTCTGGTCGGTTATTTGGAAATGAGAAAACTGGCACATCTTTGGCATCTCAGGA
TTCTAATTTCTTCTGCCCTGCCACATTTGCACAATTCTATCTGGACAATCCACCATTCCCTAATACTGGTGGGAGGTTAAGTGTATCAAAGGATTCAGATGTTTACTCGT
CTACTGGGAATGGATACCAAAACCGGCATAGTAAGTCACCAAAACAAGATGTGGAGGAAATAGAAGCCTACCGAGCATCTTTCGGTTTCAGTGCAGATGAAATTATAACT
ACTTCACAATACGTGGAGATATCTGATGTAATGGAGGATTCCTTTACCATGAGACCTTTTACTTCAACTAGTCTGTCAGCAGAAGAAAGTATTGAACCTCCATTATTGGG
TGAAAAACTAAAATCCACTCAGACAACTTTACCCAGTGAGAGAAGTATTAAATCAGCATCTGATGTTGACGAAAAAAAGACCTGCACGGAAGTGCTGGCATTTTGCAATG
GCTTCAAAGACAATAAGTTGCAAAGACCTTTCAGCCAAGTTGAAACAGAAGATGTATTCTCAAGAATGGGGCCGTCCAAAAGTAGTCGCAAGTATAACCATGGTTTATCA
TGCTCCGATGCAGAAGTTGACTACAGAAGAGGAAGGAGCCTAAGGGGGGAGGTCAAGGGAGATTTCTCATGGCACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGGTCTGAGCAAAACAGATACCCTCGCCACGAACGGGGAAAGAGATGGGGTGGATGTTGGGGTGCATTATCTTGTTTTCACTCTCAGAAAGGGGAAAAACGCATTGT
GCCTGCATCTCGTTTACCAGAGGGCAATGTCGTGACAACTCAGCCAAATGGACCTCAAGCAGCAGGGATGGCGAACCAGGCTACAGCGATTGCTCCATCTCTTCTAGCCC
CACCTTCTTCACCAGCATCCTTTTCAAATTCTGCACTCCCTTCAACAGCCCAATCACCTAGCTGTTTCTTGTCGTTGTCTGCCAACTCACCTGGAGGTCCTTCATCCACA
ATGTTTGCTACTGGGCCATATGCCCACGAAACACAACTGGTTTCTCCTCCTGTTTTCTCAGCCTTCACCACAGAACCATCAACTGCTCCACTCACTCCCCCACCCGAGCT
AGCTCACCTAACCACACCTTCTTCCCCTGATGTGCCTTTTGCCCAGTTCCTATCCTCATCAATGGATCTTAAAGGTACTGGAAAGGCAAATTACGTTGCTTCAAATGATC
TTCATGCAGCTTATTCTCTTTACCCTGGAAGTCCTGCCAGTAGCCTCGTGTCGCCAATTTCAATGACCTCCGGCGATTGCTTATCATCTTCATTTCCTGAGAGGGAATTC
GCACCTCAGTGGAATCCTTCAGCTTCTCACCAGGATGGAAAATATCCAAGAAGTGGCTCTGGTCGGTTATTTGGAAATGAGAAAACTGGCACATCTTTGGCATCTCAGGA
TTCTAATTTCTTCTGCCCTGCCACATTTGCACAATTCTATCTGGACAATCCACCATTCCCTAATACTGGTGGGAGGTTAAGTGTATCAAAGGATTCAGATGTTTACTCGT
CTACTGGGAATGGATACCAAAACCGGCATAGTAAGTCACCAAAACAAGATGTGGAGGAAATAGAAGCCTACCGAGCATCTTTCGGTTTCAGTGCAGATGAAATTATAACT
ACTTCACAATACGTGGAGATATCTGATGTAATGGAGGATTCCTTTACCATGAGACCTTTTACTTCAACTAGTCTGTCAGCAGAAGAAAGTATTGAACCTCCATTATTGGG
TGAAAAACTAAAATCCACTCAGACAACTTTACCCAGTGAGAGAAGTATTAAATCAGCATCTGATGTTGACGAAAAAAAGACCTGCACGGAAGTGCTGGCATTTTGCAATG
GCTTCAAAGACAATAAGTTGCAAAGACCTTTCAGCCAAGTTGAAACAGAAGATGTATTCTCAAGAATGGGGCCGTCCAAAAGTAGTCGCAAGTATAACCATGGTTTATCA
TGCTCCGATGCAGAAGTTGACTACAGAAGAGGAAGGAGCCTAAGGGGGGAGGTCAAGGGAGATTTCTCATGGCACTGA
Protein sequenceShow/hide protein sequence
MGSEQNRYPRHERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMANQATAIAPSLLAPPSSPASFSNSALPSTAQSPSCFLSLSANSPGGPSST
MFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKGTGKANYVASNDLHAAYSLYPGSPASSLVSPISMTSGDCLSSSFPEREF
APQWNPSASHQDGKYPRSGSGRLFGNEKTGTSLASQDSNFFCPATFAQFYLDNPPFPNTGGRLSVSKDSDVYSSTGNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIIT
TSQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTQTTLPSERSIKSASDVDEKKTCTEVLAFCNGFKDNKLQRPFSQVETEDVFSRMGPSKSSRKYNHGLS
CSDAEVDYRRGRSLRGEVKGDFSWH