; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0004656 (gene) of Snake gourd v1 genome

Gene IDTan0004656
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionBEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein .
Genome locationLG01:116962844..116976231
RNA-Seq ExpressionTan0004656
SyntenyTan0004656
Gene Ontology termsNA
InterPro domainsIPR040420 - Uncharacterized protein At1g76660-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6608673.1 hypothetical protein SDJN03_02015, partial [Cucurbita argyrosperma subsp. sororia]9.3e-24894.46Show/hide
Query:  MGSEQNRIPQHDRGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPIGPQAAGIANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS
        MGSEQNRIPQ +RGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQP GP AAGIANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS
Subjt:  MGSEQNRIPQHDRGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPIGPQAAGIANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS

Query:  ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYVASNDLQAAYSLYPGSPASSLV
        ANSPGGPSSTMFATGPYAHETQ VSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS+DLKGTGKANYVASNDLQAAYSLYPGSPASSLV
Subjt:  ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYVASNDLQAAYSLYPGSPASSLV

Query:  SPISRTSGNCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSGGNGYQ
        SPISRTSG+CLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVY+SGGNGYQ
Subjt:  SPISRTSGNCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSGGNGYQ

Query:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEENIEPPLLGEKLKSTQTTLQSQRSIKSASDVVEKETCTEVLA
        NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSL AEE+I+PPL+GEKLKSTQ TLQSQRSIKSAS+VVEKETC+EVLA
Subjt:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEENIEPPLLGEKLKSTQTTLQSQRSIKSASDVVEKETCTEVLA

Query:  LCNGNKDNKLQRQPGNMPGSSTFNQVETEDVFSRIGSSKNSRKYNHGLSCSDAEVDYRRGRSLRGEVKG
        LCNG K++KLQRQPGN+PGSST +Q ETED+FSRIGSSKNSRKYNH LSCSDAEVDYRRGRSLRGE  G
Subjt:  LCNGNKDNKLQRQPGNMPGSSTFNQVETEDVFSRIGSSKNSRKYNHGLSCSDAEVDYRRGRSLRGEVKG

KAG7037989.1 hypothetical protein SDJN02_01622, partial [Cucurbita argyrosperma subsp. argyrosperma]2.1e-24795.25Show/hide
Query:  RGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPIGPQAAGIANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLSANSPGGPSSTMF
        +GKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQP GP AAGIANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLSANSPGGPSSTMF
Subjt:  RGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPIGPQAAGIANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLSANSPGGPSSTMF

Query:  ATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYVASNDLQAAYSLYPGSPASSLVSPISRTSGNCLS
        ATGPYAHETQ VSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS+DLKGTGKANYVASNDLQAAYSLYPGSPASSLVSPISRTSG+CLS
Subjt:  ATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYVASNDLQAAYSLYPGSPASSLVSPISRTSGNCLS

Query:  SSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSGGNGYQNRHSKSPKQDVE
        SSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVY+SGGNGYQNRHSKSPKQDVE
Subjt:  SSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSGGNGYQNRHSKSPKQDVE

Query:  EIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEENIEPPLLGEKLKSTQTTLQSQRSIKSASDVVEKETCTEVLALCNGNKDNKLQR
        EIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSL AEE+I+PPL+GEKLKSTQ TLQSQRSIKSASDVVEKETC+EVLALCNG KD+KLQR
Subjt:  EIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEENIEPPLLGEKLKSTQTTLQSQRSIKSASDVVEKETCTEVLALCNGNKDNKLQR

Query:  QPGNMPGSSTFNQVETEDVFSRIGSSKNSRKYNHGLSCSDAEVDYRRGRSLRGEVKGDFSWHD
        QPGN+PGSST +Q ETED+FSRIGSSKNSRKYNH LSCSDAEVDYRRGRSLRGEVKGDF WHD
Subjt:  QPGNMPGSSTFNQVETEDVFSRIGSSKNSRKYNHGLSCSDAEVDYRRGRSLRGEVKGDFSWHD

XP_022940824.1 uncharacterized protein At1g76660 [Cucurbita moschata]4.3e-25395.16Show/hide
Query:  MGSEQNRIPQHDRGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPIGPQAAGIANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS
        MGSEQNRIPQ +RGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQP GP AAGIANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS
Subjt:  MGSEQNRIPQHDRGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPIGPQAAGIANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS

Query:  ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYVASNDLQAAYSLYPGSPASSLV
        ANSPGGPSSTMFATGPYAHETQ VSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS+DLKGTGKANYVASNDLQAAYSLYPGSPASSLV
Subjt:  ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYVASNDLQAAYSLYPGSPASSLV

Query:  SPISRTSGNCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSGGNGYQ
        SPISRTSG+CLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVY+SGGNGYQ
Subjt:  SPISRTSGNCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSGGNGYQ

Query:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEENIEPPLLGEKLKSTQTTLQSQRSIKSASDVVEKETCTEVLA
        NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEE+I+PPL+GEKLKSTQ TLQSQRSIKSASD VEKETC+EVLA
Subjt:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEENIEPPLLGEKLKSTQTTLQSQRSIKSASDVVEKETCTEVLA

Query:  LCNGNKDNKLQRQPGNMPGSSTFNQVETEDVFSRIGSSKNSRKYNHGLSCSDAEVDYRRGRSLRGEVKGDFSWHD
        LCNG KD+KLQRQPGN+PGSST +Q ETED+FSRIGSSKNSRKYNH LSCSDAEVDYRRGRSLRGEVKGDF WHD
Subjt:  LCNGNKDNKLQRQPGNMPGSSTFNQVETEDVFSRIGSSKNSRKYNHGLSCSDAEVDYRRGRSLRGEVKGDFSWHD

XP_022981333.1 uncharacterized protein At1g76660-like [Cucurbita maxima]4.3e-25394.95Show/hide
Query:  MGSEQNRIPQHDRGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPIGPQAAGIANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS
        MGSEQNRIPQ +RGKRWGGCWGALSCF SQKGEKRIVPASRLPEGNVVTTQP GP AAGIANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS
Subjt:  MGSEQNRIPQHDRGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPIGPQAAGIANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS

Query:  ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYVASNDLQAAYSLYPGSPASSLV
        ANSPGGPSSTMFATGPYAHETQ VSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYVASNDLQAAYSLYPGSPASSLV
Subjt:  ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYVASNDLQAAYSLYPGSPASSLV

Query:  SPISRTSGNCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSGGNGYQ
        SPISRTSG+CLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVY+SGGNGYQ
Subjt:  SPISRTSGNCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSGGNGYQ

Query:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEENIEPPLLGEKLKSTQTTLQSQRSIKSASDVVEKETCTEVLA
        NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEE+I+PPL+GEK KSTQ TL SQRSIKSASD+VEKETC+EVLA
Subjt:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEENIEPPLLGEKLKSTQTTLQSQRSIKSASDVVEKETCTEVLA

Query:  LCNGNKDNKLQRQPGNMPGSSTFNQVETEDVFSRIGSSKNSRKYNHGLSCSDAEVDYRRGRSLRGEVKGDFSWHD
        LCNG KDNKLQRQPGN+PGSST +Q ETED+FSRIGSSKNSRKYNH LSCSDAEVDYRRGRSLRGEVKGDF WHD
Subjt:  LCNGNKDNKLQRQPGNMPGSSTFNQVETEDVFSRIGSSKNSRKYNHGLSCSDAEVDYRRGRSLRGEVKGDFSWHD

XP_023524439.1 uncharacterized protein At1g76660 [Cucurbita pepo subsp. pepo]7.9e-25595.37Show/hide
Query:  MGSEQNRIPQHDRGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPIGPQAAGIANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS
        MGSEQNRIPQ +RGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQP GP AAGIANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS
Subjt:  MGSEQNRIPQHDRGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPIGPQAAGIANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS

Query:  ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYVASNDLQAAYSLYPGSPASSLV
        ANSPGGPSSTMFATGPYAHETQ VSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYVASNDLQAAYSLYPGSPASSLV
Subjt:  ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYVASNDLQAAYSLYPGSPASSLV

Query:  SPISRTSGNCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSGGNGYQ
        SPISRTSG+CLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVY+SGGNGYQ
Subjt:  SPISRTSGNCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSGGNGYQ

Query:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEENIEPPLLGEKLKSTQTTLQSQRSIKSASDVVEKETCTEVLA
        NRHSKSPKQDVEEIEAYRASFGFSADEII+TTQYVEISDVMEDSFTMRPFTSTSLSAEE+I+PPL+GEKLKSTQ TLQSQRSIKSASDVVEKETC+EVLA
Subjt:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEENIEPPLLGEKLKSTQTTLQSQRSIKSASDVVEKETCTEVLA

Query:  LCNGNKDNKLQRQPGNMPGSSTFNQVETEDVFSRIGSSKNSRKYNHGLSCSDAEVDYRRGRSLRGEVKGDFSWHD
        LCNG KD+KLQRQPGN+PGSST +Q ETED+FSRIGSSKNSRKYNH LSCSDAEVDYRRGRSLRGEVKGDF WHD
Subjt:  LCNGNKDNKLQRQPGNMPGSSTFNQVETEDVFSRIGSSKNSRKYNHGLSCSDAEVDYRRGRSLRGEVKGDFSWHD

TrEMBL top hitse value%identityAlignment
A0A0A0L1G3 Uncharacterized protein1.2e-23590.11Show/hide
Query:  MGSEQNRIPQHDRGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPIGPQAAGIANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS
        MGSEQNR PQH+RGKRWGGCWGALSCFHSQKG+KRIVPASRLPEGNVVTTQP GPQAAG+ NQATVI PSLLAPPSSPASFTNSALPST QSPSCFLSLS
Subjt:  MGSEQNRIPQHDRGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPIGPQAAGIANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS

Query:  ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYVASNDLQAAYSLYPGSPASSLV
        ANSPGGPSSTM+ATGPYAH+TQLVSPPVFSAF TEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS DLKGTGKANY+ASNDLQAAYSLYPGSPASSLV
Subjt:  ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYVASNDLQAAYSLYPGSPASSLV

Query:  SPISRTSGNCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSGGNGYQ
        SPISRTSG+CLSSSFPERDF PQWN SASLQDGKYPRSGSGRLFG+EK GT LASQDSNFFCPATFAQFYLDN  FPHTGGRLSVSKDSDVYSS GNGYQ
Subjt:  SPISRTSGNCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSGGNGYQ

Query:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEENIEPPLLGEKLKSTQTTLQSQRSIKSASDVVEKETCTEVLA
        NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEE+ EPPLLGEKLKS+ TTLQSQRSIKSA +    ETCTE+ A
Subjt:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEENIEPPLLGEKLKSTQTTLQSQRSIKSASDVVEKETCTEVLA

Query:  LCNGNKDNKLQRQPGNMPGSSTFNQVETEDVFSRIGSSKNSRKYNHGLSCSDAEVDYRRGRSLRGEVKGDFSWHD
        LCNG KDNKLQRQPG++ GSST NQVE +DVFSRIGSSKNSRKY+ GLSCSDAEVDYRRGRSLR E KG+ SWHD
Subjt:  LCNGNKDNKLQRQPGNMPGSSTFNQVETEDVFSRIGSSKNSRKYNHGLSCSDAEVDYRRGRSLRGEVKGDFSWHD

A0A1S3BV86 uncharacterized protein At1g766601.6e-23790.74Show/hide
Query:  MGSEQNRIPQHDRGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPIGPQAAGIANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS
        MGSEQNR PQ +RGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQP GPQAAG+ NQATVI PSLLAPPSSPASFTNSALPST QSPSCFLSLS
Subjt:  MGSEQNRIPQHDRGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPIGPQAAGIANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS

Query:  ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYVASNDLQAAYSLYPGSPASSLV
        ANSPGGPSST++ATGPYAHETQ VSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS DLKGTGKANY+ASNDLQAAYSLYPGSPASSLV
Subjt:  ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYVASNDLQAAYSLYPGSPASSLV

Query:  SPISRTSGNCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSGGNGYQ
        SPISRTSG+CLSSSFPERDF PQWN SASLQDGKYPRSGSGRLFG+EK GT LASQDSNFFCPATFAQFYLDN  FPHTGGRLSVSKDSDVYSS GNGYQ
Subjt:  SPISRTSGNCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSGGNGYQ

Query:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEENIEPPLLGEKLKSTQTTLQSQRSIKSASDVVEKETCTEVLA
        NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEE+ EPPLLGEKLKS+ TTLQ+QRSIKSA +VVEKETCTEV A
Subjt:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEENIEPPLLGEKLKSTQTTLQSQRSIKSASDVVEKETCTEVLA

Query:  LCNGNKDNKLQRQPGNMPGSSTFNQVETEDVFSRIGSSKNSRKYNHGLSCSDAEVDYRRGRSLRGEVKGDFSWHD
        LCNG KDNKLQRQPG++ GSST +QVE +DVFSRIGSSKNSRKY+ GLSCSDAEVDYRRGRSLR E KG+ SWHD
Subjt:  LCNGNKDNKLQRQPGNMPGSSTFNQVETEDVFSRIGSSKNSRKYNHGLSCSDAEVDYRRGRSLRGEVKGDFSWHD

A0A6J1BVS7 uncharacterized protein At1g766601.9e-24692.86Show/hide
Query:  MGSEQNRIPQHDRGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPIGPQAAGIANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS
        MGSEQNR PQ +R KRWGGCWGALSCFHSQKG KRIVPASRLPEGN VTTQP GPQAAG+ NQATVIAPSLLAPPSSPASFTNSALPST QSPSCFLSLS
Subjt:  MGSEQNRIPQHDRGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPIGPQAAGIANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS

Query:  ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYVASNDLQAAYSLYPGSPASSLV
        ANSPGGPSSTMFATGPYAHE QLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGK NY+ASNDLQAAYSLYPGSPASSLV
Subjt:  ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYVASNDLQAAYSLYPGSPASSLV

Query:  SPISRTSGNCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSGGNGYQ
        SPISRTSG+CLSSSFPERDFPPQWNPS+S QDGKYPRSGSGRLFGHEKTGT LASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSGGNGYQ
Subjt:  SPISRTSGNCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSGGNGYQ

Query:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEENIEPPLLGEKLKSTQTTLQSQRSIKSASDVVEKETCTEVLA
        NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEE+IEPPLLGEKLKST+TT+QSQRS+K ASDVVEKETC EVL 
Subjt:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEENIEPPLLGEKLKSTQTTLQSQRSIKSASDVVEKETCTEVLA

Query:  LCNGNKDNKLQRQPGNMPG-SSTFNQVETEDVFSRIGSSKNSRKYNHGLSCSDAEVDYRRGRSLRGEVKGDFSWHD
        LCNG +DNKLQRQPGNM G SS+FNQVETEDVFSRI   KNSRKYN GLSCSDAEVDYRRGRSLR EVKGDFSWHD
Subjt:  LCNGNKDNKLQRQPGNMPG-SSTFNQVETEDVFSRIGSSKNSRKYNHGLSCSDAEVDYRRGRSLRGEVKGDFSWHD

A0A6J1FRS7 uncharacterized protein At1g766602.1e-25395.16Show/hide
Query:  MGSEQNRIPQHDRGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPIGPQAAGIANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS
        MGSEQNRIPQ +RGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQP GP AAGIANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS
Subjt:  MGSEQNRIPQHDRGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPIGPQAAGIANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS

Query:  ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYVASNDLQAAYSLYPGSPASSLV
        ANSPGGPSSTMFATGPYAHETQ VSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS+DLKGTGKANYVASNDLQAAYSLYPGSPASSLV
Subjt:  ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYVASNDLQAAYSLYPGSPASSLV

Query:  SPISRTSGNCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSGGNGYQ
        SPISRTSG+CLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVY+SGGNGYQ
Subjt:  SPISRTSGNCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSGGNGYQ

Query:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEENIEPPLLGEKLKSTQTTLQSQRSIKSASDVVEKETCTEVLA
        NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEE+I+PPL+GEKLKSTQ TLQSQRSIKSASD VEKETC+EVLA
Subjt:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEENIEPPLLGEKLKSTQTTLQSQRSIKSASDVVEKETCTEVLA

Query:  LCNGNKDNKLQRQPGNMPGSSTFNQVETEDVFSRIGSSKNSRKYNHGLSCSDAEVDYRRGRSLRGEVKGDFSWHD
        LCNG KD+KLQRQPGN+PGSST +Q ETED+FSRIGSSKNSRKYNH LSCSDAEVDYRRGRSLRGEVKGDF WHD
Subjt:  LCNGNKDNKLQRQPGNMPGSSTFNQVETEDVFSRIGSSKNSRKYNHGLSCSDAEVDYRRGRSLRGEVKGDFSWHD

A0A6J1IZ74 uncharacterized protein At1g76660-like2.1e-25394.95Show/hide
Query:  MGSEQNRIPQHDRGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPIGPQAAGIANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS
        MGSEQNRIPQ +RGKRWGGCWGALSCF SQKGEKRIVPASRLPEGNVVTTQP GP AAGIANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS
Subjt:  MGSEQNRIPQHDRGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPIGPQAAGIANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS

Query:  ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYVASNDLQAAYSLYPGSPASSLV
        ANSPGGPSSTMFATGPYAHETQ VSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYVASNDLQAAYSLYPGSPASSLV
Subjt:  ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYVASNDLQAAYSLYPGSPASSLV

Query:  SPISRTSGNCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSGGNGYQ
        SPISRTSG+CLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVY+SGGNGYQ
Subjt:  SPISRTSGNCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSGGNGYQ

Query:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEENIEPPLLGEKLKSTQTTLQSQRSIKSASDVVEKETCTEVLA
        NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEE+I+PPL+GEK KSTQ TL SQRSIKSASD+VEKETC+EVLA
Subjt:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEENIEPPLLGEKLKSTQTTLQSQRSIKSASDVVEKETCTEVLA

Query:  LCNGNKDNKLQRQPGNMPGSSTFNQVETEDVFSRIGSSKNSRKYNHGLSCSDAEVDYRRGRSLRGEVKGDFSWHD
        LCNG KDNKLQRQPGN+PGSST +Q ETED+FSRIGSSKNSRKYNH LSCSDAEVDYRRGRSLRGEVKGDF WHD
Subjt:  LCNGNKDNKLQRQPGNMPGSSTFNQVETEDVFSRIGSSKNSRKYNHGLSCSDAEVDYRRGRSLRGEVKGDFSWHD

SwissProt top hitse value%identityAlignment
Q9SRE5 Uncharacterized protein At1g766602.5e-12658.3Show/hide
Query:  MGSEQNRIPQHDRGKRWGGCWGALSCFHSQKGEKRIVPASRLPE-GNVVTTQPIGPQAAGIANQ--ATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFL
        MGSEQ      D+ KRWGGC G  SCF SQKG KRIVPASR+PE GNV  +QP G   AG+ N   A  I  SLLAPPSSPASFTNSALPST QSP+C+L
Subjt:  MGSEQNRIPQHDRGKRWGGCWGALSCFHSQKGEKRIVPASRLPE-GNVVTTQPIGPQAAGIANQ--ATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFL

Query:  SLSANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYVASNDLQAAYSLYPGSPAS
        SL+ANSPGGPSS+M+ATGPYAHETQLVSPPVFS FTTEPSTAP TPPPELA LT PSSPDVP+A+FL+SS+DLK +GK +Y   NDLQA YSLYPGSPAS
Subjt:  SLSANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYVASNDLQAAYSLYPGSPAS

Query:  SLVSPISRTSGNCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLD-NPPFPHTGGRLSVSKDSDVYSSG-
        +L SPISR SG+ L S                 Q+GK  RS SG  FG++  G     Q+SNFFCP TFA+FYLD +P  P  GGRLSVSKDSDVY +  
Subjt:  SLVSPISRTSGNCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLD-NPPFPHTGGRLSVSKDSDVYSSG-

Query:  -GNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEENIEPPLLGEKLKSTQTTLQSQRSIKSASDVVEKET
         GNG QNR ++SPKQD+EE+EAYRASFGFSADEIITT+QYVEI+DVM+ SF    ++            P  G+KL   +  L SQ S KS +D+  +  
Subjt:  -GNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEENIEPPLLGEKLKSTQTTLQSQRSIKSASDVVEKET

Query:  CTEVLALCNGNKDNKLQRQPGNMPGSSTFNQVETEDVFSRIGSSKNSRKYNHGLSCSDAEVDYRRGRSLR
          +     N  KD+K + +             + E + SR+GS K SR Y+  +S SDAEV+YRRGRSLR
Subjt:  CTEVLALCNGNKDNKLQRQPGNMPGSSTFNQVETEDVFSRIGSSKNSRKYNHGLSCSDAEVDYRRGRSLR

Arabidopsis top hitse value%identityAlignment
AT1G63720.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT5G52430.1)3.2e-2840.18Show/hide
Query:  MGSEQNRIPQHD---RGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPIGPQAAGIANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFL
        + S  +R+ Q     + ++W   W  L CF S +  KRI  +  +PE   +++       +G  +  T +    +APPSSPASF  S  PS  QSP   L
Subjt:  MGSEQNRIPQHD---RGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPIGPQAAGIANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFL

Query:  SLSANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHL----TTPSSPDVPFAQFLSSSVDLKGTGKANYVASNDLQAAYSLYPG
        S S   P     ++FA GPYAHETQLVSPPVFS +TTEPS+AP+TPP + + +    TTPSSP+VPFAQ  +S+      G    ++S+     Y L PG
Subjt:  SLSANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHL----TTPSSPDVPFAQFLSSSVDLKGTGKANYVASNDLQAAYSLYPG

Query:  SPASSLVSPISRTSGNCLSSSFPE
        SP   L+SP   + G+  +S FP+
Subjt:  SPASSLVSPISRTSGNCLSSSFPE

AT1G76660.1 FUNCTIONS IN: molecular_function unknown1.8e-12758.3Show/hide
Query:  MGSEQNRIPQHDRGKRWGGCWGALSCFHSQKGEKRIVPASRLPE-GNVVTTQPIGPQAAGIANQ--ATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFL
        MGSEQ      D+ KRWGGC G  SCF SQKG KRIVPASR+PE GNV  +QP G   AG+ N   A  I  SLLAPPSSPASFTNSALPST QSP+C+L
Subjt:  MGSEQNRIPQHDRGKRWGGCWGALSCFHSQKGEKRIVPASRLPE-GNVVTTQPIGPQAAGIANQ--ATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFL

Query:  SLSANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYVASNDLQAAYSLYPGSPAS
        SL+ANSPGGPSS+M+ATGPYAHETQLVSPPVFS FTTEPSTAP TPPPELA LT PSSPDVP+A+FL+SS+DLK +GK +Y   NDLQA YSLYPGSPAS
Subjt:  SLSANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYVASNDLQAAYSLYPGSPAS

Query:  SLVSPISRTSGNCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLD-NPPFPHTGGRLSVSKDSDVYSSG-
        +L SPISR SG+ L S                 Q+GK  RS SG  FG++  G     Q+SNFFCP TFA+FYLD +P  P  GGRLSVSKDSDVY +  
Subjt:  SLVSPISRTSGNCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLD-NPPFPHTGGRLSVSKDSDVYSSG-

Query:  -GNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEENIEPPLLGEKLKSTQTTLQSQRSIKSASDVVEKET
         GNG QNR ++SPKQD+EE+EAYRASFGFSADEIITT+QYVEI+DVM+ SF    ++            P  G+KL   +  L SQ S KS +D+  +  
Subjt:  -GNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEENIEPPLLGEKLKSTQTTLQSQRSIKSASDVVEKET

Query:  CTEVLALCNGNKDNKLQRQPGNMPGSSTFNQVETEDVFSRIGSSKNSRKYNHGLSCSDAEVDYRRGRSLR
          +     N  KD+K + +             + E + SR+GS K SR Y+  +S SDAEV+YRRGRSLR
Subjt:  CTEVLALCNGNKDNKLQRQPGNMPGSSTFNQVETEDVFSRIGSSKNSRKYNHGLSCSDAEVDYRRGRSLR

AT4G25620.1 hydroxyproline-rich glycoprotein family protein3.8e-2943.4Show/hide
Query:  SEQNRIPQHDRGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPIGPQAAGIANQATVIAPSLLAPPSSPASFTNSALPSTAQS--PSCFLSLS
        S ++R       K+ G  W    CF S+K  KRI  A  +PE    +   + P     +N  ++  P  +APPSSPASF  S  PS + +  P    SL+
Subjt:  SEQNRIPQHDRGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPIGPQAAGIANQATVIAPSLLAPPSSPASFTNSALPSTAQS--PSCFLSLS

Query:  ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLK-----GTGKANYVASNDLQAAYSLYPGSP
         N P  PS+  F  GPYAHETQ V+PPVFSAFTTEPSTAP TPPPE     +PSSP+VPFAQ L+SS++       G     + A++    +  +YPGSP
Subjt:  ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLK-----GTGKANYVASNDLQAAYSLYPGSP

Query:  ASSLVSPISRTS
          +L+SP S TS
Subjt:  ASSLVSPISRTS

AT5G52430.1 hydroxyproline-rich glycoprotein family protein7.3e-3342.19Show/hide
Query:  PQHDRGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPIGPQAAGIANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLSAN--SPGG
        P   +  RWG CW   SCF +QK  KRI  A  +PE   VT+          A   TV+ P  +APPSSPASF  S   S + SP   LSL++N  SP  
Subjt:  PQHDRGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPIGPQAAGIANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLSAN--SPGG

Query:  PSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELA-HLTTPSSPDVPFAQFLSSSVDLKGTGKANYVASNDLQAAY-----SLYPGSP-ASSL
        P S +F  GPYA+ETQ V+PPVFSAF TEPSTAP TPPPE + H+TTPSSP+VPFAQ L+SS++L      + +      + Y      + PGSP   +L
Subjt:  PSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELA-HLTTPSSPDVPFAQFLSSSVDLKGTGKANYVASNDLQAAY-----SLYPGSP-ASSL

Query:  VSPISRTSGNCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGHEKTGTPLAS
        +SP S  S +  SS +P +      +P    + G+ P+      F   K G+   S
Subjt:  VSPISRTSGNCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGHEKTGTPLAS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGTCCGAGCAAAACAGAATCCCTCAGCATGATCGGGGAAAGAGATGGGGTGGATGTTGGGGTGCATTATCTTGTTTTCACTCTCAGAAAGGAGAAAAGCGCATTGT
GCCTGCATCTCGTTTACCTGAGGGCAATGTCGTGACAACCCAGCCAATTGGACCTCAAGCAGCAGGAATAGCCAACCAGGCTACAGTGATAGCTCCATCCCTTCTGGCCC
CACCTTCTTCACCAGCATCCTTTACAAATTCTGCACTCCCTTCTACAGCCCAATCACCTAGCTGTTTCTTGTCATTGTCTGCCAACTCACCTGGAGGTCCTTCATCCACA
ATGTTCGCTACAGGGCCATATGCGCATGAAACACAACTGGTTTCTCCTCCTGTTTTCTCAGCCTTCACCACTGAACCGTCAACTGCTCCACTCACTCCCCCACCTGAACT
AGCTCACCTAACCACACCTTCTTCCCCTGATGTGCCTTTTGCTCAGTTCCTATCCTCGTCAGTGGATCTCAAAGGAACTGGAAAGGCAAATTACGTTGCCTCAAATGATC
TTCAAGCAGCATATTCTCTCTACCCTGGAAGTCCTGCCAGTAGCCTAGTGTCACCAATTTCAAGGACCTCCGGCAATTGCTTATCATCTTCGTTTCCTGAGAGGGACTTC
CCACCACAGTGGAATCCGTCAGCTTCTCTCCAAGATGGAAAATATCCAAGAAGTGGTTCTGGTCGGCTATTTGGACATGAGAAAACTGGCACACCTTTGGCATCTCAGGA
TTCTAATTTCTTCTGCCCTGCTACATTTGCACAATTCTATCTGGACAATCCACCATTCCCGCATACTGGTGGGAGGTTAAGTGTATCAAAAGATTCAGATGTTTACTCGT
CTGGTGGGAATGGATACCAAAACCGCCACAGTAAGTCTCCAAAACAAGATGTGGAGGAAATAGAAGCTTATAGAGCATCTTTTGGTTTCAGTGCAGATGAAATTATAACT
ACTACACAATATGTGGAGATATCTGATGTAATGGAGGATTCCTTTACTATGAGACCTTTTACTTCAACTAGTCTGTCAGCAGAAGAAAATATTGAACCTCCATTATTGGG
TGAAAAACTAAAATCCACACAGACAACTTTGCAGAGTCAGAGAAGTATTAAATCAGCATCTGATGTTGTCGAAAAAGAAACTTGCACTGAAGTCCTGGCATTATGCAATG
GCAATAAAGACAATAAATTGCAAAGACAACCTGGTAACATGCCAGGATCAAGTACTTTCAACCAAGTTGAAACGGAAGATGTATTCTCAAGGATAGGGTCATCCAAAAAT
AGTCGCAAGTATAATCATGGTTTATCCTGCTCTGATGCAGAAGTTGACTACAGAAGAGGAAGGAGCCTAAGGGGGGAGGTCAAGGGAGATTTTTCATGGCATGACTAA
mRNA sequenceShow/hide mRNA sequence
GAACTTACTTCGTCTGTCAGCCAGTAGCCACCATCGTTCCCTTTCTCACTCTGTCTCTCGCATTTGCGTCTTACTCACACAGCCACCGTTGCCGTCGACTACTCAGCCAC
CGTCGCCGTCGCTCTCTATTGGAAATTTTTCTGGTCGGGAGAAGGACGTGATTAACAAAGAACCAAACATCCATCACAACAACAACAAAGAACCAACCGTCTGTTCTGGG
GTTTGTTTGCTAATAGTGCTGTTGCTCTGTTGTTCAGCATTGGAGAGCAAATCTGTATGCTATTAGATTTTGCTTTGTGAATCGAGAAATGCCTGGGTTGGTATGGGTCT
AACTCACTTAGTCCTTAAATGGACGAAGTTGACGGAGGAGGAGGAGGAGGAGGAGGGGGAGGAATAAGAGGAGGAGGAGCAAGAGGAGGGCCAAGAGTCTAGTAGGAAGG
GCCAGCTTGGGACATTAAAGATAAATAGACTAAGTCGAGCTACTGGGTACAAATGGGGTCCGAGCAAAACAGAATCCCTCAGCATGATCGGGGAAAGAGATGGGGTGGAT
GTTGGGGTGCATTATCTTGTTTTCACTCTCAGAAAGGAGAAAAGCGCATTGTGCCTGCATCTCGTTTACCTGAGGGCAATGTCGTGACAACCCAGCCAATTGGACCTCAA
GCAGCAGGAATAGCCAACCAGGCTACAGTGATAGCTCCATCCCTTCTGGCCCCACCTTCTTCACCAGCATCCTTTACAAATTCTGCACTCCCTTCTACAGCCCAATCACC
TAGCTGTTTCTTGTCATTGTCTGCCAACTCACCTGGAGGTCCTTCATCCACAATGTTCGCTACAGGGCCATATGCGCATGAAACACAACTGGTTTCTCCTCCTGTTTTCT
CAGCCTTCACCACTGAACCGTCAACTGCTCCACTCACTCCCCCACCTGAACTAGCTCACCTAACCACACCTTCTTCCCCTGATGTGCCTTTTGCTCAGTTCCTATCCTCG
TCAGTGGATCTCAAAGGAACTGGAAAGGCAAATTACGTTGCCTCAAATGATCTTCAAGCAGCATATTCTCTCTACCCTGGAAGTCCTGCCAGTAGCCTAGTGTCACCAAT
TTCAAGGACCTCCGGCAATTGCTTATCATCTTCGTTTCCTGAGAGGGACTTCCCACCACAGTGGAATCCGTCAGCTTCTCTCCAAGATGGAAAATATCCAAGAAGTGGTT
CTGGTCGGCTATTTGGACATGAGAAAACTGGCACACCTTTGGCATCTCAGGATTCTAATTTCTTCTGCCCTGCTACATTTGCACAATTCTATCTGGACAATCCACCATTC
CCGCATACTGGTGGGAGGTTAAGTGTATCAAAAGATTCAGATGTTTACTCGTCTGGTGGGAATGGATACCAAAACCGCCACAGTAAGTCTCCAAAACAAGATGTGGAGGA
AATAGAAGCTTATAGAGCATCTTTTGGTTTCAGTGCAGATGAAATTATAACTACTACACAATATGTGGAGATATCTGATGTAATGGAGGATTCCTTTACTATGAGACCTT
TTACTTCAACTAGTCTGTCAGCAGAAGAAAATATTGAACCTCCATTATTGGGTGAAAAACTAAAATCCACACAGACAACTTTGCAGAGTCAGAGAAGTATTAAATCAGCA
TCTGATGTTGTCGAAAAAGAAACTTGCACTGAAGTCCTGGCATTATGCAATGGCAATAAAGACAATAAATTGCAAAGACAACCTGGTAACATGCCAGGATCAAGTACTTT
CAACCAAGTTGAAACGGAAGATGTATTCTCAAGGATAGGGTCATCCAAAAATAGTCGCAAGTATAATCATGGTTTATCCTGCTCTGATGCAGAAGTTGACTACAGAAGAG
GAAGGAGCCTAAGGGGGGAGGTCAAGGGAGATTTTTCATGGCATGACTAAGAGAGCCATCTCTGAAAATAGTTTACAGTGCTTATCTGTTCTTTGCTTTGCGGGTTTCCA
TGGAATGTCTAACCTATGATCTAACCTTTTTTTCTCTGAGTTATGACGTCAGTGGATGGATGAATATAATTTGTGTTCTTTATCATGCCAATTTAGGTATGGAATCGGTC
GAATTGCTGTATTTGGTAGACGAAGTTGACGTTTTTTCAATGAATTATGTACTGGCATCGTACACTCTCGCCCGTTCTGTGCATCAGTCGTCCTATACATGTCCAGACTC
CTGTTCATAGAGGTCAGATCGGGATTTCTTTAGAACCTTAGATTTGCTAAAATGTCATGTTGAAAGTTAGTGTTACATCAGAAATAGTCGGTTGTCGTTGTAAATAATTG
TTGGAGATTTCTTGTCGTTACAAGATCCAGTGGTTTGGCAACATGGCAAAATCCGTCGAATGAACCTAGGAGGTAACTCTGGCGATGCTATGTTGTTCGTTCTGTGAACC
CGACTTATTATAAAGTGGGAATAGAAATCAGCGA
Protein sequenceShow/hide protein sequence
MGSEQNRIPQHDRGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPIGPQAAGIANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLSANSPGGPSST
MFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYVASNDLQAAYSLYPGSPASSLVSPISRTSGNCLSSSFPERDF
PPQWNPSASLQDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSGGNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIIT
TTQYVEISDVMEDSFTMRPFTSTSLSAEENIEPPLLGEKLKSTQTTLQSQRSIKSASDVVEKETCTEVLALCNGNKDNKLQRQPGNMPGSSTFNQVETEDVFSRIGSSKN
SRKYNHGLSCSDAEVDYRRGRSLRGEVKGDFSWHD