; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0004623 (gene) of Snake gourd v1 genome

Gene IDTan0004623
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionMucin-2
Genome locationLG01:11550102..11552756
RNA-Seq ExpressionTan0004623
SyntenyTan0004623
Gene Ontology termsNA
InterPro domainsIPR040420 - Uncharacterized protein At1g76660-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004146564.1 uncharacterized protein LOC101220378 isoform X1 [Cucumis sativus]3.1e-21183.9Show/hide
Query:  MRRRTDADADAADLRPV-NNTFQTITAAADAIATVDHRFPRATPVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSVEAHENTLQSPDIVLPFAA
        MRRRTD D    D RPV NNTFQTITAAADAIATVDHRFPRAT VQKRRWGSC SIYWCFGS+KQRKRIGHAVLVPEPSPS E HENTLQSPDIVLPFAA
Subjt:  MRRRTDADADAADLRPV-NNTFQTITAAADAIATVDHRFPRATPVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSVEAHENTLQSPDIVLPFAA

Query:  PPSSPVSFLQSEPPSATQSPTPSAILPFTSLTANMYSPDGPSSIFAIGPFAYETQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFLQPT
        PPSSPVS LQSEPPSA QSPT  A++ FTSLTANMYSPDGPSSIFAIGPFA+E QLVSPPLNFSTLTTEPST PFTPPESIHLTTPSSPEVPFAQF+QPT
Subjt:  PPSSPVSFLQSEPPSATQSPTPSAILPFTSLTANMYSPDGPSSIFAIGPFAYETQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFLQPT

Query:  LQKAESDNQYSFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDLDFAPSGSQFSNFTLEVPPALLNLDKHSIHKWRQRQSSDSCTQNSMGFKSS
        L K ESDNQY+FPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPD DFA  GSQF NF LEVPP LLNLDKHSIH WRQRQS+DSCTQ+S+ FKSS
Subjt:  LQKAESDNQYSFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDLDFAPSGSQFSNFTLEVPPALLNLDKHSIHKWRQRQSSDSCTQNSMGFKSS

Query:  DDFDLNPQTSESMSDHHATNESQNIQILI-DGSQK-EEPAAANHRFSFELSDEDALLRSVESKPLESNELAVASSPIHEPFETAKETSPVDDHISNGTEE
        +DF LNPQTSESMSDHHATNESQNIQILI DGS+K EEP A NHRFSFELSD D LL+SV SKPLESNELAV SSPIHEPFET KE SP  DH SN  EE
Subjt:  DDFDLNPQTSESMSDHHATNESQNIQILI-DGSQK-EEPAAANHRFSFELSDEDALLRSVESKPLESNELAVASSPIHEPFETAKETSPVDDHISNGTEE

Query:  KAKENGEEANQHQEHHHSITLGSVKEFNFDNGSGSDTLKPNINSDWWANAKVVEKEGTATGAWSFFPMAQQR
        K K +G+EA+Q QE HHS+TLGSVKEFNFDNG+GSDT  PNINS+WW NAK    E TATG WSFFPM QQR
Subjt:  KAKENGEEANQHQEHHHSITLGSVKEFNFDNGSGSDTLKPNINSDWWANAKVVEKEGTATGAWSFFPMAQQR

XP_022136623.1 uncharacterized protein At1g76660-like [Momordica charantia]1.9e-21684.04Show/hide
Query:  MRRRTDADADAADLRPVNNTFQTITAAADAIATVDHRFPRATPVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSVEAHENTLQSPDIVLPFAAP
        MRRR DADAD ADL PVNNTFQTITAAADAIATVDHRFPRAT VQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPS E  ENTLQSPDIVLPFAAP
Subjt:  MRRRTDADADAADLRPVNNTFQTITAAADAIATVDHRFPRATPVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSVEAHENTLQSPDIVLPFAAP

Query:  PSSPVSFLQSEPPSATQSPTPSAILPFTSLTANMYSPDGPSSIFAIGPFAYETQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFLQPTL
        PSSPVSFLQSEPPSATQSPT  AIL FTSLTANMYSPDGPSSIFA+GPFA+ETQLVSPPLNFST+TT+PST PFTPPESIHLTTPSSPEVPFAQ+LQP+ 
Subjt:  PSSPVSFLQSEPPSATQSPTPSAILPFTSLTANMYSPDGPSSIFAIGPFAYETQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFLQPTL

Query:  QKAESDNQY-SFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDLDFAPSGSQFSNFTLEVPPALLNLDKHSIHKWRQRQSSDSCTQNSMGFKSS
        QK ESD+QY  FPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPD DF PSGS FSNF +EVPP LLNLD+HSI  WR +QSSDSCTQNS+G+KSS
Subjt:  QKAESDNQY-SFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDLDFAPSGSQFSNFTLEVPPALLNLDKHSIHKWRQRQSSDSCTQNSMGFKSS

Query:  DDFDLNPQTSESMSDHHATNESQNIQILIDGSQKEEPAAANHRFSFELSDEDALLRSVESKPLESNELAVASSPIHEPFETAKETSPVDDHISNGTEEKA
        +DF LNPQTSES+SD+HA+NE  NIQIL DGSQ++E AAANHRFSFELSDEDALL+SVE+KPLESNELAVASSPIHEP ETAKETS V  H SN TEE+ 
Subjt:  DDFDLNPQTSESMSDHHATNESQNIQILIDGSQKEEPAAANHRFSFELSDEDALLRSVESKPLESNELAVASSPIHEPFETAKETSPVDDHISNGTEEKA

Query:  KENGEEANQHQE-HHHSITLGSVKEFNFDNGSGSDTLKPNINSDWWANAKVVEKEGTATGAWSFFPMAQQ
        K +GEE + HQE  HHS+TLG+VKEFNFDNG+G DTLKPNINS WWAN K  E EGT TGAWSFFP+ QQ
Subjt:  KENGEEANQHQE-HHHSITLGSVKEFNFDNGSGSDTLKPNINSDWWANAKVVEKEGTATGAWSFFPMAQQ

XP_023522163.1 uncharacterized protein At1g76660-like [Cucurbita pepo subsp. pepo]4.8e-21285.32Show/hide
Query:  MRRRTDADADAADLRPVNNTFQTITAAADAIATVDHRFPRATPVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSVEAHENTLQSPDIVLPFAAP
        MRRR  ADADAADLRP+NNTFQTITAAADAIATVDHRFPRAT VQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPS EAH+N+LQSPDIVLPFAAP
Subjt:  MRRRTDADADAADLRPVNNTFQTITAAADAIATVDHRFPRATPVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSVEAHENTLQSPDIVLPFAAP

Query:  PSSPVSFLQSEPPSATQSPTPSAILPFTSLTANMYSPDGPSSIFAIGPFAYETQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFLQPTL
        PSSPVSFLQSEPPSATQS  PS IL FTSLTANMYSPDGPSSIFAIGPFA+ETQLVSPPLNFSTLTTEPSTP FTPPESIHLTTPSSPEVPFAQFLQPTL
Subjt:  PSSPVSFLQSEPPSATQSPTPSAILPFTSLTANMYSPDGPSSIFAIGPFAYETQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFLQPTL

Query:  QKAESDNQYSFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDLDFAPSGSQFSNFTLEVPPALLNLDKHSIHKWRQRQSSDSCTQNSMGFKSS-
        QKAESD+QYS PNDDFQSYQFYPGSPVS+LISPRS IS SGASSPLPDLDFA S SQFSNF+L+VPPALLNLD       RQ QSSDSCTQNS+GFKS+ 
Subjt:  QKAESDNQYSFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDLDFAPSGSQFSNFTLEVPPALLNLDKHSIHKWRQRQSSDSCTQNSMGFKSS-

Query:  DDFDLNPQTSESMSDHHATNESQNIQILIDGSQKEEPAAANHRFSFELSDEDALLRSVESKPLESNELAVASSPIHEPFETAKETSPVDDHISNGTEEKA
        DDFDLNP+TS+SM      NESQNIQILIDGSQ EEP   NHRFSFELSDED+LLR+VESKPLESN +AVASSP+HE FETAKETS    H SNG EEKA
Subjt:  DDFDLNPQTSESMSDHHATNESQNIQILIDGSQKEEPAAANHRFSFELSDEDALLRSVESKPLESNELAVASSPIHEPFETAKETSPVDDHISNGTEEKA

Query:  KENGEEANQHQEHHHSITLGSVKEFNFDNGSGSDTLKPNINSDWWANAKVVEKEGTATGAWSFFPMAQQR
         + GEEANQHQEHHHS TLGSV EFNFDNG+GS+ LKPNINSDWWANAK VE +GT TGAWSFFPMAQQR
Subjt:  KENGEEANQHQEHHHSITLGSVKEFNFDNGSGSDTLKPNINSDWWANAKVVEKEGTATGAWSFFPMAQQR

XP_023529207.1 uncharacterized protein At1g76660-like isoform X1 [Cucurbita pepo subsp. pepo]3.9e-21485.53Show/hide
Query:  MRRRTDADADAADLRPVNNTFQTITAAADAIATVDHRFPRATPVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSVEAHENTLQSPDIVLPFAAP
        MRRR DADADAADLRP+NNTFQTITAAADAIATVDHRFPRAT VQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPS EAH+N+LQSPDIVLPFAAP
Subjt:  MRRRTDADADAADLRPVNNTFQTITAAADAIATVDHRFPRATPVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSVEAHENTLQSPDIVLPFAAP

Query:  PSSPVSFLQSEPPSATQSPTPSAILPFTSLTANMYSPDGPSSIFAIGPFAYETQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFLQPTL
        PSSPVSFLQSEPPSATQS  PS IL FTSLTANMYSPDGPSSIFAIGPFA+ETQLVSPPLNFSTLTTEPSTP FTPPESIHLTTPSSPEVPFAQFLQPTL
Subjt:  PSSPVSFLQSEPPSATQSPTPSAILPFTSLTANMYSPDGPSSIFAIGPFAYETQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFLQPTL

Query:  QKAESDNQYSFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDLDFAPSGSQFSNFTLEVPPALLNLDKHSIHKWRQRQSSDSCTQNSMGFKSS-
        QKAESD+QYS PNDDFQSYQFYPGSPVS+LISPRS IS SGASSPLPDLDFA S SQFSNF+L+VPPALLNLD       RQ QSSDSCTQNS+GFKS+ 
Subjt:  QKAESDNQYSFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDLDFAPSGSQFSNFTLEVPPALLNLDKHSIHKWRQRQSSDSCTQNSMGFKSS-

Query:  DDFDLNPQTSESMSDHHATNESQNIQILIDGSQKEEPAAANHRFSFELSDEDALLRSVESKPLESNELAVASSPIHEPFETAKETSPVDDHISNGTEEKA
        DDFDLNP+TS+SM      NESQNIQILIDGSQ EEP   NHRFSFELSDED+LLR+VESKPLESN +AVASSP+HE FETAKETS    H SNG EEKA
Subjt:  DDFDLNPQTSESMSDHHATNESQNIQILIDGSQKEEPAAANHRFSFELSDEDALLRSVESKPLESNELAVASSPIHEPFETAKETSPVDDHISNGTEEKA

Query:  KENGEEANQHQEHHHSITLGSVKEFNFDNGSGSDTLKPNINSDWWANAKVVEKEGTATGAWSFFPMAQQR
         + GEEANQHQEHHHS TLGSV EFNFDNG+GS+ LKPNINSDWWANAK VE +GT TGAWSFFPMAQQR
Subjt:  KENGEEANQHQEHHHSITLGSVKEFNFDNGSGSDTLKPNINSDWWANAKVVEKEGTATGAWSFFPMAQQR

XP_038884079.1 uncharacterized protein LOC120075005 isoform X2 [Benincasa hispida]4.5e-21885.38Show/hide
Query:  MRRRTDADADAADLRPVNNTFQTITAAADAIATVDHRFPRATPVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSVEAHENTLQSPDIVLPFAAP
        MRRRTD D    D RPVNNTFQTITAAADAIATVDHRFPRAT VQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPE SPS E+HEN+LQSPDIVLPFAAP
Subjt:  MRRRTDADADAADLRPVNNTFQTITAAADAIATVDHRFPRATPVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSVEAHENTLQSPDIVLPFAAP

Query:  PSSPVSFLQSEPPSATQSPTPSAILPFTSLTANMYSPDGPSSIFAIGPFAYETQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFLQPTL
        PSSPVSFLQSEPPSATQSPT  A++ FTSLTANMYSPDGPSSIFAIGPFA+ETQLVSPPLNFSTLTTEPST PFTPPESIHLTTPSSPEVPFAQFLQPTL
Subjt:  PSSPVSFLQSEPPSATQSPTPSAILPFTSLTANMYSPDGPSSIFAIGPFAYETQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFLQPTL

Query:  QKAESDNQYSFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDLDFAPSGSQFSNFTLEVPPALLNLDKHSIHKWRQRQSSDSCTQNSMGFKSSD
        QK+ESD+QY FPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPD DFA  GSQF NF LEVPP LLNLDK SIH WRQRQS+DSCTQ+S+  KSS+
Subjt:  QKAESDNQYSFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDLDFAPSGSQFSNFTLEVPPALLNLDKHSIHKWRQRQSSDSCTQNSMGFKSSD

Query:  DFDLNPQTSESMSDHHATNESQNIQILIDGSQKEE--PAAANHRFSFELSDEDALLRSVESKPLESNELAVASSPIHEPFETAKETSPV-DDHISNGTEE
        DF LNPQTSESMSDHHATNESQNIQILIDG+QKEE  P A NHRFSFELSD DALL+SV SKPL+SNE+AVASSPIHEPFETAKE SPV DDH SN TE 
Subjt:  DFDLNPQTSESMSDHHATNESQNIQILIDGSQKEE--PAAANHRFSFELSDEDALLRSVESKPLESNELAVASSPIHEPFETAKETSPV-DDHISNGTEE

Query:  KAKENGEEANQHQEHHHSITLGSVKEFNFDNGSGSDTLKPNINSDWWANAKVVEKEGTATGAWSFFPMAQQR
        K K   EEA+QHQE HHSITLGSVKEFNFDNG+GSDT K N+NS+WW NAK V+ EGT  GAWSFFPM QQR
Subjt:  KAKENGEEANQHQEHHHSITLGSVKEFNFDNGSGSDTLKPNINSDWWANAKVVEKEGTATGAWSFFPMAQQR

TrEMBL top hitse value%identityAlignment
A0A1S3BSB0 uncharacterized protein LOC103493162 isoform X12.4e-20982.42Show/hide
Query:  MRRRTDADADAADLRPVNNTFQTITAAADAIATVDHRFPRATPV-QKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSVEAHENTLQSPDIVLPFAA
        MRRRTD D    D RPVNNTFQTITAAADAIATVDHRFPRAT V QKRRWGSC SIYWCFGSLKQRKRIGHAVLVPEPSPS E HENTLQSPDIVLPFAA
Subjt:  MRRRTDADADAADLRPVNNTFQTITAAADAIATVDHRFPRATPV-QKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSVEAHENTLQSPDIVLPFAA

Query:  PPSSPVSFLQSEPPSATQSPTPSAILPFTSLTANMYSPDGPSSIFAIGPFAYETQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFLQPT
        PPSSPVS LQSEPPSA QSPT  A++ FTSLTANMYSPDGPSSIFAIGPFA+E QLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQF+ P+
Subjt:  PPSSPVSFLQSEPPSATQSPTPSAILPFTSLTANMYSPDGPSSIFAIGPFAYETQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFLQPT

Query:  LQKAESDNQYSFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDLDFAPSGSQFSNFTLEVPPALLNLDKHSIHKWRQRQSSDSCTQNSMGFKSS
        LQK ESDNQY+FPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPD DFA  GSQF NF LEVPP L NLDKHSIH WRQRQS+DSCTQ+S+ FKSS
Subjt:  LQKAESDNQYSFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDLDFAPSGSQFSNFTLEVPPALLNLDKHSIHKWRQRQSSDSCTQNSMGFKSS

Query:  DDFDLNPQTSESMSDHHATNESQNIQILIDGSQK--EEPAAANHRFSFELSDEDALLRSVESKPLESNELAVASSPIHEPFETAKETSPVDDHISNGTEE
        +DF LNP TSESM DHHATNESQNIQILID   K  EEP A NHRFSFELSD D L +SV SKPLESNEL V SSPIHEPFET KE SP  DH SN  EE
Subjt:  DDFDLNPQTSESMSDHHATNESQNIQILIDGSQK--EEPAAANHRFSFELSDEDALLRSVESKPLESNELAVASSPIHEPFETAKETSPVDDHISNGTEE

Query:  KAKENGEEANQHQEHHHSITLGSVKEFNFDNGSGSDTLKPNINSDWWANAKVVEKEGTATGAWSFFPMAQQR
        K K +G+EA+QHQE HHS+ LGSVKEFNFDN +GSDT  P INSDWW NAK    EGT TGAWSFFP  QQR
Subjt:  KAKENGEEANQHQEHHHSITLGSVKEFNFDNGSGSDTLKPNINSDWWANAKVVEKEGTATGAWSFFPMAQQR

A0A1S3BSY8 uncharacterized protein LOC103493162 isoform X29.7e-21182.59Show/hide
Query:  MRRRTDADADAADLRPVNNTFQTITAAADAIATVDHRFPRATPVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSVEAHENTLQSPDIVLPFAAP
        MRRRTD D    D RPVNNTFQTITAAADAIATVDHRFPRAT VQKRRWGSC SIYWCFGSLKQRKRIGHAVLVPEPSPS E HENTLQSPDIVLPFAAP
Subjt:  MRRRTDADADAADLRPVNNTFQTITAAADAIATVDHRFPRATPVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSVEAHENTLQSPDIVLPFAAP

Query:  PSSPVSFLQSEPPSATQSPTPSAILPFTSLTANMYSPDGPSSIFAIGPFAYETQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFLQPTL
        PSSPVS LQSEPPSA QSPT  A++ FTSLTANMYSPDGPSSIFAIGPFA+E QLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQF+ P+L
Subjt:  PSSPVSFLQSEPPSATQSPTPSAILPFTSLTANMYSPDGPSSIFAIGPFAYETQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFLQPTL

Query:  QKAESDNQYSFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDLDFAPSGSQFSNFTLEVPPALLNLDKHSIHKWRQRQSSDSCTQNSMGFKSSD
        QK ESDNQY+FPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPD DFA  GSQF NF LEVPP L NLDKHSIH WRQRQS+DSCTQ+S+ FKSS+
Subjt:  QKAESDNQYSFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDLDFAPSGSQFSNFTLEVPPALLNLDKHSIHKWRQRQSSDSCTQNSMGFKSSD

Query:  DFDLNPQTSESMSDHHATNESQNIQILIDGSQK--EEPAAANHRFSFELSDEDALLRSVESKPLESNELAVASSPIHEPFETAKETSPVDDHISNGTEEK
        DF LNP TSESM DHHATNESQNIQILID   K  EEP A NHRFSFELSD D L +SV SKPLESNEL V SSPIHEPFET KE SP  DH SN  EEK
Subjt:  DFDLNPQTSESMSDHHATNESQNIQILIDGSQK--EEPAAANHRFSFELSDEDALLRSVESKPLESNELAVASSPIHEPFETAKETSPVDDHISNGTEEK

Query:  AKENGEEANQHQEHHHSITLGSVKEFNFDNGSGSDTLKPNINSDWWANAKVVEKEGTATGAWSFFPMAQQR
         K +G+EA+QHQE HHS+ LGSVKEFNFDN +GSDT  P INSDWW NAK    EGT TGAWSFFP  QQR
Subjt:  AKENGEEANQHQEHHHSITLGSVKEFNFDNGSGSDTLKPNINSDWWANAKVVEKEGTATGAWSFFPMAQQR

A0A5A7TUB1 Mucin-23.1e-20981.95Show/hide
Query:  MRRRTDADADAADLRPVNNTFQTITAAADAIATVDHRFPRATPVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSVEAHENTLQSPDIVLPFAAP
        MRRRTD D    D RPVNNTFQTITAAADAIATVDHRFPRAT VQKRRWGSC SIYWCFGSLKQRKRIGHAVLVPEPSPS E HENTLQSPDIVLPFAAP
Subjt:  MRRRTDADADAADLRPVNNTFQTITAAADAIATVDHRFPRATPVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSVEAHENTLQSPDIVLPFAAP

Query:  PSSPVSFLQSEPPSATQSPTPSAILPFTSLTANMYSPDGPSSIFAIGPFAYETQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFLQPTL
        PSSPVS LQSEPPSA QSPT  A++ FTSLTANMYSPDGPSSIFAIGPFA+E QLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQF+ P+ 
Subjt:  PSSPVSFLQSEPPSATQSPTPSAILPFTSLTANMYSPDGPSSIFAIGPFAYETQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFLQPTL

Query:  QKAESDNQYSFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDLDFAPSGSQFSNFTLEVPPALLNLDKHSIHKWRQRQSSDSCTQNSMGFKSSD
        QK ESDNQY+FPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPD DFA  GSQF NF L+VPP L N+DKHSIH WRQRQS+DSCTQ+S+ FKSS+
Subjt:  QKAESDNQYSFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDLDFAPSGSQFSNFTLEVPPALLNLDKHSIHKWRQRQSSDSCTQNSMGFKSSD

Query:  DFDLNPQTSESMSDHHATNESQNIQILIDGSQK--EEPAAANHRFSFELSDEDALLRSVESKPLESNELAVASSPIHEPFETAKETSPVDDHISNGTEEK
        DF LNP TSESM DHHATNESQNIQILID   K  EEP A NHRFSFELSD D L +SV SKPLESNEL V SSPIHEPFET KE SP  DH SN  EEK
Subjt:  DFDLNPQTSESMSDHHATNESQNIQILIDGSQK--EEPAAANHRFSFELSDEDALLRSVESKPLESNELAVASSPIHEPFETAKETSPVDDHISNGTEEK

Query:  AKENGEEANQHQEHHHSITLGSVKEFNFDNGSGSDTLKPNINSDWWANAKVVEKEGTATGAWSFFPMAQQR
         K +G+EA+QHQE HHS+ LGSVKEFNFDN +GSDT  P INSDWW NAK    EGT TGAWSFFP  QQR
Subjt:  AKENGEEANQHQEHHHSITLGSVKEFNFDNGSGSDTLKPNINSDWWANAKVVEKEGTATGAWSFFPMAQQR

A0A5D3CYQ2 Mucin-29.7e-21182.59Show/hide
Query:  MRRRTDADADAADLRPVNNTFQTITAAADAIATVDHRFPRATPVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSVEAHENTLQSPDIVLPFAAP
        MRRRTD D    D RPVNNTFQTITAAADAIATVDHRFPRAT VQKRRWGSC SIYWCFGSLKQRKRIGHAVLVPEPSPS E HENTLQSPDIVLPFAAP
Subjt:  MRRRTDADADAADLRPVNNTFQTITAAADAIATVDHRFPRATPVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSVEAHENTLQSPDIVLPFAAP

Query:  PSSPVSFLQSEPPSATQSPTPSAILPFTSLTANMYSPDGPSSIFAIGPFAYETQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFLQPTL
        PSSPVS LQSEPPSA QSPT  A++ FTSLTANMYSPDGPSSIFAIGPFA+E QLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQF+ P+L
Subjt:  PSSPVSFLQSEPPSATQSPTPSAILPFTSLTANMYSPDGPSSIFAIGPFAYETQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFLQPTL

Query:  QKAESDNQYSFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDLDFAPSGSQFSNFTLEVPPALLNLDKHSIHKWRQRQSSDSCTQNSMGFKSSD
        QK ESDNQY+FPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPD DFA  GSQF NF LEVPP L NLDKHSIH WRQRQS+DSCTQ+S+ FKSS+
Subjt:  QKAESDNQYSFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDLDFAPSGSQFSNFTLEVPPALLNLDKHSIHKWRQRQSSDSCTQNSMGFKSSD

Query:  DFDLNPQTSESMSDHHATNESQNIQILIDGSQK--EEPAAANHRFSFELSDEDALLRSVESKPLESNELAVASSPIHEPFETAKETSPVDDHISNGTEEK
        DF LNP TSESM DHHATNESQNIQILID   K  EEP A NHRFSFELSD D L +SV SKPLESNEL V SSPIHEPFET KE SP  DH SN  EEK
Subjt:  DFDLNPQTSESMSDHHATNESQNIQILIDGSQK--EEPAAANHRFSFELSDEDALLRSVESKPLESNELAVASSPIHEPFETAKETSPVDDHISNGTEEK

Query:  AKENGEEANQHQEHHHSITLGSVKEFNFDNGSGSDTLKPNINSDWWANAKVVEKEGTATGAWSFFPMAQQR
         K +G+EA+QHQE HHS+ LGSVKEFNFDN +GSDT  P INSDWW NAK    EGT TGAWSFFP  QQR
Subjt:  AKENGEEANQHQEHHHSITLGSVKEFNFDNGSGSDTLKPNINSDWWANAKVVEKEGTATGAWSFFPMAQQR

A0A6J1C828 uncharacterized protein At1g76660-like9.1e-21784.04Show/hide
Query:  MRRRTDADADAADLRPVNNTFQTITAAADAIATVDHRFPRATPVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSVEAHENTLQSPDIVLPFAAP
        MRRR DADAD ADL PVNNTFQTITAAADAIATVDHRFPRAT VQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPS E  ENTLQSPDIVLPFAAP
Subjt:  MRRRTDADADAADLRPVNNTFQTITAAADAIATVDHRFPRATPVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSVEAHENTLQSPDIVLPFAAP

Query:  PSSPVSFLQSEPPSATQSPTPSAILPFTSLTANMYSPDGPSSIFAIGPFAYETQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFLQPTL
        PSSPVSFLQSEPPSATQSPT  AIL FTSLTANMYSPDGPSSIFA+GPFA+ETQLVSPPLNFST+TT+PST PFTPPESIHLTTPSSPEVPFAQ+LQP+ 
Subjt:  PSSPVSFLQSEPPSATQSPTPSAILPFTSLTANMYSPDGPSSIFAIGPFAYETQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFLQPTL

Query:  QKAESDNQY-SFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDLDFAPSGSQFSNFTLEVPPALLNLDKHSIHKWRQRQSSDSCTQNSMGFKSS
        QK ESD+QY  FPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPD DF PSGS FSNF +EVPP LLNLD+HSI  WR +QSSDSCTQNS+G+KSS
Subjt:  QKAESDNQY-SFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDLDFAPSGSQFSNFTLEVPPALLNLDKHSIHKWRQRQSSDSCTQNSMGFKSS

Query:  DDFDLNPQTSESMSDHHATNESQNIQILIDGSQKEEPAAANHRFSFELSDEDALLRSVESKPLESNELAVASSPIHEPFETAKETSPVDDHISNGTEEKA
        +DF LNPQTSES+SD+HA+NE  NIQIL DGSQ++E AAANHRFSFELSDEDALL+SVE+KPLESNELAVASSPIHEP ETAKETS V  H SN TEE+ 
Subjt:  DDFDLNPQTSESMSDHHATNESQNIQILIDGSQKEEPAAANHRFSFELSDEDALLRSVESKPLESNELAVASSPIHEPFETAKETSPVDDHISNGTEEKA

Query:  KENGEEANQHQE-HHHSITLGSVKEFNFDNGSGSDTLKPNINSDWWANAKVVEKEGTATGAWSFFPMAQQ
        K +GEE + HQE  HHS+TLG+VKEFNFDNG+G DTLKPNINS WWAN K  E EGT TGAWSFFP+ QQ
Subjt:  KENGEEANQHQE-HHHSITLGSVKEFNFDNGSGSDTLKPNINSDWWANAKVVEKEGTATGAWSFFPMAQQ

SwissProt top hitse value%identityAlignment
Q9SRE5 Uncharacterized protein At1g766607.8e-3248.11Show/hide
Query:  QKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPE-----PSPSVEAHE----NTLQSPDIVLPFAAPPSSPVSFLQSEPPSATQSPTPSAILPFTSLTANMY
        Q++RWG C  ++ CF S K  KRI  A  +PE      S    AH+    N   +  I L   APPSSP SF  S  PS TQSP       + SL AN  
Subjt:  QKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPE-----PSPSVEAHE----NTLQSPDIVLPFAAPPSSPVSFLQSEPPSATQSPTPSAILPFTSLTANMY

Query:  SPDGP-SSIFAIGPFAYETQLVSPPLNFSTLTTEPSTPPFT-PPESIHLTTPSSPEVPFAQFLQPTLQKAESDNQYSFPNDDFQSYQFYPGSPVSHLISP
        SP GP SS++A GP+A+ETQLVSPP+ FST TTEPST PFT PPE   LT PSSP+VP+A+FL  ++    S   +   ND   +Y  YPGSP S L SP
Subjt:  SPDGP-SSIFAIGPFAYETQLVSPPLNFSTLTTEPSTPPFT-PPESIHLTTPSSPEVPFAQFLQPTLQKAESDNQYSFPNDDFQSYQFYPGSPVSHLISP

Query:  RSVISRSGASSP
         S  S  G  SP
Subjt:  RSVISRSGASSP

Arabidopsis top hitse value%identityAlignment
AT1G63720.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT5G52430.1)1.0e-5050.94Show/hide
Query:  NNTFQTITAAADAIATVDHRFPRATPV-QKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSVEAHENT----LQSPDIVLPFAAPPSSPVSFLQSEP
        NN F TI AAA AIA+ D R  +++P+ +KR+W + WS+  CFGS +QRKRIG++VLVPEP     ++  T     +S    LPF APPSSP SF QSEP
Subjt:  NNTFQTITAAADAIATVDHRFPRATPV-QKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSVEAHENT----LQSPDIVLPFAAPPSSPVSFLQSEP

Query:  PSATQSPTPSAILPFTSLTANMYSPDGPSSIFAIGPFAYETQLVSPPLNFSTLTTEPSTPPFTPP---ESIHL--TTPSSPEVPFAQFLQPTLQKAESDN
        PSATQSP    IL F+ L  N        SIFAIGP+A+ETQLVSPP+ FST TTEPS+ P TPP    SI+L  TTPSSPEVPFAQ      Q      
Subjt:  PSATQSPTPSAILPFTSLTANMYSPDGPSSIFAIGPFAYETQLVSPPLNFSTLTTEPSTPPFTPP---ESIHL--TTPSSPEVPFAQFLQPTLQKAESDN

Query:  QYSFP---NDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDLDFAPSGSQFSNFTLEVPPALLN
         Y FP   + +FQ YQ  PGSP+  LISP      SG +SP PD       S F +F +  PP LL+
Subjt:  QYSFP---NDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDLDFAPSGSQFSNFTLEVPPALLN

AT1G76660.1 FUNCTIONS IN: molecular_function unknown5.5e-3348.11Show/hide
Query:  QKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPE-----PSPSVEAHE----NTLQSPDIVLPFAAPPSSPVSFLQSEPPSATQSPTPSAILPFTSLTANMY
        Q++RWG C  ++ CF S K  KRI  A  +PE      S    AH+    N   +  I L   APPSSP SF  S  PS TQSP       + SL AN  
Subjt:  QKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPE-----PSPSVEAHE----NTLQSPDIVLPFAAPPSSPVSFLQSEPPSATQSPTPSAILPFTSLTANMY

Query:  SPDGP-SSIFAIGPFAYETQLVSPPLNFSTLTTEPSTPPFT-PPESIHLTTPSSPEVPFAQFLQPTLQKAESDNQYSFPNDDFQSYQFYPGSPVSHLISP
        SP GP SS++A GP+A+ETQLVSPP+ FST TTEPST PFT PPE   LT PSSP+VP+A+FL  ++    S   +   ND   +Y  YPGSP S L SP
Subjt:  SPDGP-SSIFAIGPFAYETQLVSPPLNFSTLTTEPSTPPFT-PPESIHLTTPSSPEVPFAQFLQPTLQKAESDNQYSFPNDDFQSYQFYPGSPVSHLISP

Query:  RSVISRSGASSP
         S  S  G  SP
Subjt:  RSVISRSGASSP

AT4G25620.1 hydroxyproline-rich glycoprotein family protein1.6e-5637.2Show/hide
Query:  LRPVNN-TFQTITAAADAIATVDHRFPRATPVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSVEA----HENTLQSPDIVLPFAAPPSSPVSFL
        +R VNN +  T+ AAA AI + + R  + + VQK+R GS WS+YWCFGS K  KRIGHAVLVPEP+ S  A      ++  S  I +PF APPSSP SFL
Subjt:  LRPVNN-TFQTITAAADAIATVDHRFPRATPVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSVEA----HENTLQSPDIVLPFAAPPSSPVSFL

Query:  QSEPPSATQSPTPSAILPFTSLTANMYSPDGPSSIFAIGPFAYETQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFLQPTLQKAESDN-
         S PPSA+ +P P  +    SLT N      P S F IGP+A+ETQ V+PP+ FS  TTEPST PFTPP      +PSSPEVPFAQ L  +L++A  ++ 
Subjt:  QSEPPSATQSPTPSAILPFTSLTANMYSPDGPSSIFAIGPFAYETQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFLQPTLQKAESDN-

Query:  -----QYSFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDLDFAPSGSQFSNFTLEVPPALLNLDKHSIHKWRQRQSSDSCTQNSMGFKSSDDF
             ++S  + +F+S Q YPGSP  +LISP      SG SSP       P       F +  PP  L  +  +  KW  R  S S T    G +     
Subjt:  -----QYSFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDLDFAPSGSQFSNFTLEVPPALLNLDKHSIHKWRQRQSSDSCTQNSMGFKSSDDF

Query:  DLNPQTSESMSDHHATNESQNI------------QILIDGSQKEEPAAAN----------------HRFSFELSDEDALLRSVESKPLESNELAVASSPI
         L P  S+  S     N ++ +              L+D    E  + AN                HR SFEL+ ED + R + SK   S     AS   
Subjt:  DLNPQTSESMSDHHATNESQNI------------QILIDGSQKEEPAAAN----------------HRFSFELSDEDALLRSVESKPLESNELAVASSPI

Query:  HEPFETAKETSPVDDHISNGTEEKAKENGEEANQHQEHHHSITLGSVKEFNFDNGSGSDTLKPNINSDWWANAKVVEK-EGTATGAWSFFPM
          P                      K +GE  ++  +   S + GS KEF FD  S ++ +   I S+WWAN KV  K + +   +W+FFP+
Subjt:  HEPFETAKETSPVDDHISNGTEEKAKENGEEANQHQEHHHSITLGSVKEFNFDNGSGSDTLKPNINSDWWANAKVVEK-EGTATGAWSFFPM

AT5G52430.1 hydroxyproline-rich glycoprotein family protein9.7e-6239.61Show/hide
Query:  VNNTFQTITAAADAIATVDHRFPRATPVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPS---VEAHENTLQSPDIVLPFAAPPSSPVSFLQSEPP
        VNN+ +T+ AAA AI T + R  + +  QK RWG CWS+Y CFG+ K  KRIG+AVLVPEP  S   V   +N+  S  +VLPF APPSSP SFLQS+P 
Subjt:  VNNTFQTITAAADAIATVDHRFPRATPVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPS---VEAHENTLQSPDIVLPFAAPPSSPVSFLQSEPP

Query:  SATQSPTPSAILPFTSLTANMYSPDGPSSIFAIGPFAYETQLVSPPLNFSTLTTEPSTPPFTPP--ESIHLTTPSSPEVPFAQFLQPTLQKAESDN----
        S + SP         SLT+N +SP  P S+F +GP+A ETQ V+PP+ FS   TEPST P+TPP   S+H+TTPSSPEVPFAQ L  +L+    D+    
Subjt:  SATQSPTPSAILPFTSLTANMYSPDGPSSIFAIGPFAYETQLVSPPLNFSTLTTEPSTPPFTPP--ESIHLTTPSSPEVPFAQFLQPTLQKAESDN----

Query:  --QYSFPNDDFQSYQFYPGSP-VSHLISPRSVISRSGASSPLPDLDFAPSGSQFSNFTLEVPPALLNLDKHSIHKWRQRQSSDSCTQ--NSMGFKSSDDF
          ++S  + +F+S Q  PGSP   +LISP SVIS SG SSP       P  S    F +  PP  L  +  +  KW  R  S S T   +  G  S    
Subjt:  --QYSFPNDDFQSYQFYPGSP-VSHLISPRSVISRSGASSPLPDLDFAPSGSQFSNFTLEVPPALLNLDKHSIHKWRQRQSSDSCTQ--NSMGFKSSDDF

Query:  DLNPQ-TSESMSDHHAT----NESQNIQILIDGSQKEEPAAANHRFSFELSDEDALLRSVESKPLESNELAVASSPIHEPFETAKETSPVDDHISNGTEE
           P+  S +++ ++ T    N+   +  L +     E   A+HR SFEL+ ED + R + SK   S++    +  I       +E+S  D  I    E+
Subjt:  DLNPQ-TSESMSDHHAT----NESQNIQILIDGSQKEEPAAANHRFSFELSDEDALLRSVESKPLESNELAVASSPIHEPFETAKETSPVDDHISNGTEE

Query:  KAKENGEEANQHQEHHHSITLGSVKEFNFDNGSGSDTLKPNINSDWWANAKVVEKEGTATGAWSFFP
        ++ +   E ++ Q+   S ++GS KEF FDN     T   NI             E  A  +WSFFP
Subjt:  KAKENGEEANQHQEHHHSITLGSVKEFNFDNGSGSDTLKPNINSDWWANAKVVEKEGTATGAWSFFP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGACGACGTACGGATGCTGATGCCGATGCTGCTGATCTGAGGCCTGTAAATAACACGTTTCAAACCATTACTGCGGCCGCCGATGCGATCGCCACCGTCGATCATCG
TTTTCCTCGGGCTACTCCCGTCCAGAAAAGAAGATGGGGCAGCTGTTGGAGTATTTATTGGTGCTTTGGATCTCTCAAACAGAGGAAAAGAATTGGGCATGCTGTCCTGG
TCCCAGAACCAAGTCCTTCAGTTGAGGCTCATGAAAATACATTGCAATCACCAGACATTGTGCTTCCTTTTGCTGCACCTCCCTCTTCCCCTGTTTCCTTCCTTCAATCA
GAGCCACCTTCTGCTACTCAATCACCTACACCTTCAGCTATACTCCCTTTCACTTCTCTCACTGCTAACATGTATTCTCCTGATGGGCCTTCCTCAATTTTTGCCATTGG
CCCATTTGCTTATGAAACACAACTTGTGTCTCCCCCTCTGAATTTCTCCACTCTAACCACTGAACCATCAACTCCTCCCTTCACTCCTCCTGAGTCTATCCACTTGACTA
CACCTTCTTCCCCTGAAGTTCCTTTTGCTCAGTTCCTTCAACCTACCCTTCAGAAAGCCGAGTCTGATAACCAATATTCATTTCCTAATGATGACTTTCAATCTTATCAA
TTCTATCCTGGCAGCCCAGTCAGTCACCTCATTTCACCGCGCTCAGTCATTTCTCGTTCTGGGGCGTCGTCGCCTTTGCCAGACTTGGATTTTGCTCCCTCTGGTTCTCA
ATTTTCTAATTTCACATTGGAAGTTCCACCTGCGCTGTTGAACCTTGACAAACATTCCATTCATAAATGGCGACAAAGGCAAAGTTCTGATTCTTGCACTCAGAATTCTA
TGGGATTCAAATCAAGTGATGATTTTGATTTGAATCCTCAAACTTCGGAATCTATGTCGGATCACCACGCAACAAATGAATCCCAAAATATTCAAATTCTCATTGATGGA
AGCCAAAAGGAGGAGCCTGCTGCTGCTAATCATAGATTCTCATTTGAGTTATCTGATGAAGATGCTTTATTGAGAAGCGTAGAAAGTAAGCCACTGGAATCAAATGAACT
TGCAGTTGCATCATCTCCAATACATGAACCATTTGAAACGGCTAAAGAAACTTCTCCTGTCGATGATCATATTTCAAATGGTACAGAAGAAAAGGCAAAAGAAAACGGTG
AAGAAGCAAATCAGCATCAAGAACATCATCATTCCATTACTCTTGGGTCTGTGAAGGAATTCAATTTTGACAATGGCAGTGGAAGTGATACACTTAAGCCTAATATCAAC
TCAGACTGGTGGGCCAATGCGAAAGTTGTAGAGAAAGAAGGTACAGCCACCGGGGCCTGGTCATTCTTTCCAATGGCGCAGCAAAGATGA
mRNA sequenceShow/hide mRNA sequence
ATCACATATAATTTAATTTCCGTCTTCTTTCTTCTCTCTTTAACGAACTTTCTCTTCACTAACTGCAAATTCTCTTATTTTGTTCTGTGTTTTTTCCCCTAAGAATTTCA
TCATCTCTCGTGTATGAAGAACGACAATTTTTTCTATGAAAACGATCAACAATTCTCTGGTTTCGATCAACGATAGCGATGAGACGACGTACGGATGCTGATGCCGATGC
TGCTGATCTGAGGCCTGTAAATAACACGTTTCAAACCATTACTGCGGCCGCCGATGCGATCGCCACCGTCGATCATCGTTTTCCTCGGGCTACTCCCGTCCAGAAAAGAA
GATGGGGCAGCTGTTGGAGTATTTATTGGTGCTTTGGATCTCTCAAACAGAGGAAAAGAATTGGGCATGCTGTCCTGGTCCCAGAACCAAGTCCTTCAGTTGAGGCTCAT
GAAAATACATTGCAATCACCAGACATTGTGCTTCCTTTTGCTGCACCTCCCTCTTCCCCTGTTTCCTTCCTTCAATCAGAGCCACCTTCTGCTACTCAATCACCTACACC
TTCAGCTATACTCCCTTTCACTTCTCTCACTGCTAACATGTATTCTCCTGATGGGCCTTCCTCAATTTTTGCCATTGGCCCATTTGCTTATGAAACACAACTTGTGTCTC
CCCCTCTGAATTTCTCCACTCTAACCACTGAACCATCAACTCCTCCCTTCACTCCTCCTGAGTCTATCCACTTGACTACACCTTCTTCCCCTGAAGTTCCTTTTGCTCAG
TTCCTTCAACCTACCCTTCAGAAAGCCGAGTCTGATAACCAATATTCATTTCCTAATGATGACTTTCAATCTTATCAATTCTATCCTGGCAGCCCAGTCAGTCACCTCAT
TTCACCGCGCTCAGTCATTTCTCGTTCTGGGGCGTCGTCGCCTTTGCCAGACTTGGATTTTGCTCCCTCTGGTTCTCAATTTTCTAATTTCACATTGGAAGTTCCACCTG
CGCTGTTGAACCTTGACAAACATTCCATTCATAAATGGCGACAAAGGCAAAGTTCTGATTCTTGCACTCAGAATTCTATGGGATTCAAATCAAGTGATGATTTTGATTTG
AATCCTCAAACTTCGGAATCTATGTCGGATCACCACGCAACAAATGAATCCCAAAATATTCAAATTCTCATTGATGGAAGCCAAAAGGAGGAGCCTGCTGCTGCTAATCA
TAGATTCTCATTTGAGTTATCTGATGAAGATGCTTTATTGAGAAGCGTAGAAAGTAAGCCACTGGAATCAAATGAACTTGCAGTTGCATCATCTCCAATACATGAACCAT
TTGAAACGGCTAAAGAAACTTCTCCTGTCGATGATCATATTTCAAATGGTACAGAAGAAAAGGCAAAAGAAAACGGTGAAGAAGCAAATCAGCATCAAGAACATCATCAT
TCCATTACTCTTGGGTCTGTGAAGGAATTCAATTTTGACAATGGCAGTGGAAGTGATACACTTAAGCCTAATATCAACTCAGACTGGTGGGCCAATGCGAAAGTTGTAGA
GAAAGAAGGTACAGCCACCGGGGCCTGGTCATTCTTTCCAATGGCGCAGCAAAGATGAGCAAACCGGTGCTTATCCTCTGGAATCTCCTCATTTCCATCATGTTTTGCAG
TTGCAAATTGGTAGGTATTAGGTAAGACAAACGGCTAGAGAAATGGTGGGTTTTAAAGGTAAAAAAAGAGGTCAAATCATGAAAGATTCAAACCAGAAGCCATTTTCTTT
TCAACAATCTGACCTAAACAAAGGCAGGTGTTATTAGAATGAAAAATAGAAATATGTACATTGACAATGGGGCCTTATTAACAAACAGTTGTGGCTCCTCACTTGAATTG
TAACAGGTATTAGTGTTCTAGTAGAAATTGGAAGTGTGTAAATATGGTAATAAAAATTGTTTTTATCTTTTCA
Protein sequenceShow/hide protein sequence
MRRRTDADADAADLRPVNNTFQTITAAADAIATVDHRFPRATPVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSVEAHENTLQSPDIVLPFAAPPSSPVSFLQS
EPPSATQSPTPSAILPFTSLTANMYSPDGPSSIFAIGPFAYETQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFLQPTLQKAESDNQYSFPNDDFQSYQ
FYPGSPVSHLISPRSVISRSGASSPLPDLDFAPSGSQFSNFTLEVPPALLNLDKHSIHKWRQRQSSDSCTQNSMGFKSSDDFDLNPQTSESMSDHHATNESQNIQILIDG
SQKEEPAAANHRFSFELSDEDALLRSVESKPLESNELAVASSPIHEPFETAKETSPVDDHISNGTEEKAKENGEEANQHQEHHHSITLGSVKEFNFDNGSGSDTLKPNIN
SDWWANAKVVEKEGTATGAWSFFPMAQQR