; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0000235 (gene) of Snake gourd v1 genome

Gene IDTan0000235
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionproline-rich protein 3-like
Genome locationLG06:6569865..6575338
RNA-Seq ExpressionTan0000235
SyntenyTan0000235
Gene Ontology termsGO:0071944 - cell periphery (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575813.1 Proline-rich protein 3, partial [Cucurbita argyrosperma subsp. sororia]4.3e-7173.82Show/hide
Query:  MALPRQLISATPVMLLWLLAAAPVTSAAGYY---DDLPNG-----GMPIYEKPLSPYNREVSGSFIVAVEGVVSCKNGSTYHPLKDVVARITCMALNEKG
        MAL R+LI+ATPV+LLWLLAAA V+SAA YY   DD   G     G+PIYEKPL  Y REV+   ++AVEGVVSC+N + YHPLK VVARITCM LNEKG
Subjt:  MALPRQLISATPVMLLWLLAAAPVTSAAGYY---DDLPNG-----GMPIYEKPLSPYNREVSGSFIVAVEGVVSCKNGSTYHPLKDVVARITCMALNEKG

Query:  NEMAPFSFSTFPTDDHGYFLATLSPFKLKGKAKVTQCKAFLPP-SPCEDCKYLTDVNNGVTGALFRSFRILTHKKMKLYSVGPFFYSSEPN
        NEMAPFSFS+FP+D+HGYFLATLS  KLKGKAKVT+CKAFLPP SPCE CKYLT+VNNGV GALFRSFRIL+H+ MKLYSVGPF YSS+ N
Subjt:  NEMAPFSFSTFPTDDHGYFLATLSPFKLKGKAKVTQCKAFLPP-SPCEDCKYLTDVNNGVTGALFRSFRILTHKKMKLYSVGPFFYSSEPN

XP_022152066.1 proline-rich protein 1 [Momordica charantia]4.1e-6967.61Show/hide
Query:  MALPRQLISATPVMLLWLLAAAPVTSAAGYYDDLPNG-----------------------GMPIYEKPLSPYNREVSGSFIVAVEGVVSCKNGSTYHPLK
        MAL RQ  SATPV+LL LLA   V+SAA  Y  +P+G                        +PIYEKP   Y  E      +AVEGVVSCK GS Y PLK
Subjt:  MALPRQLISATPVMLLWLLAAAPVTSAAGYYDDLPNG-----------------------GMPIYEKPLSPYNREVSGSFIVAVEGVVSCKNGSTYHPLK

Query:  DVVARITCMALNEKGNEMAPFSFSTFPTDDHGYFLATLSPFKLKGKAKVTQCKAFLPPSPCEDCKYLTDVNNGVTGALFRSFRILTHKKMKLYSVGPFFY
         VVARITC+ALNEKGNE+APFSFS+FPTD+HGYFLATLSP KLKGKAKVTQCK FLPPSPCEDCKY TD+NNGVTGALF SFRILT+KKMKLYSVGPFFY
Subjt:  DVVARITCMALNEKGNEMAPFSFSTFPTDDHGYFLATLSPFKLKGKAKVTQCKAFLPPSPCEDCKYLTDVNNGVTGALFRSFRILTHKKMKLYSVGPFFY

Query:  SSEPNAFPLPDGY
        +SEPNA P+PDGY
Subjt:  SSEPNAFPLPDGY

XP_022953718.1 proline-rich protein 3-like [Cucurbita moschata]9.7e-7175Show/hide
Query:  MALPRQLISATPVMLLWLLAAAPVTSAAGYY---DDLPNG-----GMPIYEKPLSPYNREVSGSFIVAVEGVVSCKNGSTYHPLKDVVARITCMALNEKG
        MAL R+LI+ATPV+LLWLLAAA V+SAA YY   DD   G     G+PIYEKPL  Y REV+   ++AVEGVVSC+N + YHPLK VVARITCM LNEKG
Subjt:  MALPRQLISATPVMLLWLLAAAPVTSAAGYY---DDLPNG-----GMPIYEKPLSPYNREVSGSFIVAVEGVVSCKNGSTYHPLKDVVARITCMALNEKG

Query:  NEMAPFSFSTFPTDDHGYFLATLSPFKLKGKAKVTQCKAFLPP-SPCEDCKYLTDVNNGVTGALFRSFRILTHKKMKLYSVGPFFYSS
        NEMAPFSFS+FP+D+HGYFLATLS  KLKGKAKVT+CKAFLPP SPCE CKYLT+VNNGV GALFRSFRILTH+ MKLYSVGPF YSS
Subjt:  NEMAPFSFSTFPTDDHGYFLATLSPFKLKGKAKVTQCKAFLPP-SPCEDCKYLTDVNNGVTGALFRSFRILTHKKMKLYSVGPFFYSS

XP_023549401.1 proline-rich protein 3-like [Cucurbita pepo subsp. pepo]6.7e-7274.87Show/hide
Query:  MALPRQLISATPVMLLWLLAAAPVTSAAGYY---DDLPNG-----GMPIYEKPLSPYNREVSGSFIVAVEGVVSCKNGSTYHPLKDVVARITCMALNEKG
        MAL R+LI+ATPV+LLWLLAAA V+SAA YY   DD   G     G+PIYEKPL  Y REV+   ++AVEGVVSCKN + YHPLK VVARITCM LNEKG
Subjt:  MALPRQLISATPVMLLWLLAAAPVTSAAGYY---DDLPNG-----GMPIYEKPLSPYNREVSGSFIVAVEGVVSCKNGSTYHPLKDVVARITCMALNEKG

Query:  NEMAPFSFSTFPTDDHGYFLATLSPFKLKGKAKVTQCKAFLPP-SPCEDCKYLTDVNNGVTGALFRSFRILTHKKMKLYSVGPFFYSSEPN
        NEMAPFSFS+FP+D+HGYFLATLS  KLKGKAKVT+CKAFLPP SPCE CKYLT+VNNGV GALFRSFRILTH+ MKLYSVGPF YSS+ N
Subjt:  NEMAPFSFSTFPTDDHGYFLATLSPFKLKGKAKVTQCKAFLPP-SPCEDCKYLTDVNNGVTGALFRSFRILTHKKMKLYSVGPFFYSSEPN

XP_038896333.1 proline-rich protein 3-like [Benincasa hispida]5.1e-7274.35Show/hide
Query:  MALPRQLISATPVMLLWLLAAAPVTSAAGYYD----DLPN-GGMPIYEKPLSPYNREVSGSFIVAVEGVVSCKNGSTYHPLKDVVARITCMALNEKGNEM
        MAL RQLISA PV+LLWLLAAA   S+   YD    D+ +  G+PIYEK L  Y  EV+ S ++AVEGVVSCKNG+ YHPLK +VARITCMALNE G EM
Subjt:  MALPRQLISATPVMLLWLLAAAPVTSAAGYYD----DLPN-GGMPIYEKPLSPYNREVSGSFIVAVEGVVSCKNGSTYHPLKDVVARITCMALNEKGNEM

Query:  APFSFSTFPTDDHGYFLATLSPFKLKGKAKVTQCKAFLPP-SPCEDCKYLTDVNNGVTGALFRSFRILTHKKMKLYSVGPFFYSSEPNAFP
        APFSFS+ P+DDHGYFLATLSP KLKGKAKVTQCKAFLPP SPCEDCKYLT+VNNGV GALFRSFRILTHKKMKLYS+G FFYSS+P   P
Subjt:  APFSFSTFPTDDHGYFLATLSPFKLKGKAKVTQCKAFLPP-SPCEDCKYLTDVNNGVTGALFRSFRILTHKKMKLYSVGPFFYSSEPNAFP

TrEMBL top hitse value%identityAlignment
A0A1S3BRP9 proline-rich protein 3-like2.2e-6873.12Show/hide
Query:  MALPRQLI-SATPVMLLWLLAAAPVTSAAGYYD--DLPNGGMPIYEKPLSPYNREVSGSFIVAVEGVVSCKNGSTYHPLKDVVARITCMALNEKGNEMAP
        MAL RQLI SA PV+LLWLLA+A V+SAA YYD  D+ + G+PIY+K L  Y  +      +AVEGVVSCKNG+ YHPLK +VAR TCMALNEKG EMAP
Subjt:  MALPRQLI-SATPVMLLWLLAAAPVTSAAGYYD--DLPNGGMPIYEKPLSPYNREVSGSFIVAVEGVVSCKNGSTYHPLKDVVARITCMALNEKGNEMAP

Query:  FSFSTFPTDDHGYFLATLSPFKLKGKAKVTQCKAFLPP-SPCEDCKYLTDVNNGVTGALFRSFRILTHKKMKLYSVGPFFYSSEPN
        FSFS+FP+D +GYFLATLS  +LKGKAKVTQCKAFLPP SPCE CKYLT+VN+GV GALFRSFRILTHKKMKLYS+G FFYSS+PN
Subjt:  FSFSTFPTDDHGYFLATLSPFKLKGKAKVTQCKAFLPP-SPCEDCKYLTDVNNGVTGALFRSFRILTHKKMKLYSVGPFFYSSEPN

A0A5A7VJJ1 Proline-rich protein 3-like2.2e-6873.12Show/hide
Query:  MALPRQLI-SATPVMLLWLLAAAPVTSAAGYYD--DLPNGGMPIYEKPLSPYNREVSGSFIVAVEGVVSCKNGSTYHPLKDVVARITCMALNEKGNEMAP
        MAL RQLI SA PV+LLWLLA+A V+SAA YYD  D+ + G+PIY+K L  Y  +      +AVEGVVSCKNG+ YHPLK +VAR TCMALNEKG EMAP
Subjt:  MALPRQLI-SATPVMLLWLLAAAPVTSAAGYYD--DLPNGGMPIYEKPLSPYNREVSGSFIVAVEGVVSCKNGSTYHPLKDVVARITCMALNEKGNEMAP

Query:  FSFSTFPTDDHGYFLATLSPFKLKGKAKVTQCKAFLPP-SPCEDCKYLTDVNNGVTGALFRSFRILTHKKMKLYSVGPFFYSSEPN
        FSFS+FP+D +GYFLATLS  +LKGKAKVTQCKAFLPP SPCE CKYLT+VN+GV GALFRSFRILTHKKMKLYS+G FFYSS+PN
Subjt:  FSFSTFPTDDHGYFLATLSPFKLKGKAKVTQCKAFLPP-SPCEDCKYLTDVNNGVTGALFRSFRILTHKKMKLYSVGPFFYSSEPN

A0A6J1DGI4 proline-rich protein 12.0e-6967.61Show/hide
Query:  MALPRQLISATPVMLLWLLAAAPVTSAAGYYDDLPNG-----------------------GMPIYEKPLSPYNREVSGSFIVAVEGVVSCKNGSTYHPLK
        MAL RQ  SATPV+LL LLA   V+SAA  Y  +P+G                        +PIYEKP   Y  E      +AVEGVVSCK GS Y PLK
Subjt:  MALPRQLISATPVMLLWLLAAAPVTSAAGYYDDLPNG-----------------------GMPIYEKPLSPYNREVSGSFIVAVEGVVSCKNGSTYHPLK

Query:  DVVARITCMALNEKGNEMAPFSFSTFPTDDHGYFLATLSPFKLKGKAKVTQCKAFLPPSPCEDCKYLTDVNNGVTGALFRSFRILTHKKMKLYSVGPFFY
         VVARITC+ALNEKGNE+APFSFS+FPTD+HGYFLATLSP KLKGKAKVTQCK FLPPSPCEDCKY TD+NNGVTGALF SFRILT+KKMKLYSVGPFFY
Subjt:  DVVARITCMALNEKGNEMAPFSFSTFPTDDHGYFLATLSPFKLKGKAKVTQCKAFLPPSPCEDCKYLTDVNNGVTGALFRSFRILTHKKMKLYSVGPFFY

Query:  SSEPNAFPLPDGY
        +SEPNA P+PDGY
Subjt:  SSEPNAFPLPDGY

A0A6J1GP42 proline-rich protein 3-like4.7e-7175Show/hide
Query:  MALPRQLISATPVMLLWLLAAAPVTSAAGYY---DDLPNG-----GMPIYEKPLSPYNREVSGSFIVAVEGVVSCKNGSTYHPLKDVVARITCMALNEKG
        MAL R+LI+ATPV+LLWLLAAA V+SAA YY   DD   G     G+PIYEKPL  Y REV+   ++AVEGVVSC+N + YHPLK VVARITCM LNEKG
Subjt:  MALPRQLISATPVMLLWLLAAAPVTSAAGYY---DDLPNG-----GMPIYEKPLSPYNREVSGSFIVAVEGVVSCKNGSTYHPLKDVVARITCMALNEKG

Query:  NEMAPFSFSTFPTDDHGYFLATLSPFKLKGKAKVTQCKAFLPP-SPCEDCKYLTDVNNGVTGALFRSFRILTHKKMKLYSVGPFFYSS
        NEMAPFSFS+FP+D+HGYFLATLS  KLKGKAKVT+CKAFLPP SPCE CKYLT+VNNGV GALFRSFRILTH+ MKLYSVGPF YSS
Subjt:  NEMAPFSFSTFPTDDHGYFLATLSPFKLKGKAKVTQCKAFLPP-SPCEDCKYLTDVNNGVTGALFRSFRILTHKKMKLYSVGPFFYSS

A0A6J1JUA4 proline-rich protein 3-like2.6e-6973.3Show/hide
Query:  MALPRQLISATPVMLLWLLAAAPVTSAAGYY---DDLPNG-----GMPIYEKPLSPYNREVSGSFIVAVEGVVSCKNGSTYHPLKDVVARITCMALNEKG
        MA  R+LI+ATPV+LL LLAAA V+SAA YY   DD   G     G+PIYEKPL  Y REV+   I+AVEGVVSCKN + YHPLK VVARI C+ LNEKG
Subjt:  MALPRQLISATPVMLLWLLAAAPVTSAAGYY---DDLPNG-----GMPIYEKPLSPYNREVSGSFIVAVEGVVSCKNGSTYHPLKDVVARITCMALNEKG

Query:  NEMAPFSFSTFPTDDHGYFLATLSPFKLKGKAKVTQCKAFLPP-SPCEDCKYLTDVNNGVTGALFRSFRILTHKKMKLYSVGPFFYSSEPN
        NEMAPFSFS+FP+D+HGYFLATLS  KLKGKAKVT+CKAFLPP SPCE CKYLT+VNNGV GALFRSFRILTH+ MKLYSVGPF YSS+ N
Subjt:  NEMAPFSFSTFPTDDHGYFLATLSPFKLKGKAKVTQCKAFLPP-SPCEDCKYLTDVNNGVTGALFRSFRILTHKKMKLYSVGPFFYSSEPN

SwissProt top hitse value%identityAlignment
O81417 Protein SEED AND ROOT HAIR PROTECTIVE PROTEIN5.6e-2945.24Show/hide
Query:  VAVEGVVSCKNGSTYHPLKDVVARITCMALNEKGNEMAPFSFSTFPTDDHGYFLATLSPFKLKGKAKVTQCKAFLPPSPCEDCKYLTDVNNGVTGALFRS
        +AVEG++ CK+G   +P++   ARI C+ ++  GNE+ P S  +  TD  GYF+AT+ P +L+    VT+CK +L  SP  DC + TDVN GV G    +
Subjt:  VAVEGVVSCKNGSTYHPLKDVVARITCMALNEKGNEMAPFSFSTFPTDDHGYFLATLSPFKLKGKAKVTQCKAFLPPSPCEDCKYLTDVNNGVTGALFRS

Query:  FRILTHKKMKLYSVGPFFYSSEPNAF
        +RIL  K  KLY  GPFFY+SEP  +
Subjt:  FRILTHKKMKLYSVGPFFYSSEPNAF

P93013 Non-classical arabinogalactan protein 303.2e-0830.57Show/hide
Query:  PIYEKPLSPYNREVSGSFIVAVEGVVSCK--------NGSTYHPLKDVVARITCMALNEKGNEMAPFSFSTFPTDDHGYFL----ATLSPFKLKGKAKVT
        PI    L P         +VAV GVV CK        N     P+KD V R+ C   N+K       S S   TD +GYF+     T++ + +KG     
Subjt:  PIYEKPLSPYNREVSGSFIVAVEGVVSCK--------NGSTYHPLKDVVARITCMALNEKGNEMAPFSFSTFPTDDHGYFL----ATLSPFKLKGKAKVT

Query:  QCKAFLPPSPCEDCKYLTDVNNGVTGALFR-------SFRILTHKKMKLYSVGPFFY
         C+AFL  SP   C  ++ +++G  G++ +       S  I+   K  +Y+VGPF +
Subjt:  QCKAFLPPSPCEDCKYLTDVNNGVTGALFR-------SFRILTHKKMKLYSVGPFFY

Q9FZ35 Proline-rich protein 11.1e-1633.96Show/hide
Query:  PNGGMPIYEKPLSPYNREVSGSFIVAVEGVVSCKNGSTYHPLKDVVARITCM---ALNEKGNEMAPFSFSTFPTDDHGYFLATLSPFKLKGKAKVTQCKA
        P    P Y  P  PY  E+    I AV G++ CKNG   +P++   A+I C    +  +  NE+  +S    PTD  GYF   L+  K      ++ C+ 
Subjt:  PNGGMPIYEKPLSPYNREVSGSFIVAVEGVVSCKNGSTYHPLKDVVARITCM---ALNEKGNEMAPFSFSTFPTDDHGYFLATLSPFKLKGKAKVTQCKA

Query:  FLPPSPCEDCKYLTDVNNGVTGALFRSFRILTHKKMKLYSVGPFFYSSEPNAFPLPDGY
         L  SP E CK  T+VN G+TG     F + + K +KL++VGPF++++   A P    Y
Subjt:  FLPPSPCEDCKYLTDVNNGVTGALFRSFRILTHKKMKLYSVGPFFYSSEPNAFPLPDGY

Q9LZJ7 Proline-rich protein 36.0e-1533.77Show/hide
Query:  PIYEKPLSPYNREVSGSFIVAVEGVVSCKNGSTYHPLKDVVARITCMALNEKGNEMAPFSFSTFPTDDHGYFLATLSPFKLKGKAKVTQCKAFLPPSPCE
        P Y     PY  E+    + AV+G++ CKNG   +P+     +I C      G         + PTD  GYF  +L+  K      +  C+  L  SP E
Subjt:  PIYEKPLSPYNREVSGSFIVAVEGVVSCKNGSTYHPLKDVVARITCMALNEKGNEMAPFSFSTFPTDDHGYFLATLSPFKLKGKAKVTQCKAFLPPSPCE

Query:  DCKYLTDVNNGVTG---ALFRSFRILTHKKMKLYSVGPFFYSSEPNAFPLPDGY
         CK  T+VN G+TG   AL+  +R    K ++L+SVGPF+Y+  P A P    Y
Subjt:  DCKYLTDVNNGVTG---ALFRSFRILTHKKMKLYSVGPFFYSSEPNAFPLPDGY

Arabidopsis top hitse value%identityAlignment
AT1G54970.1 proline-rich protein 17.8e-1833.96Show/hide
Query:  PNGGMPIYEKPLSPYNREVSGSFIVAVEGVVSCKNGSTYHPLKDVVARITCM---ALNEKGNEMAPFSFSTFPTDDHGYFLATLSPFKLKGKAKVTQCKA
        P    P Y  P  PY  E+    I AV G++ CKNG   +P++   A+I C    +  +  NE+  +S    PTD  GYF   L+  K      ++ C+ 
Subjt:  PNGGMPIYEKPLSPYNREVSGSFIVAVEGVVSCKNGSTYHPLKDVVARITCM---ALNEKGNEMAPFSFSTFPTDDHGYFLATLSPFKLKGKAKVTQCKA

Query:  FLPPSPCEDCKYLTDVNNGVTGALFRSFRILTHKKMKLYSVGPFFYSSEPNAFPLPDGY
         L  SP E CK  T+VN G+TG     F + + K +KL++VGPF++++   A P    Y
Subjt:  FLPPSPCEDCKYLTDVNNGVTGALFRSFRILTHKKMKLYSVGPFFYSSEPNAFPLPDGY

AT2G47530.1 Pollen Ole e 1 allergen and extensin family protein3.5e-1831.79Show/hide
Query:  LWLLAAAPVTSAAGYYDDLPNGGMPIYEKPLSPYNREVSGSFI------VAVEGVVSCKNGSTYHPLKDVVARITCMALNEKGNEMAPFSFSTFPTDDHG
        L LLA   V + A YY        P   KP + Y   V   ++      +A+EG + CK+G   +P++    ++ C  ++  G  +A  + S++PTD  G
Subjt:  LWLLAAAPVTSAAGYYDDLPNGGMPIYEKPLSPYNREVSGSFI------VAVEGVVSCKNGSTYHPLKDVVARITCMALNEKGNEMAPFSFSTFPTDDHG

Query:  YFLATLSPFKLKGKA-KVTQCKAFLPPSPCEDCKYLTDVNNGVTGALFR--SFRILTHKKMKLYSVGPFFYSS
        YF      + L  K   ++ CK  L  SP   CK  T+VN GVTGA     + + L+H  + LY++ PF++SS
Subjt:  YFLATLSPFKLKGKA-KVTQCKAFLPPSPCEDCKYLTDVNNGVTGALFR--SFRILTHKKMKLYSVGPFFYSS

AT2G47540.1 Pollen Ole e 1 allergen and extensin family protein7.7e-2641.43Show/hide
Query:  SFIVAVEGVVSCKNGSTYHPLKDVVARITCMALNEKGNEMAPFSFSTFPTDDHGYFLATLSPFKLKGKAKV---TQCKAFLPPSPCEDCKYLTDVNNGVT
        S ++ V+G++ CK GS   P++  VAR+TC   +E G E    +  +  TD  GYFLATLS  ++K   KV    +C+AFL  SP + C + T++N G++
Subjt:  SFIVAVEGVVSCKNGSTYHPLKDVVARITCMALNEKGNEMAPFSFSTFPTDDHGYFLATLSPFKLKGKAKV---TQCKAFLPPSPCEDCKYLTDVNNGVT

Query:  GALFRSFRILTHK-KMKLYSVGPFFYSSEPNA-FPLPDGY
        GA+ +++R+L +K KMKL++VGPF +SSE      +P+GY
Subjt:  GALFRSFRILTHK-KMKLYSVGPFFYSSEPNA-FPLPDGY

AT3G62680.1 proline-rich protein 34.3e-1633.77Show/hide
Query:  PIYEKPLSPYNREVSGSFIVAVEGVVSCKNGSTYHPLKDVVARITCMALNEKGNEMAPFSFSTFPTDDHGYFLATLSPFKLKGKAKVTQCKAFLPPSPCE
        P Y     PY  E+    + AV+G++ CKNG   +P+     +I C      G         + PTD  GYF  +L+  K      +  C+  L  SP E
Subjt:  PIYEKPLSPYNREVSGSFIVAVEGVVSCKNGSTYHPLKDVVARITCMALNEKGNEMAPFSFSTFPTDDHGYFLATLSPFKLKGKAKVTQCKAFLPPSPCE

Query:  DCKYLTDVNNGVTG---ALFRSFRILTHKKMKLYSVGPFFYSSEPNAFPLPDGY
         CK  T+VN G+TG   AL+  +R    K ++L+SVGPF+Y+  P A P    Y
Subjt:  DCKYLTDVNNGVTG---ALFRSFRILTHKKMKLYSVGPFFYSSEPNAFPLPDGY

AT4G02270.1 root hair specific 134.0e-3045.24Show/hide
Query:  VAVEGVVSCKNGSTYHPLKDVVARITCMALNEKGNEMAPFSFSTFPTDDHGYFLATLSPFKLKGKAKVTQCKAFLPPSPCEDCKYLTDVNNGVTGALFRS
        +AVEG++ CK+G   +P++   ARI C+ ++  GNE+ P S  +  TD  GYF+AT+ P +L+    VT+CK +L  SP  DC + TDVN GV G    +
Subjt:  VAVEGVVSCKNGSTYHPLKDVVARITCMALNEKGNEMAPFSFSTFPTDDHGYFLATLSPFKLKGKAKVTQCKAFLPPSPCEDCKYLTDVNNGVTGALFRS

Query:  FRILTHKKMKLYSVGPFFYSSEPNAF
        +RIL  K  KLY  GPFFY+SEP  +
Subjt:  FRILTHKKMKLYSVGPFFYSSEPNAF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCTCCCACGGCAGCTGATCTCTGCAACTCCGGTGATGCTACTGTGGCTGCTGGCCGCTGCCCCCGTCACTTCGGCGGCGGGTTATTATGACGACCTTCCCAACGG
TGGAATGCCGATCTATGAGAAACCGCTCTCACCGTACAACCGTGAAGTGTCGGGATCATTCATCGTCGCCGTTGAAGGAGTTGTTTCTTGTAAAAATGGTTCTACATATC
ACCCTCTTAAAGATGTTGTAGCAAGAATCACATGCATGGCTTTGAATGAAAAAGGCAACGAAATGGCTCCTTTTTCATTCTCCACTTTTCCAACCGACGACCATGGCTAT
TTCTTAGCTACATTGTCACCTTTCAAGCTCAAGGGCAAGGCCAAAGTCACACAATGCAAAGCCTTCCTTCCACCGTCGCCATGCGAGGACTGCAAATACCTTACCGACGT
TAACAATGGCGTTACGGGTGCTTTGTTTCGTTCTTTTCGAATCCTCACACACAAGAAGATGAAGTTGTACTCTGTTGGACCTTTCTTTTACTCCTCAGAACCAAATGCTT
TCCCTCTCCCTGATGGTTATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTCTCCCACGGCAGCTGATCTCTGCAACTCCGGTGATGCTACTGTGGCTGCTGGCCGCTGCCCCCGTCACTTCGGCGGCGGGTTATTATGACGACCTTCCCAACGG
TGGAATGCCGATCTATGAGAAACCGCTCTCACCGTACAACCGTGAAGTGTCGGGATCATTCATCGTCGCCGTTGAAGGAGTTGTTTCTTGTAAAAATGGTTCTACATATC
ACCCTCTTAAAGATGTTGTAGCAAGAATCACATGCATGGCTTTGAATGAAAAAGGCAACGAAATGGCTCCTTTTTCATTCTCCACTTTTCCAACCGACGACCATGGCTAT
TTCTTAGCTACATTGTCACCTTTCAAGCTCAAGGGCAAGGCCAAAGTCACACAATGCAAAGCCTTCCTTCCACCGTCGCCATGCGAGGACTGCAAATACCTTACCGACGT
TAACAATGGCGTTACGGGTGCTTTGTTTCGTTCTTTTCGAATCCTCACACACAAGAAGATGAAGTTGTACTCTGTTGGACCTTTCTTTTACTCCTCAGAACCAAATGCTT
TCCCTCTCCCTGATGGTTATTGA
Protein sequenceShow/hide protein sequence
MALPRQLISATPVMLLWLLAAAPVTSAAGYYDDLPNGGMPIYEKPLSPYNREVSGSFIVAVEGVVSCKNGSTYHPLKDVVARITCMALNEKGNEMAPFSFSTFPTDDHGY
FLATLSPFKLKGKAKVTQCKAFLPPSPCEDCKYLTDVNNGVTGALFRSFRILTHKKMKLYSVGPFFYSSEPNAFPLPDGY