; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0003588 (gene) of Chayote v1 genome

Gene IDSed0003588
OrganismSechium edule (Chayote v1)
DescriptionU-box domain-containing protein
Genome locationLG10:2748429..2750811
RNA-Seq ExpressionSed0003588
SyntenySed0003588
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022132478.1 uncharacterized protein LOC111005326 isoform X1 [Momordica charantia]5.5e-9880.88Show/hide
Query:  MFRALELPPPCAAAKHNPFHEPPSGVKLCRLLYSLRLPNRRLPLLSLRAQSPSSSPSSDPSSSSRYTETLGYSSPAFLQFSQWTLTQRHILVLNVVACAT
        MF AL+LPPPC AAKHN     PS VKLCRL Y+LR PNRRL L S+R+QS SSSP SDPS+SS YTET+G+SSPA LQ SQW LTQRHILVLNVVACAT
Subjt:  MFRALELPPPCAAAKHNPFHEPPSGVKLCRLLYSLRLPNRRLPLLSLRAQSPSSSPSSDPSSSSRYTETLGYSSPAFLQFSQWTLTQRHILVLNVVACAT

Query:  AISAIWLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMAAVRLSGMEISDLTMELSDLGQEITQGVRSSTRAVRVAEERLRRLSNMNPTASVQEMTAT
        AISA WLFCSAIPTLLAFKRAAESLEKLMDVTREE+PGTMAA+RLSGMEISDLTMELSDLGQEITQGVRSSTRAVRV EERLR L+NM PTASVQEMT T
Subjt:  AISAIWLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMAAVRLSGMEISDLTMELSDLGQEITQGVRSSTRAVRVAEERLRRLSNMNPTASVQEMTAT

Query:  NLKMEAAEPALARRARGIKEGILKSRSTFQLFLSLTRFSWMALNYLIKRGK
        + K+ AAEP LA+RARGIKEGI+K RS F+LFL+LTRFS +ALNYL  RGK
Subjt:  NLKMEAAEPALARRARGIKEGILKSRSTFQLFLSLTRFSWMALNYLIKRGK

XP_022132479.1 uncharacterized protein LOC111005326 isoform X2 [Momordica charantia]1.2e-9580.08Show/hide
Query:  MFRALELPPPCAAAKHNPFHEPPSGVKLCRLLYSLRLPNRRLPLLSLRAQSPSSSPSSDPSSSSRYTETLGYSSPAFLQFSQWTLTQRHILVLNVVACAT
        MF AL+LPPPC AAKHN     PS VKLCRL Y+LR PNRRL L S+R+QS SSSP SDPS+SS YTET+G+SSPA LQ SQW LTQRHILVLNVVACAT
Subjt:  MFRALELPPPCAAAKHNPFHEPPSGVKLCRLLYSLRLPNRRLPLLSLRAQSPSSSPSSDPSSSSRYTETLGYSSPAFLQFSQWTLTQRHILVLNVVACAT

Query:  AISAIWLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMAAVRLSGMEISDLTMELSDLGQEITQGVRSSTRAVRVAEERLRRLSNMNPTASVQEMTAT
        AISA WLFCSAIPTLLAFKRAAESLEKLMDVTREE+PGTMAA+RLSGMEISDLTMELSDLGQEITQGVRSSTRAVRV EERLR L+NM PT  VQEMT T
Subjt:  AISAIWLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMAAVRLSGMEISDLTMELSDLGQEITQGVRSSTRAVRVAEERLRRLSNMNPTASVQEMTAT

Query:  NLKMEAAEPALARRARGIKEGILKSRSTFQLFLSLTRFSWMALNYLIKRGK
        + K+ AAEP LA+RARGIKEGI+K RS F+LFL+LTRFS +ALNYL  RGK
Subjt:  NLKMEAAEPALARRARGIKEGILKSRSTFQLFLSLTRFSWMALNYLIKRGK

XP_022949654.1 uncharacterized protein LOC111452981 isoform X1 [Cucurbita moschata]1.3e-9479.84Show/hide
Query:  MFRALELPPPCAAAKHNPFHEPPSGVKLCR-LLYSLRLPNRRLPLLSLRAQSPSSSPSSDPSSSSRYTETLGYSSPAFLQFSQWTLTQRHILVLNVVACA
        MFRALEL PPC AAKH+  H  PS VKL R   Y+LRLPNRRL LLS+RAQS SSSP SDPS+S RYTET+G+SSPA++QFSQ TLTQRH+LVLNVVACA
Subjt:  MFRALELPPPCAAAKHNPFHEPPSGVKLCR-LLYSLRLPNRRLPLLSLRAQSPSSSPSSDPSSSSRYTETLGYSSPAFLQFSQWTLTQRHILVLNVVACA

Query:  TAISAIWLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMAAVRLSGMEISDLTMELSDLGQEITQGVRSSTRAVRVAEERLRRLSNMNPTASVQEMTA
        TAI+A WLFCSAIPTLLAFKRAAESLEKLMDVTREE+PGTMAA+RLSGMEISDLTMELSDLGQ+ITQGVRSSTRAVRVAEERLR L+NM PTA VQEMT 
Subjt:  TAISAIWLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMAAVRLSGMEISDLTMELSDLGQEITQGVRSSTRAVRVAEERLRRLSNMNPTASVQEMTA

Query:  TNL-KMEAAEPALARRARGIKEGILKSRSTFQLFLSLTRFSWMALNYLIKRGK
         NL  +EAAEP LA+RAR IK GI+K RS FQLFLSLTRFS +ALN+  KRGK
Subjt:  TNL-KMEAAEPALARRARGIKEGILKSRSTFQLFLSLTRFSWMALNYLIKRGK

XP_022977945.1 uncharacterized protein LOC111478086 isoform X1 [Cucurbita maxima]7.2e-9881.35Show/hide
Query:  MFRALELPPPCAAAKHNPFHEPPSGVKLCR-LLYSLRLPNRRLPLLSLRAQSPSSSPSSDPSSSSRYTETLGYSSPAFLQFSQWTLTQRHILVLNVVACA
        MFRALEL PPC AAKH   H  PS VKLCR   ++LRLPNRRL LLS+RAQS SSSP SDPS+S RYTET+G+SSPAF+QFSQ TLTQRHILVLNVVACA
Subjt:  MFRALELPPPCAAAKHNPFHEPPSGVKLCR-LLYSLRLPNRRLPLLSLRAQSPSSSPSSDPSSSSRYTETLGYSSPAFLQFSQWTLTQRHILVLNVVACA

Query:  TAISAIWLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMAAVRLSGMEISDLTMELSDLGQEITQGVRSSTRAVRVAEERLRRLSNMNPTASVQEMTA
        TAI+A WLFCSAIPTLLAFKRAAESLEKLMDVTREE+PGTMAA+RLSGMEISDLTMELSDLGQ ITQGVRSSTRAVRVAEERLR L+NM PTA VQEMT 
Subjt:  TAISAIWLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMAAVRLSGMEISDLTMELSDLGQEITQGVRSSTRAVRVAEERLRRLSNMNPTASVQEMTA

Query:  TNLKMEAAEPALARRARGIKEGILKSRSTFQLFLSLTRFSWMALNYLIKRGK
         NL +EAAEP LA+RAR IKEGI+K RS FQLFLSLTRFS +ALN+  KRGK
Subjt:  TNLKMEAAEPALARRARGIKEGILKSRSTFQLFLSLTRFSWMALNYLIKRGK

XP_038880851.1 uncharacterized protein LOC120072535 [Benincasa hispida]3.2e-9881.27Show/hide
Query:  MFRALELPPPCAAAKHNPFHEPPSGVKLCRLLYSLRLPNRRLPLLSLRAQSPSSSPSSDPSSSSRYTETLGYSSPAFLQFSQWTLTQRHILVLNVVACAT
        MFRALELPPPC AAK N  H  PS VK CRL YSL LPNRRL LL +RAQS      SDPS+SSRYTET+G+SSPAFLQFSQ TLTQ HI VLNVVACAT
Subjt:  MFRALELPPPCAAAKHNPFHEPPSGVKLCRLLYSLRLPNRRLPLLSLRAQSPSSSPSSDPSSSSRYTETLGYSSPAFLQFSQWTLTQRHILVLNVVACAT

Query:  AISAIWLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMAAVRLSGMEISDLTMELSDLGQEITQGVRSSTRAVRVAEERLRRLSNMNPTASVQEMTAT
        AISA WLFCSAIPTLLAFKRAAESLEKLMDVTREE+PGTMAA+RLSGMEISDLTMEL+DLGQ+ITQGVRSSTRAVRVAE+RLRRL+NM PTASVQEMT T
Subjt:  AISAIWLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMAAVRLSGMEISDLTMELSDLGQEITQGVRSSTRAVRVAEERLRRLSNMNPTASVQEMTAT

Query:  NLKMEAAEPALARRARGIKEGILKSRSTFQLFLSLTRFSWMALNYLIKRGK
        NL +E AEP LA+RAR IKEGI+K RS FQLFLSLTRFS +ALNY  KRGK
Subjt:  NLKMEAAEPALARRARGIKEGILKSRSTFQLFLSLTRFSWMALNYLIKRGK

TrEMBL top hitse value%identityAlignment
A0A1S3B164 uncharacterized protein LOC103484701 isoform X11.5e-9378.09Show/hide
Query:  MFRALELPPPCAAAKHNPFHEPPSGVKLCRLLYSLRLPNRRLPLLSLRAQSPSSSPSSDPSSSSRYTETLGYSSPAFLQFSQWTLTQRHILVLNVVACAT
        MF ALE+ PPC AAK N     PS  K  RL Y+L LPNRRL LLS+RAQS      SDPS+SSRYT+T+G SSPAFLQF + TLTQRHILVLNVVACAT
Subjt:  MFRALELPPPCAAAKHNPFHEPPSGVKLCRLLYSLRLPNRRLPLLSLRAQSPSSSPSSDPSSSSRYTETLGYSSPAFLQFSQWTLTQRHILVLNVVACAT

Query:  AISAIWLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMAAVRLSGMEISDLTMELSDLGQEITQGVRSSTRAVRVAEERLRRLSNMNPTASVQEMTAT
        AISA WLFCSAIPTLLAFKRAAESLEKLMDVTREE+PGTMAA+RLSGMEISDLTMELSDLGQ+ITQGVRSSTRAVRVAEERLRRL+NM+PTASVQEMT T
Subjt:  AISAIWLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMAAVRLSGMEISDLTMELSDLGQEITQGVRSSTRAVRVAEERLRRLSNMNPTASVQEMTAT

Query:  NLKMEAAEPALARRARGIKEGILKSRSTFQLFLSLTRFSWMALNYLIKRGK
        NL ++ A+P LA+RAR IKEGI+K RS FQLFLS+TRFS +ALNY  KRGK
Subjt:  NLKMEAAEPALARRARGIKEGILKSRSTFQLFLSLTRFSWMALNYLIKRGK

A0A6J1BT62 uncharacterized protein LOC111005326 isoform X25.6e-9680.08Show/hide
Query:  MFRALELPPPCAAAKHNPFHEPPSGVKLCRLLYSLRLPNRRLPLLSLRAQSPSSSPSSDPSSSSRYTETLGYSSPAFLQFSQWTLTQRHILVLNVVACAT
        MF AL+LPPPC AAKHN     PS VKLCRL Y+LR PNRRL L S+R+QS SSSP SDPS+SS YTET+G+SSPA LQ SQW LTQRHILVLNVVACAT
Subjt:  MFRALELPPPCAAAKHNPFHEPPSGVKLCRLLYSLRLPNRRLPLLSLRAQSPSSSPSSDPSSSSRYTETLGYSSPAFLQFSQWTLTQRHILVLNVVACAT

Query:  AISAIWLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMAAVRLSGMEISDLTMELSDLGQEITQGVRSSTRAVRVAEERLRRLSNMNPTASVQEMTAT
        AISA WLFCSAIPTLLAFKRAAESLEKLMDVTREE+PGTMAA+RLSGMEISDLTMELSDLGQEITQGVRSSTRAVRV EERLR L+NM PT  VQEMT T
Subjt:  AISAIWLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMAAVRLSGMEISDLTMELSDLGQEITQGVRSSTRAVRVAEERLRRLSNMNPTASVQEMTAT

Query:  NLKMEAAEPALARRARGIKEGILKSRSTFQLFLSLTRFSWMALNYLIKRGK
        + K+ AAEP LA+RARGIKEGI+K RS F+LFL+LTRFS +ALNYL  RGK
Subjt:  NLKMEAAEPALARRARGIKEGILKSRSTFQLFLSLTRFSWMALNYLIKRGK

A0A6J1BWC9 uncharacterized protein LOC111005326 isoform X12.7e-9880.88Show/hide
Query:  MFRALELPPPCAAAKHNPFHEPPSGVKLCRLLYSLRLPNRRLPLLSLRAQSPSSSPSSDPSSSSRYTETLGYSSPAFLQFSQWTLTQRHILVLNVVACAT
        MF AL+LPPPC AAKHN     PS VKLCRL Y+LR PNRRL L S+R+QS SSSP SDPS+SS YTET+G+SSPA LQ SQW LTQRHILVLNVVACAT
Subjt:  MFRALELPPPCAAAKHNPFHEPPSGVKLCRLLYSLRLPNRRLPLLSLRAQSPSSSPSSDPSSSSRYTETLGYSSPAFLQFSQWTLTQRHILVLNVVACAT

Query:  AISAIWLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMAAVRLSGMEISDLTMELSDLGQEITQGVRSSTRAVRVAEERLRRLSNMNPTASVQEMTAT
        AISA WLFCSAIPTLLAFKRAAESLEKLMDVTREE+PGTMAA+RLSGMEISDLTMELSDLGQEITQGVRSSTRAVRV EERLR L+NM PTASVQEMT T
Subjt:  AISAIWLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMAAVRLSGMEISDLTMELSDLGQEITQGVRSSTRAVRVAEERLRRLSNMNPTASVQEMTAT

Query:  NLKMEAAEPALARRARGIKEGILKSRSTFQLFLSLTRFSWMALNYLIKRGK
        + K+ AAEP LA+RARGIKEGI+K RS F+LFL+LTRFS +ALNYL  RGK
Subjt:  NLKMEAAEPALARRARGIKEGILKSRSTFQLFLSLTRFSWMALNYLIKRGK

A0A6J1GDG4 uncharacterized protein LOC111452981 isoform X16.2e-9579.84Show/hide
Query:  MFRALELPPPCAAAKHNPFHEPPSGVKLCR-LLYSLRLPNRRLPLLSLRAQSPSSSPSSDPSSSSRYTETLGYSSPAFLQFSQWTLTQRHILVLNVVACA
        MFRALEL PPC AAKH+  H  PS VKL R   Y+LRLPNRRL LLS+RAQS SSSP SDPS+S RYTET+G+SSPA++QFSQ TLTQRH+LVLNVVACA
Subjt:  MFRALELPPPCAAAKHNPFHEPPSGVKLCR-LLYSLRLPNRRLPLLSLRAQSPSSSPSSDPSSSSRYTETLGYSSPAFLQFSQWTLTQRHILVLNVVACA

Query:  TAISAIWLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMAAVRLSGMEISDLTMELSDLGQEITQGVRSSTRAVRVAEERLRRLSNMNPTASVQEMTA
        TAI+A WLFCSAIPTLLAFKRAAESLEKLMDVTREE+PGTMAA+RLSGMEISDLTMELSDLGQ+ITQGVRSSTRAVRVAEERLR L+NM PTA VQEMT 
Subjt:  TAISAIWLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMAAVRLSGMEISDLTMELSDLGQEITQGVRSSTRAVRVAEERLRRLSNMNPTASVQEMTA

Query:  TNL-KMEAAEPALARRARGIKEGILKSRSTFQLFLSLTRFSWMALNYLIKRGK
         NL  +EAAEP LA+RAR IK GI+K RS FQLFLSLTRFS +ALN+  KRGK
Subjt:  TNL-KMEAAEPALARRARGIKEGILKSRSTFQLFLSLTRFSWMALNYLIKRGK

A0A6J1INQ2 uncharacterized protein LOC111478086 isoform X13.5e-9881.35Show/hide
Query:  MFRALELPPPCAAAKHNPFHEPPSGVKLCR-LLYSLRLPNRRLPLLSLRAQSPSSSPSSDPSSSSRYTETLGYSSPAFLQFSQWTLTQRHILVLNVVACA
        MFRALEL PPC AAKH   H  PS VKLCR   ++LRLPNRRL LLS+RAQS SSSP SDPS+S RYTET+G+SSPAF+QFSQ TLTQRHILVLNVVACA
Subjt:  MFRALELPPPCAAAKHNPFHEPPSGVKLCR-LLYSLRLPNRRLPLLSLRAQSPSSSPSSDPSSSSRYTETLGYSSPAFLQFSQWTLTQRHILVLNVVACA

Query:  TAISAIWLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMAAVRLSGMEISDLTMELSDLGQEITQGVRSSTRAVRVAEERLRRLSNMNPTASVQEMTA
        TAI+A WLFCSAIPTLLAFKRAAESLEKLMDVTREE+PGTMAA+RLSGMEISDLTMELSDLGQ ITQGVRSSTRAVRVAEERLR L+NM PTA VQEMT 
Subjt:  TAISAIWLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMAAVRLSGMEISDLTMELSDLGQEITQGVRSSTRAVRVAEERLRRLSNMNPTASVQEMTA

Query:  TNLKMEAAEPALARRARGIKEGILKSRSTFQLFLSLTRFSWMALNYLIKRGK
         NL +EAAEP LA+RAR IKEGI+K RS FQLFLSLTRFS +ALN+  KRGK
Subjt:  TNLKMEAAEPALARRARGIKEGILKSRSTFQLFLSLTRFSWMALNYLIKRGK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G08530.1 unknown protein2.2e-2043.26Show/hide
Query:  TLTQRHILVLNVVACATAISAIWLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMAAVRLSGMEISDLTMELSDLGQEITQGVRSSTRAVRVAEERLR
        +L+ +  L+L  + C T+++   L  +AIPTL+A  RAA S  KL D  R+E+P T+AAVRLSGMEISDLT+ELSDL Q+IT G+  S +AV+ AE  ++
Subjt:  TLTQRHILVLNVVACATAISAIWLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMAAVRLSGMEISDLTMELSDLGQEITQGVRSSTRAVRVAEERLR

Query:  RLSNM--NPTASVQEMTATNLKMEAAEPALARRARGIKEGI
        ++  +    T S+ E  A NL   + +P +A  A      I
Subjt:  RLSNM--NPTASVQEMTATNLKMEAAEPALARRARGIKEGI

AT5G09995.1 unknown protein1.1e-4670.75Show/hide
Query:  SLRAQSPSS--SPSSDPSSSSRYTETLGYSSPAFLQFSQWTLTQRHILVLNVVACATAISAIWLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMAAV
        S+ + +PS+  S   +PS SS+ T ++G      LQ SQWT TQ+H ++LNVVAC TAISA WLF +AIPTLLAFK+AAESLEKL+DVTREE+P TMAAV
Subjt:  SLRAQSPSS--SPSSDPSSSSRYTETLGYSSPAFLQFSQWTLTQRHILVLNVVACATAISAIWLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMAAV

Query:  RLSGMEISDLTMELSDLGQEITQGVRSSTRAVRVAEERLRRLSNMNP
        RLSGMEISDLTMELSDLGQ ITQGV+SSTRA+RVAE+RLRRL+NMNP
Subjt:  RLSGMEISDLTMELSDLGQEITQGVRSSTRAVRVAEERLRRLSNMNP

AT5G09995.2 unknown protein5.8e-6163.46Show/hide
Query:  SLRAQSPSS--SPSSDPSSSSRYTETLGYSSPAFLQFSQWTLTQRHILVLNVVACATAISAIWLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMAAV
        S+ + +PS+  S   +PS SS+ T ++G      LQ SQWT TQ+H ++LNVVAC TAISA WLF +AIPTLLAFK+AAESLEKL+DVTREE+P TMAAV
Subjt:  SLRAQSPSS--SPSSDPSSSSRYTETLGYSSPAFLQFSQWTLTQRHILVLNVVACATAISAIWLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMAAV

Query:  RLSGMEISDLTMELSDLGQEITQGVRSSTRAVRVAEERLRRLSNMNPTASVQEMTATNLKMEAAEPALARRARGIKEGILKSRSTFQLFLSLTRFSWMAL
        RLSGMEISDLTMELSDLGQ ITQGV+SSTRA+RVAE+RLRRL+NMNP AS+QE+     K +  EP LA++AR  +EG++K RS +QLF ++TRFS +  
Subjt:  RLSGMEISDLTMELSDLGQEITQGVRSSTRAVRVAEERLRRLSNMNPTASVQEMTATNLKMEAAEPALARRARGIKEGILKSRSTFQLFLSLTRFSWMAL

Query:  NYLIKRGK
        +YL KR K
Subjt:  NYLIKRGK

AT5G09995.3 unknown protein1.1e-5963.46Show/hide
Query:  SLRAQSPSS--SPSSDPSSSSRYTETLGYSSPAFLQFSQWTLTQRHILVLNVVACATAISAIWLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMAAV
        S+ + +PS+  S   +PS SS+ T ++G      LQ SQWT TQ+H ++LNVVAC TAISA WLF +AIPTLLAFK+AAESLEKL+DVTREE+P TMAAV
Subjt:  SLRAQSPSS--SPSSDPSSSSRYTETLGYSSPAFLQFSQWTLTQRHILVLNVVACATAISAIWLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMAAV

Query:  RLSGMEISDLTMELSDLGQEITQGVRSSTRAVRVAEERLRRLSNMNPTASVQEMTATNLKMEAAEPALARRARGIKEGILKSRSTFQLFLSLTRFSWMAL
        RLSGMEISDLTMELSDLGQ ITQGV+SSTRA+RVAE+RLRRL+NMNP AS+QE+     K +  EP LA++AR  +EG++K RS +QLF ++TRFS +  
Subjt:  RLSGMEISDLTMELSDLGQEITQGVRSSTRAVRVAEERLRRLSNMNPTASVQEMTATNLKMEAAEPALARRARGIKEGILKSRSTFQLFLSLTRFSWMAL

Query:  NYLIKRGK
        +YL KR K
Subjt:  NYLIKRGK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCAGAGCTTTGGAACTACCGCCGCCGTGTGCGGCAGCGAAGCATAATCCCTTTCATGAACCACCGAGCGGCGTGAAACTCTGCCGACTATTATACAGTCTCAGGCT
GCCGAATCGTCGACTTCCTTTGCTTTCGCTTCGAGCTCAATCGCCATCGTCATCGCCGTCATCAGATCCGTCGTCTTCATCGCGTTATACGGAAACTCTTGGATATTCTT
CACCTGCATTTCTTCAATTCTCTCAGTGGACGCTAACTCAACGCCACATCCTTGTTCTTAATGTCGTCGCCTGTGCGACGGCTATTTCTGCAATCTGGCTCTTTTGTTCT
GCAATCCCCACTCTTCTGGCATTCAAGAGAGCAGCTGAATCATTAGAGAAACTCATGGATGTCACAAGGGAGGAAATTCCTGGCACTATGGCAGCCGTACGGTTATCTGG
AATGGAAATTAGTGATCTGACCATGGAGCTTAGTGATCTTGGCCAGGAAATCACACAAGGTGTGAGAAGTTCAACTAGAGCTGTTCGAGTAGCCGAAGAGAGATTGCGTC
GCTTGTCAAACATGAATCCAACAGCCTCAGTGCAGGAAATGACAGCAACCAATCTGAAAATGGAGGCAGCTGAGCCAGCTTTGGCTAGAAGGGCAAGGGGCATTAAGGAA
GGGATATTGAAAAGCCGTTCCACTTTCCAATTATTTCTCTCCCTTACACGGTTCTCTTGGATGGCATTGAATTATCTTATCAAACGAGGTAAGAACTAG
mRNA sequenceShow/hide mRNA sequence
GAATATTTCACATCCTCTTCGAAATCGCCGAGCAGATCAACTTCAGCGGAAGAAATGTTCAGAGCTTTGGAACTACCGCCGCCGTGTGCGGCAGCGAAGCATAATCCCTT
TCATGAACCACCGAGCGGCGTGAAACTCTGCCGACTATTATACAGTCTCAGGCTGCCGAATCGTCGACTTCCTTTGCTTTCGCTTCGAGCTCAATCGCCATCGTCATCGC
CGTCATCAGATCCGTCGTCTTCATCGCGTTATACGGAAACTCTTGGATATTCTTCACCTGCATTTCTTCAATTCTCTCAGTGGACGCTAACTCAACGCCACATCCTTGTT
CTTAATGTCGTCGCCTGTGCGACGGCTATTTCTGCAATCTGGCTCTTTTGTTCTGCAATCCCCACTCTTCTGGCATTCAAGAGAGCAGCTGAATCATTAGAGAAACTCAT
GGATGTCACAAGGGAGGAAATTCCTGGCACTATGGCAGCCGTACGGTTATCTGGAATGGAAATTAGTGATCTGACCATGGAGCTTAGTGATCTTGGCCAGGAAATCACAC
AAGGTGTGAGAAGTTCAACTAGAGCTGTTCGAGTAGCCGAAGAGAGATTGCGTCGCTTGTCAAACATGAATCCAACAGCCTCAGTGCAGGAAATGACAGCAACCAATCTG
AAAATGGAGGCAGCTGAGCCAGCTTTGGCTAGAAGGGCAAGGGGCATTAAGGAAGGGATATTGAAAAGCCGTTCCACTTTCCAATTATTTCTCTCCCTTACACGGTTCTC
TTGGATGGCATTGAATTATCTTATCAAACGAGGTAAGAACTAGAGAAGAAACTTCAAGATTTCAAATGGGAGCAAAATTTAAAAAAATTGGCTGCTATAAGTAATTTCCT
TTCCTTTTGAGCTGATAAGTTTATGATTTTAGAATTATTGTCAAAATTGACCGGTTAGCTTCAAATGGAGGGAATTCTCTATCCCTTTTTTTTTTCTTCCTCCTATTTTG
CTTCCACTAGATTGTATTATGGTTGGTTTCTGTTACTTTTTTTTTTTTTTGATATCTGTGGCTATCTAAATCAATTTGCG
Protein sequenceShow/hide protein sequence
MFRALELPPPCAAAKHNPFHEPPSGVKLCRLLYSLRLPNRRLPLLSLRAQSPSSSPSSDPSSSSRYTETLGYSSPAFLQFSQWTLTQRHILVLNVVACATAISAIWLFCS
AIPTLLAFKRAAESLEKLMDVTREEIPGTMAAVRLSGMEISDLTMELSDLGQEITQGVRSSTRAVRVAEERLRRLSNMNPTASVQEMTATNLKMEAAEPALARRARGIKE
GILKSRSTFQLFLSLTRFSWMALNYLIKRGKN