; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Chy2G032450 (gene) of Cucumber (hystrix) v1 genome

Gene IDChy2G032450
OrganismCucumis hystrix (Cucumber (hystrix) v1)
DescriptionU-box domain-containing protein
Genome locationchrH02:6519979..6521463
RNA-Seq ExpressionChy2G032450
SyntenyChy2G032450
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8647483.1 hypothetical protein Csa_002826 [Cucumis sativus]7.65e-16397.57Show/hide
Query:  MFTALEIPPPCPAAKLNVVRALPSQFRFYRLPYNLGLPNRQLPLLSIRAQSLSDPSTSSRYTDTIGTSSPAFLQFPQCTLTQRHILVLNVVACATAISAT
        MFTALEIPPPCPAAKLNVVRALPSQFRFYRLPYNLGLPNRQLP LSIRAQSLSDPSTSSRYTDTIGTSSPAFLQFPQCTLTQRHILVLNVVACATAISAT
Subjt:  MFTALEIPPPCPAAKLNVVRALPSQFRFYRLPYNLGLPNRQLPLLSIRAQSLSDPSTSSRYTDTIGTSSPAFLQFPQCTLTQRHILVLNVVACATAISAT

Query:  WLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMAAIRLSGMEISDLTMELSDLGQGITQGVRSSTKAVRVVEERLRRLTNMSPTASVQEMTITNLGVK
        WLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMAAIRLSGMEISDLTMELSDLGQGITQGVRSST+AVRV EERLRRLTNMSPTASVQEMTITNLGV+
Subjt:  WLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMAAIRLSGMEISDLTMELSDLGQGITQGVRSSTKAVRVVEERLRRLTNMSPTASVQEMTITNLGVK

Query:  GAEPVLAKRARDIKEGILKGRSIFQLFLSLTRFSRLALNYFSKRGKK
        GAEPVLAKRA+DIKEGILKGRSIFQLFLSLTRFS LALNYFSKRGKK
Subjt:  GAEPVLAKRARDIKEGILKGRSIFQLFLSLTRFSRLALNYFSKRGKK

TYK12956.1 uncharacterized protein E5676_scaffold255G005520 [Cucumis melo var. makuwa]3.53e-15594.26Show/hide
Query:  MFTALEIPPPCPAAKLNVVRALPSQFRFYRLPYNLGLPNRQLPLLSIRAQSLSDPSTSSRYTDTIGTSSPAFLQFPQCTLTQRHILVLNVVACATAISAT
        MFTALEI PPCPAAKLNVVRALPS+F+FYRLPYNLGLPNR+L LLSIRAQSLSDPSTSSRYTDTIG SSPAFLQFP+CTLTQRHILVLNVVACATAISAT
Subjt:  MFTALEIPPPCPAAKLNVVRALPSQFRFYRLPYNLGLPNRQLPLLSIRAQSLSDPSTSSRYTDTIGTSSPAFLQFPQCTLTQRHILVLNVVACATAISAT

Query:  WLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMAAIRLSGMEISDLTMELSDLGQGITQGVRSSTKAVRVVEERLRRLTNMSPTASVQEMTITNLGVK
        WLFCSAIPTLLAFKRAAESLEKLMDVTREE+PGTMAAIRLSGMEISDLTMELSDLGQ ITQGVRSST+AVRV EERLRRLTNMSPTASVQEMTITNLGVK
Subjt:  WLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMAAIRLSGMEISDLTMELSDLGQGITQGVRSSTKAVRVVEERLRRLTNMSPTASVQEMTITNLGVK

Query:  GAEPVLAKRARDIKEGILKGRSIFQLFLSLTRFSRLALNYFSKR
        GA+PVLAKRARDIKEGI+KGRSIFQLFLS+TRFSRLALNYFSKR
Subjt:  GAEPVLAKRARDIKEGILKGRSIFQLFLSLTRFSRLALNYFSKR

XP_004142034.1 uncharacterized protein LOC101204218 isoform X1 [Cucumis sativus]8.84e-16397.57Show/hide
Query:  MFTALEIPPPCPAAKLNVVRALPSQFRFYRLPYNLGLPNRQLPLLSIRAQSLSDPSTSSRYTDTIGTSSPAFLQFPQCTLTQRHILVLNVVACATAISAT
        MFTALEIPPPCPAAKLNVVRALPSQFRFYRLPYNLGLPNRQLP LSIRAQSLSDPSTSSRYTDTIGTSSPAFLQFPQCTLTQRHILVLNVVACATAISAT
Subjt:  MFTALEIPPPCPAAKLNVVRALPSQFRFYRLPYNLGLPNRQLPLLSIRAQSLSDPSTSSRYTDTIGTSSPAFLQFPQCTLTQRHILVLNVVACATAISAT

Query:  WLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMAAIRLSGMEISDLTMELSDLGQGITQGVRSSTKAVRVVEERLRRLTNMSPTASVQEMTITNLGVK
        WLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMAAIRLSGMEISDLTMELSDLGQGITQGVRSST+AVRV EERLRRLTNMSPTASVQEMTITNLGV+
Subjt:  WLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMAAIRLSGMEISDLTMELSDLGQGITQGVRSSTKAVRVVEERLRRLTNMSPTASVQEMTITNLGVK

Query:  GAEPVLAKRARDIKEGILKGRSIFQLFLSLTRFSRLALNYFSKRGKK
        GAEPVLAKRA+DIKEGILKGRSIFQLFLSLTRFS LALNYFSKRGKK
Subjt:  GAEPVLAKRARDIKEGILKGRSIFQLFLSLTRFSRLALNYFSKRGKK

XP_008440148.1 PREDICTED: uncharacterized protein LOC103484701 isoform X1 [Cucumis melo]9.77e-15894.33Show/hide
Query:  MFTALEIPPPCPAAKLNVVRALPSQFRFYRLPYNLGLPNRQLPLLSIRAQSLSDPSTSSRYTDTIGTSSPAFLQFPQCTLTQRHILVLNVVACATAISAT
        MFTALEI PPCPAAKLNVVRALPS+F+FYRLPYNLGLPNR+L LLSIRAQSLSDPSTSSRYTDTIG SSPAFLQFP+CTLTQRHILVLNVVACATAISAT
Subjt:  MFTALEIPPPCPAAKLNVVRALPSQFRFYRLPYNLGLPNRQLPLLSIRAQSLSDPSTSSRYTDTIGTSSPAFLQFPQCTLTQRHILVLNVVACATAISAT

Query:  WLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMAAIRLSGMEISDLTMELSDLGQGITQGVRSSTKAVRVVEERLRRLTNMSPTASVQEMTITNLGVK
        WLFCSAIPTLLAFKRAAESLEKLMDVTREE+PGTMAAIRLSGMEISDLTMELSDLGQ ITQGVRSST+AVRV EERLRRLTNMSPTASVQEMTITNLGVK
Subjt:  WLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMAAIRLSGMEISDLTMELSDLGQGITQGVRSSTKAVRVVEERLRRLTNMSPTASVQEMTITNLGVK

Query:  GAEPVLAKRARDIKEGILKGRSIFQLFLSLTRFSRLALNYFSKRGKK
        GA+PVLAKRARDIKEGI+KGRSIFQLFLS+TRFSRLALNYFSKRGKK
Subjt:  GAEPVLAKRARDIKEGILKGRSIFQLFLSLTRFSRLALNYFSKRGKK

XP_011657800.1 uncharacterized protein LOC101204218 isoform X2 [Cucumis sativus]1.30e-15996.76Show/hide
Query:  MFTALEIPPPCPAAKLNVVRALPSQFRFYRLPYNLGLPNRQLPLLSIRAQSLSDPSTSSRYTDTIGTSSPAFLQFPQCTLTQRHILVLNVVACATAISAT
        MFTALEIPPPCPAAKLNVVRALPSQFRFYRLPYNLGLPNRQLP LSIRAQSLSDPSTSSRYTDTIGTSSPAFLQFPQCTLTQRHILVLNVVACATAISAT
Subjt:  MFTALEIPPPCPAAKLNVVRALPSQFRFYRLPYNLGLPNRQLPLLSIRAQSLSDPSTSSRYTDTIGTSSPAFLQFPQCTLTQRHILVLNVVACATAISAT

Query:  WLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMAAIRLSGMEISDLTMELSDLGQGITQGVRSSTKAVRVVEERLRRLTNMSPTASVQEMTITNLGVK
        WLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMAAIRLSGMEISDLTMELSDLGQGITQGVRSST+AVRV EERLRRLTNMSPT  VQEMTITNLGV+
Subjt:  WLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMAAIRLSGMEISDLTMELSDLGQGITQGVRSSTKAVRVVEERLRRLTNMSPTASVQEMTITNLGVK

Query:  GAEPVLAKRARDIKEGILKGRSIFQLFLSLTRFSRLALNYFSKRGKK
        GAEPVLAKRA+DIKEGILKGRSIFQLFLSLTRFS LALNYFSKRGKK
Subjt:  GAEPVLAKRARDIKEGILKGRSIFQLFLSLTRFSRLALNYFSKRGKK

TrEMBL top hitse value%identityAlignment
A0A1S3B011 uncharacterized protein LOC103484701 isoform X29.3e-12093.52Show/hide
Query:  MFTALEIPPPCPAAKLNVVRALPSQFRFYRLPYNLGLPNRQLPLLSIRAQSLSDPSTSSRYTDTIGTSSPAFLQFPQCTLTQRHILVLNVVACATAISAT
        MFTALEI PPCPAAKLNVVRALPS+F+FYRLPYNLGLPNR+L LLSIRAQSLSDPSTSSRYTDTIG SSPAFLQFP+CTLTQRHILVLNVVACATAISAT
Subjt:  MFTALEIPPPCPAAKLNVVRALPSQFRFYRLPYNLGLPNRQLPLLSIRAQSLSDPSTSSRYTDTIGTSSPAFLQFPQCTLTQRHILVLNVVACATAISAT

Query:  WLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMAAIRLSGMEISDLTMELSDLGQGITQGVRSSTKAVRVVEERLRRLTNMSPTASVQEMTITNLGVK
        WLFCSAIPTLLAFKRAAESLEKLMDVTREE+PGTMAAIRLSGMEISDLTMELSDLGQ ITQGVRSST+AVRV EERLRRLTNMSPT  VQEMTITNLGVK
Subjt:  WLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMAAIRLSGMEISDLTMELSDLGQGITQGVRSSTKAVRVVEERLRRLTNMSPTASVQEMTITNLGVK

Query:  GAEPVLAKRARDIKEGILKGRSIFQLFLSLTRFSRLALNYFSKRGKK
        GA+PVLAKRARDIKEGI+KGRSIFQLFLS+TRFSRLALNYFSKRGKK
Subjt:  GAEPVLAKRARDIKEGILKGRSIFQLFLSLTRFSRLALNYFSKRGKK

A0A1S3B164 uncharacterized protein LOC103484701 isoform X13.4e-12294.33Show/hide
Query:  MFTALEIPPPCPAAKLNVVRALPSQFRFYRLPYNLGLPNRQLPLLSIRAQSLSDPSTSSRYTDTIGTSSPAFLQFPQCTLTQRHILVLNVVACATAISAT
        MFTALEI PPCPAAKLNVVRALPS+F+FYRLPYNLGLPNR+L LLSIRAQSLSDPSTSSRYTDTIG SSPAFLQFP+CTLTQRHILVLNVVACATAISAT
Subjt:  MFTALEIPPPCPAAKLNVVRALPSQFRFYRLPYNLGLPNRQLPLLSIRAQSLSDPSTSSRYTDTIGTSSPAFLQFPQCTLTQRHILVLNVVACATAISAT

Query:  WLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMAAIRLSGMEISDLTMELSDLGQGITQGVRSSTKAVRVVEERLRRLTNMSPTASVQEMTITNLGVK
        WLFCSAIPTLLAFKRAAESLEKLMDVTREE+PGTMAAIRLSGMEISDLTMELSDLGQ ITQGVRSST+AVRV EERLRRLTNMSPTASVQEMTITNLGVK
Subjt:  WLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMAAIRLSGMEISDLTMELSDLGQGITQGVRSSTKAVRVVEERLRRLTNMSPTASVQEMTITNLGVK

Query:  GAEPVLAKRARDIKEGILKGRSIFQLFLSLTRFSRLALNYFSKRGKK
        GA+PVLAKRARDIKEGI+KGRSIFQLFLS+TRFSRLALNYFSKRGKK
Subjt:  GAEPVLAKRARDIKEGILKGRSIFQLFLSLTRFSRLALNYFSKRGKK

A0A5D3CRC7 Uncharacterized protein3.2e-12094.26Show/hide
Query:  MFTALEIPPPCPAAKLNVVRALPSQFRFYRLPYNLGLPNRQLPLLSIRAQSLSDPSTSSRYTDTIGTSSPAFLQFPQCTLTQRHILVLNVVACATAISAT
        MFTALEI PPCPAAKLNVVRALPS+F+FYRLPYNLGLPNR+L LLSIRAQSLSDPSTSSRYTDTIG SSPAFLQFP+CTLTQRHILVLNVVACATAISAT
Subjt:  MFTALEIPPPCPAAKLNVVRALPSQFRFYRLPYNLGLPNRQLPLLSIRAQSLSDPSTSSRYTDTIGTSSPAFLQFPQCTLTQRHILVLNVVACATAISAT

Query:  WLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMAAIRLSGMEISDLTMELSDLGQGITQGVRSSTKAVRVVEERLRRLTNMSPTASVQEMTITNLGVK
        WLFCSAIPTLLAFKRAAESLEKLMDVTREE+PGTMAAIRLSGMEISDLTMELSDLGQ ITQGVRSST+AVRV EERLRRLTNMSPTASVQEMTITNLGVK
Subjt:  WLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMAAIRLSGMEISDLTMELSDLGQGITQGVRSSTKAVRVVEERLRRLTNMSPTASVQEMTITNLGVK

Query:  GAEPVLAKRARDIKEGILKGRSIFQLFLSLTRFSRLALNYFSKR
        GA+PVLAKRARDIKEGI+KGRSIFQLFLS+TRFSRLALNYFSKR
Subjt:  GAEPVLAKRARDIKEGILKGRSIFQLFLSLTRFSRLALNYFSKR

A0A6J1GDG4 uncharacterized protein LOC111452981 isoform X15.1e-10281.82Show/hide
Query:  MFTALEIPPPCPAAKLNVVRALPSQFRFYR-LPYNLGLPNRQLPLLSIRAQSL----SDPSTSSRYTDTIGTSSPAFLQFPQCTLTQRHILVLNVVACAT
        MF ALE+ PPCPAAK ++V A PS  +  R  PYNL LPNR+L LLS+RAQSL    SDPSTS RYT+TIG SSPA++QF QCTLTQRH+LVLNVVACAT
Subjt:  MFTALEIPPPCPAAKLNVVRALPSQFRFYR-LPYNLGLPNRQLPLLSIRAQSL----SDPSTSSRYTDTIGTSSPAFLQFPQCTLTQRHILVLNVVACAT

Query:  AISATWLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMAAIRLSGMEISDLTMELSDLGQGITQGVRSSTKAVRVVEERLRRLTNMSPTASVQEMTIT
        AI+ATWLFCSAIPTLLAFKRAAESLEKLMDVTREE+PGTMAAIRLSGMEISDLTMELSDLGQ ITQGVRSST+AVRV EERLR LTNM+PTA VQEMT+ 
Subjt:  AISATWLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMAAIRLSGMEISDLTMELSDLGQGITQGVRSSTKAVRVVEERLRRLTNMSPTASVQEMTIT

Query:  NLG-VKGAEPVLAKRARDIKEGILKGRSIFQLFLSLTRFSRLALNYFSKRGKK
        NLG V+ AEPVLAKRARDIK GI+KGRSIFQLFLSLTRFSRLALN+FSKRGKK
Subjt:  NLG-VKGAEPVLAKRARDIKEGILKGRSIFQLFLSLTRFSRLALNYFSKRGKK

A0A6J1INQ2 uncharacterized protein LOC111478086 isoform X13.2e-10482.94Show/hide
Query:  MFTALEIPPPCPAAKLNVVRALPSQFRFYR-LPYNLGLPNRQLPLLSIRAQSL----SDPSTSSRYTDTIGTSSPAFLQFPQCTLTQRHILVLNVVACAT
        MF ALE+ PPCPAAK  +V A PS  +  R  P+NL LPNR+L LLS+RAQSL    SDPSTS RYT+TIG SSPAF+QF QCTLTQRHILVLNVVACAT
Subjt:  MFTALEIPPPCPAAKLNVVRALPSQFRFYR-LPYNLGLPNRQLPLLSIRAQSL----SDPSTSSRYTDTIGTSSPAFLQFPQCTLTQRHILVLNVVACAT

Query:  AISATWLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMAAIRLSGMEISDLTMELSDLGQGITQGVRSSTKAVRVVEERLRRLTNMSPTASVQEMTIT
        AI+ATWLFCSAIPTLLAFKRAAESLEKLMDVTREE+PGTMAAIRLSGMEISDLTMELSDLGQ ITQGVRSST+AVRV EERLR LTNM+PTA VQEMT+ 
Subjt:  AISATWLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMAAIRLSGMEISDLTMELSDLGQGITQGVRSSTKAVRVVEERLRRLTNMSPTASVQEMTIT

Query:  NLGVKGAEPVLAKRARDIKEGILKGRSIFQLFLSLTRFSRLALNYFSKRGKK
        NLGV+ AEPVLAKRARDIKEGI+KGRSIFQLFLSLTRFSRLALN+FSKRGKK
Subjt:  NLGVKGAEPVLAKRARDIKEGILKGRSIFQLFLSLTRFSRLALNYFSKRGKK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G08530.1 unknown protein3.4e-2141.43Show/hide
Query:  TLTQRHILVLNVVACATAISATWLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMAAIRLSGMEISDLTMELSDLGQGITQGVRSSTKAVRVVEERLR
        +L+ +  L+L  + C T+++ T L  +AIPTL+A  RAA S  KL D  R+E+P T+AA+RLSGMEISDLT+ELSDL Q IT G+  S KAV+  E  ++
Subjt:  TLTQRHILVLNVVACATAISATWLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMAAIRLSGMEISDLTMELSDLGQGITQGVRSSTKAVRVVEERLR

Query:  RLTNMSPTASVQEM-TITNLGVKGAEPVLAKRARDIKEGI
        ++  ++   ++  +    NL     +PV+A  A      I
Subjt:  RLTNMSPTASVQEM-TITNLGVKGAEPVLAKRARDIKEGI

AT5G09995.1 unknown protein5.2e-4672.73Show/hide
Query:  DPSTSSRYTDTIGTSSPAFLQFPQCTLTQRHILVLNVVACATAISATWLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMAAIRLSGMEISDLTMELS
        +PS SS+ T ++G      LQ  Q T TQ+H ++LNVVAC TAISA+WLF +AIPTLLAFK+AAESLEKL+DVTREE+P TMAA+RLSGMEISDLTMELS
Subjt:  DPSTSSRYTDTIGTSSPAFLQFPQCTLTQRHILVLNVVACATAISATWLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMAAIRLSGMEISDLTMELS

Query:  DLGQGITQGVRSSTKAVRVVEERLRRLTNMSP
        DLGQGITQGV+SST+A+RV E+RLRRLTNM+P
Subjt:  DLGQGITQGVRSSTKAVRVVEERLRRLTNMSP

AT5G09995.2 unknown protein3.0e-6263.92Show/hide
Query:  DPSTSSRYTDTIGTSSPAFLQFPQCTLTQRHILVLNVVACATAISATWLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMAAIRLSGMEISDLTMELS
        +PS SS+ T ++G      LQ  Q T TQ+H ++LNVVAC TAISA+WLF +AIPTLLAFK+AAESLEKL+DVTREE+P TMAA+RLSGMEISDLTMELS
Subjt:  DPSTSSRYTDTIGTSSPAFLQFPQCTLTQRHILVLNVVACATAISATWLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMAAIRLSGMEISDLTMELS

Query:  DLGQGITQGVRSSTKAVRVVEERLRRLTNMSPTASVQEMTITNLGVKGAEPVLAKRARDIKEGILKGRSIFQLFLSLTRFSRLALNYFSKRGKK
        DLGQGITQGV+SST+A+RV E+RLRRLTNM+P AS+QE+ +        EP+LAK+AR  +EG++KGRS++QLF ++TRFS++  +Y +KR K+
Subjt:  DLGQGITQGVRSSTKAVRVVEERLRRLTNMSPTASVQEMTITNLGVKGAEPVLAKRARDIKEGILKGRSIFQLFLSLTRFSRLALNYFSKRGKK

AT5G09995.3 unknown protein5.7e-6163.92Show/hide
Query:  DPSTSSRYTDTIGTSSPAFLQFPQCTLTQRHILVLNVVACATAISATWLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMAAIRLSGMEISDLTMELS
        +PS SS+ T ++G      LQ  Q T TQ+H ++LNVVAC TAISA+WLF +AIPTLLAFK+AAESLEKL+DVTREE+P TMAA+RLSGMEISDLTMELS
Subjt:  DPSTSSRYTDTIGTSSPAFLQFPQCTLTQRHILVLNVVACATAISATWLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMAAIRLSGMEISDLTMELS

Query:  DLGQGITQGVRSSTKAVRVVEERLRRLTNMSPTASVQEMTITNLGVKGAEPVLAKRARDIKEGILKGRSIFQLFLSLTRFSRLALNYFSKRGKK
        DLGQGITQGV+SST+A+RV E+RLRRLTNM+P AS+QE+ +        EP+LAK+AR  +EG++KGRS++QLF ++TRFS++  +Y +KR K+
Subjt:  DLGQGITQGVRSSTKAVRVVEERLRRLTNMSPTASVQEMTITNLGVKGAEPVLAKRARDIKEGILKGRSIFQLFLSLTRFSRLALNYFSKRGKK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTACAGCTTTGGAAATACCACCACCGTGTCCGGCGGCGAAGCTTAATGTCGTTCGAGCACTTCCAAGCCAATTTAGATTCTACCGACTACCTTACAATCTC
GGCCTTCCTAACCGTCAACTTCCTTTGCTTTCAATACGAGCACAATCGCTATCTGATCCATCGACTTCATCGCGTTATACGGACACGATTGGAACCTCCTCTCCA
GCATTTCTTCAATTCCCTCAGTGCACGCTAACTCAACGCCACATCCTTGTTCTCAATGTCGTTGCCTGCGCGACGGCAATTTCTGCAACCTGGCTCTTTTGTTCT
GCGATCCCCACTCTTCTGGCATTCAAGAGAGCAGCCGAATCATTAGAGAAACTCATGGATGTCACAAGGGAGGAAATTCCAGGAACTATGGCAGCCATTCGGTTA
TCTGGCATGGAAATCAGTGATTTGACCATGGAACTCAGTGATCTTGGGCAGGGTATCACCCAAGGTGTGAGAAGTTCCACTAAAGCTGTTCGAGTAGTGGAAGAG
AGATTACGTCGCTTGACAAACATGTCTCCAACAGCCTCAGTGCAGGAAATGACAATAACCAATCTGGGAGTGAAGGGAGCAGAGCCAGTTCTGGCTAAAAGGGCA
AGAGACATTAAGGAAGGGATTCTGAAAGGACGTTCCATCTTCCAATTATTTCTCTCCCTTACAAGATTCTCTCGGCTGGCCTTGAATTATTTTAGCAAACGAGGT
AAGAAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTTACAGCTTTGGAAATACCACCACCGTGTCCGGCGGCGAAGCTTAATGTCGTTCGAGCACTTCCAAGCCAATTTAGATTCTACCGACTACCTTACAATCTC
GGCCTTCCTAACCGTCAACTTCCTTTGCTTTCAATACGAGCACAATCGCTATCTGATCCATCGACTTCATCGCGTTATACGGACACGATTGGAACCTCCTCTCCA
GCATTTCTTCAATTCCCTCAGTGCACGCTAACTCAACGCCACATCCTTGTTCTCAATGTCGTTGCCTGCGCGACGGCAATTTCTGCAACCTGGCTCTTTTGTTCT
GCGATCCCCACTCTTCTGGCATTCAAGAGAGCAGCCGAATCATTAGAGAAACTCATGGATGTCACAAGGGAGGAAATTCCAGGAACTATGGCAGCCATTCGGTTA
TCTGGCATGGAAATCAGTGATTTGACCATGGAACTCAGTGATCTTGGGCAGGGTATCACCCAAGGTGTGAGAAGTTCCACTAAAGCTGTTCGAGTAGTGGAAGAG
AGATTACGTCGCTTGACAAACATGTCTCCAACAGCCTCAGTGCAGGAAATGACAATAACCAATCTGGGAGTGAAGGGAGCAGAGCCAGTTCTGGCTAAAAGGGCA
AGAGACATTAAGGAAGGGATTCTGAAAGGACGTTCCATCTTCCAATTATTTCTCTCCCTTACAAGATTCTCTCGGCTGGCCTTGAATTATTTTAGCAAACGAGGT
AAGAAGTAG
Protein sequenceShow/hide protein sequence
MFTALEIPPPCPAAKLNVVRALPSQFRFYRLPYNLGLPNRQLPLLSIRAQSLSDPSTSSRYTDTIGTSSPAFLQFPQCTLTQRHILVLNVVACATAISATWLFCS
AIPTLLAFKRAAESLEKLMDVTREEIPGTMAAIRLSGMEISDLTMELSDLGQGITQGVRSSTKAVRVVEERLRRLTNMSPTASVQEMTITNLGVKGAEPVLAKRA
RDIKEGILKGRSIFQLFLSLTRFSRLALNYFSKRGKK