; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G14820 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G14820
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionProtein of unknown function (DUF1218)
Genome locationChr3:11022563..11024029
RNA-Seq ExpressionCSPI03G14820
SyntenyCSPI03G14820
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR009606 - Modifying wall lignin-1/2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0049529.1 uncharacterized protein E6C27_scaffold171G007840 [Cucumis melo var. makuwa]7.6e-14196.67Show/hide
Query:  AVTHDDLLPSPKSSELGSKMGTFLIILTILCGLCCFILCLIAETTRSQVIWMGIDENNKEKRRCSYSGSGKTPLLCTASAFLGMAVMMVVQHLYVLIAVS
        AVTHDDLLPSPKSSELGSK+GTFLIILTILCGLCCFILCLIAE+TRSQ IWMGIDENNKEKRRCSYSGSGKTPLLCTASAFLGMAVMMVVQHLYVLIAVS
Subjt:  AVTHDDLLPSPKSSELGSKMGTFLIILTILCGLCCFILCLIAETTRSQVIWMGIDENNKEKRRCSYSGSGKTPLLCTASAFLGMAVMMVVQHLYVLIAVS

Query:  KSAPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAVRAQRMFE
        KSAPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWS PKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAVRAQRMFE
Subjt:  KSAPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAVRAQRMFE

Query:  QQENVRREVLESYHIHSSPPRSLS--SPPLQPMPPIAREDPVIRHSQHHQERAPFWSLLQSTAPFCKLSA
        QQENVRREVLESYHIHSSPPRSLS  SPPLQPMPPIAREDPVIRHS H Q+RAPFWSLLQSTAPFCKLSA
Subjt:  QQENVRREVLESYHIHSSPPRSLS--SPPLQPMPPIAREDPVIRHSQHHQERAPFWSLLQSTAPFCKLSA

XP_004134498.1 protein MODIFYING WALL LIGNIN-1 [Cucumis sativus]8.1e-151100Show/hide
Query:  MGRRKKNMAVTHDDLLPSPKSSELGSKMGTFLIILTILCGLCCFILCLIAETTRSQVIWMGIDENNKEKRRCSYSGSGKTPLLCTASAFLGMAVMMVVQH
        MGRRKKNMAVTHDDLLPSPKSSELGSKMGTFLIILTILCGLCCFILCLIAETTRSQVIWMGIDENNKEKRRCSYSGSGKTPLLCTASAFLGMAVMMVVQH
Subjt:  MGRRKKNMAVTHDDLLPSPKSSELGSKMGTFLIILTILCGLCCFILCLIAETTRSQVIWMGIDENNKEKRRCSYSGSGKTPLLCTASAFLGMAVMMVVQH

Query:  LYVLIAVSKSAPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTA
        LYVLIAVSKSAPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTA
Subjt:  LYVLIAVSKSAPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTA

Query:  VRAQRMFEQQENVRREVLESYHIHSSPPRSLSSPPLQPMPPIAREDPVIRHSQHHQERAPFWSLLQSTAPFCKLSA
        VRAQRMFEQQENVRREVLESYHIHSSPPRSLSSPPLQPMPPIAREDPVIRHSQHHQERAPFWSLLQSTAPFCKLSA
Subjt:  VRAQRMFEQQENVRREVLESYHIHSSPPRSLSSPPLQPMPPIAREDPVIRHSQHHQERAPFWSLLQSTAPFCKLSA

XP_008438924.1 PREDICTED: uncharacterized protein LOC103483879 [Cucumis melo]8.7e-14596.42Show/hide
Query:  MGRRKKNM-AVTHDDLLPSPKSSELGSKMGTFLIILTILCGLCCFILCLIAETTRSQVIWMGIDENNKEKRRCSYSGSGKTPLLCTASAFLGMAVMMVVQ
        MGRR+KNM AVTHDDLLPSPKSSELGSK+GTFLIILTILCGLCCFILCLIAE+TRSQ IWMGIDENNKEKRRCSYSGSGKTPLLCTASAFLGMAVMMVVQ
Subjt:  MGRRKKNM-AVTHDDLLPSPKSSELGSKMGTFLIILTILCGLCCFILCLIAETTRSQVIWMGIDENNKEKRRCSYSGSGKTPLLCTASAFLGMAVMMVVQ

Query:  HLYVLIAVSKSAPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMT
        HLYVLIAVSKSAPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWS PKESCLVIKEGLFSAAGVFQLATVFLAAGLYMT
Subjt:  HLYVLIAVSKSAPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMT

Query:  AVRAQRMFEQQENVRREVLESYHIHSSPPRSLS--SPPLQPMPPIAREDPVIRHSQHHQERAPFWSLLQSTAPFCKLSA
        AVRAQRMFEQQENVRREVLESYHIHSSPPRSLS  SPPLQPMPPIAREDPVIRHS HHQ+RAPFWSLLQSTAPFCKLSA
Subjt:  AVRAQRMFEQQENVRREVLESYHIHSSPPRSLS--SPPLQPMPPIAREDPVIRHSQHHQERAPFWSLLQSTAPFCKLSA

XP_022972325.1 uncharacterized protein LOC111470898 [Cucurbita maxima]1.9e-12085.82Show/hide
Query:  MGRRKKNMAVTHDDLLPSPKSSELGSKMGTFLIILTILCGLCCFILCLIAETTRSQVIWMGIDENNKEK--RRCSYSGSGKTPLLCTASAFLGMAVMMVV
        MGRRK  MAVTH+DL PS +SSELGSKMGTFLIILT+LCGLCCFILCL+AE+TRSQVIWMG DENN +K  +RC YSGSGKTPL+CTASAFLGMAVMMVV
Subjt:  MGRRKKNMAVTHDDLLPSPKSSELGSKMGTFLIILTILCGLCCFILCLIAETTRSQVIWMGIDENNKEK--RRCSYSGSGKTPLLCTASAFLGMAVMMVV

Query:  QHLYVLIAVSKSAPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYM
        QHLYVLIAVSKS PPALIAWDPSFATSKSLTFQAAFFFVSTWISF+VGEILLLIGLSVESGHL +W +PKESCLVIKEGLFSAAGVF+LATVFLAAGLYM
Subjt:  QHLYVLIAVSKSAPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYM

Query:  TAVRAQRMFEQQENVRREVLESYHIHSSPPRSLSSPPLQPMPPIAREDPVIRHSQHHQERAPFWSLLQSTAPFCK
        TAVRAQR+FE Q NVRREVLESYHIHSSPPR   SPPLQPMPPIAREDPVIRHS HH E +PF  LL S+A FCK
Subjt:  TAVRAQRMFEQQENVRREVLESYHIHSSPPRSLSSPPLQPMPPIAREDPVIRHSQHHQERAPFWSLLQSTAPFCK

XP_038903084.1 uncharacterized protein LOC120089763 [Benincasa hispida]1.7e-13792.78Show/hide
Query:  MGRRKKNMAVTHDDLLPSPKSSELGSKMGTFLIILTILCGLCCFILCLIAETTRSQVIWMGIDENNKE-KRRCSYSGSGKTPLLCTASAFLGMAVMMVVQ
        MGRR+KNMAVTH DLLPSP+SSELGSKMGTFL+ILT++CGLCCFILCLIAE+TRSQVIWMG+DENNK   +RCSYSGSGKTPLLCTASAFLGMAVMMVVQ
Subjt:  MGRRKKNMAVTHDDLLPSPKSSELGSKMGTFLIILTILCGLCCFILCLIAETTRSQVIWMGIDENNKE-KRRCSYSGSGKTPLLCTASAFLGMAVMMVVQ

Query:  HLYVLIAVSKSAPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMT
        HLYVLIAVSKS PPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWS+PKESCLVIKEGLFSAAGVFQLATVFLAAGLYMT
Subjt:  HLYVLIAVSKSAPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMT

Query:  AVRAQRMFEQQENVRREVLESYHIHSSPPRSLSSPPLQPMPPIAREDPVIRHSQHHQERAPFWSLLQSTAPFCKLSA
        AVRAQR+FE+QENVRREVLESYHIHSSPPRS SSPPLQPMPPIAREDPVIRHS HHQE APF+SLLQSTAPFCKLSA
Subjt:  AVRAQRMFEQQENVRREVLESYHIHSSPPRSLSSPPLQPMPPIAREDPVIRHSQHHQERAPFWSLLQSTAPFCKLSA

TrEMBL top hitse value%identityAlignment
A0A0A0L805 Uncharacterized protein5.8e-147100Show/hide
Query:  MAVTHDDLLPSPKSSELGSKMGTFLIILTILCGLCCFILCLIAETTRSQVIWMGIDENNKEKRRCSYSGSGKTPLLCTASAFLGMAVMMVVQHLYVLIAV
        MAVTHDDLLPSPKSSELGSKMGTFLIILTILCGLCCFILCLIAETTRSQVIWMGIDENNKEKRRCSYSGSGKTPLLCTASAFLGMAVMMVVQHLYVLIAV
Subjt:  MAVTHDDLLPSPKSSELGSKMGTFLIILTILCGLCCFILCLIAETTRSQVIWMGIDENNKEKRRCSYSGSGKTPLLCTASAFLGMAVMMVVQHLYVLIAV

Query:  SKSAPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAVRAQRMF
        SKSAPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAVRAQRMF
Subjt:  SKSAPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAVRAQRMF

Query:  EQQENVRREVLESYHIHSSPPRSLSSPPLQPMPPIAREDPVIRHSQHHQERAPFWSLLQSTAPFCKLSA
        EQQENVRREVLESYHIHSSPPRSLSSPPLQPMPPIAREDPVIRHSQHHQERAPFWSLLQSTAPFCKLSA
Subjt:  EQQENVRREVLESYHIHSSPPRSLSSPPLQPMPPIAREDPVIRHSQHHQERAPFWSLLQSTAPFCKLSA

A0A1S3AY82 uncharacterized protein LOC1034838794.2e-14596.42Show/hide
Query:  MGRRKKNM-AVTHDDLLPSPKSSELGSKMGTFLIILTILCGLCCFILCLIAETTRSQVIWMGIDENNKEKRRCSYSGSGKTPLLCTASAFLGMAVMMVVQ
        MGRR+KNM AVTHDDLLPSPKSSELGSK+GTFLIILTILCGLCCFILCLIAE+TRSQ IWMGIDENNKEKRRCSYSGSGKTPLLCTASAFLGMAVMMVVQ
Subjt:  MGRRKKNM-AVTHDDLLPSPKSSELGSKMGTFLIILTILCGLCCFILCLIAETTRSQVIWMGIDENNKEKRRCSYSGSGKTPLLCTASAFLGMAVMMVVQ

Query:  HLYVLIAVSKSAPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMT
        HLYVLIAVSKSAPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWS PKESCLVIKEGLFSAAGVFQLATVFLAAGLYMT
Subjt:  HLYVLIAVSKSAPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMT

Query:  AVRAQRMFEQQENVRREVLESYHIHSSPPRSLS--SPPLQPMPPIAREDPVIRHSQHHQERAPFWSLLQSTAPFCKLSA
        AVRAQRMFEQQENVRREVLESYHIHSSPPRSLS  SPPLQPMPPIAREDPVIRHS HHQ+RAPFWSLLQSTAPFCKLSA
Subjt:  AVRAQRMFEQQENVRREVLESYHIHSSPPRSLS--SPPLQPMPPIAREDPVIRHSQHHQERAPFWSLLQSTAPFCKLSA

A0A5A7U7S1 Uncharacterized protein3.7e-14196.67Show/hide
Query:  AVTHDDLLPSPKSSELGSKMGTFLIILTILCGLCCFILCLIAETTRSQVIWMGIDENNKEKRRCSYSGSGKTPLLCTASAFLGMAVMMVVQHLYVLIAVS
        AVTHDDLLPSPKSSELGSK+GTFLIILTILCGLCCFILCLIAE+TRSQ IWMGIDENNKEKRRCSYSGSGKTPLLCTASAFLGMAVMMVVQHLYVLIAVS
Subjt:  AVTHDDLLPSPKSSELGSKMGTFLIILTILCGLCCFILCLIAETTRSQVIWMGIDENNKEKRRCSYSGSGKTPLLCTASAFLGMAVMMVVQHLYVLIAVS

Query:  KSAPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAVRAQRMFE
        KSAPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWS PKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAVRAQRMFE
Subjt:  KSAPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAVRAQRMFE

Query:  QQENVRREVLESYHIHSSPPRSLS--SPPLQPMPPIAREDPVIRHSQHHQERAPFWSLLQSTAPFCKLSA
        QQENVRREVLESYHIHSSPPRSLS  SPPLQPMPPIAREDPVIRHS H Q+RAPFWSLLQSTAPFCKLSA
Subjt:  QQENVRREVLESYHIHSSPPRSLS--SPPLQPMPPIAREDPVIRHSQHHQERAPFWSLLQSTAPFCKLSA

A0A6J1F8A9 uncharacterized protein LOC1114417765.2e-11984.93Show/hide
Query:  RKKNMAVTHDDLLPSPKSSELGSKMGTFLIILTILCGLCCFILCLIAETTRSQVIWMGIDENNKEK--RRCSYSGSGKTPLLCTASAFLGMAVMMVVQHL
        R K MAVTH+DL PS +SSELGSKMGTFLIILT+LCGLCCFILCL+AE+TRSQVIW G DENN +K  +RC YSGSGKTPL+CTASAFLGMAVMMVVQHL
Subjt:  RKKNMAVTHDDLLPSPKSSELGSKMGTFLIILTILCGLCCFILCLIAETTRSQVIWMGIDENNKEK--RRCSYSGSGKTPLLCTASAFLGMAVMMVVQHL

Query:  YVLIAVSKSAPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAV
        YVLIAVSKS PPALIAWDPSFATSKSLTFQAAFFFVSTWISF+VGEILLLIGLSVESGHL +W +PKESCLVIKEGLFSAAGVF+LATVFLAAGLYMTAV
Subjt:  YVLIAVSKSAPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAV

Query:  RAQRMFEQQENVRREVLESYHIHSSPPRSLSSPPLQPMPPIAREDPVIRHSQHHQERAPFWSLLQSTAPFCK
        RAQR+FE Q NVRREVLESYHIHSSPPR   SPP+QPMPPIAREDPVIRHS HH E +PF  LL S+A FCK
Subjt:  RAQRMFEQQENVRREVLESYHIHSSPPRSLSSPPLQPMPPIAREDPVIRHSQHHQERAPFWSLLQSTAPFCK

A0A6J1I898 uncharacterized protein LOC1114708989.4e-12185.82Show/hide
Query:  MGRRKKNMAVTHDDLLPSPKSSELGSKMGTFLIILTILCGLCCFILCLIAETTRSQVIWMGIDENNKEK--RRCSYSGSGKTPLLCTASAFLGMAVMMVV
        MGRRK  MAVTH+DL PS +SSELGSKMGTFLIILT+LCGLCCFILCL+AE+TRSQVIWMG DENN +K  +RC YSGSGKTPL+CTASAFLGMAVMMVV
Subjt:  MGRRKKNMAVTHDDLLPSPKSSELGSKMGTFLIILTILCGLCCFILCLIAETTRSQVIWMGIDENNKEK--RRCSYSGSGKTPLLCTASAFLGMAVMMVV

Query:  QHLYVLIAVSKSAPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYM
        QHLYVLIAVSKS PPALIAWDPSFATSKSLTFQAAFFFVSTWISF+VGEILLLIGLSVESGHL +W +PKESCLVIKEGLFSAAGVF+LATVFLAAGLYM
Subjt:  QHLYVLIAVSKSAPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYM

Query:  TAVRAQRMFEQQENVRREVLESYHIHSSPPRSLSSPPLQPMPPIAREDPVIRHSQHHQERAPFWSLLQSTAPFCK
        TAVRAQR+FE Q NVRREVLESYHIHSSPPR   SPPLQPMPPIAREDPVIRHS HH E +PF  LL S+A FCK
Subjt:  TAVRAQRMFEQQENVRREVLESYHIHSSPPRSLSSPPLQPMPPIAREDPVIRHSQHHQERAPFWSLLQSTAPFCK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G13380.1 Protein of unknown function (DUF1218)5.5e-0425Show/hide
Query:  KMGTFLIILTILCGLCCFILCLIAETTRSQVIWMGIDENNKEKRRCSYSGSGKTPLLCTASAFLGMAVMMVVQHLYVLIAVSKSAPPALIAWDPSFATSK
        K  T + IL +   L  F   + AE  RS  I   I +       C Y     T     A  FL  +  +++     +      AP +  AW        
Subjt:  KMGTFLIILTILCGLCCFILCLIAETTRSQVIWMGIDENNKEKRRCSYSGSGKTPLLCTASAFLGMAVMMVVQHLYVLIAVSKSAPPALIAWDPSFATSK

Query:  SLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYM
             +  +F+S+W++F V E  ++ G +  + H    S+   SC  +++G+F A  VF +AT+ L    YM
Subjt:  SLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYM

AT5G49320.1 Protein of unknown function (DUF1218)2.6e-7057.02Show/hide
Query:  RKKNMAVTHDDLLPSPKSSELGSKMGTFLIILTILCGLCCFILCLIAETTRSQVIWMGIDENNKEKRRCSYSGSGKTPLLCTASAFLGMAVMMVVQHLYV
        R +  AVTH DL+P+PK+++L SK G F+ +LTI+ GL CF+LCL AE TRSQ  W          + C Y+GSGKTPLLC A AF+G+AV MV  H+Y+
Subjt:  RKKNMAVTHDDLLPSPKSSELGSKMGTFLIILTILCGLCCFILCLIAETTRSQVIWMGIDENNKEKRRCSYSGSGKTPLLCTASAFLGMAVMMVVQHLYV

Query:  LIAVSKSAPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAVRA
        LIAV+ S    L+ WDP    +K LTFQAAFFFVSTW+ F VGE+LLL+ LSVESGHL NWS PK SCLVI++GLFSAAGVF L TVFLA GLY+TA++A
Subjt:  LIAVSKSAPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAVRA

Query:  QRMFEQQENVRREVLESYHIHSSPPRSLSSPPLQPMPPIARE
         R+ +  EN  RE++E+  +++SPPRS    P   M  +ARE
Subjt:  QRMFEQQENVRREVLESYHIHSSPPRSLSSPPLQPMPPIARE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAGAAGAAAAAAAAACATGGCTGTCACACATGATGATCTTCTTCCAAGTCCAAAGAGCTCTGAATTGGGCAGCAAAATGGGTACTTTTCTTATCATTTTGACCAT
TCTCTGTGGTCTATGTTGCTTCATTCTTTGCCTCATTGCTGAGACCACTCGTTCTCAGGTGATATGGATGGGTATAGATGAAAATAATAAGGAAAAAAGGAGATGCTCGT
ATAGCGGCAGCGGGAAGACACCGCTGCTGTGCACGGCGAGCGCGTTTCTGGGGATGGCGGTGATGATGGTGGTGCAACATTTGTATGTGTTGATTGCAGTGAGTAAGTCA
GCTCCTCCTGCTCTCATTGCTTGGGATCCTTCTTTTGCAACTTCCAAATCTCTAACCTTTCAAGCTGCTTTCTTCTTCGTTTCAACATGGATAAGTTTTGCAGTTGGAGA
AATTTTATTGTTAATAGGATTGAGTGTGGAGTCAGGGCATCTTAACAATTGGTCAACTCCAAAAGAAAGCTGTTTGGTGATCAAAGAAGGTTTGTTCTCAGCTGCCGGAG
TTTTTCAATTGGCCACAGTCTTCCTTGCCGCCGGTCTCTACATGACCGCGGTCCGAGCACAGAGAATGTTTGAACAGCAAGAAAATGTGAGAAGAGAGGTATTGGAAAGT
TACCATATCCACAGTTCACCACCGCGGTCATTGTCGTCGCCACCGCTGCAGCCAATGCCGCCCATCGCAAGAGAGGACCCTGTAATAAGACATAGCCAACACCATCAAGA
GCGGGCTCCTTTCTGGTCTCTGCTTCAATCTACTGCCCCTTTTTGCAAACTCTCTGCTTAA
mRNA sequenceShow/hide mRNA sequence
AAACAAACCAAAATTTTTAATGGGAAGAAGAAAAAAAAACATGGCTGTCACACATGATGATCTTCTTCCAAGTCCAAAGAGCTCTGAATTGGGCAGCAAAATGGGTACTT
TTCTTATCATTTTGACCATTCTCTGTGGTCTATGTTGCTTCATTCTTTGCCTCATTGCTGAGACCACTCGTTCTCAGGTGATATGGATGGGTATAGATGAAAATAATAAG
GAAAAAAGGAGATGCTCGTATAGCGGCAGCGGGAAGACACCGCTGCTGTGCACGGCGAGCGCGTTTCTGGGGATGGCGGTGATGATGGTGGTGCAACATTTGTATGTGTT
GATTGCAGTGAGTAAGTCAGCTCCTCCTGCTCTCATTGCTTGGGATCCTTCTTTTGCAACTTCCAAATCTCTAACCTTTCAAGCTGCTTTCTTCTTCGTTTCAACATGGA
TAAGTTTTGCAGTTGGAGAAATTTTATTGTTAATAGGATTGAGTGTGGAGTCAGGGCATCTTAACAATTGGTCAACTCCAAAAGAAAGCTGTTTGGTGATCAAAGAAGGT
TTGTTCTCAGCTGCCGGAGTTTTTCAATTGGCCACAGTCTTCCTTGCCGCCGGTCTCTACATGACCGCGGTCCGAGCACAGAGAATGTTTGAACAGCAAGAAAATGTGAG
AAGAGAGGTATTGGAAAGTTACCATATCCACAGTTCACCACCGCGGTCATTGTCGTCGCCACCGCTGCAGCCAATGCCGCCCATCGCAAGAGAGGACCCTGTAATAAGAC
ATAGCCAACACCATCAAGAGCGGGCTCCTTTCTGGTCTCTGCTTCAATCTACTGCCCCTTTTTGCAAACTCTCTGCTTAAAACATTTTTTTTCTTTTTTCTTCTCTCTTT
TTGGAT
Protein sequenceShow/hide protein sequence
MGRRKKNMAVTHDDLLPSPKSSELGSKMGTFLIILTILCGLCCFILCLIAETTRSQVIWMGIDENNKEKRRCSYSGSGKTPLLCTASAFLGMAVMMVVQHLYVLIAVSKS
APPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAVRAQRMFEQQENVRREVLES
YHIHSSPPRSLSSPPLQPMPPIAREDPVIRHSQHHQERAPFWSLLQSTAPFCKLSA