; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0009456 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0009456
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionProtein of unknown function (DUF1218)
Genome locationchr06:5300869..5302335
RNA-Seq ExpressionPI0009456
SyntenyPI0009456
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR009606 - Modifying wall lignin-1/2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0049529.1 uncharacterized protein E6C27_scaffold171G007840 [Cucumis melo var. makuwa]1.2e-13896.28Show/hide
Query:  AVTHDDLLPSPKSSELGSKMGTFLIILTILCGLCCFILCLIAESTRSQVIWMGIDENNKEKKRCSYSGSGKTPLLCTASAFLGMAVMMVVQHLYVLIAVS
        AVTHDDLLPSPKSSELGSK+GTFLIILTILCGLCCFILCLIAESTRSQ IWMGIDENNKEK+RCSYSGSGKTPLLCTASAFLGMAVMMVVQHLYVLIAVS
Subjt:  AVTHDDLLPSPKSSELGSKMGTFLIILTILCGLCCFILCLIAESTRSQVIWMGIDENNKEKKRCSYSGSGKTPLLCTASAFLGMAVMMVVQHLYVLIAVS

Query:  KSAPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAVRAQRMFE
        KSAPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWS PKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAVRAQRMFE
Subjt:  KSAPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAVRAQRMFE

Query:  QQENVRREVLESYHIHSSPPRSLS--SPPLQPMPPIAREDPVIRHSHHHQGRAPLLSLLQSTAPFCKLS
        QQENVRREVLESYHIHSSPPRSLS  SPPLQPMPPIAREDPVIRHSHH Q RAP  SLLQSTAPFCKLS
Subjt:  QQENVRREVLESYHIHSSPPRSLS--SPPLQPMPPIAREDPVIRHSHHHQGRAPLLSLLQSTAPFCKLS

XP_004134498.1 protein MODIFYING WALL LIGNIN-1 [Cucumis sativus]1.7e-14597.45Show/hide
Query:  MGRRRKNMAVTHDDLLPSPKSSELGSKMGTFLIILTILCGLCCFILCLIAESTRSQVIWMGIDENNKEKKRCSYSGSGKTPLLCTASAFLGMAVMMVVQH
        MGRR+KNMAVTHDDLLPSPKSSELGSKMGTFLIILTILCGLCCFILCLIAE+TRSQVIWMGIDENNKEK+RCSYSGSGKTPLLCTASAFLGMAVMMVVQH
Subjt:  MGRRRKNMAVTHDDLLPSPKSSELGSKMGTFLIILTILCGLCCFILCLIAESTRSQVIWMGIDENNKEKKRCSYSGSGKTPLLCTASAFLGMAVMMVVQH

Query:  LYVLIAVSKSAPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTA
        LYVLIAVSKSAPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTA
Subjt:  LYVLIAVSKSAPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTA

Query:  VRAQRMFEQQENVRREVLESYHIHSSPPRSLSSPPLQPMPPIAREDPVIRHSHHHQGRAPLLSLLQSTAPFCKLS
        VRAQRMFEQQENVRREVLESYHIHSSPPRSLSSPPLQPMPPIAREDPVIRHS HHQ RAP  SLLQSTAPFCKLS
Subjt:  VRAQRMFEQQENVRREVLESYHIHSSPPRSLSSPPLQPMPPIAREDPVIRHSHHHQGRAPLLSLLQSTAPFCKLS

XP_008438924.1 PREDICTED: uncharacterized protein LOC103483879 [Cucumis melo]6.2e-14396.4Show/hide
Query:  MGRRRKNM-AVTHDDLLPSPKSSELGSKMGTFLIILTILCGLCCFILCLIAESTRSQVIWMGIDENNKEKKRCSYSGSGKTPLLCTASAFLGMAVMMVVQ
        MGRRRKNM AVTHDDLLPSPKSSELGSK+GTFLIILTILCGLCCFILCLIAESTRSQ IWMGIDENNKEK+RCSYSGSGKTPLLCTASAFLGMAVMMVVQ
Subjt:  MGRRRKNM-AVTHDDLLPSPKSSELGSKMGTFLIILTILCGLCCFILCLIAESTRSQVIWMGIDENNKEKKRCSYSGSGKTPLLCTASAFLGMAVMMVVQ

Query:  HLYVLIAVSKSAPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMT
        HLYVLIAVSKSAPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWS PKESCLVIKEGLFSAAGVFQLATVFLAAGLYMT
Subjt:  HLYVLIAVSKSAPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMT

Query:  AVRAQRMFEQQENVRREVLESYHIHSSPPRSLS--SPPLQPMPPIAREDPVIRHSHHHQGRAPLLSLLQSTAPFCKLS
        AVRAQRMFEQQENVRREVLESYHIHSSPPRSLS  SPPLQPMPPIAREDPVIRHSHHHQ RAP  SLLQSTAPFCKLS
Subjt:  AVRAQRMFEQQENVRREVLESYHIHSSPPRSLS--SPPLQPMPPIAREDPVIRHSHHHQGRAPLLSLLQSTAPFCKLS

XP_022972325.1 uncharacterized protein LOC111470898 [Cucurbita maxima]7.9e-12286.4Show/hide
Query:  RRKNMAVTHDDLLPSPKSSELGSKMGTFLIILTILCGLCCFILCLIAESTRSQVIWMGIDENNKEK--KRCSYSGSGKTPLLCTASAFLGMAVMMVVQHL
        RRK MAVTH+DL PS +SSELGSKMGTFLIILT+LCGLCCFILCL+AESTRSQVIWMG DENN +K  KRC YSGSGKTPL+CTASAFLGMAVMMVVQHL
Subjt:  RRKNMAVTHDDLLPSPKSSELGSKMGTFLIILTILCGLCCFILCLIAESTRSQVIWMGIDENNKEK--KRCSYSGSGKTPLLCTASAFLGMAVMMVVQHL

Query:  YVLIAVSKSAPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAV
        YVLIAVSKS PPALIAWDPSFATSKSLTFQAAFFFVSTWISF+VGEILLLIGLSVESGHL +W +PKESCLVIKEGLFSAAGVF+LATVFLAAGLYMTAV
Subjt:  YVLIAVSKSAPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAV

Query:  RAQRMFEQQENVRREVLESYHIHSSPPRSLSSPPLQPMPPIAREDPVIRHSHHHQGRAPLLSLLQSTAPFCK
        RAQR+FE Q NVRREVLESYHIHSSPPR   SPPLQPMPPIAREDPVIRHSHH +  +P L LL S+A FCK
Subjt:  RAQRMFEQQENVRREVLESYHIHSSPPRSLSSPPLQPMPPIAREDPVIRHSHHHQGRAPLLSLLQSTAPFCK

XP_038903084.1 uncharacterized protein LOC120089763 [Benincasa hispida]1.7e-13793.48Show/hide
Query:  MGRRRKNMAVTHDDLLPSPKSSELGSKMGTFLIILTILCGLCCFILCLIAESTRSQVIWMGIDENNKE-KKRCSYSGSGKTPLLCTASAFLGMAVMMVVQ
        MGRRRKNMAVTH DLLPSP+SSELGSKMGTFL+ILT++CGLCCFILCLIAESTRSQVIWMG+DENNK   KRCSYSGSGKTPLLCTASAFLGMAVMMVVQ
Subjt:  MGRRRKNMAVTHDDLLPSPKSSELGSKMGTFLIILTILCGLCCFILCLIAESTRSQVIWMGIDENNKE-KKRCSYSGSGKTPLLCTASAFLGMAVMMVVQ

Query:  HLYVLIAVSKSAPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMT
        HLYVLIAVSKS PPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWS+PKESCLVIKEGLFSAAGVFQLATVFLAAGLYMT
Subjt:  HLYVLIAVSKSAPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMT

Query:  AVRAQRMFEQQENVRREVLESYHIHSSPPRSLSSPPLQPMPPIAREDPVIRHSHHHQGRAPLLSLLQSTAPFCKLS
        AVRAQR+FE+QENVRREVLESYHIHSSPPRS SSPPLQPMPPIAREDPVIRHSHHHQ  AP  SLLQSTAPFCKLS
Subjt:  AVRAQRMFEQQENVRREVLESYHIHSSPPRSLSSPPLQPMPPIAREDPVIRHSHHHQGRAPLLSLLQSTAPFCKLS

TrEMBL top hitse value%identityAlignment
A0A0A0L805 Uncharacterized protein5.6e-14297.76Show/hide
Query:  MAVTHDDLLPSPKSSELGSKMGTFLIILTILCGLCCFILCLIAESTRSQVIWMGIDENNKEKKRCSYSGSGKTPLLCTASAFLGMAVMMVVQHLYVLIAV
        MAVTHDDLLPSPKSSELGSKMGTFLIILTILCGLCCFILCLIAE+TRSQVIWMGIDENNKEK+RCSYSGSGKTPLLCTASAFLGMAVMMVVQHLYVLIAV
Subjt:  MAVTHDDLLPSPKSSELGSKMGTFLIILTILCGLCCFILCLIAESTRSQVIWMGIDENNKEKKRCSYSGSGKTPLLCTASAFLGMAVMMVVQHLYVLIAV

Query:  SKSAPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAVRAQRMF
        SKSAPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAVRAQRMF
Subjt:  SKSAPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAVRAQRMF

Query:  EQQENVRREVLESYHIHSSPPRSLSSPPLQPMPPIAREDPVIRHSHHHQGRAPLLSLLQSTAPFCKLS
        EQQENVRREVLESYHIHSSPPRSLSSPPLQPMPPIAREDPVIRHS HHQ RAP  SLLQSTAPFCKLS
Subjt:  EQQENVRREVLESYHIHSSPPRSLSSPPLQPMPPIAREDPVIRHSHHHQGRAPLLSLLQSTAPFCKLS

A0A1S3AY82 uncharacterized protein LOC1034838793.0e-14396.4Show/hide
Query:  MGRRRKNM-AVTHDDLLPSPKSSELGSKMGTFLIILTILCGLCCFILCLIAESTRSQVIWMGIDENNKEKKRCSYSGSGKTPLLCTASAFLGMAVMMVVQ
        MGRRRKNM AVTHDDLLPSPKSSELGSK+GTFLIILTILCGLCCFILCLIAESTRSQ IWMGIDENNKEK+RCSYSGSGKTPLLCTASAFLGMAVMMVVQ
Subjt:  MGRRRKNM-AVTHDDLLPSPKSSELGSKMGTFLIILTILCGLCCFILCLIAESTRSQVIWMGIDENNKEKKRCSYSGSGKTPLLCTASAFLGMAVMMVVQ

Query:  HLYVLIAVSKSAPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMT
        HLYVLIAVSKSAPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWS PKESCLVIKEGLFSAAGVFQLATVFLAAGLYMT
Subjt:  HLYVLIAVSKSAPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMT

Query:  AVRAQRMFEQQENVRREVLESYHIHSSPPRSLS--SPPLQPMPPIAREDPVIRHSHHHQGRAPLLSLLQSTAPFCKLS
        AVRAQRMFEQQENVRREVLESYHIHSSPPRSLS  SPPLQPMPPIAREDPVIRHSHHHQ RAP  SLLQSTAPFCKLS
Subjt:  AVRAQRMFEQQENVRREVLESYHIHSSPPRSLS--SPPLQPMPPIAREDPVIRHSHHHQGRAPLLSLLQSTAPFCKLS

A0A5A7U7S1 Uncharacterized protein5.8e-13996.28Show/hide
Query:  AVTHDDLLPSPKSSELGSKMGTFLIILTILCGLCCFILCLIAESTRSQVIWMGIDENNKEKKRCSYSGSGKTPLLCTASAFLGMAVMMVVQHLYVLIAVS
        AVTHDDLLPSPKSSELGSK+GTFLIILTILCGLCCFILCLIAESTRSQ IWMGIDENNKEK+RCSYSGSGKTPLLCTASAFLGMAVMMVVQHLYVLIAVS
Subjt:  AVTHDDLLPSPKSSELGSKMGTFLIILTILCGLCCFILCLIAESTRSQVIWMGIDENNKEKKRCSYSGSGKTPLLCTASAFLGMAVMMVVQHLYVLIAVS

Query:  KSAPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAVRAQRMFE
        KSAPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWS PKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAVRAQRMFE
Subjt:  KSAPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAVRAQRMFE

Query:  QQENVRREVLESYHIHSSPPRSLS--SPPLQPMPPIAREDPVIRHSHHHQGRAPLLSLLQSTAPFCKLS
        QQENVRREVLESYHIHSSPPRSLS  SPPLQPMPPIAREDPVIRHSHH Q RAP  SLLQSTAPFCKLS
Subjt:  QQENVRREVLESYHIHSSPPRSLS--SPPLQPMPPIAREDPVIRHSHHHQGRAPLLSLLQSTAPFCKLS

A0A6J1F8A9 uncharacterized protein LOC1114417762.1e-12085.29Show/hide
Query:  RRKNMAVTHDDLLPSPKSSELGSKMGTFLIILTILCGLCCFILCLIAESTRSQVIWMGIDENNKEK--KRCSYSGSGKTPLLCTASAFLGMAVMMVVQHL
        R K MAVTH+DL PS +SSELGSKMGTFLIILT+LCGLCCFILCL+AESTRSQVIW G DENN +K  KRC YSGSGKTPL+CTASAFLGMAVMMVVQHL
Subjt:  RRKNMAVTHDDLLPSPKSSELGSKMGTFLIILTILCGLCCFILCLIAESTRSQVIWMGIDENNKEK--KRCSYSGSGKTPLLCTASAFLGMAVMMVVQHL

Query:  YVLIAVSKSAPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAV
        YVLIAVSKS PPALIAWDPSFATSKSLTFQAAFFFVSTWISF+VGEILLLIGLSVESGHL +W +PKESCLVIKEGLFSAAGVF+LATVFLAAGLYMTAV
Subjt:  YVLIAVSKSAPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAV

Query:  RAQRMFEQQENVRREVLESYHIHSSPPRSLSSPPLQPMPPIAREDPVIRHSHHHQGRAPLLSLLQSTAPFCK
        RAQR+FE Q NVRREVLESYHIHSSPPR   SPP+QPMPPIAREDPVIRHSHH +  +P L LL S+A FCK
Subjt:  RAQRMFEQQENVRREVLESYHIHSSPPRSLSSPPLQPMPPIAREDPVIRHSHHHQGRAPLLSLLQSTAPFCK

A0A6J1I898 uncharacterized protein LOC1114708983.8e-12286.4Show/hide
Query:  RRKNMAVTHDDLLPSPKSSELGSKMGTFLIILTILCGLCCFILCLIAESTRSQVIWMGIDENNKEK--KRCSYSGSGKTPLLCTASAFLGMAVMMVVQHL
        RRK MAVTH+DL PS +SSELGSKMGTFLIILT+LCGLCCFILCL+AESTRSQVIWMG DENN +K  KRC YSGSGKTPL+CTASAFLGMAVMMVVQHL
Subjt:  RRKNMAVTHDDLLPSPKSSELGSKMGTFLIILTILCGLCCFILCLIAESTRSQVIWMGIDENNKEK--KRCSYSGSGKTPLLCTASAFLGMAVMMVVQHL

Query:  YVLIAVSKSAPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAV
        YVLIAVSKS PPALIAWDPSFATSKSLTFQAAFFFVSTWISF+VGEILLLIGLSVESGHL +W +PKESCLVIKEGLFSAAGVF+LATVFLAAGLYMTAV
Subjt:  YVLIAVSKSAPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAV

Query:  RAQRMFEQQENVRREVLESYHIHSSPPRSLSSPPLQPMPPIAREDPVIRHSHHHQGRAPLLSLLQSTAPFCK
        RAQR+FE Q NVRREVLESYHIHSSPPR   SPPLQPMPPIAREDPVIRHSHH +  +P L LL S+A FCK
Subjt:  RAQRMFEQQENVRREVLESYHIHSSPPRSLSSPPLQPMPPIAREDPVIRHSHHHQGRAPLLSLLQSTAPFCK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G13380.1 Protein of unknown function (DUF1218)5.4e-0425Show/hide
Query:  KMGTFLIILTILCGLCCFILCLIAESTRSQVIWMGIDENNKEKKRCSYSGSGKTPLLCTASAFLGMAVMMVVQHLYVLIAVSKSAPPALIAWDPSFATSK
        K  T + IL +   L  F   + AE  RS  I   I +       C Y     T     A  FL  +  +++     +      AP +  AW        
Subjt:  KMGTFLIILTILCGLCCFILCLIAESTRSQVIWMGIDENNKEKKRCSYSGSGKTPLLCTASAFLGMAVMMVVQHLYVLIAVSKSAPPALIAWDPSFATSK

Query:  SLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYM
             +  +F+S+W++F V E  ++ G +  + H    S+   SC  +++G+F A  VF +AT+ L    YM
Subjt:  SLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYM

AT5G49320.1 Protein of unknown function (DUF1218)5.2e-7157.44Show/hide
Query:  RRKNMAVTHDDLLPSPKSSELGSKMGTFLIILTILCGLCCFILCLIAESTRSQVIWMGIDENNKEKKRCSYSGSGKTPLLCTASAFLGMAVMMVVQHLYV
        R +  AVTH DL+P+PK+++L SK G F+ +LTI+ GL CF+LCL AE+TRSQ  W          K C Y+GSGKTPLLC A AF+G+AV MV  H+Y+
Subjt:  RRKNMAVTHDDLLPSPKSSELGSKMGTFLIILTILCGLCCFILCLIAESTRSQVIWMGIDENNKEKKRCSYSGSGKTPLLCTASAFLGMAVMMVVQHLYV

Query:  LIAVSKSAPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAVRA
        LIAV+ S    L+ WDP    +K LTFQAAFFFVSTW+ F VGE+LLL+ LSVESGHL NWS PK SCLVI++GLFSAAGVF L TVFLA GLY+TA++A
Subjt:  LIAVSKSAPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAVRA

Query:  QRMFEQQENVRREVLESYHIHSSPPRSLSSPPLQPMPPIARE
         R+ +  EN  RE++E+  +++SPPRS    P   M  +ARE
Subjt:  QRMFEQQENVRREVLESYHIHSSPPRSLSSPPLQPMPPIARE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAGAAGAAGAAAAAATATGGCTGTCACACATGATGATCTTCTTCCAAGTCCAAAGAGCTCTGAATTGGGCAGCAAAATGGGGACTTTTCTTATCATTTTGACCAT
TCTTTGTGGTCTATGTTGCTTCATTCTTTGCCTCATTGCTGAGTCTACTCGTTCTCAGGTGATATGGATGGGTATAGATGAAAATAATAAGGAAAAAAAGAGATGCTCGT
ACAGCGGCAGCGGGAAAACACCGCTGCTGTGCACGGCGAGCGCGTTTCTGGGGATGGCGGTGATGATGGTGGTGCAACATTTGTATGTGTTGATTGCAGTGAGTAAGTCA
GCTCCTCCTGCTCTCATTGCTTGGGATCCTTCTTTTGCAACTTCCAAATCTCTAACCTTTCAAGCTGCTTTCTTCTTCGTTTCAACATGGATAAGTTTTGCAGTTGGAGA
AATTTTATTGTTAATAGGATTGAGTGTGGAGTCGGGGCATCTTAACAATTGGTCAACTCCAAAAGAAAGCTGTTTGGTGATCAAAGAAGGTTTGTTCTCAGCTGCCGGAG
TTTTTCAATTGGCCACCGTCTTCCTTGCCGCCGGTCTCTACATGACGGCGGTCCGAGCACAGAGAATGTTTGAACAGCAAGAAAATGTGAGAAGAGAGGTATTGGAAAGT
TACCATATCCACAGTTCCCCGCCACGGTCGTTGTCGTCGCCACCGCTGCAGCCAATGCCACCCATCGCGAGAGAGGACCCTGTAATTAGGCATAGCCACCACCATCAAGG
TCGGGCTCCCCTCTTGTCTCTGCTTCAATCTACTGCCCCTTTTTGCAAACTCTCTCCTTAA
mRNA sequenceShow/hide mRNA sequence
AAACAAACCAAAATTTTTAATGGGAAGAAGAAGAAAAAATATGGCTGTCACACATGATGATCTTCTTCCAAGTCCAAAGAGCTCTGAATTGGGCAGCAAAATGGGGACTT
TTCTTATCATTTTGACCATTCTTTGTGGTCTATGTTGCTTCATTCTTTGCCTCATTGCTGAGTCTACTCGTTCTCAGGTGATATGGATGGGTATAGATGAAAATAATAAG
GAAAAAAAGAGATGCTCGTACAGCGGCAGCGGGAAAACACCGCTGCTGTGCACGGCGAGCGCGTTTCTGGGGATGGCGGTGATGATGGTGGTGCAACATTTGTATGTGTT
GATTGCAGTGAGTAAGTCAGCTCCTCCTGCTCTCATTGCTTGGGATCCTTCTTTTGCAACTTCCAAATCTCTAACCTTTCAAGCTGCTTTCTTCTTCGTTTCAACATGGA
TAAGTTTTGCAGTTGGAGAAATTTTATTGTTAATAGGATTGAGTGTGGAGTCGGGGCATCTTAACAATTGGTCAACTCCAAAAGAAAGCTGTTTGGTGATCAAAGAAGGT
TTGTTCTCAGCTGCCGGAGTTTTTCAATTGGCCACCGTCTTCCTTGCCGCCGGTCTCTACATGACGGCGGTCCGAGCACAGAGAATGTTTGAACAGCAAGAAAATGTGAG
AAGAGAGGTATTGGAAAGTTACCATATCCACAGTTCCCCGCCACGGTCGTTGTCGTCGCCACCGCTGCAGCCAATGCCACCCATCGCGAGAGAGGACCCTGTAATTAGGC
ATAGCCACCACCATCAAGGTCGGGCTCCCCTCTTGTCTCTGCTTCAATCTACTGCCCCTTTTTGCAAACTCTCTCCTTAAAACTTTTCTTTTTGATGAAAGGAGATGATT
TTTTTTTTTTTTAAAAAAAAGAAATTTGTATATAATAGGTTAAGTAATATTTTGATGTTTGTACTTCTCAATTTTACTTTTTGTAATTTTTTAGTTTGTTACTTTTAAAA
ATGAATGATGTTTGATTCGTTTGTAAATCTTAACACAAA
Protein sequenceShow/hide protein sequence
MGRRRKNMAVTHDDLLPSPKSSELGSKMGTFLIILTILCGLCCFILCLIAESTRSQVIWMGIDENNKEKKRCSYSGSGKTPLLCTASAFLGMAVMMVVQHLYVLIAVSKS
APPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAVRAQRMFEQQENVRREVLES
YHIHSSPPRSLSSPPLQPMPPIAREDPVIRHSHHHQGRAPLLSLLQSTAPFCKLSP