; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG05G006400 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG05G006400
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionProtein of unknown function (DUF1218)
Genome locationCG_Chr05:6383462..6386798
RNA-Seq ExpressionClCG05G006400
SyntenyClCG05G006400
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR009606 - Modifying wall lignin-1/2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0049529.1 uncharacterized protein E6C27_scaffold171G007840 [Cucumis melo var. makuwa]5.6e-12889.3Show/hide
Query:  AVTHEDLLPSRKSSELGSKMGTFLMILTILCGLCCFILCLIAESTRSQVIWRGMDENNKKGEKRCSYSGSGKTPLLCTASSFLGMAVIMVVQHLYVLIAV
        AVTH+DLLPS KSSELGSK+GTFL+ILTILCGLCCFILCLIAESTRSQ IW G+DENNK+ ++RCSYSGSGKTPLLCTAS+FLGMAV+MVVQHLYVLIAV
Subjt:  AVTHEDLLPSRKSSELGSKMGTFLMILTILCGLCCFILCLIAESTRSQVIWRGMDENNKKGEKRCSYSGSGKTPLLCTASSFLGMAVIMVVQHLYVLIAV

Query:  SKSPPPALISWDPSLATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPRESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAVRAQRMF
        SKS PPALI+WDPS ATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWS P+ESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAVRAQRMF
Subjt:  SKSPPPALISWDPSLATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPRESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAVRAQRMF

Query:  EEQENVRREVLESYHIHSSPPRS----SPPLQPMPPIAREDPVIRHSHHHQE-TPLFSLLQSTAPFCKLSA
        E+QENVRREVLESYHIHSSPPRS    SPPLQPMPPIAREDPVIRHSHH Q+  P +SLLQSTAPFCKLSA
Subjt:  EEQENVRREVLESYHIHSSPPRS----SPPLQPMPPIAREDPVIRHSHHHQE-TPLFSLLQSTAPFCKLSA

XP_004134498.1 protein MODIFYING WALL LIGNIN-1 [Cucumis sativus]8.9e-13490.97Show/hide
Query:  MGRRRKNMAVTHEDLLPSRKSSELGSKMGTFLMILTILCGLCCFILCLIAESTRSQVIWRGMDENNKKGEKRCSYSGSGKTPLLCTASSFLGMAVIMVVQ
        MGRR+KNMAVTH+DLLPS KSSELGSKMGTFL+ILTILCGLCCFILCLIAE+TRSQVIW G+DENNK+ ++RCSYSGSGKTPLLCTAS+FLGMAV+MVVQ
Subjt:  MGRRRKNMAVTHEDLLPSRKSSELGSKMGTFLMILTILCGLCCFILCLIAESTRSQVIWRGMDENNKKGEKRCSYSGSGKTPLLCTASSFLGMAVIMVVQ

Query:  HLYVLIAVSKSPPPALISWDPSLATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPRESCLVIKEGLFSAAGVFQLATVFLAAGLYMT
        HLYVLIAVSKS PPALI+WDPS ATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTP+ESCLVIKEGLFSAAGVFQLATVFLAAGLYMT
Subjt:  HLYVLIAVSKSPPPALISWDPSLATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPRESCLVIKEGLFSAAGVFQLATVFLAAGLYMT

Query:  AVRAQRMFEEQENVRREVLESYHIHSSPPR--SSPPLQPMPPIAREDPVIRHSHHHQE-TPLFSLLQSTAPFCKLSA
        AVRAQRMFE+QENVRREVLESYHIHSSPPR  SSPPLQPMPPIAREDPVIRHS HHQE  P +SLLQSTAPFCKLSA
Subjt:  AVRAQRMFEEQENVRREVLESYHIHSSPPR--SSPPLQPMPPIAREDPVIRHSHHHQE-TPLFSLLQSTAPFCKLSA

XP_008438924.1 PREDICTED: uncharacterized protein LOC103483879 [Cucumis melo]4.9e-13289.64Show/hide
Query:  MGRRRKNM-AVTHEDLLPSRKSSELGSKMGTFLMILTILCGLCCFILCLIAESTRSQVIWRGMDENNKKGEKRCSYSGSGKTPLLCTASSFLGMAVIMVV
        MGRRRKNM AVTH+DLLPS KSSELGSK+GTFL+ILTILCGLCCFILCLIAESTRSQ IW G+DENNK+ ++RCSYSGSGKTPLLCTAS+FLGMAV+MVV
Subjt:  MGRRRKNM-AVTHEDLLPSRKSSELGSKMGTFLMILTILCGLCCFILCLIAESTRSQVIWRGMDENNKKGEKRCSYSGSGKTPLLCTASSFLGMAVIMVV

Query:  QHLYVLIAVSKSPPPALISWDPSLATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPRESCLVIKEGLFSAAGVFQLATVFLAAGLYM
        QHLYVLIAVSKS PPALI+WDPS ATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWS P+ESCLVIKEGLFSAAGVFQLATVFLAAGLYM
Subjt:  QHLYVLIAVSKSPPPALISWDPSLATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPRESCLVIKEGLFSAAGVFQLATVFLAAGLYM

Query:  TAVRAQRMFEEQENVRREVLESYHIHSSPPRS----SPPLQPMPPIAREDPVIRHSHHHQE-TPLFSLLQSTAPFCKLSA
        TAVRAQRMFE+QENVRREVLESYHIHSSPPRS    SPPLQPMPPIAREDPVIRHSHHHQ+  P +SLLQSTAPFCKLSA
Subjt:  TAVRAQRMFEEQENVRREVLESYHIHSSPPRS----SPPLQPMPPIAREDPVIRHSHHHQE-TPLFSLLQSTAPFCKLSA

XP_022972325.1 uncharacterized protein LOC111470898 [Cucurbita maxima]2.4e-12386.99Show/hide
Query:  RRKNMAVTHEDLLPSRKSSELGSKMGTFLMILTILCGLCCFILCLIAESTRSQVIWRGMDE-NNKKGEKRCSYSGSGKTPLLCTASSFLGMAVIMVVQHL
        RRK MAVTHEDL PSR+SSELGSKMGTFL+ILT+LCGLCCFILCL+AESTRSQVIW G DE NNKKGEKRC YSGSGKTPL+CTAS+FLGMAV+MVVQHL
Subjt:  RRKNMAVTHEDLLPSRKSSELGSKMGTFLMILTILCGLCCFILCLIAESTRSQVIWRGMDE-NNKKGEKRCSYSGSGKTPLLCTASSFLGMAVIMVVQHL

Query:  YVLIAVSKSPPPALISWDPSLATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPRESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAV
        YVLIAVSKSPPPALI+WDPS ATSKSLTFQAAFFFVSTWISF+VGEILLLIGLSVESGHL +W +P+ESCLVIKEGLFSAAGVF+LATVFLAAGLYMTAV
Subjt:  YVLIAVSKSPPPALISWDPSLATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPRESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAV

Query:  RAQRMFEEQENVRREVLESYHIHSSPPRSSPPLQPMPPIAREDPVIRHSHHHQETPLFSLLQSTAPFCK
        RAQR+FE+Q NVRREVLESYHIHSSPPR SPPLQPMPPIAREDPVIRHS HH E+P   LL S+A FCK
Subjt:  RAQRMFEEQENVRREVLESYHIHSSPPRSSPPLQPMPPIAREDPVIRHSHHHQETPLFSLLQSTAPFCK

XP_038903084.1 uncharacterized protein LOC120089763 [Benincasa hispida]1.4e-13993.09Show/hide
Query:  MGRRRKNMAVTHEDLLPSRKSSELGSKMGTFLMILTILCGLCCFILCLIAESTRSQVIWRGMDENNKKGEKRCSYSGSGKTPLLCTASSFLGMAVIMVVQ
        MGRRRKNMAVTH+DLLPS +SSELGSKMGTFLMILT++CGLCCFILCLIAESTRSQVIW G+DENNK G KRCSYSGSGKTPLLCTAS+FLGMAV+MVVQ
Subjt:  MGRRRKNMAVTHEDLLPSRKSSELGSKMGTFLMILTILCGLCCFILCLIAESTRSQVIWRGMDENNKKGEKRCSYSGSGKTPLLCTASSFLGMAVIMVVQ

Query:  HLYVLIAVSKSPPPALISWDPSLATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPRESCLVIKEGLFSAAGVFQLATVFLAAGLYMT
        HLYVLIAVSKSPPPALI+WDPS ATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWS+P+ESCLVIKEGLFSAAGVFQLATVFLAAGLYMT
Subjt:  HLYVLIAVSKSPPPALISWDPSLATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPRESCLVIKEGLFSAAGVFQLATVFLAAGLYMT

Query:  AVRAQRMFEEQENVRREVLESYHIHSSPPR-SSPPLQPMPPIAREDPVIRHSHHHQETPLFSLLQSTAPFCKLSA
        AVRAQR+FEEQENVRREVLESYHIHSSPPR SSPPLQPMPPIAREDPVIRHSHHHQE P FSLLQSTAPFCKLSA
Subjt:  AVRAQRMFEEQENVRREVLESYHIHSSPPR-SSPPLQPMPPIAREDPVIRHSHHHQETPLFSLLQSTAPFCKLSA

TrEMBL top hitse value%identityAlignment
A0A0A0L805 Uncharacterized protein1.7e-13091.11Show/hide
Query:  MAVTHEDLLPSRKSSELGSKMGTFLMILTILCGLCCFILCLIAESTRSQVIWRGMDENNKKGEKRCSYSGSGKTPLLCTASSFLGMAVIMVVQHLYVLIA
        MAVTH+DLLPS KSSELGSKMGTFL+ILTILCGLCCFILCLIAE+TRSQVIW G+DENNK+ ++RCSYSGSGKTPLLCTAS+FLGMAV+MVVQHLYVLIA
Subjt:  MAVTHEDLLPSRKSSELGSKMGTFLMILTILCGLCCFILCLIAESTRSQVIWRGMDENNKKGEKRCSYSGSGKTPLLCTASSFLGMAVIMVVQHLYVLIA

Query:  VSKSPPPALISWDPSLATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPRESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAVRAQRM
        VSKS PPALI+WDPS ATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTP+ESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAVRAQRM
Subjt:  VSKSPPPALISWDPSLATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPRESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAVRAQRM

Query:  FEEQENVRREVLESYHIHSSPPR--SSPPLQPMPPIAREDPVIRHSHHHQE-TPLFSLLQSTAPFCKLSA
        FE+QENVRREVLESYHIHSSPPR  SSPPLQPMPPIAREDPVIRHS HHQE  P +SLLQSTAPFCKLSA
Subjt:  FEEQENVRREVLESYHIHSSPPR--SSPPLQPMPPIAREDPVIRHSHHHQE-TPLFSLLQSTAPFCKLSA

A0A1S3AY82 uncharacterized protein LOC1034838792.4e-13289.64Show/hide
Query:  MGRRRKNM-AVTHEDLLPSRKSSELGSKMGTFLMILTILCGLCCFILCLIAESTRSQVIWRGMDENNKKGEKRCSYSGSGKTPLLCTASSFLGMAVIMVV
        MGRRRKNM AVTH+DLLPS KSSELGSK+GTFL+ILTILCGLCCFILCLIAESTRSQ IW G+DENNK+ ++RCSYSGSGKTPLLCTAS+FLGMAV+MVV
Subjt:  MGRRRKNM-AVTHEDLLPSRKSSELGSKMGTFLMILTILCGLCCFILCLIAESTRSQVIWRGMDENNKKGEKRCSYSGSGKTPLLCTASSFLGMAVIMVV

Query:  QHLYVLIAVSKSPPPALISWDPSLATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPRESCLVIKEGLFSAAGVFQLATVFLAAGLYM
        QHLYVLIAVSKS PPALI+WDPS ATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWS P+ESCLVIKEGLFSAAGVFQLATVFLAAGLYM
Subjt:  QHLYVLIAVSKSPPPALISWDPSLATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPRESCLVIKEGLFSAAGVFQLATVFLAAGLYM

Query:  TAVRAQRMFEEQENVRREVLESYHIHSSPPRS----SPPLQPMPPIAREDPVIRHSHHHQE-TPLFSLLQSTAPFCKLSA
        TAVRAQRMFE+QENVRREVLESYHIHSSPPRS    SPPLQPMPPIAREDPVIRHSHHHQ+  P +SLLQSTAPFCKLSA
Subjt:  TAVRAQRMFEEQENVRREVLESYHIHSSPPRS----SPPLQPMPPIAREDPVIRHSHHHQE-TPLFSLLQSTAPFCKLSA

A0A5A7U7S1 Uncharacterized protein2.7e-12889.3Show/hide
Query:  AVTHEDLLPSRKSSELGSKMGTFLMILTILCGLCCFILCLIAESTRSQVIWRGMDENNKKGEKRCSYSGSGKTPLLCTASSFLGMAVIMVVQHLYVLIAV
        AVTH+DLLPS KSSELGSK+GTFL+ILTILCGLCCFILCLIAESTRSQ IW G+DENNK+ ++RCSYSGSGKTPLLCTAS+FLGMAV+MVVQHLYVLIAV
Subjt:  AVTHEDLLPSRKSSELGSKMGTFLMILTILCGLCCFILCLIAESTRSQVIWRGMDENNKKGEKRCSYSGSGKTPLLCTASSFLGMAVIMVVQHLYVLIAV

Query:  SKSPPPALISWDPSLATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPRESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAVRAQRMF
        SKS PPALI+WDPS ATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWS P+ESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAVRAQRMF
Subjt:  SKSPPPALISWDPSLATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPRESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAVRAQRMF

Query:  EEQENVRREVLESYHIHSSPPRS----SPPLQPMPPIAREDPVIRHSHHHQE-TPLFSLLQSTAPFCKLSA
        E+QENVRREVLESYHIHSSPPRS    SPPLQPMPPIAREDPVIRHSHH Q+  P +SLLQSTAPFCKLSA
Subjt:  EEQENVRREVLESYHIHSSPPRS----SPPLQPMPPIAREDPVIRHSHHHQE-TPLFSLLQSTAPFCKLSA

A0A6J1F8A9 uncharacterized protein LOC1114417765.8e-12386.25Show/hide
Query:  RRKNMAVTHEDLLPSRKSSELGSKMGTFLMILTILCGLCCFILCLIAESTRSQVIWRGMDE-NNKKGEKRCSYSGSGKTPLLCTASSFLGMAVIMVVQHL
        R K MAVTHEDL PSR+SSELGSKMGTFL+ILT+LCGLCCFILCL+AESTRSQVIW+G DE NNKKGEKRC YSGSGKTPL+CTAS+FLGMAV+MVVQHL
Subjt:  RRKNMAVTHEDLLPSRKSSELGSKMGTFLMILTILCGLCCFILCLIAESTRSQVIWRGMDE-NNKKGEKRCSYSGSGKTPLLCTASSFLGMAVIMVVQHL

Query:  YVLIAVSKSPPPALISWDPSLATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPRESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAV
        YVLIAVSKSPPPALI+WDPS ATSKSLTFQAAFFFVSTWISF+VGEILLLIGLSVESGHL +W +P+ESCLVIKEGLFSAAGVF+LATVFLAAGLYMTAV
Subjt:  YVLIAVSKSPPPALISWDPSLATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPRESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAV

Query:  RAQRMFEEQENVRREVLESYHIHSSPPRSSPPLQPMPPIAREDPVIRHSHHHQETPLFSLLQSTAPFCK
        RAQR+FE+Q NVRREVLESYHIHSSPPR SPP+QPMPPIAREDPVIRHS HH E+P   LL S+A FCK
Subjt:  RAQRMFEEQENVRREVLESYHIHSSPPRSSPPLQPMPPIAREDPVIRHSHHHQETPLFSLLQSTAPFCK

A0A6J1I898 uncharacterized protein LOC1114708981.2e-12386.99Show/hide
Query:  RRKNMAVTHEDLLPSRKSSELGSKMGTFLMILTILCGLCCFILCLIAESTRSQVIWRGMDE-NNKKGEKRCSYSGSGKTPLLCTASSFLGMAVIMVVQHL
        RRK MAVTHEDL PSR+SSELGSKMGTFL+ILT+LCGLCCFILCL+AESTRSQVIW G DE NNKKGEKRC YSGSGKTPL+CTAS+FLGMAV+MVVQHL
Subjt:  RRKNMAVTHEDLLPSRKSSELGSKMGTFLMILTILCGLCCFILCLIAESTRSQVIWRGMDE-NNKKGEKRCSYSGSGKTPLLCTASSFLGMAVIMVVQHL

Query:  YVLIAVSKSPPPALISWDPSLATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPRESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAV
        YVLIAVSKSPPPALI+WDPS ATSKSLTFQAAFFFVSTWISF+VGEILLLIGLSVESGHL +W +P+ESCLVIKEGLFSAAGVF+LATVFLAAGLYMTAV
Subjt:  YVLIAVSKSPPPALISWDPSLATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPRESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAV

Query:  RAQRMFEEQENVRREVLESYHIHSSPPRSSPPLQPMPPIAREDPVIRHSHHHQETPLFSLLQSTAPFCK
        RAQR+FE+Q NVRREVLESYHIHSSPPR SPPLQPMPPIAREDPVIRHS HH E+P   LL S+A FCK
Subjt:  RAQRMFEEQENVRREVLESYHIHSSPPRSSPPLQPMPPIAREDPVIRHSHHHQETPLFSLLQSTAPFCK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G13380.1 Protein of unknown function (DUF1218)1.9e-0432.81Show/hide
Query:  FFVSTWISFAVGEILLLIGLSVESGHLNNWSTPRESCLVIKEGLFSAAGVFQLATVFLAAGLYM
        +F+S+W++F V E  ++ G +  + H    S+   SC  +++G+F A  VF +AT+ L    YM
Subjt:  FFVSTWISFAVGEILLLIGLSVESGHLNNWSTPRESCLVIKEGLFSAAGVFQLATVFLAAGLYM

AT5G49320.1 Protein of unknown function (DUF1218)6.0e-7257.26Show/hide
Query:  RRKNMAVTHEDLLPSRKSSELGSKMGTFLMILTILCGLCCFILCLIAESTRSQVIWRGMDENNKKGEKRCSYSGSGKTPLLCTASSFLGMAVIMVVQHLY
        R +  AVTH+DL+P+ K+++L SK G F+ +LTI+ GL CF+LCL AE+TRSQ  W         G K C Y+GSGKTPLLC A +F+G+AV MV  H+Y
Subjt:  RRKNMAVTHEDLLPSRKSSELGSKMGTFLMILTILCGLCCFILCLIAESTRSQVIWRGMDENNKKGEKRCSYSGSGKTPLLCTASSFLGMAVIMVVQHLY

Query:  VLIAVSKSPPPALISWDPSLATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPRESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAVR
        +LIAV+ SP   L+ WDP    +K LTFQAAFFFVSTW+ F VGE+LLL+ LSVESGHL NWS P+ SCLVI++GLFSAAGVF L TVFLA GLY+TA++
Subjt:  VLIAVSKSPPPALISWDPSLATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPRESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAVR

Query:  AQRMFEEQENVRREVLESYHIHSSPPRSSPPLQPMPPIARE
        A R+ ++ EN  RE++E+  +++SPPRS  P   M  +ARE
Subjt:  AQRMFEEQENVRREVLESYHIHSSPPRSSPPLQPMPPIARE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAGAAGAAGAAAAAATATGGCAGTGACACATGAAGATCTTCTACCAAGTCGAAAGAGCTCTGAATTAGGCAGCAAAATGGGGACTTTCCTTATGATTTTGACCAT
CCTTTGTGGCCTATGTTGCTTCATTCTTTGCCTCATTGCTGAGTCTACTCGTTCCCAGGTGATATGGAGGGGTATGGATGAAAATAATAAGAAGGGAGAAAAGAGATGCT
CGTACAGCGGCAGCGGGAAGACACCGCTGCTGTGCACGGCGAGCTCGTTTCTCGGGATGGCAGTGATAATGGTGGTGCAACATTTGTATGTGTTGATTGCAGTGAGTAAG
TCACCGCCTCCTGCTCTCATTTCTTGGGATCCTTCTTTAGCAACTTCCAAATCTCTAACCTTTCAAGCTGCTTTCTTCTTCGTTTCAACATGGATAAGTTTTGCAGTGGG
AGAAATTTTGTTGTTAATAGGATTGAGTGTGGAGTCGGGGCATTTGAACAACTGGTCAACTCCAAGAGAAAGCTGCTTGGTGATCAAAGAAGGTTTGTTTTCAGCTGCCG
GAGTTTTTCAATTGGCCACGGTCTTCCTCGCCGCCGGCCTCTACATGACGGCGGTGCGAGCACAGAGAATGTTTGAAGAGCAAGAAAATGTGAGAAGAGAGGTGCTGGAA
AGTTACCATATCCACAGTTCCCCGCCACGGTCGTCGCCGCCGCTGCAGCCAATGCCGCCGATTGCAAGAGAAGACCCTGTAATCAGACATAGCCACCACCATCAAGAGAC
TCCCCTTTTTTCCCTGCTTCAATCTACTGCCCCTTTTTGCAAGCTCTCTGCTTGA
mRNA sequenceShow/hide mRNA sequence
GTTCTATCTCTTCTTTCTCTCTTAATATAATGTTTTTAAGAACACAAACTGCCATTTTGGCCAAAGCAAACCCACATCTTAATGGGAAGAAGAAGAAAAAATATGGCAGT
GACACATGAAGATCTTCTACCAAGTCGAAAGAGCTCTGAATTAGGCAGCAAAATGGGGACTTTCCTTATGATTTTGACCATCCTTTGTGGCCTATGTTGCTTCATTCTTT
GCCTCATTGCTGAGTCTACTCGTTCCCAGGTGATATGGAGGGGTATGGATGAAAATAATAAGAAGGGAGAAAAGAGATGCTCGTACAGCGGCAGCGGGAAGACACCGCTG
CTGTGCACGGCGAGCTCGTTTCTCGGGATGGCAGTGATAATGGTGGTGCAACATTTGTATGTGTTGATTGCAGTGAGTAAGTCACCGCCTCCTGCTCTCATTTCTTGGGA
TCCTTCTTTAGCAACTTCCAAATCTCTAACCTTTCAAGCTGCTTTCTTCTTCGTTTCAACATGGATAAGTTTTGCAGTGGGAGAAATTTTGTTGTTAATAGGATTGAGTG
TGGAGTCGGGGCATTTGAACAACTGGTCAACTCCAAGAGAAAGCTGCTTGGTGATCAAAGAAGGTTTGTTTTCAGCTGCCGGAGTTTTTCAATTGGCCACGGTCTTCCTC
GCCGCCGGCCTCTACATGACGGCGGTGCGAGCACAGAGAATGTTTGAAGAGCAAGAAAATGTGAGAAGAGAGGTGCTGGAAAGTTACCATATCCACAGTTCCCCGCCACG
GTCGTCGCCGCCGCTGCAGCCAATGCCGCCGATTGCAAGAGAAGACCCTGTAATCAGACATAGCCACCACCATCAAGAGACTCCCCTTTTTTCCCTGCTTCAATCTACTG
CCCCTTTTTGCAAGCTCTCTGCTTGA
Protein sequenceShow/hide protein sequence
MGRRRKNMAVTHEDLLPSRKSSELGSKMGTFLMILTILCGLCCFILCLIAESTRSQVIWRGMDENNKKGEKRCSYSGSGKTPLLCTASSFLGMAVIMVVQHLYVLIAVSK
SPPPALISWDPSLATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPRESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAVRAQRMFEEQENVRREVLE
SYHIHSSPPRSSPPLQPMPPIAREDPVIRHSHHHQETPLFSLLQSTAPFCKLSA