; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC05G086540 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC05G086540
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionProtein of unknown function (DUF1218)
Genome locationCiama_Chr05:6644493..6647113
RNA-Seq ExpressionCaUC05G086540
SyntenyCaUC05G086540
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR009606 - Modifying wall lignin-1/2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0049529.1 uncharacterized protein E6C27_scaffold171G007840 [Cucumis melo var. makuwa]3.3e-12889.67Show/hide
Query:  AVTHEDLLPSRKSSELGSKMGTFLMILTILCGLCCFILCLIAESTRSQVIWRGMDENNKKGEKRCSYSGSGKTPLLCTASSFLGMAVIMVVQHLYVLIAV
        AVTH+DLLPS KSSELGSK+GTFL+ILTILCGLCCFILCLIAESTRSQ IW G+DENNK+ ++RCSYSGSGKTPLLCTAS+FLGMAV+MVVQHLYVLIAV
Subjt:  AVTHEDLLPSRKSSELGSKMGTFLMILTILCGLCCFILCLIAESTRSQVIWRGMDENNKKGEKRCSYSGSGKTPLLCTASSFLGMAVIMVVQHLYVLIAV

Query:  SKSPPPALISWDPSLATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAVRAQRMF
        SKS PPALI+WDPS ATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWS PKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAVRAQRMF
Subjt:  SKSPPPALISWDPSLATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAVRAQRMF

Query:  EEQENVRREVLESYHIHSSPPRS---SSPPLQPMPPIAREDPVIRHSHHHQG-TPLFSLLQSTAPFCKLSA
        E+QENVRREVLESYHIHSSPPRS   +SPPLQPMPPIAREDPVIRHSHH Q   P +SLLQSTAPFCKLSA
Subjt:  EEQENVRREVLESYHIHSSPPRS---SSPPLQPMPPIAREDPVIRHSHHHQG-TPLFSLLQSTAPFCKLSA

XP_004134498.1 protein MODIFYING WALL LIGNIN-1 [Cucumis sativus]6.8e-13491.34Show/hide
Query:  MGRRRKNMAVTHEDLLPSRKSSELGSKMGTFLMILTILCGLCCFILCLIAESTRSQVIWRGMDENNKKGEKRCSYSGSGKTPLLCTASSFLGMAVIMVVQ
        MGRR+KNMAVTH+DLLPS KSSELGSKMGTFL+ILTILCGLCCFILCLIAE+TRSQVIW G+DENNK+ ++RCSYSGSGKTPLLCTAS+FLGMAV+MVVQ
Subjt:  MGRRRKNMAVTHEDLLPSRKSSELGSKMGTFLMILTILCGLCCFILCLIAESTRSQVIWRGMDENNKKGEKRCSYSGSGKTPLLCTASSFLGMAVIMVVQ

Query:  HLYVLIAVSKSPPPALISWDPSLATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMT
        HLYVLIAVSKS PPALI+WDPS ATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMT
Subjt:  HLYVLIAVSKSPPPALISWDPSLATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMT

Query:  AVRAQRMFEEQENVRREVLESYHIHSSPPRS-SSPPLQPMPPIAREDPVIRHSHHHQ-GTPLFSLLQSTAPFCKLSA
        AVRAQRMFE+QENVRREVLESYHIHSSPPRS SSPPLQPMPPIAREDPVIRHS HHQ   P +SLLQSTAPFCKLSA
Subjt:  AVRAQRMFEEQENVRREVLESYHIHSSPPRS-SSPPLQPMPPIAREDPVIRHSHHHQ-GTPLFSLLQSTAPFCKLSA

XP_008438924.1 PREDICTED: uncharacterized protein LOC103483879 [Cucumis melo]2.9e-13290Show/hide
Query:  MGRRRKNM-AVTHEDLLPSRKSSELGSKMGTFLMILTILCGLCCFILCLIAESTRSQVIWRGMDENNKKGEKRCSYSGSGKTPLLCTASSFLGMAVIMVV
        MGRRRKNM AVTH+DLLPS KSSELGSK+GTFL+ILTILCGLCCFILCLIAESTRSQ IW G+DENNK+ ++RCSYSGSGKTPLLCTAS+FLGMAV+MVV
Subjt:  MGRRRKNM-AVTHEDLLPSRKSSELGSKMGTFLMILTILCGLCCFILCLIAESTRSQVIWRGMDENNKKGEKRCSYSGSGKTPLLCTASSFLGMAVIMVV

Query:  QHLYVLIAVSKSPPPALISWDPSLATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYM
        QHLYVLIAVSKS PPALI+WDPS ATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWS PKESCLVIKEGLFSAAGVFQLATVFLAAGLYM
Subjt:  QHLYVLIAVSKSPPPALISWDPSLATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYM

Query:  TAVRAQRMFEEQENVRREVLESYHIHSSPPRS---SSPPLQPMPPIAREDPVIRHSHHHQG-TPLFSLLQSTAPFCKLSA
        TAVRAQRMFE+QENVRREVLESYHIHSSPPRS   +SPPLQPMPPIAREDPVIRHSHHHQ   P +SLLQSTAPFCKLSA
Subjt:  TAVRAQRMFEEQENVRREVLESYHIHSSPPRS---SSPPLQPMPPIAREDPVIRHSHHHQG-TPLFSLLQSTAPFCKLSA

XP_022972325.1 uncharacterized protein LOC111470898 [Cucurbita maxima]4.2e-12386.67Show/hide
Query:  RRKNMAVTHEDLLPSRKSSELGSKMGTFLMILTILCGLCCFILCLIAESTRSQVIWRGMDE-NNKKGEKRCSYSGSGKTPLLCTASSFLGMAVIMVVQHL
        RRK MAVTHEDL PSR+SSELGSKMGTFL+ILT+LCGLCCFILCL+AESTRSQVIW G DE NNKKGEKRC YSGSGKTPL+CTAS+FLGMAV+MVVQHL
Subjt:  RRKNMAVTHEDLLPSRKSSELGSKMGTFLMILTILCGLCCFILCLIAESTRSQVIWRGMDE-NNKKGEKRCSYSGSGKTPLLCTASSFLGMAVIMVVQHL

Query:  YVLIAVSKSPPPALISWDPSLATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAV
        YVLIAVSKSPPPALI+WDPS ATSKSLTFQAAFFFVSTWISF+VGEILLLIGLSVESGHL +W +PKESCLVIKEGLFSAAGVF+LATVFLAAGLYMTAV
Subjt:  YVLIAVSKSPPPALISWDPSLATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAV

Query:  RAQRMFEEQENVRREVLESYHIHSSPPRSSSPPLQPMPPIAREDPVIRHSHHHQGTPLFSLLQSTAPFCK
        RAQR+FE+Q NVRREVLESYHIHSSPPR  SPPLQPMPPIAREDPVIRHSHH + +P   LL S+A FCK
Subjt:  RAQRMFEEQENVRREVLESYHIHSSPPRSSSPPLQPMPPIAREDPVIRHSHHHQGTPLFSLLQSTAPFCK

XP_038903084.1 uncharacterized protein LOC120089763 [Benincasa hispida]5.8e-14193.45Show/hide
Query:  MGRRRKNMAVTHEDLLPSRKSSELGSKMGTFLMILTILCGLCCFILCLIAESTRSQVIWRGMDENNKKGEKRCSYSGSGKTPLLCTASSFLGMAVIMVVQ
        MGRRRKNMAVTH+DLLPS +SSELGSKMGTFLMILT++CGLCCFILCLIAESTRSQVIW G+DENNK G KRCSYSGSGKTPLLCTAS+FLGMAV+MVVQ
Subjt:  MGRRRKNMAVTHEDLLPSRKSSELGSKMGTFLMILTILCGLCCFILCLIAESTRSQVIWRGMDENNKKGEKRCSYSGSGKTPLLCTASSFLGMAVIMVVQ

Query:  HLYVLIAVSKSPPPALISWDPSLATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMT
        HLYVLIAVSKSPPPALI+WDPS ATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWS+PKESCLVIKEGLFSAAGVFQLATVFLAAGLYMT
Subjt:  HLYVLIAVSKSPPPALISWDPSLATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMT

Query:  AVRAQRMFEEQENVRREVLESYHIHSSPPRSSSPPLQPMPPIAREDPVIRHSHHHQGTPLFSLLQSTAPFCKLSA
        AVRAQR+FEEQENVRREVLESYHIHSSPPRSSSPPLQPMPPIAREDPVIRHSHHHQ  P FSLLQSTAPFCKLSA
Subjt:  AVRAQRMFEEQENVRREVLESYHIHSSPPRSSSPPLQPMPPIAREDPVIRHSHHHQGTPLFSLLQSTAPFCKLSA

TrEMBL top hitse value%identityAlignment
A0A0A0L805 Uncharacterized protein1.3e-13091.48Show/hide
Query:  MAVTHEDLLPSRKSSELGSKMGTFLMILTILCGLCCFILCLIAESTRSQVIWRGMDENNKKGEKRCSYSGSGKTPLLCTASSFLGMAVIMVVQHLYVLIA
        MAVTH+DLLPS KSSELGSKMGTFL+ILTILCGLCCFILCLIAE+TRSQVIW G+DENNK+ ++RCSYSGSGKTPLLCTAS+FLGMAV+MVVQHLYVLIA
Subjt:  MAVTHEDLLPSRKSSELGSKMGTFLMILTILCGLCCFILCLIAESTRSQVIWRGMDENNKKGEKRCSYSGSGKTPLLCTASSFLGMAVIMVVQHLYVLIA

Query:  VSKSPPPALISWDPSLATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAVRAQRM
        VSKS PPALI+WDPS ATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAVRAQRM
Subjt:  VSKSPPPALISWDPSLATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAVRAQRM

Query:  FEEQENVRREVLESYHIHSSPPRS-SSPPLQPMPPIAREDPVIRHSHHHQ-GTPLFSLLQSTAPFCKLSA
        FE+QENVRREVLESYHIHSSPPRS SSPPLQPMPPIAREDPVIRHS HHQ   P +SLLQSTAPFCKLSA
Subjt:  FEEQENVRREVLESYHIHSSPPRS-SSPPLQPMPPIAREDPVIRHSHHHQ-GTPLFSLLQSTAPFCKLSA

A0A1S3AY82 uncharacterized protein LOC1034838791.4e-13290Show/hide
Query:  MGRRRKNM-AVTHEDLLPSRKSSELGSKMGTFLMILTILCGLCCFILCLIAESTRSQVIWRGMDENNKKGEKRCSYSGSGKTPLLCTASSFLGMAVIMVV
        MGRRRKNM AVTH+DLLPS KSSELGSK+GTFL+ILTILCGLCCFILCLIAESTRSQ IW G+DENNK+ ++RCSYSGSGKTPLLCTAS+FLGMAV+MVV
Subjt:  MGRRRKNM-AVTHEDLLPSRKSSELGSKMGTFLMILTILCGLCCFILCLIAESTRSQVIWRGMDENNKKGEKRCSYSGSGKTPLLCTASSFLGMAVIMVV

Query:  QHLYVLIAVSKSPPPALISWDPSLATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYM
        QHLYVLIAVSKS PPALI+WDPS ATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWS PKESCLVIKEGLFSAAGVFQLATVFLAAGLYM
Subjt:  QHLYVLIAVSKSPPPALISWDPSLATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYM

Query:  TAVRAQRMFEEQENVRREVLESYHIHSSPPRS---SSPPLQPMPPIAREDPVIRHSHHHQG-TPLFSLLQSTAPFCKLSA
        TAVRAQRMFE+QENVRREVLESYHIHSSPPRS   +SPPLQPMPPIAREDPVIRHSHHHQ   P +SLLQSTAPFCKLSA
Subjt:  TAVRAQRMFEEQENVRREVLESYHIHSSPPRS---SSPPLQPMPPIAREDPVIRHSHHHQG-TPLFSLLQSTAPFCKLSA

A0A5A7U7S1 Uncharacterized protein1.6e-12889.67Show/hide
Query:  AVTHEDLLPSRKSSELGSKMGTFLMILTILCGLCCFILCLIAESTRSQVIWRGMDENNKKGEKRCSYSGSGKTPLLCTASSFLGMAVIMVVQHLYVLIAV
        AVTH+DLLPS KSSELGSK+GTFL+ILTILCGLCCFILCLIAESTRSQ IW G+DENNK+ ++RCSYSGSGKTPLLCTAS+FLGMAV+MVVQHLYVLIAV
Subjt:  AVTHEDLLPSRKSSELGSKMGTFLMILTILCGLCCFILCLIAESTRSQVIWRGMDENNKKGEKRCSYSGSGKTPLLCTASSFLGMAVIMVVQHLYVLIAV

Query:  SKSPPPALISWDPSLATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAVRAQRMF
        SKS PPALI+WDPS ATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWS PKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAVRAQRMF
Subjt:  SKSPPPALISWDPSLATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAVRAQRMF

Query:  EEQENVRREVLESYHIHSSPPRS---SSPPLQPMPPIAREDPVIRHSHHHQG-TPLFSLLQSTAPFCKLSA
        E+QENVRREVLESYHIHSSPPRS   +SPPLQPMPPIAREDPVIRHSHH Q   P +SLLQSTAPFCKLSA
Subjt:  EEQENVRREVLESYHIHSSPPRS---SSPPLQPMPPIAREDPVIRHSHHHQG-TPLFSLLQSTAPFCKLSA

A0A6J1F8A9 uncharacterized protein LOC1114417761.0e-12285.93Show/hide
Query:  RRKNMAVTHEDLLPSRKSSELGSKMGTFLMILTILCGLCCFILCLIAESTRSQVIWRGMDE-NNKKGEKRCSYSGSGKTPLLCTASSFLGMAVIMVVQHL
        R K MAVTHEDL PSR+SSELGSKMGTFL+ILT+LCGLCCFILCL+AESTRSQVIW+G DE NNKKGEKRC YSGSGKTPL+CTAS+FLGMAV+MVVQHL
Subjt:  RRKNMAVTHEDLLPSRKSSELGSKMGTFLMILTILCGLCCFILCLIAESTRSQVIWRGMDE-NNKKGEKRCSYSGSGKTPLLCTASSFLGMAVIMVVQHL

Query:  YVLIAVSKSPPPALISWDPSLATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAV
        YVLIAVSKSPPPALI+WDPS ATSKSLTFQAAFFFVSTWISF+VGEILLLIGLSVESGHL +W +PKESCLVIKEGLFSAAGVF+LATVFLAAGLYMTAV
Subjt:  YVLIAVSKSPPPALISWDPSLATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAV

Query:  RAQRMFEEQENVRREVLESYHIHSSPPRSSSPPLQPMPPIAREDPVIRHSHHHQGTPLFSLLQSTAPFCK
        RAQR+FE+Q NVRREVLESYHIHSSPPR  SPP+QPMPPIAREDPVIRHSHH + +P   LL S+A FCK
Subjt:  RAQRMFEEQENVRREVLESYHIHSSPPRSSSPPLQPMPPIAREDPVIRHSHHHQGTPLFSLLQSTAPFCK

A0A6J1I898 uncharacterized protein LOC1114708982.0e-12386.67Show/hide
Query:  RRKNMAVTHEDLLPSRKSSELGSKMGTFLMILTILCGLCCFILCLIAESTRSQVIWRGMDE-NNKKGEKRCSYSGSGKTPLLCTASSFLGMAVIMVVQHL
        RRK MAVTHEDL PSR+SSELGSKMGTFL+ILT+LCGLCCFILCL+AESTRSQVIW G DE NNKKGEKRC YSGSGKTPL+CTAS+FLGMAV+MVVQHL
Subjt:  RRKNMAVTHEDLLPSRKSSELGSKMGTFLMILTILCGLCCFILCLIAESTRSQVIWRGMDE-NNKKGEKRCSYSGSGKTPLLCTASSFLGMAVIMVVQHL

Query:  YVLIAVSKSPPPALISWDPSLATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAV
        YVLIAVSKSPPPALI+WDPS ATSKSLTFQAAFFFVSTWISF+VGEILLLIGLSVESGHL +W +PKESCLVIKEGLFSAAGVF+LATVFLAAGLYMTAV
Subjt:  YVLIAVSKSPPPALISWDPSLATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAV

Query:  RAQRMFEEQENVRREVLESYHIHSSPPRSSSPPLQPMPPIAREDPVIRHSHHHQGTPLFSLLQSTAPFCK
        RAQR+FE+Q NVRREVLESYHIHSSPPR  SPPLQPMPPIAREDPVIRHSHH + +P   LL S+A FCK
Subjt:  RAQRMFEEQENVRREVLESYHIHSSPPRSSSPPLQPMPPIAREDPVIRHSHHHQGTPLFSLLQSTAPFCK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G13380.1 Protein of unknown function (DUF1218)1.9e-0432.81Show/hide
Query:  FFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYM
        +F+S+W++F V E  ++ G +  + H    S+   SC  +++G+F A  VF +AT+ L    YM
Subjt:  FFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYM

AT5G49320.1 Protein of unknown function (DUF1218)4.6e-7257.44Show/hide
Query:  RRKNMAVTHEDLLPSRKSSELGSKMGTFLMILTILCGLCCFILCLIAESTRSQVIWRGMDENNKKGEKRCSYSGSGKTPLLCTASSFLGMAVIMVVQHLY
        R +  AVTH+DL+P+ K+++L SK G F+ +LTI+ GL CF+LCL AE+TRSQ  W         G K C Y+GSGKTPLLC A +F+G+AV MV  H+Y
Subjt:  RRKNMAVTHEDLLPSRKSSELGSKMGTFLMILTILCGLCCFILCLIAESTRSQVIWRGMDENNKKGEKRCSYSGSGKTPLLCTASSFLGMAVIMVVQHLY

Query:  VLIAVSKSPPPALISWDPSLATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAVR
        +LIAV+ SP   L+ WDP    +K LTFQAAFFFVSTW+ F VGE+LLL+ LSVESGHL NWS PK SCLVI++GLFSAAGVF L TVFLA GLY+TA++
Subjt:  VLIAVSKSPPPALISWDPSLATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAVR

Query:  AQRMFEEQENVRREVLESYHIHSSPPRSSSPPLQPMPPIARE
        A R+ ++ EN  RE++E+  +++SPPRS   P   M  +ARE
Subjt:  AQRMFEEQENVRREVLESYHIHSSPPRSSSPPLQPMPPIARE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAGAAGAAGAAAAAATATGGCAGTGACACATGAAGATCTTCTACCAAGTCGAAAGAGCTCTGAATTAGGCAGCAAAATGGGGACTTTCCTTATGATTTTGACCAT
CCTTTGTGGCCTATGTTGCTTCATTCTTTGCCTCATTGCTGAGTCTACTCGTTCCCAGGTGATATGGAGGGGTATGGATGAAAATAATAAGAAGGGAGAAAAGAGATGCT
CGTACAGCGGCAGCGGGAAGACACCGCTGCTGTGCACGGCGAGCTCGTTTCTCGGGATGGCAGTGATAATGGTGGTGCAACATTTGTATGTGTTGATTGCAGTGAGTAAG
TCACCGCCTCCTGCTCTCATTTCTTGGGATCCTTCTTTAGCAACTTCCAAATCTCTAACCTTTCAAGCTGCTTTCTTCTTCGTTTCAACATGGATAAGTTTTGCAGTGGG
AGAAATTTTGTTGTTAATAGGATTGAGTGTGGAGTCGGGGCATTTGAACAACTGGTCAACTCCAAAAGAAAGCTGCTTGGTGATCAAAGAAGGTTTGTTTTCAGCTGCCG
GAGTTTTTCAATTGGCCACGGTCTTCCTCGCCGCCGGCCTCTACATGACGGCGGTGCGAGCACAGAGAATGTTTGAAGAGCAAGAAAATGTGAGAAGAGAGGTGCTGGAA
AGTTACCATATCCACAGTTCGCCGCCACGGTCGTCGTCGCCGCCGCTGCAGCCAATGCCGCCGATTGCAAGAGAAGACCCTGTAATCAGACATAGCCACCACCATCAAGG
GACTCCCCTTTTTTCCCTGCTTCAATCTACTGCCCCTTTTTGCAAGCTCTCTGCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGAAGAAGAAGAAAAAATATGGCAGTGACACATGAAGATCTTCTACCAAGTCGAAAGAGCTCTGAATTAGGCAGCAAAATGGGGACTTTCCTTATGATTTTGACCAT
CCTTTGTGGCCTATGTTGCTTCATTCTTTGCCTCATTGCTGAGTCTACTCGTTCCCAGGTGATATGGAGGGGTATGGATGAAAATAATAAGAAGGGAGAAAAGAGATGCT
CGTACAGCGGCAGCGGGAAGACACCGCTGCTGTGCACGGCGAGCTCGTTTCTCGGGATGGCAGTGATAATGGTGGTGCAACATTTGTATGTGTTGATTGCAGTGAGTAAG
TCACCGCCTCCTGCTCTCATTTCTTGGGATCCTTCTTTAGCAACTTCCAAATCTCTAACCTTTCAAGCTGCTTTCTTCTTCGTTTCAACATGGATAAGTTTTGCAGTGGG
AGAAATTTTGTTGTTAATAGGATTGAGTGTGGAGTCGGGGCATTTGAACAACTGGTCAACTCCAAAAGAAAGCTGCTTGGTGATCAAAGAAGGTTTGTTTTCAGCTGCCG
GAGTTTTTCAATTGGCCACGGTCTTCCTCGCCGCCGGCCTCTACATGACGGCGGTGCGAGCACAGAGAATGTTTGAAGAGCAAGAAAATGTGAGAAGAGAGGTGCTGGAA
AGTTACCATATCCACAGTTCGCCGCCACGGTCGTCGTCGCCGCCGCTGCAGCCAATGCCGCCGATTGCAAGAGAAGACCCTGTAATCAGACATAGCCACCACCATCAAGG
GACTCCCCTTTTTTCCCTGCTTCAATCTACTGCCCCTTTTTGCAAGCTCTCTGCTTGA
Protein sequenceShow/hide protein sequence
MGRRRKNMAVTHEDLLPSRKSSELGSKMGTFLMILTILCGLCCFILCLIAESTRSQVIWRGMDENNKKGEKRCSYSGSGKTPLLCTASSFLGMAVIMVVQHLYVLIAVSK
SPPPALISWDPSLATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAVRAQRMFEEQENVRREVLE
SYHIHSSPPRSSSPPLQPMPPIAREDPVIRHSHHHQGTPLFSLLQSTAPFCKLSA