; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0010751 (gene) of Chayote v1 genome

Gene IDSed0010751
OrganismSechium edule (Chayote v1)
DescriptionProtein of unknown function (DUF1218)
Genome locationLG02:48633605..48636436
RNA-Seq ExpressionSed0010751
SyntenySed0010751
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR009606 - Modifying wall lignin-1/2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004134498.1 protein MODIFYING WALL LIGNIN-1 [Cucumis sativus]8.6e-12186.18Show/hide
Query:  MGRRTKNMAVTHEDLHPSPKSSQLGSKMGTFLIILTILSGLCCFILCLIAESTRSLVIWTDTNDTDKRGEKRCSYSGSGKTPLVCTASAFLGMAVMMVVQ
        MGRR KNMAVTH+DL PSPKSS+LGSKMGTFLIILTIL GLCCFILCLIAE+TRS VIW   ++ +K  ++RCSYSGSGKTPL+CTASAFLGMAVMMVVQ
Subjt:  MGRRTKNMAVTHEDLHPSPKSSQLGSKMGTFLIILTILSGLCCFILCLIAESTRSLVIWTDTNDTDKRGEKRCSYSGSGKTPLVCTASAFLGMAVMMVVQ

Query:  HLYLLIAVSKSPPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLKNWSSPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMT
        HLY+LIAVSKS PPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHL NWS+PKESCLVIKEGLFSAAGVFQLATVFLAAGLYMT
Subjt:  HLYLLIAVSKSPPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLKNWSSPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMT

Query:  AVRAQRLSEQQENVRREVLESYHIHSSPPQ---SPPLQPMPPIAREDPVIRHS-HHQD-SPFLLLLSSTASFSKL
        AVRAQR+ EQQENVRREVLESYHIHSSPP+   SPPLQPMPPIAREDPVIRHS HHQ+ +PF  LL STA F KL
Subjt:  AVRAQRLSEQQENVRREVLESYHIHSSPPQ---SPPLQPMPPIAREDPVIRHS-HHQD-SPFLLLLSSTASFSKL

XP_022934645.1 uncharacterized protein LOC111441776 [Cucurbita moschata]9.1e-12386.89Show/hide
Query:  RTKNMAVTHEDLHPSPKSSQLGSKMGTFLIILTILSGLCCFILCLIAESTRSLVIWTDTND-TDKRGEKRCSYSGSGKTPLVCTASAFLGMAVMMVVQHL
        RTK MAVTHEDLHPS +SS+LGSKMGTFLIILT+L GLCCFILCL+AESTRS VIW   ++  +K+GEKRC YSGSGKTPLVCTASAFLGMAVMMVVQHL
Subjt:  RTKNMAVTHEDLHPSPKSSQLGSKMGTFLIILTILSGLCCFILCLIAESTRSLVIWTDTND-TDKRGEKRCSYSGSGKTPLVCTASAFLGMAVMMVVQHL

Query:  YLLIAVSKSPPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLKNWSSPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAV
        Y+LIAVSKSPPPALIAWDPSFATSKSLTFQAAFFFVSTWISF+VGEILLLIGLSVESGHLK+W SPKESCLVIKEGLFSAAGVF+LATVFLAAGLYMTAV
Subjt:  YLLIAVSKSPPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLKNWSSPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAV

Query:  RAQRLSEQQENVRREVLESYHIHSSPPQSPPLQPMPPIAREDPVIRHSHHQDSPFLLLLSSTASFSK
        RAQRL E Q NVRREVLESYHIHSSPP+SPP+QPMPPIAREDPVIRHSHH +SPFL LL S+A+F K
Subjt:  RAQRLSEQQENVRREVLESYHIHSSPPQSPPLQPMPPIAREDPVIRHSHHQDSPFLLLLSSTASFSK

XP_022972325.1 uncharacterized protein LOC111470898 [Cucurbita maxima]5.4e-12387.04Show/hide
Query:  MGRRTKNMAVTHEDLHPSPKSSQLGSKMGTFLIILTILSGLCCFILCLIAESTRSLVIWTDTND-TDKRGEKRCSYSGSGKTPLVCTASAFLGMAVMMVV
        MGRR K MAVTHEDLHPS +SS+LGSKMGTFLIILT+L GLCCFILCL+AESTRS VIW   ++  +K+GEKRC YSGSGKTPLVCTASAFLGMAVMMVV
Subjt:  MGRRTKNMAVTHEDLHPSPKSSQLGSKMGTFLIILTILSGLCCFILCLIAESTRSLVIWTDTND-TDKRGEKRCSYSGSGKTPLVCTASAFLGMAVMMVV

Query:  QHLYLLIAVSKSPPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLKNWSSPKESCLVIKEGLFSAAGVFQLATVFLAAGLYM
        QHLY+LIAVSKSPPPALIAWDPSFATSKSLTFQAAFFFVSTWISF+VGEILLLIGLSVESGHLK+W SPKESCLVIKEGLFSAAGVF+LATVFLAAGLYM
Subjt:  QHLYLLIAVSKSPPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLKNWSSPKESCLVIKEGLFSAAGVFQLATVFLAAGLYM

Query:  TAVRAQRLSEQQENVRREVLESYHIHSSPPQSPPLQPMPPIAREDPVIRHSHHQDSPFLLLLSSTASFSK
        TAVRAQRL E Q NVRREVLESYHIHSSPP+SPPLQPMPPIAREDPVIRHSHH +SPFL LL S+A+F K
Subjt:  TAVRAQRLSEQQENVRREVLESYHIHSSPPQSPPLQPMPPIAREDPVIRHSHHQDSPFLLLLSSTASFSK

XP_023540160.1 uncharacterized protein LOC111800614 [Cucurbita pepo subsp. pepo]2.0e-12286.52Show/hide
Query:  RTKNMAVTHEDLHPSPKSSQLGSKMGTFLIILTILSGLCCFILCLIAESTRSLVIWTDTND-TDKRGEKRCSYSGSGKTPLVCTASAFLGMAVMMVVQHL
        RTK MAVTHEDLHPS +SS+LGSKMGTFLIILT+L GLCCFILCL+AESTRS VIW   ++  +K+GEKRC YSGSG+TPLVCTASAFLGMAVMMVVQHL
Subjt:  RTKNMAVTHEDLHPSPKSSQLGSKMGTFLIILTILSGLCCFILCLIAESTRSLVIWTDTND-TDKRGEKRCSYSGSGKTPLVCTASAFLGMAVMMVVQHL

Query:  YLLIAVSKSPPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLKNWSSPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAV
        Y+LIAVSKSPPPALIAWDPSFATSKSLTFQAAFFFVSTWISF+VGEILLLIGLSVESGHLK+W SPKESCLVIKEGLFSAAGVF+LATVFLAAGLYMTAV
Subjt:  YLLIAVSKSPPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLKNWSSPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAV

Query:  RAQRLSEQQENVRREVLESYHIHSSPPQSPPLQPMPPIAREDPVIRHSHHQDSPFLLLLSSTASFSK
        RAQRL E Q NVRREVLESYHIHSSPP+SPP+QPMPPIAREDPVIRHSHH +SPFL LL S+A+F K
Subjt:  RAQRLSEQQENVRREVLESYHIHSSPPQSPPLQPMPPIAREDPVIRHSHHQDSPFLLLLSSTASFSK

XP_038903084.1 uncharacterized protein LOC120089763 [Benincasa hispida]5.7e-12586.81Show/hide
Query:  MGRRTKNMAVTHEDLHPSPKSSQLGSKMGTFLIILTILSGLCCFILCLIAESTRSLVIWTDTNDTDKRGEKRCSYSGSGKTPLVCTASAFLGMAVMMVVQ
        MGRR KNMAVTH+DL PSP+SS+LGSKMGTFL+ILT++ GLCCFILCLIAESTRS VIW   ++ +K G KRCSYSGSGKTPL+CTASAFLGMAVMMVVQ
Subjt:  MGRRTKNMAVTHEDLHPSPKSSQLGSKMGTFLIILTILSGLCCFILCLIAESTRSLVIWTDTNDTDKRGEKRCSYSGSGKTPLVCTASAFLGMAVMMVVQ

Query:  HLYLLIAVSKSPPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLKNWSSPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMT
        HLY+LIAVSKSPPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHL NWSSPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMT
Subjt:  HLYLLIAVSKSPPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLKNWSSPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMT

Query:  AVRAQRLSEQQENVRREVLESYHIHSSPPQ--SPPLQPMPPIAREDPVIRHS-HHQDSPFLLLLSSTASFSKL
        AVRAQR+ E+QENVRREVLESYHIHSSPP+  SPPLQPMPPIAREDPVIRHS HHQ++PF  LL STA F KL
Subjt:  AVRAQRLSEQQENVRREVLESYHIHSSPPQ--SPPLQPMPPIAREDPVIRHS-HHQDSPFLLLLSSTASFSKL

TrEMBL top hitse value%identityAlignment
A0A0A0L805 Uncharacterized protein1.2e-11786.19Show/hide
Query:  MAVTHEDLHPSPKSSQLGSKMGTFLIILTILSGLCCFILCLIAESTRSLVIWTDTNDTDKRGEKRCSYSGSGKTPLVCTASAFLGMAVMMVVQHLYLLIA
        MAVTH+DL PSPKSS+LGSKMGTFLIILTIL GLCCFILCLIAE+TRS VIW   ++ +K  ++RCSYSGSGKTPL+CTASAFLGMAVMMVVQHLY+LIA
Subjt:  MAVTHEDLHPSPKSSQLGSKMGTFLIILTILSGLCCFILCLIAESTRSLVIWTDTNDTDKRGEKRCSYSGSGKTPLVCTASAFLGMAVMMVVQHLYLLIA

Query:  VSKSPPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLKNWSSPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAVRAQRL
        VSKS PPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHL NWS+PKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAVRAQR+
Subjt:  VSKSPPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLKNWSSPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAVRAQRL

Query:  SEQQENVRREVLESYHIHSSPPQ---SPPLQPMPPIAREDPVIRHS-HHQD-SPFLLLLSSTASFSKL
         EQQENVRREVLESYHIHSSPP+   SPPLQPMPPIAREDPVIRHS HHQ+ +PF  LL STA F KL
Subjt:  SEQQENVRREVLESYHIHSSPPQ---SPPLQPMPPIAREDPVIRHS-HHQD-SPFLLLLSSTASFSKL

A0A1S3AY82 uncharacterized protein LOC1034838791.7e-11985.25Show/hide
Query:  MGRRTKNM-AVTHEDLHPSPKSSQLGSKMGTFLIILTILSGLCCFILCLIAESTRSLVIWTDTNDTDKRGEKRCSYSGSGKTPLVCTASAFLGMAVMMVV
        MGRR KNM AVTH+DL PSPKSS+LGSK+GTFLIILTIL GLCCFILCLIAESTRS  IW   ++ +K  ++RCSYSGSGKTPL+CTASAFLGMAVMMVV
Subjt:  MGRRTKNM-AVTHEDLHPSPKSSQLGSKMGTFLIILTILSGLCCFILCLIAESTRSLVIWTDTNDTDKRGEKRCSYSGSGKTPLVCTASAFLGMAVMMVV

Query:  QHLYLLIAVSKSPPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLKNWSSPKESCLVIKEGLFSAAGVFQLATVFLAAGLYM
        QHLY+LIAVSKS PPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHL NWS+PKESCLVIKEGLFSAAGVFQLATVFLAAGLYM
Subjt:  QHLYLLIAVSKSPPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLKNWSSPKESCLVIKEGLFSAAGVFQLATVFLAAGLYM

Query:  TAVRAQRLSEQQENVRREVLESYHIHSSPPQ-----SPPLQPMPPIAREDPVIRHS-HHQD-SPFLLLLSSTASFSKL
        TAVRAQR+ EQQENVRREVLESYHIHSSPP+     SPPLQPMPPIAREDPVIRHS HHQD +PF  LL STA F KL
Subjt:  TAVRAQRLSEQQENVRREVLESYHIHSSPPQ-----SPPLQPMPPIAREDPVIRHS-HHQD-SPFLLLLSSTASFSKL

A0A5A7U7S1 Uncharacterized protein2.1e-11785.13Show/hide
Query:  AVTHEDLHPSPKSSQLGSKMGTFLIILTILSGLCCFILCLIAESTRSLVIWTDTNDTDKRGEKRCSYSGSGKTPLVCTASAFLGMAVMMVVQHLYLLIAV
        AVTH+DL PSPKSS+LGSK+GTFLIILTIL GLCCFILCLIAESTRS  IW   ++ +K  ++RCSYSGSGKTPL+CTASAFLGMAVMMVVQHLY+LIAV
Subjt:  AVTHEDLHPSPKSSQLGSKMGTFLIILTILSGLCCFILCLIAESTRSLVIWTDTNDTDKRGEKRCSYSGSGKTPLVCTASAFLGMAVMMVVQHLYLLIAV

Query:  SKSPPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLKNWSSPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAVRAQRLS
        SKS PPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHL NWS+PKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAVRAQR+ 
Subjt:  SKSPPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLKNWSSPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAVRAQRLS

Query:  EQQENVRREVLESYHIHSSPPQ-----SPPLQPMPPIAREDPVIRHSHHQD--SPFLLLLSSTASFSKL
        EQQENVRREVLESYHIHSSPP+     SPPLQPMPPIAREDPVIRHSHHQ   +PF  LL STA F KL
Subjt:  EQQENVRREVLESYHIHSSPPQ-----SPPLQPMPPIAREDPVIRHSHHQD--SPFLLLLSSTASFSKL

A0A6J1F8A9 uncharacterized protein LOC1114417764.4e-12386.89Show/hide
Query:  RTKNMAVTHEDLHPSPKSSQLGSKMGTFLIILTILSGLCCFILCLIAESTRSLVIWTDTND-TDKRGEKRCSYSGSGKTPLVCTASAFLGMAVMMVVQHL
        RTK MAVTHEDLHPS +SS+LGSKMGTFLIILT+L GLCCFILCL+AESTRS VIW   ++  +K+GEKRC YSGSGKTPLVCTASAFLGMAVMMVVQHL
Subjt:  RTKNMAVTHEDLHPSPKSSQLGSKMGTFLIILTILSGLCCFILCLIAESTRSLVIWTDTND-TDKRGEKRCSYSGSGKTPLVCTASAFLGMAVMMVVQHL

Query:  YLLIAVSKSPPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLKNWSSPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAV
        Y+LIAVSKSPPPALIAWDPSFATSKSLTFQAAFFFVSTWISF+VGEILLLIGLSVESGHLK+W SPKESCLVIKEGLFSAAGVF+LATVFLAAGLYMTAV
Subjt:  YLLIAVSKSPPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLKNWSSPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAV

Query:  RAQRLSEQQENVRREVLESYHIHSSPPQSPPLQPMPPIAREDPVIRHSHHQDSPFLLLLSSTASFSK
        RAQRL E Q NVRREVLESYHIHSSPP+SPP+QPMPPIAREDPVIRHSHH +SPFL LL S+A+F K
Subjt:  RAQRLSEQQENVRREVLESYHIHSSPPQSPPLQPMPPIAREDPVIRHSHHQDSPFLLLLSSTASFSK

A0A6J1I898 uncharacterized protein LOC1114708982.6e-12387.04Show/hide
Query:  MGRRTKNMAVTHEDLHPSPKSSQLGSKMGTFLIILTILSGLCCFILCLIAESTRSLVIWTDTND-TDKRGEKRCSYSGSGKTPLVCTASAFLGMAVMMVV
        MGRR K MAVTHEDLHPS +SS+LGSKMGTFLIILT+L GLCCFILCL+AESTRS VIW   ++  +K+GEKRC YSGSGKTPLVCTASAFLGMAVMMVV
Subjt:  MGRRTKNMAVTHEDLHPSPKSSQLGSKMGTFLIILTILSGLCCFILCLIAESTRSLVIWTDTND-TDKRGEKRCSYSGSGKTPLVCTASAFLGMAVMMVV

Query:  QHLYLLIAVSKSPPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLKNWSSPKESCLVIKEGLFSAAGVFQLATVFLAAGLYM
        QHLY+LIAVSKSPPPALIAWDPSFATSKSLTFQAAFFFVSTWISF+VGEILLLIGLSVESGHLK+W SPKESCLVIKEGLFSAAGVF+LATVFLAAGLYM
Subjt:  QHLYLLIAVSKSPPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLKNWSSPKESCLVIKEGLFSAAGVFQLATVFLAAGLYM

Query:  TAVRAQRLSEQQENVRREVLESYHIHSSPPQSPPLQPMPPIAREDPVIRHSHHQDSPFLLLLSSTASFSK
        TAVRAQRL E Q NVRREVLESYHIHSSPP+SPPLQPMPPIAREDPVIRHSHH +SPFL LL S+A+F K
Subjt:  TAVRAQRLSEQQENVRREVLESYHIHSSPPQSPPLQPMPPIAREDPVIRHSHHQDSPFLLLLSSTASFSK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G49320.1 Protein of unknown function (DUF1218)1.9e-7359.17Show/hide
Query:  RTKNMAVTHEDLHPSPKSSQLGSKMGTFLIILTILSGLCCFILCLIAESTRSLVIWTDTNDTDKRGEKRCSYSGSGKTPLVCTASAFLGMAVMMVVQHLY
        RT+  AVTH+DL P+PK++ L SK G F+ +LTI+ GL CF+LCL AE+TRS   W         G K C Y+GSGKTPL+C A AF+G+AV MV  H+Y
Subjt:  RTKNMAVTHEDLHPSPKSSQLGSKMGTFLIILTILSGLCCFILCLIAESTRSLVIWTDTNDTDKRGEKRCSYSGSGKTPLVCTASAFLGMAVMMVVQHLY

Query:  LLIAVSKSPPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLKNWSSPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAVR
        LLIAV+ SP   L+ WDP    +K LTFQAAFFFVSTW+ F VGE+LLL+ LSVESGHLKNWS PK SCLVI++GLFSAAGVF L TVFLA GLY+TA++
Subjt:  LLIAVSKSPPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLKNWSSPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAVR

Query:  AQRLSEQQENVRREVLESYHIHSSPPQSPPLQPMPPIARE
        A R+S+  EN  RE++E+  +++SPP+S P   M  +ARE
Subjt:  AQRLSEQQENVRREVLESYHIHSSPPQSPPLQPMPPIARE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAGAAGGACAAAAAATATGGCTGTGACACATGAGGATCTTCATCCAAGTCCAAAGAGCTCTCAATTGGGTAGCAAAATGGGCACTTTTCTTATCATTTTGACCAT
TCTTTCTGGCCTTTGTTGCTTCATTCTCTGCCTCATCGCCGAGTCCACTCGTTCTCTGGTGATATGGACGGATACCAACGACACTGATAAGAGGGGAGAAAAGAGATGTT
CGTACAGCGGTAGCGGAAAGACGCCGCTGGTATGCACGGCGAGCGCGTTTCTCGGGATGGCGGTGATGATGGTGGTGCAACATTTGTATTTGCTGATTGCAGTGAGTAAG
TCGCCGCCTCCAGCTCTCATTGCTTGGGATCCTTCGTTTGCCACTTCCAAATCTCTAACCTTTCAAGCTGCTTTCTTTTTCGTCTCAACATGGATAAGTTTTGCAGTGGG
AGAAATATTACTATTAATTGGATTGAGCGTAGAATCAGGGCATCTCAAAAACTGGTCCAGTCCAAAAGAAAGCTGTTTGGTGATCAAAGAAGGTTTGTTTTCGGCCGCCG
GAGTTTTTCAACTCGCGACGGTGTTCCTCGCGGCCGGCCTCTACATGACGGCGGTGCGAGCACAGAGATTGTCTGAACAGCAAGAAAACGTTAGGAGAGAAGTGCTGGAA
AGCTACCATATCCACAGTTCACCGCCCCAGTCACCGCCGCTGCAGCCTATGCCGCCCATTGCGAGAGAGGACCCTGTAATCAGACATAGCCATCACCAAGACTCTCCCTT
TTTGCTGCTGCTCTCATCTACTGCCTCTTTCTCCAAACTCCCTCGTTGA
mRNA sequenceShow/hide mRNA sequence
AAGAATTTTAACAACAAATAAAATCTCATTCATTTGATTGAGATCTTGAATGGGAAGAAGGACAAAAAATATGGCTGTGACACATGAGGATCTTCATCCAAGTCCAAAGA
GCTCTCAATTGGGTAGCAAAATGGGCACTTTTCTTATCATTTTGACCATTCTTTCTGGCCTTTGTTGCTTCATTCTCTGCCTCATCGCCGAGTCCACTCGTTCTCTGGTG
ATATGGACGGATACCAACGACACTGATAAGAGGGGAGAAAAGAGATGTTCGTACAGCGGTAGCGGAAAGACGCCGCTGGTATGCACGGCGAGCGCGTTTCTCGGGATGGC
GGTGATGATGGTGGTGCAACATTTGTATTTGCTGATTGCAGTGAGTAAGTCGCCGCCTCCAGCTCTCATTGCTTGGGATCCTTCGTTTGCCACTTCCAAATCTCTAACCT
TTCAAGCTGCTTTCTTTTTCGTCTCAACATGGATAAGTTTTGCAGTGGGAGAAATATTACTATTAATTGGATTGAGCGTAGAATCAGGGCATCTCAAAAACTGGTCCAGT
CCAAAAGAAAGCTGTTTGGTGATCAAAGAAGGTTTGTTTTCGGCCGCCGGAGTTTTTCAACTCGCGACGGTGTTCCTCGCGGCCGGCCTCTACATGACGGCGGTGCGAGC
ACAGAGATTGTCTGAACAGCAAGAAAACGTTAGGAGAGAAGTGCTGGAAAGCTACCATATCCACAGTTCACCGCCCCAGTCACCGCCGCTGCAGCCTATGCCGCCCATTG
CGAGAGAGGACCCTGTAATCAGACATAGCCATCACCAAGACTCTCCCTTTTTGCTGCTGCTCTCATCTACTGCCTCTTTCTCCAAACTCCCTCGTTGAATGAAAAGGCGT
TCGTAACGGTTATAAACTTTATAGTTTAAGGGTGGTTTATCTAAATTTCTTTTTTTTTTGGTGAAAGGATACAATCATTCTTTATTCTTCTCTGT
Protein sequenceShow/hide protein sequence
MGRRTKNMAVTHEDLHPSPKSSQLGSKMGTFLIILTILSGLCCFILCLIAESTRSLVIWTDTNDTDKRGEKRCSYSGSGKTPLVCTASAFLGMAVMMVVQHLYLLIAVSK
SPPPALIAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLKNWSSPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAVRAQRLSEQQENVRREVLE
SYHIHSSPPQSPPLQPMPPIAREDPVIRHSHHQDSPFLLLLSSTASFSKLPR