; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh16G003350 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh16G003350
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionUmuC domain-containing protein
Genome locationCmo_Chr16:1551786..1556776
RNA-Seq ExpressionCmoCh16G003350
SyntenyCmoCh16G003350
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6576912.1 hypothetical protein SDJN03_24486, partial [Cucurbita argyrosperma subsp. sororia]3.0e-22493.59Show/hide
Query:  MSITAFKLPPSMELTVPIPRIPQNFFPIHRALTKFHVNSSSSSKSPNFQLPLPSKSPKPPKTLIAPRGEEFPTMAEIVAAGEAQNISLRLQTLGPFFRIT
        MSITAFKLPPSMELTVPIP IPQNF PIHRALTKFHV SSSSSKSPNFQLPLPSKSPKPPK LI PRGEEFPTMAEI+AAGEAQNISLRLQTLGPFFRIT
Subjt:  MSITAFKLPPSMELTVPIPRIPQNFFPIHRALTKFHVNSSSSSKSPNFQLPLPSKSPKPPKTLIAPRGEEFPTMAEIVAAGEAQNISLRLQTLGPFFRIT

Query:  AKSLESQREIGKAEGLIRVWLRGRILHLDSIRLKRESLGMEKSIFGLGLFIGAVAIRYGYDCGCKTAELLAINDSDLYHSKLVRFYTRIGFKSVYEVSGS
        AKSLESQREIGKAEGLIRVWLRGRILHLDSIRLKRESLGMEKSIFGLGLFIGAVAIRYGYDCGC+TAELLAINDSDLYHSKLVRFYTRIGFKSVYEVSGS
Subjt:  AKSLESQREIGKAEGLIRVWLRGRILHLDSIRLKRESLGMEKSIFGLGLFIGAVAIRYGYDCGCKTAELLAINDSDLYHSKLVRFYTRIGFKSVYEVSGS

Query:  KIGDIGDMLVWGGVGTRMDAPIEALLLKWCSREQNGAKQMSRREIRESDSNRHRSRFDGEPSPKRSRRDRKTLEEKLSRNSSSHVEDNKDRDQKHGLQHE
        K+GDIGDMLVWGGVGTRMDAPIEALLLKWCSRE+NGAKQMSRREIR+SDSN HRSRFDGE SPKRSRRDRKTLEEKLSRNSSSHVEDNKDRDQKHGLQHE
Subjt:  KIGDIGDMLVWGGVGTRMDAPIEALLLKWCSREQNGAKQMSRREIRESDSNRHRSRFDGEPSPKRSRRDRKTLEEKLSRNSSSHVEDNKDRDQKHGLQHE

Query:  GQDMIPHECSSPLDSKPEYGVSKGANRNSDRRDRGTKHSVNPTEVPRSRSYLQHVKRGNDGQV----GRSFGRSATRERGWWKDSREKEKDNDRASGKAT
        GQDMIPHECSS LDSKPEYGVSKG NRNSDRRD GTKHS+NPTEVPRSRSY QH KRGNDGQV    GRSFGRSATRERGWWKDSREKEKDNDRASGK T
Subjt:  GQDMIPHECSSPLDSKPEYGVSKGANRNSDRRDRGTKHSVNPTEVPRSRSYLQHVKRGNDGQV----GRSFGRSATRERGWWKDSREKEKDNDRASGKAT

Query:  AYNSQQRDEMPQAVRDDSHRDESSKQEVDAPTSAWKK
        AYNSQQRDEMPQAVRDDSHRDES K E DAPTSA K+
Subjt:  AYNSQQRDEMPQAVRDDSHRDESSKQEVDAPTSAWKK

KAG7014938.1 hypothetical protein SDJN02_22569, partial [Cucurbita argyrosperma subsp. argyrosperma]8.9e-12096.92Show/hide
Query:  FKLPPSMELTVPIPRIPQNFFPIHRALTKFHVNSSSSSKSPNFQLPLPSKSPKPPKTLIAPRGEEFPTMAEIVAAGEAQNISLRLQTLGPFFRITAKSLE
        FKLPPSMELTVPIP IPQNF PIHRALTKFHV SSSSSKSPNFQLPLPSKSPKPPK LI PRGEEFPTMAEI+AAGEAQNISLRLQTLGPFFRITAKSLE
Subjt:  FKLPPSMELTVPIPRIPQNFFPIHRALTKFHVNSSSSSKSPNFQLPLPSKSPKPPKTLIAPRGEEFPTMAEIVAAGEAQNISLRLQTLGPFFRITAKSLE

Query:  SQREIGKAEGLIRVWLRGRILHLDSIRLKRESLGMEKSIFGLGLFIGAVAIRYGYDCGCKTAELLAINDSDLYHSKLVRFYTRIGFKSVYEVSGSKIGDI
        SQREIGKAEGLIRVWLRGRILHLDSIRLKRESLGMEKSIFGLGLFIGAVAIRYGYDCGCKTAELLAINDSDLYHSKLVRFYTRIGFKSVYEVSGSK+GDI
Subjt:  SQREIGKAEGLIRVWLRGRILHLDSIRLKRESLGMEKSIFGLGLFIGAVAIRYGYDCGCKTAELLAINDSDLYHSKLVRFYTRIGFKSVYEVSGSKIGDI

Query:  GDMLVWGGVGTRMDAPIEALLLKWCSR
        GDMLVWGGVGTRMDAPIEALLLKWCSR
Subjt:  GDMLVWGGVGTRMDAPIEALLLKWCSR

XP_022922749.1 uncharacterized protein LOC111430648 [Cucurbita moschata]1.7e-126100Show/hide
Query:  MSITAFKLPPSMELTVPIPRIPQNFFPIHRALTKFHVNSSSSSKSPNFQLPLPSKSPKPPKTLIAPRGEEFPTMAEIVAAGEAQNISLRLQTLGPFFRIT
        MSITAFKLPPSMELTVPIPRIPQNFFPIHRALTKFHVNSSSSSKSPNFQLPLPSKSPKPPKTLIAPRGEEFPTMAEIVAAGEAQNISLRLQTLGPFFRIT
Subjt:  MSITAFKLPPSMELTVPIPRIPQNFFPIHRALTKFHVNSSSSSKSPNFQLPLPSKSPKPPKTLIAPRGEEFPTMAEIVAAGEAQNISLRLQTLGPFFRIT

Query:  AKSLESQREIGKAEGLIRVWLRGRILHLDSIRLKRESLGMEKSIFGLGLFIGAVAIRYGYDCGCKTAELLAINDSDLYHSKLVRFYTRIGFKSVYEVSGS
        AKSLESQREIGKAEGLIRVWLRGRILHLDSIRLKRESLGMEKSIFGLGLFIGAVAIRYGYDCGCKTAELLAINDSDLYHSKLVRFYTRIGFKSVYEVSGS
Subjt:  AKSLESQREIGKAEGLIRVWLRGRILHLDSIRLKRESLGMEKSIFGLGLFIGAVAIRYGYDCGCKTAELLAINDSDLYHSKLVRFYTRIGFKSVYEVSGS

Query:  KIGDIGDMLVWGGVGTRMDAPIEALLLKWCSR
        KIGDIGDMLVWGGVGTRMDAPIEALLLKWCSR
Subjt:  KIGDIGDMLVWGGVGTRMDAPIEALLLKWCSR

XP_022984679.1 uncharacterized protein LOC111482888 [Cucurbita maxima]1.1e-12095.26Show/hide
Query:  MSITAFKLPPSMELTVPIPRIPQNFFPIHRALTKFHVNSSSSSKSPNFQLPLPSKSPKPPKTLIAPRGEEFPTMAEIVAAGEAQNISLRLQTLGPFFRIT
        MSIT FKLPPSMELTVPIPRIPQNF PIHRALTKFHV+SSSSSKSPNFQLPLPSK PKPPK LIAPRGEEFPTMAEI+AAGEAQNISLRLQTLGPFFRIT
Subjt:  MSITAFKLPPSMELTVPIPRIPQNFFPIHRALTKFHVNSSSSSKSPNFQLPLPSKSPKPPKTLIAPRGEEFPTMAEIVAAGEAQNISLRLQTLGPFFRIT

Query:  AKSLESQREIGKAEGLIRVWLRGRILHLDSIRLKRESLGMEKSIFGLGLFIGAVAIRYGYDCGCKTAELLAINDSDLYHSKLVRFYTRIGFKSVYEVSGS
        AKSLESQREIGKAEGLIRVWLRGRILHLDSIRLKRESLGMEKSIFGLGLFIGAVAIRYGYDCGCKTAELLAINDSDLYHSKLVRFYTRIGF+SVYEVSGS
Subjt:  AKSLESQREIGKAEGLIRVWLRGRILHLDSIRLKRESLGMEKSIFGLGLFIGAVAIRYGYDCGCKTAELLAINDSDLYHSKLVRFYTRIGFKSVYEVSGS

Query:  KIGDIGDMLVWGGVGTRMDAPIEALLLKWCSR
         +GDIGDMLVWGGVGTRMDAPIE LL KWCSR
Subjt:  KIGDIGDMLVWGGVGTRMDAPIEALLLKWCSR

XP_023521663.1 uncharacterized protein LOC111785501 [Cucurbita pepo subsp. pepo]1.2e-12196.55Show/hide
Query:  MSITAFKLPPSMELTVPIPRIPQNFFPIHRALTKFHVNSSSSSKSPNFQLPLPSKSPKPPKTLIAPRGEEFPTMAEIVAAGEAQNISLRLQTLGPFFRIT
        MSITAFK PPSMELTVPIPRIPQNF PIHRALTKFHV SSSSSKSPNFQLPLPSKS KPPK LIAPRGEEFPTMAEI+AAGEAQNISLRLQTLGPFFRIT
Subjt:  MSITAFKLPPSMELTVPIPRIPQNFFPIHRALTKFHVNSSSSSKSPNFQLPLPSKSPKPPKTLIAPRGEEFPTMAEIVAAGEAQNISLRLQTLGPFFRIT

Query:  AKSLESQREIGKAEGLIRVWLRGRILHLDSIRLKRESLGMEKSIFGLGLFIGAVAIRYGYDCGCKTAELLAINDSDLYHSKLVRFYTRIGFKSVYEVSGS
        AKSLESQREIGKAEGLIRVWLRGRILHLDSIRLKRESLGMEKSIFGLGLFIGAVAIRYGYDCGC+TAELLAINDSDLYHSKLVRFYTRIGFKSVYEVSGS
Subjt:  AKSLESQREIGKAEGLIRVWLRGRILHLDSIRLKRESLGMEKSIFGLGLFIGAVAIRYGYDCGCKTAELLAINDSDLYHSKLVRFYTRIGFKSVYEVSGS

Query:  KIGDIGDMLVWGGVGTRMDAPIEALLLKWCSR
        K+GDIGDMLVWGGVGTRMDAPIEALLLKWCSR
Subjt:  KIGDIGDMLVWGGVGTRMDAPIEALLLKWCSR

TrEMBL top hitse value%identityAlignment
A0A1S3BZR1 uncharacterized protein LOC1034948153.4e-9380.89Show/hide
Query:  MEL-TVPIPRIPQNFFPIHRALTKFHVNSSSSSKSPNFQLPLPSKSPKPPKTLIAPRGE---EFPTMAEIVAAGEAQNISLRLQTLGPFFRITAKSLESQ
        MEL T+ IPRIP    PIHR LTK H  +SSSSK PN QLPLPSKS      LI  RG+     PTMAEIVAAGE+QN+SLRLQTLGPFFRITAKSLE++
Subjt:  MEL-TVPIPRIPQNFFPIHRALTKFHVNSSSSSKSPNFQLPLPSKSPKPPKTLIAPRGE---EFPTMAEIVAAGEAQNISLRLQTLGPFFRITAKSLESQ

Query:  REIGKAEGLIRVWLRGRILHLDSIRLKRESLGMEKSIFGLGLFIGAVAIRYGYDCGCKTAELLAINDSDLYHSKLVRFYTRIGFKSVYEVSGSKIGDIGD
        REIGKAEGL+RVWL G+ILHLDSIRL RESLGMEKSIFGLGLFIGAVAIRYGYDCGCKTAELLAINDSDLYHSKLVRFYTRIGFKS+YEVSGSK+ DIGD
Subjt:  REIGKAEGLIRVWLRGRILHLDSIRLKRESLGMEKSIFGLGLFIGAVAIRYGYDCGCKTAELLAINDSDLYHSKLVRFYTRIGFKSVYEVSGSKIGDIGD

Query:  MLVWGGVGTRMDAPIEALLLKWCSR
        M+VWGG+GTRMDAPIE+LLLKWC+R
Subjt:  MLVWGGVGTRMDAPIEALLLKWCSR

A0A6J1D616 uncharacterized protein LOC1110176374.1e-9481.9Show/hide
Query:  MELTVPIPRIPQNFFPIHRALTKFHVNSSSSSKSPNFQLPLPSKSPKPPKTLIAPRGEEFPTMAEIVAAGEAQNISLRLQTLGPFFRITAKSLESQREIG
        ME T  I RIPQNF PI R L+K H +SSSSSK PN QLP PSKSPKP K+    R +  PTM EI+AAG AQN+ LRLQTLGPFFRITAKS ESQREIG
Subjt:  MELTVPIPRIPQNFFPIHRALTKFHVNSSSSSKSPNFQLPLPSKSPKPPKTLIAPRGEEFPTMAEIVAAGEAQNISLRLQTLGPFFRITAKSLESQREIG

Query:  KAEGLIRVWLRGRILHLDSIRLKRESLGMEKSIFGLGLFIGAVAIRYGYDCGCKTAELLAINDSDLYHSKLVRFYTRIGFKSVYEVSGSKIGDIGDMLVW
        KAEGLIRVW  GRILHLDSIRL RESLGMEKSIFGLGLFIGAVAIRYG+DCGCKTAELLAINDSDLYHSKLVRFYTRIGFKS+YEVSGSK+GD GDMLVW
Subjt:  KAEGLIRVWLRGRILHLDSIRLKRESLGMEKSIFGLGLFIGAVAIRYGYDCGCKTAELLAINDSDLYHSKLVRFYTRIGFKSVYEVSGSKIGDIGDMLVW

Query:  GGVGTRMDAPIEALLLKWCSR
        GGVGTRMDA IE LLLKWC+R
Subjt:  GGVGTRMDAPIEALLLKWCSR

A0A6J1E4B2 uncharacterized protein LOC1114306488.1e-127100Show/hide
Query:  MSITAFKLPPSMELTVPIPRIPQNFFPIHRALTKFHVNSSSSSKSPNFQLPLPSKSPKPPKTLIAPRGEEFPTMAEIVAAGEAQNISLRLQTLGPFFRIT
        MSITAFKLPPSMELTVPIPRIPQNFFPIHRALTKFHVNSSSSSKSPNFQLPLPSKSPKPPKTLIAPRGEEFPTMAEIVAAGEAQNISLRLQTLGPFFRIT
Subjt:  MSITAFKLPPSMELTVPIPRIPQNFFPIHRALTKFHVNSSSSSKSPNFQLPLPSKSPKPPKTLIAPRGEEFPTMAEIVAAGEAQNISLRLQTLGPFFRIT

Query:  AKSLESQREIGKAEGLIRVWLRGRILHLDSIRLKRESLGMEKSIFGLGLFIGAVAIRYGYDCGCKTAELLAINDSDLYHSKLVRFYTRIGFKSVYEVSGS
        AKSLESQREIGKAEGLIRVWLRGRILHLDSIRLKRESLGMEKSIFGLGLFIGAVAIRYGYDCGCKTAELLAINDSDLYHSKLVRFYTRIGFKSVYEVSGS
Subjt:  AKSLESQREIGKAEGLIRVWLRGRILHLDSIRLKRESLGMEKSIFGLGLFIGAVAIRYGYDCGCKTAELLAINDSDLYHSKLVRFYTRIGFKSVYEVSGS

Query:  KIGDIGDMLVWGGVGTRMDAPIEALLLKWCSR
        KIGDIGDMLVWGGVGTRMDAPIEALLLKWCSR
Subjt:  KIGDIGDMLVWGGVGTRMDAPIEALLLKWCSR

A0A6J1E9N7 uncharacterized protein LOC1114306493.1e-102100Show/hide
Query:  MSRREIRESDSNRHRSRFDGEPSPKRSRRDRKTLEEKLSRNSSSHVEDNKDRDQKHGLQHEGQDMIPHECSSPLDSKPEYGVSKGANRNSDRRDRGTKHS
        MSRREIRESDSNRHRSRFDGEPSPKRSRRDRKTLEEKLSRNSSSHVEDNKDRDQKHGLQHEGQDMIPHECSSPLDSKPEYGVSKGANRNSDRRDRGTKHS
Subjt:  MSRREIRESDSNRHRSRFDGEPSPKRSRRDRKTLEEKLSRNSSSHVEDNKDRDQKHGLQHEGQDMIPHECSSPLDSKPEYGVSKGANRNSDRRDRGTKHS

Query:  VNPTEVPRSRSYLQHVKRGNDGQVGRSFGRSATRERGWWKDSREKEKDNDRASGKATAYNSQQRDEMPQAVRDDSHRDESSKQEVDAPTSAWKKACI
        VNPTEVPRSRSYLQHVKRGNDGQVGRSFGRSATRERGWWKDSREKEKDNDRASGKATAYNSQQRDEMPQAVRDDSHRDESSKQEVDAPTSAWKKACI
Subjt:  VNPTEVPRSRSYLQHVKRGNDGQVGRSFGRSATRERGWWKDSREKEKDNDRASGKATAYNSQQRDEMPQAVRDDSHRDESSKQEVDAPTSAWKKACI

A0A6J1JB84 uncharacterized protein LOC1114828885.1e-12195.26Show/hide
Query:  MSITAFKLPPSMELTVPIPRIPQNFFPIHRALTKFHVNSSSSSKSPNFQLPLPSKSPKPPKTLIAPRGEEFPTMAEIVAAGEAQNISLRLQTLGPFFRIT
        MSIT FKLPPSMELTVPIPRIPQNF PIHRALTKFHV+SSSSSKSPNFQLPLPSK PKPPK LIAPRGEEFPTMAEI+AAGEAQNISLRLQTLGPFFRIT
Subjt:  MSITAFKLPPSMELTVPIPRIPQNFFPIHRALTKFHVNSSSSSKSPNFQLPLPSKSPKPPKTLIAPRGEEFPTMAEIVAAGEAQNISLRLQTLGPFFRIT

Query:  AKSLESQREIGKAEGLIRVWLRGRILHLDSIRLKRESLGMEKSIFGLGLFIGAVAIRYGYDCGCKTAELLAINDSDLYHSKLVRFYTRIGFKSVYEVSGS
        AKSLESQREIGKAEGLIRVWLRGRILHLDSIRLKRESLGMEKSIFGLGLFIGAVAIRYGYDCGCKTAELLAINDSDLYHSKLVRFYTRIGF+SVYEVSGS
Subjt:  AKSLESQREIGKAEGLIRVWLRGRILHLDSIRLKRESLGMEKSIFGLGLFIGAVAIRYGYDCGCKTAELLAINDSDLYHSKLVRFYTRIGFKSVYEVSGS

Query:  KIGDIGDMLVWGGVGTRMDAPIEALLLKWCSR
         +GDIGDMLVWGGVGTRMDAPIE LL KWCSR
Subjt:  KIGDIGDMLVWGGVGTRMDAPIEALLLKWCSR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G19650.1 cyclin-related6.8e-0932.98Show/hide
Query:  SRREIRESDSNRHRSRFDGEPSPKRSRRDRKTLEEKLSRNSSSHVEDNKD--RDQKHGLQHEGQDMIPHECSSPLDSKPEYGVSKGANRNSDRRDRG---
        +RRE R+SD  RHRSR D EPSPKRSRRD K   E++       V D  D  ++ +  L+    ++  H                G+ ++S+++  G   
Subjt:  SRREIRESDSNRHRSRFDGEPSPKRSRRDRKTLEEKLSRNSSSHVEDNKD--RDQKHGLQHEGQDMIPHECSSPLDSKPEYGVSKGANRNSDRRDRG---

Query:  -TKHSVNPTEVPRSRSYLQHVKRGNDGQVGRSFGRSATRERGWWKDSREKEKDNDRASGKATAYNSQQRDEMPQAVRDDSHRDESSKQ
         TK + + ++VPRSR Y QH  R +DG+V     R  T  RG W+ SR  ++ N RA       +  ++DE   + R D  R+    Q
Subjt:  -TKHSVNPTEVPRSRSYLQHVKRGNDGQVGRSFGRSATRERGWWKDSREKEKDNDRASGKATAYNSQQRDEMPQAVRDDSHRDESSKQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAATCACAGCCTTCAAACTCCCGCCATCAATGGAGCTCACAGTCCCAATTCCCCGAATCCCCCAAAATTTCTTCCCAATTCACAGAGCCCTAACTAAATTCCACGT
CAATTCCTCATCCTCATCCAAAAGCCCTAATTTCCAATTACCGCTTCCATCAAAATCGCCCAAGCCACCGAAAACCCTAATCGCTCCAAGAGGCGAAGAATTTCCGACGA
TGGCGGAGATCGTGGCGGCCGGCGAGGCCCAGAATATAAGCCTCCGGCTGCAAACGTTGGGGCCGTTTTTCCGAATAACGGCGAAAAGTTTGGAATCGCAGCGAGAGATT
GGGAAGGCCGAGGGATTGATAAGAGTGTGGTTGCGAGGGAGGATTCTTCACCTGGATTCAATTCGATTGAAGCGAGAGAGCTTGGGAATGGAGAAATCGATCTTTGGACT
CGGATTGTTCATCGGCGCTGTTGCGATTCGGTATGGGTATGACTGTGGGTGCAAGACGGCGGAGTTATTGGCCATCAATGACTCCGATCTTTACCATTCTAAGCTCGTTA
GATTCTACACAAGGATCGGGTTCAAGAGTGTGTATGAAGTGAGTGGTTCGAAGATAGGAGACATTGGGGACATGTTGGTGTGGGGAGGGGTTGGGACTAGAATGGATGCT
CCCATTGAAGCCCTTCTACTTAAATGGTGTTCACGAGAACAAAACGGCGCGAAGCAAATGTCGCGTCGTGAAATTCGGGAATCCGATTCCAATCGTCATCGTTCCAGATT
CGATGGAGAACCCAGTCCCAAGAGGTCAAGGAGGGACAGGAAAACATTAGAGGAGAAGTTATCTAGAAACTCTAGTTCTCATGTTGAAGATAATAAAGATCGGGATCAGA
AACATGGGCTTCAACATGAGGGGCAAGATATGATACCCCATGAGTGTTCATCACCACTTGATTCTAAGCCAGAATATGGGGTTAGCAAAGGAGCAAACAGGAACAGTGAT
AGGAGGGATAGAGGGACAAAACATTCAGTGAATCCCACTGAAGTACCACGATCAAGATCTTATTTACAGCATGTCAAACGTGGTAATGACGGGCAAGTTGGTCGAAGCTT
TGGTCGGAGTGCAACTAGGGAGCGTGGTTGGTGGAAGGACTCAAGGGAGAAGGAGAAGGATAACGATAGGGCATCTGGGAAGGCAACAGCTTATAATTCACAGCAAAGAG
ATGAGATGCCACAAGCCGTGAGAGACGATAGCCACCGTGATGAGTCCTCGAAACAGGAGGTTGATGCACCCACATCTGCTTGGAAAAAGGCCTGCATTTAG
mRNA sequenceShow/hide mRNA sequence
CCATGTCGACATGTCAATCACAGCCTTCAAACTCCCGCCATCAATGGAGCTCACAGTCCCAATTCCCCGAATCCCCCAAAATTTCTTCCCAATTCACAGAGCCCTAACTA
AATTCCACGTCAATTCCTCATCCTCATCCAAAAGCCCTAATTTCCAATTACCGCTTCCATCAAAATCGCCCAAGCCACCGAAAACCCTAATCGCTCCAAGAGGCGAAGAA
TTTCCGACGATGGCGGAGATCGTGGCGGCCGGCGAGGCCCAGAATATAAGCCTCCGGCTGCAAACGTTGGGGCCGTTTTTCCGAATAACGGCGAAAAGTTTGGAATCGCA
GCGAGAGATTGGGAAGGCCGAGGGATTGATAAGAGTGTGGTTGCGAGGGAGGATTCTTCACCTGGATTCAATTCGATTGAAGCGAGAGAGCTTGGGAATGGAGAAATCGA
TCTTTGGACTCGGATTGTTCATCGGCGCTGTTGCGATTCGGTATGGGTATGACTGTGGGTGCAAGACGGCGGAGTTATTGGCCATCAATGACTCCGATCTTTACCATTCT
AAGCTCGTTAGATTCTACACAAGGATCGGGTTCAAGAGTGTGTATGAAGTGAGTGGTTCGAAGATAGGAGACATTGGGGACATGTTGGTGTGGGGAGGGGTTGGGACTAG
AATGGATGCTCCCATTGAAGCCCTTCTACTTAAATGGTGTTCACGAGAACAAAACGGCGCGAAGCAAATGTCGCGTCGTGAAATTCGGGAATCCGATTCCAATCGTCATC
GTTCCAGATTCGATGGAGAACCCAGTCCCAAGAGGTCAAGGAGGGACAGGAAAACATTAGAGGAGAAGTTATCTAGAAACTCTAGTTCTCATGTTGAAGATAATAAAGAT
CGGGATCAGAAACATGGGCTTCAACATGAGGGGCAAGATATGATACCCCATGAGTGTTCATCACCACTTGATTCTAAGCCAGAATATGGGGTTAGCAAAGGAGCAAACAG
GAACAGTGATAGGAGGGATAGAGGGACAAAACATTCAGTGAATCCCACTGAAGTACCACGATCAAGATCTTATTTACAGCATGTCAAACGTGGTAATGACGGGCAAGTTG
GTCGAAGCTTTGGTCGGAGTGCAACTAGGGAGCGTGGTTGGTGGAAGGACTCAAGGGAGAAGGAGAAGGATAACGATAGGGCATCTGGGAAGGCAACAGCTTATAATTCA
CAGCAAAGAGATGAGATGCCACAAGCCGTGAGAGACGATAGCCACCGTGATGAGTCCTCGAAACAGGAGGTTGATGCACCCACATCTGCTTGGAAAAAGGCCTGCATTTA
GGGAGAAGAAGATCACAGTAGGCGATGAAAATGGTGAGAAAGCAGCTACTGTATCAGAGATTCAGATATCGAGTGATCCACACCAGCCTCGGGATGGAAGGGAACGAAGA
GAAGAAAGAGGCTGTCATACCCTTTATTTCAAAAATAAGAACTTGAAGGATACTTTTGAAAGAAAATTAACTAATTTCCTTCTAAATTAATGGTAAATGTATTTTTTTTT
TTTAAAATTA
Protein sequenceShow/hide protein sequence
MSITAFKLPPSMELTVPIPRIPQNFFPIHRALTKFHVNSSSSSKSPNFQLPLPSKSPKPPKTLIAPRGEEFPTMAEIVAAGEAQNISLRLQTLGPFFRITAKSLESQREI
GKAEGLIRVWLRGRILHLDSIRLKRESLGMEKSIFGLGLFIGAVAIRYGYDCGCKTAELLAINDSDLYHSKLVRFYTRIGFKSVYEVSGSKIGDIGDMLVWGGVGTRMDA
PIEALLLKWCSREQNGAKQMSRREIRESDSNRHRSRFDGEPSPKRSRRDRKTLEEKLSRNSSSHVEDNKDRDQKHGLQHEGQDMIPHECSSPLDSKPEYGVSKGANRNSD
RRDRGTKHSVNPTEVPRSRSYLQHVKRGNDGQVGRSFGRSATRERGWWKDSREKEKDNDRASGKATAYNSQQRDEMPQAVRDDSHRDESSKQEVDAPTSAWKKACI