; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh07G006200 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh07G006200
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionCys-Gly metallodipeptidase DUG1
Genome locationCmo_Chr07:2792263..2793807
RNA-Seq ExpressionCmoCh07G006200
SyntenyCmoCh07G006200
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6594940.1 hypothetical protein SDJN03_11493, partial [Cucurbita argyrosperma subsp. sororia]1.7e-16897.11Show/hide
Query:  MSRNEKSVSDGTDEAKSAKSGCHSLENAAPQNQRYTTFVPRALNQQHARERSSPLPVSSAVNDRLHPPQNLANLQSPLSQPHQFVFSLQPFWVQPHPSIA
        +SRNEKSVSDGTDEAKSAKSGCHSLENAAPQNQRYTTFVPRALNQQHARERSSPLPVSSAVNDRLHPPQNLANLQSPLSQP QF+FSLQPFWVQPHPSIA
Subjt:  MSRNEKSVSDGTDEAKSAKSGCHSLENAAPQNQRYTTFVPRALNQQHARERSSPLPVSSAVNDRLHPPQNLANLQSPLSQPHQFVFSLQPFWVQPHPSIA

Query:  QPQCYPVGYPTYPGFPGSWDASIWGAQTQPLLFPGMSNYSRASYGFVSSQCWSMPAPNCITSSSVQPLSRGVIKPPEKLSKTHQRLWGAQSAENVQMWNM
        QPQCYPVGYPTYPGFPGSWDAS WGAQTQPLLFPGMSNYSRASYGFVSSQ WSMPAPNCITSSSVQPLSRGVIKPPEKLSKTHQRLWGAQSAENVQMWNM
Subjt:  QPQCYPVGYPTYPGFPGSWDASIWGAQTQPLLFPGMSNYSRASYGFVSSQCWSMPAPNCITSSSVQPLSRGVIKPPEKLSKTHQRLWGAQSAENVQMWNM

Query:  IGQVQGELGECKGRLIKLEAEISSFRSAAATDEAAVGVGNGGIMVKRRRSKRAVAPVCSQHSLQTRTRIRKPRMGRTKPNVLEKESLNKVDDKQQSTPME
        IGQ+QGELGECKGRLIKLEAEISSFRSA ATDEAAVGVGNGGIMVKRRRSKRA APVCSQHSLQ RTRIRKPRMGRTKPNVLEKESLNKVDDKQQSTPME
Subjt:  IGQVQGELGECKGRLIKLEAEISSFRSAAATDEAAVGVGNGGIMVKRRRSKRAVAPVCSQHSLQTRTRIRKPRMGRTKPNVLEKESLNKVDDKQQSTPME

Query:  MLADEEQQGGA
        MLADEEQQGGA
Subjt:  MLADEEQQGGA

KAG6604092.1 hypothetical protein SDJN03_04701, partial [Cucurbita argyrosperma subsp. sororia]4.0e-9861.21Show/hide
Query:  RNEKSVSDGTDEAKSAKSGCHSLENAAPQNQRYTTFVPRALNQQHARERSSPLPVSSAVNDRLHPPQNLANLQ-----SPLSQPHQFVFSLQPFWVQPHP
        RNEKSV DGTD AK+ KSGC  LENAAPQNQ+YT  + RALN +HA E+SS     +AVN+RL PP+NL  LQ      P  QP QFV S QPFWVQP P
Subjt:  RNEKSVSDGTDEAKSAKSGCHSLENAAPQNQRYTTFVPRALNQQHARERSSPLPVSSAVNDRLHPPQNLANLQ-----SPLSQPHQFVFSLQPFWVQPHP

Query:  SIA----------------------QPQ----CYPVGYPTYPGFPGSWDASIWGAQTQPLLFPGMSNYSRASYGFVSSQCWSMPAPNCITSSSVQPLSRG
        SI+                      QPQ    CYPVGYPTYPGF GSWDASIW  QT PLLFPG+SNY RASYGF SSQ   MP PNC+TSSS QPL RG
Subjt:  SIA----------------------QPQ----CYPVGYPTYPGFPGSWDASIWGAQTQPLLFPGMSNYSRASYGFVSSQCWSMPAPNCITSSSVQPLSRG

Query:  VIKPPEKLSKTHQRLWGAQSAENVQMWNMIGQVQGELGECKGRLIKLEAEISSFRSAAATDEAAVGVGNGGIMV-----KRRRSKRAVAPVCSQHSLQTR
        VIKPPE+LS+ HQRLW AQSAENVQ+W+MIGQ QGEL +CKGRLIKLEAEISS RS  AT+E AV VGNGGI V     KR RSKRA+APV S    Q+R
Subjt:  VIKPPEKLSKTHQRLWGAQSAENVQMWNMIGQVQGELGECKGRLIKLEAEISSFRSAAATDEAAVGVGNGGIMV-----KRRRSKRAVAPVCSQHSLQTR

Query:  TRIRKPRMGRT----KPNVLEKESLNKVDDKQQS-TPMEMLADEEQQG
        TR RKP +G T    KP +L K+SLNKVDD  +  TP+++   ++ +G
Subjt:  TRIRKPRMGRT----KPNVLEKESLNKVDDKQQS-TPMEMLADEEQQG

KAG7026901.1 hypothetical protein SDJN02_10908, partial [Cucurbita argyrosperma subsp. argyrosperma]4.4e-13784.24Show/hide
Query:  MSRNEKSVSDGTDEAKSAKSGCHSLENAAPQNQRYTTFVPRALNQQHARERSSPLPVSSAVNDRLHPPQNLANLQSPLSQPHQFVFSLQPFWVQPHPSIA
        +SRNEKSVSDGTDEAKSAKSGCHSLENAAPQNQRYTTFVPRALNQQHARERSSPLPVSSAVNDRLHPPQNLANLQSPLSQP QF+FSLQPFWVQPHPSIA
Subjt:  MSRNEKSVSDGTDEAKSAKSGCHSLENAAPQNQRYTTFVPRALNQQHARERSSPLPVSSAVNDRLHPPQNLANLQSPLSQPHQFVFSLQPFWVQPHPSIA

Query:  QPQCYPVGYPTYPGFPGSWDASIWGAQTQPLLFPGMSNYSRASYGFVSSQCWSMPAPNCITSSSVQPLSRGVIKPPEKLSKTHQRLWGAQSAENVQMWNM
        QPQCYPVGYPTYPGFPGSWDAS WGAQTQPLLFPGMSNYSRASYGFVSSQ WSMPAPNCITSSSVQPLSRG                             
Subjt:  QPQCYPVGYPTYPGFPGSWDASIWGAQTQPLLFPGMSNYSRASYGFVSSQCWSMPAPNCITSSSVQPLSRGVIKPPEKLSKTHQRLWGAQSAENVQMWNM

Query:  IGQVQGELGECKGRLIKLEAEISSFRSAAATDEAAVGVGNGGIMVKRRRSKRAVAPVCSQHSLQTRTRIRKPRMGRTKPNVLEKESLNKVDDKQQSTPME
                    GRLIKLEAEISSFRSA ATDEAAVGVGNGGIMVKRRRSKRA APVCSQHSLQ RTRIRKPRMGRTKPNVLEKESLNKVDDKQQSTPME
Subjt:  IGQVQGELGECKGRLIKLEAEISSFRSAAATDEAAVGVGNGGIMVKRRRSKRAVAPVCSQHSLQTRTRIRKPRMGRTKPNVLEKESLNKVDDKQQSTPME

Query:  MLADEEQQGGA
        MLADEEQQGGA
Subjt:  MLADEEQQGGA

XP_022963410.1 uncharacterized protein LOC111463624 [Cucurbita moschata]1.3e-17399.68Show/hide
Query:  MSRNEKSVSDGTDEAKSAKSGCHSLENAAPQNQRYTTFVPRALNQQHARERSSPLPVSSAVNDRLHPPQNLANLQSPLSQPHQFVFSLQPFWVQPHPSIA
        +SRNEKSVSDGTDEAKSAKSGCHSLENAAPQNQRYTTFVPRALNQQHARERSSPLPVSSAVNDRLHPPQNLANLQSPLSQPHQFVFSLQPFWVQPHPSIA
Subjt:  MSRNEKSVSDGTDEAKSAKSGCHSLENAAPQNQRYTTFVPRALNQQHARERSSPLPVSSAVNDRLHPPQNLANLQSPLSQPHQFVFSLQPFWVQPHPSIA

Query:  QPQCYPVGYPTYPGFPGSWDASIWGAQTQPLLFPGMSNYSRASYGFVSSQCWSMPAPNCITSSSVQPLSRGVIKPPEKLSKTHQRLWGAQSAENVQMWNM
        QPQCYPVGYPTYPGFPGSWDASIWGAQTQPLLFPGMSNYSRASYGFVSSQCWSMPAPNCITSSSVQPLSRGVIKPPEKLSKTHQRLWGAQSAENVQMWNM
Subjt:  QPQCYPVGYPTYPGFPGSWDASIWGAQTQPLLFPGMSNYSRASYGFVSSQCWSMPAPNCITSSSVQPLSRGVIKPPEKLSKTHQRLWGAQSAENVQMWNM

Query:  IGQVQGELGECKGRLIKLEAEISSFRSAAATDEAAVGVGNGGIMVKRRRSKRAVAPVCSQHSLQTRTRIRKPRMGRTKPNVLEKESLNKVDDKQQSTPME
        IGQVQGELGECKGRLIKLEAEISSFRSAAATDEAAVGVGNGGIMVKRRRSKRAVAPVCSQHSLQTRTRIRKPRMGRTKPNVLEKESLNKVDDKQQSTPME
Subjt:  IGQVQGELGECKGRLIKLEAEISSFRSAAATDEAAVGVGNGGIMVKRRRSKRAVAPVCSQHSLQTRTRIRKPRMGRTKPNVLEKESLNKVDDKQQSTPME

Query:  MLADEEQQGGA
        MLADEEQQGGA
Subjt:  MLADEEQQGGA

XP_023518125.1 uncharacterized protein LOC111781671 [Cucurbita pepo subsp. pepo]3.4e-13792.94Show/hide
Query:  MSRNEKSVSDGTDEAKSAKSGCHSLENAAPQNQRYTTFVPRALNQQHARERSSPLPVSSAVNDRLHPPQNLANLQSPLSQPHQFVFSLQPFWVQPHPSIA
        + +NEKSVSDGTDEAKSAKSGC SLENAAPQNQRYTT VPRALNQQHARERSSPL VSSAVNDRL PPQNLANLQSPLSQP QFVFS QPFWVQPHPSIA
Subjt:  MSRNEKSVSDGTDEAKSAKSGCHSLENAAPQNQRYTTFVPRALNQQHARERSSPLPVSSAVNDRLHPPQNLANLQSPLSQPHQFVFSLQPFWVQPHPSIA

Query:  QPQCYPVGYPTYPGFPGSWDASIWGAQTQPLLFPGMSNYSRASYGFVSSQCWSMPAPNCITSSSVQPLSRGVIKPPEKLSKTHQRLWGAQSAENVQMWNM
        QPQCYPVGYPTYPGFPGSWDAS WGAQTQPLLFPGMSNYSR SYGFVSSQ WSMPAPNCITSSSVQPLSRGVIKPPEKLSKTHQRLWGAQSAENVQMWNM
Subjt:  QPQCYPVGYPTYPGFPGSWDASIWGAQTQPLLFPGMSNYSRASYGFVSSQCWSMPAPNCITSSSVQPLSRGVIKPPEKLSKTHQRLWGAQSAENVQMWNM

Query:  IGQVQGELGECKGRLIKLEAEISSFRSAAATDEAAVGVGNGGIMVKRRRSKRAVAPVCSQHSLQTRTRI
        IGQ+QGELGECKGRLIKLEAEISSFRSA ATDEAAVGVGNGGIMVK RRSKRA APVCSQHSLQ R  I
Subjt:  IGQVQGELGECKGRLIKLEAEISSFRSAAATDEAAVGVGNGGIMVKRRRSKRAVAPVCSQHSLQTRTRI

TrEMBL top hitse value%identityAlignment
A0A6J1GCT8 uncharacterized protein LOC111453031 isoform X22.8e-9760.58Show/hide
Query:  RNEKSVSDGTDEAKSAKSGCHSLENAAPQNQRYTTFVPRALNQQHARERSSPLPVSSAVNDRLHPPQNLANLQSPL-SQPHQFVFSLQPFWVQPHPSIA-
        +NEKSV DGTD AK+ KSGC  LENAAPQNQ+YT  + RALN +HA E+SS     +AVN+RL PP+NL  LQ  L  QP QFV S QPFWVQP PSI+ 
Subjt:  RNEKSVSDGTDEAKSAKSGCHSLENAAPQNQRYTTFVPRALNQQHARERSSPLPVSSAVNDRLHPPQNLANLQSPL-SQPHQFVFSLQPFWVQPHPSIA-

Query:  ---------------------QPQ----CYPVGYPTYPGFPGSWDASIWGAQTQPLLFPGMSNYSRASYGFVSSQCWSMPAPNCITSSSVQPLSRGVIKP
                             QPQ    CYPVGYPTYPGF GSWDASIW  QT PLLFPG+SNY RASYG  SSQ   MP PNC+TSSS QPL RGVIKP
Subjt:  ---------------------QPQ----CYPVGYPTYPGFPGSWDASIWGAQTQPLLFPGMSNYSRASYGFVSSQCWSMPAPNCITSSSVQPLSRGVIKP

Query:  PEKLSKTHQRLWGAQSAENVQMWNMIGQVQGELGECKGRLIKLEAEISSFRSAAATDEAAVGVGNGGIMV-----KRRRSKRAVAPVCSQHSLQTRTRIR
        PE+LS+ HQRLW AQSAENVQ+W+MIGQ+QGEL +CKGRLIKLEAEIS  RS  AT+E AV VGNGGI V     KR RSKRA+APV S    Q+RTR R
Subjt:  PEKLSKTHQRLWGAQSAENVQMWNMIGQVQGELGECKGRLIKLEAEISSFRSAAATDEAAVGVGNGGIMV-----KRRRSKRAVAPVCSQHSLQTRTRIR

Query:  KP-----RMGRTKPNVLEKESLNKVDDKQQS-TPMEMLADEEQQG
        KP     ++G  KP +L K+SLNKVDD  ++ TP+++   ++ +G
Subjt:  KP-----RMGRTKPNVLEKESLNKVDDKQQS-TPMEMLADEEQQG

A0A6J1GDK6 uncharacterized protein LOC111453031 isoform X12.2e-9760.74Show/hide
Query:  GMSR-NEKSVSDGTDEAKSAKSGCHSLENAAPQNQRYTTFVPRALNQQHARERSSPLPVSSAVNDRLHPPQNLANLQSPL-SQPHQFVFSLQPFWVQPHP
        G+SR NEKSV DGTD AK+ KSGC  LENAAPQNQ+YT  + RALN +HA E+SS     +AVN+RL PP+NL  LQ  L  QP QFV S QPFWVQP P
Subjt:  GMSR-NEKSVSDGTDEAKSAKSGCHSLENAAPQNQRYTTFVPRALNQQHARERSSPLPVSSAVNDRLHPPQNLANLQSPL-SQPHQFVFSLQPFWVQPHP

Query:  SIA----------------------QPQ----CYPVGYPTYPGFPGSWDASIWGAQTQPLLFPGMSNYSRASYGFVSSQCWSMPAPNCITSSSVQPLSRG
        SI+                      QPQ    CYPVGYPTYPGF GSWDASIW  QT PLLFPG+SNY RASYG  SSQ   MP PNC+TSSS QPL RG
Subjt:  SIA----------------------QPQ----CYPVGYPTYPGFPGSWDASIWGAQTQPLLFPGMSNYSRASYGFVSSQCWSMPAPNCITSSSVQPLSRG

Query:  VIKPPEKLSKTHQRLWGAQSAENVQMWNMIGQVQGELGECKGRLIKLEAEISSFRSAAATDEAAVGVGNGGIMV-----KRRRSKRAVAPVCSQHSLQTR
        VIKPPE+LS+ HQRLW AQSAENVQ+W+MIGQ+QGEL +CKGRLIKLEAEIS  RS  AT+E AV VGNGGI V     KR RSKRA+APV S    Q+R
Subjt:  VIKPPEKLSKTHQRLWGAQSAENVQMWNMIGQVQGELGECKGRLIKLEAEISSFRSAAATDEAAVGVGNGGIMV-----KRRRSKRAVAPVCSQHSLQTR

Query:  TRIRKP-----RMGRTKPNVLEKESLNKVDDKQQS-TPMEMLADEEQQG
        TR RKP     ++G  KP +L K+SLNKVDD  ++ TP+++   ++ +G
Subjt:  TRIRKP-----RMGRTKPNVLEKESLNKVDDKQQS-TPMEMLADEEQQG

A0A6J1HHX5 uncharacterized protein LOC1114636246.3e-17499.68Show/hide
Query:  MSRNEKSVSDGTDEAKSAKSGCHSLENAAPQNQRYTTFVPRALNQQHARERSSPLPVSSAVNDRLHPPQNLANLQSPLSQPHQFVFSLQPFWVQPHPSIA
        +SRNEKSVSDGTDEAKSAKSGCHSLENAAPQNQRYTTFVPRALNQQHARERSSPLPVSSAVNDRLHPPQNLANLQSPLSQPHQFVFSLQPFWVQPHPSIA
Subjt:  MSRNEKSVSDGTDEAKSAKSGCHSLENAAPQNQRYTTFVPRALNQQHARERSSPLPVSSAVNDRLHPPQNLANLQSPLSQPHQFVFSLQPFWVQPHPSIA

Query:  QPQCYPVGYPTYPGFPGSWDASIWGAQTQPLLFPGMSNYSRASYGFVSSQCWSMPAPNCITSSSVQPLSRGVIKPPEKLSKTHQRLWGAQSAENVQMWNM
        QPQCYPVGYPTYPGFPGSWDASIWGAQTQPLLFPGMSNYSRASYGFVSSQCWSMPAPNCITSSSVQPLSRGVIKPPEKLSKTHQRLWGAQSAENVQMWNM
Subjt:  QPQCYPVGYPTYPGFPGSWDASIWGAQTQPLLFPGMSNYSRASYGFVSSQCWSMPAPNCITSSSVQPLSRGVIKPPEKLSKTHQRLWGAQSAENVQMWNM

Query:  IGQVQGELGECKGRLIKLEAEISSFRSAAATDEAAVGVGNGGIMVKRRRSKRAVAPVCSQHSLQTRTRIRKPRMGRTKPNVLEKESLNKVDDKQQSTPME
        IGQVQGELGECKGRLIKLEAEISSFRSAAATDEAAVGVGNGGIMVKRRRSKRAVAPVCSQHSLQTRTRIRKPRMGRTKPNVLEKESLNKVDDKQQSTPME
Subjt:  IGQVQGELGECKGRLIKLEAEISSFRSAAATDEAAVGVGNGGIMVKRRRSKRAVAPVCSQHSLQTRTRIRKPRMGRTKPNVLEKESLNKVDDKQQSTPME

Query:  MLADEEQQGGA
        MLADEEQQGGA
Subjt:  MLADEEQQGGA

A0A6J1IS95 uncharacterized protein LOC111478840 isoform X22.4e-9659.89Show/hide
Query:  RNEKSVSDGTDEAKSAKSGCHSLENAAPQNQRYTTFVPRALNQQHARERSSPLPVSSAVNDRLHPPQNLANLQ-----SPLSQPHQFVFSLQPFWVQPHP
        +NEKSV DGTD AK+AKSGC  LENAAPQNQ+YT  + RALN +HA E+SS     +AVN+RL PP+NL   Q      P  QP QFV S QPFWVQP  
Subjt:  RNEKSVSDGTDEAKSAKSGCHSLENAAPQNQRYTTFVPRALNQQHARERSSPLPVSSAVNDRLHPPQNLANLQ-----SPLSQPHQFVFSLQPFWVQPHP

Query:  SIA----------------------QPQ----CYPVGYPTYPGFPGSWDASIWGAQTQPLLFPGMSNYSRASYGFVSSQCWSMPAPNCITSSSVQPLSRG
        SI+                      QPQ    CYPVGYPTYPGF GSWDASIW  QT PLLFPG+SNY RASYGF SSQ   MP P+C+ SSS QPL RG
Subjt:  SIA----------------------QPQ----CYPVGYPTYPGFPGSWDASIWGAQTQPLLFPGMSNYSRASYGFVSSQCWSMPAPNCITSSSVQPLSRG

Query:  VIKPPEKLSKTHQRLWGAQSAENVQMWNMIGQVQGELGECKGRLIKLEAEISSFRSAAATDEAAVGVGNGGIMV-----KRRRSKRAVAPVCSQHSLQTR
        VIKPPE+LS+ HQRLW AQSAENVQ+W+MIGQ+Q EL +CKGRLIKLEAEISS RS  ATDEAAV VGNGGI V     KR RSKRA+APV S    Q+R
Subjt:  VIKPPEKLSKTHQRLWGAQSAENVQMWNMIGQVQGELGECKGRLIKLEAEISSFRSAAATDEAAVGVGNGGIMV-----KRRRSKRAVAPVCSQHSLQTR

Query:  TRIRKP-----RMGRTKPNVLEKESLNKVDDKQQS-TPMEMLADEEQQG
        TR RKP     ++G  KP +L K+SLNKVDD  +  TP+++   ++ +G
Subjt:  TRIRKP-----RMGRTKPNVLEKESLNKVDDKQQS-TPMEMLADEEQQG

A0A6J1IVR4 uncharacterized protein LOC111478840 isoform X11.8e-9660.06Show/hide
Query:  GMSR-NEKSVSDGTDEAKSAKSGCHSLENAAPQNQRYTTFVPRALNQQHARERSSPLPVSSAVNDRLHPPQNLANLQ-----SPLSQPHQFVFSLQPFWV
        G+SR NEKSV DGTD AK+AKSGC  LENAAPQNQ+YT  + RALN +HA E+SS     +AVN+RL PP+NL   Q      P  QP QFV S QPFWV
Subjt:  GMSR-NEKSVSDGTDEAKSAKSGCHSLENAAPQNQRYTTFVPRALNQQHARERSSPLPVSSAVNDRLHPPQNLANLQ-----SPLSQPHQFVFSLQPFWV

Query:  QPHPSIA----------------------QPQ----CYPVGYPTYPGFPGSWDASIWGAQTQPLLFPGMSNYSRASYGFVSSQCWSMPAPNCITSSSVQP
        QP  SI+                      QPQ    CYPVGYPTYPGF GSWDASIW  QT PLLFPG+SNY RASYGF SSQ   MP P+C+ SSS QP
Subjt:  QPHPSIA----------------------QPQ----CYPVGYPTYPGFPGSWDASIWGAQTQPLLFPGMSNYSRASYGFVSSQCWSMPAPNCITSSSVQP

Query:  LSRGVIKPPEKLSKTHQRLWGAQSAENVQMWNMIGQVQGELGECKGRLIKLEAEISSFRSAAATDEAAVGVGNGGIMV-----KRRRSKRAVAPVCSQHS
        L RGVIKPPE+LS+ HQRLW AQSAENVQ+W+MIGQ+Q EL +CKGRLIKLEAEISS RS  ATDEAAV VGNGGI V     KR RSKRA+APV S   
Subjt:  LSRGVIKPPEKLSKTHQRLWGAQSAENVQMWNMIGQVQGELGECKGRLIKLEAEISSFRSAAATDEAAVGVGNGGIMV-----KRRRSKRAVAPVCSQHS

Query:  LQTRTRIRKP-----RMGRTKPNVLEKESLNKVDDKQQS-TPMEMLADEEQQG
         Q+RTR RKP     ++G  KP +L K+SLNKVDD  +  TP+++   ++ +G
Subjt:  LQTRTRIRKP-----RMGRTKPNVLEKESLNKVDDKQQS-TPMEMLADEEQQG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAATGAGTCGGAATGAGAAAAGCGTATCTGATGGGACAGATGAAGCAAAAAGCGCCAAATCTGGGTGCCACTCTTTGGAGAATGCGGCTCCGCAAAATCAGCGCTA
CACTACGTTTGTTCCAAGGGCACTCAACCAGCAACATGCAAGGGAAAGATCTTCACCGTTACCCGTTTCTTCCGCCGTGAATGACCGGCTTCACCCACCGCAAAACCTTG
CCAATCTTCAGAGCCCGTTGTCACAGCCGCATCAATTTGTATTCTCCTTGCAACCCTTTTGGGTACAGCCGCACCCGAGCATTGCCCAACCTCAATGTTACCCTGTTGGA
TATCCAACATACCCTGGCTTTCCTGGTTCCTGGGATGCCTCAATTTGGGGGGCTCAAACACAACCATTACTGTTTCCTGGGATGTCCAATTATTCAAGAGCATCATATGG
TTTTGTCTCTTCTCAATGTTGGTCTATGCCAGCTCCTAATTGTATTACATCTTCCTCTGTACAACCCCTTTCAAGAGGAGTCATCAAGCCCCCTGAAAAGCTTTCTAAGA
CTCATCAAAGACTCTGGGGAGCACAGTCTGCAGAAAATGTTCAAATGTGGAATATGATTGGGCAGGTGCAAGGGGAATTAGGCGAGTGTAAGGGCAGATTGATCAAGCTT
GAAGCTGAAATTTCATCTTTCAGGTCAGCAGCAGCTACGGATGAGGCTGCCGTGGGAGTTGGAAATGGTGGCATTATGGTGAAGCGGAGACGGTCGAAACGAGCAGTAGC
CCCAGTTTGTTCACAACATTCATTGCAAACTCGGACTCGGATACGAAAGCCAAGAATGGGAAGAACAAAACCAAATGTTCTTGAAAAAGAAAGCTTGAATAAGGTGGATG
ATAAACAACAATCTACACCGATGGAAATGTTAGCAGACGAGGAACAACAAGGCGGAGCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGAATGAGTCGGAATGAGAAAAGCGTATCTGATGGGACAGATGAAGCAAAAAGCGCCAAATCTGGGTGCCACTCTTTGGAGAATGCGGCTCCGCAAAATCAGCGCTA
CACTACGTTTGTTCCAAGGGCACTCAACCAGCAACATGCAAGGGAAAGATCTTCACCGTTACCCGTTTCTTCCGCCGTGAATGACCGGCTTCACCCACCGCAAAACCTTG
CCAATCTTCAGAGCCCGTTGTCACAGCCGCATCAATTTGTATTCTCCTTGCAACCCTTTTGGGTACAGCCGCACCCGAGCATTGCCCAACCTCAATGTTACCCTGTTGGA
TATCCAACATACCCTGGCTTTCCTGGTTCCTGGGATGCCTCAATTTGGGGGGCTCAAACACAACCATTACTGTTTCCTGGGATGTCCAATTATTCAAGAGCATCATATGG
TTTTGTCTCTTCTCAATGTTGGTCTATGCCAGCTCCTAATTGTATTACATCTTCCTCTGTACAACCCCTTTCAAGAGGAGTCATCAAGCCCCCTGAAAAGCTTTCTAAGA
CTCATCAAAGACTCTGGGGAGCACAGTCTGCAGAAAATGTTCAAATGTGGAATATGATTGGGCAGGTGCAAGGGGAATTAGGCGAGTGTAAGGGCAGATTGATCAAGCTT
GAAGCTGAAATTTCATCTTTCAGGTCAGCAGCAGCTACGGATGAGGCTGCCGTGGGAGTTGGAAATGGTGGCATTATGGTGAAGCGGAGACGGTCGAAACGAGCAGTAGC
CCCAGTTTGTTCACAACATTCATTGCAAACTCGGACTCGGATACGAAAGCCAAGAATGGGAAGAACAAAACCAAATGTTCTTGAAAAAGAAAGCTTGAATAAGGTGGATG
ATAAACAACAATCTACACCGATGGAAATGTTAGCAGACGAGGAACAACAAGGCGGAGCTTGA
Protein sequenceShow/hide protein sequence
MGMSRNEKSVSDGTDEAKSAKSGCHSLENAAPQNQRYTTFVPRALNQQHARERSSPLPVSSAVNDRLHPPQNLANLQSPLSQPHQFVFSLQPFWVQPHPSIAQPQCYPVG
YPTYPGFPGSWDASIWGAQTQPLLFPGMSNYSRASYGFVSSQCWSMPAPNCITSSSVQPLSRGVIKPPEKLSKTHQRLWGAQSAENVQMWNMIGQVQGELGECKGRLIKL
EAEISSFRSAAATDEAAVGVGNGGIMVKRRRSKRAVAPVCSQHSLQTRTRIRKPRMGRTKPNVLEKESLNKVDDKQQSTPMEMLADEEQQGGA