; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0007768 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0007768
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionYqgFc domain-containing protein
Genome locationchr06:174042..178474
RNA-Seq ExpressionPay0007768
SyntenyPay0007768
Gene Ontology termsGO:0000967 - rRNA 5'-end processing (biological process)
InterPro domainsIPR005227 - Putative pre-16S rRNA nuclease
IPR006641 - YqgF/RNase H-like domain
IPR012337 - Ribonuclease H-like superfamily
IPR037027 - YqgF/RNase H-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048825.1 UPF0081 domain-containing protein [Cucumis melo var. makuwa]1.2e-9598.34Show/hide
Query:  MFTLLCGQCFQFQAFQFPLQSHRLLHSLHPHLPPPSPPALFSKSHNNNPFSSIELPPNALRRKLDPHWRGGFSLGVDLGTSRTGLALSKGFSIRPLTVLE
        MFTLLCGQCFQFQAFQFPLQSHRLLHSLHPHLPPPSPPALFSKSHNNNPFSSIELPPNALRRKLDPHWRGGFSLGVDLGTSRTGLALSKGFSIRPLTVLE
Subjt:  MFTLLCGQCFQFQAFQFPLQSHRLLHSLHPHLPPPSPPALFSKSHNNNPFSSIELPPNALRRKLDPHWRGGFSLGVDLGTSRTGLALSKGFSIRPLTVLE

Query:  LRGQKLEAKLIQIAEQEEADEFIIGLPKSCDGKETPQSNKIRSIAGRVAARAAERGWRVYLYDEHGTTAEAESHMISKGLN
        LRGQKLEAKLIQIAEQEEADEFIIGLPKSCDGKETPQSNKIRSIAGRVAARAAERGWRVYLYDEHGTTAEAESHMIS  +N
Subjt:  LRGQKLEAKLIQIAEQEEADEFIIGLPKSCDGKETPQSNKIRSIAGRVAARAAERGWRVYLYDEHGTTAEAESHMISKGLN

XP_004133768.1 uncharacterized protein LOC101219287 isoform X1 [Cucumis sativus]1.1e-12094.04Show/hide
Query:  MFTLLCGQCFQFQAFQFPLQSHRLLHSLHPHLPPPSPPALFSKSHNNNPFSSIELPPNALRRKLDPHWRGGFSLGVDLGTSRTGLALSKGFSIRPLTVLE
        MFT   GQCFQFQAFQFPLQ+H+LLHSLHPHLPPPS PAL SKSHNNNP SSIELPPNALRRKLDPHWRGGFSLGVDLGTSRTGLALSKGFS RPLTVLE
Subjt:  MFTLLCGQCFQFQAFQFPLQSHRLLHSLHPHLPPPSPPALFSKSHNNNPFSSIELPPNALRRKLDPHWRGGFSLGVDLGTSRTGLALSKGFSIRPLTVLE

Query:  LRGQKLEAKLIQIAEQEEADEFIIGLPKSCDGKETPQSNKIRSIAGRVAARAAERGWRVYLYDEHGTTAEAESHMISKGLNKSTRQKKIDAYAAMMVLER
        LRGQKLEAKLI+IAEQEEADEFIIGLPKSCDGKETPQSNKIRSIAGRVAARAAERGWRVYLYDEHGTTAEAESHMIS+GLNKSTRQKKIDAYAAMMVLER
Subjt:  LRGQKLEAKLIQIAEQEEADEFIIGLPKSCDGKETPQSNKIRSIAGRVAARAAERGWRVYLYDEHGTTAEAESHMISKGLNKSTRQKKIDAYAAMMVLER

Query:  YFLMSGQGTELLVPKSLVLQDKLIEGPPTDPDFKD
        YF  SGQGTELLVPKSLVLQDKLIEGPPTDPDF+D
Subjt:  YFLMSGQGTELLVPKSLVLQDKLIEGPPTDPDFKD

XP_008437782.1 PREDICTED: uncharacterized protein LOC103483114 isoform X1 [Cucumis melo]4.2e-15299.63Show/hide
Query:  MEKLRILAGGGVVICERAPIHCLCWVLESPIPPILPIMFTLLCGQCFQFQAFQFPLQSHRLLHSLHPHLPPPSPPALFSKSHNNNPFSSIELPPNALRRK
        MEKLRILAGGGVVICERAPIHCLCWVLESPIPPILP MFTLLCGQCFQFQAFQFPLQSHRLLHSLHPHLPPPSPPALFSKSHNNNPFSSIELPPNALRRK
Subjt:  MEKLRILAGGGVVICERAPIHCLCWVLESPIPPILPIMFTLLCGQCFQFQAFQFPLQSHRLLHSLHPHLPPPSPPALFSKSHNNNPFSSIELPPNALRRK

Query:  LDPHWRGGFSLGVDLGTSRTGLALSKGFSIRPLTVLELRGQKLEAKLIQIAEQEEADEFIIGLPKSCDGKETPQSNKIRSIAGRVAARAAERGWRVYLYD
        LDPHWRGGFSLGVDLGTSRTGLALSKGFSIRPLTVLELRGQKLEAKLIQIAEQEEADEFIIGLPKSCDGKETPQSNKIRSIAGRVAARAAERGWRVYLYD
Subjt:  LDPHWRGGFSLGVDLGTSRTGLALSKGFSIRPLTVLELRGQKLEAKLIQIAEQEEADEFIIGLPKSCDGKETPQSNKIRSIAGRVAARAAERGWRVYLYD

Query:  EHGTTAEAESHMISKGLNKSTRQKKIDAYAAMMVLERYFLMSGQGTELLVPKSLVLQDKLIEGPPTDPDFKD
        EHGTTAEAESHMISKGLNKSTRQKKIDAYAAMMVLERYFLMSGQGTELLVPKSLVLQDKLIEGPPTDPDFKD
Subjt:  EHGTTAEAESHMISKGLNKSTRQKKIDAYAAMMVLERYFLMSGQGTELLVPKSLVLQDKLIEGPPTDPDFKD

XP_011650631.1 uncharacterized protein LOC101219287 isoform X2 [Cucumis sativus]9.2e-9993.88Show/hide
Query:  MFTLLCGQCFQFQAFQFPLQSHRLLHSLHPHLPPPSPPALFSKSHNNNPFSSIELPPNALRRKLDPHWRGGFSLGVDLGTSRTGLALSKGFSIRPLTVLE
        MFT   GQCFQFQAFQFPLQ+H+LLHSLHPHLPPPS PAL SKSHNNNP SSIELPPNALRRKLDPHWRGGFSLGVDLGTSRTGLALSKGFS RPLTVLE
Subjt:  MFTLLCGQCFQFQAFQFPLQSHRLLHSLHPHLPPPSPPALFSKSHNNNPFSSIELPPNALRRKLDPHWRGGFSLGVDLGTSRTGLALSKGFSIRPLTVLE

Query:  LRGQKLEAKLIQIAEQEEADEFIIGLPKSCDGKETPQSNKIRSIAGRVAARAAERGWRVYLYDEHGTTAEAESHMISKGLNKSTRQKKIDAYAAMM
        LRGQKLEAKLI+IAEQEEADEFIIGLPKSCDGKETPQSNKIRSIAGRVAARAAERGWRVYLYDEHGTTAEAESHMIS+GLNKSTRQKKIDAYAAM+
Subjt:  LRGQKLEAKLIQIAEQEEADEFIIGLPKSCDGKETPQSNKIRSIAGRVAARAAERGWRVYLYDEHGTTAEAESHMISKGLNKSTRQKKIDAYAAMM

XP_016898999.1 PREDICTED: uncharacterized protein LOC103483114 isoform X2 [Cucumis melo]4.0e-11899.53Show/hide
Query:  MEKLRILAGGGVVICERAPIHCLCWVLESPIPPILPIMFTLLCGQCFQFQAFQFPLQSHRLLHSLHPHLPPPSPPALFSKSHNNNPFSSIELPPNALRRK
        MEKLRILAGGGVVICERAPIHCLCWVLESPIPPILP MFTLLCGQCFQFQAFQFPLQSHRLLHSLHPHLPPPSPPALFSKSHNNNPFSSIELPPNALRRK
Subjt:  MEKLRILAGGGVVICERAPIHCLCWVLESPIPPILPIMFTLLCGQCFQFQAFQFPLQSHRLLHSLHPHLPPPSPPALFSKSHNNNPFSSIELPPNALRRK

Query:  LDPHWRGGFSLGVDLGTSRTGLALSKGFSIRPLTVLELRGQKLEAKLIQIAEQEEADEFIIGLPKSCDGKETPQSNKIRSIAGRVAARAAERGWRVYLYD
        LDPHWRGGFSLGVDLGTSRTGLALSKGFSIRPLTVLELRGQKLEAKLIQIAEQEEADEFIIGLPKSCDGKETPQSNKIRSIAGRVAARAAERGWRVYLYD
Subjt:  LDPHWRGGFSLGVDLGTSRTGLALSKGFSIRPLTVLELRGQKLEAKLIQIAEQEEADEFIIGLPKSCDGKETPQSNKIRSIAGRVAARAAERGWRVYLYD

Query:  EHGTTAEAESHMIS
        EHGTTAEAESHMIS
Subjt:  EHGTTAEAESHMIS

TrEMBL top hitse value%identityAlignment
A0A0A0L3L5 YqgFc domain-containing protein5.4e-12194.04Show/hide
Query:  MFTLLCGQCFQFQAFQFPLQSHRLLHSLHPHLPPPSPPALFSKSHNNNPFSSIELPPNALRRKLDPHWRGGFSLGVDLGTSRTGLALSKGFSIRPLTVLE
        MFT   GQCFQFQAFQFPLQ+H+LLHSLHPHLPPPS PAL SKSHNNNP SSIELPPNALRRKLDPHWRGGFSLGVDLGTSRTGLALSKGFS RPLTVLE
Subjt:  MFTLLCGQCFQFQAFQFPLQSHRLLHSLHPHLPPPSPPALFSKSHNNNPFSSIELPPNALRRKLDPHWRGGFSLGVDLGTSRTGLALSKGFSIRPLTVLE

Query:  LRGQKLEAKLIQIAEQEEADEFIIGLPKSCDGKETPQSNKIRSIAGRVAARAAERGWRVYLYDEHGTTAEAESHMISKGLNKSTRQKKIDAYAAMMVLER
        LRGQKLEAKLI+IAEQEEADEFIIGLPKSCDGKETPQSNKIRSIAGRVAARAAERGWRVYLYDEHGTTAEAESHMIS+GLNKSTRQKKIDAYAAMMVLER
Subjt:  LRGQKLEAKLIQIAEQEEADEFIIGLPKSCDGKETPQSNKIRSIAGRVAARAAERGWRVYLYDEHGTTAEAESHMISKGLNKSTRQKKIDAYAAMMVLER

Query:  YFLMSGQGTELLVPKSLVLQDKLIEGPPTDPDFKD
        YF  SGQGTELLVPKSLVLQDKLIEGPPTDPDF+D
Subjt:  YFLMSGQGTELLVPKSLVLQDKLIEGPPTDPDFKD

A0A1S3AUT7 uncharacterized protein LOC103483114 isoform X12.0e-15299.63Show/hide
Query:  MEKLRILAGGGVVICERAPIHCLCWVLESPIPPILPIMFTLLCGQCFQFQAFQFPLQSHRLLHSLHPHLPPPSPPALFSKSHNNNPFSSIELPPNALRRK
        MEKLRILAGGGVVICERAPIHCLCWVLESPIPPILP MFTLLCGQCFQFQAFQFPLQSHRLLHSLHPHLPPPSPPALFSKSHNNNPFSSIELPPNALRRK
Subjt:  MEKLRILAGGGVVICERAPIHCLCWVLESPIPPILPIMFTLLCGQCFQFQAFQFPLQSHRLLHSLHPHLPPPSPPALFSKSHNNNPFSSIELPPNALRRK

Query:  LDPHWRGGFSLGVDLGTSRTGLALSKGFSIRPLTVLELRGQKLEAKLIQIAEQEEADEFIIGLPKSCDGKETPQSNKIRSIAGRVAARAAERGWRVYLYD
        LDPHWRGGFSLGVDLGTSRTGLALSKGFSIRPLTVLELRGQKLEAKLIQIAEQEEADEFIIGLPKSCDGKETPQSNKIRSIAGRVAARAAERGWRVYLYD
Subjt:  LDPHWRGGFSLGVDLGTSRTGLALSKGFSIRPLTVLELRGQKLEAKLIQIAEQEEADEFIIGLPKSCDGKETPQSNKIRSIAGRVAARAAERGWRVYLYD

Query:  EHGTTAEAESHMISKGLNKSTRQKKIDAYAAMMVLERYFLMSGQGTELLVPKSLVLQDKLIEGPPTDPDFKD
        EHGTTAEAESHMISKGLNKSTRQKKIDAYAAMMVLERYFLMSGQGTELLVPKSLVLQDKLIEGPPTDPDFKD
Subjt:  EHGTTAEAESHMISKGLNKSTRQKKIDAYAAMMVLERYFLMSGQGTELLVPKSLVLQDKLIEGPPTDPDFKD

A0A1S4DSN6 uncharacterized protein LOC103483114 isoform X21.9e-11899.53Show/hide
Query:  MEKLRILAGGGVVICERAPIHCLCWVLESPIPPILPIMFTLLCGQCFQFQAFQFPLQSHRLLHSLHPHLPPPSPPALFSKSHNNNPFSSIELPPNALRRK
        MEKLRILAGGGVVICERAPIHCLCWVLESPIPPILP MFTLLCGQCFQFQAFQFPLQSHRLLHSLHPHLPPPSPPALFSKSHNNNPFSSIELPPNALRRK
Subjt:  MEKLRILAGGGVVICERAPIHCLCWVLESPIPPILPIMFTLLCGQCFQFQAFQFPLQSHRLLHSLHPHLPPPSPPALFSKSHNNNPFSSIELPPNALRRK

Query:  LDPHWRGGFSLGVDLGTSRTGLALSKGFSIRPLTVLELRGQKLEAKLIQIAEQEEADEFIIGLPKSCDGKETPQSNKIRSIAGRVAARAAERGWRVYLYD
        LDPHWRGGFSLGVDLGTSRTGLALSKGFSIRPLTVLELRGQKLEAKLIQIAEQEEADEFIIGLPKSCDGKETPQSNKIRSIAGRVAARAAERGWRVYLYD
Subjt:  LDPHWRGGFSLGVDLGTSRTGLALSKGFSIRPLTVLELRGQKLEAKLIQIAEQEEADEFIIGLPKSCDGKETPQSNKIRSIAGRVAARAAERGWRVYLYD

Query:  EHGTTAEAESHMIS
        EHGTTAEAESHMIS
Subjt:  EHGTTAEAESHMIS

A0A5A7U5C1 UPF0081 domain-containing protein6.0e-9698.34Show/hide
Query:  MFTLLCGQCFQFQAFQFPLQSHRLLHSLHPHLPPPSPPALFSKSHNNNPFSSIELPPNALRRKLDPHWRGGFSLGVDLGTSRTGLALSKGFSIRPLTVLE
        MFTLLCGQCFQFQAFQFPLQSHRLLHSLHPHLPPPSPPALFSKSHNNNPFSSIELPPNALRRKLDPHWRGGFSLGVDLGTSRTGLALSKGFSIRPLTVLE
Subjt:  MFTLLCGQCFQFQAFQFPLQSHRLLHSLHPHLPPPSPPALFSKSHNNNPFSSIELPPNALRRKLDPHWRGGFSLGVDLGTSRTGLALSKGFSIRPLTVLE

Query:  LRGQKLEAKLIQIAEQEEADEFIIGLPKSCDGKETPQSNKIRSIAGRVAARAAERGWRVYLYDEHGTTAEAESHMISKGLN
        LRGQKLEAKLIQIAEQEEADEFIIGLPKSCDGKETPQSNKIRSIAGRVAARAAERGWRVYLYDEHGTTAEAESHMIS  +N
Subjt:  LRGQKLEAKLIQIAEQEEADEFIIGLPKSCDGKETPQSNKIRSIAGRVAARAAERGWRVYLYDEHGTTAEAESHMISKGLN

A0A6J1CTB8 uncharacterized protein LOC111014097 isoform X13.4e-9192.97Show/hide
Query:  SSIELPPNALRRKLDPHWRGGFSLGVDLGTSRTGLALSKGFSIRPLTVLELRGQKLEAKLIQIAEQEEADEFIIGLPKSCDGKETPQSNKIRSIAGRVAA
        SSIELPPNALRRKLDPHWRGGFSLGVDLGTSRTGLALSKGFS RPLTVLELRG KLE KLI+IAEQEEADEFIIGLPKS DGKETPQSNKIRSIAGRVAA
Subjt:  SSIELPPNALRRKLDPHWRGGFSLGVDLGTSRTGLALSKGFSIRPLTVLELRGQKLEAKLIQIAEQEEADEFIIGLPKSCDGKETPQSNKIRSIAGRVAA

Query:  RAAERGWRVYLYDEHGTTAEAESHMISKGLNKSTRQKKIDAYAAMMVLERYFLMSGQGTELLVPKSLVLQDKLIEGPPTDPDFKD
        RAAERGWRVYL+DEHGTT+EAE+HMI KGLNKSTRQKKIDAYAAMMVLERY+ MSGQGTEL+VPKSLVLQ+KLIEGPPTDPDFKD
Subjt:  RAAERGWRVYLYDEHGTTAEAESHMISKGLNKSTRQKKIDAYAAMMVLERYFLMSGQGTELLVPKSLVLQDKLIEGPPTDPDFKD

SwissProt top hitse value%identityAlignment
A1SJC8 Putative pre-16S rRNA nuclease1.2e-1136.5Show/hide
Query:  RGGFSLGVDLGTSRTGLALS--KGFSIRPL-TVLELRGQKLEAKLIQIAEQEEAD---EFIIGLPKSCDGKETPQSNKIRSIAGRVAARAAERGWRVYLY
        R G  +G+D G +R G+A S   GF   P+ TV   +G       I  AE++E     E ++GLP+S  G+E P + K+R  AGR+AAR A     V L 
Subjt:  RGGFSLGVDLGTSRTGLALS--KGFSIRPL-TVLELRGQKLEAKLIQIAEQEEAD---EFIIGLPKSCDGKETPQSNKIRSIAGRVAARAAERGWRVYLY

Query:  DEHGTTAEAESHMISKGLNKSTRQKKIDAYAAMMVLE
        DE  TT  AE+ +  +G     R+  +D  AA+++L+
Subjt:  DEHGTTAEAESHMISKGLNKSTRQKKIDAYAAMMVLE

A5D3C3 Putative pre-16S rRNA nuclease6.4e-1031.65Show/hide
Query:  LGVDLGTSRTGLALS--KGFSIRPLTVLELRGQKLEAKLIQIAE---QEEADEFIIGLPKSCDGKETPQSNKIRSIAGRVAARAAERGWRVYLYDEHGTT
        +G+DLG  + G+ALS   G++ + L V+ ++G   EA + +I+E   Q    + ++GLP++ +G   P++ + R+ AG +A         V L+DE  TT
Subjt:  LGVDLGTSRTGLALS--KGFSIRPLTVLELRGQKLEAKLIQIAE---QEEADEFIIGLPKSCDGKETPQSNKIRSIAGRVAARAAERGWRVYLYDEHGTT

Query:  AEAESHMISKGLNKSTRQKKIDAYAAMMVLERYFLMSGQ
         EAE  +I   L+++ R++ ID  AA+++L+ +    G+
Subjt:  AEAESHMISKGLNKSTRQKKIDAYAAMMVLERYFLMSGQ

B8DIJ5 Putative pre-16S rRNA nuclease1.4e-0929.77Show/hide
Query:  LGVDLGTSRTGLALSK---GFSIRPLTVLELRGQKLEAKLIQIAEQEEADEFIIGLPKSCDGKETPQSNKIRSIAGRVAARAAERGWRVYLYDEHGTTAE
        LG+D GT RTG+A S      +    T++     +  A+L+ +AE+E A+ +++GLP   DG +T  + ++R+   R+  R       VYL +E  ++ E
Subjt:  LGVDLGTSRTGLALSK---GFSIRPLTVLELRGQKLEAKLIQIAEQEEADEFIIGLPKSCDGKETPQSNKIRSIAGRVAARAAERGWRVYLYDEHGTTAE

Query:  AESHMISKGLNKSTRQKKIDAYAAMMVLERY
        AE  +   GL+    +  +D  AA+ +L+ +
Subjt:  AESHMISKGLNKSTRQKKIDAYAAMMVLERY

C6C125 Putative pre-16S rRNA nuclease1.9e-0932.06Show/hide
Query:  LGVDLGTSRTGLALSKGFSI--RPLTVLE-LRGQKLEAKLIQIAEQEEADEFIIGLPKSCDGKETPQSNKIRSIAGRVAARAAERGWRVYLYDEHGTTAE
        L +D GT R GLA+S    I   P  V+E      + ++L++I E E+  + +IGLP S DG++T  + ++R+ A  +  R       ++L DE  ++  
Subjt:  LGVDLGTSRTGLALSKGFSI--RPLTVLE-LRGQKLEAKLIQIAEQEEADEFIIGLPKSCDGKETPQSNKIRSIAGRVAARAAERGWRVYLYDEHGTTAE

Query:  AESHMISKGLNKSTRQKKIDAYAAMMVLERY
        AE  +   GL    R+K +D+ AA ++LE +
Subjt:  AESHMISKGLNKSTRQKKIDAYAAMMVLERY

Q3SR28 Putative pre-16S rRNA nuclease8.3e-1033.58Show/hide
Query:  HWRG-GFSLGVDLGTSRTGLALSKGFSIRPLTVLELRGQKL---EAKLIQIAEQEEADEFIIGLPKSCDGKETPQSNKIRSIAGRVAARAAERGWRVYLY
        HW   G  +G+DLGT   G+A+S         V  +R +      A+L+ IA +  A+ FI+GLP + DG E P++   R+ A R  AR  +  + + L+
Subjt:  HWRG-GFSLGVDLGTSRTGLALSKGFSIRPLTVLELRGQKL---EAKLIQIAEQEEADEFIIGLPKSCDGKETPQSNKIRSIAGRVAARAAERGWRVYLY

Query:  DEHGTTAEAESHMISKGLNKSTRQKKIDAYAAMMVLE
        DE  +TA  E  +I   ++++ R K ID +AA+ +L+
Subjt:  DEHGTTAEAESHMISKGLNKSTRQKKIDAYAAMMVLE

Arabidopsis top hitse value%identityAlignment
AT1G12244.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.4e-6856.91Show/hide
Query:  ILPIMFTLLCGQC----FQFQAFQFPLQSH---RLLHSLHP--HLPPPSPPALFSKSHNNNPFSSIELPPNALRRKLDPHWRGGFSLGVDLGTSRTGLAL
        IL + F L    C    F   A  F  Q +    L+H L P  +   P P A+ S           E+PPNA+RRK+D +WRGGFSLGVDLG SRTG+A+
Subjt:  ILPIMFTLLCGQC----FQFQAFQFPLQSH---RLLHSLHP--HLPPPSPPALFSKSHNNNPFSSIELPPNALRRKLDPHWRGGFSLGVDLGTSRTGLAL

Query:  SKGFSIRPLTVLELRGQKLEAKLIQIAEQEEADEFIIGLPKSCDGKETPQSNKIRSIAGRVAARAAERGWRVYLYDEHGTTAEAESHMISKGLNKSTRQK
        SKG++++PLTVL+ RGQKLE +L++IAE+EEADEFIIGLP+S DGKET QSNKIRS+AGR+A +AAERGWRVY++DEHGTT+EA   MI  GL+KS RQ 
Subjt:  SKGFSIRPLTVLELRGQKLEAKLIQIAEQEEADEFIIGLPKSCDGKETPQSNKIRSIAGRVAARAAERGWRVYLYDEHGTTAEAESHMISKGLNKSTRQK

Query:  KIDAYAAMMVLERYFLMSGQGTELLVPKSLVLQDKLIEGPPTDPDF
        + DAYAA+++LERYF   G G E+++PKSL LQ K+  G P DPDF
Subjt:  KIDAYAAMMVLERYFLMSGQGTELLVPKSLVLQDKLIEGPPTDPDF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAAATTGAGGATTTTGGCGGGGGGAGGGGTTGTGATTTGTGAAAGAGCTCCGATCCATTGTTTATGTTGGGTTTTGGAATCCCCAATTCCCCCAATTCTCCCAAT
AATGTTTACGTTGTTGTGTGGACAATGTTTCCAATTCCAAGCCTTTCAGTTTCCGTTGCAAAGCCACAGATTACTTCACTCCCTCCATCCTCATCTCCCTCCTCCATCTC
CACCAGCCCTATTTTCCAAATCCCACAACAACAACCCATTTTCTTCCATTGAACTTCCTCCCAACGCCCTCCGCCGCAAGCTCGATCCTCACTGGAGAGGAGGTTTCAGT
CTAGGTGTCGACCTCGGAACCTCTCGCACTGGACTTGCTCTTAGTAAAGGCTTCTCCATTCGTCCTCTTACCGTTCTAGAGTTGCGAGGACAAAAGCTTGAGGCTAAGCT
CATTCAGATTGCTGAACAGGAAGAGGCTGATGAATTTATTATTGGACTTCCTAAATCATGCGATGGGAAAGAGACACCTCAGTCAAACAAAATTCGTAGTATTGCCGGAA
GGGTGGCAGCCCGAGCAGCTGAAAGGGGCTGGAGAGTTTATTTGTATGATGAACATGGGACAACAGCAGAAGCGGAAAGCCATATGATTTCCAAGGGTCTCAATAAATCT
ACTAGGCAGAAGAAGATTGATGCCTATGCTGCCATGATGGTACTCGAGAGATATTTTCTCATGTCAGGCCAGGGAACCGAACTTTTAGTGCCCAAGAGTCTAGTCTTACA
AGATAAACTTATCGAGGGACCACCTACAGACCCAGACTTTAAGGATTGA
mRNA sequenceShow/hide mRNA sequence
GATAAAGGTTTGAATAAATAAAGAGTAATGGAAAAATTGAGGATTTTGGCGGGGGGAGGGGTTGTGATTTGTGAAAGAGCTCCGATCCATTGTTTATGTTGGGTTTTGGA
ATCCCCAATTCCCCCAATTCTCCCAATAATGTTTACGTTGTTGTGTGGACAATGTTTCCAATTCCAAGCCTTTCAGTTTCCGTTGCAAAGCCACAGATTACTTCACTCCC
TCCATCCTCATCTCCCTCCTCCATCTCCACCAGCCCTATTTTCCAAATCCCACAACAACAACCCATTTTCTTCCATTGAACTTCCTCCCAACGCCCTCCGCCGCAAGCTC
GATCCTCACTGGAGAGGAGGTTTCAGTCTAGGTGTCGACCTCGGAACCTCTCGCACTGGACTTGCTCTTAGTAAAGGCTTCTCCATTCGTCCTCTTACCGTTCTAGAGTT
GCGAGGACAAAAGCTTGAGGCTAAGCTCATTCAGATTGCTGAACAGGAAGAGGCTGATGAATTTATTATTGGACTTCCTAAATCATGCGATGGGAAAGAGACACCTCAGT
CAAACAAAATTCGTAGTATTGCCGGAAGGGTGGCAGCCCGAGCAGCTGAAAGGGGCTGGAGAGTTTATTTGTATGATGAACATGGGACAACAGCAGAAGCGGAAAGCCAT
ATGATTTCCAAGGGTCTCAATAAATCTACTAGGCAGAAGAAGATTGATGCCTATGCTGCCATGATGGTACTCGAGAGATATTTTCTCATGTCAGGCCAGGGAACCGAACT
TTTAGTGCCCAAGAGTCTAGTCTTACAAGATAAACTTATCGAGGGACCACCTACAGACCCAGACTTTAAGGATTGAGGATGTGCTTCAGGCCAGGAAAAAAATAATCTGG
GAGAGGCCCTTGGGGTATTTTTCTCTTTGACGGATAACAAAATTGCTATCAAACAAAGGGTATTTATTACTGTTGAAGTGGAAGAGAGCCGCAACCGACTTAGAAATGGA
GGGTTACTTCCTCAAAGCATGGTCACGGAATTAGGTGAAAAGAGATACCGCTGTTCCTAATGAAGGCTCAATTGAAGGTTGCGGGTTGTGAGATGATGACGACGACGACG
CTTTTGACATTATATCTTTATAACAATCTTGTCAGAATCTGTCCTCACATATTGAATTCACCTACACACTTCGGTTAGTTAAGGATGGGATTTGCTTTGGTCCGTCCTAT
GCACTTGTGCTTCTCCTTGATCGCCGAATCGCGTATGTACTTGGTTGAAATTATTTCTAAACGAAGAAATTGATTAGATGCTGCAGTCATAATGAGATTTAAAAATGGCG
CTTTGCTAAGTTAATACTGTGGAAAAGCGTCTAAAGGCCTAGACAGTCTCCTAATTTCATGCTTGAGGACTCTTCGTCGGCATCTCCATGGGTTAGAGTCTTTTACCTTT
AGATGCCATTTTGCGTTGGCTAATTTTTATGGGAGATCTTTACTATTTTATAAACAAAGATCTTAGGAATCCCTGAAATATCAATAATCCTATATGATCCATCATTCGTT
TTAAAATTATTGTGATTTTGGGTAATAATGGGCAAGTAGAGATGATCTAAACATTGGTAGTAATAGTTGCTATCTGTGATGGTGCCTCTTGCACTCCATTTACTTTTC
Protein sequenceShow/hide protein sequence
MEKLRILAGGGVVICERAPIHCLCWVLESPIPPILPIMFTLLCGQCFQFQAFQFPLQSHRLLHSLHPHLPPPSPPALFSKSHNNNPFSSIELPPNALRRKLDPHWRGGFS
LGVDLGTSRTGLALSKGFSIRPLTVLELRGQKLEAKLIQIAEQEEADEFIIGLPKSCDGKETPQSNKIRSIAGRVAARAAERGWRVYLYDEHGTTAEAESHMISKGLNKS
TRQKKIDAYAAMMVLERYFLMSGQGTELLVPKSLVLQDKLIEGPPTDPDFKD