; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0005546 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0005546
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionUnknown protein
Genome locationchr02:1003439..1007400
RNA-Seq ExpressionIVF0005546
SyntenyIVF0005546
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0016020 - membrane (cellular component)
GO:0004176 - ATP-dependent peptidase activity (molecular function)
GO:0004222 - metalloendopeptidase activity (molecular function)
GO:0005524 - ATP binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK10198.1 uncharacterized protein E5676_scaffold16G003430 [Cucumis melo var. makuwa]5.29e-127100Show/hide
Query:  MFLTAAVYDFTFDLEFHRRVPVTGYVVSSAERRRALKLVDRALSKRQYKSAVSLVKQLQGKPYGLRGFGAAKQIIKRRLELDESEVNGMDMLSLQPLVDS
        MFLTAAVYDFTFDLEFHRRVPVTGYVVSSAERRRALKLVDRALSKRQYKSAVSLVKQLQGKPYGLRGFGAAKQIIKRRLELDESEVNGMDMLSLQPLVDS
Subjt:  MFLTAAVYDFTFDLEFHRRVPVTGYVVSSAERRRALKLVDRALSKRQYKSAVSLVKQLQGKPYGLRGFGAAKQIIKRRLELDESEVNGMDMLSLQPLVDS

Query:  ILDSVQQCLQISFLEEILSAEKPESSMAEGRHSSRCEEQEHFICAQHEAGHFLVGYLMGVLPKEYQVPSVQALSQNRFAEGKVSFVGFEFLGE
        ILDSVQQCLQISFLEEILSAEKPESSMAEGRHSSRCEEQEHFICAQHEAGHFLVGYLMGVLPKEYQVPSVQALSQNRFAEGKVSFVGFEFLGE
Subjt:  ILDSVQQCLQISFLEEILSAEKPESSMAEGRHSSRCEEQEHFICAQHEAGHFLVGYLMGVLPKEYQVPSVQALSQNRFAEGKVSFVGFEFLGE

XP_004135797.2 uncharacterized protein LOC101213254 isoform X2 [Cucumis sativus]8.14e-13389.04Show/hide
Query:  MFLTAAVYDFTFDLEFHRRVPVTGYVVSSAERRRALKLVDRALSKRQYKSAVSLVKQLQGKPYGLRGFGAAKQIIKRRLELDESEVNGMDMLSLQPLVDS
        MFLT AVYDFTF+LEFH RVPVTG VVSSA+RRRALKLVDRALSKRQYKSAVSLVKQLQGKPYGLRGFGAAKQIIK+RLELDESEVN MD+LSLQPLVDS
Subjt:  MFLTAAVYDFTFDLEFHRRVPVTGYVVSSAERRRALKLVDRALSKRQYKSAVSLVKQLQGKPYGLRGFGAAKQIIKRRLELDESEVNGMDMLSLQPLVDS

Query:  ILDSVQQCLQISFLEEILSAEKPESSMAEGRHSSRCEEQEHFICAQHEAGHFLVGYLMGVLPKEYQVPSVQALSQNRFAEGKVSFVGFEFLGEIDSVKIL
        ILDSVQQCLQIS LEEILS EK ESSMAEGRHSSRCEEQEHFICAQHEAGHFLVGYLMGVLPK YQVPS+QAL QNRFAEGKVSFVGFEFLGEIDS KIL
Subjt:  ILDSVQQCLQISFLEEILSAEKPESSMAEGRHSSRCEEQEHFICAQHEAGHFLVGYLMGVLPKEYQVPSVQALSQNRFAEGKVSFVGFEFLGEIDSVKIL

Query:  GQNADIKKFNKRANKGTISSKQGLKYRC
        G+NADI+ FN RANKGTISSK   ++ C
Subjt:  GQNADIKKFNKRANKGTISSKQGLKYRC

XP_008450723.1 PREDICTED: uncharacterized protein LOC103492218 [Cucumis melo]1.42e-14797.37Show/hide
Query:  MFLTAAVYDFTFDLEFHRRVPVTGYVVSSAERRRALKLVDRALSKRQYKSAVSLVKQLQGKPYGLRGFGAAKQIIKRRLELDESEVNGMDMLSLQPLVDS
        MFLTAAVYDFTFDLEFHRRVPVTGYVVSSAERRRALKLVDRALSKRQYKSAVSLVKQLQGKPYGLRGFGAAKQIIKRRLELDESEVNGMDMLSLQPLVDS
Subjt:  MFLTAAVYDFTFDLEFHRRVPVTGYVVSSAERRRALKLVDRALSKRQYKSAVSLVKQLQGKPYGLRGFGAAKQIIKRRLELDESEVNGMDMLSLQPLVDS

Query:  ILDSVQQCLQISFLEEILSAEKPESSMAEGRHSSRCEEQEHFICAQHEAGHFLVGYLMGVLPKEYQVPSVQALSQNRFAEGKVSFVGFEFLGEIDSVKIL
        ILDSVQQCLQISFLEEILSAEKPESSMAEGRHSSRCEEQEHFICAQHEAGHFLVGYLMGVLPKEYQVPSVQALSQNRFAEGKVSFVGFEFLGEIDSVKIL
Subjt:  ILDSVQQCLQISFLEEILSAEKPESSMAEGRHSSRCEEQEHFICAQHEAGHFLVGYLMGVLPKEYQVPSVQALSQNRFAEGKVSFVGFEFLGEIDSVKIL

Query:  GQNADIKKFNKRANKGTISSKQGLKYRC
        GQNADIKKFNKRANKGTISSK   ++ C
Subjt:  GQNADIKKFNKRANKGTISSKQGLKYRC

XP_011659921.1 uncharacterized protein LOC101213254 isoform X3 [Cucumis sativus]3.63e-14088.38Show/hide
Query:  MFLTAAVYDFTFDLEFHRRVPVTGYVVSSAERRRALKLVDRALSKRQYKSAVSLVKQLQGKPYGLRGFGAAKQIIKRRLELDESEVNGMDMLSLQPLVDS
        MFLT AVYDFTF+LEFH RVPVTG VVSSA+RRRALKLVDRALSKRQYKSAVSLVKQLQGKPYGLRGFGAAKQIIK+RLELDESEVN MD+LSLQPLVDS
Subjt:  MFLTAAVYDFTFDLEFHRRVPVTGYVVSSAERRRALKLVDRALSKRQYKSAVSLVKQLQGKPYGLRGFGAAKQIIKRRLELDESEVNGMDMLSLQPLVDS

Query:  ILDSVQQCLQISFLEEILSAEKPESSMAEGRHSSRCEEQEHFICAQHEAGHFLVGYLMGVLPKEYQVPSVQALSQNRFAEGKVSFVGFEFLGEIDSVKIL
        ILDSVQQCLQIS LEEILS EK ESSMAEGRHSSRCEEQEHFICAQHEAGHFLVGYLMGVLPK YQVPS+QAL QNRFAEGKVSFVGFEFLGEIDS KIL
Subjt:  ILDSVQQCLQISFLEEILSAEKPESSMAEGRHSSRCEEQEHFICAQHEAGHFLVGYLMGVLPKEYQVPSVQALSQNRFAEGKVSFVGFEFLGEIDSVKIL

Query:  GQNADIKKFNKRANKGTISSK-----QGLKYRCRYRDINFS
        G+NADI+ FN RANKGTISSK       LKYRCRYRD NF+
Subjt:  GQNADIKKFNKRANKGTISSK-----QGLKYRCRYRDINFS

XP_031735971.1 uncharacterized protein LOC101213254 isoform X1 [Cucumis sativus]1.65e-13988.38Show/hide
Query:  MFLTAAVYDFTFDLEFHRRVPVTGYVVSSAERRRALKLVDRALSKRQYKSAVSLVKQLQGKPYGLRGFGAAKQIIKRRLELDESEVNGMDMLSLQPLVDS
        MFLT AVYDFTF+LEFH RVPVTG VVSSA+RRRALKLVDRALSKRQYKSAVSLVKQLQGKPYGLRGFGAAKQIIK+RLELDESEVN MD+LSLQPLVDS
Subjt:  MFLTAAVYDFTFDLEFHRRVPVTGYVVSSAERRRALKLVDRALSKRQYKSAVSLVKQLQGKPYGLRGFGAAKQIIKRRLELDESEVNGMDMLSLQPLVDS

Query:  ILDSVQQCLQISFLEEILSAEKPESSMAEGRHSSRCEEQEHFICAQHEAGHFLVGYLMGVLPKEYQVPSVQALSQNRFAEGKVSFVGFEFLGEIDSVKIL
        ILDSVQQCLQIS LEEILS EK ESSMAEGRHSSRCEEQEHFICAQHEAGHFLVGYLMGVLPK YQVPS+QAL QNRFAEGKVSFVGFEFLGEIDS KIL
Subjt:  ILDSVQQCLQISFLEEILSAEKPESSMAEGRHSSRCEEQEHFICAQHEAGHFLVGYLMGVLPKEYQVPSVQALSQNRFAEGKVSFVGFEFLGEIDSVKIL

Query:  GQNADIKKFNKRANKGTISSK-----QGLKYRCRYRDINFS
        G+NADI+ FN RANKGTISSK       LKYRCRYRD NF+
Subjt:  GQNADIKKFNKRANKGTISSK-----QGLKYRCRYRDINFS

TrEMBL top hitse value%identityAlignment
A0A0A0LZ50 Uncharacterized protein5.2e-11088.38Show/hide
Query:  MFLTAAVYDFTFDLEFHRRVPVTGYVVSSAERRRALKLVDRALSKRQYKSAVSLVKQLQGKPYGLRGFGAAKQIIKRRLELDESEVNGMDMLSLQPLVDS
        MFLT AVYDFTF+LEFH RVPVTG VVSSA+RRRALKLVDRALSKRQYKSAVSLVKQLQGKPYGLRGFGAAKQIIK+RLELDESEVN MD+LSLQPLVDS
Subjt:  MFLTAAVYDFTFDLEFHRRVPVTGYVVSSAERRRALKLVDRALSKRQYKSAVSLVKQLQGKPYGLRGFGAAKQIIKRRLELDESEVNGMDMLSLQPLVDS

Query:  ILDSVQQCLQISFLEEILSAEKPESSMAEGRHSSRCEEQEHFICAQHEAGHFLVGYLMGVLPKEYQVPSVQALSQNRFAEGKVSFVGFEFLGEIDSVKIL
        ILDSVQQCLQIS LEEILS EK ESSMAEGRHSSRCEEQEHFICAQHEAGHFLVGYLMGVLPK YQVPS+QAL QNRFAEGKVSFVGFEFLGEIDS KIL
Subjt:  ILDSVQQCLQISFLEEILSAEKPESSMAEGRHSSRCEEQEHFICAQHEAGHFLVGYLMGVLPKEYQVPSVQALSQNRFAEGKVSFVGFEFLGEIDSVKIL

Query:  GQNADIKKFNKRANKGTISS-----KQGLKYRCRYRDINFS
        G+NADI+ FN RANKGTISS     K  LKYRCRYRD NF+
Subjt:  GQNADIKKFNKRANKGTISS-----KQGLKYRCRYRDINFS

A0A1S3BP83 uncharacterized protein LOC1034922182.9e-11697.37Show/hide
Query:  MFLTAAVYDFTFDLEFHRRVPVTGYVVSSAERRRALKLVDRALSKRQYKSAVSLVKQLQGKPYGLRGFGAAKQIIKRRLELDESEVNGMDMLSLQPLVDS
        MFLTAAVYDFTFDLEFHRRVPVTGYVVSSAERRRALKLVDRALSKRQYKSAVSLVKQLQGKPYGLRGFGAAKQIIKRRLELDESEVNGMDMLSLQPLVDS
Subjt:  MFLTAAVYDFTFDLEFHRRVPVTGYVVSSAERRRALKLVDRALSKRQYKSAVSLVKQLQGKPYGLRGFGAAKQIIKRRLELDESEVNGMDMLSLQPLVDS

Query:  ILDSVQQCLQISFLEEILSAEKPESSMAEGRHSSRCEEQEHFICAQHEAGHFLVGYLMGVLPKEYQVPSVQALSQNRFAEGKVSFVGFEFLGEIDSVKIL
        ILDSVQQCLQISFLEEILSAEKPESSMAEGRHSSRCEEQEHFICAQHEAGHFLVGYLMGVLPKEYQVPSVQALSQNRFAEGKVSFVGFEFLGEIDSVKIL
Subjt:  ILDSVQQCLQISFLEEILSAEKPESSMAEGRHSSRCEEQEHFICAQHEAGHFLVGYLMGVLPKEYQVPSVQALSQNRFAEGKVSFVGFEFLGEIDSVKIL

Query:  GQNADIKKFNKRANKGTISSKQGLKYRC
        GQNADIKKFNKRANKGTISSK   ++ C
Subjt:  GQNADIKKFNKRANKGTISSKQGLKYRC

A0A5D3CG48 Uncharacterized protein4.9e-100100Show/hide
Query:  MFLTAAVYDFTFDLEFHRRVPVTGYVVSSAERRRALKLVDRALSKRQYKSAVSLVKQLQGKPYGLRGFGAAKQIIKRRLELDESEVNGMDMLSLQPLVDS
        MFLTAAVYDFTFDLEFHRRVPVTGYVVSSAERRRALKLVDRALSKRQYKSAVSLVKQLQGKPYGLRGFGAAKQIIKRRLELDESEVNGMDMLSLQPLVDS
Subjt:  MFLTAAVYDFTFDLEFHRRVPVTGYVVSSAERRRALKLVDRALSKRQYKSAVSLVKQLQGKPYGLRGFGAAKQIIKRRLELDESEVNGMDMLSLQPLVDS

Query:  ILDSVQQCLQISFLEEILSAEKPESSMAEGRHSSRCEEQEHFICAQHEAGHFLVGYLMGVLPKEYQVPSVQALSQNRFAEGKVSFVGFEFLGE
        ILDSVQQCLQISFLEEILSAEKPESSMAEGRHSSRCEEQEHFICAQHEAGHFLVGYLMGVLPKEYQVPSVQALSQNRFAEGKVSFVGFEFLGE
Subjt:  ILDSVQQCLQISFLEEILSAEKPESSMAEGRHSSRCEEQEHFICAQHEAGHFLVGYLMGVLPKEYQVPSVQALSQNRFAEGKVSFVGFEFLGE

A0A6J1HDU1 uncharacterized protein LOC1114619602.6e-7768.85Show/hide
Query:  MFLTAAVYDFTFDLEFHRRVPVTGYVVSS----------AERRRALKLVDRALSKRQYKSAVSLVKQLQGKPYGLRGFGAAKQIIKRRLELDESEVNGMD
        MF TAA  DFTF+LEFHRR+PVTG V+SS          A+RRRALKLVDRALSKRQYKSA+SLVKQLQGKPYGLR FGAAKQI KR         + MD
Subjt:  MFLTAAVYDFTFDLEFHRRVPVTGYVVSS----------AERRRALKLVDRALSKRQYKSAVSLVKQLQGKPYGLRGFGAAKQIIKRRLELDESEVNGMD

Query:  MLSLQPLVDSILDSVQQCLQISFLEEILSAEKPESSMAEGRHSSRCEEQEHFICAQHEAGHFLVGYLMGVLPKEYQVPSVQALSQNRFAEGKVSFVGFEF
         LSLQPLVDSILDS+Q CLQI       SAE+ ES +AEGR+ SRCEE+EH ICAQHEAGHFLVGYLMGVLPK+Y+VPS+QAL QNRFAEG VSFVGFEF
Subjt:  MLSLQPLVDSILDSVQQCLQISFLEEILSAEKPESSMAEGRHSSRCEEQEHFICAQHEAGHFLVGYLMGVLPKEYQVPSVQALSQNRFAEGKVSFVGFEF

Query:  LGEIDSVKILGQNADIKKFNKR------ANKGTISSKQGLKYRC
        LGEIDS+KIL +NADI   +KR       NKGTISS +  ++ C
Subjt:  LGEIDSVKILGQNADIKKFNKR------ANKGTISSKQGLKYRC

A0A6J1HY40 uncharacterized protein LOC111467900 isoform X11.1e-8071.86Show/hide
Query:  MFLTAAVYDFTFDLEFHRRVPVTGYVVSS----------AERRRALKLVDRALSKRQYKSAVSLVKQLQGKPYGLRGFGAAKQIIKRRLELDESEVNGMD
        MF TAA  DFT +LEFHRR+PVTG V+SS          A+RRRALKLVDRALSKRQYKSA+SLVKQLQGKPYGLR FGAAKQI KR   +DESE+N  D
Subjt:  MFLTAAVYDFTFDLEFHRRVPVTGYVVSS----------AERRRALKLVDRALSKRQYKSAVSLVKQLQGKPYGLRGFGAAKQIIKRRLELDESEVNGMD

Query:  MLSLQPLVDSILDSVQQCLQISFLEEILSAEKPESSMAEGRHSSRCEEQEHFICAQHEAGHFLVGYLMGVLPKEYQVPSVQALSQNRFAEGKVSFVGFEF
        +LSLQPLVDSILDS+Q CLQI       SAE+ ES +AEGR+ SRCEE+EH ICAQHEAGHFLVGYLMGVLPK+Y+VPS+QAL QNRFAEG VSFVGFEF
Subjt:  MLSLQPLVDSILDSVQQCLQISFLEEILSAEKPESSMAEGRHSSRCEEQEHFICAQHEAGHFLVGYLMGVLPKEYQVPSVQALSQNRFAEGKVSFVGFEF

Query:  LGEIDSVKILGQNADIKKFNKRANKGTISSK
        LG+IDS+KIL +NADIK  ++R NKG   +K
Subjt:  LGEIDSVKILGQNADIKKFNKRANKGTISSK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G54680.1 unknown protein1.4e-1434.62Show/hide
Query:  SILDSVQQCLQISFLEEILSAEKPESSMAEGRHSSRCEEQEHFICAQHEAGHFLVGYLMGVLPKEYQVPSVQALSQN-RFAEGKVSFVGFEFLGEIDSVK
        S++DSV + ++  +++E       E  +          E++ F   QHE+GHFLVGYL+GVLP+ Y++P+++A+ QN     G+V FVGFEFL ++ +  
Subjt:  SILDSVQQCLQISFLEEILSAEKPESSMAEGRHSSRCEEQEHFICAQHEAGHFLVGYLMGVLPKEYQVPSVQALSQN-RFAEGKVSFVGFEFLGEIDSVK

Query:  ILGQNADIKKFNKRANKGTISSKQGLKYRC
         L ++      + + N+G ISSK    + C
Subjt:  ILGQNADIKKFNKRANKGTISSKQGLKYRC

AT1G54680.2 unknown protein1.4e-1434.62Show/hide
Query:  SILDSVQQCLQISFLEEILSAEKPESSMAEGRHSSRCEEQEHFICAQHEAGHFLVGYLMGVLPKEYQVPSVQALSQN-RFAEGKVSFVGFEFLGEIDSVK
        S++DSV + ++  +++E       E  +          E++ F   QHE+GHFLVGYL+GVLP+ Y++P+++A+ QN     G+V FVGFEFL ++ +  
Subjt:  SILDSVQQCLQISFLEEILSAEKPESSMAEGRHSSRCEEQEHFICAQHEAGHFLVGYLMGVLPKEYQVPSVQALSQN-RFAEGKVSFVGFEFLGEIDSVK

Query:  ILGQNADIKKFNKRANKGTISSKQGLKYRC
         L ++      + + N+G ISSK    + C
Subjt:  ILGQNADIKKFNKRANKGTISSKQGLKYRC

AT1G54680.3 unknown protein1.4e-1435.38Show/hide
Query:  SILDSVQQCLQISFLEEILSAEKPESSMAEGRHSSRCEEQEHFICAQHEAGHFLVGYLMGVLPKEYQVPSVQALSQN-RFAEGKVSFVGFEFLGEIDSVK
        S++DSV + ++  +++E       E  +          E++ F   QHE+GHFLVGYL+GVLP+ Y++P+++A+ QN     G+V FVGFEFL ++    
Subjt:  SILDSVQQCLQISFLEEILSAEKPESSMAEGRHSSRCEEQEHFICAQHEAGHFLVGYLMGVLPKEYQVPSVQALSQN-RFAEGKVSFVGFEFLGEIDSVK

Query:  ILGQNADIKKFNKRANKGTISSKQGLKYRC
        + GQ           N+G ISSK    + C
Subjt:  ILGQNADIKKFNKRANKGTISSKQGLKYRC

AT5G27290.1 unknown protein1.5e-2132.58Show/hide
Query:  FTFDLEFHRRVPVTGYVVSSAERRRALKLVDRALSKRQYKSAVSLVKQLQGKPYGLRGFGAAKQIIKRRLELDESEVNGMDMLSLQPLVDSILDSVQQCL
        F F   +  R  V       + RR+AL+ VD  LS    ++A+SLVK LQGKP GLR FGAA+Q+ +R   L+E ++NG++  SL    D+ L S+++ L
Subjt:  FTFDLEFHRRVPVTGYVVSSAERRRALKLVDRALSKRQYKSAVSLVKQLQGKPYGLRGFGAAKQIIKRRLELDESEVNGMDMLSLQPLVDSILDSVQQCL

Query:  QISFLE-----------------------------EILSAEKPESSMAEGRHSSRCEEQEHFICAQHEAGHFLVGYLMGVLPKEYQVPSVQALSQ--NRF
        QI+ +                              +++S      S+          ++ H    QHEAGHFLV YL+G+LP+ Y + S++AL +  +  
Subjt:  QISFLE-----------------------------EILSAEKPESSMAEGRHSSRCEEQEHFICAQHEAGHFLVGYLMGVLPKEYQVPSVQALSQ--NRF

Query:  AEGKVSFVGFEFLGEIDSVKI
         +   +FV +EFL E++S K+
Subjt:  AEGKVSFVGFEFLGEIDSVKI

AT5G27290.2 unknown protein1.5e-2132.58Show/hide
Query:  FTFDLEFHRRVPVTGYVVSSAERRRALKLVDRALSKRQYKSAVSLVKQLQGKPYGLRGFGAAKQIIKRRLELDESEVNGMDMLSLQPLVDSILDSVQQCL
        F F   +  R  V       + RR+AL+ VD  LS    ++A+SLVK LQGKP GLR FGAA+Q+ +R   L+E ++NG++  SL    D+ L S+++ L
Subjt:  FTFDLEFHRRVPVTGYVVSSAERRRALKLVDRALSKRQYKSAVSLVKQLQGKPYGLRGFGAAKQIIKRRLELDESEVNGMDMLSLQPLVDSILDSVQQCL

Query:  QISFLE-----------------------------EILSAEKPESSMAEGRHSSRCEEQEHFICAQHEAGHFLVGYLMGVLPKEYQVPSVQALSQ--NRF
        QI+ +                              +++S      S+          ++ H    QHEAGHFLV YL+G+LP+ Y + S++AL +  +  
Subjt:  QISFLE-----------------------------EILSAEKPESSMAEGRHSSRCEEQEHFICAQHEAGHFLVGYLMGVLPKEYQVPSVQALSQ--NRF

Query:  AEGKVSFVGFEFLGEIDSVKI
         +   +FV +EFL E++S K+
Subjt:  AEGKVSFVGFEFLGEIDSVKI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCCTCACTGCTGCTGTTTACGATTTCACTTTCGACCTAGAGTTTCACCGGAGGGTTCCGGTGACCGGATACGTCGTTTCTTCAGCGGAACGACGTCGTGCATTGAA
GCTTGTGGATCGAGCACTCTCAAAGCGGCAATACAAATCAGCTGTTTCATTGGTTAAGCAATTGCAAGGAAAACCATATGGACTTCGTGGTTTTGGCGCTGCCAAACAGA
TAATCAAGAGGCGTTTAGAACTGGATGAATCTGAGGTCAATGGGATGGATATGTTATCCCTTCAACCATTAGTGGATTCGATTCTGGATTCAGTTCAACAATGTCTTCAG
ATATCTTTCCTTGAGGAGATTCTCTCTGCTGAAAAGCCAGAGAGTTCAATGGCTGAAGGTAGACATTCTTCAAGATGTGAAGAACAAGAACACTTCATTTGTGCTCAACA
TGAAGCTGGGCATTTTCTTGTTGGATATTTGATGGGTGTTCTTCCAAAAGAATATCAAGTGCCAAGCGTTCAAGCTTTGAGCCAAAACAGATTTGCTGAAGGAAAAGTTT
CATTTGTTGGCTTTGAATTTCTTGGGGAAATTGATTCGGTAAAGATTTTGGGGCAAAATGCTGATATCAAAAAGTTTAATAAGAGAGCAAATAAAGGCACCATTTCCTCG
AAGCAGGGATTAAAATATCGATGTCGATATCGAGATATTAATTTTTCCTTTTTGTACATCATTGAATATTTATCCAAAAATAAGTTGCATATCAAGAAAATGTAA
mRNA sequenceShow/hide mRNA sequence
ATGTTCCTCACTGCTGCTGTTTACGATTTCACTTTCGACCTAGAGTTTCACCGGAGGGTTCCGGTGACCGGATACGTCGTTTCTTCAGCGGAACGACGTCGTGCATTGAA
GCTTGTGGATCGAGCACTCTCAAAGCGGCAATACAAATCAGCTGTTTCATTGGTTAAGCAATTGCAAGGAAAACCATATGGACTTCGTGGTTTTGGCGCTGCCAAACAGA
TAATCAAGAGGCGTTTAGAACTGGATGAATCTGAGGTCAATGGGATGGATATGTTATCCCTTCAACCATTAGTGGATTCGATTCTGGATTCAGTTCAACAATGTCTTCAG
ATATCTTTCCTTGAGGAGATTCTCTCTGCTGAAAAGCCAGAGAGTTCAATGGCTGAAGGTAGACATTCTTCAAGATGTGAAGAACAAGAACACTTCATTTGTGCTCAACA
TGAAGCTGGGCATTTTCTTGTTGGATATTTGATGGGTGTTCTTCCAAAAGAATATCAAGTGCCAAGCGTTCAAGCTTTGAGCCAAAACAGATTTGCTGAAGGAAAAGTTT
CATTTGTTGGCTTTGAATTTCTTGGGGAAATTGATTCGGTAAAGATTTTGGGGCAAAATGCTGATATCAAAAAGTTTAATAAGAGAGCAAATAAAGGCACCATTTCCTCG
AAGCAGGGATTAAAATATCGATGTCGATATCGAGATATTAATTTTTCCTTTTTGTACATCATTGAATATTTATCCAAAAATAAGTTGCATATCAAGAAAATGTAA
Protein sequenceShow/hide protein sequence
MFLTAAVYDFTFDLEFHRRVPVTGYVVSSAERRRALKLVDRALSKRQYKSAVSLVKQLQGKPYGLRGFGAAKQIIKRRLELDESEVNGMDMLSLQPLVDSILDSVQQCLQ
ISFLEEILSAEKPESSMAEGRHSSRCEEQEHFICAQHEAGHFLVGYLMGVLPKEYQVPSVQALSQNRFAEGKVSFVGFEFLGEIDSVKILGQNADIKKFNKRANKGTISS
KQGLKYRCRYRDINFSFLYIIEYLSKNKLHIKKM