; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC01g1482 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC01g1482
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Description50S Ribosomal protein
Genome locationMC01:19237693..19243863
RNA-Seq ExpressionMC01g1482
SyntenyMC01g1482
Gene Ontology termsGO:0005525 - GTP binding (molecular function)
InterPro domainsIPR011719 - Conserved hypothetical protein CHP02058
IPR037103 - Tubulin/FtsZ-like, C-terminal domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008439752.1 PREDICTED: uncharacterized protein LOC103484457 isoform X1 [Cucumis melo]6.19e-10482.05Show/hide
Query:  MTGSIVVNVPFPWAIELRFRSYLHSNKNRSSSSAHFLCSLTRPPISRLIKATAASSMEVEEGGRSAAVGSTASPMKLLFVEMGVGYDQHGQDVTAAAMRA
        MTGSI++N+ F    + RFRSY  SN    SS  H +CSLTRP ISRLIK TA SSMEVE+GG SA V ST  PMKLLFVEMGVGYDQHGQD+TAAAMRA
Subjt:  MTGSIVVNVPFPWAIELRFRSYLHSNKNRSSSSAHFLCSLTRPPISRLIKATAASSMEVEEGGRSAAVGSTASPMKLLFVEMGVGYDQHGQDVTAAAMRA

Query:  CRDAISSNSIPAFRRGSIPGVTFDQMKLQIKLGVPQTLQQSLDIEKVKSVFPYGKILNVEVVDGGLICSSGVHVEEMGDKNDDCYIVNAAVYVGY
        CRDAISSNSIPAFRRGSIPGV+F +MKLQIKLGVP +LQQSLDIEKVKSVFPYGKILNVEVVDGGLICSSGVHVEEMGDKNDDCYIVNAAVYVGY
Subjt:  CRDAISSNSIPAFRRGSIPGVTFDQMKLQIKLGVPQTLQQSLDIEKVKSVFPYGKILNVEVVDGGLICSSGVHVEEMGDKNDDCYIVNAAVYVGY

XP_022142213.1 uncharacterized protein LOC111012382 isoform X1 [Momordica charantia]2.26e-134100Show/hide
Query:  MTGSIVVNVPFPWAIELRFRSYLHSNKNRSSSSAHFLCSLTRPPISRLIKATAASSMEVEEGGRSAAVGSTASPMKLLFVEMGVGYDQHGQDVTAAAMRA
        MTGSIVVNVPFPWAIELRFRSYLHSNKNRSSSSAHFLCSLTRPPISRLIKATAASSMEVEEGGRSAAVGSTASPMKLLFVEMGVGYDQHGQDVTAAAMRA
Subjt:  MTGSIVVNVPFPWAIELRFRSYLHSNKNRSSSSAHFLCSLTRPPISRLIKATAASSMEVEEGGRSAAVGSTASPMKLLFVEMGVGYDQHGQDVTAAAMRA

Query:  CRDAISSNSIPAFRRGSIPGVTFDQMKLQIKLGVPQTLQQSLDIEKVKSVFPYGKILNVEVVDGGLICSSGVHVEEMGDKNDDCYIVNAAVYVGY
        CRDAISSNSIPAFRRGSIPGVTFDQMKLQIKLGVPQTLQQSLDIEKVKSVFPYGKILNVEVVDGGLICSSGVHVEEMGDKNDDCYIVNAAVYVGY
Subjt:  CRDAISSNSIPAFRRGSIPGVTFDQMKLQIKLGVPQTLQQSLDIEKVKSVFPYGKILNVEVVDGGLICSSGVHVEEMGDKNDDCYIVNAAVYVGY

XP_022925764.1 uncharacterized protein LOC111433074 [Cucurbita moschata]2.90e-10985.13Show/hide
Query:  MTGSIVVNVPFPWAIELRFRSYLHSNKNRSSSSAHFLCSLTRPPISRLIKATAASSMEVEEGGRSAAVGSTASPMKLLFVEMGVGYDQHGQDVTAAAMRA
        MTGS V+N+ FP   +LRFRSY  SN++ SSSSAH +CS TRPPISRL+K TAASSMEVE+GG+SA VGST  PMKLLFVEMGVGYDQHGQD+TAAAMRA
Subjt:  MTGSIVVNVPFPWAIELRFRSYLHSNKNRSSSSAHFLCSLTRPPISRLIKATAASSMEVEEGGRSAAVGSTASPMKLLFVEMGVGYDQHGQDVTAAAMRA

Query:  CRDAISSNSIPAFRRGSIPGVTFDQMKLQIKLGVPQTLQQSLDIEKVKSVFPYGKILNVEVVDGGLICSSGVHVEEMGDKNDDCYIVNAAVYVGY
        CRDAISSNSIPAFRRGSIPGVTF +MKLQIKLGVP +LQQSLDIEKVKSVFPYGKI+NVEVVDGGLICSSGVHVEEMGDKNDDCYIVNAAVYVGY
Subjt:  CRDAISSNSIPAFRRGSIPGVTFDQMKLQIKLGVPQTLQQSLDIEKVKSVFPYGKILNVEVVDGGLICSSGVHVEEMGDKNDDCYIVNAAVYVGY

XP_023544561.1 uncharacterized protein LOC111804102 [Cucurbita pepo subsp. pepo]5.85e-10985.13Show/hide
Query:  MTGSIVVNVPFPWAIELRFRSYLHSNKNRSSSSAHFLCSLTRPPISRLIKATAASSMEVEEGGRSAAVGSTASPMKLLFVEMGVGYDQHGQDVTAAAMRA
        MTGSIV+N+ FP   +LRFRSY  SN++ SSSSAH +CS TRP ISRL+K TAASSMEVE+GG+SA VGST  PMKLLFVEMGVGYDQHGQD+TAAAMRA
Subjt:  MTGSIVVNVPFPWAIELRFRSYLHSNKNRSSSSAHFLCSLTRPPISRLIKATAASSMEVEEGGRSAAVGSTASPMKLLFVEMGVGYDQHGQDVTAAAMRA

Query:  CRDAISSNSIPAFRRGSIPGVTFDQMKLQIKLGVPQTLQQSLDIEKVKSVFPYGKILNVEVVDGGLICSSGVHVEEMGDKNDDCYIVNAAVYVGY
        CRDAISSNSIPAFRRGSIPGVTF +MKLQIKLGVP +LQQSLDIEKVKSVFPYGKI+NVEVVDGGLICSSGVHVEEMGDKNDDCYIVNAAVYVGY
Subjt:  CRDAISSNSIPAFRRGSIPGVTFDQMKLQIKLGVPQTLQQSLDIEKVKSVFPYGKILNVEVVDGGLICSSGVHVEEMGDKNDDCYIVNAAVYVGY

XP_038883708.1 uncharacterized protein LOC120074614 isoform X1 [Benincasa hispida]4.21e-10783.16Show/hide
Query:  QMTGSIVVNVPFPWAIELRFRSYLHSNKNRSSSSAHFLCSLTRPPISRLIKATAASSMEVEEGGRSAAVGSTASPMKLLFVEMGVGYDQHGQDVTAAAMR
        QMTGSIV+N  F    + RFRSY  S  N  S SAH +CSLTRP ISRL+K TA SSMEVE+GG+SA  GST  PMKLLFVEMGVGYDQHGQD+TAAAMR
Subjt:  QMTGSIVVNVPFPWAIELRFRSYLHSNKNRSSSSAHFLCSLTRPPISRLIKATAASSMEVEEGGRSAAVGSTASPMKLLFVEMGVGYDQHGQDVTAAAMR

Query:  ACRDAISSNSIPAFRRGSIPGVTFDQMKLQIKLGVPQTLQQSLDIEKVKSVFPYGKILNVEVVDGGLICSSGVHVEEMGDKNDDCYIVNAAVYVGY
        ACRDAISSNSIPAFRRGSIPGVTF +MKLQIKLGVP +LQQSLDIEKVKSVFPYGKILNVEVVDGGLICSSGVHVEEMGDKNDDCYIVNAAVYVGY
Subjt:  ACRDAISSNSIPAFRRGSIPGVTFDQMKLQIKLGVPQTLQQSLDIEKVKSVFPYGKILNVEVVDGGLICSSGVHVEEMGDKNDDCYIVNAAVYVGY

TrEMBL top hitse value%identityAlignment
A0A0A0KNJ4 Uncharacterized protein3.00e-10482.05Show/hide
Query:  MTGSIVVNVPFPWAIELRFRSYLHSNKNRSSSSAHFLCSLTRPPISRLIKATAASSMEVEEGGRSAAVGSTASPMKLLFVEMGVGYDQHGQDVTAAAMRA
        MT SI++N+ F    + RFRSY  S  N  SSS H +CSLTRP ISRLIK TA SSMEVE+GG SA V ST  PMKLLFVEMGVGYDQHGQDVTAAAMRA
Subjt:  MTGSIVVNVPFPWAIELRFRSYLHSNKNRSSSSAHFLCSLTRPPISRLIKATAASSMEVEEGGRSAAVGSTASPMKLLFVEMGVGYDQHGQDVTAAAMRA

Query:  CRDAISSNSIPAFRRGSIPGVTFDQMKLQIKLGVPQTLQQSLDIEKVKSVFPYGKILNVEVVDGGLICSSGVHVEEMGDKNDDCYIVNAAVYVGY
        CRDAISSNSIPAFRRGSIPGV+F +MKLQIKLGVP +LQQSLD+EKVKSVFPYGKILNVEVVDGGLICSSGVHVEEMGDKNDDCYIVNAAVYVGY
Subjt:  CRDAISSNSIPAFRRGSIPGVTFDQMKLQIKLGVPQTLQQSLDIEKVKSVFPYGKILNVEVVDGGLICSSGVHVEEMGDKNDDCYIVNAAVYVGY

A0A1S3B079 uncharacterized protein LOC103484457 isoform X13.00e-10482.05Show/hide
Query:  MTGSIVVNVPFPWAIELRFRSYLHSNKNRSSSSAHFLCSLTRPPISRLIKATAASSMEVEEGGRSAAVGSTASPMKLLFVEMGVGYDQHGQDVTAAAMRA
        MTGSI++N+ F    + RFRSY  SN    SS  H +CSLTRP ISRLIK TA SSMEVE+GG SA V ST  PMKLLFVEMGVGYDQHGQD+TAAAMRA
Subjt:  MTGSIVVNVPFPWAIELRFRSYLHSNKNRSSSSAHFLCSLTRPPISRLIKATAASSMEVEEGGRSAAVGSTASPMKLLFVEMGVGYDQHGQDVTAAAMRA

Query:  CRDAISSNSIPAFRRGSIPGVTFDQMKLQIKLGVPQTLQQSLDIEKVKSVFPYGKILNVEVVDGGLICSSGVHVEEMGDKNDDCYIVNAAVYVGY
        CRDAISSNSIPAFRRGSIPGV+F +MKLQIKLGVP +LQQSLDIEKVKSVFPYGKILNVEVVDGGLICSSGVHVEEMGDKNDDCYIVNAAVYVGY
Subjt:  CRDAISSNSIPAFRRGSIPGVTFDQMKLQIKLGVPQTLQQSLDIEKVKSVFPYGKILNVEVVDGGLICSSGVHVEEMGDKNDDCYIVNAAVYVGY

A0A6J1CKY1 uncharacterized protein LOC111012382 isoform X22.71e-9696.69Show/hide
Query:  ISRLIKATAASSMEVEEGGRSAAVGSTASPMKLLFVEMGVGYDQHGQDVTAAAMRACRDAISSNSIPAFRRGSIPGVTFDQMKLQIKLGVPQTLQQSLDI
        + +L  ATAASSMEVEEGGRSAAVGSTASPMKLLFVEMGVGYDQHGQDVTAAAMRACRDAISSNSIPAFRRGSIPGVTFDQMKLQIKLGVPQTLQQSLDI
Subjt:  ISRLIKATAASSMEVEEGGRSAAVGSTASPMKLLFVEMGVGYDQHGQDVTAAAMRACRDAISSNSIPAFRRGSIPGVTFDQMKLQIKLGVPQTLQQSLDI

Query:  EKVKSVFPYGKILNVEVVDGGLICSSGVHVEEMGDKNDDCYIVNAAVYVGY
        EKVKSVFPYGKILNVEVVDGGLICSSGVHVEEMGDKNDDCYIVNAAVYVGY
Subjt:  EKVKSVFPYGKILNVEVVDGGLICSSGVHVEEMGDKNDDCYIVNAAVYVGY

A0A6J1CMQ6 uncharacterized protein LOC111012382 isoform X11.09e-134100Show/hide
Query:  MTGSIVVNVPFPWAIELRFRSYLHSNKNRSSSSAHFLCSLTRPPISRLIKATAASSMEVEEGGRSAAVGSTASPMKLLFVEMGVGYDQHGQDVTAAAMRA
        MTGSIVVNVPFPWAIELRFRSYLHSNKNRSSSSAHFLCSLTRPPISRLIKATAASSMEVEEGGRSAAVGSTASPMKLLFVEMGVGYDQHGQDVTAAAMRA
Subjt:  MTGSIVVNVPFPWAIELRFRSYLHSNKNRSSSSAHFLCSLTRPPISRLIKATAASSMEVEEGGRSAAVGSTASPMKLLFVEMGVGYDQHGQDVTAAAMRA

Query:  CRDAISSNSIPAFRRGSIPGVTFDQMKLQIKLGVPQTLQQSLDIEKVKSVFPYGKILNVEVVDGGLICSSGVHVEEMGDKNDDCYIVNAAVYVGY
        CRDAISSNSIPAFRRGSIPGVTFDQMKLQIKLGVPQTLQQSLDIEKVKSVFPYGKILNVEVVDGGLICSSGVHVEEMGDKNDDCYIVNAAVYVGY
Subjt:  CRDAISSNSIPAFRRGSIPGVTFDQMKLQIKLGVPQTLQQSLDIEKVKSVFPYGKILNVEVVDGGLICSSGVHVEEMGDKNDDCYIVNAAVYVGY

A0A6J1ED44 uncharacterized protein LOC1114330741.40e-10985.13Show/hide
Query:  MTGSIVVNVPFPWAIELRFRSYLHSNKNRSSSSAHFLCSLTRPPISRLIKATAASSMEVEEGGRSAAVGSTASPMKLLFVEMGVGYDQHGQDVTAAAMRA
        MTGS V+N+ FP   +LRFRSY  SN++ SSSSAH +CS TRPPISRL+K TAASSMEVE+GG+SA VGST  PMKLLFVEMGVGYDQHGQD+TAAAMRA
Subjt:  MTGSIVVNVPFPWAIELRFRSYLHSNKNRSSSSAHFLCSLTRPPISRLIKATAASSMEVEEGGRSAAVGSTASPMKLLFVEMGVGYDQHGQDVTAAAMRA

Query:  CRDAISSNSIPAFRRGSIPGVTFDQMKLQIKLGVPQTLQQSLDIEKVKSVFPYGKILNVEVVDGGLICSSGVHVEEMGDKNDDCYIVNAAVYVGY
        CRDAISSNSIPAFRRGSIPGVTF +MKLQIKLGVP +LQQSLDIEKVKSVFPYGKI+NVEVVDGGLICSSGVHVEEMGDKNDDCYIVNAAVYVGY
Subjt:  CRDAISSNSIPAFRRGSIPGVTFDQMKLQIKLGVPQTLQQSLDIEKVKSVFPYGKILNVEVVDGGLICSSGVHVEEMGDKNDDCYIVNAAVYVGY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G29040.1 unknown protein1.1e-5868.42Show/hide
Query:  SNKNRSSSSAHFLCSLTRPPISRLIKATAASSMEVEEGGRSAAVGSTASPMKLLFVEMGVGYDQHGQDVTAAAMRACRDAISSNSIPAFRRGSIPGVTFD
        S++++  S  HF      P ISR IK  A S+ME      +   G   + MKLLFVEMGVGYDQHGQDVT+AAM+AC++AISSNSIPAFRRGSIPGV+F 
Subjt:  SNKNRSSSSAHFLCSLTRPPISRLIKATAASSMEVEEGGRSAAVGSTASPMKLLFVEMGVGYDQHGQDVTAAAMRACRDAISSNSIPAFRRGSIPGVTFD

Query:  QMKLQIKLGVPQTLQQSLDIEKVKSVFPYGKILNVEVVDGGLICSSGVHVEEMGDKNDDCYIVNAAVYVGY
        +MKLQIKLGVP +L Q LD++KVKS+FPYGKI+NVEVVDGGLICSSGV VEEMGD N+DCYIVN AVYVGY
Subjt:  QMKLQIKLGVPQTLQQSLDIEKVKSVFPYGKILNVEVVDGGLICSSGVHVEEMGDKNDDCYIVNAAVYVGY

AT1G29040.2 unknown protein5.1e-4860.82Show/hide
Query:  SNKNRSSSSAHFLCSLTRPPISRLIKATAASSMEVEEGGRSAAVGSTASPMKLLFVEMGVGYDQHGQDVTAAAMRACRDAISSNSIPAFRRGSIPGVTFD
        S++++  S  HF      P ISR IK  A S+ME      +   G   + MKLLFVEMGVGYDQHGQDVT+AAM+AC               SIPGV+F 
Subjt:  SNKNRSSSSAHFLCSLTRPPISRLIKATAASSMEVEEGGRSAAVGSTASPMKLLFVEMGVGYDQHGQDVTAAAMRACRDAISSNSIPAFRRGSIPGVTFD

Query:  QMKLQIKLGVPQTLQQSLDIEKVKSVFPYGKILNVEVVDGGLICSSGVHVEEMGDKNDDCYIVNAAVYVGY
        +MKLQIKLGVP +L Q LD++KVKS+FPYGKI+NVEVVDGGLICSSGV VEEMGD N+DCYIVN AVYVGY
Subjt:  QMKLQIKLGVPQTLQQSLDIEKVKSVFPYGKILNVEVVDGGLICSSGVHVEEMGDKNDDCYIVNAAVYVGY

AT1G29040.3 unknown protein3.8e-1956.99Show/hide
Query:  SNKNRSSSSAHFLCSLTRPPISRLIKATAASSMEVEEGGRSAAVGSTASPMKLLFVEMGVGYDQHGQDVTAAAMRACRDAISSNSIPAFRRGS
        S++++  S  HF      P ISR IK  A S+ME      +   G   + MKLLFVEMGVGYDQHGQDVT+AAM+AC++AISSNSIPAFRRG+
Subjt:  SNKNRSSSSAHFLCSLTRPPISRLIKATAASSMEVEEGGRSAAVGSTASPMKLLFVEMGVGYDQHGQDVTAAAMRACRDAISSNSIPAFRRGS

AT1G29040.4 unknown protein1.1e-5868.42Show/hide
Query:  SNKNRSSSSAHFLCSLTRPPISRLIKATAASSMEVEEGGRSAAVGSTASPMKLLFVEMGVGYDQHGQDVTAAAMRACRDAISSNSIPAFRRGSIPGVTFD
        S++++  S  HF      P ISR IK  A S+ME      +   G   + MKLLFVEMGVGYDQHGQDVT+AAM+AC++AISSNSIPAFRRGSIPGV+F 
Subjt:  SNKNRSSSSAHFLCSLTRPPISRLIKATAASSMEVEEGGRSAAVGSTASPMKLLFVEMGVGYDQHGQDVTAAAMRACRDAISSNSIPAFRRGSIPGVTFD

Query:  QMKLQIKLGVPQTLQQSLDIEKVKSVFPYGKILNVEVVDGGLICSSGVHVEEMGDKNDDCYIVNAAVYVGY
        +MKLQIKLGVP +L Q LD++KVKS+FPYGKI+NVEVVDGGLICSSGV VEEMGD N+DCYIVN AVYVGY
Subjt:  QMKLQIKLGVPQTLQQSLDIEKVKSVFPYGKILNVEVVDGGLICSSGVHVEEMGDKNDDCYIVNAAVYVGY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
TTTTCTAAAAGAGATATATATTTTATAAAATTTGTTCCAACCTTCGGAAGCCGCTCCCGAGTCCGGACATGGATCTGTAGCTTTTTGACGCATTTAGCAACGCAGATGAC
AGGCTCCATAGTTGTAAATGTGCCATTTCCTTGGGCAATTGAACTCAGGTTTCGTTCTTACCTTCATTCCAATAAGAATCGATCATCATCATCTGCGCATTTTCTCTGCT
CTCTCACACGGCCTCCAATTTCTCGCCTTATCAAAGCTACTGCTGCGTCGTCCATGGAGGTTGAGGAAGGTGGAAGATCTGCAGCTGTTGGCAGCACAGCTTCGCCCATG
AAGCTCTTGTTCGTCGAGATGGGTGTCGGATACGATCAACATGGCCAAGATGTCACGGCAGCTGCTATGCGGGCCTGCAGGGATGCCATATCTTCCAATTCGATTCCAGC
ATTCCGTAGAGGTTCCATTCCTGGAGTCACATTTGATCAGATGAAACTTCAGATCAAACTTGGAGTACCACAGACACTTCAACAATCCTTGGATATTGAAAAAGTCAAGT
CTGTCTTCCCATATGGAAAGATTCTGAATGTTGAGGTCGTTGATGGTGGCTTAATATGCTCCAGCGGTGTGCACGTGGAAGAAATGGGAGACAAAAATGATGACTGTTAT
ATAGTAAATGCTGCTGTATATGTTGGCTATTAA
mRNA sequenceShow/hide mRNA sequence
ATTTTTCTAAAAGAGATATATATTTTATAAAATTTGTTCCAACCTTCGGAAGCCGCTCCCGAGTCCGGACATGGATCTGTAGCTTTTTGACGCATTTAGCAACGCAGATG
ACAGGCTCCATAGTTGTAAATGTGCCATTTCCTTGGGCAATTGAACTCAGGTTTCGTTCTTACCTTCATTCCAATAAGAATCGATCATCATCATCTGCGCATTTTCTCTG
CTCTCTCACACGGCCTCCAATTTCTCGCCTTATCAAAGCTACTGCTGCGTCGTCCATGGAGGTTGAGGAAGGTGGAAGATCTGCAGCTGTTGGCAGCACAGCTTCGCCCA
TGAAGCTCTTGTTCGTCGAGATGGGTGTCGGATACGATCAACATGGCCAAGATGTCACGGCAGCTGCTATGCGGGCCTGCAGGGATGCCATATCTTCCAATTCGATTCCA
GCATTCCGTAGAGGTTCCATTCCTGGAGTCACATTTGATCAGATGAAACTTCAGATCAAACTTGGAGTACCACAGACACTTCAACAATCCTTGGATATTGAAAAAGTCAA
GTCTGTCTTCCCATATGGAAAGATTCTGAATGTTGAGGTCGTTGATGGTGGCTTAATATGCTCCAGCGGTGTGCACGTGGAAGAAATGGGAGACAAAAATGATGACTGTT
ATATAGTAAATGCTGCTGTATATGTTGGCTATTAATTTTTGTGCCTCATATTCCTCTTCCGGAGATCTTAACTCTTAAGGGTTATTCACCCCTCCCCACAAACCACTGAA
CAGTGTATGCATGCTCGAGGAGCTTAGTCATCTATGATTGTGATTATACTGTAAGCTAAGCTTGATGAATACTTCGACATTTCAGTTTCCTTGGAGATATTGAGAGAACC
ACAGGGTTATTTCCTGTATTGATTAAGCTCTCAAATTTAGTTTTTCTTGATTTTAACATGCTGCAATATACTTTCACTATGTTTCAAACTCATGTTTTAGTCATCAACAT
CTGTAATTACTAAATGAACTGCCATATCAGTTCGTATTCAGTGGAATGATCTTTGAGTGCTTGGTGGAGTTGATGTCAGGTACTTAGTTATTGCATTATTTATTGACGAT
TGAGGTCATGCTTCTTAACTTGGCCATTGGTCAGATATTTTCATTTTTTCTGGTCTCAGTAACTATTCAGGAGGATAGGGGTTGACAAATGACAAGATGTACAGTTAGCC
TGACATTCAGATTCTTTGCAAATATAAACTGATCTCAAAGCCAAACCCAAATTCAATTGGACTCATTTTACAAAGATGCGCAATAGAGCATGTTAATCCAGTGTTGCCAT
CTCACCTACATTCTTTCCCATCAATATGAAACTATAGGCACAATTACTTTATACAATTGCTTTGTTGCTATTACATTTCTTGCTTCCAGTTGCCGTTAAGCTGGTTAATG
AAATGAATGAAGTTCTAACATTCCCTGAGAAGAATTGCAGAGACGAAGGGAAGTTCAACTGGCCTTCACATGGTGAAAACTGCATAGAGAAGCTGGTTCCAAGTGTCAGG
CAAACCGGGGAGGCACCAGTGGCTGCAATCTGTGCCTGCATGCTCGCCGCTGTAAGCAGAGGGATGTGCATCTTTGCGCAGTTGCGACAATTTTGTGATGTCCAGCAAGA
AAACTGGCTTCCTCATTCTAGCTAGCACTCTCTGCACGATTTCTGCTGCGGGTGGCGTTCCCGCTGGATACAGTGACCCCGTAAGCGGTACGCTTTCTCCATAGCAACTC
CTCCTTGGTTGGTTCCAATCCTTTCCCCTGTTAACTTATTTTTCAAGGTCAATTTCCATTTTCATCTATACACTCTGAAAAGTCTTATTCCTATCACTAACTATGAGAAA
ATTACATAAGAAAATATATTTTTACTGTGAATATCGATTATTAAAGGAATTAATGTCACACTATCCGACATGGATACGTGACAGTGCTGCTCTTTCTGTCGACAGAAATT
TATAGATGTTCGGAGCTTGCCAAACTGTTGGAGAAGTATTAAAAGGGATACATAGCCGAATAACCAAATTTATACTTTAGCCTCTCATGATGTGTGTATATATTTATAAA
TAAACAGATCACAATTATGGAGCGCATATATATTATATATGAACAGATCACAACTAGGGAGGGCAGCGTTGGGATGTATTCTTTGTCCTTGACCTTGTAGGAATAAAATA
TTCTGCCTTTCCTTAGTTTTTCAATTTTCATATTTATTTTGTATAATTAAATATTGATGAATTCTTTAAATATATATTTTAGAGTTAAAATAAAAAATAAAAAAAATGAT
AGAAAACCGATCACAGCCGTTGATTTTTACTTACTCATAATGAGTGGGGGAGATTCCCTGGAAAATGACCCTGGTTTTACTGGGATCAACATTCATGTCCACCCACCTAG
CCCAGGTGGTCAGCCCTTGATAAAATGCCTCCAGACGGTCCATATCTTTCTTCACTGTATTTCCTATTTGCACATAATCCCACCTGAACAACACTCACAACCGTCAGATT
TTACCATCTTCTACCGCATAAGCATGATCTAGAAAAAAGAGATTGATCATACGGCTGGGATCTTCCAGTGTGGGTCCACCAGTGCCAGGAGTTGAAAATTAACACATCCA
TCCCCTTCCACACACTTCCTCCTTCAATGGAATCAAGCTTTAGTACTCTTCCAACTCTCTCTTTCACTATGTCCACAAGGTAGTGCGTTCTGTGAAGAAGCAAGCTCACT
CCATAGTCCTGCACACACCGGATTGTTCCCTCTTATTCAATGGTTTGGTCCATTATTAGAAATAATTGTACAGAGAAAGTTATTATTGGTGTTTACTATTATTTGTATTT
GAATATGGTTTTCTTATATTATTTGTTTTGTACTGTTTCATTTTGCCGAGCTCAGCATAACCCAAAAAACAGCTTTTGGAAAAGTGAATAAAGAACAGAGAATAAAGCAA
GCAGACAATTGGTATACTGTGAAGTGGGAGATTTTGAACGAAGAAATTTGAGGAAAAGGAAAAGGAGAAGGAACTCGCCTGGAAAATCACAGTGGAGAGTGATTCCCTCA
TGACTATGGAGGTTTTGGCTTTTGGTGCCGACGCCAGAATCATACATGTTAAAGATTGCCACATGTTCAGACTCAGTGAGTCACCTACAAACATTATCTTCTTCCCTTTC
CACCTCCTCAGAAGCTCCAACCCATCAAACCTACAAATAAATGAACACATTTTCTCAAAGAGAGAGAGAAGGCTGCTCTATTTTGGCACCCACATTCTCAGAGAAGGAGA
GCATGAGATTTGCATAACAATGTCGGATTATGAGGGAAAGCAAGAAGCCTCAAATGGGAGATCAGAGTGGAAAACTTTACAAACAAAAGAAAGTGAAGGAGAAGTACCTT
GGAAGATCACATAAGTCAGGTTTCCAGGTGTACTTGAGGTAGGATCGATCAGGCCTGCCATACTTTTGGCAATTAAACTCAGGGTCAATGAAGGGACAGCTTGAAGATTC
ATAGAGAGGAAAAGAAGCATCAAAAACCCATTTGCCCTGAAACAAATTGCACTCCCTCTCTTGCTTTCTACTTCCCACATTGCTTATGTTATAGAAGTCTTCAGCTTTTG
CAGTTTCCAGTAAAAACAGTAACACAACTTGTAAAAGCAGAAGGGACAGAGGAACTCTGAATCGAAAACCCATCTCAGATTTTGGGCTGTGACTGAAAAGAAAGAGGAGT
TTGGGGAATAGAAGTAGACAGAAGCAAGTGGGTCTATATAGAGACAAACAAGAGTGAGAGAGAGAAAGCCCTGATTTTTATTATTGTTATTTGTTTAATCTTTTAAAATT
TTTATGACATGAGCTGGGAATTGGGTAATCTTGGCTGGATTGATTAGGGGAAATGCAGGGGATTGTGGGGAGTTGAAATTGAGAGTTGTGTTTTAGTATTGCCTGGTTTG
CACATCGAGGGCCATATCATATTGCCCACCAAGCAATGTTGCACATGGCTCAATGATGTGTTGGTTATCTAACTACC
Protein sequenceShow/hide protein sequence
FSKRDIYFIKFVPTFGSRSRVRTWICSFLTHLATQMTGSIVVNVPFPWAIELRFRSYLHSNKNRSSSSAHFLCSLTRPPISRLIKATAASSMEVEEGGRSAAVGSTASPM
KLLFVEMGVGYDQHGQDVTAAAMRACRDAISSNSIPAFRRGSIPGVTFDQMKLQIKLGVPQTLQQSLDIEKVKSVFPYGKILNVEVVDGGLICSSGVHVEEMGDKNDDCY
IVNAAVYVGY