; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi05G011590 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi05G011590
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
Descriptionprotein LOL1-like
Genome locationchr05:19476239..19480156
RNA-Seq ExpressionLsi05G011590
SyntenyLsi05G011590
Gene Ontology termsGO:0005634 - nucleus (cellular component)
InterPro domainsIPR005735 - Zinc finger, LSD1-type
IPR036280 - Multiheme cytochrome superfamily
IPR040319 - LSD1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG54321.1 hypothetical protein EZV62_019577 [Acer yangbiense]6.9e-6991.55Show/hide
Query:  PVPLAPYPTPPTPYTQPANATQSQLVCSGCRNLLVYPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQVA
        PVPLAPYPTPP PYT PAN  QSQLVCSGCRNLL+YPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQVA
Subjt:  PVPLAPYPTPPTPYTQPANATQSQLVCSGCRNLLVYPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQVA

Query:  HVSCGNCRMLLMYQYGARSVKCAVCNFVTSVGVNVNERDRSK
        HV+CGNCRMLLMYQYGARSVKCAVCNFVTSVGV++N+R R++
Subjt:  HVSCGNCRMLLMYQYGARSVKCAVCNFVTSVGVNVNERDRSK

XP_004148490.1 protein LOL1 [Cucumis sativus]8.2e-7092.96Show/hide
Query:  MPPVPLAPYPTPPTPYTQPANATQSQLVCSGCRNLLVYPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQ
        MPPVPLAPYPTPP PYTQP+NATQSQLVCSGCRNLL+YPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQ
Subjt:  MPPVPLAPYPTPPTPYTQPANATQSQLVCSGCRNLLVYPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQ

Query:  VAHVSCGNCRMLLMYQYGARSVKCAVCNFVTSVGVNVNERDR
        VAHVSCGNCRMLLMYQYGARSVKCAVCNFVTSVG++ +  D+
Subjt:  VAHVSCGNCRMLLMYQYGARSVKCAVCNFVTSVGVNVNERDR

XP_022952054.1 protein LOL1-like [Cucurbita moschata]1.8e-6992.25Show/hide
Query:  MPPVPLAPYPTPPTPYTQPANATQSQLVCSGCRNLLVYPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQ
        MPPVPLAPYPTPP PY+QPANATQSQLVCSGCRNLL+YPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQ
Subjt:  MPPVPLAPYPTPPTPYTQPANATQSQLVCSGCRNLLVYPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQ

Query:  VAHVSCGNCRMLLMYQYGARSVKCAVCNFVTSVGVNVNERDR
        VAHVSCGNCRMLLMYQYGARSVKCAVCNFVTS+G+  +  D+
Subjt:  VAHVSCGNCRMLLMYQYGARSVKCAVCNFVTSVGVNVNERDR

XP_038887481.1 protein LOL1 isoform X1 [Benincasa hispida]6.2e-7894.27Show/hide
Query:  MPPVPLAPYPTPPTPYTQPANATQSQLVCSGCRNLLVYPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQ
        MPPVPLAPYPTPPTPYTQPANATQSQLVCSGCRNLL+YPVGATSVCCAVCNAVT VPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQ
Subjt:  MPPVPLAPYPTPPTPYTQPANATQSQLVCSGCRNLLVYPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQ

Query:  VAHVSCGNCRMLLMYQYGARSVKCAVCNFVTSVGVNVNERDRSKSLIAKAIVKHGCP
        VAHVSCGNCRMLLMYQYGARSVKCAVCNFVTSVGV+VNE DRSKSLIAK IV    P
Subjt:  VAHVSCGNCRMLLMYQYGARSVKCAVCNFVTSVGVNVNERDRSKSLIAKAIVKHGCP

XP_038887483.1 protein LOL1 isoform X2 [Benincasa hispida]2.8e-7093.66Show/hide
Query:  MPPVPLAPYPTPPTPYTQPANATQSQLVCSGCRNLLVYPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQ
        MPPVPLAPYPTPPTPYTQPANATQSQLVCSGCRNLL+YPVGATSVCCAVCNAVT VPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQ
Subjt:  MPPVPLAPYPTPPTPYTQPANATQSQLVCSGCRNLLVYPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQ

Query:  VAHVSCGNCRMLLMYQYGARSVKCAVCNFVTSVGVNVNERDR
        VAHVSCGNCRMLLMYQYGARSVKCAVCNFVTSVG++ +  D+
Subjt:  VAHVSCGNCRMLLMYQYGARSVKCAVCNFVTSVGVNVNERDR

TrEMBL top hitse value%identityAlignment
A0A0A0LEE4 Uncharacterized protein3.9e-7092.96Show/hide
Query:  MPPVPLAPYPTPPTPYTQPANATQSQLVCSGCRNLLVYPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQ
        MPPVPLAPYPTPP PYTQP+NATQSQLVCSGCRNLL+YPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQ
Subjt:  MPPVPLAPYPTPPTPYTQPANATQSQLVCSGCRNLLVYPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQ

Query:  VAHVSCGNCRMLLMYQYGARSVKCAVCNFVTSVGVNVNERDR
        VAHVSCGNCRMLLMYQYGARSVKCAVCNFVTSVG++ +  D+
Subjt:  VAHVSCGNCRMLLMYQYGARSVKCAVCNFVTSVGVNVNERDR

A0A1S3CQ35 protein LOL13.3e-6992.25Show/hide
Query:  MPPVPLAPYPTPPTPYTQPANATQSQLVCSGCRNLLVYPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQ
        MPPVPLAPYPTPP PYTQP+NATQSQLVCSGCRNLL+YPVGATSVCCAVCNAVTAVP PGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQ
Subjt:  MPPVPLAPYPTPPTPYTQPANATQSQLVCSGCRNLLVYPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQ

Query:  VAHVSCGNCRMLLMYQYGARSVKCAVCNFVTSVGVNVNERDR
        VAHVSCGNCRMLLMYQYGARSVKCAVCNFVTSVG++ +  D+
Subjt:  VAHVSCGNCRMLLMYQYGARSVKCAVCNFVTSVGVNVNERDR

A0A5C7HBP8 Uncharacterized protein3.3e-6991.55Show/hide
Query:  PVPLAPYPTPPTPYTQPANATQSQLVCSGCRNLLVYPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQVA
        PVPLAPYPTPP PYT PAN  QSQLVCSGCRNLL+YPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQVA
Subjt:  PVPLAPYPTPPTPYTQPANATQSQLVCSGCRNLLVYPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQVA

Query:  HVSCGNCRMLLMYQYGARSVKCAVCNFVTSVGVNVNERDRSK
        HV+CGNCRMLLMYQYGARSVKCAVCNFVTSVGV++N+R R++
Subjt:  HVSCGNCRMLLMYQYGARSVKCAVCNFVTSVGVNVNERDRSK

A0A6J1ET02 protein LOL11.3e-6891.55Show/hide
Query:  MPPVPLAPYPTPPTPYTQPANATQSQLVCSGCRNLLVYPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQ
        MPPVPLAPYPTPP PYT PAN  QSQLVCSGCRNLL+YPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQ
Subjt:  MPPVPLAPYPTPPTPYTQPANATQSQLVCSGCRNLLVYPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQ

Query:  VAHVSCGNCRMLLMYQYGARSVKCAVCNFVTSVGVNVNERDR
        VAHVSCGNCRMLLMYQYGARSVKCAVCNFVTSVG++ +  D+
Subjt:  VAHVSCGNCRMLLMYQYGARSVKCAVCNFVTSVGVNVNERDR

A0A6J1GKN9 protein LOL1-like8.8e-7092.25Show/hide
Query:  MPPVPLAPYPTPPTPYTQPANATQSQLVCSGCRNLLVYPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQ
        MPPVPLAPYPTPP PY+QPANATQSQLVCSGCRNLL+YPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQ
Subjt:  MPPVPLAPYPTPPTPYTQPANATQSQLVCSGCRNLLVYPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQ

Query:  VAHVSCGNCRMLLMYQYGARSVKCAVCNFVTSVGVNVNERDR
        VAHVSCGNCRMLLMYQYGARSVKCAVCNFVTS+G+  +  D+
Subjt:  VAHVSCGNCRMLLMYQYGARSVKCAVCNFVTSVGVNVNERDR

SwissProt top hitse value%identityAlignment
P94077 Protein LSD13.4e-3458.06Show/hide
Query:  QSQLVCSGCRNLLVYPVGATSVCCAVCNAVTAVPPPGT--EMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLA-LEANQVAH--------VSCGNCRML
        Q QLVC GCRNLL+YP GA++V CA+CN +  VPPP    +MA ++CGGC T+LMY RGA+SV+CSCC T NL    +NQVAH        ++CG+CR  
Subjt:  QSQLVCSGCRNLLVYPVGATSVCCAVCNAVTAVPPPGT--EMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLA-LEANQVAH--------VSCGNCRML

Query:  LMYQYGARSVKCAVCNFVTSVGVN
        LMY YGA SVKCAVC FVT+V ++
Subjt:  LMYQYGARSVKCAVCNFVTSVGVN

Q0J7V9 Protein LSD13.1e-6488.81Show/hide
Query:  PVPLAPYPTPPTPYTQPANATQSQLVCSGCRNLLVYPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQVA
        PVPLAPYPTPP P+T P N  QSQLVCSGCRNLL+YP GATSVCCAVC+ VTAVP PGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLA+EANQVA
Subjt:  PVPLAPYPTPPTPYTQPANATQSQLVCSGCRNLLVYPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQVA

Query:  HVSCGNCRMLLMYQYGARSVKCAVCNFVTSVGVN
        HV+CGNCRMLLMYQYGARSVKCAVCNFVTSVG +
Subjt:  HVSCGNCRMLLMYQYGARSVKCAVCNFVTSVGVN

Q2QMB3 Protein LOL21.0e-3863.16Show/hide
Query:  QSQLVCSGCRNLLVYPVGATSVCCAVCNAVTA-VPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQVAHVSCGNCRMLLMYQYGARSV
        QSQ+VC GCRN+L+YP GA SVCCAVC+AV++  P PG ++A L+CGGC TLLMY R ATSV+CSCC TVNL    + +AH++CG C+ +LMY YGA SV
Subjt:  QSQLVCSGCRNLLVYPVGATSVCCAVCNAVTA-VPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQVAHVSCGNCRMLLMYQYGARSV

Query:  KCAVCNFVTSVGVN
        KCA+CNF+T+ G+N
Subjt:  KCAVCNFVTSVGVN

Q6ASS2 Protein LOL32.2e-3863.48Show/hide
Query:  QSQLVCSGCRNLLVYPVGATSVCCAVCNAVTAVPPPG--TEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQVAHVSCGNCRMLLMYQYGARS
        QSQ+VC GCR++L YP GA SVCCA+C A+T VPPP    EMA L+CGGC TLLMY R A +V+CSCC TVNL    N +AHVSCG CR  LMY YGA S
Subjt:  QSQLVCSGCRNLLVYPVGATSVCCAVCNAVTAVPPPG--TEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQVAHVSCGNCRMLLMYQYGARS

Query:  VKCAVCNFVTSVGVN
        VKCA+C+++T+ G+N
Subjt:  VKCAVCNFVTSVGVN

Q93ZB1 Protein LOL13.1e-6484.46Show/hide
Query:  PVPLAPYPTPPTP------YTQPANAT---QSQLVCSGCRNLLVYPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVN
        PVPLAPYPTPP P       T PAN +   QSQLVCSGCRNLL+YPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVN
Subjt:  PVPLAPYPTPPTP------YTQPANAT---QSQLVCSGCRNLLVYPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVN

Query:  LALEANQVAHVSCGNCRMLLMYQYGARSVKCAVCNFVTSVGVNVNERD
        LALEANQVAHV+CGNC MLLMYQYGARSVKCAVCNFVTSVG + +  D
Subjt:  LALEANQVAHVSCGNCRMLLMYQYGARSVKCAVCNFVTSVGVNVNERD

Arabidopsis top hitse value%identityAlignment
AT1G32540.1 lsd one like 14.8e-6092.37Show/hide
Query:  QSQLVCSGCRNLLVYPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQVAHVSCGNCRMLLMYQYGARSVK
        QSQLVCSGCRNLL+YPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQVAHV+CGNC MLLMYQYGARSVK
Subjt:  QSQLVCSGCRNLLVYPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQVAHVSCGNCRMLLMYQYGARSVK

Query:  CAVCNFVTSVGVNVNERD
        CAVCNFVTSVG + +  D
Subjt:  CAVCNFVTSVGVNVNERD

AT1G32540.2 lsd one like 12.2e-6584.46Show/hide
Query:  PVPLAPYPTPPTP------YTQPANAT---QSQLVCSGCRNLLVYPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVN
        PVPLAPYPTPP P       T PAN +   QSQLVCSGCRNLL+YPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVN
Subjt:  PVPLAPYPTPPTP------YTQPANAT---QSQLVCSGCRNLLVYPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVN

Query:  LALEANQVAHVSCGNCRMLLMYQYGARSVKCAVCNFVTSVGVNVNERD
        LALEANQVAHV+CGNC MLLMYQYGARSVKCAVCNFVTSVG + +  D
Subjt:  LALEANQVAHVSCGNCRMLLMYQYGARSVKCAVCNFVTSVGVNVNERD

AT1G32540.3 lsd one like 12.2e-6584.46Show/hide
Query:  PVPLAPYPTPPTP------YTQPANAT---QSQLVCSGCRNLLVYPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVN
        PVPLAPYPTPP P       T PAN +   QSQLVCSGCRNLL+YPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVN
Subjt:  PVPLAPYPTPPTP------YTQPANAT---QSQLVCSGCRNLLVYPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVN

Query:  LALEANQVAHVSCGNCRMLLMYQYGARSVKCAVCNFVTSVGVNVNERD
        LALEANQVAHV+CGNC MLLMYQYGARSVKCAVCNFVTSVG + +  D
Subjt:  LALEANQVAHVSCGNCRMLLMYQYGARSVKCAVCNFVTSVGVNVNERD

AT4G20380.1 LSD1 zinc finger family protein2.4e-3558.06Show/hide
Query:  QSQLVCSGCRNLLVYPVGATSVCCAVCNAVTAVPPPGT--EMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLA-LEANQVAH--------VSCGNCRML
        Q QLVC GCRNLL+YP GA++V CA+CN +  VPPP    +MA ++CGGC T+LMY RGA+SV+CSCC T NL    +NQVAH        ++CG+CR  
Subjt:  QSQLVCSGCRNLLVYPVGATSVCCAVCNAVTAVPPPGT--EMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLA-LEANQVAH--------VSCGNCRML

Query:  LMYQYGARSVKCAVCNFVTSVGVN
        LMY YGA SVKCAVC FVT+V ++
Subjt:  LMYQYGARSVKCAVCNFVTSVGVN

AT4G20380.2 LSD1 zinc finger family protein2.4e-3558.06Show/hide
Query:  QSQLVCSGCRNLLVYPVGATSVCCAVCNAVTAVPPPGT--EMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLA-LEANQVAH--------VSCGNCRML
        Q QLVC GCRNLL+YP GA++V CA+CN +  VPPP    +MA ++CGGC T+LMY RGA+SV+CSCC T NL    +NQVAH        ++CG+CR  
Subjt:  QSQLVCSGCRNLLVYPVGATSVCCAVCNAVTAVPPPGT--EMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLA-LEANQVAH--------VSCGNCRML

Query:  LMYQYGARSVKCAVCNFVTSVGVN
        LMY YGA SVKCAVC FVT+V ++
Subjt:  LMYQYGARSVKCAVCNFVTSVGVN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCACCAGTTCCTCTTGCACCATATCCAACCCCTCCAACACCTTATACACAACCTGCTAATGCTACACAGAGCCAACTTGTGTGCTCAGGATGCAGAAACCTCTTAGT
TTATCCAGTTGGGGCAACCTCTGTTTGCTGTGCAGTTTGTAATGCAGTTACTGCTGTACCGCCTCCCGGCACAGAAATGGCACAACTGGTGTGTGGAGGCTGCCATACTC
TTCTCATGTACATCCGCGGTGCCACGAGTGTACAATGTTCTTGTTGCCACACTGTCAACTTAGCTTTGGAAGCGAATCAGGTGGCGCACGTTAGCTGCGGGAACTGCAGG
ATGCTACTGATGTATCAATATGGAGCAAGGTCGGTGAAATGTGCAGTATGCAATTTTGTGACATCAGTTGGGGTAAATGTCAACGAGCGTGATCGATCAAAAAGCTTAAT
AGCTAAAGCTATCGTAAAACATGGTTGTCCTAAATTCGACAACTGCGGCTTCGTCCAATATTTGCAACGCCTTCCAGGGTGTTTGTCCGACTCGACAAACGGAGGTTCCT
CTTCATATTCACTTGTATAG
mRNA sequenceShow/hide mRNA sequence
CCCTTCCCCTTCCCTTCTTCTATCTACTGCAAGTTGCAAATGTATGAATTTTGCCCTCTCACTACTACTCATTGCCTTTTCAAGCTTTCTTCCCCAGCTTCTTCCTTGCT
GGGTTTCTCCATTTTTGGCTCATCTCCACTAACTTGTTTCCATGGGGGAAGTAGCTGGTTTTGTCTGTTGCTAAGGCAGTGAGCTTTTAAGAGCTTGGTTAGAAGACAGA
GAGAAAGAGAGAGAGAAAGAAAAGTAGGGAAGTTTTTTCCTTTATCAAGCTGCAAAATGCCACCAGTTCCTCTTGCACCATATCCAACCCCTCCAACACCTTATACACAA
CCTGCTAATGCTACACAGAGCCAACTTGTGTGCTCAGGATGCAGAAACCTCTTAGTTTATCCAGTTGGGGCAACCTCTGTTTGCTGTGCAGTTTGTAATGCAGTTACTGC
TGTACCGCCTCCCGGCACAGAAATGGCACAACTGGTGTGTGGAGGCTGCCATACTCTTCTCATGTACATCCGCGGTGCCACGAGTGTACAATGTTCTTGTTGCCACACTG
TCAACTTAGCTTTGGAAGCGAATCAGGTGGCGCACGTTAGCTGCGGGAACTGCAGGATGCTACTGATGTATCAATATGGAGCAAGGTCGGTGAAATGTGCAGTATGCAAT
TTTGTGACATCAGTTGGGGTAAATGTCAACGAGCGTGATCGATCAAAAAGCTTAATAGCTAAAGCTATCGTAAAACATGGTTGTCCTAAATTCGACAACTGCGGCTTCGT
CCAATATTTGCAACGCCTTCCAGGGTGTTTGTCCGACTCGACAAACGGAGGTTCCTCTTCATATTCACTTGTATAGAGCTGTGTTTCTGCGCGTGAACTTTAAATGTTGC
TTTTTTTCAGTCTTCACAGTGCTATATATATATGTGTGTGTGGTCTGAGAGACGACTTTTTATATTGTGCTGAGTGTAATATAACTGTTGGATAAAGATGGATTTTGGTT
TTGGTTTTTGTTTTTGTTTTTTTTCCTTTTTTTTCCCTCTCTTTGTGAACTGCATAAAAAGATTTATATCAGATATATCAGCCTTTTGAAATGTCAACCTACTTGCTTCA
TTGTCACTACATAATGCTTTGTAGTCAATTCAGATTCTAAATGAACTGATCCATGTGAGTGATGTTCAATTGGGACCAAGAGAGAGAGAGATAGATAGAGAATATGACTA
AGCTACAACCTTTGCATTTAAAGAAACATTCATCAACTCTTCAAGATTTGAGAATAATACTCATTTAAGTTCTAATCTGCCTTGCTCTTAAAAGTTATTTTGTAGTCTTT
AGAAAGATTTAACTTCTTGTTGCAACCTTGTGCTTTATGCGTTCAACGCAAGGTGGTGAACGCCTGAGGGTCAACTCAACGCCTGATGTGCGATGAACTGGACTGGTACT
ATGCAGTGAACTGGATGGGTACTATGCGGTGAACTGGACGCCTACTATGCGGTGAACCTGCAAAATAAAAGCTCGTTATGGCTGAGTCGCGGAGCTCGCTCTCTTTAAGA
CGTTTTGCGGCTCTGCCTAAGGGGCTTGCAGTTCTGACTTAATGCCATGTCGTTCCCCGGATAAAACAGCCCAGATAACACTGTCTCGGTCGAAACTCGCACAAGAATCG
ATCGGTACTCACACTCTTTCTAAATTGCTCGGTATTCAGAGATGAAAATATTTTTGCGCTTTTTATTTCAACTGAAGATGGCAGGAATTTATAGTATGCGAGGCTCAAGG
GTCGCATGAGGGACGAGCGCTCGTCCACACCACGTGCGCCGAAAAAGACGGCGCGGACGCGAAGACTACGCGACGCTACGCGAACCATGCAGTGAGGGGCAAGACCGTAA
TTTCTAAAAAAAATTATAATTCAAAAATTATCTTTATTGCACCTTACCCCACCATTGCCTATCTCCTCACCCTTTCCAAAACATGGGTACCCCTTGGGACCCAAGTCCAT
CACTAAAGGGAATGGTAAATAGAGTGAGATCCCAATGCTAAACTAAGCAATATGGGATTCTCACTTTTCCAAAAAATTCATTTTTTTTCTCTTTTGGTTTTAAAAACTAT
TTTTTCAATCTTTTTCACCAACACTTCTAAAACCAACACGATAGCCTTTGTAGATTTCTTTTTCTTGGGTAAGAAACCGAAATTTCATTGAGAAAAATGAAAGAAAAACA
AGAAGGATAAAAAAAAAGTGAGTGGGAGCGCATATCTAACTGAAAAAAGAACTCCACACTCCAACCAATATCATAATTATGAAAGAGCTTATTGACTAACGCTCACAAGG
AGGCATTAAATAAACCTAGCCCCCTCCTGGACGTCCTTAATCAGTCTTTCCACCTCTCTAAAAATCATATCATGTTATTTCTCTCAATTCAGATAGGCTTTGTAGATCGA
AAACCATTGTATAAAAAACGTTAAGGACCAAAATGAAAAGTAACTAGGCGGTCCCTGTGACGGGTCGAGTAAGCCTAAAGAGCTATGAATAAAGAGATGAAAGCATCCAA
AGTTAATAACACAATGATCATTATCTCTCTGCTCATCCACATGGTGAAAGTTATTATCTCATCAGATGAATTGGCCCCCTGTGGGCGCTGTGATTGTCCAATGTTTTGCC
TTCAAATTTTAAGCAAATCAGTACTTATTTGATACTCCAACCAAACTGCTTCTACCCCAC
Protein sequenceShow/hide protein sequence
MPPVPLAPYPTPPTPYTQPANATQSQLVCSGCRNLLVYPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQVAHVSCGNCR
MLLMYQYGARSVKCAVCNFVTSVGVNVNERDRSKSLIAKAIVKHGCPKFDNCGFVQYLQRLPGCLSDSTNGGSSSYSLV