; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0005967 (gene) of Snake gourd v1 genome

Gene IDTan0005967
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionIntegral membrane protein hemolysin-III homolog
Genome locationLG02:65048391..65051912
RNA-Seq ExpressionTan0005967
SyntenyTan0005967
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008464853.1 PREDICTED: uncharacterized protein LOC103502625 isoform X2 [Cucumis melo]3.6e-11385.49Show/hide
Query:  MIDSKLNSGGMGSCETHLSMYQNKQSPIAMKKVALRDVQNDNRSAMYNYPETSCSLGGKLVNGSK-QGSKRSNPTCSPSSAIRQSFKGVGVNEHIIYASG
        MIDSKLNSGGM SCET+L +YQ+KQSPIA KKVALRDVQNDNRS +YNYPETSCSLGGKL+NGSK  GSKRSNPTCSPSSAI QSFKG+GVNEH +YA+G
Subjt:  MIDSKLNSGGMGSCETHLSMYQNKQSPIAMKKVALRDVQNDNRSAMYNYPETSCSLGGKLVNGSK-QGSKRSNPTCSPSSAIRQSFKGVGVNEHIIYASG

Query:  EVDVKPGKKRALGGSTSCAPAFSSLLAASPMAFSPVRSSLPIFTEKPGNFLAVAGSNLPGIPSGLEILHSVDSNGITDERRTERLFNLQKLLKHCDESDQ
        EVDVKPGKKRA GGSTSCAPAFSS LA SPMAFSPVRSS PIFTEKPGNFLAV GSNL GIP GLEIL S DSNGITDE+RTERLFNLQKLLKH D+SDQ
Subjt:  EVDVKPGKKRALGGSTSCAPAFSSLLAASPMAFSPVRSSLPIFTEKPGNFLAVAGSNLPGIPSGLEILHSVDSNGITDERRTERLFNLQKLLKHCDESDQ

Query:  KGFIELLHGLPPSELSKLAINLEKRSMHLSVEEGLCSSGKEIQRMKALNILSNLQ
        KG IE+LHGLPPSELS  AINLEKRSMHLSVEE     GKEIQRMKALNILSNLQ
Subjt:  KGFIELLHGLPPSELSKLAINLEKRSMHLSVEEGLCSSGKEIQRMKALNILSNLQ

XP_022945996.1 uncharacterized protein LOC111450215 [Cucurbita moschata]1.0e-12090.98Show/hide
Query:  MIDSKLNSGGMGSCETHLSMYQNKQSPIAMKKVALRDVQNDNRSAMYNYPETSCSLGGKLVNGSK-QGSKRSNPTCSPSSAIRQSFKGVGVNEHIIYASG
        MIDSKLNSGGMGSCETHLSMY++KQSPIAMKKVALRDVQNDNRS +YNYPETSCSLGGKLVNGSK  GSKRSNPTCSPSSAI QSFKG+GVNEHIIYA+G
Subjt:  MIDSKLNSGGMGSCETHLSMYQNKQSPIAMKKVALRDVQNDNRSAMYNYPETSCSLGGKLVNGSK-QGSKRSNPTCSPSSAIRQSFKGVGVNEHIIYASG

Query:  EVDVKPGKKRALGGSTSCAPAFSSLLAASPMAFSPVRSSLPIFTEKPGNFLAVAGSNLPGIPSGLEILHSVDSNGITDERRTERLFNLQKLLKHCDESDQ
        EVDVKPGKKRALGGSTSCAPAFSS LAASPMAFSPVRSSLPIFTEKPGNFLAVAGSNL GI  GLEILHSVDSNGITDE+RTERLFNLQ LLKHCDESDQ
Subjt:  EVDVKPGKKRALGGSTSCAPAFSSLLAASPMAFSPVRSSLPIFTEKPGNFLAVAGSNLPGIPSGLEILHSVDSNGITDERRTERLFNLQKLLKHCDESDQ

Query:  KGFIELLHGLPPSELSKLAINLEKRSMHLSVEEGLCSSGKEIQRMKALNILSNLQ
        KGFIELLHGLPPSELS+LAINLEK+SMHLSVEE     GKEIQRMKALNIL NLQ
Subjt:  KGFIELLHGLPPSELSKLAINLEKRSMHLSVEEGLCSSGKEIQRMKALNILSNLQ

XP_022999755.1 uncharacterized protein LOC111494010 [Cucurbita maxima]1.2e-11989.41Show/hide
Query:  MIDSKLNSGGMGSCETHLSMYQNKQSPIAMKKVALRDVQNDNRSAMYNYPETSCSLGGKLVNGSK-QGSKRSNPTCSPSSAIRQSFKGVGVNEHIIYASG
        MIDSKLN+GGMGSCETHLSMY++KQSPIAMKKVALRDVQNDNRS +YNYPETSCSLGGKLVNGSK  GSKRSNPTCSP+SAI QSFKG+GVNEHIIYA+G
Subjt:  MIDSKLNSGGMGSCETHLSMYQNKQSPIAMKKVALRDVQNDNRSAMYNYPETSCSLGGKLVNGSK-QGSKRSNPTCSPSSAIRQSFKGVGVNEHIIYASG

Query:  EVDVKPGKKRALGGSTSCAPAFSSLLAASPMAFSPVRSSLPIFTEKPGNFLAVAGSNLPGIPSGLEILHSVDSNGITDERRTERLFNLQKLLKHCDESDQ
        EVDVKPGKKRALGGSTSCAP FSS LAASP+AFSPVRSSLPIFTEKPGNFLAV GSNL GIP GLEILHSVDSNGITDE+RTERLFNLQ LLKHCDESDQ
Subjt:  EVDVKPGKKRALGGSTSCAPAFSSLLAASPMAFSPVRSSLPIFTEKPGNFLAVAGSNLPGIPSGLEILHSVDSNGITDERRTERLFNLQKLLKHCDESDQ

Query:  KGFIELLHGLPPSELSKLAINLEKRSMHLSVEEGLCSSGKEIQRMKALNILSNLQ
        KGFIELLHGLPPSELS+LAINLEK+SMHLSVEE     GKEIQRMKALNIL NLQ
Subjt:  KGFIELLHGLPPSELSKLAINLEKRSMHLSVEEGLCSSGKEIQRMKALNILSNLQ

XP_023545576.1 uncharacterized protein LOC111804962 [Cucurbita pepo subsp. pepo]1.2e-11990.2Show/hide
Query:  MIDSKLNSGGMGSCETHLSMYQNKQSPIAMKKVALRDVQNDNRSAMYNYPETSCSLGGKLVNGSK-QGSKRSNPTCSPSSAIRQSFKGVGVNEHIIYASG
        MIDSKLNSGGMGSCETHLSMY++KQSPIAMKKVALRDVQNDNRS +YNY ETSCSLGGKLVNGSK  GSKRSNPTCSPSSAI QSFKG+GV EHIIYA+G
Subjt:  MIDSKLNSGGMGSCETHLSMYQNKQSPIAMKKVALRDVQNDNRSAMYNYPETSCSLGGKLVNGSK-QGSKRSNPTCSPSSAIRQSFKGVGVNEHIIYASG

Query:  EVDVKPGKKRALGGSTSCAPAFSSLLAASPMAFSPVRSSLPIFTEKPGNFLAVAGSNLPGIPSGLEILHSVDSNGITDERRTERLFNLQKLLKHCDESDQ
        EVDVKPGKKRALGGSTSCAPAFSS LAASPMAFSPVRSSLPIFTEKPGNFLAV GSNL GIP GLEILHSVDSNGITDE+RTERLFNLQ LLKHCDESDQ
Subjt:  EVDVKPGKKRALGGSTSCAPAFSSLLAASPMAFSPVRSSLPIFTEKPGNFLAVAGSNLPGIPSGLEILHSVDSNGITDERRTERLFNLQKLLKHCDESDQ

Query:  KGFIELLHGLPPSELSKLAINLEKRSMHLSVEEGLCSSGKEIQRMKALNILSNLQ
        KGFIELLHGLPPSELS+LAINLEK+SMHLSVEE     GKEIQRMKALNIL NLQ
Subjt:  KGFIELLHGLPPSELSKLAINLEKRSMHLSVEEGLCSSGKEIQRMKALNILSNLQ

XP_038884507.1 uncharacterized protein LOC120075309 [Benincasa hispida]1.3e-11588.24Show/hide
Query:  MIDSKLNSGGMGSCETHLSMYQNKQSPIAMKKVALRDVQNDNRSAMYNYPETSCSLGGKLVNGSK-QGSKRSNPTCSPSSAIRQSFKGVGVNEHIIYASG
        MIDSKLNSGGMGSCETHLSMYQ KQSPIA KKVALRDVQNDNRS MYNYPETSCSLGGKLVNGSK  GSKRSNPT SPSSAI QSFKG+GVNEH IYASG
Subjt:  MIDSKLNSGGMGSCETHLSMYQNKQSPIAMKKVALRDVQNDNRSAMYNYPETSCSLGGKLVNGSK-QGSKRSNPTCSPSSAIRQSFKGVGVNEHIIYASG

Query:  EVDVKPGKKRALGGSTSCAPAFSSLLAASPMAFSPVRSSLPIFTEKPGNFLAVAGSNLPGIPSGLEILHSVDSNGITDERRTERLFNLQKLLKHCDESDQ
        EVDVKPGKKRA GGSTSCAPAFSSLLAASPMA SPVRSSLPIFTEKPGNFLAVAGS+L GIP G EIL SVDSNGITDE+RTERLFNLQK LKHCDESD+
Subjt:  EVDVKPGKKRALGGSTSCAPAFSSLLAASPMAFSPVRSSLPIFTEKPGNFLAVAGSNLPGIPSGLEILHSVDSNGITDERRTERLFNLQKLLKHCDESDQ

Query:  KGFIELLHGLPPSELSKLAINLEKRSMHLSVEEGLCSSGKEIQRMKALNILSNLQ
        KG IE LHGLPPSELS  AINLEKRSM+LSVEE     GKEIQRMKALNIL NLQ
Subjt:  KGFIELLHGLPPSELSKLAINLEKRSMHLSVEEGLCSSGKEIQRMKALNILSNLQ

TrEMBL top hitse value%identityAlignment
A0A1S3CME9 uncharacterized protein LOC103502625 isoform X14.3e-11285.16Show/hide
Query:  MIDSKLNSGGMGSCETHLSMYQNKQSPIAMKKVALRDVQNDNRSAMYNYPETSCSLGGKLVNGSK-QGSKRSNPTCSPSSAIRQSFKGVGVNEHIIYASG
        MIDSKLNSGGM SCET+L +YQ+KQSPIA KKVALRDVQNDNRS +YNYPETSCSLGGKL+NGSK  GSKRSNPTCSPSSAI QSFKG+GVNEH +YA+G
Subjt:  MIDSKLNSGGMGSCETHLSMYQNKQSPIAMKKVALRDVQNDNRSAMYNYPETSCSLGGKLVNGSK-QGSKRSNPTCSPSSAIRQSFKGVGVNEHIIYASG

Query:  EVDVKPGKKRALGGSTSCAPAFSSLLAASPMAFSPVRSSLPIFTEKPGNFLAVAGSNLPGIPSGLEILHSVDSNGITDERRTERLFNLQKLLKHCDESDQ
        EVDVKPGKKRA GGSTSCAPAFSS LA SPMAFSPVRSS PIFTEKPGNFLAV GSNL GIP GLEIL S DSNGITDE+RTERLFNLQKLLKH D+SDQ
Subjt:  EVDVKPGKKRALGGSTSCAPAFSSLLAASPMAFSPVRSSLPIFTEKPGNFLAVAGSNLPGIPSGLEILHSVDSNGITDERRTERLFNLQKLLKHCDESDQ

Query:  KGFIEL-LHGLPPSELSKLAINLEKRSMHLSVEEGLCSSGKEIQRMKALNILSNLQ
        KG IE+ LHGLPPSELS  AINLEKRSMHLSVEE     GKEIQRMKALNILSNLQ
Subjt:  KGFIEL-LHGLPPSELSKLAINLEKRSMHLSVEEGLCSSGKEIQRMKALNILSNLQ

A0A1S3CMI7 uncharacterized protein LOC103502625 isoform X21.7e-11385.49Show/hide
Query:  MIDSKLNSGGMGSCETHLSMYQNKQSPIAMKKVALRDVQNDNRSAMYNYPETSCSLGGKLVNGSK-QGSKRSNPTCSPSSAIRQSFKGVGVNEHIIYASG
        MIDSKLNSGGM SCET+L +YQ+KQSPIA KKVALRDVQNDNRS +YNYPETSCSLGGKL+NGSK  GSKRSNPTCSPSSAI QSFKG+GVNEH +YA+G
Subjt:  MIDSKLNSGGMGSCETHLSMYQNKQSPIAMKKVALRDVQNDNRSAMYNYPETSCSLGGKLVNGSK-QGSKRSNPTCSPSSAIRQSFKGVGVNEHIIYASG

Query:  EVDVKPGKKRALGGSTSCAPAFSSLLAASPMAFSPVRSSLPIFTEKPGNFLAVAGSNLPGIPSGLEILHSVDSNGITDERRTERLFNLQKLLKHCDESDQ
        EVDVKPGKKRA GGSTSCAPAFSS LA SPMAFSPVRSS PIFTEKPGNFLAV GSNL GIP GLEIL S DSNGITDE+RTERLFNLQKLLKH D+SDQ
Subjt:  EVDVKPGKKRALGGSTSCAPAFSSLLAASPMAFSPVRSSLPIFTEKPGNFLAVAGSNLPGIPSGLEILHSVDSNGITDERRTERLFNLQKLLKHCDESDQ

Query:  KGFIELLHGLPPSELSKLAINLEKRSMHLSVEEGLCSSGKEIQRMKALNILSNLQ
        KG IE+LHGLPPSELS  AINLEKRSMHLSVEE     GKEIQRMKALNILSNLQ
Subjt:  KGFIELLHGLPPSELSKLAINLEKRSMHLSVEEGLCSSGKEIQRMKALNILSNLQ

A0A6J1DG32 uncharacterized protein LOC1110207391.9e-10783.2Show/hide
Query:  MIDSKLNSGGMGSCETHLSMYQNKQSPIAMKKVALRDVQNDNRSAMYNYPETSCSLGGKLVNGSK-QGSKRSN-PTCSPSSAIRQSFKGVGVNEHIIYAS
        MIDSKL +GGMGS ETHLSMYQNKQSPIA+KKVALRDV NDNR+ MYNYPE SCSLGGKLVNGSK  G+KRSN PTCSP S   QSFKGVGVNEHI YAS
Subjt:  MIDSKLNSGGMGSCETHLSMYQNKQSPIAMKKVALRDVQNDNRSAMYNYPETSCSLGGKLVNGSK-QGSKRSN-PTCSPSSAIRQSFKGVGVNEHIIYAS

Query:  GEVDVKPGKKRALGGSTSCAPAFSSLLAASPMAFSPVRSSLPIFTEKPGNFLAVAGSNLPGIPSGLEILHSVDSNGITDERRTERLFNLQKLLKHCDESD
        GEVD KP KKRALG  TSCAPAFSSLLAASPM FSPVR+SLPIFTEK GNFL V+GSNL  IP   E+LHSV SNGITDERRTERLFNLQKLLKHCDESD
Subjt:  GEVDVKPGKKRALGGSTSCAPAFSSLLAASPMAFSPVRSSLPIFTEKPGNFLAVAGSNLPGIPSGLEILHSVDSNGITDERRTERLFNLQKLLKHCDESD

Query:  QKGFIELLHGLPPSELSKLAINLEKRSMHLSVEEGLCSSGKEIQRMKALNILSNLQ
        QKG+IE LHGLPPSELSKLAINLEKRSM+LSVEE     GKEIQRMKALNIL N+Q
Subjt:  QKGFIELLHGLPPSELSKLAINLEKRSMHLSVEEGLCSSGKEIQRMKALNILSNLQ

A0A6J1G2G3 uncharacterized protein LOC1114502155.1e-12190.98Show/hide
Query:  MIDSKLNSGGMGSCETHLSMYQNKQSPIAMKKVALRDVQNDNRSAMYNYPETSCSLGGKLVNGSK-QGSKRSNPTCSPSSAIRQSFKGVGVNEHIIYASG
        MIDSKLNSGGMGSCETHLSMY++KQSPIAMKKVALRDVQNDNRS +YNYPETSCSLGGKLVNGSK  GSKRSNPTCSPSSAI QSFKG+GVNEHIIYA+G
Subjt:  MIDSKLNSGGMGSCETHLSMYQNKQSPIAMKKVALRDVQNDNRSAMYNYPETSCSLGGKLVNGSK-QGSKRSNPTCSPSSAIRQSFKGVGVNEHIIYASG

Query:  EVDVKPGKKRALGGSTSCAPAFSSLLAASPMAFSPVRSSLPIFTEKPGNFLAVAGSNLPGIPSGLEILHSVDSNGITDERRTERLFNLQKLLKHCDESDQ
        EVDVKPGKKRALGGSTSCAPAFSS LAASPMAFSPVRSSLPIFTEKPGNFLAVAGSNL GI  GLEILHSVDSNGITDE+RTERLFNLQ LLKHCDESDQ
Subjt:  EVDVKPGKKRALGGSTSCAPAFSSLLAASPMAFSPVRSSLPIFTEKPGNFLAVAGSNLPGIPSGLEILHSVDSNGITDERRTERLFNLQKLLKHCDESDQ

Query:  KGFIELLHGLPPSELSKLAINLEKRSMHLSVEEGLCSSGKEIQRMKALNILSNLQ
        KGFIELLHGLPPSELS+LAINLEK+SMHLSVEE     GKEIQRMKALNIL NLQ
Subjt:  KGFIELLHGLPPSELSKLAINLEKRSMHLSVEEGLCSSGKEIQRMKALNILSNLQ

A0A6J1KE02 uncharacterized protein LOC1114940105.6e-12089.41Show/hide
Query:  MIDSKLNSGGMGSCETHLSMYQNKQSPIAMKKVALRDVQNDNRSAMYNYPETSCSLGGKLVNGSK-QGSKRSNPTCSPSSAIRQSFKGVGVNEHIIYASG
        MIDSKLN+GGMGSCETHLSMY++KQSPIAMKKVALRDVQNDNRS +YNYPETSCSLGGKLVNGSK  GSKRSNPTCSP+SAI QSFKG+GVNEHIIYA+G
Subjt:  MIDSKLNSGGMGSCETHLSMYQNKQSPIAMKKVALRDVQNDNRSAMYNYPETSCSLGGKLVNGSK-QGSKRSNPTCSPSSAIRQSFKGVGVNEHIIYASG

Query:  EVDVKPGKKRALGGSTSCAPAFSSLLAASPMAFSPVRSSLPIFTEKPGNFLAVAGSNLPGIPSGLEILHSVDSNGITDERRTERLFNLQKLLKHCDESDQ
        EVDVKPGKKRALGGSTSCAP FSS LAASP+AFSPVRSSLPIFTEKPGNFLAV GSNL GIP GLEILHSVDSNGITDE+RTERLFNLQ LLKHCDESDQ
Subjt:  EVDVKPGKKRALGGSTSCAPAFSSLLAASPMAFSPVRSSLPIFTEKPGNFLAVAGSNLPGIPSGLEILHSVDSNGITDERRTERLFNLQKLLKHCDESDQ

Query:  KGFIELLHGLPPSELSKLAINLEKRSMHLSVEEGLCSSGKEIQRMKALNILSNLQ
        KGFIELLHGLPPSELS+LAINLEK+SMHLSVEE     GKEIQRMKALNIL NLQ
Subjt:  KGFIELLHGLPPSELSKLAINLEKRSMHLSVEEGLCSSGKEIQRMKALNILSNLQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G45250.1 Integral membrane protein hemolysin-III homolog9.7e-0843.48Show/hide
Query:  ERLFNLQKLLKHCDESDQKGFIELLHGLPPSELSKLAINLEKRSMHLSVEEGLCSSGKEIQRMKALNIL
        ER  +LQ LL   ++SD+   +++L  L  +ELSK A++LEKRS+  S+EE      +E+QR+ ALN+L
Subjt:  ERLFNLQKLLKHCDESDQKGFIELLHGLPPSELSKLAINLEKRSMHLSVEEGLCSSGKEIQRMKALNIL

AT2G45250.2 Integral membrane protein hemolysin-III homolog3.5e-0545.1Show/hide
Query:  ERLFNLQKLLKHCDESDQKGFIELLHGLPPSELSKLAINLEKRSMHLSVEE
        ER  +LQ LL   ++SD+   +++L  L  +ELSK A++LEKRS+  S+EE
Subjt:  ERLFNLQKLLKHCDESDQKGFIELLHGLPPSELSKLAINLEKRSMHLSVEE

AT4G38280.1 BEST Arabidopsis thaliana protein match is: Integral membrane protein hemolysin-III homolog (TAIR:AT2G45250.1)1.7e-0743.48Show/hide
Query:  ERLFNLQKLLKHCDESDQKGFIELLHGLPPSELSKLAINLEKRSMHLSVEEGLCSSGKEIQRMKALNIL
        ER  +LQ LL   ++SD+   +++L  L  +ELSK A++LEKRS+  S+EE      +E+QR+ ALN+L
Subjt:  ERLFNLQKLLKHCDESDQKGFIELLHGLPPSELSKLAINLEKRSMHLSVEEGLCSSGKEIQRMKALNIL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTGATAGCAAGTTGAACAGCGGTGGGATGGGTAGCTGTGAGACACATTTGTCTATGTATCAGAACAAACAGTCACCAATTGCAATGAAGAAGGTTGCTTTAAGGGA
TGTGCAGAATGATAATAGGAGTGCCATGTATAACTATCCCGAAACTTCTTGTTCTTTGGGTGGAAAACTTGTGAATGGGAGTAAGCAAGGAAGTAAGAGATCCAATCCTA
CGTGCTCACCGAGCTCTGCAATCCGTCAATCCTTCAAGGGGGTTGGTGTAAATGAGCACATCATTTATGCCAGTGGAGAAGTTGATGTGAAGCCCGGAAAAAAAAGAGCA
CTGGGAGGTAGCACATCTTGTGCACCTGCATTTTCTTCTCTTCTTGCAGCTTCTCCGATGGCATTTTCACCGGTTAGGTCTTCACTTCCCATTTTCACAGAAAAGCCTGG
TAATTTTCTGGCAGTGGCTGGATCCAATCTTCCTGGAATCCCTTCTGGTTTGGAGATTCTTCACTCTGTTGATTCAAATGGGATTACTGATGAGCGGAGAACAGAACGTT
TATTCAATCTGCAGAAGCTCCTAAAACATTGTGATGAGTCGGATCAAAAAGGCTTCATCGAGTTGCTTCATGGTTTACCTCCATCTGAGCTCAGCAAACTTGCCATTAAT
CTGGAAAAGAGATCCATGCATCTGTCAGTAGAGGAAGGTTTATGTTCTTCAGGGAAAGAGATCCAACGGATGAAGGCTCTGAATATTCTGAGCAACCTTCAGTGA
mRNA sequenceShow/hide mRNA sequence
CCCATCTTCTTCGAGCCAAAGTTTTCCCTTTTGAACTCGCCTCTCCTTCCCTCTCTCTCTCTCTCTCGTCGGTTGCAGACCAAGCTTCCCGCTCAGATGGACTTTTCTAG
GAGATAATTTGAAATTGTTGCTTGGTTGAAGCTCATCAAGTTTTCCCAATGATTGATAGCAAGTTGAACAGCGGTGGGATGGGTAGCTGTGAGACACATTTGTCTATGTA
TCAGAACAAACAGTCACCAATTGCAATGAAGAAGGTTGCTTTAAGGGATGTGCAGAATGATAATAGGAGTGCCATGTATAACTATCCCGAAACTTCTTGTTCTTTGGGTG
GAAAACTTGTGAATGGGAGTAAGCAAGGAAGTAAGAGATCCAATCCTACGTGCTCACCGAGCTCTGCAATCCGTCAATCCTTCAAGGGGGTTGGTGTAAATGAGCACATC
ATTTATGCCAGTGGAGAAGTTGATGTGAAGCCCGGAAAAAAAAGAGCACTGGGAGGTAGCACATCTTGTGCACCTGCATTTTCTTCTCTTCTTGCAGCTTCTCCGATGGC
ATTTTCACCGGTTAGGTCTTCACTTCCCATTTTCACAGAAAAGCCTGGTAATTTTCTGGCAGTGGCTGGATCCAATCTTCCTGGAATCCCTTCTGGTTTGGAGATTCTTC
ACTCTGTTGATTCAAATGGGATTACTGATGAGCGGAGAACAGAACGTTTATTCAATCTGCAGAAGCTCCTAAAACATTGTGATGAGTCGGATCAAAAAGGCTTCATCGAG
TTGCTTCATGGTTTACCTCCATCTGAGCTCAGCAAACTTGCCATTAATCTGGAAAAGAGATCCATGCATCTGTCAGTAGAGGAAGGTTTATGTTCTTCAGGGAAAGAGAT
CCAACGGATGAAGGCTCTGAATATTCTGAGCAACCTTCAGTGACATCTGAAAGAAGTTTTGTTAGATTAATGAATAACGATTTTGGATACGAACAAGATGAGAAGGCCAT
TTGCCATTTCTGCCATGATTGCACGAGAGGAGGATGAACTGGTTGACTGGTTGCTGATATTGAGCAAATCGGCATTCCTTGAGAGCTGGCAGGGTAGTCTTTCTATGGAG
CTTTCAGTTATGAAGTTTGATTTGCGCAACAAAAGAAGTTGAGTGTTGCTGTTGCCAGCATGAAACTCTTCCTGCAGGTTCGTTTGGATGAGAAAAAAAAAAAAACCTAA
AAATCCAATCAACTTTGAACAATTTGGAAGGGCTTTTGAAAGTTTTCTTTTCAGAAGAGTTTAGTGAGAGATGTGTTGAGTTGATTACACTTAGAAATCTGGCATCCCTA
TCAGTCCAATTACCATGAAATTGTTGTCTCAATCATCCATTTTTCTTTTAGAAGTTTATAACAGACGTTAGTTTATTAACCCAACTTTACTCTTTCAACTCAACTCTATC
CACCAAACGTCCACTCACTCTATGTTCTCTATGTTTTTTGTTCTTATTATATTTGTGTTCTAATTATAGATGTTTTTTTTTCTAAGTTCAATACATGTGGG
Protein sequenceShow/hide protein sequence
MIDSKLNSGGMGSCETHLSMYQNKQSPIAMKKVALRDVQNDNRSAMYNYPETSCSLGGKLVNGSKQGSKRSNPTCSPSSAIRQSFKGVGVNEHIIYASGEVDVKPGKKRA
LGGSTSCAPAFSSLLAASPMAFSPVRSSLPIFTEKPGNFLAVAGSNLPGIPSGLEILHSVDSNGITDERRTERLFNLQKLLKHCDESDQKGFIELLHGLPPSELSKLAIN
LEKRSMHLSVEEGLCSSGKEIQRMKALNILSNLQ