; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg038075 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg038075
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionPH domain-containing protein
Genome locationscaffold12:40111035..40120007
RNA-Seq ExpressionSpg038075
SyntenySpg038075
Gene Ontology termsGO:0016043 - cellular component organization (biological process)
GO:0005737 - cytoplasm (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0031090 - organelle membrane (cellular component)
GO:0043231 - intracellular membrane-bounded organelle (cellular component)
InterPro domainsIPR025558 - Domain of unknown function DUF4283
IPR026960 - Reverse transcriptase zinc-binding domain
IPR031642 - VPS13, repeated coiled region
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0038522.1 hypothetical protein E6C27_scaffold92G00460 [Cucumis melo var. makuwa]4.0e-8187.57Show/hide
Query:  QDDLENVSPQELDMYLQFDVVLSDVAAFLVDGDYSWNQIFGKDTDKSSHVTEINILPVIDKCGVILKLQQIRLENPSYPSTRLAVRLPSLVFHFSPARYH
        QDDLENVS QELDMYLQFDVVLSDV+AFLVDGDY+WNQIFGKDT KS  VTEINI+PVIDKCG+ILKLQQIRLENPSYPSTRLAVRLPSL FHFSPARYH
Subjt:  QDDLENVSPQELDMYLQFDVVLSDVAAFLVDGDYSWNQIFGKDTDKSSHVTEINILPVIDKCGVILKLQQIRLENPSYPSTRLAVRLPSLVFHFSPARYH

Query:  RLLKILKIFQDEDNSNSDVLQLWNQADFVGRLSVLIRKGVGNREAEWQQRYCCLVGPYLYLIESPRSKSYKHYLRFK
        RLLKILKIFQ EDN+NSDV Q WNQADF G LSVLIRKGVGNREAEWQ+RYCCLVGPYLYLIESP SKSY  YL  +
Subjt:  RLLKILKIFQDEDNSNSDVLQLWNQADFVGRLSVLIRKGVGNREAEWQQRYCCLVGPYLYLIESPRSKSYKHYLRFK

KAE8651464.1 hypothetical protein Csa_002584 [Cucumis sativus]2.6e-8086.44Show/hide
Query:  QDDLENVSPQELDMYLQFDVVLSDVAAFLVDGDYSWNQIFGKDTDKSSHVTEINILPVIDKCGVILKLQQIRLENPSYPSTRLAVRLPSLVFHFSPARYH
        QDD ENVS QELDMYLQFDVVLSDV+AFLVDGDY+WNQIFGKDT KS  VT+INI+PVIDKCG+ILKLQQIRLENPSYPSTRLAVRLPSL FHFSPARYH
Subjt:  QDDLENVSPQELDMYLQFDVVLSDVAAFLVDGDYSWNQIFGKDTDKSSHVTEINILPVIDKCGVILKLQQIRLENPSYPSTRLAVRLPSLVFHFSPARYH

Query:  RLLKILKIFQDEDNSNSDVLQLWNQADFVGRLSVLIRKGVGNREAEWQQRYCCLVGPYLYLIESPRSKSYKHYLRFK
        RLLKILKIFQ ED++NSDV QLWNQADF G LSVLIRKGVGNREAEWQ+RYCCLVGPYLYLIESP SKSY  YL  +
Subjt:  RLLKILKIFQDEDNSNSDVLQLWNQADFVGRLSVLIRKGVGNREAEWQQRYCCLVGPYLYLIESPRSKSYKHYLRFK

XP_008465914.1 PREDICTED: uncharacterized protein LOC103503494 isoform X1 [Cucumis melo]4.0e-8187.57Show/hide
Query:  QDDLENVSPQELDMYLQFDVVLSDVAAFLVDGDYSWNQIFGKDTDKSSHVTEINILPVIDKCGVILKLQQIRLENPSYPSTRLAVRLPSLVFHFSPARYH
        QDDLENVS QELDMYLQFDVVLSDV+AFLVDGDY+WNQIFGKDT KS  VTEINI+PVIDKCG+ILKLQQIRLENPSYPSTRLAVRLPSL FHFSPARYH
Subjt:  QDDLENVSPQELDMYLQFDVVLSDVAAFLVDGDYSWNQIFGKDTDKSSHVTEINILPVIDKCGVILKLQQIRLENPSYPSTRLAVRLPSLVFHFSPARYH

Query:  RLLKILKIFQDEDNSNSDVLQLWNQADFVGRLSVLIRKGVGNREAEWQQRYCCLVGPYLYLIESPRSKSYKHYLRFK
        RLLKILKIFQ EDN+NSDV Q WNQADF G LSVLIRKGVGNREAEWQ+RYCCLVGPYLYLIESP SKSY  YL  +
Subjt:  RLLKILKIFQDEDNSNSDVLQLWNQADFVGRLSVLIRKGVGNREAEWQQRYCCLVGPYLYLIESPRSKSYKHYLRFK

XP_008465915.1 PREDICTED: uncharacterized protein LOC103503494 isoform X2 [Cucumis melo]4.0e-8187.57Show/hide
Query:  QDDLENVSPQELDMYLQFDVVLSDVAAFLVDGDYSWNQIFGKDTDKSSHVTEINILPVIDKCGVILKLQQIRLENPSYPSTRLAVRLPSLVFHFSPARYH
        QDDLENVS QELDMYLQFDVVLSDV+AFLVDGDY+WNQIFGKDT KS  VTEINI+PVIDKCG+ILKLQQIRLENPSYPSTRLAVRLPSL FHFSPARYH
Subjt:  QDDLENVSPQELDMYLQFDVVLSDVAAFLVDGDYSWNQIFGKDTDKSSHVTEINILPVIDKCGVILKLQQIRLENPSYPSTRLAVRLPSLVFHFSPARYH

Query:  RLLKILKIFQDEDNSNSDVLQLWNQADFVGRLSVLIRKGVGNREAEWQQRYCCLVGPYLYLIESPRSKSYKHYLRFK
        RLLKILKIFQ EDN+NSDV Q WNQADF G LSVLIRKGVGNREAEWQ+RYCCLVGPYLYLIESP SKSY  YL  +
Subjt:  RLLKILKIFQDEDNSNSDVLQLWNQADFVGRLSVLIRKGVGNREAEWQQRYCCLVGPYLYLIESPRSKSYKHYLRFK

XP_011652678.1 uncharacterized protein LOC101212417 [Cucumis sativus]2.6e-8086.44Show/hide
Query:  QDDLENVSPQELDMYLQFDVVLSDVAAFLVDGDYSWNQIFGKDTDKSSHVTEINILPVIDKCGVILKLQQIRLENPSYPSTRLAVRLPSLVFHFSPARYH
        QDD ENVS QELDMYLQFDVVLSDV+AFLVDGDY+WNQIFGKDT KS  VT+INI+PVIDKCG+ILKLQQIRLENPSYPSTRLAVRLPSL FHFSPARYH
Subjt:  QDDLENVSPQELDMYLQFDVVLSDVAAFLVDGDYSWNQIFGKDTDKSSHVTEINILPVIDKCGVILKLQQIRLENPSYPSTRLAVRLPSLVFHFSPARYH

Query:  RLLKILKIFQDEDNSNSDVLQLWNQADFVGRLSVLIRKGVGNREAEWQQRYCCLVGPYLYLIESPRSKSYKHYLRFK
        RLLKILKIFQ ED++NSDV QLWNQADF G LSVLIRKGVGNREAEWQ+RYCCLVGPYLYLIESP SKSY  YL  +
Subjt:  RLLKILKIFQDEDNSNSDVLQLWNQADFVGRLSVLIRKGVGNREAEWQQRYCCLVGPYLYLIESPRSKSYKHYLRFK

TrEMBL top hitse value%identityAlignment
A0A0A0LEI9 PH domain-containing protein1.3e-8086.44Show/hide
Query:  QDDLENVSPQELDMYLQFDVVLSDVAAFLVDGDYSWNQIFGKDTDKSSHVTEINILPVIDKCGVILKLQQIRLENPSYPSTRLAVRLPSLVFHFSPARYH
        QDD ENVS QELDMYLQFDVVLSDV+AFLVDGDY+WNQIFGKDT KS  VT+INI+PVIDKCG+ILKLQQIRLENPSYPSTRLAVRLPSL FHFSPARYH
Subjt:  QDDLENVSPQELDMYLQFDVVLSDVAAFLVDGDYSWNQIFGKDTDKSSHVTEINILPVIDKCGVILKLQQIRLENPSYPSTRLAVRLPSLVFHFSPARYH

Query:  RLLKILKIFQDEDNSNSDVLQLWNQADFVGRLSVLIRKGVGNREAEWQQRYCCLVGPYLYLIESPRSKSYKHYLRFK
        RLLKILKIFQ ED++NSDV QLWNQADF G LSVLIRKGVGNREAEWQ+RYCCLVGPYLYLIESP SKSY  YL  +
Subjt:  RLLKILKIFQDEDNSNSDVLQLWNQADFVGRLSVLIRKGVGNREAEWQQRYCCLVGPYLYLIESPRSKSYKHYLRFK

A0A1S3CPY5 uncharacterized protein LOC103503494 isoform X11.9e-8187.57Show/hide
Query:  QDDLENVSPQELDMYLQFDVVLSDVAAFLVDGDYSWNQIFGKDTDKSSHVTEINILPVIDKCGVILKLQQIRLENPSYPSTRLAVRLPSLVFHFSPARYH
        QDDLENVS QELDMYLQFDVVLSDV+AFLVDGDY+WNQIFGKDT KS  VTEINI+PVIDKCG+ILKLQQIRLENPSYPSTRLAVRLPSL FHFSPARYH
Subjt:  QDDLENVSPQELDMYLQFDVVLSDVAAFLVDGDYSWNQIFGKDTDKSSHVTEINILPVIDKCGVILKLQQIRLENPSYPSTRLAVRLPSLVFHFSPARYH

Query:  RLLKILKIFQDEDNSNSDVLQLWNQADFVGRLSVLIRKGVGNREAEWQQRYCCLVGPYLYLIESPRSKSYKHYLRFK
        RLLKILKIFQ EDN+NSDV Q WNQADF G LSVLIRKGVGNREAEWQ+RYCCLVGPYLYLIESP SKSY  YL  +
Subjt:  RLLKILKIFQDEDNSNSDVLQLWNQADFVGRLSVLIRKGVGNREAEWQQRYCCLVGPYLYLIESPRSKSYKHYLRFK

A0A1S3CQB7 uncharacterized protein LOC103503494 isoform X21.9e-8187.57Show/hide
Query:  QDDLENVSPQELDMYLQFDVVLSDVAAFLVDGDYSWNQIFGKDTDKSSHVTEINILPVIDKCGVILKLQQIRLENPSYPSTRLAVRLPSLVFHFSPARYH
        QDDLENVS QELDMYLQFDVVLSDV+AFLVDGDY+WNQIFGKDT KS  VTEINI+PVIDKCG+ILKLQQIRLENPSYPSTRLAVRLPSL FHFSPARYH
Subjt:  QDDLENVSPQELDMYLQFDVVLSDVAAFLVDGDYSWNQIFGKDTDKSSHVTEINILPVIDKCGVILKLQQIRLENPSYPSTRLAVRLPSLVFHFSPARYH

Query:  RLLKILKIFQDEDNSNSDVLQLWNQADFVGRLSVLIRKGVGNREAEWQQRYCCLVGPYLYLIESPRSKSYKHYLRFK
        RLLKILKIFQ EDN+NSDV Q WNQADF G LSVLIRKGVGNREAEWQ+RYCCLVGPYLYLIESP SKSY  YL  +
Subjt:  RLLKILKIFQDEDNSNSDVLQLWNQADFVGRLSVLIRKGVGNREAEWQQRYCCLVGPYLYLIESPRSKSYKHYLRFK

A0A5A7T4X4 PH domain-containing protein1.9e-8187.57Show/hide
Query:  QDDLENVSPQELDMYLQFDVVLSDVAAFLVDGDYSWNQIFGKDTDKSSHVTEINILPVIDKCGVILKLQQIRLENPSYPSTRLAVRLPSLVFHFSPARYH
        QDDLENVS QELDMYLQFDVVLSDV+AFLVDGDY+WNQIFGKDT KS  VTEINI+PVIDKCG+ILKLQQIRLENPSYPSTRLAVRLPSL FHFSPARYH
Subjt:  QDDLENVSPQELDMYLQFDVVLSDVAAFLVDGDYSWNQIFGKDTDKSSHVTEINILPVIDKCGVILKLQQIRLENPSYPSTRLAVRLPSLVFHFSPARYH

Query:  RLLKILKIFQDEDNSNSDVLQLWNQADFVGRLSVLIRKGVGNREAEWQQRYCCLVGPYLYLIESPRSKSYKHYLRFK
        RLLKILKIFQ EDN+NSDV Q WNQADF G LSVLIRKGVGNREAEWQ+RYCCLVGPYLYLIESP SKSY  YL  +
Subjt:  RLLKILKIFQDEDNSNSDVLQLWNQADFVGRLSVLIRKGVGNREAEWQQRYCCLVGPYLYLIESPRSKSYKHYLRFK

A0A6J1DYN3 uncharacterized protein LOC1110242484.0e-7985.31Show/hide
Query:  QDDLENVSPQELDMYLQFDVVLSDVAAFLVDGDYSWNQIFGKDTDKSSHVTEINILPVIDKCGVILKLQQIRLENPSYPSTRLAVRLPSLVFHFSPARYH
        QDD +NVSPQELDMYL FDVVLSDV+AFLVDGDYSWNQIFGKDTDKSSH T INILP+IDKCGVILKLQQIR ENPSYPSTRLAVRLPSL FHFSPARYH
Subjt:  QDDLENVSPQELDMYLQFDVVLSDVAAFLVDGDYSWNQIFGKDTDKSSHVTEINILPVIDKCGVILKLQQIRLENPSYPSTRLAVRLPSLVFHFSPARYH

Query:  RLLKILKIFQDEDNSNSDVLQLWNQADFVGRLSVLIRKGVGNREAEWQQRYCCLVGPYLYLIESPRSKSYKHYLRFK
        RLLKILKIFQ ED++NSD LQ WNQADF G LSVL RKGVGNREA WQ++YCCLVGPYLYLIESP SKSY+ YL  +
Subjt:  RLLKILKIFQDEDNSNSDVLQLWNQADFVGRLSVLIRKGVGNREAEWQQRYCCLVGPYLYLIESPRSKSYKHYLRFK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G48090.1 calcium-dependent lipid-binding family protein3.4e-2232.22Show/hide
Query:  QELDMYLQFDVVLSDVAAFLVD--GDYSWNQIFGKDTDKSSHVTEI-----NILPVIDKCGVILKLQQIRLENPSYPSTRLAVRLPSLVFHFSPARYHRL
        Q  ++Y +F +   D+AAF  D   D     +  +D      ++ I     N+  +ID+CG+ + + QI++ +PSYPSTR+++++P++  HFSP RY R+
Subjt:  QELDMYLQFDVVLSDVAAFLVD--GDYSWNQIFGKDTDKSSHVTEI-----NILPVIDKCGVILKLQQIRLENPSYPSTRLAVRLPSLVFHFSPARYHRL

Query:  LKILKIFQDEDNSNS--------DVLQLWNQADFVGRLSVLIRKGVGNREAEWQQRYCCLVGPYLYLIESPRSKSYKHYL
        +++  I      + S        D +Q W+  D      +L+ KG+GN  A WQ     L G YLY  ES +S  Y+ YL
Subjt:  LKILKIFQDEDNSNS--------DVLQLWNQADFVGRLSVLIRKGVGNREAEWQQRYCCLVGPYLYLIESPRSKSYKHYL

AT4G04650.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein4.2e-1226.97Show/hide
Query:  IFKLGNDSQVVFWHDFWIGDLPFY-LKFPRLFRIASLP-NAPVNDLWNGETFSWNISFRRLLKEEEVLEFQQLMGILSDAIISDLSDSRTWSLE---SSG
        + ++G+     FWHD WIG  P   +  P   R   LP +A V D   G   SW I+  R  +   +++ + L+      +     DS  W  +    S 
Subjt:  IFKLGNDSQVVFWHDFWIGDLPFY-LKFPRLFRIASLP-NAPVNDLWNGETFSWNISFRRLLKEEEVLEFQQLMGILSDAIISDLSDSRTWSLE---SSG

Query:  LFTVKPLYNHLAASS---DMHKDVFRALWKTKCPKRINILCWIMIFGSLNSSEVLQRMLPSHVLSPSICPLCSSASESLQHLFFDCVFSCQCWGKFMSIF
         F+    ++ L   S     HK V+   +K   PK    +CW++ +  L++ + LQ      +  P+ C LC++  +S  HLFF+C FS   W  F +  
Subjt:  LFTVKPLYNHLAASS---DMHKDVFRALWKTKCPKRINILCWIMIFGSLNSSEVLQRMLPSHVLSPSICPLCSSASESLQHLFFDCVFSCQCWGKFMSIF

Query:  K----------LHWVFDPSFKENVLQLLIGPVIQPASKLLWLNDVKAILSEIWFERNQRVFQGTSLS
                   L+W+  PS ++N+  ++         +L + + V A    IW ERNQR+  G S S
Subjt:  K----------LHWVFDPSFKENVLQLLIGPVIQPASKLLWLNDVKAILSEIWFERNQRVFQGTSLS

AT4G17140.1 pleckstrin homology (PH) domain-containing protein1.2e-5962.43Show/hide
Query:  QDDLENVSPQELDMYLQFDVVLSDVAAFLVDGDYSWNQIFGKDTDKSSHVTEINILPVIDKCGVILKLQQIRLENPSYPSTRLAVRLPSLVFHFSPARYH
        QDD ++   +E+DMYLQFD+VLSDV+A LVDGDYSW Q+  K    S   + +  LPVIDKCGV+LKLQQIR  NP+YPSTRLAVRLPSL FHFSPARYH
Subjt:  QDDLENVSPQELDMYLQFDVVLSDVAAFLVDGDYSWNQIFGKDTDKSSHVTEINILPVIDKCGVILKLQQIRLENPSYPSTRLAVRLPSLVFHFSPARYH

Query:  RLLKILKIFQDEDNSNSDVLQLWNQADFVGRLSVLIRKGVGNREAEWQQRYCCLVGPYLYLIESPRSKSYKHY
        RL+++ +IFQ +D+ +S +L+ W +ADF G LS+L  KG   REA WQ+RY CLVGP++Y++ESP SKSYK Y
Subjt:  RLLKILKIFQDEDNSNSDVLQLWNQADFVGRLSVLIRKGVGNREAEWQQRYCCLVGPYLYLIESPRSKSYKHY

AT4G17140.2 pleckstrin homology (PH) domain-containing protein1.2e-5962.43Show/hide
Query:  QDDLENVSPQELDMYLQFDVVLSDVAAFLVDGDYSWNQIFGKDTDKSSHVTEINILPVIDKCGVILKLQQIRLENPSYPSTRLAVRLPSLVFHFSPARYH
        QDD ++   +E+DMYLQFD+VLSDV+A LVDGDYSW Q+  K    S   + +  LPVIDKCGV+LKLQQIR  NP+YPSTRLAVRLPSL FHFSPARYH
Subjt:  QDDLENVSPQELDMYLQFDVVLSDVAAFLVDGDYSWNQIFGKDTDKSSHVTEINILPVIDKCGVILKLQQIRLENPSYPSTRLAVRLPSLVFHFSPARYH

Query:  RLLKILKIFQDEDNSNSDVLQLWNQADFVGRLSVLIRKGVGNREAEWQQRYCCLVGPYLYLIESPRSKSYKHY
        RL+++ +IFQ +D+ +S +L+ W +ADF G LS+L  KG   REA WQ+RY CLVGP++Y++ESP SKSYK Y
Subjt:  RLLKILKIFQDEDNSNSDVLQLWNQADFVGRLSVLIRKGVGNREAEWQQRYCCLVGPYLYLIESPRSKSYKHY

AT4G17140.3 pleckstrin homology (PH) domain-containing protein1.2e-5962.43Show/hide
Query:  QDDLENVSPQELDMYLQFDVVLSDVAAFLVDGDYSWNQIFGKDTDKSSHVTEINILPVIDKCGVILKLQQIRLENPSYPSTRLAVRLPSLVFHFSPARYH
        QDD ++   +E+DMYLQFD+VLSDV+A LVDGDYSW Q+  K    S   + +  LPVIDKCGV+LKLQQIR  NP+YPSTRLAVRLPSL FHFSPARYH
Subjt:  QDDLENVSPQELDMYLQFDVVLSDVAAFLVDGDYSWNQIFGKDTDKSSHVTEINILPVIDKCGVILKLQQIRLENPSYPSTRLAVRLPSLVFHFSPARYH

Query:  RLLKILKIFQDEDNSNSDVLQLWNQADFVGRLSVLIRKGVGNREAEWQQRYCCLVGPYLYLIESPRSKSYKHY
        RL+++ +IFQ +D+ +S +L+ W +ADF G LS+L  KG   REA WQ+RY CLVGP++Y++ESP SKSYK Y
Subjt:  RLLKILKIFQDEDNSNSDVLQLWNQADFVGRLSVLIRKGVGNREAEWQQRYCCLVGPYLYLIESPRSKSYKHY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCACTGCCGTGGGGAGAAAGAAAATAGCTTTGGCGTGGAGAAAGGAAATGGCGTTGGCACTTGGCAGACGGAAGAAGAAGACAGAAGGAAGGCGTGGGCAAAATATGT
GCATGCAATTTTGATTATTGGTCTTATACATGATTTATGTCTACAGGATGACCTTGAAAACGTGTCTCCTCAAGAGTTGGATATGTATTTGCAATTTGATGTGGTCTTGA
GTGATGTCGCTGCCTTTTTGGTTGATGGTGATTACAGCTGGAATCAAATTTTTGGCAAAGATACTGATAAATCTTCCCATGTAACAGAAATTAATATTTTGCCTGTTATT
GACAAGTGTGGGGTTATCTTAAAGCTGCAACAGATTCGATTGGAGAATCCGTCTTACCCTTCAACAAGACTTGCAGTGCGATTGCCTTCTTTAGTGTTTCACTTTTCACC
AGCAAGGTATCACCGACTGTTAAAAATTCTGAAGATCTTTCAGGATGAGGATAATTCAAATTCAGATGTTCTTCAACTGTGGAATCAAGCTGACTTTGTGGGGCGGCTAT
CTGTTCTTATTAGGAAGGGTGTTGGAAACAGGGAAGCTGAATGGCAGCAGCGATACTGTTGTCTAGTAGGGCCCTATCTCTACTTAATTGAAAGCCCAAGATCCAAATCT
TACAAGCACTATCTCAGGTTCAAATACTCTCTAATGCAAATTAGATCATTATCGGGCAAGGCTTATGATGAATGCGTGATTCGTCTAGTTAAATGGAGGGTCTGTATTGA
GAAAGGAGGCCTTAACAAAAATAAAATTTCTAAAGGTCATGTTTCTTCTAATTCTTATGCTGAAGTTGTTAGGCGTGGTGTTTTGATGAAGAAATCATTCTCCTTGAAAA
ATTCAGTCAGAAATGATAAGCTTGTTAATAAGGAAGCTTACTGGGTTCAAAAGAACTGGGATGTGCTGAAAATAGATTTGGAAAGCTCTCGTGTTGTTTCTAGATTGACG
ACCCATTATTCTTGGAAGGAGGTTAAGCTTGTCCTTGAGGATTTCTTTAAATCTTCAGTCTTGATCAATCATTTTATGGATGATAAAGCTTTGATTCAAGTGGCTGATTG
TAGTTTGGATCCTTCTGTGAATGGTAAGTGGAAGCAATTCGGGAACCTTCATTTGAAATTGGAATTTTGGTCCTCTGATCTTCATTCCCAGCCAAAATTTATAAAAAGTT
ATGGAGGATGGATTGCAATCAAAAACCTACCTTTGGATTTGTGGCACCGGGACTCCTTTGAAGCTATTGGAAAAAACCTTGATGTTGTGGAAAGGATGAATGAATTGCCT
TCTTGTTCTCGGTCTCAGGGAAAAATTAATGAGGTGTTGGGTTCTCCAAAGGGTGCTTTGTTGCATGATGAGGGCATTAATAACATTGGTTGGGTGAAAAATCAATTTGT
CAAAGGAATTGCTTGTTCTTCCAACCCTAAAGTTCATTCTTCTATAGATTCAGATGATGAGTCTTCGGCTGGTTTGAGTAGTGTTGATTCTGGGTCTTTGATTGCTGAAG
AAGATTGTGTTGGGCTCCTCCAAAATGATCAAATTGATGAGTCTTTGGCTTCTTTTTTTTCCAGGGGAGGATATTGGTTTATACAATCAAATTCAAGATACCTTGCTGAA
ACCAAACAATCGAAAATTGATTTAAACTTCATTAAATCTTTATGGAGTTCAAAGGAAATTGGATGGACTTTTGTGGAAGCTTATGGGAAATCAGGAGGACTTCTTATTAT
GTGGGATGAGAGCAAATTATCAGTGCTGGAATTCTTAAAGGGTGGTTATACTCTTTCAATTAAATGTTTTACTCTTTGTAAAAAAGTTTGTTGGGTCACCAATGTTTATG
GTCTGAATGACTACAAGGAAAGGAGATTCTTATGGCCTGAATTGCGTTCCCTCTCTTACTATTGCACGGATCCATGGTGTATTGGTGGGGACTTTAATATTACTCGATGG
GTTCATGAGAGATCTCCTATGGGAAGACAAACTAAAGAGGCTGGTTCATTTATGTGGGGACCATCTCCTTTCCGGTTTTATAACAGTTGGCTTTCTCAAATGGCTTGTGA
TAAAATTATCTTGGATTCTCTTTCGCTTGATCGCTATCAAGGATGGGCTGGTTTTGCTTTGAGCTCTAGACTCAGAAATCTGAAGGTAGCTATTAAAAAGTGGTATGCTG
ATTTTGAAGTTGGCAGGAAAAGGAAAGAGCAATGTTTGCTCTCGGAACTTGTATTTCTTTATGCAAAAGCTGAAAGTATATCTAAGGTGTGGAATCAAATTGAGAGTTTG
GCTATTTTTAAACTCGGAAATGACTCTCAAGTAGTTTTTTGGCATGACTTTTGGATTGGAGATCTTCCTTTTTATTTAAAATTTCCAAGATTATTTCGAATTGCTTCTCT
TCCAAATGCTCCCGTTAATGATCTTTGGAATGGGGAGACTTTTTCATGGAATATTTCTTTTCGTCGACTTCTTAAAGAGGAAGAAGTTCTTGAGTTTCAGCAGCTCATGG
GCATTTTAAGTGATGCCATAATTTCAGATCTTTCGGATTCCCGCACTTGGTCTTTAGAAAGTTCGGGTTTGTTCACGGTTAAACCTCTCTATAATCACTTGGCTGCTTCT
TCAGATATGCACAAGGATGTGTTTAGAGCTCTTTGGAAGACTAAATGTCCAAAGCGTATTAACATTCTTTGTTGGATCATGATCTTTGGTTCTTTAAACAGCTCGGAGGT
TCTTCAAAGGATGCTACCATCTCATGTTTTATCTCCTTCCATTTGCCCTTTATGTTCAAGTGCCAGTGAATCTTTGCAGCACCTTTTCTTTGATTGTGTTTTTTCTTGCC
AGTGTTGGGGGAAGTTTATGTCGATTTTCAAGCTTCATTGGGTTTTCGATCCGTCATTTAAAGAAAACGTGCTACAACTTTTAATTGGTCCAGTTATTCAGCCGGCTTCC
AAATTATTATGGTTGAATGATGTTAAAGCTATTTTATCAGAAATTTGGTTTGAAAGAAATCAGAGAGTTTTTCAGGGCACTTCTCTTTCTTGGTTGGATCGTTTCAACTC
AGCTCTCCTCAAAGCCTCGTCATGTTGTCCAAGCTCTTTTCAGCCTTCTCTATTGAGGATATTTGCCTTAATTGCGATGCTTTCATTTTTCCAGCCTAAGTCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCACTGCCGTGGGGAGAAAGAAAATAGCTTTGGCGTGGAGAAAGGAAATGGCGTTGGCACTTGGCAGACGGAAGAAGAAGACAGAAGGAAGGCGTGGGCAAAATATGT
GCATGCAATTTTGATTATTGGTCTTATACATGATTTATGTCTACAGGATGACCTTGAAAACGTGTCTCCTCAAGAGTTGGATATGTATTTGCAATTTGATGTGGTCTTGA
GTGATGTCGCTGCCTTTTTGGTTGATGGTGATTACAGCTGGAATCAAATTTTTGGCAAAGATACTGATAAATCTTCCCATGTAACAGAAATTAATATTTTGCCTGTTATT
GACAAGTGTGGGGTTATCTTAAAGCTGCAACAGATTCGATTGGAGAATCCGTCTTACCCTTCAACAAGACTTGCAGTGCGATTGCCTTCTTTAGTGTTTCACTTTTCACC
AGCAAGGTATCACCGACTGTTAAAAATTCTGAAGATCTTTCAGGATGAGGATAATTCAAATTCAGATGTTCTTCAACTGTGGAATCAAGCTGACTTTGTGGGGCGGCTAT
CTGTTCTTATTAGGAAGGGTGTTGGAAACAGGGAAGCTGAATGGCAGCAGCGATACTGTTGTCTAGTAGGGCCCTATCTCTACTTAATTGAAAGCCCAAGATCCAAATCT
TACAAGCACTATCTCAGGTTCAAATACTCTCTAATGCAAATTAGATCATTATCGGGCAAGGCTTATGATGAATGCGTGATTCGTCTAGTTAAATGGAGGGTCTGTATTGA
GAAAGGAGGCCTTAACAAAAATAAAATTTCTAAAGGTCATGTTTCTTCTAATTCTTATGCTGAAGTTGTTAGGCGTGGTGTTTTGATGAAGAAATCATTCTCCTTGAAAA
ATTCAGTCAGAAATGATAAGCTTGTTAATAAGGAAGCTTACTGGGTTCAAAAGAACTGGGATGTGCTGAAAATAGATTTGGAAAGCTCTCGTGTTGTTTCTAGATTGACG
ACCCATTATTCTTGGAAGGAGGTTAAGCTTGTCCTTGAGGATTTCTTTAAATCTTCAGTCTTGATCAATCATTTTATGGATGATAAAGCTTTGATTCAAGTGGCTGATTG
TAGTTTGGATCCTTCTGTGAATGGTAAGTGGAAGCAATTCGGGAACCTTCATTTGAAATTGGAATTTTGGTCCTCTGATCTTCATTCCCAGCCAAAATTTATAAAAAGTT
ATGGAGGATGGATTGCAATCAAAAACCTACCTTTGGATTTGTGGCACCGGGACTCCTTTGAAGCTATTGGAAAAAACCTTGATGTTGTGGAAAGGATGAATGAATTGCCT
TCTTGTTCTCGGTCTCAGGGAAAAATTAATGAGGTGTTGGGTTCTCCAAAGGGTGCTTTGTTGCATGATGAGGGCATTAATAACATTGGTTGGGTGAAAAATCAATTTGT
CAAAGGAATTGCTTGTTCTTCCAACCCTAAAGTTCATTCTTCTATAGATTCAGATGATGAGTCTTCGGCTGGTTTGAGTAGTGTTGATTCTGGGTCTTTGATTGCTGAAG
AAGATTGTGTTGGGCTCCTCCAAAATGATCAAATTGATGAGTCTTTGGCTTCTTTTTTTTCCAGGGGAGGATATTGGTTTATACAATCAAATTCAAGATACCTTGCTGAA
ACCAAACAATCGAAAATTGATTTAAACTTCATTAAATCTTTATGGAGTTCAAAGGAAATTGGATGGACTTTTGTGGAAGCTTATGGGAAATCAGGAGGACTTCTTATTAT
GTGGGATGAGAGCAAATTATCAGTGCTGGAATTCTTAAAGGGTGGTTATACTCTTTCAATTAAATGTTTTACTCTTTGTAAAAAAGTTTGTTGGGTCACCAATGTTTATG
GTCTGAATGACTACAAGGAAAGGAGATTCTTATGGCCTGAATTGCGTTCCCTCTCTTACTATTGCACGGATCCATGGTGTATTGGTGGGGACTTTAATATTACTCGATGG
GTTCATGAGAGATCTCCTATGGGAAGACAAACTAAAGAGGCTGGTTCATTTATGTGGGGACCATCTCCTTTCCGGTTTTATAACAGTTGGCTTTCTCAAATGGCTTGTGA
TAAAATTATCTTGGATTCTCTTTCGCTTGATCGCTATCAAGGATGGGCTGGTTTTGCTTTGAGCTCTAGACTCAGAAATCTGAAGGTAGCTATTAAAAAGTGGTATGCTG
ATTTTGAAGTTGGCAGGAAAAGGAAAGAGCAATGTTTGCTCTCGGAACTTGTATTTCTTTATGCAAAAGCTGAAAGTATATCTAAGGTGTGGAATCAAATTGAGAGTTTG
GCTATTTTTAAACTCGGAAATGACTCTCAAGTAGTTTTTTGGCATGACTTTTGGATTGGAGATCTTCCTTTTTATTTAAAATTTCCAAGATTATTTCGAATTGCTTCTCT
TCCAAATGCTCCCGTTAATGATCTTTGGAATGGGGAGACTTTTTCATGGAATATTTCTTTTCGTCGACTTCTTAAAGAGGAAGAAGTTCTTGAGTTTCAGCAGCTCATGG
GCATTTTAAGTGATGCCATAATTTCAGATCTTTCGGATTCCCGCACTTGGTCTTTAGAAAGTTCGGGTTTGTTCACGGTTAAACCTCTCTATAATCACTTGGCTGCTTCT
TCAGATATGCACAAGGATGTGTTTAGAGCTCTTTGGAAGACTAAATGTCCAAAGCGTATTAACATTCTTTGTTGGATCATGATCTTTGGTTCTTTAAACAGCTCGGAGGT
TCTTCAAAGGATGCTACCATCTCATGTTTTATCTCCTTCCATTTGCCCTTTATGTTCAAGTGCCAGTGAATCTTTGCAGCACCTTTTCTTTGATTGTGTTTTTTCTTGCC
AGTGTTGGGGGAAGTTTATGTCGATTTTCAAGCTTCATTGGGTTTTCGATCCGTCATTTAAAGAAAACGTGCTACAACTTTTAATTGGTCCAGTTATTCAGCCGGCTTCC
AAATTATTATGGTTGAATGATGTTAAAGCTATTTTATCAGAAATTTGGTTTGAAAGAAATCAGAGAGTTTTTCAGGGCACTTCTCTTTCTTGGTTGGATCGTTTCAACTC
AGCTCTCCTCAAAGCCTCGTCATGTTGTCCAAGCTCTTTTCAGCCTTCTCTATTGAGGATATTTGCCTTAATTGCGATGCTTTCATTTTTCCAGCCTAAGTCTTGA
Protein sequenceShow/hide protein sequence
MHCRGEKENSFGVEKGNGVGTWQTEEEDRRKAWAKYVHAILIIGLIHDLCLQDDLENVSPQELDMYLQFDVVLSDVAAFLVDGDYSWNQIFGKDTDKSSHVTEINILPVI
DKCGVILKLQQIRLENPSYPSTRLAVRLPSLVFHFSPARYHRLLKILKIFQDEDNSNSDVLQLWNQADFVGRLSVLIRKGVGNREAEWQQRYCCLVGPYLYLIESPRSKS
YKHYLRFKYSLMQIRSLSGKAYDECVIRLVKWRVCIEKGGLNKNKISKGHVSSNSYAEVVRRGVLMKKSFSLKNSVRNDKLVNKEAYWVQKNWDVLKIDLESSRVVSRLT
THYSWKEVKLVLEDFFKSSVLINHFMDDKALIQVADCSLDPSVNGKWKQFGNLHLKLEFWSSDLHSQPKFIKSYGGWIAIKNLPLDLWHRDSFEAIGKNLDVVERMNELP
SCSRSQGKINEVLGSPKGALLHDEGINNIGWVKNQFVKGIACSSNPKVHSSIDSDDESSAGLSSVDSGSLIAEEDCVGLLQNDQIDESLASFFSRGGYWFIQSNSRYLAE
TKQSKIDLNFIKSLWSSKEIGWTFVEAYGKSGGLLIMWDESKLSVLEFLKGGYTLSIKCFTLCKKVCWVTNVYGLNDYKERRFLWPELRSLSYYCTDPWCIGGDFNITRW
VHERSPMGRQTKEAGSFMWGPSPFRFYNSWLSQMACDKIILDSLSLDRYQGWAGFALSSRLRNLKVAIKKWYADFEVGRKRKEQCLLSELVFLYAKAESISKVWNQIESL
AIFKLGNDSQVVFWHDFWIGDLPFYLKFPRLFRIASLPNAPVNDLWNGETFSWNISFRRLLKEEEVLEFQQLMGILSDAIISDLSDSRTWSLESSGLFTVKPLYNHLAAS
SDMHKDVFRALWKTKCPKRINILCWIMIFGSLNSSEVLQRMLPSHVLSPSICPLCSSASESLQHLFFDCVFSCQCWGKFMSIFKLHWVFDPSFKENVLQLLIGPVIQPAS
KLLWLNDVKAILSEIWFERNQRVFQGTSLSWLDRFNSALLKASSCCPSSFQPSLLRIFALIAMLSFFQPKS