; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc06G06300 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc06G06300
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionPentatricopeptide repeat (PPR) superfamily protein
Genome locationClcChr06:6521599..6522797
RNA-Seq ExpressionClc06G06300
SyntenyClc06G06300
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004140154.1 uncharacterized protein LOC101216953 [Cucumis sativus]6.7e-11397.11Show/hide
Query:  MVSIGSGG--SVQAS-PPSSP-PSTEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFEG
        MVSIGSGG  SVQAS PPSSP P+TEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFEG
Subjt:  MVSIGSGG--SVQAS-PPSSP-PSTEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFEG

Query:  KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE
        KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE
Subjt:  KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE

Query:  EGKEGTTGSKEKNVKRIKKGLERTRSASIRIRPMINVPICTQ
        EGKEGTTG+KEKNVKRIKK LERTRSASIRIRPMINVPICTQ
Subjt:  EGKEGTTGSKEKNVKRIKKGLERTRSASIRIRPMINVPICTQ

XP_008449565.1 PREDICTED: uncharacterized protein LOC103491410 [Cucumis melo]1.5e-11295.87Show/hide
Query:  MVSIGSGG--SVQASPP--SSPPSTEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFEG
        MVSIG GG  SVQASPP  S PP+TEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFEG
Subjt:  MVSIGSGG--SVQASPP--SSPPSTEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFEG

Query:  KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE
        KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE
Subjt:  KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE

Query:  EGKEGTTGSKEKNVKRIKKGLERTRSASIRIRPMINVPICTQ
        EGKEGTTG+KEKNV+RIKK LERTRSASIRIRPMINVPICTQ
Subjt:  EGKEGTTGSKEKNVKRIKKGLERTRSASIRIRPMINVPICTQ

XP_022138704.1 uncharacterized protein LOC111009802 [Momordica charantia]4.5e-10991.36Show/hide
Query:  MVSIGSGGSVQASPPS-----SPPSTEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFE
        MVSIG   SVQA+PPS      PP TEPNSSPRISFSSEFLDESNFISITPNSQIERDQE+CERQ+K+RSEKLAWSADFEFLSNKVSSHSM TADELFFE
Subjt:  MVSIGSGGSVQASPPS-----SPPSTEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFE

Query:  GKLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATT
        GKLLPFWQMQQAERL KISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASS LSPSSSSSSSSSSSRSMAD ATT
Subjt:  GKLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATT

Query:  EEGKEGTTGSKEKNVKRIKKGLERTRSASIRIRPMINVPICTQ
        EEGKEGT G+KEKN+KRIKKGLERTRSASIRIRPMINVPICTQ
Subjt:  EEGKEGTTGSKEKNVKRIKKGLERTRSASIRIRPMINVPICTQ

XP_022930387.1 uncharacterized protein LOC111436849 [Cucurbita moschata]1.3e-10391.18Show/hide
Query:  MVSIGSGGSVQASPPSSPPSTEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFEGKLLP
        MVSI    SVQ SPPSS  S EPNSSPRISFSSEFLDESNFISITP+SQIERDQEICERQKK+RSE+LA SADFEFLSN+VSSHSM+TADELFFEGKLLP
Subjt:  MVSIGSGGSVQASPPSSPPSTEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFEGKLLP

Query:  FWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTEEGKE
        FWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQR SSALSPSSSSSSSSSSSRSMADAAT+E   E
Subjt:  FWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTEEGKE

Query:  GTTGSKEKNVKRIKKGLERTRSASIRIRPMINVPICTQ
        GTTG+KEKN+KRIKKGLERTRSASIRIRPMINVPICTQ
Subjt:  GTTGSKEKNVKRIKKGLERTRSASIRIRPMINVPICTQ

XP_038876013.1 uncharacterized protein LOC120068347 [Benincasa hispida]1.1e-11596.67Show/hide
Query:  MVSIGSGGSVQASPPSSPPSTEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQK--KDRSEKLAWSADFEFLSNKVSSHSMITADELFFEGKL
        MVSIGSGGSVQASPPSSP  TEPNSSPRISFSSEFLDESNFISITPNSQIERDQEIC+RQK  KDRSEKLAWSADFEFLSNKVSSHSMITADELFFEGKL
Subjt:  MVSIGSGGSVQASPPSSPPSTEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQK--KDRSEKLAWSADFEFLSNKVSSHSMITADELFFEGKL

Query:  LPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTEEG
        LPFWQMQQAERLNKISLKSPKDVDEED+VEIEVNKEA+NKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTEEG
Subjt:  LPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTEEG

Query:  KEGTTGSKEKNVKRIKKGLERTRSASIRIRPMINVPICTQ
        KEGTTG+KEKNVKRIKKGLERTRSASIRIRPMINVPICTQ
Subjt:  KEGTTGSKEKNVKRIKKGLERTRSASIRIRPMINVPICTQ

TrEMBL top hitse value%identityAlignment
A0A1S3BMY8 uncharacterized protein LOC1034914107.2e-11395.87Show/hide
Query:  MVSIGSGG--SVQASPP--SSPPSTEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFEG
        MVSIG GG  SVQASPP  S PP+TEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFEG
Subjt:  MVSIGSGG--SVQASPP--SSPPSTEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFEG

Query:  KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE
        KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE
Subjt:  KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE

Query:  EGKEGTTGSKEKNVKRIKKGLERTRSASIRIRPMINVPICTQ
        EGKEGTTG+KEKNV+RIKK LERTRSASIRIRPMINVPICTQ
Subjt:  EGKEGTTGSKEKNVKRIKKGLERTRSASIRIRPMINVPICTQ

A0A5D3DC08 SEY17.2e-11395.87Show/hide
Query:  MVSIGSGG--SVQASPP--SSPPSTEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFEG
        MVSIG GG  SVQASPP  S PP+TEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFEG
Subjt:  MVSIGSGG--SVQASPP--SSPPSTEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFEG

Query:  KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE
        KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE
Subjt:  KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE

Query:  EGKEGTTGSKEKNVKRIKKGLERTRSASIRIRPMINVPICTQ
        EGKEGTTG+KEKNV+RIKK LERTRSASIRIRPMINVPICTQ
Subjt:  EGKEGTTGSKEKNVKRIKKGLERTRSASIRIRPMINVPICTQ

A0A6J1CAV2 uncharacterized protein LOC1110098022.2e-10991.36Show/hide
Query:  MVSIGSGGSVQASPPS-----SPPSTEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFE
        MVSIG   SVQA+PPS      PP TEPNSSPRISFSSEFLDESNFISITPNSQIERDQE+CERQ+K+RSEKLAWSADFEFLSNKVSSHSM TADELFFE
Subjt:  MVSIGSGGSVQASPPS-----SPPSTEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFE

Query:  GKLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATT
        GKLLPFWQMQQAERL KISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASS LSPSSSSSSSSSSSRSMAD ATT
Subjt:  GKLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATT

Query:  EEGKEGTTGSKEKNVKRIKKGLERTRSASIRIRPMINVPICTQ
        EEGKEGT G+KEKN+KRIKKGLERTRSASIRIRPMINVPICTQ
Subjt:  EEGKEGTTGSKEKNVKRIKKGLERTRSASIRIRPMINVPICTQ

A0A6J1EQC3 uncharacterized protein LOC1114368496.1e-10491.18Show/hide
Query:  MVSIGSGGSVQASPPSSPPSTEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFEGKLLP
        MVSI    SVQ SPPSS  S EPNSSPRISFSSEFLDESNFISITP+SQIERDQEICERQKK+RSE+LA SADFEFLSN+VSSHSM+TADELFFEGKLLP
Subjt:  MVSIGSGGSVQASPPSSPPSTEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFEGKLLP

Query:  FWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTEEGKE
        FWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQR SSALSPSSSSSSSSSSSRSMADAAT+E   E
Subjt:  FWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTEEGKE

Query:  GTTGSKEKNVKRIKKGLERTRSASIRIRPMINVPICTQ
        GTTG+KEKN+KRIKKGLERTRSASIRIRPMINVPICTQ
Subjt:  GTTGSKEKNVKRIKKGLERTRSASIRIRPMINVPICTQ

A0A6J1KCD5 uncharacterized protein LOC1114943316.1e-10491.18Show/hide
Query:  MVSIGSGGSVQASPPSSPPSTEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFEGKLLP
        MVSI    SVQ SPPSS  S EPNSSPRISFSSEFLDESNFISITP+SQIERDQEICERQKK+RSE+LA SADFEFLSN+VSSHSM+TADELFFEGKLLP
Subjt:  MVSIGSGGSVQASPPSSPPSTEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFEGKLLP

Query:  FWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTEEGKE
        FWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQR SSALSPSSSSSSSSSSSRSMADAAT+E   E
Subjt:  FWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTEEGKE

Query:  GTTGSKEKNVKRIKKGLERTRSASIRIRPMINVPICTQ
        GTTG+KEKN+KRIKKGLERTRSASIRIRPMINVPICTQ
Subjt:  GTTGSKEKNVKRIKKGLERTRSASIRIRPMINVPICTQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G05980.1 unknown protein4.1e-4452.54Show/hide
Query:  SSPPSTEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNK-VSSHSMITADELFFEGKLLPFWQMQQAERLNKIS
        + PP   P   PRISFSS+  D  +FI ITP         +C+      S K+   +DFEFLS++ VS   M+TADELF EGKLLPFWQ++ +E+L  I+
Subjt:  SSPPSTEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNK-VSSHSMITADELFFEGKLLPFWQMQQAERLNKIS

Query:  LKSPKDVDEEDLVEIEVNKE------AENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSP-------SSSSSSSSSSSRSMADAATTEEGKEG
        LK+ ++ +E +  ++EV K+       +N+V WF+D+DPSPRPPKCTVLWKELLRLKKQR  S+ SP       S S SSS+SSS S+ DAA  EE    
Subjt:  LKSPKDVDEEDLVEIEVNKE------AENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSP-------SSSSSSSSSSSRSMADAATTEEGKEG

Query:  TTGSKEKNVKRIKKGLERTRSASIRIRPMINVPICT
            KEK  KR KKGLERTRSAS+RIRPMI+VPICT
Subjt:  TTGSKEKNVKRIKKGLERTRSASIRIRPMINVPICT

AT3G12970.1 unknown protein3.8e-0533.91Show/hide
Query:  DFEFLSNKVSSHSMITADELFFEGKLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDP-SPRPPKCTVLWKELLRLKKQRASS
        DFEFL       +M++ADELF +GKL+P        + + ++    K +       ++  +  E +++  +D    SPR P+CTV W+ELL LK+   + 
Subjt:  DFEFLSNKVSSHSMITADELFFEGKLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDP-SPRPPKCTVLWKELLRLKKQRASS

Query:  ALSPSSSSSSSSSSS
          + +SSSS  SSSS
Subjt:  ALSPSSSSSSSSSSS

AT5G19340.1 unknown protein2.3e-4246.39Show/hide
Query:  ASPPSSPPSTEPNSSPRISFSSEFL---DESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFEGKLLPFWQMQQAE
        A+   + PST   + PRISFS++      + +FI I P   +     I  R++KD+S   A   DFEFLS      +M++ADELF EGKLLPFWQ++ +E
Subjt:  ASPPSSPPSTEPNSSPRISFSSEFL---DESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFEGKLLPFWQMQQAE

Query:  RLNKISLKSPKDVDEEDLVEIEVNKEA-----------------ENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRAS---------SALSPSSSSSSS
        +L  ++LK   +V +E+     VN+E                   N+ +WFLDDDPSPRPPKCTVLWKELLRLKKQR +         S+LSPSSSSSS+
Subjt:  RLNKISLKSPKDVDEEDLVEIEVNKEA-----------------ENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRAS---------SALSPSSSSSSS

Query:  SSSSRSMADAATTEEGKEGTTGSKEKNVKRIKKGLERTRSASIRIRPMINVPICTQFDVTCRM
        SSSS S+ DA   EE        +EK  KR KKGLERTRS ++RIRPMI+VP+CT    + R+
Subjt:  SSSSRSMADAATTEEGKEGTTGSKEKNVKRIKKGLERTRSASIRIRPMINVPICTQFDVTCRM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTTCAATAGGCAGTGGTGGTAGTGTTCAAGCATCACCGCCATCATCACCACCATCCACAGAACCAAATTCCAGTCCTAGGATATCTTTCTCTTCTGAATTT
CTTGATGAAAGCAACTTCATTTCCATTACCCCAAATTCCCAGATAGAGAGAGATCAAGAAATTTGTGAGAGACAGAAGAAGGATAGATCAGAGAAGCTGGCATGG
AGTGCTGATTTTGAGTTTCTTTCTAATAAAGTTAGTAGTCACTCCATGATTACAGCCGATGAGCTTTTCTTTGAAGGGAAGCTTCTTCCCTTTTGGCAAATGCAG
CAAGCTGAGCGGCTTAACAAAATCAGTCTGAAATCTCCAAAAGATGTAGATGAAGAAGACTTGGTGGAAATAGAGGTAAACAAGGAGGCAGAGAACAAAGTGAAT
TGGTTTCTCGACGACGACCCGTCTCCGAGACCACCAAAATGCACTGTTCTCTGGAAAGAACTTTTGAGGTTGAAGAAGCAACGCGCTTCATCTGCGCTATCGCCA
TCTTCTTCTTCATCCTCGTCATCTTCCTCTTCCAGGTCCATGGCTGATGCAGCCACAACAGAGGAAGGAAAGGAAGGGACAACAGGAAGCAAAGAGAAGAACGTA
AAGAGGATAAAGAAGGGTCTGGAAAGGACAAGATCAGCAAGTATTAGAATAAGGCCTATGATTAATGTGCCAATCTGCACACAGTTCGACGTCACATGCCGGATG
CAGCAGCCTCGGATCAGTGGGTCACCCACATTTCAGTACAATACAATAGCAATTTTCAGAATGGAACCTGGGTCCAAAATTATCGTCAATTGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTTTCAATAGGCAGTGGTGGTAGTGTTCAAGCATCACCGCCATCATCACCACCATCCACAGAACCAAATTCCAGTCCTAGGATATCTTTCTCTTCTGAATTT
CTTGATGAAAGCAACTTCATTTCCATTACCCCAAATTCCCAGATAGAGAGAGATCAAGAAATTTGTGAGAGACAGAAGAAGGATAGATCAGAGAAGCTGGCATGG
AGTGCTGATTTTGAGTTTCTTTCTAATAAAGTTAGTAGTCACTCCATGATTACAGCCGATGAGCTTTTCTTTGAAGGGAAGCTTCTTCCCTTTTGGCAAATGCAG
CAAGCTGAGCGGCTTAACAAAATCAGTCTGAAATCTCCAAAAGATGTAGATGAAGAAGACTTGGTGGAAATAGAGGTAAACAAGGAGGCAGAGAACAAAGTGAAT
TGGTTTCTCGACGACGACCCGTCTCCGAGACCACCAAAATGCACTGTTCTCTGGAAAGAACTTTTGAGGTTGAAGAAGCAACGCGCTTCATCTGCGCTATCGCCA
TCTTCTTCTTCATCCTCGTCATCTTCCTCTTCCAGGTCCATGGCTGATGCAGCCACAACAGAGGAAGGAAAGGAAGGGACAACAGGAAGCAAAGAGAAGAACGTA
AAGAGGATAAAGAAGGGTCTGGAAAGGACAAGATCAGCAAGTATTAGAATAAGGCCTATGATTAATGTGCCAATCTGCACACAGTTCGACGTCACATGCCGGATG
CAGCAGCCTCGGATCAGTGGGTCACCCACATTTCAGTACAATACAATAGCAATTTTCAGAATGGAACCTGGGTCCAAAATTATCGTCAATTGTTAA
Protein sequenceShow/hide protein sequence
MVSIGSGGSVQASPPSSPPSTEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFEGKLLPFWQMQ
QAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTEEGKEGTTGSKEKNV
KRIKKGLERTRSASIRIRPMINVPICTQFDVTCRMQQPRISGSPTFQYNTIAIFRMEPGSKIIVNC