; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0021101 (gene) of Chayote v1 genome

Gene IDSed0021101
OrganismSechium edule (Chayote v1)
DescriptionMit_KHE1 domain-containing protein
Genome locationLG04:37061811..37066082
RNA-Seq ExpressionSed0021101
SyntenySed0021101
Gene Ontology termsGO:0006813 - potassium ion transport (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0071704 - organic substance metabolic process (biological process)
GO:1902600 - proton transmembrane transport (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0031305 - integral component of mitochondrial inner membrane (cellular component)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR018786 - Protein of unknown function DUF2343


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6593588.1 hypothetical protein SDJN03_13064, partial [Cucurbita argyrosperma subsp. sororia]3.1e-8772.96Show/hide
Query:  KLLPISSLSRSSMNFSVSSRSSGVRFDCSVYQMNKSWTTMEKAPDGSFKNKLHGIGSKLLSRIKPSEIFLKSITKEVKSVEIAYPSSLNPRLVRRRLRHI
        K+   SS    +++ S SS ++ +  D   ++MNK+WT +EKAPDGSFKNKLHGIG KLLSR+KPSEIFLKSITK+V SVEIAYPSSLNPRLVRRRLRHI
Subjt:  KLLPISSLSRSSMNFSVSSRSSGVRFDCSVYQMNKSWTTMEKAPDGSFKNKLHGIGSKLLSRIKPSEIFLKSITKEVKSVEIAYPSSLNPRLVRRRLRHI

Query:  ALRGTAIHKKYFYGSVWLLPLTSVFIVLPLPNISFFWVLFRAYSNWRALKGSEILLQLVSDRSYSCNSSTDDSKTENKVQQYSGSVLEMQPSKELDKFVS
        ALRGTAIHKK+FYGSV LLPL S F +LPLPNI FFWVLFR YS+WRALKGSEILLQLVSDRSYSCNSSTD +KT N VQ++ GS L++QPSKELDKF+S
Subjt:  ALRGTAIHKKYFYGSVWLLPLTSVFIVLPLPNISFFWVLFRAYSNWRALKGSEILLQLVSDRSYSCNSSTDDSKTENKVQQYSGSVLEMQPSKELDKFVS

Query:  RMEGESGDVTAIKEICKMFDLNMNDVLKYKDTM
        +MEG     T IK+ICK+FDLNMN+VLKYKD M
Subjt:  RMEGESGDVTAIKEICKMFDLNMNDVLKYKDTM

KAG7025932.1 hypothetical protein SDJN02_12430 [Cucurbita argyrosperma subsp. argyrosperma]2.7e-8672.53Show/hide
Query:  KLLPISSLSRSSMNFSVSSRSSGVRFDCSVYQMNKSWTTMEKAPDGSFKNKLHGIGSKLLSRIKPSEIFLKSITKEVKSVEIAYPSSLNPRLVRRRLRHI
        K+   SS    +++ S SS ++ +  D   ++MNK+WT +EKAPDGSFKNKLHGIG KLLSR+KPSEIFLKSITK+V SVEIAYPSSLNPRLVRRRLRHI
Subjt:  KLLPISSLSRSSMNFSVSSRSSGVRFDCSVYQMNKSWTTMEKAPDGSFKNKLHGIGSKLLSRIKPSEIFLKSITKEVKSVEIAYPSSLNPRLVRRRLRHI

Query:  ALRGTAIHKKYFYGSVWLLPLTSVFIVLPLPNISFFWVLFRAYSNWRALKGSEILLQLVSDRSYSCNSSTDDSKTENKVQQYSGSVLEMQPSKELDKFVS
        ALRGTAIHKK+FYGSV LLPL S F +LPLPNI FFWVLFR YS+WRALKGSEILLQLVSDRSYSCNSSTD + T N VQ++ GS L++QPSKELDKF+S
Subjt:  ALRGTAIHKKYFYGSVWLLPLTSVFIVLPLPNISFFWVLFRAYSNWRALKGSEILLQLVSDRSYSCNSSTDDSKTENKVQQYSGSVLEMQPSKELDKFVS

Query:  RMEGESGDVTAIKEICKMFDLNMNDVLKYKDTM
        +MEG     T IK+ICK+FDLNMN+VLKYKD M
Subjt:  RMEGESGDVTAIKEICKMFDLNMNDVLKYKDTM

XP_022964630.1 uncharacterized protein LOC111464584 [Cucurbita moschata]1.1e-8773.39Show/hide
Query:  KLLPISSLSRSSMNFSVSSRSSGVRFDCSVYQMNKSWTTMEKAPDGSFKNKLHGIGSKLLSRIKPSEIFLKSITKEVKSVEIAYPSSLNPRLVRRRLRHI
        K+   SS    +++ S SS ++ +  D   ++MNK+WT +EKAPDGSFKNKLHGIG KLLSR+KPSEIFLKSITK+V SVEIAYPSSLNPRLVRRRLRHI
Subjt:  KLLPISSLSRSSMNFSVSSRSSGVRFDCSVYQMNKSWTTMEKAPDGSFKNKLHGIGSKLLSRIKPSEIFLKSITKEVKSVEIAYPSSLNPRLVRRRLRHI

Query:  ALRGTAIHKKYFYGSVWLLPLTSVFIVLPLPNISFFWVLFRAYSNWRALKGSEILLQLVSDRSYSCNSSTDDSKTENKVQQYSGSVLEMQPSKELDKFVS
        ALRGTAIHKK+FYGSV LLPL S F +LPLPNI FFWVLFR YS+WRALKGSEILLQLVSDRSYSCNSSTD +KT N VQQ+ GS L++QPSKELDKF+S
Subjt:  ALRGTAIHKKYFYGSVWLLPLTSVFIVLPLPNISFFWVLFRAYSNWRALKGSEILLQLVSDRSYSCNSSTDDSKTENKVQQYSGSVLEMQPSKELDKFVS

Query:  RMEGESGDVTAIKEICKMFDLNMNDVLKYKDTM
        +MEG     T IK+ICK+FDLNMN+VLKYKD M
Subjt:  RMEGESGDVTAIKEICKMFDLNMNDVLKYKDTM

XP_023000133.1 uncharacterized protein LOC111494421 [Cucurbita maxima]1.1e-8773.39Show/hide
Query:  KLLPISSLSRSSMNFSVSSRSSGVRFDCSVYQMNKSWTTMEKAPDGSFKNKLHGIGSKLLSRIKPSEIFLKSITKEVKSVEIAYPSSLNPRLVRRRLRHI
        K+   SS    +++ S SS ++ +  D   ++MNK+WT +EKAPDGSFKNKLHGIG KLLSR+KPSEIFLKSITK+V SVEIAYPSSLNPRLVRRRLRHI
Subjt:  KLLPISSLSRSSMNFSVSSRSSGVRFDCSVYQMNKSWTTMEKAPDGSFKNKLHGIGSKLLSRIKPSEIFLKSITKEVKSVEIAYPSSLNPRLVRRRLRHI

Query:  ALRGTAIHKKYFYGSVWLLPLTSVFIVLPLPNISFFWVLFRAYSNWRALKGSEILLQLVSDRSYSCNSSTDDSKTENKVQQYSGSVLEMQPSKELDKFVS
        ALRGTAIHKK+FYGSV LLPL S F +LPLPNI FFWVLFR YS+WRALKGSEILLQLVSDRSYSCNSSTD +KT N VQQ+ GS L++QPSKELDKF+S
Subjt:  ALRGTAIHKKYFYGSVWLLPLTSVFIVLPLPNISFFWVLFRAYSNWRALKGSEILLQLVSDRSYSCNSSTDDSKTENKVQQYSGSVLEMQPSKELDKFVS

Query:  RMEGESGDVTAIKEICKMFDLNMNDVLKYKDTM
        +MEG     T IK+ICK+FDLNMN+VLKYKD M
Subjt:  RMEGESGDVTAIKEICKMFDLNMNDVLKYKDTM

XP_038896482.1 uncharacterized protein LOC120084733 isoform X1 [Benincasa hispida]1.6e-8675.77Show/hide
Query:  SLSRSSMNFSVSSRSSGVRFDCSVYQMNKSWTTMEKAPDGSFKNKLHGIGSKLLSRIKPSEIFLKSITKEVKSVEIAYPSSLNPRLVRRRLRHIALRGTA
        S S SS + +VSS +  V  D   ++MNK+WT +EKAP GSFKNKLHGIG KLLSR+KPSEIFLKSITK+V SVEI YPSSLNPRLVRRRLRHIALRG A
Subjt:  SLSRSSMNFSVSSRSSGVRFDCSVYQMNKSWTTMEKAPDGSFKNKLHGIGSKLLSRIKPSEIFLKSITKEVKSVEIAYPSSLNPRLVRRRLRHIALRGTA

Query:  IHKKYFYGSVWLLPLTSVFIVLPLPNISFFWVLFRAYSNWRALKGSEILLQLVSDRSYSCNSSTDDSKTENKVQQYSGSVLEMQPSKELDKFVSRMEGES
        IH+KYFYGSV +LPLTS F VLPLPNI FFWVLFR YS+WRAL+GSE LLQLVSDRSY C+SS+DD KTE+KVQQY GS L+M+PSKELDKF+S+ME  S
Subjt:  IHKKYFYGSVWLLPLTSVFIVLPLPNISFFWVLFRAYSNWRALKGSEILLQLVSDRSYSCNSSTDDSKTENKVQQYSGSVLEMQPSKELDKFVSRMEGES

Query:  GDVTAIKEICKMFDLNMNDVLKYKDTM
        GD+TAIK+ICKMFDLNM +VLKYKDT+
Subjt:  GDVTAIKEICKMFDLNMNDVLKYKDTM

TrEMBL top hitse value%identityAlignment
A0A1S3CGF0 uncharacterized protein LOC1035006395.1e-8370.09Show/hide
Query:  KLLPISSLSRS-SMNFSVSSRSSGVRFDCSVYQMNKSWTTMEKAPDGSFKNKLHGIGSKLLSRIKPSEIFLKSITKEVKSVEIAYPSSLNPRLVRRRLRH
        K+   SS S+S +++ + +S ++ +  D   ++MNK+WT +EKAPDGSFKNKLHGIG KLLSR+KPSEIFLKSITK+V SVEI YPSSLNPRLVRRRLRH
Subjt:  KLLPISSLSRS-SMNFSVSSRSSGVRFDCSVYQMNKSWTTMEKAPDGSFKNKLHGIGSKLLSRIKPSEIFLKSITKEVKSVEIAYPSSLNPRLVRRRLRH

Query:  IALRGTAIHKKYFYGSVWLLPLTSVFIVLPLPNISFFWVLFRAYSNWRALKGSEILLQLVSDRSYSCNSSTDDSKTENKVQQYSGSVLEMQPSKELDKFV
        IA RGT IH+KYFYGSV L PL S F +LPLPNI FFWVLFR YS+WRAL+GSE LLQLVSDRSY  NSS+D  K E+KVQQYSG  L+MQPSKELDKF+
Subjt:  IALRGTAIHKKYFYGSVWLLPLTSVFIVLPLPNISFFWVLFRAYSNWRALKGSEILLQLVSDRSYSCNSSTDDSKTENKVQQYSGSVLEMQPSKELDKFV

Query:  SRMEGESGDVTAIKEICKMFDLNMNDVLKYKDTM
        S+ME  SGD+TAI++ICKMFDLN+ +VLKYKD +
Subjt:  SRMEGESGDVTAIKEICKMFDLNMNDVLKYKDTM

A0A5D3BYB6 Uncharacterized protein8.6e-8370.61Show/hide
Query:  SSLSRSSMNFSVSSRSSGVRFDCSVYQMNKSWTTMEKAPDGSFKNKLHGIGSKLLSRIKPSEIFLKSITKEVKSVEIAYPSSLNPRLVRRRLRHIALRGT
        SS    +++ + +S ++ +  D   ++MNK+WT +EKAPDGSFKNKLHGIG KLLSR+KPSEIFLKSITK+V SVE  YPSSLNPRLVRRRLRHIA RGT
Subjt:  SSLSRSSMNFSVSSRSSGVRFDCSVYQMNKSWTTMEKAPDGSFKNKLHGIGSKLLSRIKPSEIFLKSITKEVKSVEIAYPSSLNPRLVRRRLRHIALRGT

Query:  AIHKKYFYGSVWLLPLTSVFIVLPLPNISFFWVLFRAYSNWRALKGSEILLQLVSDRSYSCNSSTDDSKTENKVQQYSGSVLEMQPSKELDKFVSRMEGE
         IH+KYFYGSV LLPL S F +LPLPNI FFWVLFR YS+WRAL+GSE LLQLVSDRSY  NSS+D  K E+KVQQYSG  L+MQPSKELDKF+S+ME  
Subjt:  AIHKKYFYGSVWLLPLTSVFIVLPLPNISFFWVLFRAYSNWRALKGSEILLQLVSDRSYSCNSSTDDSKTENKVQQYSGSVLEMQPSKELDKFVSRMEGE

Query:  SGDVTAIKEICKMFDLNMNDVLKYKDTM
        SGD+TAI++ICKMFDLN+ +VLKYKD +
Subjt:  SGDVTAIKEICKMFDLNMNDVLKYKDTM

A0A6J1CZZ6 uncharacterized protein LOC111016104 isoform X11.2e-8472.93Show/hide
Query:  SSLSRSSMNFSVSSRSSG-VRFDCSVYQMNKSWTTMEKAPDGSFKNKLHGIGSKLLSRIKPSEIFLKSITKEVKSVEIAYPSSLNPRLVRRRLRHIALRG
        SS S+S  + S +S S+  +  D   ++MNK+WT +E AP+GSFKNKLHGIG KLLSR+KP EIFLKSITK+V +VEI YPSSLNPRLVRRRLRHIALRG
Subjt:  SSLSRSSMNFSVSSRSSG-VRFDCSVYQMNKSWTTMEKAPDGSFKNKLHGIGSKLLSRIKPSEIFLKSITKEVKSVEIAYPSSLNPRLVRRRLRHIALRG

Query:  TAIHKKYFYGSVWLLPLTSVFIVLPLPNISFFWVLFRAYSNWRALKGSEILLQLVSDRSYSCNSSTDDSKTENKVQQYSGSVLEMQPSKELDKFVSRMEG
        TAIHKKYFYGSV LLP+TS F VLPLPNI FFWVLFR YS+WRALKGSE LLQLVSDRSYS NSSTD +KT +KV+Q+ GS L++QPSKELDK +S+MEG
Subjt:  TAIHKKYFYGSVWLLPLTSVFIVLPLPNISFFWVLFRAYSNWRALKGSEILLQLVSDRSYSCNSSTDDSKTENKVQQYSGSVLEMQPSKELDKFVSRMEG

Query:  ESGDVTAIKEICKMFDLNMNDVLKYKDTM
          GD TAIK+ICK+FDLN N+VLKYKD M
Subjt:  ESGDVTAIKEICKMFDLNMNDVLKYKDTM

A0A6J1HLD1 uncharacterized protein LOC1114645845.2e-8873.39Show/hide
Query:  KLLPISSLSRSSMNFSVSSRSSGVRFDCSVYQMNKSWTTMEKAPDGSFKNKLHGIGSKLLSRIKPSEIFLKSITKEVKSVEIAYPSSLNPRLVRRRLRHI
        K+   SS    +++ S SS ++ +  D   ++MNK+WT +EKAPDGSFKNKLHGIG KLLSR+KPSEIFLKSITK+V SVEIAYPSSLNPRLVRRRLRHI
Subjt:  KLLPISSLSRSSMNFSVSSRSSGVRFDCSVYQMNKSWTTMEKAPDGSFKNKLHGIGSKLLSRIKPSEIFLKSITKEVKSVEIAYPSSLNPRLVRRRLRHI

Query:  ALRGTAIHKKYFYGSVWLLPLTSVFIVLPLPNISFFWVLFRAYSNWRALKGSEILLQLVSDRSYSCNSSTDDSKTENKVQQYSGSVLEMQPSKELDKFVS
        ALRGTAIHKK+FYGSV LLPL S F +LPLPNI FFWVLFR YS+WRALKGSEILLQLVSDRSYSCNSSTD +KT N VQQ+ GS L++QPSKELDKF+S
Subjt:  ALRGTAIHKKYFYGSVWLLPLTSVFIVLPLPNISFFWVLFRAYSNWRALKGSEILLQLVSDRSYSCNSSTDDSKTENKVQQYSGSVLEMQPSKELDKFVS

Query:  RMEGESGDVTAIKEICKMFDLNMNDVLKYKDTM
        +MEG     T IK+ICK+FDLNMN+VLKYKD M
Subjt:  RMEGESGDVTAIKEICKMFDLNMNDVLKYKDTM

A0A6J1KLS4 uncharacterized protein LOC1114944215.2e-8873.39Show/hide
Query:  KLLPISSLSRSSMNFSVSSRSSGVRFDCSVYQMNKSWTTMEKAPDGSFKNKLHGIGSKLLSRIKPSEIFLKSITKEVKSVEIAYPSSLNPRLVRRRLRHI
        K+   SS    +++ S SS ++ +  D   ++MNK+WT +EKAPDGSFKNKLHGIG KLLSR+KPSEIFLKSITK+V SVEIAYPSSLNPRLVRRRLRHI
Subjt:  KLLPISSLSRSSMNFSVSSRSSGVRFDCSVYQMNKSWTTMEKAPDGSFKNKLHGIGSKLLSRIKPSEIFLKSITKEVKSVEIAYPSSLNPRLVRRRLRHI

Query:  ALRGTAIHKKYFYGSVWLLPLTSVFIVLPLPNISFFWVLFRAYSNWRALKGSEILLQLVSDRSYSCNSSTDDSKTENKVQQYSGSVLEMQPSKELDKFVS
        ALRGTAIHKK+FYGSV LLPL S F +LPLPNI FFWVLFR YS+WRALKGSEILLQLVSDRSYSCNSSTD +KT N VQQ+ GS L++QPSKELDKF+S
Subjt:  ALRGTAIHKKYFYGSVWLLPLTSVFIVLPLPNISFFWVLFRAYSNWRALKGSEILLQLVSDRSYSCNSSTDDSKTENKVQQYSGSVLEMQPSKELDKFVS

Query:  RMEGESGDVTAIKEICKMFDLNMNDVLKYKDTM
        +MEG     T IK+ICK+FDLNMN+VLKYKD M
Subjt:  RMEGESGDVTAIKEICKMFDLNMNDVLKYKDTM

SwissProt top hitse value%identityAlignment
O13942 Uncharacterized protein C23H3.12c5.7e-0731.91Show/hide
Query:  VYQMNKSWTTMEKAPDGSFKNKLHGIGSKLLSRIKPSEIFLKSI--------TKEVKSVEIAYPSSLNPRLVRRRL-RHIALRGTAIHKKYFYGSVWLLP
        +Y+   SW+    A     K K+  +G+++L      E FL++I        T+  +++ I +P +L    +   L R   L+ T  H  Y  G++  LP
Subjt:  VYQMNKSWTTMEKAPDGSFKNKLHGIGSKLLSRIKPSEIFLKSI--------TKEVKSVEIAYPSSLNPRLVRRRL-RHIALRGTAIHKKYFYGSVWLLP

Query:  LTSVFIVLPL-PNISFFWVLFRAYSNWRALKGSEILLQLVS
        LT  FI++PL PNI  F++ +RAY N+RA++GS  L +++S
Subjt:  LTSVFIVLPL-PNISFFWVLFRAYSNWRALKGSEILLQLVS

Arabidopsis top hitse value%identityAlignment
AT1G53760.1 unknown protein5.4e-6159.22Show/hide
Query:  QMNKSWTTMEKAPDGSFKNKLHGIGSKLLSRIKPSEIFLKSITKEVKSVEIAYPSSLNPRLVRRRLRHIALRGTAIHKKYFYGSVWLLPLTSVFIVLPLP
        +MNK+W  +EKAPDGS KNK+HG G KLL+R+KPSEIFLKSI+KEV SV++ YP SL+PRLVRRRLRHIA+ GT +HKKY  GSV LLPLTS F+VLPLP
Subjt:  QMNKSWTTMEKAPDGSFKNKLHGIGSKLLSRIKPSEIFLKSITKEVKSVEIAYPSSLNPRLVRRRLRHIALRGTAIHKKYFYGSVWLLPLTSVFIVLPLP

Query:  NISFFWVLFRAYSNWRALKGSEILLQLVS-----DRSYSCNSSTDDSKTENKVQQYSGS-VLEMQPSKELDKFVSRMEGESGDVTAIKEICKMFDLNMND
        NI FFWVLFR YS+WRAL+GSE LL+L+S     D+  S + + +   +  K +Q S S    + PS+EL + +     E  D   I EICK FDLN ND
Subjt:  NISFFWVLFRAYSNWRALKGSEILLQLVS-----DRSYSCNSSTDDSKTENKVQQYSGS-VLEMQPSKELDKFVSRMEGESGDVTAIKEICKMFDLNMND

Query:  VLKYKD
        VLKY++
Subjt:  VLKYKD

AT1G53760.2 unknown protein2.6e-4773.95Show/hide
Query:  QMNKSWTTMEKAPDGSFKNKLHGIGSKLLSRIKPSEIFLKSITKEVKSVEIAYPSSLNPRLVRRRLRHIALRGTAIHKKYFYGSVWLLPLTSVFIVLPLP
        +MNK+W  +EKAPDGS KNK+HG G KLL+R+KPSEIFLKSI+KEV SV++ YP SL+PRLVRRRLRHIA+ GT +HKKY  GSV LLPLTS F+VLPLP
Subjt:  QMNKSWTTMEKAPDGSFKNKLHGIGSKLLSRIKPSEIFLKSITKEVKSVEIAYPSSLNPRLVRRRLRHIALRGTAIHKKYFYGSVWLLPLTSVFIVLPLP

Query:  NISFFWVLFRAYSNWRALK
        NI FFWVLFR YS+WRAL+
Subjt:  NISFFWVLFRAYSNWRALK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGAAATTGTTGCCGATTTCGTCTCTTTCAAGGTCCTCCATGAACTTTTCTGTTTCGTCTCGTTCTTCTGGTGTTCGTTTTGACTGTTCTGTGTATCAGATGAACAA
ATCGTGGACTACTATGGAAAAAGCTCCCGATGGATCGTTTAAGAACAAACTTCACGGGATCGGATCGAAGCTTTTGTCTCGAATTAAGCCGTCTGAGATATTCTTGAAGT
CGATAACTAAAGAGGTTAAGAGTGTCGAAATAGCGTATCCATCGAGTTTGAATCCGCGGCTCGTTCGCAGGAGGTTACGGCATATTGCCCTCAGGGGAACTGCCATCCAC
AAGAAATACTTCTATGGTTCAGTTTGGTTGCTTCCATTGACAAGTGTATTTATTGTCTTACCTCTGCCAAACATTTCTTTCTTTTGGGTTTTGTTTCGCGCATATTCTAA
TTGGCGAGCTCTGAAGGGGAGTGAAATACTCCTTCAGTTGGTCTCTGATAGATCTTATTCGTGCAACTCATCCACTGATGATAGTAAAACAGAGAACAAAGTCCAGCAGT
ACTCAGGTTCAGTCCTGGAGATGCAGCCATCAAAGGAACTCGACAAATTCGTAAGCCGGATGGAGGGAGAATCCGGTGATGTAACGGCAATTAAAGAGATCTGCAAGATG
TTTGATTTAAACATGAATGATGTTTTGAAGTACAAGGATACAATGTGA
mRNA sequenceShow/hide mRNA sequence
ATCCATAAGTATGTACAAAATGGTAGCCAAACAAGTAGTAGCATGGCAGCAACCGTCTTCTTAAAGTTCCTTTCATTTCGTTCAGTTCTCAGTTCTCTCAATATCTACCA
CATTCGCTCCAGAGAAATCAGGATCTTCCTAAACTTGCTTATGGATCGTTGTTTATATGCGAAAATAGGGATTTCGTGTGCGAATCGACCATTTTCTCCATTGCAATTTC
GCGACGATGAGAGCCAAATTGGTAGCGTTTTCGTTCGGAGCAACAAGAAATTGGTGTTTCACCCGATCCGTCGACCCCGCCGCTTCTTCTTCCGCTCAGACTCCATCCAA
TTTGAAAGATCTCTGGACCAAAATTTCTTCGTCTTCCTCAAAATCCAGCAGTTCCAGTTGCAGCAATGCCGAAATTGTTGCCGATTTCGTCTCTTTCAAGGTCCTCCATG
AACTTTTCTGTTTCGTCTCGTTCTTCTGGTGTTCGTTTTGACTGTTCTGTGTATCAGATGAACAAATCGTGGACTACTATGGAAAAAGCTCCCGATGGATCGTTTAAGAA
CAAACTTCACGGGATCGGATCGAAGCTTTTGTCTCGAATTAAGCCGTCTGAGATATTCTTGAAGTCGATAACTAAAGAGGTTAAGAGTGTCGAAATAGCGTATCCATCGA
GTTTGAATCCGCGGCTCGTTCGCAGGAGGTTACGGCATATTGCCCTCAGGGGAACTGCCATCCACAAGAAATACTTCTATGGTTCAGTTTGGTTGCTTCCATTGACAAGT
GTATTTATTGTCTTACCTCTGCCAAACATTTCTTTCTTTTGGGTTTTGTTTCGCGCATATTCTAATTGGCGAGCTCTGAAGGGGAGTGAAATACTCCTTCAGTTGGTCTC
TGATAGATCTTATTCGTGCAACTCATCCACTGATGATAGTAAAACAGAGAACAAAGTCCAGCAGTACTCAGGTTCAGTCCTGGAGATGCAGCCATCAAAGGAACTCGACA
AATTCGTAAGCCGGATGGAGGGAGAATCCGGTGATGTAACGGCAATTAAAGAGATCTGCAAGATGTTTGATTTAAACATGAATGATGTTTTGAAGTACAAGGATACAATG
TGATCGACAAGATCCCACTTTCAGATGGGGAAAAACAGACAAAGGAGATGGCCCAAGAAAACCATGGAATTAAAAGATGAAACAAACAACCTGATTCCCTTGTTTACATT
GTTCAAAATCGACCGTTATAAAAGGGGGGAATATATACATACATAGCCTGATTCTCCTGTTGGGTGGCTTTTTGTGTTGGCCAGGCACAGCAATCGGATCTGTCTCTACA
AACTCAAAGACTGCACAAAGTTCTTGGCACCAGATCCCATACTAGTTAGTGGTTACTAGTTTTTTTGGGTATTATTCATTTGGTTTAAAATATTAAGCTATTTTATTTAT
TATTTGCTACATGTTCTTAGTAATTTAGTAATTTTCTATTTTTCCTAAAAAAAGAAAAAGAAAAGGTAATTTTCTATTTGATAACCATTTTTGTTTTCTTTTTGGCTCGA
GTATTTGAGTTTTCCTAGTAACCCCAAGTTCCTAAATTCTGAATGTTGTAAGTACATTATTAAAACCTAATGGGGGTGTTTGTTTGACATATTTAGG
Protein sequenceShow/hide protein sequence
MPKLLPISSLSRSSMNFSVSSRSSGVRFDCSVYQMNKSWTTMEKAPDGSFKNKLHGIGSKLLSRIKPSEIFLKSITKEVKSVEIAYPSSLNPRLVRRRLRHIALRGTAIH
KKYFYGSVWLLPLTSVFIVLPLPNISFFWVLFRAYSNWRALKGSEILLQLVSDRSYSCNSSTDDSKTENKVQQYSGSVLEMQPSKELDKFVSRMEGESGDVTAIKEICKM
FDLNMNDVLKYKDTM