; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0013878 (gene) of Chayote v1 genome

Gene IDSed0013878
OrganismSechium edule (Chayote v1)
Descriptionurease accessory protein G
Genome locationLG01:16670175..16679181
RNA-Seq ExpressionSed0013878
SyntenySed0013878
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573361.1 hypothetical protein SDJN03_27248, partial [Cucurbita argyrosperma subsp. sororia]1.5e-4848.92Show/hide
Query:  MESIALFGGGSLPKLPFYEPSSAIPKLASTSRLCFNFRKPFLIPQLIS------TSHGFSWNPNVRDKFLARCCGNEHTGKTRSNERQDALKSLLSLTQS
        MES ALF GG   KLP  +PS + P     S LCFN R P   P+L +       S G + NPN+RDK LARC      G+T SN  +D+LK+L+SL Q 
Subjt:  MESIALFGGGSLPKLPFYEPSSAIPKLASTSRLCFNFRKPFLIPQLIS------TSHGFSWNPNVRDKFLARCCGNEHTGKTRSNERQDALKSLLSLTQS

Query:  QEKNST---ISARKNEAFKMAMEGKYNEASHHMEHLFN-GDAHEAYEARVAHLQILIRLDEYKKALEFLEEKGNFRQSKPYDARLSLYEAVVHTMLGKDD
        +E NS+   I++ KNEA K+ +E KY EA  H ++L +   A   YEAR+AHLQILIRLDEY KALEFLEEK NF QS  ++ARLSLY+AVVHTMLG  D
Subjt:  QEKNST---ISARKNEAFKMAMEGKYNEASHHMEHLFN-GDAHEAYEARVAHLQILIRLDEYKKALEFLEEKGNFRQSKPYDARLSLYEAVVHTMLGKDD

Query:  KAKTLWKTYLD-IDNGNGNPSI-----------FLKNAIALLMPLLSLQSALQAKNDVNLNIIPVKVKHGVGVEEIVN
        KA+  W TYL+ + NGN N  +           FL NA +LL PLLSL+S     + +  +IIP K    + ++E+VN
Subjt:  KAKTLWKTYLD-IDNGNGNPSI-----------FLKNAIALLMPLLSLQSALQAKNDVNLNIIPVKVKHGVGVEEIVN

KAG7012525.1 hypothetical protein SDJN02_25277, partial [Cucurbita argyrosperma subsp. argyrosperma]1.2e-4253.21Show/hide
Query:  MESIALFGGGSLPKLPFYEPSSAIPKLASTSRLCFNFRKPFLIPQLIS------TSHGFSWNPNVRDKFLARCCGNEHTGKTRSNERQDALKSLLSLTQS
        MES AL  GG   KLP  +PS + P     S LCFN R P   P+L +       S G + NPN+RDK LARC      G+T SN  +D+LK+L+SL Q 
Subjt:  MESIALFGGGSLPKLPFYEPSSAIPKLASTSRLCFNFRKPFLIPQLIS------TSHGFSWNPNVRDKFLARCCGNEHTGKTRSNERQDALKSLLSLTQS

Query:  QEKNST---ISARKNEAFKMAMEGKYNEASHHMEHLFN-GDAHEAYEARVAHLQILIRLDEYKKALEFLEEKGNFRQSKPYDARLSLYEAVVHTMLGKDD
        +E NS+   I++ KNEA K+ +E KY EA  H ++L +   A   YEAR+AHLQILIRLDEY KALEFLEEK NF QS  ++ARLSLY+AVVHTMLG  D
Subjt:  QEKNST---ISARKNEAFKMAMEGKYNEASHHMEHLFN-GDAHEAYEARVAHLQILIRLDEYKKALEFLEEKGNFRQSKPYDARLSLYEAVVHTMLGKDD

Query:  KAKTLWKTYLDIDNGNGN
        KA+  W TYL+   GNGN
Subjt:  KAKTLWKTYLDIDNGNGN

XP_022955326.1 uncharacterized protein LOC111457322 isoform X1 [Cucurbita moschata]2.4e-4648.56Show/hide
Query:  MESIALFGGGSLPKLPFYEPSSAIPKLASTSRLCFNFRKPFLIPQLIS------TSHGFSWNPNVRDKFLARCCGNEHTGKTRSNERQDALKSLLSLTQS
        MES AL  GG   KLP  +PS + P     S LCFN R P   P+L +       S G + NPN+ DK LARC      G T SN  +D+LK+LLSL Q 
Subjt:  MESIALFGGGSLPKLPFYEPSSAIPKLASTSRLCFNFRKPFLIPQLIS------TSHGFSWNPNVRDKFLARCCGNEHTGKTRSNERQDALKSLLSLTQS

Query:  QEKNST---ISARKNEAFKMAMEGKYNEASHHMEHLFN-GDAHEAYEARVAHLQILIRLDEYKKALEFLEEKGNFRQSKPYDARLSLYEAVVHTMLGKDD
        +E NS+   I++ KNEA K+ +EGKY EA  H ++L +   A   YEAR+AHLQILIRLDEY KALEFLEE  NF QS  ++ARLSLY+AVVHTMLG  D
Subjt:  QEKNST---ISARKNEAFKMAMEGKYNEASHHMEHLFN-GDAHEAYEARVAHLQILIRLDEYKKALEFLEEKGNFRQSKPYDARLSLYEAVVHTMLGKDD

Query:  KAKTLWKTYLD-IDNGNGNPSI-----------FLKNAIALLMPLLSLQSALQAKNDVNLNIIPVKVKHGVGVEEIVN
        KA+  W TYL+ + +GN N  +           FL NA +LL PLLSL+S     + +  NIIP K    + ++E+VN
Subjt:  KAKTLWKTYLD-IDNGNGNPSI-----------FLKNAIALLMPLLSLQSALQAKNDVNLNIIPVKVKHGVGVEEIVN

XP_022955329.1 uncharacterized protein LOC111457322 isoform X2 [Cucurbita moschata]2.4e-4648.56Show/hide
Query:  MESIALFGGGSLPKLPFYEPSSAIPKLASTSRLCFNFRKPFLIPQLIS------TSHGFSWNPNVRDKFLARCCGNEHTGKTRSNERQDALKSLLSLTQS
        MES AL  GG   KLP  +PS + P     S LCFN R P   P+L +       S G + NPN+ DK LARC      G T SN  +D+LK+LLSL Q 
Subjt:  MESIALFGGGSLPKLPFYEPSSAIPKLASTSRLCFNFRKPFLIPQLIS------TSHGFSWNPNVRDKFLARCCGNEHTGKTRSNERQDALKSLLSLTQS

Query:  QEKNST---ISARKNEAFKMAMEGKYNEASHHMEHLFN-GDAHEAYEARVAHLQILIRLDEYKKALEFLEEKGNFRQSKPYDARLSLYEAVVHTMLGKDD
        +E NS+   I++ KNEA K+ +EGKY EA  H ++L +   A   YEAR+AHLQILIRLDEY KALEFLEE  NF QS  ++ARLSLY+AVVHTMLG  D
Subjt:  QEKNST---ISARKNEAFKMAMEGKYNEASHHMEHLFN-GDAHEAYEARVAHLQILIRLDEYKKALEFLEEKGNFRQSKPYDARLSLYEAVVHTMLGKDD

Query:  KAKTLWKTYLD-IDNGNGNPSI-----------FLKNAIALLMPLLSLQSALQAKNDVNLNIIPVKVKHGVGVEEIVN
        KA+  W TYL+ + +GN N  +           FL NA +LL PLLSL+S     + +  NIIP K    + ++E+VN
Subjt:  KAKTLWKTYLD-IDNGNGNPSI-----------FLKNAIALLMPLLSLQSALQAKNDVNLNIIPVKVKHGVGVEEIVN

XP_023542161.1 uncharacterized protein LOC111802127 [Cucurbita pepo subsp. pepo]4.6e-5052.63Show/hide
Query:  MESIALFGGGSLPKLPFYEPSSAIPKLASTSRLCFNFRKPFLIPQL----IST--SHGFSWNPNVRDKFLARCCGNEHTGKTRSNERQDALKSLLSLTQS
        MES AL  GG   KLP +EPS +IP     S L FN RK F IP+L    +ST  S G +WNPN+RDK LARC      G+T SN   D+LKSLLSL Q 
Subjt:  MESIALFGGGSLPKLPFYEPSSAIPKLASTSRLCFNFRKPFLIPQL----IST--SHGFSWNPNVRDKFLARCCGNEHTGKTRSNERQDALKSLLSLTQS

Query:  QEKNS---TISARKNEAFKMAMEGKYNEASHHMEHLFNGD-AHEAYEARVAHLQILIRLDEYKKALEFLEEKGNFRQSKPYDARLSLYEAVVHTMLGKDD
        +E NS    I++ KNEA K+ +EGKY EA  H ++L +   A   YEAR+ HLQILIRLDEY KALEFLEEK NF QS  ++ARLSLY+AVVHTMLG  D
Subjt:  QEKNS---TISARKNEAFKMAMEGKYNEASHHMEHLFNGD-AHEAYEARVAHLQILIRLDEYKKALEFLEEKGNFRQSKPYDARLSLYEAVVHTMLGKDD

Query:  KAKTLWKTYLD-IDNGNGNPSI-----------FLKNAIALLMPLLSLQSALQAKNDVNLNIIPVK
        KA+  W TYL+ + NGN N  +           FL NA +LL PLLSL+S     + +  NII  K
Subjt:  KAKTLWKTYLD-IDNGNGNPSI-----------FLKNAIALLMPLLSLQSALQAKNDVNLNIIPVK

TrEMBL top hitse value%identityAlignment
A0A0A0LUX5 Uncharacterized protein6.7e-3944.24Show/hide
Query:  MESIALFGGGSLPKLPFYEPSSAIPKLASTSRLCFNFRKPFLIPQLIS-----TSHGFSWNPNVRDKFLARCCGNEHTGKTRSNER-QDALKSLLSLTQS
        MESI L    S+PKLP   PS  IP + S+S L FN R      QL +      S G   + N  ++ L R CGN H G+  S+ R QD LKSLLSL + 
Subjt:  MESIALFGGGSLPKLPFYEPSSAIPKLASTSRLCFNFRKPFLIPQLIS-----TSHGFSWNPNVRDKFLARCCGNEHTGKTRSNER-QDALKSLLSLTQS

Query:  QEKN---STISARKNEAFKMAMEGKYNEASHHMEHLFNGDAHEAYEARVAHLQILIRLDEYKKALEFLEEKGNFRQSKPYDARLSLYEAVVHTMLGKDDK
         + N   STI+  K+EA K+ M+GKYNEA  HME L  GD   AYEAR+AHLQILI LD+Y+KAL FLE++G+F +SK ++ RL LY+AVV+TML KDD 
Subjt:  QEKN---STISARKNEAFKMAMEGKYNEASHHMEHLFNGDAHEAYEARVAHLQILIRLDEYKKALEFLEEKGNFRQSKPYDARLSLYEAVVHTMLGKDDK

Query:  AKTLWKTYLD-IDNGNG-----------NPSIFLKNAIALLMPLLSLQSALQAKNDVNL-NIIPVKVKHGVGVEEIVN
        A+  W  Y+D + N NG           +  I + +A  LL PLLS +   + + +  L +II  K    + ++++VN
Subjt:  AKTLWKTYLD-IDNGNG-----------NPSIFLKNAIALLMPLLSLQSALQAKNDVNL-NIIPVKVKHGVGVEEIVN

A0A1S3BA48 uncharacterized protein LOC1034876732.5e-4145.71Show/hide
Query:  MESIALFGGGSLPKLPFYEPSSAIPKLASTSRLCFNFRKPFLIPQLISTSH-----GFSWNPNVRDKFLARCCGNEHTGKTRSNE-RQDALKSLLSLTQS
        MESI LF   S+PKLP   PS  IP ++S+  L FN R      QL + S      G   + N  +K +AR CGN   G+  S+   Q+ LKSLLSL + 
Subjt:  MESIALFGGGSLPKLPFYEPSSAIPKLASTSRLCFNFRKPFLIPQLISTSH-----GFSWNPNVRDKFLARCCGNEHTGKTRSNE-RQDALKSLLSLTQS

Query:  QEKN---STISARKNEAFKMAMEGKYNEASHHMEHLFNGDAHEAYEARVAHLQILIRLDEYKKALEFLEEKGNFRQSKPYDARLSLYEAVVHTMLGKDDK
         E N   STI+  K+EA K+ M+GKY+EA  HME L  GD   +YEARVAHLQILI LD+Y+KAL FLEE+GNF  SK ++ RL LY+AVV+TML KDD 
Subjt:  QEKN---STISARKNEAFKMAMEGKYNEASHHMEHLFNGDAHEAYEARVAHLQILIRLDEYKKALEFLEEKGNFRQSKPYDARLSLYEAVVHTMLGKDDK

Query:  AKTLWKTYLDI---DNGNG-----------NPSIFLKNAIALLMPLLSLQS-ALQAKNDVNLNIIPVKVKHGVGVEEIVN
        A+  W  YL+    DN NG           +  I + NA  LL PLLSL++ A   +N    +II  K    + ++E+VN
Subjt:  AKTLWKTYLDI---DNGNG-----------NPSIFLKNAIALLMPLLSLQS-ALQAKNDVNLNIIPVKVKHGVGVEEIVN

A0A6J1GT93 uncharacterized protein LOC111457322 isoform X11.1e-4648.56Show/hide
Query:  MESIALFGGGSLPKLPFYEPSSAIPKLASTSRLCFNFRKPFLIPQLIS------TSHGFSWNPNVRDKFLARCCGNEHTGKTRSNERQDALKSLLSLTQS
        MES AL  GG   KLP  +PS + P     S LCFN R P   P+L +       S G + NPN+ DK LARC      G T SN  +D+LK+LLSL Q 
Subjt:  MESIALFGGGSLPKLPFYEPSSAIPKLASTSRLCFNFRKPFLIPQLIS------TSHGFSWNPNVRDKFLARCCGNEHTGKTRSNERQDALKSLLSLTQS

Query:  QEKNST---ISARKNEAFKMAMEGKYNEASHHMEHLFN-GDAHEAYEARVAHLQILIRLDEYKKALEFLEEKGNFRQSKPYDARLSLYEAVVHTMLGKDD
        +E NS+   I++ KNEA K+ +EGKY EA  H ++L +   A   YEAR+AHLQILIRLDEY KALEFLEE  NF QS  ++ARLSLY+AVVHTMLG  D
Subjt:  QEKNST---ISARKNEAFKMAMEGKYNEASHHMEHLFN-GDAHEAYEARVAHLQILIRLDEYKKALEFLEEKGNFRQSKPYDARLSLYEAVVHTMLGKDD

Query:  KAKTLWKTYLD-IDNGNGNPSI-----------FLKNAIALLMPLLSLQSALQAKNDVNLNIIPVKVKHGVGVEEIVN
        KA+  W TYL+ + +GN N  +           FL NA +LL PLLSL+S     + +  NIIP K    + ++E+VN
Subjt:  KAKTLWKTYLD-IDNGNGNPSI-----------FLKNAIALLMPLLSLQSALQAKNDVNLNIIPVKVKHGVGVEEIVN

A0A6J1GVX9 uncharacterized protein LOC111457322 isoform X21.1e-4648.56Show/hide
Query:  MESIALFGGGSLPKLPFYEPSSAIPKLASTSRLCFNFRKPFLIPQLIS------TSHGFSWNPNVRDKFLARCCGNEHTGKTRSNERQDALKSLLSLTQS
        MES AL  GG   KLP  +PS + P     S LCFN R P   P+L +       S G + NPN+ DK LARC      G T SN  +D+LK+LLSL Q 
Subjt:  MESIALFGGGSLPKLPFYEPSSAIPKLASTSRLCFNFRKPFLIPQLIS------TSHGFSWNPNVRDKFLARCCGNEHTGKTRSNERQDALKSLLSLTQS

Query:  QEKNST---ISARKNEAFKMAMEGKYNEASHHMEHLFN-GDAHEAYEARVAHLQILIRLDEYKKALEFLEEKGNFRQSKPYDARLSLYEAVVHTMLGKDD
        +E NS+   I++ KNEA K+ +EGKY EA  H ++L +   A   YEAR+AHLQILIRLDEY KALEFLEE  NF QS  ++ARLSLY+AVVHTMLG  D
Subjt:  QEKNST---ISARKNEAFKMAMEGKYNEASHHMEHLFN-GDAHEAYEARVAHLQILIRLDEYKKALEFLEEKGNFRQSKPYDARLSLYEAVVHTMLGKDD

Query:  KAKTLWKTYLD-IDNGNGNPSI-----------FLKNAIALLMPLLSLQSALQAKNDVNLNIIPVKVKHGVGVEEIVN
        KA+  W TYL+ + +GN N  +           FL NA +LL PLLSL+S     + +  NIIP K    + ++E+VN
Subjt:  KAKTLWKTYLD-IDNGNGNPSI-----------FLKNAIALLMPLLSLQSALQAKNDVNLNIIPVKVKHGVGVEEIVN

A0A6J1K442 uncharacterized protein LOC1114904758.5e-4254.07Show/hide
Query:  MESIALFGGGSLPKLPFYEPSSAIPKLASTSRLCFNFRKP---FLIPQLIST--SHGFSWNPNVRDKFLARCCGNEHTGKTRSNERQDALKSLLSLTQSQ
        MES AL  GG   KLP  +PS + P     S LCFN R P    L   LIST  S   +WNPN RD  LARC      G+T SN  +D+LK+LLSL Q +
Subjt:  MESIALFGGGSLPKLPFYEPSSAIPKLASTSRLCFNFRKP---FLIPQLIST--SHGFSWNPNVRDKFLARCCGNEHTGKTRSNERQDALKSLLSLTQSQ

Query:  EKNST---ISARKNEAFKMAMEGKYNEASHHMEHLFNGDAHE-AYEARVAHLQILIRLDEYKKALEFLEEKGNFRQSKPYDARLSLYEAVVHTMLGKDDK
        E NS+   I++ KNEA K+ +E KY E   H + L +    E  YEAR+AHLQILIRLDEY KALEFLEEK NF QSK ++A LSLY+AVVHTML K+ K
Subjt:  EKNST---ISARKNEAFKMAMEGKYNEASHHMEHLFNGDAHE-AYEARVAHLQILIRLDEYKKALEFLEEKGNFRQSKPYDARLSLYEAVVHTMLGKDDK

Query:  AKTLWKTYL
        A+  W TYL
Subjt:  AKTLWKTYL

SwissProt top hitse value%identityAlignment
E0ZS47 Urease accessory protein G8.0e-0535.71Show/hide
Query:  LDIDNGN-----GNPSIFLKNAIAL----LMPLLSLQSALQAKNDVNLN----IIPVKVKHGVGVEEIVNLIIQAWEVATGKKR
        +D+  G+     G P I   + + +    L P +    A+  ++ + +      +  +VKHGVGVEEIVN I+QAWE+ATG KR
Subjt:  LDIDNGN-----GNPSIFLKNAIAL----LMPLLSLQSALQAKNDVNLN----IIPVKVKHGVGVEEIVNLIIQAWEVATGKKR

Q6AUF3 Urease accessory protein G8.0e-0535.71Show/hide
Query:  LDIDNGN-----GNPSIFLKNAIAL----LMPLLSLQSALQAKNDVNLN----IIPVKVKHGVGVEEIVNLIIQAWEVATGKKR
        +D+  G+     G P I   + + +    L P +    A+  ++ + +      +  +VKHGVGVEEIVN I+QAWE+ATG KR
Subjt:  LDIDNGN-----GNPSIFLKNAIAL----LMPLLSLQSALQAKNDVNLN----IIPVKVKHGVGVEEIVNLIIQAWEVATGKKR

Arabidopsis top hitse value%identityAlignment
AT2G34470.1 urease accessory protein G1.4e-0470.37Show/hide
Query:  KVKHGVGVEEIVNLIIQAWEVATGKKR
        +VKHG+GVEEIVN ++ +WE ATGKKR
Subjt:  KVKHGVGVEEIVNLIIQAWEVATGKKR

AT2G34470.2 urease accessory protein G1.4e-0470.37Show/hide
Query:  KVKHGVGVEEIVNLIIQAWEVATGKKR
        +VKHG+GVEEIVN ++ +WE ATGKKR
Subjt:  KVKHGVGVEEIVNLIIQAWEVATGKKR

AT2G34540.2 unknown protein3.3e-0631.37Show/hide
Query:  ISARKNEAFKMAMEGKYNEASHHMEHL---FNGDAHEAYEARVAHLQILIRLDEYKKALEFLEEKGNFRQSKPYDARLSLYEAVVHTMLGKDDKAKTLWK
        I + K EA +   EGK  EA   +      +  +    +  ++A ++ILI L+ Y++A E+     N   ++  D R+ LY+A+++TML KD +AK  WK
Subjt:  ISARKNEAFKMAMEGKYNEASHHMEHL---FNGDAHEAYEARVAHLQILIRLDEYKKALEFLEEKGNFRQSKPYDARLSLYEAVVHTMLGKDDKAKTLWK

Query:  TY
         +
Subjt:  TY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTCCATTGCTTTGTTTGGTGGTGGATCACTACCTAAACTCCCATTTTATGAACCTTCCTCTGCAATCCCTAAATTGGCTTCAACTTCAAGGCTCTGTTTCAACTT
CAGAAAACCATTCTTAATTCCTCAACTCATATCTACTAGTCATGGTTTCTCTTGGAATCCTAATGTACGTGACAAGTTCTTAGCTCGCTGCTGTGGAAATGAACACACTG
GCAAGACCCGCTCAAACGAGAGACAAGACGCATTGAAGTCACTGCTGAGTTTAACACAATCCCAAGAAAAGAATTCAACGATATCTGCGAGAAAGAATGAAGCATTCAAG
ATGGCGATGGAAGGGAAGTACAATGAAGCATCACATCATATGGAACATTTATTCAATGGGGATGCCCACGAGGCGTATGAGGCTCGGGTTGCACATCTCCAAATTCTTAT
ACGTCTCGATGAATACAAGAAAGCTCTAGAATTTCTGGAGGAGAAGGGTAACTTTCGTCAATCTAAACCATATGATGCAAGACTTTCTCTTTATGAGGCCGTGGTTCATA
CTATGTTAGGTAAGGATGATAAAGCTAAAACATTGTGGAAAACGTACCTTGACATTGACAATGGTAATGGAAATCCAAGTATTTTCCTGAAGAATGCTATAGCCCTATTG
ATGCCATTGTTGAGCTTACAAAGTGCATTACAAGCTAAAAATGACGTGAACCTCAACATTATTCCCGTAAAGGTGAAACACGGAGTTGGTGTCGAGGAAATCGTGAACCT
TATTATACAGGCATGGGAAGTAGCAACTGGGAAGAAACGTCATTGA
mRNA sequenceShow/hide mRNA sequence
CTTTTCTTTTCCTTGTTTTGTTAAAAGGAATGGGAAACACAACATCCCCTTCTTCTTTAACCTCTGCCTTCTGTTTGGACCATGGAATTTTAATATAGTAATGTACATGT
GATTGTGATAAGAATAATTAAATACTTGGCCTCCAAGAAAGCTTGCCATTTTTGGATACTCTTGTTTACACAGTACATTATTATATATTATAAATATCGTGTGTACTCTT
GCCACTACCAAGAACAACCTTGTGTTTGGTTTAAGTTTAATATAGCATCTCCAATGGAGTCCATTGCTTTGTTTGGTGGTGGATCACTACCTAAACTCCCATTTTATGAA
CCTTCCTCTGCAATCCCTAAATTGGCTTCAACTTCAAGGCTCTGTTTCAACTTCAGAAAACCATTCTTAATTCCTCAACTCATATCTACTAGTCATGGTTTCTCTTGGAA
TCCTAATGTACGTGACAAGTTCTTAGCTCGCTGCTGTGGAAATGAACACACTGGCAAGACCCGCTCAAACGAGAGACAAGACGCATTGAAGTCACTGCTGAGTTTAACAC
AATCCCAAGAAAAGAATTCAACGATATCTGCGAGAAAGAATGAAGCATTCAAGATGGCGATGGAAGGGAAGTACAATGAAGCATCACATCATATGGAACATTTATTCAAT
GGGGATGCCCACGAGGCGTATGAGGCTCGGGTTGCACATCTCCAAATTCTTATACGTCTCGATGAATACAAGAAAGCTCTAGAATTTCTGGAGGAGAAGGGTAACTTTCG
TCAATCTAAACCATATGATGCAAGACTTTCTCTTTATGAGGCCGTGGTTCATACTATGTTAGGTAAGGATGATAAAGCTAAAACATTGTGGAAAACGTACCTTGACATTG
ACAATGGTAATGGAAATCCAAGTATTTTCCTGAAGAATGCTATAGCCCTATTGATGCCATTGTTGAGCTTACAAAGTGCATTACAAGCTAAAAATGACGTGAACCTCAAC
ATTATTCCCGTAAAGGTGAAACACGGAGTTGGTGTCGAGGAAATCGTGAACCTTATTATACAGGCATGGGAAGTAGCAACTGGGAAGAAACGTCATTGAAATCTCTGCTG
CAATATGGGTTGTTGTAATACTTAGTTTGGTTGCTTTCATTTAAGAACTTAATGCTGTCACTCTTGTTTTAAGCCTCTACCGTACGGCAATGTACTGTTTAAGAGGCTGG
TACCTCATTGAGAATGTAGTAAGGCTCTGTTTTATGCCTCCCCTATAATATTTAGAACTATAATTAGATCATGAGAGGCTTGTACCTTCAATATAGTTATGGATGTGCTT
TTGTTGATTA
Protein sequenceShow/hide protein sequence
MESIALFGGGSLPKLPFYEPSSAIPKLASTSRLCFNFRKPFLIPQLISTSHGFSWNPNVRDKFLARCCGNEHTGKTRSNERQDALKSLLSLTQSQEKNSTISARKNEAFK
MAMEGKYNEASHHMEHLFNGDAHEAYEARVAHLQILIRLDEYKKALEFLEEKGNFRQSKPYDARLSLYEAVVHTMLGKDDKAKTLWKTYLDIDNGNGNPSIFLKNAIALL
MPLLSLQSALQAKNDVNLNIIPVKVKHGVGVEEIVNLIIQAWEVATGKKRH