; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0012412 (gene) of Chayote v1 genome

Gene IDSed0012412
OrganismSechium edule (Chayote v1)
DescriptionAT hook motif-containing protein, putative
Genome locationLG13:15664739..15671264
RNA-Seq ExpressionSed0012412
SyntenySed0012412
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022150988.1 uncharacterized protein LOC111019016 isoform X1 [Momordica charantia]3.4e-8074.54Show/hide
Query:  AMPISPGSGVNGSQSHPAIKIQKVANGMLRQVVSGVIEAAFEAGYLLCVNVGNSGLTLRGVVFKPGHYVPVSAENDLAPDAQMIRRN-ILFATGNETNGN
        A+PISPGSGVNG+QSHP I IQ  A+GML QVVSGVIEA FEAGYLLCV VGNSG+TLRGVVFKPGHYVPVSAEND+APD QMIRRN + F TGN+T  +
Subjt:  AMPISPGSGVNGSQSHPAIKIQKVANGMLRQVVSGVIEAAFEAGYLLCVNVGNSGLTLRGVVFKPGHYVPVSAENDLAPDAQMIRRN-ILFATGNETNGN

Query:  HPPSKNEVPSHESSGTKLGFKYITSHSNRDASKDKSISSIFAQVSPLGSSRGTVVPVVLQPAKLTYGRSVATESLTIQTPDIQSSKGNEFLVGTFTANES
        H     EVPSH+SSG KLGF +  SHSN DASKDK+ISSIFAQ++P GSSRG V+PVVLQPAK T G SVA ES TIQT +++SSKG E LVGTFT+NES
Subjt:  HPPSKNEVPSHESSGTKLGFKYITSHSNRDASKDKSISSIFAQVSPLGSSRGTVVPVVLQPAKLTYGRSVATESLTIQTPDIQSSKGNEFLVGTFTANES

Query:  ALTNVAVGIESFPFPP
        A TNV +GIESFPF P
Subjt:  ALTNVAVGIESFPFPP

XP_022150997.1 uncharacterized protein LOC111019016 isoform X2 [Momordica charantia]3.4e-8074.54Show/hide
Query:  AMPISPGSGVNGSQSHPAIKIQKVANGMLRQVVSGVIEAAFEAGYLLCVNVGNSGLTLRGVVFKPGHYVPVSAENDLAPDAQMIRRN-ILFATGNETNGN
        A+PISPGSGVNG+QSHP I IQ  A+GML QVVSGVIEA FEAGYLLCV VGNSG+TLRGVVFKPGHYVPVSAEND+APD QMIRRN + F TGN+T  +
Subjt:  AMPISPGSGVNGSQSHPAIKIQKVANGMLRQVVSGVIEAAFEAGYLLCVNVGNSGLTLRGVVFKPGHYVPVSAENDLAPDAQMIRRN-ILFATGNETNGN

Query:  HPPSKNEVPSHESSGTKLGFKYITSHSNRDASKDKSISSIFAQVSPLGSSRGTVVPVVLQPAKLTYGRSVATESLTIQTPDIQSSKGNEFLVGTFTANES
        H     EVPSH+SSG KLGF +  SHSN DASKDK+ISSIFAQ++P GSSRG V+PVVLQPAK T G SVA ES TIQT +++SSKG E LVGTFT+NES
Subjt:  HPPSKNEVPSHESSGTKLGFKYITSHSNRDASKDKSISSIFAQVSPLGSSRGTVVPVVLQPAKLTYGRSVATESLTIQTPDIQSSKGNEFLVGTFTANES

Query:  ALTNVAVGIESFPFPP
        A TNV +GIESFPF P
Subjt:  ALTNVAVGIESFPFPP

XP_023520441.1 uncharacterized protein LOC111783826 isoform X2 [Cucurbita pepo subsp. pepo]1.0e-7671.89Show/hide
Query:  AMPISPGSGVNGSQSHPAIKIQKVANGMLRQVVSGVIEAAFEAGYLLCVNVGNSGLTLRGVVFKPGHYVPVSAENDLAPDAQMIRRNIL-FATGNETNGN
        AMPISPGSGVNG+QS PAI+IQ +++GML QVVSGVIEA FEAGYLLCV VGNSG+TLRGVVFKPGHYVPVSAEND+APD QMIRRN++ FATGN++ GN
Subjt:  AMPISPGSGVNGSQSHPAIKIQKVANGMLRQVVSGVIEAAFEAGYLLCVNVGNSGLTLRGVVFKPGHYVPVSAENDLAPDAQMIRRNIL-FATGNETNGN

Query:  HPPSKN-EVPSHESSGTKLGFKYITSHSNRDASKDKSISSIFAQVSPLGSSRGTVVPVVLQPAKLTYGRSVATESLTIQTPDIQSSKGNEFLVGTFTANE
        +P S N EVPSHESSG  LGFKY   HSN DA K+KS+SSI AQ++P GSSRG VVPVV  PAKLT G    +E+ T+QT DI+SSKG E L+G+FT+NE
Subjt:  HPPSKN-EVPSHESSGTKLGFKYITSHSNRDASKDKSISSIFAQVSPLGSSRGTVVPVVLQPAKLTYGRSVATESLTIQTPDIQSSKGNEFLVGTFTANE

Query:  SALTNVAVGIESFPFPP
        SA  +V VGIESF F P
Subjt:  SALTNVAVGIESFPFPP

XP_038905587.1 uncharacterized protein LOC120091567 isoform X1 [Benincasa hispida]2.5e-8376.5Show/hide
Query:  AMPISPGSGVNGSQSHPAIKIQKVANGMLRQVVSGVIEAAFEAGYLLCVNVGNSGLTLRGVVFKPGHYVPVSAENDLAPDAQMIRRNIL-FATGNETNGN
        A+PISPGSG NGSQSHP I+IQ V +GML QVVSGVIEA FEAGYLLCV  GNSG+TLRGVVFKPGHYVPVSAEND+APD QMIRRN++  ATGN+  G+
Subjt:  AMPISPGSGVNGSQSHPAIKIQKVANGMLRQVVSGVIEAAFEAGYLLCVNVGNSGLTLRGVVFKPGHYVPVSAENDLAPDAQMIRRNIL-FATGNETNGN

Query:  HPPSKN-EVPSHESSGTKLGFKYITSHSNRDASKDKSISSIFAQVSPLGSSRGTVVPVVLQPAKLTYGRSVATESLTIQTPDIQSSKGNEFLVGTFTANE
        +P SKN E+PSHESSG KLGFKY   HSNRDA KD SISSI AQ++P GSSRG VVPVVLQPAKLT G SV TE++TIQT DI+SSKG E LVGTFT+NE
Subjt:  HPPSKN-EVPSHESSGTKLGFKYITSHSNRDASKDKSISSIFAQVSPLGSSRGTVVPVVLQPAKLTYGRSVATESLTIQTPDIQSSKGNEFLVGTFTANE

Query:  SALTNVAVGIESFPFPP
        SA T+V VGIESFPF P
Subjt:  SALTNVAVGIESFPFPP

XP_038905591.1 uncharacterized protein LOC120091567 isoform X2 [Benincasa hispida]2.5e-8376.5Show/hide
Query:  AMPISPGSGVNGSQSHPAIKIQKVANGMLRQVVSGVIEAAFEAGYLLCVNVGNSGLTLRGVVFKPGHYVPVSAENDLAPDAQMIRRNIL-FATGNETNGN
        A+PISPGSG NGSQSHP I+IQ V +GML QVVSGVIEA FEAGYLLCV  GNSG+TLRGVVFKPGHYVPVSAEND+APD QMIRRN++  ATGN+  G+
Subjt:  AMPISPGSGVNGSQSHPAIKIQKVANGMLRQVVSGVIEAAFEAGYLLCVNVGNSGLTLRGVVFKPGHYVPVSAENDLAPDAQMIRRNIL-FATGNETNGN

Query:  HPPSKN-EVPSHESSGTKLGFKYITSHSNRDASKDKSISSIFAQVSPLGSSRGTVVPVVLQPAKLTYGRSVATESLTIQTPDIQSSKGNEFLVGTFTANE
        +P SKN E+PSHESSG KLGFKY   HSNRDA KD SISSI AQ++P GSSRG VVPVVLQPAKLT G SV TE++TIQT DI+SSKG E LVGTFT+NE
Subjt:  HPPSKN-EVPSHESSGTKLGFKYITSHSNRDASKDKSISSIFAQVSPLGSSRGTVVPVVLQPAKLTYGRSVATESLTIQTPDIQSSKGNEFLVGTFTANE

Query:  SALTNVAVGIESFPFPP
        SA T+V VGIESFPF P
Subjt:  SALTNVAVGIESFPFPP

TrEMBL top hitse value%identityAlignment
A0A6J1DAY0 uncharacterized protein LOC111019016 isoform X11.7e-8074.54Show/hide
Query:  AMPISPGSGVNGSQSHPAIKIQKVANGMLRQVVSGVIEAAFEAGYLLCVNVGNSGLTLRGVVFKPGHYVPVSAENDLAPDAQMIRRN-ILFATGNETNGN
        A+PISPGSGVNG+QSHP I IQ  A+GML QVVSGVIEA FEAGYLLCV VGNSG+TLRGVVFKPGHYVPVSAEND+APD QMIRRN + F TGN+T  +
Subjt:  AMPISPGSGVNGSQSHPAIKIQKVANGMLRQVVSGVIEAAFEAGYLLCVNVGNSGLTLRGVVFKPGHYVPVSAENDLAPDAQMIRRN-ILFATGNETNGN

Query:  HPPSKNEVPSHESSGTKLGFKYITSHSNRDASKDKSISSIFAQVSPLGSSRGTVVPVVLQPAKLTYGRSVATESLTIQTPDIQSSKGNEFLVGTFTANES
        H     EVPSH+SSG KLGF +  SHSN DASKDK+ISSIFAQ++P GSSRG V+PVVLQPAK T G SVA ES TIQT +++SSKG E LVGTFT+NES
Subjt:  HPPSKNEVPSHESSGTKLGFKYITSHSNRDASKDKSISSIFAQVSPLGSSRGTVVPVVLQPAKLTYGRSVATESLTIQTPDIQSSKGNEFLVGTFTANES

Query:  ALTNVAVGIESFPFPP
        A TNV +GIESFPF P
Subjt:  ALTNVAVGIESFPFPP

A0A6J1DCA8 uncharacterized protein LOC111019016 isoform X21.7e-8074.54Show/hide
Query:  AMPISPGSGVNGSQSHPAIKIQKVANGMLRQVVSGVIEAAFEAGYLLCVNVGNSGLTLRGVVFKPGHYVPVSAENDLAPDAQMIRRN-ILFATGNETNGN
        A+PISPGSGVNG+QSHP I IQ  A+GML QVVSGVIEA FEAGYLLCV VGNSG+TLRGVVFKPGHYVPVSAEND+APD QMIRRN + F TGN+T  +
Subjt:  AMPISPGSGVNGSQSHPAIKIQKVANGMLRQVVSGVIEAAFEAGYLLCVNVGNSGLTLRGVVFKPGHYVPVSAENDLAPDAQMIRRN-ILFATGNETNGN

Query:  HPPSKNEVPSHESSGTKLGFKYITSHSNRDASKDKSISSIFAQVSPLGSSRGTVVPVVLQPAKLTYGRSVATESLTIQTPDIQSSKGNEFLVGTFTANES
        H     EVPSH+SSG KLGF +  SHSN DASKDK+ISSIFAQ++P GSSRG V+PVVLQPAK T G SVA ES TIQT +++SSKG E LVGTFT+NES
Subjt:  HPPSKNEVPSHESSGTKLGFKYITSHSNRDASKDKSISSIFAQVSPLGSSRGTVVPVVLQPAKLTYGRSVATESLTIQTPDIQSSKGNEFLVGTFTANES

Query:  ALTNVAVGIESFPFPP
        A TNV +GIESFPF P
Subjt:  ALTNVAVGIESFPFPP

A0A6J1EVJ4 uncharacterized protein LOC111438469 isoform X15.5e-7670.51Show/hide
Query:  AMPISPGSGVNGSQSHPAIKIQKVANGMLRQVVSGVIEAAFEAGYLLCVNVGNSGLTLRGVVFKPGHYVPVSAENDLAPDAQMIRRNIL-FATGNETNGN
        A+P+SPGSGVNG+QS PAI+IQ +++GML QVVSGVIEA FEAGYLLCV VGNSG+TLRGVVFKPGHYVPVSAEND+APD QMIRRN++ FATGN++ GN
Subjt:  AMPISPGSGVNGSQSHPAIKIQKVANGMLRQVVSGVIEAAFEAGYLLCVNVGNSGLTLRGVVFKPGHYVPVSAENDLAPDAQMIRRNIL-FATGNETNGN

Query:  HPPSKN-EVPSHESSGTKLGFKYITSHSNRDASKDKSISSIFAQVSPLGSSRGTVVPVVLQPAKLTYGRSVATESLTIQTPDIQSSKGNEFLVGTFTANE
        +P S N EVPSHESSG  LGF+Y   HSN DA K+KS+SSI AQ++P GSSRG VVPVV  PAKLT G    +E+ T+QT DI+SSKG E L+G+FT+NE
Subjt:  HPPSKN-EVPSHESSGTKLGFKYITSHSNRDASKDKSISSIFAQVSPLGSSRGTVVPVVLQPAKLTYGRSVATESLTIQTPDIQSSKGNEFLVGTFTANE

Query:  SALTNVAVGIESFPFPP
        SA  +V VGIESF F P
Subjt:  SALTNVAVGIESFPFPP

A0A6J1JAQ9 uncharacterized protein LOC111483279 isoform X11.5e-7671.43Show/hide
Query:  AMPISPGSGVNGSQSHPAIKIQKVANGMLRQVVSGVIEAAFEAGYLLCVNVGNSGLTLRGVVFKPGHYVPVSAENDLAPDAQMIRRNIL-FATGNETNGN
        A+PISPGSGVNG+QS PAI+IQ +++GML QVVSGVIEA FEAGYLLCV VGNSG+TLRGVVFKPGHYVPVSAEND+APD QMIRRN++ FATGN++ GN
Subjt:  AMPISPGSGVNGSQSHPAIKIQKVANGMLRQVVSGVIEAAFEAGYLLCVNVGNSGLTLRGVVFKPGHYVPVSAENDLAPDAQMIRRNIL-FATGNETNGN

Query:  HPPSKN-EVPSHESSGTKLGFKYITSHSNRDASKDKSISSIFAQVSPLGSSRGTVVPVVLQPAKLTYGRSVATESLTIQTPDIQSSKGNEFLVGTFTANE
        +P S N EVPSHESSG  LGFKY   HSN DA K+KS+SSI AQ++P GSSRG VVPVV  PAKLT G    +E+ T+QT DI+SSKG E L+G+FT+NE
Subjt:  HPPSKN-EVPSHESSGTKLGFKYITSHSNRDASKDKSISSIFAQVSPLGSSRGTVVPVVLQPAKLTYGRSVATESLTIQTPDIQSSKGNEFLVGTFTANE

Query:  SALTNVAVGIESFPFPP
        SA  +V VGIESF F P
Subjt:  SALTNVAVGIESFPFPP

A0A6J1JCN8 uncharacterized protein LOC111483279 isoform X21.5e-7671.43Show/hide
Query:  AMPISPGSGVNGSQSHPAIKIQKVANGMLRQVVSGVIEAAFEAGYLLCVNVGNSGLTLRGVVFKPGHYVPVSAENDLAPDAQMIRRNIL-FATGNETNGN
        A+PISPGSGVNG+QS PAI+IQ +++GML QVVSGVIEA FEAGYLLCV VGNSG+TLRGVVFKPGHYVPVSAEND+APD QMIRRN++ FATGN++ GN
Subjt:  AMPISPGSGVNGSQSHPAIKIQKVANGMLRQVVSGVIEAAFEAGYLLCVNVGNSGLTLRGVVFKPGHYVPVSAENDLAPDAQMIRRNIL-FATGNETNGN

Query:  HPPSKN-EVPSHESSGTKLGFKYITSHSNRDASKDKSISSIFAQVSPLGSSRGTVVPVVLQPAKLTYGRSVATESLTIQTPDIQSSKGNEFLVGTFTANE
        +P S N EVPSHESSG  LGFKY   HSN DA K+KS+SSI AQ++P GSSRG VVPVV  PAKLT G    +E+ T+QT DI+SSKG E L+G+FT+NE
Subjt:  HPPSKN-EVPSHESSGTKLGFKYITSHSNRDASKDKSISSIFAQVSPLGSSRGTVVPVVLQPAKLTYGRSVATESLTIQTPDIQSSKGNEFLVGTFTANE

Query:  SALTNVAVGIESFPFPP
        SA  +V VGIESF F P
Subjt:  SALTNVAVGIESFPFPP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G21895.1 DNA binding3.4e-0933.62Show/hide
Query:  MLRQVVSGVIEAAFEAGYLLCVNVGNSGLTLRGVVFKPGHYVPVSAENDLAPDAQMIRRNILFATGNETNGNHPPSK---------NEVPSHESSGTKLG
        ++ +VV+GVIE +F+AGYLL V V +S   LRG+VF  G   P++ END+AP  +M  R  +    N+T+ + P  +          ++   ESS   + 
Subjt:  MLRQVVSGVIEAAFEAGYLLCVNVGNSGLTLRGVVFKPGHYVPVSAENDLAPDAQMIRRNILFATGNETNGNHPPSK---------NEVPSHESSGTKLG

Query:  FKYITSHSNRDASKDK
            T+   ++A+KD+
Subjt:  FKYITSHSNRDASKDK

AT5G52890.1 AT hook motif-containing protein5.8e-0945.31Show/hide
Query:  VANGMLRQVVSGVIEAAFEAGYLLCVNVGNSGLTLRGVVFKPGHYVPVSAENDLAPDAQMIRRN
        V   ++ +VVSGV+E +FEAGY L V V ++   L+GVVF P    P++   DL P A+M  RN
Subjt:  VANGMLRQVVSGVIEAAFEAGYLLCVNVGNSGLTLRGVVFKPGHYVPVSAENDLAPDAQMIRRN

AT5G52890.2 AT hook motif-containing protein5.8e-0945.31Show/hide
Query:  VANGMLRQVVSGVIEAAFEAGYLLCVNVGNSGLTLRGVVFKPGHYVPVSAENDLAPDAQMIRRN
        V   ++ +VVSGV+E +FEAGY L V V ++   L+GVVF P    P++   DL P A+M  RN
Subjt:  VANGMLRQVVSGVIEAAFEAGYLLCVNVGNSGLTLRGVVFKPGHYVPVSAENDLAPDAQMIRRN

AT5G54930.1 AT hook motif-containing protein1.8e-1836.41Show/hide
Query:  SPGSGVNGSQSHPAIKIQKVANGMLRQVVSGVIEAAFEAGYLLCVNVGNSGLTLRGVVFKPGHYVPVSAENDLAPDAQMIRRNILFATGNETNGNHPPSK
        S G   + S+S    + +     M+ Q +SGVIEA FEAG+LL V VGNS   LRGVVFKPGH  PVS +ND+APD  MIRRN                 
Subjt:  SPGSGVNGSQSHPAIKIQKVANGMLRQVVSGVIEAAFEAGYLLCVNVGNSGLTLRGVVFKPGHYVPVSAENDLAPDAQMIRRNILFATGNETNGNHPPSK

Query:  NEVPSHESSGTKLGFKYITSHSNRDASKDKSISSIFAQVSPLGSSRGTVVPVVLQPAKLTYGRSVATESLTIQTPDIQSSKGNE
        ++V  H+ S  K G K           + +++  +  Q +        +VPVVLQPA L  G     E + I    +Q+  G++
Subjt:  NEVPSHESSGTKLGFKYITSHSNRDASKDKSISSIFAQVSPLGSSRGTVVPVVLQPAKLTYGRSVATESLTIQTPDIQSSKGNE

AT5G54930.2 AT hook motif-containing protein1.8e-1836.41Show/hide
Query:  SPGSGVNGSQSHPAIKIQKVANGMLRQVVSGVIEAAFEAGYLLCVNVGNSGLTLRGVVFKPGHYVPVSAENDLAPDAQMIRRNILFATGNETNGNHPPSK
        S G   + S+S    + +     M+ Q +SGVIEA FEAG+LL V VGNS   LRGVVFKPGH  PVS +ND+APD  MIRRN                 
Subjt:  SPGSGVNGSQSHPAIKIQKVANGMLRQVVSGVIEAAFEAGYLLCVNVGNSGLTLRGVVFKPGHYVPVSAENDLAPDAQMIRRNILFATGNETNGNHPPSK

Query:  NEVPSHESSGTKLGFKYITSHSNRDASKDKSISSIFAQVSPLGSSRGTVVPVVLQPAKLTYGRSVATESLTIQTPDIQSSKGNE
        ++V  H+ S  K G K           + +++  +  Q +        +VPVVLQPA L  G     E + I    +Q+  G++
Subjt:  NEVPSHESSGTKLGFKYITSHSNRDASKDKSISSIFAQVSPLGSSRGTVVPVVLQPAKLTYGRSVATESLTIQTPDIQSSKGNE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGGCTATGCCTATTTCTCCTGGTTCTGGAGTAAATGGAAGCCAATCACATCCAGCAATTAAAATTCAAAAGGTAGCTAATGGGATGCTGAGGCAGGTTGTATCGGG
TGTCATTGAGGCAGCATTCGAAGCTGGATATCTTCTGTGTGTTAATGTTGGCAACTCTGGACTCACTTTGAGGGGTGTTGTTTTTAAGCCTGGGCATTATGTTCCTGTTT
CAGCAGAGAATGACCTGGCCCCAGATGCTCAAATGATCAGAAGAAATATTCTTTTTGCTACAGGAAATGAAACTAATGGAAATCACCCCCCATCTAAAAATGAAGTCCCA
TCTCATGAATCATCAGGTACCAAACTGGGGTTTAAATATATAACTTCACATTCTAACCGGGATGCATCGAAAGACAAATCTATATCATCTATATTTGCCCAAGTATCTCC
TTTGGGAAGCTCAAGAGGTACTGTGGTCCCTGTTGTATTACAGCCTGCTAAATTAACCTATGGGCGCTCAGTTGCCACTGAATCACTTACTATTCAAACTCCTGATATTC
AATCCTCAAAAGGCAATGAGTTTCTTGTAGGTACTTTTACAGCAAATGAATCAGCTCTCACCAATGTGGCAGTTGGAATTGAAAGCTTTCCTTTCCCACCCTAA
mRNA sequenceShow/hide mRNA sequence
TTAAATTTGAACCCTCTAAGTTTGGAAAACTTTTGTTTCCACCCTCCTCTCTGTCGTGGCTGTCTCAATGCAAAATCCGGAGCTCTCTTCTTTGCCCTATCCGTGCTTGC
TTCCTCTCATCCACGTTGTCGCTACTACTTTCCATACATCACTCATTTTTCTCTCAATTTCACACTCCCTACCCAGGTATCCCCGATTTTCCTCTATTTGATTTCGCTAC
ATTTATTGCATTTTCCAAGAATGTGCATGTTTTTGCTGCCTAATTTATACGGGGTTTATGGAATTTTCCCCCAAGATGGCCTTCGAGATGGTGTTAGTATCTGTAATTTG
GGGATTTTCTTTAGTTTTCACTAGAATGGAGTTTATTCATTCTACTGCTAGGAGTTTGTTCAAACATGATTTTGGTTCTGCTGCATTTAATAGGTCATTCCAATGATGAA
ATCTTTCTTCTTCTTTTTTTAAAGATTTTGAGCTATGTTTAAGGATGGCATTTGCGAGTTTGTTACAACTTAAAATAGTTCTTACGGGGTTTCCCATTGGCGAACAAGAG
TTTCATACATCTTTGTTACAATCCTATTTTTTCCATTAGTTTTATAAGAAGAATTAGGCTTTTGTATAAGCTCGTTTATGACTATTTTCCAGGCCAAATGTGTTCTTTTT
TCGACGGCTAAACAATTAAGGAAAAACTTAGCCCAACCTTAGTAAACGTATGTATGAACAGTCTGTTTTGGGAATTGCTTGGCACAAAAATCATTTTGATGGACAATTCT
TTCACTATTCTTTCATTTTAATAGATATTTAATACAATCTTTCCTTGCATCTCCGAGAAATGAACTAAAGAATTCATTCAGCAGGTCCATTTTTTCCTTTATAATTCTCC
ATTTCATTCTTGTAGTCCCCAAGAGCGGTCTCGATGGTATAGGCTTGGGGTCTCCAAGGTATATTCCTTTTGAGCAGTATTTTTAAAGGCGTAAGGCGCAAAGACTATTT
GGAGCCTGAGGCGCAAGGCGCAAAAAAGGCGCGAGCTTTATCTTAAAGGCGCACCATGTATAAAAAAAAAATTAAAAATATTTATTTATCACAAAACATTAAAAAAAATA
TATGTATCACAAAAATATTTATTAAGTTATAAACCACTAAGTTGCAATATTCATGCATATATAAGATATCAAAACAAGAGGTAGCTTTATAATATTAGTAGCATTGTATC
CAATCAAAATATTAAATAAGTCCTAAAGCTCGAACTCCTCTCCACTGAATTCCAAATCCTCCACATCCACTTCATTGTTGGACTTGTATCCCTCGGCCTCTATATCTTCT
TCTTCTTCACTATCAATCTCCATAATTGACCCACGTCGTATAAAAGGTTGTGGTTGTGTGGAGGATGAAGATTGAGATGTAGTCTTGTTTCTAGAGGTATTAGCCCTAGA
ATAATATGTTGGTTCATTCGCTCCAGCAGCTCGAGAGACATCGCCCCAAGTTAATGTATCATCTTCAAACACAAGATCACGATCCTCATCAGAATCATCTTCCATTACTC
CAAGCAAAGATTCATTGCAATCATCAATGTCATGTAAAGTGATGGGGTCGATGGAATCTCGAATGGTATATCGACGCTTCAATGCTCTATTGTATTTGATGTAGACTAAG
TCATTTAATCGACTTTGATCAAGTCTATTTCTCCTCTTGCTATGAAGCTGCAAAGCCAATTATAAATAAGTAAAGTTTAACTCCATTACTTTACTTTAGTATACTTCAAG
TTTAGGAGATTACATTACTAACCTATTCAAAAACACTCCACTTACGCTCACAACCTGATGCACTACAAGTAAGGCCTAAAACCTTTATCGCAAACTTTTTAAAAGATGGA
GTTGAATGTCCAAAGGTACTCCACCACGCCACTAAAGATCAAAATTCAAAACTACATATGTTAATAACTATTAACTTCCACACTTAAAGTTAAATTGAAGTATTTCGTGC
TCATTTGTCGTACCTGGAGATCTTTTATCTCGATTTCTAATAGCCAATGGTTGTCCAAATGATTCTTCAGCTTCCTTGTATTTAGGGATCTCTGCAAGTATCTTGTCTTG
TTCTCCCAAAGAAGGAATCATTCGGGTTATACATGCTAGTAACCCTTCGTTAATTTCTTTATTATCGGCAATGGTAGGATCTTTGTAAAATAACTCAGGGTTTAGAAAAT
GACCCGCTGCATGCAATAGACAATGTAATTGAAGCTCCCACCTTCGATCAATAATAGCAAACACGTCCTCGTACTTCTCCTTCTGACCATGAAACGCTCTCTCTATGGAC
TCTTTTGCTCGATCCATAGCCTCATAAATGTATCCCATTGGAGGCTTCTTCTCATCGTCAACCAACTGAAGCACTCGCACTAAAGGGTTTGAAACTTTAAGACCAAAGAG
TATTGTAGCCCAAAAATTAGTCATCATAATGGTTTTAAGAACTCGTTTGCTTTGTTGCTCTTTGATCCATTTGCTATTTCTCCATTCTTCAGATGTAAACATCTTCCTCA
AGTTAGCTTGTTAAGTGTGTAGACTAGACAATGTTATGCTAGCTGTAGCAAATCGAGTCTTCGCCGGTCTAATCAACTCCTTTGAATTAGTGAACTTCCTCATCATATTC
AATAATCCTGGTCTTACATAAATGAAATTACTGATATCCATGGATCGTTTCAAAGTCTTACGCAACTTTGGGATCTTGAATATGTCTTCCAACATTAGATCTAAACAGTG
TGCTGCACATGGCGACCAAAATAAATGTGGTCTTTTCGCTTCTAACAATCTCCCTGCCATTACGTTTGTAGAGGCACTATCAGTAACAGCTTGAAGTACATTAGGCTTTC
CAATACACTCAATAAAGTTATCCATTAATTCAAACATCTTTTTTCCATCTTTAACAACAGATGAGGCATCGATATACTCGATGAACACTGTCCTTTTTGGACTATTAACC
AAAACGTTCAGCAATGTTCTATTTCTCCTATCCGTTCATCCATTGGTCATTATTGTACATCCAACCTTAGCCCATTGTTCTTTGTGAGACTTGAGCATCTCATCAGTACT
TTTAACCTCTTTCTTCAGGCAAGGGGCTCTCAATTCGTGATATGTAGGTGGCTTTAATCCTGGACTGAATTGTCCTATTGCTTCTATCATTGGAGCAAAACTATTGTAAT
TGCAAGCATTAAGAGGAATCCCGACATCATAAAACCATCTCGCAATGAATTGGATAACATTCGATCTCATTTCCTTTTTGAATGTTGCATTCAAAGTTGTTTGCTTTTGA
GTTATTGCTCCACTCCCTATAGGTTTTGGAGCGAAGAATGCGTCCATGGGACCCTTAGCTTTGGGTTTCTTGAAATTCGATTACGAACTTGAACTTGAGTTACTTGATGT
AGGTATTTTCCCCAAAAGAGTAACAGTTTCTTCATCTTCGTCACATTCATCATACAAACCATAGAGTTCGGTTTCTACATCGTTTGCAACAGCCTCTGAAACCAAATTTC
ATTGTTGTTTGATATCCTTTTTATTCATCATAAACACTCTCAATTCTTCCTTCACATGTTCAGGACATTTTCTACAATTCGTTGCATTTCGAAAGCCTCCAACGAGGTGT
TGCTTCATTCTATAAACACCTTCTTTAGTTACTTTCATACAAAACTCGCTCACCAATGAATTTTTATCTTCAGGTTTTTCTAACTGAGCATATTTCCATGCTGGATCTTT
TCTTGAGTCTTCTTCACTTTTTTAACTTGCCATCTTTGAATAAAGGTATACTTAAGAAAAAACATAAAATAACCTATAGTAAAAAAACATCACACATTCACACTAAATAC
ACATTATATAAATCAAAGATTCCAAATAGTTAAACAGATAGCCAAGGTGCAAGTCAGTAATCAAAATGTGAAATTTATCTTATAAGTCTATAACACAGTGAAACACATAT
CCAATTCCAAGTCAGTAATTAAACAAAATCATAAATCAAACATACTATTTATGAACATCAACTTTTAATCATTCAAAACAGAAAAGAAATGCAAACATTACAAACCAAAA
AAAAATCAAACAGAGAAGAGAAGAGAAGATGAATAGAACACTTGGTTTTTTAAAACAAATAGAACACTTGGTTTGATATTTAAAAAGAAAAAAAGAGGGGGATTTCGAAG
AGGAAGAAGAAGACTCATCAGGCACTACCGGTGAGTCTTCGAAAACAGCCGTTGAAGAAGATGTCGAGGTCGCGAAGCTGTCGGAGAATTTTGAGAAGAAGACGTCGATG
TTGCGCAGCTGTCGGAAAATTTCGAGAAGAAGCTGCCGGAGAATTTCGAGAAGAAGTTGACAAAGAAGACGTTTCTACGAATAGGGTTTCGAAGATATGCGACGTCGAGG
GTTTTACGATTGAAAACTTATCGATGAAGTTTCATTTTTGGTTTATTAAAAAATATATATAAATGATGAAACGACGTCGTTTTGATGCCGGTTTTAAAAATAAAAAAATA
ATGATAAAGAAAAAATATTAGGCGCGCGCCTCGTGCTCCCTAGCGCCTTAGAAAAAGGCTCGGGAGGCACGCGCTTTCCCTTGCGCGCTGCGACTTGGCCAAGCAAGGCG
CAGGCTGCGCGCCTTGCGCCTTATGGAGCTTTTTAAAACACTGCTTTTGAGGTCTGTGAACTTAATATTAAAATCCCTTGGAGGTCTCGGGTTAATTTGAATATATAATA
GTGTCTTGGTTTCCTATAAAAGAAACAAATACATTAAATATTAAACTTCTTTCTTAAGCTTGAAATATATGTTCCTTTTGCTGTTGTGGGAGGGAGTTAATTAGTGCTAC
TGAACAAGCAATAATTTGGTAGGAATAAAGAGGATGAGTCAGGAAGACCAAGGAATCCGCGCTGATAATTTAGCTGATGTTCACTGGAAGCGAAAACGTGGTCGCCCTAG
ATAAGTTCCAAATTTAAATTATGATGAGAGTATTCTTATTACAAAGAACAAACATATGGTGGCTATGCCTATTTCTCCTGGTTCTGGAGTAAATGGAAGCCAATCACATC
CAGCAATTAAAATTCAAAAGGTAGCTAATGGGATGCTGAGGCAGGTTGTATCGGGTGTCATTGAGGCAGCATTCGAAGCTGGATATCTTCTGTGTGTTAATGTTGGCAAC
TCTGGACTCACTTTGAGGGGTGTTGTTTTTAAGCCTGGGCATTATGTTCCTGTTTCAGCAGAGAATGACCTGGCCCCAGATGCTCAAATGATCAGAAGAAATATTCTTTT
TGCTACAGGAAATGAAACTAATGGAAATCACCCCCCATCTAAAAATGAAGTCCCATCTCATGAATCATCAGGTACCAAACTGGGGTTTAAATATATAACTTCACATTCTA
ACCGGGATGCATCGAAAGACAAATCTATATCATCTATATTTGCCCAAGTATCTCCTTTGGGAAGCTCAAGAGGTACTGTGGTCCCTGTTGTATTACAGCCTGCTAAATTA
ACCTATGGGCGCTCAGTTGCCACTGAATCACTTACTATTCAAACTCCTGATATTCAATCCTCAAAAGGCAATGAGTTTCTTGTAGGTACTTTTACAGCAAATGAATCAGC
TCTCACCAATGTGGCAGTTGGAATTGAAAGCTTTCCTTTCCCACCCTAATCTAGTCAGCATGTCTTACAAGATGATGTTGTTTCTGTAGAAAATAGTTCTCACAGCCATT
CCTTAGTTTTGGAAGGTAAATCGATGACATTGCCTAGCACGCCTTTCGATAGTCTTGCGACTGAAGTGATCAAGAGAATTCAAGCCCCTCTGTTCGCTGAGATGCCGACT
GAGAATGATAAATCAACCGGCAAGACATCAGCGAAAGAATGCGAAGATACCTCGAAGGTTGAGGCTAACATAGTGGATGGACTTGTGTTAGAAGAATCCCTAAAAGCAGT
GCAGCACCCCCATGAAAGTTCAGTGACTGTTCCCAAAGCTCTGGATGGTGAGTCTAAAACTGGGAAAATGACTGAGTAATGTTACAGGAAAACGTGATGCAAACTACATA
GCCATGGGCTGAAAAGCAGAACCCAGATTTGATGTTTAAGTCCATTGAACCTAATGAATAAAAAACAGAGATTGGGGATGAAGAAGCTGCCAACCAAAAGCAAATCTGAG
TAGCATCGTATCGTCACGATTAATATTTTCGTTGTAGGTAAATCATATGGTGTAGCTGATTTCTGGATGCTACCACGCCTAATCCTAACTTAAGTGTAGTTGGATGATTC
TCTAGAAAGTTATTGTTTTTTTCTAGTTGGATATTATAGATATGGACAATTTCACACTTGTAAAGGTTTCATTGGGTGGAGTGAATTTATCAAAAAAAAGTTTTGACCAA
A
Protein sequenceShow/hide protein sequence
MVAMPISPGSGVNGSQSHPAIKIQKVANGMLRQVVSGVIEAAFEAGYLLCVNVGNSGLTLRGVVFKPGHYVPVSAENDLAPDAQMIRRNILFATGNETNGNHPPSKNEVP
SHESSGTKLGFKYITSHSNRDASKDKSISSIFAQVSPLGSSRGTVVPVVLQPAKLTYGRSVATESLTIQTPDIQSSKGNEFLVGTFTANESALTNVAVGIESFPFPP