; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0009835 (gene) of Snake gourd v1 genome

Gene IDTan0009835
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionorgan-specific protein S2-like isoform X2
Genome locationLG02:8319298..8321938
RNA-Seq ExpressionTan0009835
SyntenyTan0009835
Gene Ontology termsNA
InterPro domainsIPR024489 - Organ specific protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8646137.1 hypothetical protein Csa_016549 [Cucumis sativus]4.3e-8363.3Show/hide
Query:  MKYSPSFGIILFLLLLLVNSIESRYEPGGHWRNMMEDEPLLEATKEEKDCLQHIKLENEKPLV-VVEPGSSVKLYPN-GAEKKSFAKSIEPRSSATFYLN
        MK  P+FGI L LLLL  N IESR+EPGG W+N++ED+ L   ++E++DC ++  L+NE      ++P  S+  YPN G++ K F K IEPR SATFY N
Subjt:  MKYSPSFGIILFLLLLLVNSIESRYEPGGHWRNMMEDEPLLEATKEEKDCLQHIKLENEKPLV-VVEPGSSVKLYPN-GAEKKSFAKSIEPRSSATFYLN

Query:  DIEK-KLFAKDIEPRPSLTFYPDNANKN-LFTEDIEPRPSTTFYPDDIEK-KLFVNDIEPRPSTTFYPDNADKN-LFTKDIEPRPSATFYPDDIEK-KLF
        D  K + F KDIEPRPSLTFYP++  KN LFT+DIEPRPS TFYP+D  K K F+ DIEPRPS TFYP++  KN LFTKDIEPRPSATFYP+D  K + F
Subjt:  DIEK-KLFAKDIEPRPSLTFYPDNANKN-LFTEDIEPRPSTTFYPDDIEK-KLFVNDIEPRPSTTFYPDNADKN-LFTKDIEPRPSATFYPDDIEK-KLF

Query:  AKDIEPRPSLTFYP-DDFKIKLFAKDIEPRPSATFYPN-DVKTKLFSKDIEPRPSTTFYP-NDLKTK
         KDIEPRPS TFYP DD K KLF KDIEPRPSATFYPN D   K F+KDIEPRPS TFYP ND K K
Subjt:  AKDIEPRPSLTFYP-DDFKIKLFAKDIEPRPSATFYPN-DVKTKLFSKDIEPRPSTTFYP-NDLKTK

XP_022135727.1 uncharacterized protein LOC111007618 [Momordica charantia]1.0e-9264.52Show/hide
Query:  MKYSPSFGIILFLLLLLVNSIESRYEPGGHWRNMMEDEPLLEATKEEKDCL-QHIKLENEKPLVV-VEPGSSVKLYPNGAEKKS-FAKSIEPRSSATFYL
        MK SP F I LFLLLLLV+ IESRYEPGGHWRN MED+PL EATKE++DCL QH +LENEK  V  +EP  S+  YPN A+K S F   IEP++SATFY 
Subjt:  MKYSPSFGIILFLLLLLVNSIESRYEPGGHWRNMMEDEPLLEATKEEKDCL-QHIKLENEKPLVV-VEPGSSVKLYPNGAEKKS-FAKSIEPRSSATFYL

Query:  NDIEKKLFAKDIEPRPSLTFYPDNANKNLFTEDIEPRPSTTFYPDDIEKKLFVNDIEPRPSTTFYPDNADKNLFTKDIEPRPSATFYPDDIEKKLFAKDI
        +D++ K+F +DI+PRPS+T YP++  K    EDIEPRPS TFYPDD++ KLF  DI+PRPS TFYP++    L  K IEPRPS TFYPDD++ KLF KDI
Subjt:  NDIEKKLFAKDIEPRPSLTFYPDNANKNLFTEDIEPRPSTTFYPDDIEKKLFVNDIEPRPSTTFYPDNADKNLFTKDIEPRPSATFYPDDIEKKLFAKDI

Query:  EPRPSLTFYPDDFKIKLFAKDIEPRPSATFYPNDVKTKLFSKDIEPRPSTTFYPNDLKTKESSTEAHHG-KADIKIAQA
        +PRPS+TFYP+D K +L AK IEPRPS TFYP+DV+  L +KDIEPRP+ T YPN LKTKES  +AHHG +ADI+IAQA
Subjt:  EPRPSLTFYPDDFKIKLFAKDIEPRPSATFYPNDVKTKLFSKDIEPRPSTTFYPNDLKTKESSTEAHHG-KADIKIAQA

XP_031745283.1 uncharacterized protein LOC105436132 isoform X1 [Cucumis sativus]2.5e-8362.98Show/hide
Query:  MKYSPSFGIILFLLLLLVNSIESRYEPGGHWRNMMEDEPLLEATKEEKDCLQHIKLENEKPLV-VVEPGSSVKLYPN-GAEKKSFAKSIEPRSSATFYLN
        MK  P+FGI L LLLL  N IESR+EPGG W+N++ED+ L   ++E++DC ++  L+NE      ++P  S+  YPN G++ K F K IEPR SATFY N
Subjt:  MKYSPSFGIILFLLLLLVNSIESRYEPGGHWRNMMEDEPLLEATKEEKDCLQHIKLENEKPLV-VVEPGSSVKLYPN-GAEKKSFAKSIEPRSSATFYLN

Query:  DIEK-KLFAKDIEPRPSLTFYPDNANKN-LFTEDIEPRPSTTFYPDDIEK-KLFVNDIEPRPSTTFYPDNADKN-LFTKDIEPRPSATFYP-DDIEKKLF
        D  K + F KDIEPRPSLTFYP++  KN LFT+DIEPRPS TFYP+D  K K F+ DIEPRPS TFYP++  K+ LFTKDIEPRPS TFYP DD + KLF
Subjt:  DIEK-KLFAKDIEPRPSLTFYPDNANKN-LFTEDIEPRPSTTFYPDDIEK-KLFVNDIEPRPSTTFYPDNADKN-LFTKDIEPRPSATFYP-DDIEKKLF

Query:  AKDIEPRPSLTFYP-DDFKIKLFAKDIEPRPSATFYPN-DVKTKLFSKDIEPRPSTTFYPND
         KDIEPRPS TFYP D+ K + F KDIEPRPS TFYPN D K KLF+KDIEPRPS TFYPND
Subjt:  AKDIEPRPSLTFYP-DDFKIKLFAKDIEPRPSATFYPN-DVKTKLFSKDIEPRPSTTFYPND

XP_031745285.1 proteoglycan 4 isoform X2 [Cucumis sativus]3.0e-8464.5Show/hide
Query:  MKYSPSFGIILFLLLLLVNSIESRYEPGGHWRNMMEDEPLLEATKEEKDCLQHIKLENEKPLV-VVEPGSSVKLYPN-GAEKKSFAKSIEPRSSATFYLN
        MK  P+FGI L LLLL  N IESR+EPGG W+N++ED+ L   ++E++DC ++  L+NE      ++P  S+  YPN G++ K F K IEPR SATFY N
Subjt:  MKYSPSFGIILFLLLLLVNSIESRYEPGGHWRNMMEDEPLLEATKEEKDCLQHIKLENEKPLV-VVEPGSSVKLYPN-GAEKKSFAKSIEPRSSATFYLN

Query:  DIEK-KLFAKDIEPRPSLTFYPDNANKN-LFTEDIEPRPSTTFYPDDIEK-KLFVNDIEPRPSTTFYPDNADKN-LFTKDIEPRPSATFYPDDIEK-KLF
        D  K + F KDIEPRPSLTFYP++  KN LFT+DIEPRPS TFYP+D  K K F+ DIEPRPS TFYP++  KN LFTKDIEPRPSATFYP+D  K + F
Subjt:  DIEK-KLFAKDIEPRPSLTFYPDNANKN-LFTEDIEPRPSTTFYPDDIEK-KLFVNDIEPRPSTTFYPDNADKN-LFTKDIEPRPSATFYPDDIEK-KLF

Query:  AKDIEPRPSLTFYP-DDFKIKLFAKDIEPRPSATFYPND-VKTKLFSKDIEPRPSTTFYPND
         KDIEPRPSLTFYP DD K KLF KDIEPRPSATFYPND  K K F KDIEPRPS TFYPND
Subjt:  AKDIEPRPSLTFYP-DDFKIKLFAKDIEPRPSATFYPND-VKTKLFSKDIEPRPSTTFYPND

XP_038887162.1 uncharacterized protein LOC120077350 [Benincasa hispida]6.2e-9063.57Show/hide
Query:  MKYSPSFGIILFLLLLLVNSIESRYEPGGHWRNMMEDEPLLEATKEEKDCLQHIKLENEKPLVV-VEPGSSVKLYPNGAEKKSFAKSIEPRSSATFYLND
        MK  P+FGI L L LLLV+SIESRYEPGG WRN++EDE     TKE++DCL+  K+ NE+  +  +EP  SV  YP+ A+ K F K IEPR S TFY  D
Subjt:  MKYSPSFGIILFLLLLLVNSIESRYEPGGHWRNMMEDEPLLEATKEEKDCLQHIKLENEKPLVV-VEPGSSVKLYPNGAEKKSFAKSIEPRSSATFYLND

Query:  IEKKLFAKDIEPRPSLTFYPDNANKNLFTEDIEPRPSTTFYPDDIEKKLFVNDIEPRPSTTFYPDNADKNLFTKDIEPRPSATFYPDDIEKKLFAKDIEP
        +  KLF KDIEPRPS TFYP++   N F ++IEPRPS TFYP+D +   F  DIEPRPSTTFYP +     FTKDIEPRPS TFYP D++  LF KDIEP
Subjt:  IEKKLFAKDIEPRPSLTFYPDNANKNLFTEDIEPRPSTTFYPDDIEKKLFVNDIEPRPSTTFYPDNADKNLFTKDIEPRPSATFYPDDIEKKLFAKDIEP

Query:  RPSLTFYPDDFKIKLFAKDIEPRPSATFYPNDVKTKLFSKDIEPRPSTTFYPNDLKTK
        RPS TFYPDD KIKLFAK+IEPRP  TFYPND K K F+KDIEPRPSTTFYP ++K K
Subjt:  RPSLTFYPDDFKIKLFAKDIEPRPSATFYPNDVKTKLFSKDIEPRPSTTFYPNDLKTK

TrEMBL top hitse value%identityAlignment
A0A0A0K3R4 Uncharacterized protein1.3e-8565.27Show/hide
Query:  MKYSPSFGIILFLLLLLVNSIESRYEPGGHWRNMMEDEPLLEATKEEKDCLQHIKLENEKPLV-VVEPGSSVKLYPNGAEK-KSFAKSIEPRSSATFYLN
        MK  P+FGI L LLLL  N IESRYEPGG W+N++ED+ L   ++E++DC ++  L+NE       +P  S+  YPN   K + F K IEPR SATFY N
Subjt:  MKYSPSFGIILFLLLLLVNSIESRYEPGGHWRNMMEDEPLLEATKEEKDCLQHIKLENEKPLV-VVEPGSSVKLYPNGAEK-KSFAKSIEPRSSATFYLN

Query:  DIEK-KLFAKDIEPRPSLTFYPDNANKN-LFTEDIEPRPSTTFYP-DDIEKKLFVNDIEPRPSTTFYPDNADKN-LFTKDIEPRPSATFYP-DDIEKKLF
        D  K + F KDIEPRPS TFYP++  KN LFT+DIEPRPS TFYP DD + KLF  DIEPRPS TFYP++  KN LFTKDIEPRPSATFYP DD + KLF
Subjt:  DIEK-KLFAKDIEPRPSLTFYPDNANKN-LFTEDIEPRPSTTFYP-DDIEKKLFVNDIEPRPSTTFYPDNADKN-LFTKDIEPRPSATFYP-DDIEKKLF

Query:  AKDIEPRPSLTFYP-DDFKIKLFAKDIEPRPSATFYPN-DVKTKLFSKDIEPRPSTTFYPND
         KDIEPRPS TFYP DD K KLF KDIEPRPSATFYPN D K KLF+KDIEPRPS TFYPND
Subjt:  AKDIEPRPSLTFYP-DDFKIKLFAKDIEPRPSATFYPN-DVKTKLFSKDIEPRPSTTFYPND

A0A0A0K958 Uncharacterized protein6.5e-7761.15Show/hide
Query:  MKYSPSFGIILFLLLLLVNSIESRYEPGGHWRNMMEDEPLLEATKEEKDCLQHIKLENEKPLVVVEPGSSVKLYPNGAEKKSFAKSIEPRSSATFYLNDI
        MK  P+FGI LFLLLL  N IESRYEPGG W+N++ED+ L   ++E++DC ++  L+NE                      +F   I+PR S TFY ND 
Subjt:  MKYSPSFGIILFLLLLLVNSIESRYEPGGHWRNMMEDEPLLEATKEEKDCLQHIKLENEKPLVVVEPGSSVKLYPNGAEKKSFAKSIEPRSSATFYLNDI

Query:  EK-KLFAKDIEPRPSLTFYPDNANKN-LFTEDIEPRPSTTFYP-DDIEKKLFVNDIEPRPSTTFYPDNADKN-LFTKDIEPRPSATFYP-DDIEKKLFAK
         K K F KDIEPRPSLTFYP++  KN LFT+DIEPRPS TFYP DD + KLF  DIEPRPS TFYP++  KN LFTKDIEPRPS TFYP DD + KLF K
Subjt:  EK-KLFAKDIEPRPSLTFYPDNANKN-LFTEDIEPRPSTTFYP-DDIEKKLFVNDIEPRPSTTFYPDNADKN-LFTKDIEPRPSATFYP-DDIEKKLFAK

Query:  DIEPRPSLTFYP-DDFKIKLFAKDIEPRPSATFYPND-VKTKLFSKDIEPRPSTTFYPND
        DIEPRPSLTFYP D+ K K F KDIEPRPS TFYPND  K K+F KDIEPRPS TFYP++
Subjt:  DIEPRPSLTFYP-DDFKIKLFAKDIEPRPSATFYPND-VKTKLFSKDIEPRPSTTFYPND

A0A6J1C3J9 uncharacterized protein LOC1110076185.0e-9364.52Show/hide
Query:  MKYSPSFGIILFLLLLLVNSIESRYEPGGHWRNMMEDEPLLEATKEEKDCL-QHIKLENEKPLVV-VEPGSSVKLYPNGAEKKS-FAKSIEPRSSATFYL
        MK SP F I LFLLLLLV+ IESRYEPGGHWRN MED+PL EATKE++DCL QH +LENEK  V  +EP  S+  YPN A+K S F   IEP++SATFY 
Subjt:  MKYSPSFGIILFLLLLLVNSIESRYEPGGHWRNMMEDEPLLEATKEEKDCL-QHIKLENEKPLVV-VEPGSSVKLYPNGAEKKS-FAKSIEPRSSATFYL

Query:  NDIEKKLFAKDIEPRPSLTFYPDNANKNLFTEDIEPRPSTTFYPDDIEKKLFVNDIEPRPSTTFYPDNADKNLFTKDIEPRPSATFYPDDIEKKLFAKDI
        +D++ K+F +DI+PRPS+T YP++  K    EDIEPRPS TFYPDD++ KLF  DI+PRPS TFYP++    L  K IEPRPS TFYPDD++ KLF KDI
Subjt:  NDIEKKLFAKDIEPRPSLTFYPDNANKNLFTEDIEPRPSTTFYPDDIEKKLFVNDIEPRPSTTFYPDNADKNLFTKDIEPRPSATFYPDDIEKKLFAKDI

Query:  EPRPSLTFYPDDFKIKLFAKDIEPRPSATFYPNDVKTKLFSKDIEPRPSTTFYPNDLKTKESSTEAHHG-KADIKIAQA
        +PRPS+TFYP+D K +L AK IEPRPS TFYP+DV+  L +KDIEPRP+ T YPN LKTKES  +AHHG +ADI+IAQA
Subjt:  EPRPSLTFYPDDFKIKLFAKDIEPRPSATFYPNDVKTKLFSKDIEPRPSTTFYPNDLKTKESSTEAHHG-KADIKIAQA

A0A6J1EL39 organ-specific protein S2-like isoform X21.0e-6952.86Show/hide
Query:  MKYSPSFGIILFLLLLLVNSIESRYEPGGHWRNMMEDEPLLEATKEEKDCLQHIKLENEKPLVV-VEPGSSVKLYPNGAEKKSFAKSIEPRSSATFYLND
        MK    FGI L LLL+ VN+IESR+EPG    N+          + ++D ++  + ENEK  V  +EP  S   YP  AEKKSF K IEPR SATFY N+
Subjt:  MKYSPSFGIILFLLLLLVNSIESRYEPGGHWRNMMEDEPLLEATKEEKDCLQHIKLENEKPLVV-VEPGSSVKLYPNGAEKKSFAKSIEPRSSATFYLND

Query:  -IEKKLFAKDIEPRPSLTFYP-DNANKNLFTEDIEPRPSTTFYP-DDIEKKLFVNDIEPRPSTTFYPDNADKN-LFTKDIEPRPSATFYPDDIEKKLFAK
         +   LF KDIEPRPS TFYP +N    LF +DIEPRPS TFYP ++++  LF  DIEPRP  TFYP+   K  LF +DI+PRPS +FYP+D + KLF K
Subjt:  -IEKKLFAKDIEPRPSLTFYP-DNANKNLFTEDIEPRPSTTFYP-DDIEKKLFVNDIEPRPSTTFYPDNADKN-LFTKDIEPRPSATFYPDDIEKKLFAK

Query:  DIEPRPSLTFYPDDFKIKLFAKDIEPRPSATFYPNDVKTKLFSKDIEPRPSTTFYPNDLKTKESSTEAHHGKADIKIAQA
        DIEPR S++FY D+ K KLFAKDIEPRP+ + YP++ K +L++ DI+P+PSTT YP++L +K SS + HH   DI+I +A
Subjt:  DIEPRPSLTFYPDDFKIKLFAKDIEPRPSATFYPNDVKTKLFSKDIEPRPSTTFYPNDLKTKESSTEAHHGKADIKIAQA

A0A6J1GKY7 uncharacterized protein LOC111454914 isoform X12.3e-7454.34Show/hide
Query:  IILFLLLLLVNSIESRYEPGG-HWRNMMEDEPLLE--------------ATKEEKDCLQHIKLENEKPLVV-VEPGSSVKLYPNGAEKKSFAKSIEPRSS
        I  FLL+LL N+IESRYEPG  HWRN ++D+ L E              + KE +DC   +KLE+ K  V  +EP        +  + K F+  I+PR S
Subjt:  IILFLLLLLVNSIESRYEPGG-HWRNMMEDEPLLE--------------ATKEEKDCLQHIKLENEKPLVV-VEPGSSVKLYPNGAEKKSFAKSIEPRSS

Query:  ATFYLNDIEKKLFAKDIEPRPSLTFYPDNANKNLFTEDIEPRPSTTFYPDDIEKKLFVNDIEPRPSTTFYPDNADKNLFTKDIEPRPSATFYPDDIEKKL
        A+FY +D +KK  A+DIEPRP+L+FYPD     LF++DIEPRPS +FYPDD +KK    DIEPRP+ +FYPD     LF+KDIEPRPSA+FYPDD +KK 
Subjt:  ATFYLNDIEKKLFAKDIEPRPSLTFYPDNANKNLFTEDIEPRPSTTFYPDDIEKKLFVNDIEPRPSTTFYPDNADKNLFTKDIEPRPSATFYPDDIEKKL

Query:  FAKDIEPRPSLTFYPDDFKIKLFAKDIEPRPSATFYPNDVKTKLFSKDIEPRPSTTFYPNDLKTK
         A+DIEPRP+L+FYPD  K KLF+KDIEPRPSA+FYP+D K K  ++DIEPRP+ +FYP+ +KTK
Subjt:  FAKDIEPRPSLTFYPDDFKIKLFAKDIEPRPSATFYPNDVKTKLFSKDIEPRPSTTFYPNDLKTK

SwissProt top hitse value%identityAlignment
P17771 Organ-specific protein P48.9e-0728.47Show/hide
Query:  MKYSPSFGIILFLLLLLVNSIESRYEPGGHWRNMMEDEPLLEATKEEKDCLQHIKLENEKPLVVVEPGSSVKLYPNGAEKKSFAKSIEPRSSATFY----
        M    +F ++   L L+V ++ESR + G +W+ +M+D+ + E   E +  L    ++N K               +  E K      EP  +A+ Y    
Subjt:  MKYSPSFGIILFLLLLLVNSIESRYEPGGHWRNMMEDEPLLEATKEEKDCLQHIKLENEKPLVVVEPGSSVKLYPNGAEKKSFAKSIEPRSSATFY----

Query:  LNDIEKKLFAKDIEPRPSLTFYPDNANKNLFTEDIEPRPSTTFY
        ++  E K    + E RP+ + Y DN     FT+D EPRPS T Y
Subjt:  LNDIEKKLFAKDIEPRPSLTFYPDNANKNLFTEDIEPRPSTTFY

P17772 Organ-specific protein S23.8e-1330.1Show/hide
Query:  MKYSPSFGIILFLLLLLVNSIESRYEPGGHWRNMMEDEPLLEATKEEKDCLQHIKLENEKPLVVVEPGSSVKLYPNGAEKKSFAKSIEPRSSATFY----
        M    +F ++   LLL+V ++ESR + G +W+ +M+D+ + E   E +  L    ++N K               +  E        EPR  A+ Y    
Subjt:  MKYSPSFGIILFLLLLLVNSIESRYEPGGHWRNMMEDEPLLEATKEEKDCLQHIKLENEKPLVVVEPGSSVKLYPNGAEKKSFAKSIEPRSSATFY----

Query:  LNDIEKKLFAKDIEPRPSLTFYPDN---ANKNL-FTEDIEPRPSTTFYPDD----IEKKLFVNDIEPRPSTTFYPDNADKNLFTKDIEPRPSATFY
        ++  E      + EPRP+ + Y DN   AN+N   + + EPRP+ + Y D+     E K  + + E RP+ + Y DN     FT D EPRPS T Y
Subjt:  LNDIEKKLFAKDIEPRPSLTFYPDN---ANKNL-FTEDIEPRPSTTFYPDD----IEKKLFVNDIEPRPSTTFYPDNADKNLFTKDIEPRPSATFY

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCACAAAGAAAAAGCCGAAGATATGAAGTACAGTCCAAGCTTTGGCATCATTCTCTTTTTGCTTCTTTTGCTGGTCAACAGCATCGAATCAAGATATGAACCTGGAGG
ACACTGGAGAAATATGATGGAAGATGAACCATTACTAGAAGCAACTAAAGAGGAGAAAGATTGTCTCCAACATATCAAGCTTGAAAACGAAAAGCCTTTGGTCGTGGTTG
AACCAGGATCAAGTGTTAAGCTTTATCCAAATGGTGCTGAAAAGAAATCTTTTGCCAAAAGTATTGAACCACGATCAAGTGCCACATTTTATTTGAATGACATTGAAAAA
AAGCTTTTTGCCAAAGATATAGAGCCTCGACCAAGTCTCACATTTTATCCAGATAACGCCAATAAGAATCTTTTCACCGAAGATATAGAACCACGCCCAAGTACCACATT
TTATCCAGATGACATCGAAAAAAAGCTTTTTGTCAATGATATCGAGCCTCGACCAAGTACCACATTTTATCCGGATAACGCCGACAAAAATCTTTTCACCAAAGATATAG
AACCACGCCCAAGTGCCACATTTTATCCAGATGACATCGAAAAAAAACTTTTTGCCAAAGATATAGAGCCTCGACCAAGTCTCACATTTTATCCAGATGATTTCAAAATA
AAACTTTTTGCCAAAGATATAGAGCCACGTCCAAGTGCCACATTTTATCCAAATGATGTGAAAACAAAACTTTTTTCCAAAGATATAGAACCACGACCAAGTACCACATT
TTATCCCAATGATCTTAAAACAAAAGAGTCCTCTACCGAAGCTCACCATGGCAAAGCTGACATAAAGATTGCACAAGCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGCACAAAGAAAAAGCCGAAGATATGAAGTACAGTCCAAGCTTTGGCATCATTCTCTTTTTGCTTCTTTTGCTGGTCAACAGCATCGAATCAAGATATGAACCTGGAGG
ACACTGGAGAAATATGATGGAAGATGAACCATTACTAGAAGCAACTAAAGAGGAGAAAGATTGTCTCCAACATATCAAGCTTGAAAACGAAAAGCCTTTGGTCGTGGTTG
AACCAGGATCAAGTGTTAAGCTTTATCCAAATGGTGCTGAAAAGAAATCTTTTGCCAAAAGTATTGAACCACGATCAAGTGCCACATTTTATTTGAATGACATTGAAAAA
AAGCTTTTTGCCAAAGATATAGAGCCTCGACCAAGTCTCACATTTTATCCAGATAACGCCAATAAGAATCTTTTCACCGAAGATATAGAACCACGCCCAAGTACCACATT
TTATCCAGATGACATCGAAAAAAAGCTTTTTGTCAATGATATCGAGCCTCGACCAAGTACCACATTTTATCCGGATAACGCCGACAAAAATCTTTTCACCAAAGATATAG
AACCACGCCCAAGTGCCACATTTTATCCAGATGACATCGAAAAAAAACTTTTTGCCAAAGATATAGAGCCTCGACCAAGTCTCACATTTTATCCAGATGATTTCAAAATA
AAACTTTTTGCCAAAGATATAGAGCCACGTCCAAGTGCCACATTTTATCCAAATGATGTGAAAACAAAACTTTTTTCCAAAGATATAGAACCACGACCAAGTACCACATT
TTATCCCAATGATCTTAAAACAAAAGAGTCCTCTACCGAAGCTCACCATGGCAAAGCTGACATAAAGATTGCACAAGCTTAA
Protein sequenceShow/hide protein sequence
MHKEKAEDMKYSPSFGIILFLLLLLVNSIESRYEPGGHWRNMMEDEPLLEATKEEKDCLQHIKLENEKPLVVVEPGSSVKLYPNGAEKKSFAKSIEPRSSATFYLNDIEK
KLFAKDIEPRPSLTFYPDNANKNLFTEDIEPRPSTTFYPDDIEKKLFVNDIEPRPSTTFYPDNADKNLFTKDIEPRPSATFYPDDIEKKLFAKDIEPRPSLTFYPDDFKI
KLFAKDIEPRPSATFYPNDVKTKLFSKDIEPRPSTTFYPNDLKTKESSTEAHHGKADIKIAQA