; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0018747 (gene) of Snake gourd v1 genome

Gene IDTan0018747
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionorgan-specific protein S2-like isoform X2
Genome locationLG02:8462312..8463520
RNA-Seq ExpressionTan0018747
SyntenyTan0018747
Gene Ontology termsNA
InterPro domainsIPR024489 - Organ specific protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6572028.1 hypothetical protein SDJN03_28756, partial [Cucurbita argyrosperma subsp. sororia]7.0e-8654.43Show/hide
Query:  MKRLVFITFFLLTLFANTIESRHEPGGHHRRILTIGDESLHGAIKDLIHTDPNSLLSEKKMVDDCIETLKPEYGKLFSKDIEPRSSASFYLDDTKTNLFA
        MK LVFI FFLL L ANT+ESR+EPG HH R   I ++ LH  I++ IH+DPNSLLSEK+  +DC ETLK E GKLF K++EPR  A+F  +  KT LF+
Subjt:  MKRLVFITFFLLTLFANTIESRHEPGGHHRRILTIGDESLHGAIKDLIHTDPNSLLSEKKMVDDCIETLKPEYGKLFSKDIEPRSSASFYLDDTKTNLFA

Query:  KDIEPRSSVTFYPDETKTNLFAKDLEQQSSVSFYPDD----------------------IKIKLFAKDIEPRPSASFYPDDFKTKLFAKDIEPRPSVTFY
         DIEPR S +FYPD  KT LF+ D+E + S SFYPDD                      +K KLF+KDIEPRP+ SFYPD  KTKLF+KDIEPRPS +FY
Subjt:  KDIEPRSSVTFYPDETKTNLFAKDLEQQSSVSFYPDD----------------------IKIKLFAKDIEPRPSASFYPDDFKTKLFAKDIEPRPSVTFY

Query:  PDDTKTKLFAKDIKPRPSGSFYPDDTKTKLFAKDIELRPSGSFYPDVTKTKLFVKDIEPRPSLK--------------VEPQPSATFY------------
        PDDTK +  A+DI+PRP+ SFYPD  KTKLF+KDIE RPS SFYPD TK K   +DIEPRP+L               ++P+PSA+FY            
Subjt:  PDDTKTKLFAKDIKPRPSGSFYPDDTKTKLFAKDIELRPSGSFYPDVTKTKLFVKDIEPRPSLK--------------VEPQPSATFY------------

Query:  --RDNLKAKELSADAHHGEADIQVAQA
           DNLKAK+   DAHH + DIQVA A
Subjt:  --RDNLKAKELSADAHHGEADIQVAQA

XP_022952150.1 uncharacterized protein LOC111454914 isoform X1 [Cucurbita moschata]1.5e-8360.07Show/hide
Query:  MKRLVFITFFLLTLFANTIESRHEPGGHHRRILTIGDESLHGAIKDLIHTDPNSLLSEKKMVDDCIETLKPEYGKLFSKDIEPRSSASFYLDDTKTNLFA
        MK LVFI FFLL L ANTIESR+EPG HH R   I D+SL   I + I + PNSLLSEK+M +DC  TLK E GKLF K++EPR  A+F  D  KT LF+
Subjt:  MKRLVFITFFLLTLFANTIESRHEPGGHHRRILTIGDESLHGAIKDLIHTDPNSLLSEKKMVDDCIETLKPEYGKLFSKDIEPRSSASFYLDDTKTNLFA

Query:  KDIEPRSSVTFYPDETKTNLFAKDLEQQSSVSFYPDDIKIKLFAKDIEPRPSASFYPDDFKTKLFAKDIEPRPSVTFYPDDTKTKLFAKDIKPRPSGSFY
         DI+PR S +FYPD+TK    A+D+E + ++SFYPD +K KLF+KDIEPRPSASFYPDD K K  A+DIEPRP+++FYPD  KTKLF+KDI+PRPS SFY
Subjt:  KDIEPRSSVTFYPDETKTNLFAKDLEQQSSVSFYPDDIKIKLFAKDIEPRPSASFYPDDFKTKLFAKDIEPRPSVTFYPDDTKTKLFAKDIKPRPSGSFY

Query:  PDDTKTKLFAKDIELRPSGSFYPDVTKTKLFVKDIEPRPSLK--------------VEPQPSATFYRDNLKAKELSAD
        PDDTK K  A+DIE RP+ SFYPDV KTKLF KDIEPRPS                +EP+P+ +FY D +K K  S D
Subjt:  PDDTKTKLFAKDIELRPSGSFYPDVTKTKLFVKDIEPRPSLK--------------VEPQPSATFYRDNLKAKELSAD

XP_023554805.1 protein PELPK1-like isoform X4 [Cucurbita pepo subsp. pepo]8.0e-8250.14Show/hide
Query:  MKRLVFITFFLLTLFANTIESRHEPGGHHRRILTIGDESLHGAIKDLIHTDPNSLLSEKKMVDDCIETLKPEYGKLFSKDIEPRSSASFYLDDTKTNLFA
        MK LVFI FFLL L ANT++SR+EPG HH R   I ++ LH  I++ IH+DPNSLLSEK+  +DC ETLK E GKLF K++EPR  A+F  D  KT LF+
Subjt:  MKRLVFITFFLLTLFANTIESRHEPGGHHRRILTIGDESLHGAIKDLIHTDPNSLLSEKKMVDDCIETLKPEYGKLFSKDIEPRSSASFYLDDTKTNLFA

Query:  KDIEPRSSVTFYPDETKTNLFAKDLEQQSSVSFYPDD----------------------IKIKLFAKDIEPRPSASFYPDDFKTKLFAKDIEPRPSVTFY
         DIEPR S +FYPD  KT  F+ D+E + S SFYPDD                      +K KLF+KDIEPRPSASFYPDD K +  A+DIEPRP+++FY
Subjt:  KDIEPRSSVTFYPDETKTNLFAKDLEQQSSVSFYPDD----------------------IKIKLFAKDIEPRPSASFYPDDFKTKLFAKDIEPRPSVTFY

Query:  PDDTKTKLFAKDIKPRPSGSFYPDDTKTKLFAKDIELRPSGSFYPDVTKTKLFVKDIEPRPSLK------------------------------------
        PD  KTKLF+KDI+PRPS SFYPDDT+ K   +DIE RP+ SFYPDV KTKLF KDIEPRPS                                      
Subjt:  PDDTKTKLFAKDIKPRPSGSFYPDDTKTKLFAKDIELRPSGSFYPDVTKTKLFVKDIEPRPSLK------------------------------------

Query:  VEPQPSATFY--------------RDNLKAKELSADAHHGEADIQVAQA
        ++P+PSA+FY               DNL+AK+   DAHH +A  QV  A
Subjt:  VEPQPSATFY--------------RDNLKAKELSADAHHGEADIQVAQA

XP_038888400.1 uncharacterized protein LOC120078247 [Benincasa hispida]3.2e-8361.73Show/hide
Query:  MKRLVFITFFLLTLFANTIESRHEPGGHHRRILTIGDESLHGAIKDLIHTDPNSLLSEKKMVDDCIETLKPEYGKLFSKDIEPRSSASFYLDDTKTNLFA
        MK L FITFFLL LFANTIES +E G  HRR L I D+S H   + +IH+DPNSLLSE KM DDC +TLK   GKLF ++I+PR SA+FY +D KT  F 
Subjt:  MKRLVFITFFLLTLFANTIESRHEPGGHHRRILTIGDESLHGAIKDLIHTDPNSLLSEKKMVDDCIETLKPEYGKLFSKDIEPRSSASFYLDDTKTNLFA

Query:  KDIEPRSSVTFYPDETKTNLFAKDLEQQSSVSFYPDDIKIKLFAKDIEPRPSASFYPDDFKTKLFAKDIEPRPSVTFYPDDTKTKLFAKDIKPRPSGSFY
        KD+EPR S T                      FYPDD+K KLF KD+EPRPSA+FYPDD KTKLF KD+EPRPS TFYPDD K KLF KD++PRPS SFY
Subjt:  KDIEPRSSVTFYPDETKTNLFAKDLEQQSSVSFYPDDIKIKLFAKDIEPRPSASFYPDDFKTKLFAKDIEPRPSVTFYPDDTKTKLFAKDIKPRPSGSFY

Query:  PDDTKTKLFAKDIELRPSGSFYPDVTKTKLFVKDIEPRPSLKVEPQPSATFYRDNLKAKELSADAHHGEADIQVAQA
        PDD K KLF KD+ELRPS SFYPD  KTKLF KD+EPR        PS TFY +NLKAKE   DAH  EA+ QVAQA
Subjt:  PDDTKTKLFAKDIELRPSGSFYPDVTKTKLFVKDIEPRPSLKVEPQPSATFYRDNLKAKELSADAHHGEADIQVAQA

XP_038888700.1 uncharacterized protein LOC120078501 [Benincasa hispida]2.6e-8549.34Show/hide
Query:  MKRLVFITFFLLTLFANTIESRHEPGGHHRRILTIGDESLHGAIKDLIHTDPNSLLSEKKMVDDCIETLKPEYGKLFSKDIEPRSSASFYLDDTKTNLFA
        MK L FITFFLL LFAN IESRHEPG HH R L I DE    AIKD+IH+DP SLLSE K  DDC +TLK   GKLF ++IEP+ SAS Y  DT+T LF 
Subjt:  MKRLVFITFFLLTLFANTIESRHEPGGHHRRILTIGDESLHGAIKDLIHTDPNSLLSEKKMVDDCIETLKPEYGKLFSKDIEPRSSASFYLDDTKTNLFA

Query:  KDIEPRSSV------------------------------------------TFYPDETKTNLFAKDLEQQSSVSFYPDDIKIKLFAKDIEPRPSASFYPD
        +D EPRSS+                                          TFYPD+ KT LF KDLE + ++SFYPDD+K KLF KD+EPRP+ SFYPD
Subjt:  KDIEPRSSV------------------------------------------TFYPDETKTNLFAKDLEQQSSVSFYPDDIKIKLFAKDIEPRPSASFYPD

Query:  DFKTKLFAKDIEPRPSVTFYPDDTKTKLFAKDIKPRPSGSFYPDDTKTKLFAKDIELRPSGSFYPDVTKTKLFVKDIEPRPSLK----------------
        D KTKLF KD+EPRP+++FYPDD KTKLF KD++PRP+ SFYPDD KTK F KD+E RP+ SFYPD  KT+LF KD+EP+P+L                 
Subjt:  DFKTKLFAKDIEPRPSVTFYPDDTKTKLFAKDIKPRPSGSFYPDDTKTKLFAKDIELRPSGSFYPDVTKTKLFVKDIEPRPSLK----------------

Query:  ------------------------------------------VEPQPSATFYRDNLKAKELSADAHHGEADIQVAQA
                                                  +E QP+ +FY  NLKAKE S DAH GEADIQ+AQA
Subjt:  ------------------------------------------VEPQPSATFYRDNLKAKELSADAHHGEADIQVAQA

TrEMBL top hitse value%identityAlignment
A0A0A0K3R4 Uncharacterized protein1.2e-6454.67Show/hide
Query:  ITFFLLTLFANTIESRHEPGGHHRRILTIGDESLHGAIKDLIHT-DPNSLLSEKKMVDDCIE----TLKP---EYGKLFSKDIEPRSSASFY-LDDTKTN
        IT  LL LF N IESR+EPGG  + +  I D+SL    ++        SL +E    +D       T  P      + F+KDIEPR SA+FY  D++K  
Subjt:  ITFFLLTLFANTIESRHEPGGHHRRILTIGDESLHGAIKDLIHT-DPNSLLSEKKMVDDCIE----TLKP---EYGKLFSKDIEPRSSASFY-LDDTKTN

Query:  LFAKDIEPRSSVTFYP-DETKTNLFAKDLEQQSSVSFYP-DDIKIKLFAKDIEPRPSASFYP-DDFKTKLFAKDIEPRPSVTFYP-DDTKTKLFAKDIKP
         F KDIEPR S TFYP D+TK  LF KD+E + S +FYP DD K KLF KDIEPRPSA+FYP DD K KLF KDIEPRPS TFYP DDTK KLF KDI+P
Subjt:  LFAKDIEPRSSVTFYP-DETKTNLFAKDLEQQSSVSFYP-DDIKIKLFAKDIEPRPSASFYP-DDFKTKLFAKDIEPRPSVTFYP-DDTKTKLFAKDIKP

Query:  RPSGSFYP-DDTKTKLFAKDIELRPSGSFYP-DVTKTKLFVKDIEPRPSL---------------KVEPQPSATFY-RDNLKAKELSAD
        RPS +FYP DDTK KLF KDIE RPS +FYP D TK KLF KDIEPRPS                 +EP+PSATFY  D+ K K  + D
Subjt:  RPSGSFYP-DDTKTKLFAKDIELRPSGSFYP-DVTKTKLFVKDIEPRPSL---------------KVEPQPSATFY-RDNLKAKELSAD

A0A0A0K5Y0 Uncharacterized protein3.3e-7346.84Show/hide
Query:  MKRLVFITFFLLTLFANTIESRHEPGGHHRRILTIGDESLHGAIKDLIHTDPNSLLSEKKMVDDCIETLKPEYGKLF------------------SKDIE
        MK L+FI  FLL LFA TIESRHEPG HH R L                         K  +DDC ETLK E GKLF                  SKD+E
Subjt:  MKRLVFITFFLLTLFANTIESRHEPGGHHRRILTIGDESLHGAIKDLIHTDPNSLLSEKKMVDDCIETLKPEYGKLF------------------SKDIE

Query:  PRSSASFYLDDTKTNLF----------------------------------------------------------------AKDIEPRSSVTFYPDETKT
         R S SF  DDT+T LF                                                                AKDIEPR +V+FYPD+TKT
Subjt:  PRSSASFYLDDTKTNLF----------------------------------------------------------------AKDIEPRSSVTFYPDETKT

Query:  NLFAKDLEQQSSVSFYPDD-IKIKLFAKDIEPRPSASFYPDD-FKTKLFAKDIEPRPSVTFYP-DDTKTKLFAKDIKPRPSGSFYPDD-TKTKLFAKDIE
         LFA+DLE + +VSFYPDD  K KLFA+D+EPRP+ SFYPDD  KTKLFA+D+EPRP+V+FYP DDTKTKLF +D++PRP+ SFYPDD TKTKLFA+D+E
Subjt:  NLFAKDLEQQSSVSFYPDD-IKIKLFAKDIEPRPSASFYPDD-FKTKLFAKDIEPRPSVTFYP-DDTKTKLFAKDIKPRPSGSFYPDD-TKTKLFAKDIE

Query:  LRPSGSFYP-DVTKTKLFVKDIEPRPSLK---------------VEPQPSATFYRDNLKAKE-LSADAHHGEADIQVAQA
         RP+  FYP D  KTKL V++IEPRP++                +EP+P+ +FY DNLKAKE LSA +HHGEA +QVAQA
Subjt:  LRPSGSFYP-DVTKTKLFVKDIEPRPSLK---------------VEPQPSATFYRDNLKAKE-LSADAHHGEADIQVAQA

A0A5A7SZJ0 Proteoglycan 4-like isoform X14.0e-6360.18Show/hide
Query:  KLFSKDIEPRSSASFYLDD-TKTNLFAKDIEPRSSVTFYP-DETKTNLFAKDLEQQSSVSFYPD-DIKIKLFAKDIEPRPSASFY-PDDFKTKLFAKDIE
        KLF +D+EPR + SFY DD TKT LF +D+EPR +V+FYP DETKT LFAKD+E + +VSFYPD + K +LFAKD+EPRP+ SFY  DD KTKLFAKD+E
Subjt:  KLFSKDIEPRSSASFYLDD-TKTNLFAKDIEPRSSVTFYP-DETKTNLFAKDLEQQSSVSFYPD-DIKIKLFAKDIEPRPSASFY-PDDFKTKLFAKDIE

Query:  PRPSVTFYP-DDTKTKLFAKDIKPRPSGSFYPDD-TKTKLFAKDIELRPSGSFYP-DVTKTKLFVKDIEPRPSLK---------------VEPQPSATFY
        PRP+++FYP DDTKTK F +D++PRP+ SFYPDD T TKLFAK +E RP+ SFYP D TKTK  V++IE +P++                +EP+P+ +FY
Subjt:  PRPSVTFYP-DDTKTKLFAKDIKPRPSGSFYPDD-TKTKLFAKDIELRPSGSFYP-DVTKTKLFVKDIEPRPSLK---------------VEPQPSATFY

Query:  RDNLKAKE-LSADAHHGEADIQVAQA
         +NLKAKE LSAD+H GEA +QVAQA
Subjt:  RDNLKAKE-LSADAHHGEADIQVAQA

A0A6J1GJM6 organ-specific protein S2-like isoform X23.3e-8159.21Show/hide
Query:  MKRLVFITFFLLTLFANTIESRHEPGGHHRRILTIGDESLHGAIKDLIHTDPNSLLSEKKMVDDCIETLKPEYGKLFSKDIEPRSSASFYLDDTKTNLFA
        MK LVFI FFLL L ANTIESRHEPG HH R   I ++ LH  I++ IH+DPNSLLSEK+  +DC ETLK E GKLF K++EPR  A+F  D  KT LF+
Subjt:  MKRLVFITFFLLTLFANTIESRHEPGGHHRRILTIGDESLHGAIKDLIHTDPNSLLSEKKMVDDCIETLKPEYGKLFSKDIEPRSSASFYLDDTKTNLFA

Query:  KDIEPRSSVTFYPDETKTNLFAKDLEQQSSVSFYPDDIKIKLFAKDIEPRPSASFYPDDFKTKLFAKDIEPRPSVTFYPDDTKTKLFAKDIKPRPSGSFY
        KDIEPR S +FYPD  KT LF+ D+E + S SFYPDD K +  A++IEPRP+ SFYPD  KTKLF+KDIEPRPS +FYPDDTK K  A+DI+PRP+ SFY
Subjt:  KDIEPRSSVTFYPDETKTNLFAKDLEQQSSVSFYPDDIKIKLFAKDIEPRPSASFYPDDFKTKLFAKDIEPRPSVTFYPDDTKTKLFAKDIKPRPSGSFY

Query:  PDDTKTKLFAKDIELRPSGSFYPDVTKTKLFVKDIEPRPSLKVEPQPSATFYRDNLKAKELSADAHHGEADIQVAQA
        PD  + +L +KDI+ RPS SFYPD  KT   VKD +                 DNLKAK+   DAHH +ADIQVA A
Subjt:  PDDTKTKLFAKDIELRPSGSFYPDVTKTKLFVKDIEPRPSLKVEPQPSATFYRDNLKAKELSADAHHGEADIQVAQA

A0A6J1GKY7 uncharacterized protein LOC111454914 isoform X17.0e-8460.07Show/hide
Query:  MKRLVFITFFLLTLFANTIESRHEPGGHHRRILTIGDESLHGAIKDLIHTDPNSLLSEKKMVDDCIETLKPEYGKLFSKDIEPRSSASFYLDDTKTNLFA
        MK LVFI FFLL L ANTIESR+EPG HH R   I D+SL   I + I + PNSLLSEK+M +DC  TLK E GKLF K++EPR  A+F  D  KT LF+
Subjt:  MKRLVFITFFLLTLFANTIESRHEPGGHHRRILTIGDESLHGAIKDLIHTDPNSLLSEKKMVDDCIETLKPEYGKLFSKDIEPRSSASFYLDDTKTNLFA

Query:  KDIEPRSSVTFYPDETKTNLFAKDLEQQSSVSFYPDDIKIKLFAKDIEPRPSASFYPDDFKTKLFAKDIEPRPSVTFYPDDTKTKLFAKDIKPRPSGSFY
         DI+PR S +FYPD+TK    A+D+E + ++SFYPD +K KLF+KDIEPRPSASFYPDD K K  A+DIEPRP+++FYPD  KTKLF+KDI+PRPS SFY
Subjt:  KDIEPRSSVTFYPDETKTNLFAKDLEQQSSVSFYPDDIKIKLFAKDIEPRPSASFYPDDFKTKLFAKDIEPRPSVTFYPDDTKTKLFAKDIKPRPSGSFY

Query:  PDDTKTKLFAKDIELRPSGSFYPDVTKTKLFVKDIEPRPSLK--------------VEPQPSATFYRDNLKAKELSAD
        PDDTK K  A+DIE RP+ SFYPDV KTKLF KDIEPRPS                +EP+P+ +FY D +K K  S D
Subjt:  PDDTKTKLFAKDIELRPSGSFYPDVTKTKLFVKDIEPRPSLK--------------VEPQPSATFYRDNLKAKELSAD

SwissProt top hitse value%identityAlignment
P17772 Organ-specific protein S23.7e-0528.8Show/hide
Query:  ITFFLLTLFANTIESRHEPGGHHRRILTIGDESLHGAIKDLIHTDPNSLLSEKKMVDDCIETLKPEYGKLFSKDIEPRSSASFYLDD---TKTNLFA-KD
        +  FLL + AN +ESR + G + +  L + D+ +   I+ L+  D +++ + K          K   G +   + EPR  AS Y D+    K N+ A  +
Subjt:  ITFFLLTLFANTIESRHEPGGHHRRILTIGDESLHGAIKDLIHTDPNSLLSEKKMVDDCIETLKPEYGKLFSKDIEPRSSASFYLDD---TKTNLFA-KD

Query:  IEPRSSVTFYPDE----TKTNLFAKDLEQQSSVSFYPDD----IKIKLFAKDIEPRPSASFYPDDFKTKLFAKDIEPRPSVTFY
         EPR + + Y D      +    + + E + ++S Y D+     + K    + E RP+AS Y D+     F  D EPRPS+T Y
Subjt:  IEPRSSVTFYPDE----TKTNLFAKDLEQQSSVSFYPDD----IKIKLFAKDIEPRPSASFYPDDFKTKLFAKDIEPRPSVTFY

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAGGCTTGTCTTCATCACTTTCTTCTTACTTACGTTGTTTGCAAACACCATTGAATCAAGGCATGAGCCGGGAGGTCATCACCGGAGAATTTTGACGATCGGCGA
CGAGTCGTTGCATGGTGCGATTAAAGACCTTATCCATACTGATCCAAACTCGCTTCTTTCGGAGAAGAAAATGGTTGACGATTGCATCGAAACTCTGAAACCTGAATATG
GAAAGCTTTTTTCCAAAGATATAGAACCACGATCGAGTGCCTCATTTTATCTAGATGACACCAAAACAAACCTTTTCGCCAAAGATATAGAACCACGCTCGAGTGTTACA
TTTTATCCGGATGAGACCAAAACAAACCTTTTTGCTAAAGATTTAGAACAACAGTCGAGTGTCTCATTTTATCCCGATGACATAAAGATAAAACTTTTCGCTAAAGATAT
AGAACCACGACCGAGTGCCTCATTTTATCCCGATGATTTCAAAACAAAACTTTTCGCCAAAGATATAGAACCACGACCGAGCGTCACATTTTATCCTGATGACACCAAAA
CAAAACTTTTCGCCAAAGATATAAAACCACGACCAAGTGGCTCATTTTATCCTGATGACACAAAAACAAAGCTTTTCGCGAAAGATATAGAACTACGACCGAGTGGCTCA
TTTTATCCCGATGTCACCAAAACAAAACTTTTCGTCAAAGATATAGAACCACGACCGAGCTTAAAGGTAGAACCACAACCAAGTGCCACATTTTATCGAGACAATCTCAA
AGCGAAAGAGTTGTCAGCTGATGCTCATCATGGCGAAGCTGACATACAGGTGGCACAAGCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAGAGGCTTGTCTTCATCACTTTCTTCTTACTTACGTTGTTTGCAAACACCATTGAATCAAGGCATGAGCCGGGAGGTCATCACCGGAGAATTTTGACGATCGGCGA
CGAGTCGTTGCATGGTGCGATTAAAGACCTTATCCATACTGATCCAAACTCGCTTCTTTCGGAGAAGAAAATGGTTGACGATTGCATCGAAACTCTGAAACCTGAATATG
GAAAGCTTTTTTCCAAAGATATAGAACCACGATCGAGTGCCTCATTTTATCTAGATGACACCAAAACAAACCTTTTCGCCAAAGATATAGAACCACGCTCGAGTGTTACA
TTTTATCCGGATGAGACCAAAACAAACCTTTTTGCTAAAGATTTAGAACAACAGTCGAGTGTCTCATTTTATCCCGATGACATAAAGATAAAACTTTTCGCTAAAGATAT
AGAACCACGACCGAGTGCCTCATTTTATCCCGATGATTTCAAAACAAAACTTTTCGCCAAAGATATAGAACCACGACCGAGCGTCACATTTTATCCTGATGACACCAAAA
CAAAACTTTTCGCCAAAGATATAAAACCACGACCAAGTGGCTCATTTTATCCTGATGACACAAAAACAAAGCTTTTCGCGAAAGATATAGAACTACGACCGAGTGGCTCA
TTTTATCCCGATGTCACCAAAACAAAACTTTTCGTCAAAGATATAGAACCACGACCGAGCTTAAAGGTAGAACCACAACCAAGTGCCACATTTTATCGAGACAATCTCAA
AGCGAAAGAGTTGTCAGCTGATGCTCATCATGGCGAAGCTGACATACAGGTGGCACAAGCTTAA
Protein sequenceShow/hide protein sequence
MKRLVFITFFLLTLFANTIESRHEPGGHHRRILTIGDESLHGAIKDLIHTDPNSLLSEKKMVDDCIETLKPEYGKLFSKDIEPRSSASFYLDDTKTNLFAKDIEPRSSVT
FYPDETKTNLFAKDLEQQSSVSFYPDDIKIKLFAKDIEPRPSASFYPDDFKTKLFAKDIEPRPSVTFYPDDTKTKLFAKDIKPRPSGSFYPDDTKTKLFAKDIELRPSGS
FYPDVTKTKLFVKDIEPRPSLKVEPQPSATFYRDNLKAKELSADAHHGEADIQVAQA