; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg040040 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg040040
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionUnknown protein
Genome locationscaffold10:24395312..24408058
RNA-Seq ExpressionSpg040040
SyntenySpg040040
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EXB53755.1 hypothetical protein L484_022412 [Morus notabilis]2.9e-3032.6Show/hide
Query:  PHFLRTGIANHGWERFCSKPESVNAQLVREFYANIDRGEGFLAVVRGIEVDWSPRAINVLYNLQNFPHPAYNEMAVAPSNEQLSDAVREVGIEGAQWQLS
        P F+   I  HGW +FC  P +    LVREFYAN+         V+ ++V ++ RAIN ++ L+      Y + A   ++EQL   + EV IEGA WQ+S
Subjt:  PHFLRTGIANHGWERFCSKPESVNAQLVREFYANIDRGEGFLAVVRGIEVDWSPRAINVLYNLQNFPHPAYNEMAVAPSNEQLSDAVREVGIEGAQWQLS

Query:  KTQKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLVFAILRSLSIDVGKIIADEIFGC-WKKKVGKLFFPNTITMLCRGAGVPEDEGDVILF
             T     LKR A  W  F+  R +P+TH  TV+++RVLL+++IL  +S+++ +I   EI  C   +K G L+FP+ IT L   A VP  + + I+ 
Subjt:  KTQKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLVFAILRSLSIDVGKIIADEIFGC-WKKKVGKLFFPNTITMLCRGAGVPEDEGDVILF

Query:  DKGIIDTPNLARLQRMQEV--------RQGGLVHGINTILEKLALSASRQDLDWLSLIRSRAKLRQVLRIELK
        + G I T +++R+ + + V        R G    G  T       + S +    L L+  R  L+++ + E++
Subjt:  DKGIIDTPNLARLQRMQEV--------RQGGLVHGINTILEKLALSASRQDLDWLSLIRSRAKLRQVLRIELK

PIN01433.1 hypothetical protein CDL12_26059 [Handroanthus impetiginosus]7.2e-2934.38Show/hide
Query:  RFVNNFARAKY-AELLKRDFLFERGF--SGE--LPHFLRTGIANHGWERFCSKPESVNAQLVREFYANIDRGEGFLAVVRGIEVDWSPRAINVLYNLQNF
        RFV+  A  +Y + L+ +  + ERGF   GE    H   T +    W+ F + PES    LVREFYAN    +    +VRG EV +    IN LYN+   
Subjt:  RFVNNFARAKY-AELLKRDFLFERGF--SGE--LPHFLRTGIANHGWERFCSKPESVNAQLVREFYANIDRGEGFLAVVRGIEVDWSPRAINVLYNLQNF

Query:  PHPAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTQKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLVFAILRSLSIDVGKIIADEIFGC
           A+       +     +  R +   GAQW+++K +  +F+S  L + A  W+ FI  RMLPT H   V+ +R LL++ I+   + DVGKII+D I   
Subjt:  PHPAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTQKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLVFAILRSLSIDVGKIIADEIFGC

Query:  WKKKVGKLFFPNTITMLCRGAGVPEDEGDVILFDKGIIDTPNLARLQRMQEVRQGG
               L+FP+ IT LC  AGV  DE + ++F +  ID   + R+        GG
Subjt:  WKKKVGKLFFPNTITMLCRGAGVPEDEGDVILFDKGIIDTPNLARLQRMQEVRQGG

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]1.2e-3134.34Show/hide
Query:  ASEEHDEIE-EQQLLDDRFVNNFARAKYAELLKRDFLFERGF-------SGELPHFLRTGIANHGWERFCSKPESVNAQLVREFYANIDRGEGFLAVVRG
        A + H  ++ E +  + R+ NN        +  R    E+GF        G+LP F+   I  H W++FC+ PE     LVREFYAN+         VRG
Subjt:  ASEEHDEIE-EQQLLDDRFVNNFARAKYAELLKRDFLFERGF-------SGELPHFLRTGIANHGWERFCSKPESVNAQLVREFYANIDRGEGFLAVVRG

Query:  IEVDWSPRAINVLYNLQNFPHPAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTQKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLVFAI
        ++V WS  AIN ++ L + P   ++E     +   L   +  V + GA+W +S     T   + L   A  W  F++  +LPTTH  TVS++R+LL+ ++
Subjt:  IEVDWSPRAINVLYNLQNFPHPAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTQKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLVFAI

Query:  LRSLSIDVGKIIADEIFGCWKKKVGKLFFPNTITMLCRGAGVPEDEGDVILFDKGIIDTPNLARL
        L   SI+VG++I  EI  C  +K G LFFP+ IT LCR A  P    +  L + G ID   +AR+
Subjt:  LRSLSIDVGKIIADEIFGCWKKKVGKLFFPNTITMLCRGAGVPEDEGDVILFDKGIIDTPNLARL

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]3.7e-3336.14Show/hide
Query:  RFVNNFARAKYA-ELLKRDFLFERGF-------SGELPHFLRTGIANHGWERFCSKPESVNAQLVREFYANIDRGEGFLAVVRGIEVDWSPRAINVLYNL
        +F    A  +Y   +  R    E+GF        G+LP F+   I  H W++FC+ PE     LVREFYAN+   E     VRG++V WS  AIN ++ L
Subjt:  RFVNNFARAKYA-ELLKRDFLFERGF-------SGELPHFLRTGIANHGWERFCSKPESVNAQLVREFYANIDRGEGFLAVVRGIEVDWSPRAINVLYNL

Query:  QNFPHPAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTQKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLVFAILRSLSIDVGKIIADEI
         + P   ++E     + + L   +  V   GA+W +S     T   + L   A  W  F++ R+LPTTH  TVS++R+LL+ ++L   SI+VG++I  EI
Subjt:  QNFPHPAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTQKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLVFAILRSLSIDVGKIIADEI

Query:  FGCWKKKVGKLFFPNTITMLCRGAGVPEDEGDVILFDKGIIDTPNLARL
          C  +K G LFFP+ IT LCR A  P    +  L + G ID   +AR+
Subjt:  FGCWKKKVGKLFFPNTITMLCRGAGVPEDEGDVILFDKGIIDTPNLARL

PON70375.1 hypothetical protein PanWU01x14_080440 [Parasponia andersonii]2.5e-2933.63Show/hide
Query:  RFVNNFARAKYAE-------LLKRDFLFERGFSGELPHFLRTGIANHGWERFCSKPESVNAQLVREFYANIDRGEGFLAVVRGIEVDWSPRAINVLYNLQ
        +F +  A  +Y E        ++++F+++     E P F+   I  H W+ FC+ PE     LVREFY N+   +     +RG++V  S  AIN +++L 
Subjt:  RFVNNFARAKYAE-------LLKRDFLFERGFSGELPHFLRTGIANHGWERFCSKPESVNAQLVREFYANIDRGEGFLAVVRGIEVDWSPRAINVLYNLQ

Query:  NFPHPAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTQKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLVFAILRSLSIDVGKIIADEIF
        + P   ++E     +  +L   +  V I GA+W +S     T   + L   A  W  F++ R+LPTTH  TVS+E V L++++L   SI+VG++I  EI 
Subjt:  NFPHPAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTQKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLVFAILRSLSIDVGKIIADEIF

Query:  GCWKKKVGKLFFPNTITMLCRGAGVP
         C  +K G LFFP+ IT +CR    P
Subjt:  GCWKKKVGKLFFPNTITMLCRGAGVP

TrEMBL top hitse value%identityAlignment
A0A2G9G807 Uncharacterized protein3.5e-2934.38Show/hide
Query:  RFVNNFARAKY-AELLKRDFLFERGF--SGE--LPHFLRTGIANHGWERFCSKPESVNAQLVREFYANIDRGEGFLAVVRGIEVDWSPRAINVLYNLQNF
        RFV+  A  +Y + L+ +  + ERGF   GE    H   T +    W+ F + PES    LVREFYAN    +    +VRG EV +    IN LYN+   
Subjt:  RFVNNFARAKY-AELLKRDFLFERGF--SGE--LPHFLRTGIANHGWERFCSKPESVNAQLVREFYANIDRGEGFLAVVRGIEVDWSPRAINVLYNLQNF

Query:  PHPAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTQKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLVFAILRSLSIDVGKIIADEIFGC
           A+       +     +  R +   GAQW+++K +  +F+S  L + A  W+ FI  RMLPT H   V+ +R LL++ I+   + DVGKII+D I   
Subjt:  PHPAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTQKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLVFAILRSLSIDVGKIIADEIFGC

Query:  WKKKVGKLFFPNTITMLCRGAGVPEDEGDVILFDKGIIDTPNLARLQRMQEVRQGG
               L+FP+ IT LC  AGV  DE + ++F +  ID   + R+        GG
Subjt:  WKKKVGKLFFPNTITMLCRGAGVPEDEGDVILFDKGIIDTPNLARLQRMQEVRQGG

A0A2P5AGA5 Uncharacterized protein (Fragment)5.7e-3234.34Show/hide
Query:  ASEEHDEIE-EQQLLDDRFVNNFARAKYAELLKRDFLFERGF-------SGELPHFLRTGIANHGWERFCSKPESVNAQLVREFYANIDRGEGFLAVVRG
        A + H  ++ E +  + R+ NN        +  R    E+GF        G+LP F+   I  H W++FC+ PE     LVREFYAN+         VRG
Subjt:  ASEEHDEIE-EQQLLDDRFVNNFARAKYAELLKRDFLFERGF-------SGELPHFLRTGIANHGWERFCSKPESVNAQLVREFYANIDRGEGFLAVVRG

Query:  IEVDWSPRAINVLYNLQNFPHPAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTQKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLVFAI
        ++V WS  AIN ++ L + P   ++E     +   L   +  V + GA+W +S     T   + L   A  W  F++  +LPTTH  TVS++R+LL+ ++
Subjt:  IEVDWSPRAINVLYNLQNFPHPAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTQKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLVFAI

Query:  LRSLSIDVGKIIADEIFGCWKKKVGKLFFPNTITMLCRGAGVPEDEGDVILFDKGIIDTPNLARL
        L   SI+VG++I  EI  C  +K G LFFP+ IT LCR A  P    +  L + G ID   +AR+
Subjt:  LRSLSIDVGKIIADEIFGCWKKKVGKLFFPNTITMLCRGAGVPEDEGDVILFDKGIIDTPNLARL

A0A2P5BCG4 Uncharacterized protein (Fragment)1.8e-3336.14Show/hide
Query:  RFVNNFARAKYA-ELLKRDFLFERGF-------SGELPHFLRTGIANHGWERFCSKPESVNAQLVREFYANIDRGEGFLAVVRGIEVDWSPRAINVLYNL
        +F    A  +Y   +  R    E+GF        G+LP F+   I  H W++FC+ PE     LVREFYAN+   E     VRG++V WS  AIN ++ L
Subjt:  RFVNNFARAKYA-ELLKRDFLFERGF-------SGELPHFLRTGIANHGWERFCSKPESVNAQLVREFYANIDRGEGFLAVVRGIEVDWSPRAINVLYNL

Query:  QNFPHPAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTQKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLVFAILRSLSIDVGKIIADEI
         + P   ++E     + + L   +  V   GA+W +S     T   + L   A  W  F++ R+LPTTH  TVS++R+LL+ ++L   SI+VG++I  EI
Subjt:  QNFPHPAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTQKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLVFAILRSLSIDVGKIIADEI

Query:  FGCWKKKVGKLFFPNTITMLCRGAGVPEDEGDVILFDKGIIDTPNLARL
          C  +K G LFFP+ IT LCR A  P    +  L + G ID   +AR+
Subjt:  FGCWKKKVGKLFFPNTITMLCRGAGVPEDEGDVILFDKGIIDTPNLARL

A0A2P5DAQ2 Uncharacterized protein1.2e-2933.63Show/hide
Query:  RFVNNFARAKYAE-------LLKRDFLFERGFSGELPHFLRTGIANHGWERFCSKPESVNAQLVREFYANIDRGEGFLAVVRGIEVDWSPRAINVLYNLQ
        +F +  A  +Y E        ++++F+++     E P F+   I  H W+ FC+ PE     LVREFY N+   +     +RG++V  S  AIN +++L 
Subjt:  RFVNNFARAKYAE-------LLKRDFLFERGFSGELPHFLRTGIANHGWERFCSKPESVNAQLVREFYANIDRGEGFLAVVRGIEVDWSPRAINVLYNLQ

Query:  NFPHPAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTQKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLVFAILRSLSIDVGKIIADEIF
        + P   ++E     +  +L   +  V I GA+W +S     T   + L   A  W  F++ R+LPTTH  TVS+E V L++++L   SI+VG++I  EI 
Subjt:  NFPHPAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTQKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLVFAILRSLSIDVGKIIADEIF

Query:  GCWKKKVGKLFFPNTITMLCRGAGVP
         C  +K G LFFP+ IT +CR    P
Subjt:  GCWKKKVGKLFFPNTITMLCRGAGVP

W9QTD9 Uncharacterized protein1.4e-3032.6Show/hide
Query:  PHFLRTGIANHGWERFCSKPESVNAQLVREFYANIDRGEGFLAVVRGIEVDWSPRAINVLYNLQNFPHPAYNEMAVAPSNEQLSDAVREVGIEGAQWQLS
        P F+   I  HGW +FC  P +    LVREFYAN+         V+ ++V ++ RAIN ++ L+      Y + A   ++EQL   + EV IEGA WQ+S
Subjt:  PHFLRTGIANHGWERFCSKPESVNAQLVREFYANIDRGEGFLAVVRGIEVDWSPRAINVLYNLQNFPHPAYNEMAVAPSNEQLSDAVREVGIEGAQWQLS

Query:  KTQKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLVFAILRSLSIDVGKIIADEIFGC-WKKKVGKLFFPNTITMLCRGAGVPEDEGDVILF
             T     LKR A  W  F+  R +P+TH  TV+++RVLL+++IL  +S+++ +I   EI  C   +K G L+FP+ IT L   A VP  + + I+ 
Subjt:  KTQKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLVFAILRSLSIDVGKIIADEIFGC-WKKKVGKLFFPNTITMLCRGAGVPEDEGDVILF

Query:  DKGIIDTPNLARLQRMQEV--------RQGGLVHGINTILEKLALSASRQDLDWLSLIRSRAKLRQVLRIELK
        + G I T +++R+ + + V        R G    G  T       + S +    L L+  R  L+++ + E++
Subjt:  DKGIIDTPNLARLQRMQEV--------RQGGLVHGINTILEKLALSASRQDLDWLSLIRSRAKLRQVLRIELK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGAAAGAATTTATGGCTCGTACAGACGCCGTAATTCAAAGTAATCAAGCTTCAATGAGAGCCCTGGAATTGCAAGTAGGTCAGCTAGCTAATGAGCTGAAGGCAAG
GCCTCAAGGGAAACTTCCATCAGATACTGAACACCCTAAAAGGGAAGGTAAGGAGCAGGTACAGGCAGTGACTCTTAGGAGTGGTAAGCCACTAGAAGAAAGAAGAGAGC
CTAGTAAAACCCAGGATATAGATAATAATTACGATAGAAATAATGTTGTTGTTGAGAAAGAGTTGGAGTCTGGTCAGGGTGCTGGAGGCAGCGATAAAGATGCTGGAGCA
TCTGACGGATTGTTGCGGCAAGGATTCTATTTTGCTCAAGCCTTATTTATTGGCAACTCACGGTACGTTTCTCTTACTTCATCCTTTTTGCTTCAATCTTTTGCTTTCTT
TTTCGTTTTCATTCTCTGTAAATCCCTTGAGTCTTCAATGGCAAAAACAAGAGCGCGAAAAGAAAGGGAGAATGAGGAAGAAGAGGTGTCTGTTACCCCTGAAGTGCAGA
AAGTTAAGACGAAGAAGAAAAGGACCCCGGAGGAGAAGGAAGCCAAGAGACGAAGACGACAACAGAGGGCTGAGGAACAAGAAAAGGCAACAGAGATTGTTGCTGCGGCA
GTTGAAGAAGGATACCCGCAAGAACCTGATGTACAGAACCCAGAGGAGGGTGAGCAGAGAGTTGTGGATATGGAAGAAGAGGAGCGAACAGAAGAAGTTCAAGAGGAGCG
GCCGGAAGTGCAAGAAGAAGTTCAAGAACAGCAGGTCGAGGATGTTCAAATGCAACAGGAAGAAGAGGTTCAGGTACCGGATAATGAGCCAATACAGGAGGCTCAAGTAG
AGGTGATCATGCCGGAGGTGCCAAAGCGTCGCCGCGTTAAGAGAAAAGCAGGCCGCGCTAGGGTTGTCCGGAATGATACTCCATCGCCTCTGACCACTGATTCTGAAAGA
GAAAATGCAGGAAAAGAGGAACGAGAAAAGAAGGAAGCCGAGGAAAGAGCAAGAGAAGAGGCAGAGGAAAAGGCTGAGGAAGAGCGGTTGCTCAAGCGAAGGGCGGAAAA
GGGCAAAAATGTTCCTGGAGCATCAGAAGAGCACGATGAAATAGAAGAGCAACAGTTACTGGATGATCGCTTCGTCAATAATTTTGCCAGAGCAAAATACGCTGAGCTTC
TGAAAAGGGACTTCCTGTTTGAGAGGGGATTTAGTGGTGAGCTTCCGCATTTTCTGAGGACTGGTATTGCGAATCATGGGTGGGAACGATTCTGTTCAAAACCCGAATCT
GTAAACGCGCAGTTAGTGCGCGAATTCTATGCAAATATCGACCGAGGAGAAGGTTTCCTAGCAGTTGTTCGAGGTATTGAGGTCGACTGGAGTCCTAGGGCTATCAACGT
ACTGTATAACCTTCAGAATTTCCCCCATCCGGCATATAATGAGATGGCTGTCGCGCCATCTAATGAGCAGTTAAGTGATGCTGTGCGGGAGGTAGGTATTGAAGGGGCAC
AATGGCAGCTATCCAAGACTCAGAAGAGGACATTCCAATCGGCTTATTTGAAAAGGGAAGCAAATACGTGGATGGGATTTATCAGACAAAGGATGCTTCCAACGACTCAT
GACTCGACAGTCTCTAGGGAACGAGTGCTTCTGGTTTTCGCTATTTTGAGGTCTCTCAGTATTGATGTGGGAAAAATTATTGCTGATGAAATATTTGGTTGTTGGAAGAA
GAAAGTGGGGAAACTATTCTTTCCGAACACAATCACAATGCTTTGTAGAGGAGCAGGGGTTCCGGAAGATGAAGGGGATGTGATTCTGTTTGACAAGGGAATCATTGACA
CGCCTAACTTGGCACGGCTCCAGCGTATGCAGGAGGTACGTCAGGGTGGGCTGGTACACGGCATCAACACGATTTTAGAAAAACTCGCACTTTCGGCCAGCAGGCAGGAC
TTGGACTGGTTAAGCTTAATTAGATCACGAGCTAAGCTGAGGCAAGTTCTTAGAATTGAGTTAAAAGTGGTGATTATTTGTCCATGCCGGAAGAATTATTTTGCTGCAGC
AGAGCTTGGTTTTGCAGAGTGCTCAGGGAGCGAATTTTATGCTGGAGCAAACCCGGAGGCAAAACTGCCACGTCACAGCTCGTTAGCCAACTTCATGAACCGACTTCTGT
TGAATTATTTTCGGGATAAAGGGGCAAGGAGAGCCTTACACGTGTCCAGAAAAACAGAGGAAAAGCCGGAATTCCCCAGAAATGCGACCGCATTTCTGGGTCGCATTTCT
GGGAAGGCAAAAATGAAATGCGACAGCATTTCTGGAAAAACAGAGGCCCTCACACACGCACCACGGTTGTCAAAGACAGAGAAGAGGACCTTCCAAGCGGCCTATCTCAA
GAGTGAAGCCAACACTTGGTTAGGATTCGTCAAACTGCGTCTGCTGCCAACAACCCACGACTCTACTGTCTCCCATGATCACATTCTCTTGGTATTTGCAATCCTGAGAG
CTGGGGTTCCCGCTAGTGCAGAAGATGTTATCCTGATGGATAAGGGAATAATAGACACGCCGAACCTGGCAAGGCTTCAAAGGACTCAGGAAGCACGCCAAGGCGGTTTG
GTGTGTGGCATCCACCAAATACAAGAACAACTGCAAATGCATTCCAGCCAAATGGAGTTTGCTGAGAGGCAATTGCAAACATACTGGAATTATGTTAAGCGGAGGGATGC
CACACTAAGGAGGGCTTTGCAATCCAACTTTTCCAAGCCATATCAAGCCTTCCCTGTTTTTCCCAATGATCTATTGAACCCCTGGATCCCACCACCGCCGATAGAAAGAG
AAAGAGATGAGGATGAAGACGTTGGTCAGGAGGATTAG
mRNA sequenceShow/hide mRNA sequence
ATGATGAAAGAATTTATGGCTCGTACAGACGCCGTAATTCAAAGTAATCAAGCTTCAATGAGAGCCCTGGAATTGCAAGTAGGTCAGCTAGCTAATGAGCTGAAGGCAAG
GCCTCAAGGGAAACTTCCATCAGATACTGAACACCCTAAAAGGGAAGGTAAGGAGCAGGTACAGGCAGTGACTCTTAGGAGTGGTAAGCCACTAGAAGAAAGAAGAGAGC
CTAGTAAAACCCAGGATATAGATAATAATTACGATAGAAATAATGTTGTTGTTGAGAAAGAGTTGGAGTCTGGTCAGGGTGCTGGAGGCAGCGATAAAGATGCTGGAGCA
TCTGACGGATTGTTGCGGCAAGGATTCTATTTTGCTCAAGCCTTATTTATTGGCAACTCACGGTACGTTTCTCTTACTTCATCCTTTTTGCTTCAATCTTTTGCTTTCTT
TTTCGTTTTCATTCTCTGTAAATCCCTTGAGTCTTCAATGGCAAAAACAAGAGCGCGAAAAGAAAGGGAGAATGAGGAAGAAGAGGTGTCTGTTACCCCTGAAGTGCAGA
AAGTTAAGACGAAGAAGAAAAGGACCCCGGAGGAGAAGGAAGCCAAGAGACGAAGACGACAACAGAGGGCTGAGGAACAAGAAAAGGCAACAGAGATTGTTGCTGCGGCA
GTTGAAGAAGGATACCCGCAAGAACCTGATGTACAGAACCCAGAGGAGGGTGAGCAGAGAGTTGTGGATATGGAAGAAGAGGAGCGAACAGAAGAAGTTCAAGAGGAGCG
GCCGGAAGTGCAAGAAGAAGTTCAAGAACAGCAGGTCGAGGATGTTCAAATGCAACAGGAAGAAGAGGTTCAGGTACCGGATAATGAGCCAATACAGGAGGCTCAAGTAG
AGGTGATCATGCCGGAGGTGCCAAAGCGTCGCCGCGTTAAGAGAAAAGCAGGCCGCGCTAGGGTTGTCCGGAATGATACTCCATCGCCTCTGACCACTGATTCTGAAAGA
GAAAATGCAGGAAAAGAGGAACGAGAAAAGAAGGAAGCCGAGGAAAGAGCAAGAGAAGAGGCAGAGGAAAAGGCTGAGGAAGAGCGGTTGCTCAAGCGAAGGGCGGAAAA
GGGCAAAAATGTTCCTGGAGCATCAGAAGAGCACGATGAAATAGAAGAGCAACAGTTACTGGATGATCGCTTCGTCAATAATTTTGCCAGAGCAAAATACGCTGAGCTTC
TGAAAAGGGACTTCCTGTTTGAGAGGGGATTTAGTGGTGAGCTTCCGCATTTTCTGAGGACTGGTATTGCGAATCATGGGTGGGAACGATTCTGTTCAAAACCCGAATCT
GTAAACGCGCAGTTAGTGCGCGAATTCTATGCAAATATCGACCGAGGAGAAGGTTTCCTAGCAGTTGTTCGAGGTATTGAGGTCGACTGGAGTCCTAGGGCTATCAACGT
ACTGTATAACCTTCAGAATTTCCCCCATCCGGCATATAATGAGATGGCTGTCGCGCCATCTAATGAGCAGTTAAGTGATGCTGTGCGGGAGGTAGGTATTGAAGGGGCAC
AATGGCAGCTATCCAAGACTCAGAAGAGGACATTCCAATCGGCTTATTTGAAAAGGGAAGCAAATACGTGGATGGGATTTATCAGACAAAGGATGCTTCCAACGACTCAT
GACTCGACAGTCTCTAGGGAACGAGTGCTTCTGGTTTTCGCTATTTTGAGGTCTCTCAGTATTGATGTGGGAAAAATTATTGCTGATGAAATATTTGGTTGTTGGAAGAA
GAAAGTGGGGAAACTATTCTTTCCGAACACAATCACAATGCTTTGTAGAGGAGCAGGGGTTCCGGAAGATGAAGGGGATGTGATTCTGTTTGACAAGGGAATCATTGACA
CGCCTAACTTGGCACGGCTCCAGCGTATGCAGGAGGTACGTCAGGGTGGGCTGGTACACGGCATCAACACGATTTTAGAAAAACTCGCACTTTCGGCCAGCAGGCAGGAC
TTGGACTGGTTAAGCTTAATTAGATCACGAGCTAAGCTGAGGCAAGTTCTTAGAATTGAGTTAAAAGTGGTGATTATTTGTCCATGCCGGAAGAATTATTTTGCTGCAGC
AGAGCTTGGTTTTGCAGAGTGCTCAGGGAGCGAATTTTATGCTGGAGCAAACCCGGAGGCAAAACTGCCACGTCACAGCTCGTTAGCCAACTTCATGAACCGACTTCTGT
TGAATTATTTTCGGGATAAAGGGGCAAGGAGAGCCTTACACGTGTCCAGAAAAACAGAGGAAAAGCCGGAATTCCCCAGAAATGCGACCGCATTTCTGGGTCGCATTTCT
GGGAAGGCAAAAATGAAATGCGACAGCATTTCTGGAAAAACAGAGGCCCTCACACACGCACCACGGTTGTCAAAGACAGAGAAGAGGACCTTCCAAGCGGCCTATCTCAA
GAGTGAAGCCAACACTTGGTTAGGATTCGTCAAACTGCGTCTGCTGCCAACAACCCACGACTCTACTGTCTCCCATGATCACATTCTCTTGGTATTTGCAATCCTGAGAG
CTGGGGTTCCCGCTAGTGCAGAAGATGTTATCCTGATGGATAAGGGAATAATAGACACGCCGAACCTGGCAAGGCTTCAAAGGACTCAGGAAGCACGCCAAGGCGGTTTG
GTGTGTGGCATCCACCAAATACAAGAACAACTGCAAATGCATTCCAGCCAAATGGAGTTTGCTGAGAGGCAATTGCAAACATACTGGAATTATGTTAAGCGGAGGGATGC
CACACTAAGGAGGGCTTTGCAATCCAACTTTTCCAAGCCATATCAAGCCTTCCCTGTTTTTCCCAATGATCTATTGAACCCCTGGATCCCACCACCGCCGATAGAAAGAG
AAAGAGATGAGGATGAAGACGTTGGTCAGGAGGATTAG
Protein sequenceShow/hide protein sequence
MMKEFMARTDAVIQSNQASMRALELQVGQLANELKARPQGKLPSDTEHPKREGKEQVQAVTLRSGKPLEERREPSKTQDIDNNYDRNNVVVEKELESGQGAGGSDKDAGA
SDGLLRQGFYFAQALFIGNSRYVSLTSSFLLQSFAFFFVFILCKSLESSMAKTRARKERENEEEEVSVTPEVQKVKTKKKRTPEEKEAKRRRRQQRAEEQEKATEIVAAA
VEEGYPQEPDVQNPEEGEQRVVDMEEEERTEEVQEERPEVQEEVQEQQVEDVQMQQEEEVQVPDNEPIQEAQVEVIMPEVPKRRRVKRKAGRARVVRNDTPSPLTTDSER
ENAGKEEREKKEAEERAREEAEEKAEEERLLKRRAEKGKNVPGASEEHDEIEEQQLLDDRFVNNFARAKYAELLKRDFLFERGFSGELPHFLRTGIANHGWERFCSKPES
VNAQLVREFYANIDRGEGFLAVVRGIEVDWSPRAINVLYNLQNFPHPAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTQKRTFQSAYLKREANTWMGFIRQRMLPTTH
DSTVSRERVLLVFAILRSLSIDVGKIIADEIFGCWKKKVGKLFFPNTITMLCRGAGVPEDEGDVILFDKGIIDTPNLARLQRMQEVRQGGLVHGINTILEKLALSASRQD
LDWLSLIRSRAKLRQVLRIELKVVIICPCRKNYFAAAELGFAECSGSEFYAGANPEAKLPRHSSLANFMNRLLLNYFRDKGARRALHVSRKTEEKPEFPRNATAFLGRIS
GKAKMKCDSISGKTEALTHAPRLSKTEKRTFQAAYLKSEANTWLGFVKLRLLPTTHDSTVSHDHILLVFAILRAGVPASAEDVILMDKGIIDTPNLARLQRTQEARQGGL
VCGIHQIQEQLQMHSSQMEFAERQLQTYWNYVKRRDATLRRALQSNFSKPYQAFPVFPNDLLNPWIPPPPIERERDEDEDVGQED