; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg005752 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg005752
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionTransposon TX1 uncharacterized 149 kDa protein
Genome locationscaffold11:32781..36312
RNA-Seq ExpressionSpg005752
SyntenySpg005752
Gene Ontology termsGO:0044237 - cellular metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0071704 - organic substance metabolic process (biological process)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RVW13148.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]1.1e-2733.85Show/hide
Query:  GGLLIMWDDSRLKALEFIKGGYTLSVKFSFMNEKVVWIYNVYCPNDYRERKYLWNELRS--GVLE--VRSLLDHFPLLFEARSFQWGPSPFRFYNSWLNQ
        GG+LI+WD   L+  E + G +++SVKFS      +WI  VY PN    RK  W EL    G+ E  +R   DH+P++ +   F WGP+PFRF N WL  
Subjt:  GGLLIMWDDSRLKALEFIKGGYTLSVKFSFMNEKVVWIYNVYCPNDYRERKYLWNELRS--GVLE--VRSLLDHFPLLFEARSFQWGPSPFRFYNSWLNQ

Query:  SDCVQIIAS-LSQDTSYGWAGFAIACKLRNLKSVLKD--RYADHELKRKRKEKSLLAEINRLDTLSDQCPLSSLEQSLRSSARGEFLELYMSEERNLIQR
        ++  +      S     GW G     +L+ +K+ LK+  +++  ELK K+  KS+L ++   D +  +  L+    S R+S +GE  EL + EE +  Q+
Subjt:  SDCVQIIAS-LSQDTSYGWAGFAIACKLRNLKSVLKD--RYADHELKRKRKEKSLLAEINRLDTLSDQCPLSSLEQSLRSSARGEFLELYMSEERNLIQR

Query:  CKLQWLKAGDENTNFFSQVLGSKKEEIIDYQFVFSRWGFSYVFQRDIEAEILDYFASLYT
         K++W+K GD N+ F+ +V   ++      +    R G        I  EIL YF  LYT
Subjt:  CKLQWLKAGDENTNFFSQVLGSKKEEIIDYQFVFSRWGFSYVFQRDIEAEILDYFASLYT

XP_038884535.1 DEAD-box ATP-dependent RNA helicase FANCM isoform X1 [Benincasa hispida]3.9e-3645.08Show/hide
Query:  EVRSLLDHFPLLFEARSFQWGPSPFRFYNSWLNQSDCVQIIASLSQ-DTSYGWAGFAIACKLRNLKSVLKDRYADHELKRKRKEKSLLAEINRLDTLSDQ
        + R++ DHFPLLFEA +F+WGPSPFRF NSWL   +C +II + S       WAGFA+  +LR +K  +K   A+HE  +K +E+SLL EI   D  +D 
Subjt:  EVRSLLDHFPLLFEARSFQWGPSPFRFYNSWLNQSDCVQIIASLSQ-DTSYGWAGFAIACKLRNLKSVLKDRYADHELKRKRKEKSLLAEINRLDTLSDQ

Query:  CPLSSLEQSLRSSARGEFLELYMSEERNLIQRCKLQWLKAGDENTNFFSQVLGSKKEEIIDYQFVFSRWGFSYVFQRDIEAEILDYFASLYTK
            S E+ +R S + + L LY +EER+LIQ+ KL WL  GDENT+FF + L +K+ + +  + +F+  G      R+IE  ILD+F++LYTK
Subjt:  CPLSSLEQSLRSSARGEFLELYMSEERNLIQRCKLQWLKAGDENTNFFSQVLGSKKEEIIDYQFVFSRWGFSYVFQRDIEAEILDYFASLYTK

XP_038884536.1 DEAD-box ATP-dependent RNA helicase FANCM isoform X2 [Benincasa hispida]3.9e-3645.08Show/hide
Query:  EVRSLLDHFPLLFEARSFQWGPSPFRFYNSWLNQSDCVQIIASLSQ-DTSYGWAGFAIACKLRNLKSVLKDRYADHELKRKRKEKSLLAEINRLDTLSDQ
        + R++ DHFPLLFEA +F+WGPSPFRF NSWL   +C +II + S       WAGFA+  +LR +K  +K   A+HE  +K +E+SLL EI   D  +D 
Subjt:  EVRSLLDHFPLLFEARSFQWGPSPFRFYNSWLNQSDCVQIIASLSQ-DTSYGWAGFAIACKLRNLKSVLKDRYADHELKRKRKEKSLLAEINRLDTLSDQ

Query:  CPLSSLEQSLRSSARGEFLELYMSEERNLIQRCKLQWLKAGDENTNFFSQVLGSKKEEIIDYQFVFSRWGFSYVFQRDIEAEILDYFASLYTK
            S E+ +R S + + L LY +EER+LIQ+ KL WL  GDENT+FF + L +K+ + +  + +F+  G      R+IE  ILD+F++LYTK
Subjt:  CPLSSLEQSLRSSARGEFLELYMSEERNLIQRCKLQWLKAGDENTNFFSQVLGSKKEEIIDYQFVFSRWGFSYVFQRDIEAEILDYFASLYTK

XP_038884537.1 DEAD-box ATP-dependent RNA helicase FANCM isoform X3 [Benincasa hispida]3.9e-3645.08Show/hide
Query:  EVRSLLDHFPLLFEARSFQWGPSPFRFYNSWLNQSDCVQIIASLSQ-DTSYGWAGFAIACKLRNLKSVLKDRYADHELKRKRKEKSLLAEINRLDTLSDQ
        + R++ DHFPLLFEA +F+WGPSPFRF NSWL   +C +II + S       WAGFA+  +LR +K  +K   A+HE  +K +E+SLL EI   D  +D 
Subjt:  EVRSLLDHFPLLFEARSFQWGPSPFRFYNSWLNQSDCVQIIASLSQ-DTSYGWAGFAIACKLRNLKSVLKDRYADHELKRKRKEKSLLAEINRLDTLSDQ

Query:  CPLSSLEQSLRSSARGEFLELYMSEERNLIQRCKLQWLKAGDENTNFFSQVLGSKKEEIIDYQFVFSRWGFSYVFQRDIEAEILDYFASLYTK
            S E+ +R S + + L LY +EER+LIQ+ KL WL  GDENT+FF + L +K+ + +  + +F+  G      R+IE  ILD+F++LYTK
Subjt:  CPLSSLEQSLRSSARGEFLELYMSEERNLIQRCKLQWLKAGDENTNFFSQVLGSKKEEIIDYQFVFSRWGFSYVFQRDIEAEILDYFASLYTK

XP_038904301.1 uncharacterized protein LOC120090656 [Benincasa hispida]1.1e-2739.9Show/hide
Query:  EVRSLLDHFPLLFEARSFQWGPSPFRFYNSWLNQSD-CVQIIASLSQDTSYGWAGFAIACKLRNLKSVLKDRYADHELKRKRKEKSLLAEINRLDTLSDQ
        + R++ DHFPL  EA +F+WGPS FRF NSWLN  + C  I  SL +  ++ WA   ++  LR  KS LK  + +   + K KE+SLL E+ R D+L+  
Subjt:  EVRSLLDHFPLLFEARSFQWGPSPFRFYNSWLNQSD-CVQIIASLSQDTSYGWAGFAIACKLRNLKSVLKDRYADHELKRKRKEKSLLAEINRLDTLSDQ

Query:  CPLSSLEQSLRSSARGEFLELYMSEERNLIQRCKLQWLKAGDENTNFFSQVLGSKKEEIIDYQFVFSRWGFSYVFQRDIEAEILDYFASLYTK
                    S + + L LY  EE++LIQ+CKL+WLK GDENT+FF + L ++K + + +  + +       F RDIE  IL +++ LY+K
Subjt:  CPLSSLEQSLRSSARGEFLELYMSEERNLIQRCKLQWLKAGDENTNFFSQVLGSKKEEIIDYQFVFSRWGFSYVFQRDIEAEILDYFASLYTK

TrEMBL top hitse value%identityAlignment
A0A438BQB2 Transposon TX1 uncharacterized 149 kDa protein5.5e-2833.85Show/hide
Query:  GGLLIMWDDSRLKALEFIKGGYTLSVKFSFMNEKVVWIYNVYCPNDYRERKYLWNELRS--GVLE--VRSLLDHFPLLFEARSFQWGPSPFRFYNSWLNQ
        GG+LI+WD   L+  E + G +++SVKFS      +WI  VY PN    RK  W EL    G+ E  +R   DH+P++ +   F WGP+PFRF N WL  
Subjt:  GGLLIMWDDSRLKALEFIKGGYTLSVKFSFMNEKVVWIYNVYCPNDYRERKYLWNELRS--GVLE--VRSLLDHFPLLFEARSFQWGPSPFRFYNSWLNQ

Query:  SDCVQIIAS-LSQDTSYGWAGFAIACKLRNLKSVLKD--RYADHELKRKRKEKSLLAEINRLDTLSDQCPLSSLEQSLRSSARGEFLELYMSEERNLIQR
        ++  +      S     GW G     +L+ +K+ LK+  +++  ELK K+  KS+L ++   D +  +  L+    S R+S +GE  EL + EE +  Q+
Subjt:  SDCVQIIAS-LSQDTSYGWAGFAIACKLRNLKSVLKD--RYADHELKRKRKEKSLLAEINRLDTLSDQCPLSSLEQSLRSSARGEFLELYMSEERNLIQR

Query:  CKLQWLKAGDENTNFFSQVLGSKKEEIIDYQFVFSRWGFSYVFQRDIEAEILDYFASLYT
         K++W+K GD N+ F+ +V   ++      +    R G        I  EIL YF  LYT
Subjt:  CKLQWLKAGDENTNFFSQVLGSKKEEIIDYQFVFSRWGFSYVFQRDIEAEILDYFASLYT

A0A438I181 Transposon TX1 uncharacterized 149 kDa protein2.1e-2429.27Show/hide
Query:  GGLLIMWDDSRLKALEFIKGGYTLSVKFSFMNEKVVWIYNVYCPNDYRERKYLWNELR----------------------------------SGVLEVRS
        GG+LI+WD  +L+  E + G +++S+KF+    + +W+  VY PN+   RK  W EL                                    GVL  R 
Subjt:  GGLLIMWDDSRLKALEFIKGGYTLSVKFSFMNEKVVWIYNVYCPNDYRERKYLWNELR----------------------------------SGVLEVRS

Query:  LLDHFPLLFEARSFQWGPSPFRFYNSWLNQSDCVQIIAS-LSQDTSYGWAGFAIACKLRNLKSVLKDRYADHELKRKRKEKSLLAEINRLDTLSDQCPLS
          DH+P++ E   F+WGP+PFRF N WL  S   +      S+    GW G     KL+ +K+ LK+       +  +K+K +LA +   D+L  +  LS
Subjt:  LLDHFPLLFEARSFQWGPSPFRFYNSWLNQSDCVQIIAS-LSQDTSYGWAGFAIACKLRNLKSVLKDRYADHELKRKRKEKSLLAEINRLDTLSDQCPLS

Query:  SLEQSLRSSARGEFLELYMSEERNLIQRCKLQWLKAGDENTNFFSQVLGSKKEEIIDYQFVFSRWGFSYVFQRDIEAEILDYFASLY
              R+ ++GE  EL + EE +  Q+ +++W+K GD N+ FF +V   ++      + + +  G        I+ EIL YF  LY
Subjt:  SLEQSLRSSARGEFLELYMSEERNLIQRCKLQWLKAGDENTNFFSQVLGSKKEEIIDYQFVFSRWGFSYVFQRDIEAEILDYFASLY

A0A438JK46 LINE-1 retrotransposable element ORF2 protein1.6e-2430.07Show/hide
Query:  GGLLIMWDDSRLKALEFIKGGYTLSVKFSFMNEKVVWIYNVYCPNDYRERKYLWNELRS--GVLEVRSL--------LDHFPLLFEARSFQWGPSPFRFY
        GG++I+WD S+ +  E + G ++++VKF+   E   W+ +VY P +   RK  W EL+   G+   R           DH P+  E    +WGP+PFRF 
Subjt:  GGLLIMWDDSRLKALEFIKGGYTLSVKFSFMNEKVVWIYNVYCPNDYRERKYLWNELRS--GVLEVRSL--------LDHFPLLFEARSFQWGPSPFRFY

Query:  NSWLNQSDCVQIIASLSQD-TSYGWAGFAIACKLRNLKSVLKDRYADHELKRKRKEKSLLAEINRLDTLSDQCPLSS---LEQSLRSSARGEFLELYMSE
        N WL   +  +      Q+ T  GW G     KL+ +KS LK+         K ++K +L +++R+D +  +  L+S   LE++LR   R E  ++ + E
Subjt:  NSWLNQSDCVQIIASLSQD-TSYGWAGFAIACKLRNLKSVLKDRYADHELKRKRKEKSLLAEINRLDTLSDQCPLSS---LEQSLRSSARGEFLELYMSE

Query:  ERNLIQRCKLQWLKAGDENTNFFSQV-LGSKKEEIIDYQFVFSRWGFSYVFQRDIEAEILDYFASLYTKIPEELMKLGITDGVREVSLQ-HSAIWV
        E    Q+ +++W+K GD N+ FF +V  G +  + I  + + S  G +     DI  EI+++F +LY+K   E  ++   +G+  V +   S +W+
Subjt:  ERNLIQRCKLQWLKAGDENTNFFSQV-LGSKKEEIIDYQFVFSRWGFSYVFQRDIEAEILDYFASLYTKIPEELMKLGITDGVREVSLQ-HSAIWV

A0A5A7V878 DUF4283 domain-containing protein2.1e-2447.37Show/hide
Query:  SKVLLNPFMADKALVKLDDSYGGWISVKNLPLPLWNRSTFEVIGQHFGGLVSISSQTLNLLECSEARIEVRKNLCGFIPAEIVVTDKKHGNFSLRFGDIS
        S+VL   F   K+++ +   YGGWIS+KNLPL  W+   ++ IG  FGG  SIS +T+NL+ CSEA+I+V +NLCGF+PA + + D    N  L FGDI 
Subjt:  SKVLLNPFMADKALVKLDDSYGGWISVKNLPLPLWNRSTFEVIGQHFGGLVSISSQTLNLLECSEARIEVRKNLCGFIPAEIVVTDKKHGNFSLRFGDIS

Query:  SLDAPISIPGNLSLSDFVNEIDLKRVHQVMEDE
         L+AP  I   L +S   N IDL R++QV+ DE
Subjt:  SLDAPISIPGNLSLSDFVNEIDLKRVHQVMEDE

A5BH71 Reverse transcriptase domain-containing protein1.5e-2530.04Show/hide
Query:  GGLLIMWDDSRLKALEFIKGGYTLSVKFSFMNEKVVWIYNVYCPNDYRERKYLWNELRS--GVLEVRSL--------LDHFPLLFEARSFQWGPSPFRFY
        GG+LI+WD  +L+  E + G + +S+KF+    +++W++ VY PN+   RK  W EL    G+  +R L         DH+ ++ E   F+WGP+PF F 
Subjt:  GGLLIMWDDSRLKALEFIKGGYTLSVKFSFMNEKVVWIYNVYCPNDYRERKYLWNELRS--GVLEVRSL--------LDHFPLLFEARSFQWGPSPFRFY

Query:  NSWLNQSDCVQIIASLSQD-TSYGWAGFAIACKLRNLKSVLKDRYADHELKRKRKEKSLLAEINRLDTLSDQCPLSSLEQSLRSSARGEFLELYMSEERN
        N WL      +   S  ++    GW G     KL+ +K+ LK+       +  +++K +L+++   D+L  +  LS      R+  +GE  EL + EE +
Subjt:  NSWLNQSDCVQIIASLSQD-TSYGWAGFAIACKLRNLKSVLKDRYADHELKRKRKEKSLLAEINRLDTLSDQCPLSSLEQSLRSSARGEFLELYMSEERN

Query:  LIQRCKLQWLKAGDENTNFFSQVLGSKKEEIIDYQFVFSRWGFSYVFQRDIEAEILDYFASLY
          Q+ +++W+K GD N+NFF +V   ++      + + +  G      + I+ EIL YF  LY
Subjt:  LIQRCKLQWLKAGDENTNFFSQVLGSKKEEIIDYQFVFSRWGFSYVFQRDIEAEILDYFASLY

SwissProt top hitse value%identityAlignment
Q9FGN7 Sister chromatid cohesion protein SCC41.4e-1264.29Show/hide
Query:  IPEELMKLGITDGVREVSLQHSAIWVAGIYLMLLMQLLENKVAIELTRSEFVEAQE
        I +EL+KLGITD VRE  L+H+AIW++ ++LML MQ LEN+VA+ELTRS++VEA+E
Subjt:  IPEELMKLGITDGVREVSLQHSAIWVAGIYLMLLMQLLENKVAIELTRSEFVEAQE

Arabidopsis top hitse value%identityAlignment
AT5G51340.1 Tetratricopeptide repeat (TPR)-like superfamily protein9.9e-1464.29Show/hide
Query:  IPEELMKLGITDGVREVSLQHSAIWVAGIYLMLLMQLLENKVAIELTRSEFVEAQE
        I +EL+KLGITD VRE  L+H+AIW++ ++LML MQ LEN+VA+ELTRS++VEA+E
Subjt:  IPEELMKLGITDGVREVSLQHSAIWVAGIYLMLLMQLLENKVAIELTRSEFVEAQE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAATATGTGGAAATTAAGTATGCTAAAAATGTTTCAAAGAAGTTTGAGATTGAAAAGTCATCCAAGCCGGCTGTTAATATTAGTAAGAATTTGGATAGAAGATATGC
AGAAGTGGTGTGGTCAAAATCAGGGGGTCCCCATCATGGTTCCCATTCTAAGAAGCTGTCAGAATTTTCTTCATTTTGGGTCAGAAAAGAAAAAGAAGTGGTGGACTTAA
AGTTAGATGAATTTTGTGTGGTTTCCAGAATGTTTGCACATAACTCTTGGAGGGAAGTAAAACAAGATTTAGAAGATTATTTTCAGTCTAAGGTTTTACTCAACCCTTTT
ATGGCTGACAAAGCTTTGGTGAAATTGGACGATAGTTATGGAGGTTGGATTTCTGTGAAGAACTTACCTCTGCCTCTTTGGAATCGTTCAACTTTTGAAGTGATTGGTCA
ACATTTTGGTGGATTGGTGAGTATTTCTTCTCAAACGCTTAACCTTTTGGAGTGTTCAGAAGCCCGAATTGAAGTAAGAAAGAATCTTTGTGGATTCATCCCTGCTGAGA
TTGTTGTTACAGATAAAAAGCACGGGAATTTTTCTCTTCGTTTTGGTGATATCTCCTCTTTAGATGCCCCTATTTCTATCCCTGGGAATTTATCTTTGAGTGATTTTGTA
AATGAAATTGATTTAAAAAGAGTTCATCAGGTTATGGAAGACGAAAGGTTTTCTCTTCATCAAGATGATTTTGATTCTTGTGTTGACCAAGAGTTGAATCATTCAACATT
GACTGATCAGGTTCATAATCCTATTTATTCAAATTCTAATCCGGTGCGTCTCCACCAAGCAATGGATGTCGAAGACAATATTGGTGTGAATGATCAGAATAATGAAGAGG
CATTTATGGGGAAGGTTGACAAGGATTTAATAAGACCTACATTAATTCCACATTCATCAGTGCCTTTAATTCCAGAGTCTTCATTTGAAAAGTCAGCATTAAATATGGAG
CCTTTATCAGCATTAAATTCGGAGCCTTCTGAAGTTATTCTCATTTTGTCTCCGATTTTAAATAATCATTCTCCCAACCAATTGTTGCCTGAAGTTCCCAAGATGACCAC
TGAAATTCCTTCTTTATTCGATGCCATTGAAGAGGCCTGTTCGATTTTGCTAGCTCCTTCTCTTAAGCAAACTTCTAATTTTGAGATAGCATTAAATGATCATCCGCAGC
ATTTATCATTTAATAATTGCAGTCTTAGTAAAGTTCCAATTGGGGACTCTAGAAATGTATTCACCAAGGGTATTCTTTCTCAAGATCAAAATGCTTCTACCAAGTGTCCA
ATTTCTATCCCTTGTATGGATGAATCTGATGTGGCTTCTGATGTCAGCTTAAGTAGTGCAGAATTTGAATCACTTACTACACCATTTCATGAATCACAAGCCTTGGGAAT
CATTATGAATGAGTTGTTGTATGGTGGTTTGCTGATTATGTGGGATGACAGTAGGTTGAAAGCTCTGGAATTTATTAAAGGAGGGTATACTCTATCAGTCAAATTCTCTT
TTATGAATGAAAAAGTGGTTTGGATATATAATGTGTATTGTCCAAATGATTATAGAGAAAGGAAGTATCTATGGAATGAACTTCGGTCTGGTGTATTGGAGGTTCGATCA
CTCTTAGACCATTTTCCTTTGCTTTTTGAAGCTAGAAGTTTTCAGTGGGGTCCTTCTCCTTTTCGGTTTTACAATTCTTGGTTGAATCAGTCTGATTGTGTGCAAATTAT
TGCTTCACTATCACAAGATACTTCTTATGGGTGGGCAGGTTTTGCTATTGCTTGTAAGCTGAGAAACTTGAAATCTGTCTTAAAAGATCGGTATGCAGATCATGAGTTGA
AAAGAAAGAGAAAGGAGAAAAGTTTGCTTGCGGAAATTAATAGGCTTGATACATTATCAGATCAGTGTCCCTTATCTTCTCTTGAGCAAAGCCTTCGTTCTTCGGCAAGG
GGGGAATTTTTAGAATTATATATGTCTGAGGAAAGAAATTTGATCCAAAGATGCAAGTTGCAATGGTTAAAAGCTGGGGATGAAAATACAAATTTTTTTTCACAGGTTCT
TGGCAGCAAAAAAGAGGAAATTATTGATTACCAATTTGTGTTTAGTAGATGGGGATTCTCTTATGTCTTTCAGAGAGATATTGAAGCTGAAATTCTTGATTATTTTGCGT
CACTTTATACGAAGATACCGGAGGAGTTGATGAAACTTGGGATAACGGATGGTGTAAGAGAAGTCAGTTTGCAACACTCTGCCATTTGGGTGGCTGGCATTTATTTAATG
CTCCTTATGCAACTTCTTGAAAACAAAGTAGCCATTGAGCTGACACGTTCTGAATTTGTTGAGGCGCAAGAGTTATTTGGTTCGAAAGATATCAGAGAGTTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAATATGTGGAAATTAAGTATGCTAAAAATGTTTCAAAGAAGTTTGAGATTGAAAAGTCATCCAAGCCGGCTGTTAATATTAGTAAGAATTTGGATAGAAGATATGC
AGAAGTGGTGTGGTCAAAATCAGGGGGTCCCCATCATGGTTCCCATTCTAAGAAGCTGTCAGAATTTTCTTCATTTTGGGTCAGAAAAGAAAAAGAAGTGGTGGACTTAA
AGTTAGATGAATTTTGTGTGGTTTCCAGAATGTTTGCACATAACTCTTGGAGGGAAGTAAAACAAGATTTAGAAGATTATTTTCAGTCTAAGGTTTTACTCAACCCTTTT
ATGGCTGACAAAGCTTTGGTGAAATTGGACGATAGTTATGGAGGTTGGATTTCTGTGAAGAACTTACCTCTGCCTCTTTGGAATCGTTCAACTTTTGAAGTGATTGGTCA
ACATTTTGGTGGATTGGTGAGTATTTCTTCTCAAACGCTTAACCTTTTGGAGTGTTCAGAAGCCCGAATTGAAGTAAGAAAGAATCTTTGTGGATTCATCCCTGCTGAGA
TTGTTGTTACAGATAAAAAGCACGGGAATTTTTCTCTTCGTTTTGGTGATATCTCCTCTTTAGATGCCCCTATTTCTATCCCTGGGAATTTATCTTTGAGTGATTTTGTA
AATGAAATTGATTTAAAAAGAGTTCATCAGGTTATGGAAGACGAAAGGTTTTCTCTTCATCAAGATGATTTTGATTCTTGTGTTGACCAAGAGTTGAATCATTCAACATT
GACTGATCAGGTTCATAATCCTATTTATTCAAATTCTAATCCGGTGCGTCTCCACCAAGCAATGGATGTCGAAGACAATATTGGTGTGAATGATCAGAATAATGAAGAGG
CATTTATGGGGAAGGTTGACAAGGATTTAATAAGACCTACATTAATTCCACATTCATCAGTGCCTTTAATTCCAGAGTCTTCATTTGAAAAGTCAGCATTAAATATGGAG
CCTTTATCAGCATTAAATTCGGAGCCTTCTGAAGTTATTCTCATTTTGTCTCCGATTTTAAATAATCATTCTCCCAACCAATTGTTGCCTGAAGTTCCCAAGATGACCAC
TGAAATTCCTTCTTTATTCGATGCCATTGAAGAGGCCTGTTCGATTTTGCTAGCTCCTTCTCTTAAGCAAACTTCTAATTTTGAGATAGCATTAAATGATCATCCGCAGC
ATTTATCATTTAATAATTGCAGTCTTAGTAAAGTTCCAATTGGGGACTCTAGAAATGTATTCACCAAGGGTATTCTTTCTCAAGATCAAAATGCTTCTACCAAGTGTCCA
ATTTCTATCCCTTGTATGGATGAATCTGATGTGGCTTCTGATGTCAGCTTAAGTAGTGCAGAATTTGAATCACTTACTACACCATTTCATGAATCACAAGCCTTGGGAAT
CATTATGAATGAGTTGTTGTATGGTGGTTTGCTGATTATGTGGGATGACAGTAGGTTGAAAGCTCTGGAATTTATTAAAGGAGGGTATACTCTATCAGTCAAATTCTCTT
TTATGAATGAAAAAGTGGTTTGGATATATAATGTGTATTGTCCAAATGATTATAGAGAAAGGAAGTATCTATGGAATGAACTTCGGTCTGGTGTATTGGAGGTTCGATCA
CTCTTAGACCATTTTCCTTTGCTTTTTGAAGCTAGAAGTTTTCAGTGGGGTCCTTCTCCTTTTCGGTTTTACAATTCTTGGTTGAATCAGTCTGATTGTGTGCAAATTAT
TGCTTCACTATCACAAGATACTTCTTATGGGTGGGCAGGTTTTGCTATTGCTTGTAAGCTGAGAAACTTGAAATCTGTCTTAAAAGATCGGTATGCAGATCATGAGTTGA
AAAGAAAGAGAAAGGAGAAAAGTTTGCTTGCGGAAATTAATAGGCTTGATACATTATCAGATCAGTGTCCCTTATCTTCTCTTGAGCAAAGCCTTCGTTCTTCGGCAAGG
GGGGAATTTTTAGAATTATATATGTCTGAGGAAAGAAATTTGATCCAAAGATGCAAGTTGCAATGGTTAAAAGCTGGGGATGAAAATACAAATTTTTTTTCACAGGTTCT
TGGCAGCAAAAAAGAGGAAATTATTGATTACCAATTTGTGTTTAGTAGATGGGGATTCTCTTATGTCTTTCAGAGAGATATTGAAGCTGAAATTCTTGATTATTTTGCGT
CACTTTATACGAAGATACCGGAGGAGTTGATGAAACTTGGGATAACGGATGGTGTAAGAGAAGTCAGTTTGCAACACTCTGCCATTTGGGTGGCTGGCATTTATTTAATG
CTCCTTATGCAACTTCTTGAAAACAAAGTAGCCATTGAGCTGACACGTTCTGAATTTGTTGAGGCGCAAGAGTTATTTGGTTCGAAAGATATCAGAGAGTTTTAG
Protein sequenceShow/hide protein sequence
MEYVEIKYAKNVSKKFEIEKSSKPAVNISKNLDRRYAEVVWSKSGGPHHGSHSKKLSEFSSFWVRKEKEVVDLKLDEFCVVSRMFAHNSWREVKQDLEDYFQSKVLLNPF
MADKALVKLDDSYGGWISVKNLPLPLWNRSTFEVIGQHFGGLVSISSQTLNLLECSEARIEVRKNLCGFIPAEIVVTDKKHGNFSLRFGDISSLDAPISIPGNLSLSDFV
NEIDLKRVHQVMEDERFSLHQDDFDSCVDQELNHSTLTDQVHNPIYSNSNPVRLHQAMDVEDNIGVNDQNNEEAFMGKVDKDLIRPTLIPHSSVPLIPESSFEKSALNME
PLSALNSEPSEVILILSPILNNHSPNQLLPEVPKMTTEIPSLFDAIEEACSILLAPSLKQTSNFEIALNDHPQHLSFNNCSLSKVPIGDSRNVFTKGILSQDQNASTKCP
ISIPCMDESDVASDVSLSSAEFESLTTPFHESQALGIIMNELLYGGLLIMWDDSRLKALEFIKGGYTLSVKFSFMNEKVVWIYNVYCPNDYRERKYLWNELRSGVLEVRS
LLDHFPLLFEARSFQWGPSPFRFYNSWLNQSDCVQIIASLSQDTSYGWAGFAIACKLRNLKSVLKDRYADHELKRKRKEKSLLAEINRLDTLSDQCPLSSLEQSLRSSAR
GEFLELYMSEERNLIQRCKLQWLKAGDENTNFFSQVLGSKKEEIIDYQFVFSRWGFSYVFQRDIEAEILDYFASLYTKIPEELMKLGITDGVREVSLQHSAIWVAGIYLM
LLMQLLENKVAIELTRSEFVEAQELFGSKDIREF