; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg027934 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg027934
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionFanconi-associated nuclease
Genome locationscaffold2:38744584..38751406
RNA-Seq ExpressionSpg027934
SyntenySpg027934
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8718449.1 hypothetical protein F3Y22_tig00110013pilonHSYRG00240 [Hibiscus syriacus]2.3e-1825.37Show/hide
Query:  LSYDRFVNNSARAKYVELLKRDFLFERGF------SGDLPHFLRTGIADHDWELFCAKPESVNAQVVREFYANIDKVDGFQVIVRGVEVDWSPSAINALY
        +++ +F N+ A+A++     R+  FE GF       G     +   +    W  F   P SVNA +V+EFYANI K +   + VRG ++ ++  AIN  +
Subjt:  LSYDRFVNNSARAKYVELLKRDFLFERGF------SGDLPHFLRTGIADHDWELFCAKPESVNAQVVREFYANIDKVDGFQVIVRGVEVDWSPSAINALY

Query:  NLQNL--PHAAYNEMAVAPSNEQLSDAVREVGIEGARWQLSKTEKRTFH--------------------------------------------IDIGKII
        +LQ +   HA + E A    + +    + ++  E   W   +T + + +                                            ID+G+II
Subjt:  NLQNL--PHAAYNEMAVAPSNEQLSDAVREVGIEGARWQLSKTEKRTFH--------------------------------------------IDIGKII

Query:  VNEIFGCWKKKVGKLFFPNTITMLCKRAGVPENEGDVILFDKGIIDTPNFARLQRTQEARQGGLVY-------GINTILEQLAL-SASRQEFAERQAL--
        V ++  C  KK   L FPN IT LC++  V EN  D IL     I       L   +  +    V+         N  +  LAL  A  Q  A+  AL  
Subjt:  VNEIFGCWKKKVGKLFFPNTITMLCKRAGVPENEGDVILFDKGIIDTPNFARLQRTQEARQGGLVY-------GINTILEQLAL-SASRQEFAERQAL--

Query:  ---TFWNYVRSRDASLKKALQENFSKPYPALPAFPEDLL
            F+ YV+ RD  ++   QE         P FP+++L
Subjt:  ---TFWNYVRSRDASLKKALQENFSKPYPALPAFPEDLL

KAF4378427.1 hypothetical protein F8388_021621 [Cannabis sativa]3.3e-1729.96Show/hide
Query:  SVAEASEEPDEIE--------EPQLSYDRFVNNSARAKYVELLK-RDFLFERGF------SGDLPHFLRTGIADHDWELFCAKPESVNAQVVREFYANID
        ++ E+ EE  +I+        +  L   +F++ +A AKY + ++ ++F  ERG        G +P +L   I    WE  C  P S   QVV+EFYAN  
Subjt:  SVAEASEEPDEIE--------EPQLSYDRFVNNSARAKYVELLK-RDFLFERGF------SGDLPHFLRTGIADHDWELFCAKPESVNAQVVREFYANID

Query:  KVD-GFQVIVRGVEVDWSPSAINALYNLQNLPHAAYNEMAVAPSNEQLSDAVREVGIEGARWQLS------------KTEKRTFH-IDIGKIIVNEIFGC
          +    +IVR V+V +S   IN  + L+N+    +++     +++ + D V E+   GA W +             + E +  H +++GK++   IF C
Subjt:  KVD-GFQVIVRGVEVDWSPSAINALYNLQNLPHAAYNEMAVAPSNEQLSDAVREVGIEGARWQLS------------KTEKRTFH-IDIGKIIVNEIFGC

Query:  WKKKVGKLFFPNTITMLCKRAGVPENEGDVILFDKGI
          +  GKLFFP  IT LC+ AGVP    D  +  KG+
Subjt:  WKKKVGKLFFPNTITMLCKRAGVPENEGDVILFDKGI

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]2.5e-1730.31Show/hide
Query:  RFVNNSARAKYV-ELLKRDFLFERGF-------SGDLPHFLRTGIADHDWELFCAKPESVNAQVVREFYANIDKVDGFQVIVRGVEVDWSPSAINALYNL
        +F   +A  +Y   +  R    E+GF        G LP F+   I  H+W+ FCA PE     +VREFYAN+       V VRGV+V WS  AINA++ L
Subjt:  RFVNNSARAKYV-ELLKRDFLFERGF-------SGDLPHFLRTGIADHDWELFCAKPESVNAQVVREFYANIDKVDGFQVIVRGVEVDWSPSAINALYNL

Query:  QNLPHAAYNEMAVAPSNEQLSDAVREVGIEGARWQ-------------LSKTEKRTFH-------------------------------IDIGKIIVNEI
         + P   ++E     +   L   +  V + GA W              L+   K  FH                               I++G++I +EI
Subjt:  QNLPHAAYNEMAVAPSNEQLSDAVREVGIEGARWQ-------------LSKTEKRTFH-------------------------------IDIGKIIVNEI

Query:  FGCWKKKVGKLFFPNTITMLCKRAGVPENEGDVILFDKGIIDTPNFARLQRTQE
          C  +K G LFFP+ IT LC+ A  P    +  L + G ID    AR+  TQE
Subjt:  FGCWKKKVGKLFFPNTITMLCKRAGVPENEGDVILFDKGIIDTPNFARLQRTQE

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]5.4e-2828.53Show/hide
Query:  RFVNNSARAKYV-ELLKRDFLFERGF-------SGDLPHFLRTGIADHDWELFCAKPESVNAQVVREFYANIDKVDGFQVIVRGVEVDWSPSAINALYNL
        +F   +A  +Y   +  R    E+GF        G LP F+   I  H+W+ FCA PE     +VREFYAN+   +   V VRGV+V WS  AINA++ L
Subjt:  RFVNNSARAKYV-ELLKRDFLFERGF-------SGDLPHFLRTGIADHDWELFCAKPESVNAQVVREFYANIDKVDGFQVIVRGVEVDWSPSAINALYNL

Query:  QNLPHAAYNEMAVAPSNEQLSDAVREVGIEGARWQ-------------LSKTEKRTFH-------------------------------IDIGKIIVNEI
         + P   ++E     + + L   +  V   GA W              L+   K  +H                               I++G++I +EI
Subjt:  QNLPHAAYNEMAVAPSNEQLSDAVREVGIEGARWQ-------------LSKTEKRTFH-------------------------------IDIGKIIVNEI

Query:  FGCWKKKVGKLFFPNTITMLCKRAGVPENEGDVILFDKGIIDTPNFARLQR---TQEARQ---------------GGLVYGINTILEQLALSASRQ----
          C  +K G LFFP+ IT LC+ A  P    +  L + G ID    AR+ +   T+  +Q               G ++  +  + ++L+    +Q    
Subjt:  FGCWKKKVGKLFFPNTITMLCKRAGVPENEGDVILFDKGIIDTPNFARLQR---TQEARQ---------------GGLVYGINTILEQLALSASRQ----

Query:  ---EFAERQALTFWNYVRSRDASLKKALQENFSKPYPALPAFPEDLL
           +   +Q   FW Y + RD +LKKALQ NF++P P  PAFP+++L
Subjt:  ---EFAERQALTFWNYVRSRDASLKKALQENFSKPYPALPAFPEDLL

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]6.4e-2130.43Show/hide
Query:  VVREFYANIDKVDGFQVIVRGVEVDWSPSAINALYNLQNLPHAAYNEMAVAPSNEQLSDAVREVGIEGARWQ-------------LSKTEKRTFH-----
        +VREFYAN+   +   + VRGV+V WS  AINA++ L + P   ++E     +  +L   +  V   GA W              L+   K  +H     
Subjt:  VVREFYANIDKVDGFQVIVRGVEVDWSPSAINALYNLQNLPHAAYNEMAVAPSNEQLSDAVREVGIEGARWQ-------------LSKTEKRTFH-----

Query:  --------------------------IDIGKIIVNEIFGCWKKKVGKLFFPNTITMLCKRAGVPENEGDVILFDKGIIDT-----------------PNF
                                  I++G++I +EI  C  +K G LFFP+ IT LC+ A    NE    L + G ID                  P+ 
Subjt:  --------------------------IDIGKIIVNEIFGCWKKKVGKLFFPNTITMLCKRAGVPENEGDVILFDKGIIDT-----------------PNF

Query:  ARLQRTQEARQGGLVYGINTILEQLALSASRQEFAERQALTFWNYVRSRDASLKKALQENFSKPYPALPAFPEDLL
        +R      +R  G V      LEQ     S+QE   +Q   FW Y + RD +LKKALQ NF++P P  PAFP+++L
Subjt:  ARLQRTQEARQGGLVYGINTILEQLALSASRQEFAERQALTFWNYVRSRDASLKKALQENFSKPYPALPAFPEDLL

TrEMBL top hitse value%identityAlignment
A0A2P5BCG4 Uncharacterized protein (Fragment)2.6e-2828.53Show/hide
Query:  RFVNNSARAKYV-ELLKRDFLFERGF-------SGDLPHFLRTGIADHDWELFCAKPESVNAQVVREFYANIDKVDGFQVIVRGVEVDWSPSAINALYNL
        +F   +A  +Y   +  R    E+GF        G LP F+   I  H+W+ FCA PE     +VREFYAN+   +   V VRGV+V WS  AINA++ L
Subjt:  RFVNNSARAKYV-ELLKRDFLFERGF-------SGDLPHFLRTGIADHDWELFCAKPESVNAQVVREFYANIDKVDGFQVIVRGVEVDWSPSAINALYNL

Query:  QNLPHAAYNEMAVAPSNEQLSDAVREVGIEGARWQ-------------LSKTEKRTFH-------------------------------IDIGKIIVNEI
         + P   ++E     + + L   +  V   GA W              L+   K  +H                               I++G++I +EI
Subjt:  QNLPHAAYNEMAVAPSNEQLSDAVREVGIEGARWQ-------------LSKTEKRTFH-------------------------------IDIGKIIVNEI

Query:  FGCWKKKVGKLFFPNTITMLCKRAGVPENEGDVILFDKGIIDTPNFARLQR---TQEARQ---------------GGLVYGINTILEQLALSASRQ----
          C  +K G LFFP+ IT LC+ A  P    +  L + G ID    AR+ +   T+  +Q               G ++  +  + ++L+    +Q    
Subjt:  FGCWKKKVGKLFFPNTITMLCKRAGVPENEGDVILFDKGIIDTPNFARLQR---TQEARQ---------------GGLVYGINTILEQLALSASRQ----

Query:  ---EFAERQALTFWNYVRSRDASLKKALQENFSKPYPALPAFPEDLL
           +   +Q   FW Y + RD +LKKALQ NF++P P  PAFP+++L
Subjt:  ---EFAERQALTFWNYVRSRDASLKKALQENFSKPYPALPAFPEDLL

A0A2P5DXM3 Uncharacterized protein3.1e-2130.43Show/hide
Query:  VVREFYANIDKVDGFQVIVRGVEVDWSPSAINALYNLQNLPHAAYNEMAVAPSNEQLSDAVREVGIEGARWQ-------------LSKTEKRTFH-----
        +VREFYAN+   +   + VRGV+V WS  AINA++ L + P   ++E     +  +L   +  V   GA W              L+   K  +H     
Subjt:  VVREFYANIDKVDGFQVIVRGVEVDWSPSAINALYNLQNLPHAAYNEMAVAPSNEQLSDAVREVGIEGARWQ-------------LSKTEKRTFH-----

Query:  --------------------------IDIGKIIVNEIFGCWKKKVGKLFFPNTITMLCKRAGVPENEGDVILFDKGIIDT-----------------PNF
                                  I++G++I +EI  C  +K G LFFP+ IT LC+ A    NE    L + G ID                  P+ 
Subjt:  --------------------------IDIGKIIVNEIFGCWKKKVGKLFFPNTITMLCKRAGVPENEGDVILFDKGIIDT-----------------PNF

Query:  ARLQRTQEARQGGLVYGINTILEQLALSASRQEFAERQALTFWNYVRSRDASLKKALQENFSKPYPALPAFPEDLL
        +R      +R  G V      LEQ     S+QE   +Q   FW Y + RD +LKKALQ NF++P P  PAFP+++L
Subjt:  ARLQRTQEARQGGLVYGINTILEQLALSASRQEFAERQALTFWNYVRSRDASLKKALQENFSKPYPALPAFPEDLL

A0A6A2YMQ9 Uncharacterized protein3.9e-1627.73Show/hide
Query:  SYDRFVNNSARAKYVELLKRDFLFERGF------SGDLPHFLRTGIADHDWELFCAKPESVNAQVVREFYANIDKVDGFQVIVRGVEVDWSPSAINALYN
        S+ +F ++ A+A++    K+   FE GF       G     +   +    W+ F   P SVNA VV+EFYANI K +   + VRG ++ ++PSAI   ++
Subjt:  SYDRFVNNSARAKYVELLKRDFLFERGF------SGDLPHFLRTGIADHDWELFCAKPESVNAQVVREFYANIDKVDGFQVIVRGVEVDWSPSAINALYN

Query:  LQNL--PHAAYNEMAVAPSNEQLSDAVREVGIEGARWQLSKTE--------------------------------------------KRTFHIDIGKIIV
        LQ++   HA + E A + + +++   + ++  E   W   +T                                             K +  ID+G+IIV
Subjt:  LQNL--PHAAYNEMAVAPSNEQLSDAVREVGIEGARWQLSKTE--------------------------------------------KRTFHIDIGKIIV

Query:  NEIFGCWKKKVGKLFFPNTITMLCKRAGVPENEGDVIL
         ++  C  KK   L FPN IT LC++  V EN  D IL
Subjt:  NEIFGCWKKKVGKLFFPNTITMLCKRAGVPENEGDVIL

A0A6A3BU96 Uncharacterized protein1.1e-1825.37Show/hide
Query:  LSYDRFVNNSARAKYVELLKRDFLFERGF------SGDLPHFLRTGIADHDWELFCAKPESVNAQVVREFYANIDKVDGFQVIVRGVEVDWSPSAINALY
        +++ +F N+ A+A++     R+  FE GF       G     +   +    W  F   P SVNA +V+EFYANI K +   + VRG ++ ++  AIN  +
Subjt:  LSYDRFVNNSARAKYVELLKRDFLFERGF------SGDLPHFLRTGIADHDWELFCAKPESVNAQVVREFYANIDKVDGFQVIVRGVEVDWSPSAINALY

Query:  NLQNL--PHAAYNEMAVAPSNEQLSDAVREVGIEGARWQLSKTEKRTFH--------------------------------------------IDIGKII
        +LQ +   HA + E A    + +    + ++  E   W   +T + + +                                            ID+G+II
Subjt:  NLQNL--PHAAYNEMAVAPSNEQLSDAVREVGIEGARWQLSKTEKRTFH--------------------------------------------IDIGKII

Query:  VNEIFGCWKKKVGKLFFPNTITMLCKRAGVPENEGDVILFDKGIIDTPNFARLQRTQEARQGGLVY-------GINTILEQLAL-SASRQEFAERQAL--
        V ++  C  KK   L FPN IT LC++  V EN  D IL     I       L   +  +    V+         N  +  LAL  A  Q  A+  AL  
Subjt:  VNEIFGCWKKKVGKLFFPNTITMLCKRAGVPENEGDVILFDKGIIDTPNFARLQRTQEARQGGLVY-------GINTILEQLAL-SASRQEFAERQAL--

Query:  ---TFWNYVRSRDASLKKALQENFSKPYPALPAFPEDLL
            F+ YV+ RD  ++   QE         P FP+++L
Subjt:  ---TFWNYVRSRDASLKKALQENFSKPYPALPAFPEDLL

A0A7J6G6G2 Fanconi-associated nuclease1.6e-1729.96Show/hide
Query:  SVAEASEEPDEIE--------EPQLSYDRFVNNSARAKYVELLK-RDFLFERGF------SGDLPHFLRTGIADHDWELFCAKPESVNAQVVREFYANID
        ++ E+ EE  +I+        +  L   +F++ +A AKY + ++ ++F  ERG        G +P +L   I    WE  C  P S   QVV+EFYAN  
Subjt:  SVAEASEEPDEIE--------EPQLSYDRFVNNSARAKYVELLK-RDFLFERGF------SGDLPHFLRTGIADHDWELFCAKPESVNAQVVREFYANID

Query:  KVD-GFQVIVRGVEVDWSPSAINALYNLQNLPHAAYNEMAVAPSNEQLSDAVREVGIEGARWQLS------------KTEKRTFH-IDIGKIIVNEIFGC
          +    +IVR V+V +S   IN  + L+N+    +++     +++ + D V E+   GA W +             + E +  H +++GK++   IF C
Subjt:  KVD-GFQVIVRGVEVDWSPSAINALYNLQNLPHAAYNEMAVAPSNEQLSDAVREVGIEGARWQLS------------KTEKRTFH-IDIGKIIVNEIFGC

Query:  WKKKVGKLFFPNTITMLCKRAGVPENEGDVILFDKGI
          +  GKLFFP  IT LC+ AGVP    D  +  KG+
Subjt:  WKKKVGKLFFPNTITMLCKRAGVPENEGDVILFDKGI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGAAAACGAGAGCAAGAAAAGAGAGGGAGAATGAGGAGGAAGAGGTGTCTGTTACCCCTGAGGTACAGAAAGTAAAAACGAAGAAGAAGAAGACACCGGAGGAAAA
AGAAGCCAAAAGAATGAGAAGGCAACAACGGGCTGAGGAGCAAGAAGCTGTCCAGAAGGCGACAGAAGATGTTACTACTACAATGGAAGAAGGAAATCCGAAAGAACCTG
AAGTGCAGAACCCAGAGGAGGGCGAACCGATAGTGGCAGATACAGAAAGAGTTCAAGAAAAGATCACTGAGGAGATTCAAGAAAAACAGGCTGAGGAAGTGCAGGAACAT
CATGCAGAGGTTGCACCTGAAGAAGGTAATGAGCAAGAACAGGAGGCTCGAGTGGAGGTGATCATGCCGGAGGTACCCAAACATCGCCGCATTAAGAGGAAAGCGGGCCG
CATCAAAGTAGTCCGAACTGATACCCCCTCGCCTCCAACTACTGACTCTGAAAGAGAGAATGCAGAGAAAGAAGAGCGTGAGAAGAAGGAGGCCGAGGAAAAAGTAAGAG
AAGAAGTAGAGAAAAAGGCTGAGGAAGAGCGGTTGCTCAAGCGACGGGCGGAAAAGGGCAAAAGTGTTGCTGAAGCATCGGAAGAACCTGATGAGATAGAAGAGCCACAG
TTGTCGTATGATCGCTTCGTCAACAATTCTGCCAGAGCAAAATATGTTGAGCTGCTGAAAAGAGATTTCCTGTTTGAGAGAGGATTTAGCGGTGATCTTCCGCATTTTCT
AAGGACCGGCATTGCAGACCACGACTGGGAGTTGTTTTGTGCAAAGCCTGAGTCTGTAAACGCACAGGTGGTGCGCGAATTTTATGCAAATATTGACAAGGTAGATGGTT
TCCAAGTGATTGTTCGGGGAGTCGAAGTCGACTGGAGTCCTAGTGCTATTAACGCACTGTATAACCTTCAGAATCTCCCCCACGCAGCGTATAATGAGATGGCTGTGGCG
CCATCTAATGAGCAGCTGAGCGATGCTGTGCGGGAAGTGGGTATTGAAGGGGCACGGTGGCAGCTGTCAAAAACAGAGAAAAGGACGTTTCACATTGATATAGGGAAGAT
TATTGTTAATGAGATATTTGGTTGTTGGAAGAAGAAAGTGGGGAAGCTGTTTTTCCCGAATACCATTACCATGCTTTGCAAGCGAGCAGGGGTTCCAGAGAATGAAGGAG
ATGTTATTTTATTTGATAAGGGAATCATTGACACGCCTAACTTTGCACGACTTCAGCGTACGCAAGAGGCACGTCAGGGTGGGCTTGTCTACGGCATCAACACGATTTTA
GAACAACTGGCACTGTCGGCCAGTAGGCAAGAGTTTGCCGAGAGGCAAGCTTTAACATTCTGGAACTATGTTAGGAGTCGTGATGCCAGTCTGAAGAAGGCGCTGCAGGA
GAATTTTTCAAAGCCGTATCCAGCCCTTCCAGCATTCCCTGAAGATTTATTAAACCCCTGGATTCCGCCCCCACCGATGGAAAGAGGAGAAGGGGATGATGAAAATGACT
TGGACTCGTTAAGCTTAATTAGATCAAGAATTATTTTGCTGCAGCAGAGCTTGGTTTTGCAGAATGCTGAAGTAAAGGTTGAAGAATATGTTGCTGGGCGACTTGAGGGA
GCAAACTCTGTGCTGGAGCAAAGCTGGGAGCAAAAACTGCCACAAGAGCAAGAAAAGGCTACTGCAGAAGAAACTGCACGCAAGGCCGACGCAGATAAAAAAGCGGCCAA
AGAAAAAGAGGGCTCCAGCAAGCAGGCGCAAGATCCGGTCGCCGCAGACATTGCCGCAACCAACTTGGCCCTCATTGCGGCAGCTCTGGACATTGAAGCTTCTGATTCCA
AGGATGAGCTCCCCCTCACTCATCGCCGCAACATCCAGTCCGCAGGAGTAGTCATCCGAGAACCCTCTGAAAAATCTGCAGCAGCCCCAAGCAAGAAAAGGAAAGCGCCC
ACCGCATCCAAAGGCAAAGGAAAAGAAAAAGTGGGCGCAGGAAAAGAAAAAGTGGGCGAGGAATGGGGGAAGTTCCAAGAAGAGAGAGTCCACGACTTCTATCGAACGCA
GTGGCAAGAGGAAGTCAACAAAGTTTTAGTGAGAGGGCACACAGTCGCATTCTCCTCCGCACACATCAACAATGTATACGAGCTACGAGACATCCCAAACGCAGAAGGGA
ATGAACTCCTCGACTACCTTGACGATGACTTCCTGAGTGCAGTACTTCTTGCGGTAGCCCGACCTGGAGCCAAATGGGACCACCATGGCACGGCGAGGACCCTGAAGTCG
CAGCTGCTCGACTTTGAGGCCACCTGCTGGATGCATTGGGTGAAGAATCGCATCATGCCCACAACGCATGATGCCACATTGAGCCTTCAACGGGTAATTCTTGTTTATTG
CATCATGAGCAATGTTTGCCTAGGCTTTGTCACTCGTTTATGCCGCGCATCCGGCCTCGTGCCCGCAGCAGATGAAGAAATCCGACCCATGCGGAAGGATTTTGACGAAC
AATGGTGGACGCGCATAACCAGCACTCGCGATAGGCGTTTGCAAGGAAGCACGCAGCAACCAGCCCCCGCATCCCAACCCACCGAGCCTACGCCTGCGCCTGAAGCACAA
AAGAAAAAGAGGCGCAAGCAAACTGCGGGCAGAAACCCACCACCAGCCGCAAGGACCCCTCACTCTACTCCACCAATGCAAGAAAGCACTGTCTGCGCCCAATCTCAGGG
ATCCTTGCCACAACTGCAAGAAAGCTTGCCGCCACCACCTCCACCTCCACTTTCGCAACCAACAGTCCAACCAACACCACAACACGGCGAAGACTTGCGGCAGGAGGGGG
CGCATTTTGAAGACTCATTGCTGAGCCCCACTCGCAATAACGAAGTTCCTACATCGAGCCCCACACCCCAAGCCACAGCTTCTGCGAGCACAACTCAGGCTGGGATTGAT
CTACTGATTGATGTGGATTCAACCCTTATCTTTTCAGAGCTGCGGCGGATAATTCTAGAAAGTGTAAAGCCCCTACACCAGCAACAAGAAAGGATGTGCCAACAACAAGA
CGAACTGTCGCAGCGGGTAAATGCTTTAGTCCTTTTTCTAATGCGATGGATTGAACGTTTTGGCCGTTTGCCAGTCACGCCCAACTTGTTTGCACGGCATGCTGGGCAAC
AAAGACCCCCTGGCGCAGACCCATCCCAAGTACGCGGATCCAAGCCTCCTCCTGCGCCAAGGGAGACAAGACAACCTCCACCACCACCACCGCAGGAGTATAATACACTT
AACCTTAAGGAAAGTCCCTTCGCATTTTCGACTTGCCGCAACTCAGCCTTGATCTCCGTAGTCTATGCGCTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCGAAAACGAGAGCAAGAAAAGAGAGGGAGAATGAGGAGGAAGAGGTGTCTGTTACCCCTGAGGTACAGAAAGTAAAAACGAAGAAGAAGAAGACACCGGAGGAAAA
AGAAGCCAAAAGAATGAGAAGGCAACAACGGGCTGAGGAGCAAGAAGCTGTCCAGAAGGCGACAGAAGATGTTACTACTACAATGGAAGAAGGAAATCCGAAAGAACCTG
AAGTGCAGAACCCAGAGGAGGGCGAACCGATAGTGGCAGATACAGAAAGAGTTCAAGAAAAGATCACTGAGGAGATTCAAGAAAAACAGGCTGAGGAAGTGCAGGAACAT
CATGCAGAGGTTGCACCTGAAGAAGGTAATGAGCAAGAACAGGAGGCTCGAGTGGAGGTGATCATGCCGGAGGTACCCAAACATCGCCGCATTAAGAGGAAAGCGGGCCG
CATCAAAGTAGTCCGAACTGATACCCCCTCGCCTCCAACTACTGACTCTGAAAGAGAGAATGCAGAGAAAGAAGAGCGTGAGAAGAAGGAGGCCGAGGAAAAAGTAAGAG
AAGAAGTAGAGAAAAAGGCTGAGGAAGAGCGGTTGCTCAAGCGACGGGCGGAAAAGGGCAAAAGTGTTGCTGAAGCATCGGAAGAACCTGATGAGATAGAAGAGCCACAG
TTGTCGTATGATCGCTTCGTCAACAATTCTGCCAGAGCAAAATATGTTGAGCTGCTGAAAAGAGATTTCCTGTTTGAGAGAGGATTTAGCGGTGATCTTCCGCATTTTCT
AAGGACCGGCATTGCAGACCACGACTGGGAGTTGTTTTGTGCAAAGCCTGAGTCTGTAAACGCACAGGTGGTGCGCGAATTTTATGCAAATATTGACAAGGTAGATGGTT
TCCAAGTGATTGTTCGGGGAGTCGAAGTCGACTGGAGTCCTAGTGCTATTAACGCACTGTATAACCTTCAGAATCTCCCCCACGCAGCGTATAATGAGATGGCTGTGGCG
CCATCTAATGAGCAGCTGAGCGATGCTGTGCGGGAAGTGGGTATTGAAGGGGCACGGTGGCAGCTGTCAAAAACAGAGAAAAGGACGTTTCACATTGATATAGGGAAGAT
TATTGTTAATGAGATATTTGGTTGTTGGAAGAAGAAAGTGGGGAAGCTGTTTTTCCCGAATACCATTACCATGCTTTGCAAGCGAGCAGGGGTTCCAGAGAATGAAGGAG
ATGTTATTTTATTTGATAAGGGAATCATTGACACGCCTAACTTTGCACGACTTCAGCGTACGCAAGAGGCACGTCAGGGTGGGCTTGTCTACGGCATCAACACGATTTTA
GAACAACTGGCACTGTCGGCCAGTAGGCAAGAGTTTGCCGAGAGGCAAGCTTTAACATTCTGGAACTATGTTAGGAGTCGTGATGCCAGTCTGAAGAAGGCGCTGCAGGA
GAATTTTTCAAAGCCGTATCCAGCCCTTCCAGCATTCCCTGAAGATTTATTAAACCCCTGGATTCCGCCCCCACCGATGGAAAGAGGAGAAGGGGATGATGAAAATGACT
TGGACTCGTTAAGCTTAATTAGATCAAGAATTATTTTGCTGCAGCAGAGCTTGGTTTTGCAGAATGCTGAAGTAAAGGTTGAAGAATATGTTGCTGGGCGACTTGAGGGA
GCAAACTCTGTGCTGGAGCAAAGCTGGGAGCAAAAACTGCCACAAGAGCAAGAAAAGGCTACTGCAGAAGAAACTGCACGCAAGGCCGACGCAGATAAAAAAGCGGCCAA
AGAAAAAGAGGGCTCCAGCAAGCAGGCGCAAGATCCGGTCGCCGCAGACATTGCCGCAACCAACTTGGCCCTCATTGCGGCAGCTCTGGACATTGAAGCTTCTGATTCCA
AGGATGAGCTCCCCCTCACTCATCGCCGCAACATCCAGTCCGCAGGAGTAGTCATCCGAGAACCCTCTGAAAAATCTGCAGCAGCCCCAAGCAAGAAAAGGAAAGCGCCC
ACCGCATCCAAAGGCAAAGGAAAAGAAAAAGTGGGCGCAGGAAAAGAAAAAGTGGGCGAGGAATGGGGGAAGTTCCAAGAAGAGAGAGTCCACGACTTCTATCGAACGCA
GTGGCAAGAGGAAGTCAACAAAGTTTTAGTGAGAGGGCACACAGTCGCATTCTCCTCCGCACACATCAACAATGTATACGAGCTACGAGACATCCCAAACGCAGAAGGGA
ATGAACTCCTCGACTACCTTGACGATGACTTCCTGAGTGCAGTACTTCTTGCGGTAGCCCGACCTGGAGCCAAATGGGACCACCATGGCACGGCGAGGACCCTGAAGTCG
CAGCTGCTCGACTTTGAGGCCACCTGCTGGATGCATTGGGTGAAGAATCGCATCATGCCCACAACGCATGATGCCACATTGAGCCTTCAACGGGTAATTCTTGTTTATTG
CATCATGAGCAATGTTTGCCTAGGCTTTGTCACTCGTTTATGCCGCGCATCCGGCCTCGTGCCCGCAGCAGATGAAGAAATCCGACCCATGCGGAAGGATTTTGACGAAC
AATGGTGGACGCGCATAACCAGCACTCGCGATAGGCGTTTGCAAGGAAGCACGCAGCAACCAGCCCCCGCATCCCAACCCACCGAGCCTACGCCTGCGCCTGAAGCACAA
AAGAAAAAGAGGCGCAAGCAAACTGCGGGCAGAAACCCACCACCAGCCGCAAGGACCCCTCACTCTACTCCACCAATGCAAGAAAGCACTGTCTGCGCCCAATCTCAGGG
ATCCTTGCCACAACTGCAAGAAAGCTTGCCGCCACCACCTCCACCTCCACTTTCGCAACCAACAGTCCAACCAACACCACAACACGGCGAAGACTTGCGGCAGGAGGGGG
CGCATTTTGAAGACTCATTGCTGAGCCCCACTCGCAATAACGAAGTTCCTACATCGAGCCCCACACCCCAAGCCACAGCTTCTGCGAGCACAACTCAGGCTGGGATTGAT
CTACTGATTGATGTGGATTCAACCCTTATCTTTTCAGAGCTGCGGCGGATAATTCTAGAAAGTGTAAAGCCCCTACACCAGCAACAAGAAAGGATGTGCCAACAACAAGA
CGAACTGTCGCAGCGGGTAAATGCTTTAGTCCTTTTTCTAATGCGATGGATTGAACGTTTTGGCCGTTTGCCAGTCACGCCCAACTTGTTTGCACGGCATGCTGGGCAAC
AAAGACCCCCTGGCGCAGACCCATCCCAAGTACGCGGATCCAAGCCTCCTCCTGCGCCAAGGGAGACAAGACAACCTCCACCACCACCACCGCAGGAGTATAATACACTT
AACCTTAAGGAAAGTCCCTTCGCATTTTCGACTTGCCGCAACTCAGCCTTGATCTCCGTAGTCTATGCGCTGTAG
Protein sequenceShow/hide protein sequence
MAKTRARKERENEEEEVSVTPEVQKVKTKKKKTPEEKEAKRMRRQQRAEEQEAVQKATEDVTTTMEEGNPKEPEVQNPEEGEPIVADTERVQEKITEEIQEKQAEEVQEH
HAEVAPEEGNEQEQEARVEVIMPEVPKHRRIKRKAGRIKVVRTDTPSPPTTDSERENAEKEEREKKEAEEKVREEVEKKAEEERLLKRRAEKGKSVAEASEEPDEIEEPQ
LSYDRFVNNSARAKYVELLKRDFLFERGFSGDLPHFLRTGIADHDWELFCAKPESVNAQVVREFYANIDKVDGFQVIVRGVEVDWSPSAINALYNLQNLPHAAYNEMAVA
PSNEQLSDAVREVGIEGARWQLSKTEKRTFHIDIGKIIVNEIFGCWKKKVGKLFFPNTITMLCKRAGVPENEGDVILFDKGIIDTPNFARLQRTQEARQGGLVYGINTIL
EQLALSASRQEFAERQALTFWNYVRSRDASLKKALQENFSKPYPALPAFPEDLLNPWIPPPPMERGEGDDENDLDSLSLIRSRIILLQQSLVLQNAEVKVEEYVAGRLEG
ANSVLEQSWEQKLPQEQEKATAEETARKADADKKAAKEKEGSSKQAQDPVAADIAATNLALIAAALDIEASDSKDELPLTHRRNIQSAGVVIREPSEKSAAAPSKKRKAP
TASKGKGKEKVGAGKEKVGEEWGKFQEERVHDFYRTQWQEEVNKVLVRGHTVAFSSAHINNVYELRDIPNAEGNELLDYLDDDFLSAVLLAVARPGAKWDHHGTARTLKS
QLLDFEATCWMHWVKNRIMPTTHDATLSLQRVILVYCIMSNVCLGFVTRLCRASGLVPAADEEIRPMRKDFDEQWWTRITSTRDRRLQGSTQQPAPASQPTEPTPAPEAQ
KKKRRKQTAGRNPPPAARTPHSTPPMQESTVCAQSQGSLPQLQESLPPPPPPPLSQPTVQPTPQHGEDLRQEGAHFEDSLLSPTRNNEVPTSSPTPQATASASTTQAGID
LLIDVDSTLIFSELRRIILESVKPLHQQQERMCQQQDELSQRVNALVLFLMRWIERFGRLPVTPNLFARHAGQQRPPGADPSQVRGSKPPPAPRETRQPPPPPPQEYNTL
NLKESPFAFSTCRNSALISVVYAL