; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg020257 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg020257
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionFanconi-associated nuclease
Genome locationscaffold1:26331797..26338091
RNA-Seq ExpressionSpg020257
SyntenySpg020257
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8695166.1 hypothetical protein F3Y22_tig00110733pilonHSYRG00282 [Hibiscus syriacus]2.0e-1926.5Show/hide
Query:  FVNNSARAKYAELLKRDFLFERGF------SSDLPHFLTTGIADHGWELFCAKPESVNAQVVREFYANIDKEVGFQVIVRGVEVDWSPSAINALYHL---
        FV+ +A+  Y  +  R   FE GF      +++L   +   +  H W+ F   P  VNA +V+EFY+NI +     V+VRG+ + ++P+AIN  + L   
Subjt:  FVNNSARAKYAELLKRDFLFERGF------SSDLPHFLTTGIADHGWELFCAKPESVNAQVVREFYANIDKEVGFQVIVRGVEVDWSPSAINALYHL---

Query:  --------QNFPHAAYNEMV------------------------VAPSNEQLSDAVREMVLPTTHDSIVSRERVLLAFAILRSLSIDVGKIIVNEISGCW
                Q   H  Y  ++                        + P  +  +  ++  ++PT+H++ VS +R+LL  +IL   +ID+GKIIV     C 
Subjt:  --------QNFPHAAYNEMV------------------------VAPSNEQLSDAVREMVLPTTHDSIVSRERVLLAFAILRSLSIDVGKIIVNEISGCW

Query:  KKKVGKLFFPNTITMLCSRAGVPTDPEDVILFDKGIIDTPNLAQL-------QRKQEARQGGLA-----YVINSILEQLALSASRQEYGE--RQALTFWN
        K++   L FPN IT LC +  V  +  D IL     ++   +  L        +K EA    +A        ++ LEQ A+  + Q  G+   + + ++ 
Subjt:  KKKVGKLFFPNTITMLCSRAGVPTDPEDVILFDKGIIDTPNLAQL-------QRKQEARQGGLA-----YVINSILEQLALSASRQEYGE--RQALTFWN

Query:  YVRNIDANLKKALQENFSKPYPAL----PAFPEDLFNPWIPPPPVERDEEN
        Y +  DA L  AL E+  +   A     P  P     P  PP     D E+
Subjt:  YVRNIDANLKKALQENFSKPYPAL----PAFPEDLFNPWIPPPPVERDEEN

KAE8718449.1 hypothetical protein F3Y22_tig00110013pilonHSYRG00240 [Hibiscus syriacus]2.0e-1927.2Show/hide
Query:  SYDRFVNNSARAKYAELLKRDFLFERGFSSDLPHFLTTGIADHG-------------WELFCAKPESVNAQVVREFYANIDKEVGFQVIVRGVEVDWSPS
        ++ +F N+ A+A++     R+  FE G       F+ T   D G             W  F   P SVNA +V+EFYANI K     + VRG ++ ++  
Subjt:  SYDRFVNNSARAKYAELLKRDFLFERGFSSDLPHFLTTGIADHG-------------WELFCAKPESVNAQVVREFYANIDKEVGFQVIVRGVEVDWSPS

Query:  AINALYHLQNF--PHAAYNEMV---------------------------------VAPSNEQLSDAVREMVLPTTHDSIVSRERVLLAFAILRSLSIDVG
        AIN  +HLQ     HA + E                                   + P  +  +  ++  ++PT+H++ VS  R+LL  +++ S  IDVG
Subjt:  AINALYHLQNF--PHAAYNEMV---------------------------------VAPSNEQLSDAVREMVLPTTHDSIVSRERVLLAFAILRSLSIDVG

Query:  KIIVNEISGCWKKKVGKLFFPNTITMLCSRAGVPTDPEDVILFDKGIIDTPNLAQL------QRKQEARQGGLAYV-INSILEQLAL-SASRQEYGERQA
        +IIV ++  C  KK   L FPN IT LC +  V  +  D IL     I    L  L      + K    +  +     N+ +  LAL  A  Q   +  A
Subjt:  KIIVNEISGCWKKKVGKLFFPNTITMLCSRAGVPTDPEDVILFDKGIIDTPNLAQL------QRKQEARQGGLAYV-INSILEQLAL-SASRQEYGERQA

Query:  L-----TFWNYVRNIDANLKKALQENFSKPYPALPAFPEDL---FNPWIPPPP
        L      F+ YV++ D  ++   QE         P FP+++   FN    P P
Subjt:  L-----TFWNYVRNIDANLKKALQENFSKPYPALPAFPEDL---FNPWIPPPP

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]5.2e-2030.15Show/hide
Query:  LKRRAEKGKSVVEASEKPDEIEESQFSYDRFVNNSARAKYAELLKRDFLFERGFSSDLPHFLTTGIADHGWELFCAKPESVNAQVVREFYANIDKEVGFQ
        +KR A K    V+      E E ++  Y+  + N  R   AE   + F+ +   +     F+   I  H W+ FCA PE     +VREFYAN+   V   
Subjt:  LKRRAEKGKSVVEASEKPDEIEESQFSYDRFVNNSARAKYAELLKRDFLFERGFSSDLPHFLTTGIADHGWELFCAKPESVNAQVVREFYANIDKEVGFQ

Query:  VIVRGVEVDWSPSAINALYHLQN--FPHAAYNEMV---------------------------------VAPSNEQLSDAVREMVLPTTHDSIVSRERVLL
        V VRGV+V WS  AINA++ L +    H+ + E +                                 + P+ +     ++  +LPTTH   VS++R+LL
Subjt:  VIVRGVEVDWSPSAINALYHLQN--FPHAAYNEMV---------------------------------VAPSNEQLSDAVREMVLPTTHDSIVSRERVLL

Query:  AFAILRSLSIDVGKIIVNEISGCWKKKVGKLFFPNTITMLCSRAGVPTDPEDVILFDKGIIDTPNLAQLQRK
          ++L   SI+VG++I +EI  C  +K G LFFP+ IT LC  A  P    +  L + G ID   +A++ ++
Subjt:  AFAILRSLSIDVGKIIVNEISGCWKKKVGKLFFPNTITMLCSRAGVPTDPEDVILFDKGIIDTPNLAQLQRK

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]2.1e-2929.4Show/hide
Query:  RFVNNSARAKYA-ELLKRDFLFERGFSSD-------LPHFLTTGIADHGWELFCAKPESVNAQVVREFYANIDKEVGFQVIVRGVEVDWSPSAINALYHL
        +F   +A  +Y   +  R    E+GF  D       LP F+   I  H W+ FCA PE     +VREFYAN+       V VRGV+V WS  AINA++ L
Subjt:  RFVNNSARAKYA-ELLKRDFLFERGFSSD-------LPHFLTTGIADHGWELFCAKPESVNAQVVREFYANIDKEVGFQVIVRGVEVDWSPSAINALYHL

Query:  -----------QNF----------------------PHAAYN--EMVVAPSNEQLSDAVREMVLPTTHDSIVSRERVLLAFAILRSLSIDVGKIIVNEIS
                   QN                          AY      + P+ +     ++  +LPTTH   VS++R+LL  ++L   SI+VG++I +EI 
Subjt:  -----------QNF----------------------PHAAYN--EMVVAPSNEQLSDAVREMVLPTTHDSIVSRERVLLAFAILRSLSIDVGKIIVNEIS

Query:  GCWKKKVGKLFFPNTITMLCSRAGVPTDPEDVILFDKGIIDTPNLAQL---------QRKQEAR---------QGGLAYVINSILEQLALSASRQ-----
         C  +K G LFFP+ IT LC  A  P    +  L + G ID   +A++         Q+   +R          G +   + ++ ++L+    +Q     
Subjt:  GCWKKKVGKLFFPNTITMLCSRAGVPTDPEDVILFDKGIIDTPNLAQL---------QRKQEAR---------QGGLAYVINSILEQLALSASRQ-----

Query:  --EYGERQALTFWNYVRNIDANLKKALQENFSKPYPALPAFPEDLFNPWIPPPPVERDEENDEE
          ++  +Q   FW Y +  D  LKKALQ NF++P P  PAFP+++          E D++   E
Subjt:  --EYGERQALTFWNYVRNIDANLKKALQENFSKPYPALPAFPEDLFNPWIPPPPVERDEENDEE

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]1.6e-2431.38Show/hide
Query:  VVREFYANIDKEVGFQVIVRGVEVDWSPSAINALYHLQN--FPHAAYNEMVVAPSNEQLSDAV---------------------------------REMV
        +VREFYAN+       + VRGV+V WS  AINA++ L +    H+ + E +  P    + + V                                 +  +
Subjt:  VVREFYANIDKEVGFQVIVRGVEVDWSPSAINALYHLQN--FPHAAYNEMVVAPSNEQLSDAV---------------------------------REMV

Query:  LPTTHDSIVSRERVLLAFAILRSLSIDVGKIIVNEISGCWKKKVGKLFFPNTITMLCSRAGVPTDPEDVILFDKGIIDTPNLAQLQRK------QEARQG
        LPTTH  IVS++R+LL  ++L   SI+VG++I +EI  C  +K G LFFP+ IT LC  A    + E   L + G ID   +A++ ++      Q+    
Subjt:  LPTTHDSIVSRERVLLAFAILRSLSIDVGKIIVNEISGCWKKKVGKLFFPNTITMLCSRAGVPTDPEDVILFDKGIIDTPNLAQLQRK------QEARQG

Query:  GLAYVINS-----ILEQLAL---SASRQEYGERQALTFWNYVRNIDANLKKALQENFSKPYPALPAFPEDLFNPWIPPPPVERDEENDEE
          A   +S     +L+QL       S+QE+  +Q   FW Y +  D  LKKALQ NF++P P  PAFP+++          E D++   E
Subjt:  GLAYVINS-----ILEQLAL---SASRQEYGERQALTFWNYVRNIDANLKKALQENFSKPYPALPAFPEDLFNPWIPPPPVERDEENDEE

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)2.5e-2030.15Show/hide
Query:  LKRRAEKGKSVVEASEKPDEIEESQFSYDRFVNNSARAKYAELLKRDFLFERGFSSDLPHFLTTGIADHGWELFCAKPESVNAQVVREFYANIDKEVGFQ
        +KR A K    V+      E E ++  Y+  + N  R   AE   + F+ +   +     F+   I  H W+ FCA PE     +VREFYAN+   V   
Subjt:  LKRRAEKGKSVVEASEKPDEIEESQFSYDRFVNNSARAKYAELLKRDFLFERGFSSDLPHFLTTGIADHGWELFCAKPESVNAQVVREFYANIDKEVGFQ

Query:  VIVRGVEVDWSPSAINALYHLQN--FPHAAYNEMV---------------------------------VAPSNEQLSDAVREMVLPTTHDSIVSRERVLL
        V VRGV+V WS  AINA++ L +    H+ + E +                                 + P+ +     ++  +LPTTH   VS++R+LL
Subjt:  VIVRGVEVDWSPSAINALYHLQN--FPHAAYNEMV---------------------------------VAPSNEQLSDAVREMVLPTTHDSIVSRERVLL

Query:  AFAILRSLSIDVGKIIVNEISGCWKKKVGKLFFPNTITMLCSRAGVPTDPEDVILFDKGIIDTPNLAQLQRK
          ++L   SI+VG++I +EI  C  +K G LFFP+ IT LC  A  P    +  L + G ID   +A++ ++
Subjt:  AFAILRSLSIDVGKIIVNEISGCWKKKVGKLFFPNTITMLCSRAGVPTDPEDVILFDKGIIDTPNLAQLQRK

A0A2P5BCG4 Uncharacterized protein (Fragment)1.0e-2929.4Show/hide
Query:  RFVNNSARAKYA-ELLKRDFLFERGFSSD-------LPHFLTTGIADHGWELFCAKPESVNAQVVREFYANIDKEVGFQVIVRGVEVDWSPSAINALYHL
        +F   +A  +Y   +  R    E+GF  D       LP F+   I  H W+ FCA PE     +VREFYAN+       V VRGV+V WS  AINA++ L
Subjt:  RFVNNSARAKYA-ELLKRDFLFERGFSSD-------LPHFLTTGIADHGWELFCAKPESVNAQVVREFYANIDKEVGFQVIVRGVEVDWSPSAINALYHL

Query:  -----------QNF----------------------PHAAYN--EMVVAPSNEQLSDAVREMVLPTTHDSIVSRERVLLAFAILRSLSIDVGKIIVNEIS
                   QN                          AY      + P+ +     ++  +LPTTH   VS++R+LL  ++L   SI+VG++I +EI 
Subjt:  -----------QNF----------------------PHAAYN--EMVVAPSNEQLSDAVREMVLPTTHDSIVSRERVLLAFAILRSLSIDVGKIIVNEIS

Query:  GCWKKKVGKLFFPNTITMLCSRAGVPTDPEDVILFDKGIIDTPNLAQL---------QRKQEAR---------QGGLAYVINSILEQLALSASRQ-----
         C  +K G LFFP+ IT LC  A  P    +  L + G ID   +A++         Q+   +R          G +   + ++ ++L+    +Q     
Subjt:  GCWKKKVGKLFFPNTITMLCSRAGVPTDPEDVILFDKGIIDTPNLAQL---------QRKQEAR---------QGGLAYVINSILEQLALSASRQ-----

Query:  --EYGERQALTFWNYVRNIDANLKKALQENFSKPYPALPAFPEDLFNPWIPPPPVERDEENDEE
          ++  +Q   FW Y +  D  LKKALQ NF++P P  PAFP+++          E D++   E
Subjt:  --EYGERQALTFWNYVRNIDANLKKALQENFSKPYPALPAFPEDLFNPWIPPPPVERDEENDEE

A0A2P5DXM3 Uncharacterized protein7.6e-2531.38Show/hide
Query:  VVREFYANIDKEVGFQVIVRGVEVDWSPSAINALYHLQN--FPHAAYNEMVVAPSNEQLSDAV---------------------------------REMV
        +VREFYAN+       + VRGV+V WS  AINA++ L +    H+ + E +  P    + + V                                 +  +
Subjt:  VVREFYANIDKEVGFQVIVRGVEVDWSPSAINALYHLQN--FPHAAYNEMVVAPSNEQLSDAV---------------------------------REMV

Query:  LPTTHDSIVSRERVLLAFAILRSLSIDVGKIIVNEISGCWKKKVGKLFFPNTITMLCSRAGVPTDPEDVILFDKGIIDTPNLAQLQRK------QEARQG
        LPTTH  IVS++R+LL  ++L   SI+VG++I +EI  C  +K G LFFP+ IT LC  A    + E   L + G ID   +A++ ++      Q+    
Subjt:  LPTTHDSIVSRERVLLAFAILRSLSIDVGKIIVNEISGCWKKKVGKLFFPNTITMLCSRAGVPTDPEDVILFDKGIIDTPNLAQLQRK------QEARQG

Query:  GLAYVINS-----ILEQLAL---SASRQEYGERQALTFWNYVRNIDANLKKALQENFSKPYPALPAFPEDLFNPWIPPPPVERDEENDEE
          A   +S     +L+QL       S+QE+  +Q   FW Y +  D  LKKALQ NF++P P  PAFP+++          E D++   E
Subjt:  GLAYVINS-----ILEQLAL---SASRQEYGERQALTFWNYVRNIDANLKKALQENFSKPYPALPAFPEDLFNPWIPPPPVERDEENDEE

A0A6A3ASF6 Uncharacterized protein2.8e-1927.54Show/hide
Query:  RFVNNSARAKYAELLKRDFLFERGFSSDLPHFLTTGIADHGW--------ELFCAKPESVNAQVVREFYANIDKEVGFQVIVRGVEVDWSPSAINALYHL
        +F N+ A+ ++  +  R   FE G       F+ T   D G+        + F   P SVNA +V+EFYANI K     + VRG ++ ++ +AIN  +HL
Subjt:  RFVNNSARAKYAELLKRDFLFERGFSSDLPHFLTTGIADHGW--------ELFCAKPESVNAQVVREFYANIDKEVGFQVIVRGVEVDWSPSAINALYHL

Query:  QNF--PHA---------------------------------AYNEMVVAPSNEQLSDAVREMVLPTTHDSIVSRERVLLAFAILRSLSIDVGKIIVNEIS
        Q     HA                                 + N   + P  +  +  ++  ++PT+H++IVS  R+LL  +++ S  IDVG+IIV ++ 
Subjt:  QNF--PHA---------------------------------AYNEMVVAPSNEQLSDAVREMVLPTTHDSIVSRERVLLAFAILRSLSIDVGKIIVNEIS

Query:  GCWKKKVGKLFFPNTITMLCSRAGVPTDPEDVILFDKGIIDTPNLAQLQRKQEAR-------QGGLAYVINSILEQLALSAS-RQEYGERQAL-----TF
         C  KK   L FPN IT LC +  V  +  D IL     I    L  L   +  +       Q       N+    LAL     Q   +  AL      F
Subjt:  GCWKKKVGKLFFPNTITMLCSRAGVPTDPEDVILFDKGIIDTPNLAQLQRKQEAR-------QGGLAYVINSILEQLALSAS-RQEYGERQAL-----TF

Query:  WNYVRNIDANLKKALQENFSKPYPALPAFPEDL---FNPWIPPPP
        + YV++ DA ++   QE         P FP+++   FN    P P
Subjt:  WNYVRNIDANLKKALQENFSKPYPALPAFPEDL---FNPWIPPPP

A0A6A3BU96 Uncharacterized protein9.6e-2027.2Show/hide
Query:  SYDRFVNNSARAKYAELLKRDFLFERGFSSDLPHFLTTGIADHG-------------WELFCAKPESVNAQVVREFYANIDKEVGFQVIVRGVEVDWSPS
        ++ +F N+ A+A++     R+  FE G       F+ T   D G             W  F   P SVNA +V+EFYANI K     + VRG ++ ++  
Subjt:  SYDRFVNNSARAKYAELLKRDFLFERGFSSDLPHFLTTGIADHG-------------WELFCAKPESVNAQVVREFYANIDKEVGFQVIVRGVEVDWSPS

Query:  AINALYHLQNF--PHAAYNEMV---------------------------------VAPSNEQLSDAVREMVLPTTHDSIVSRERVLLAFAILRSLSIDVG
        AIN  +HLQ     HA + E                                   + P  +  +  ++  ++PT+H++ VS  R+LL  +++ S  IDVG
Subjt:  AINALYHLQNF--PHAAYNEMV---------------------------------VAPSNEQLSDAVREMVLPTTHDSIVSRERVLLAFAILRSLSIDVG

Query:  KIIVNEISGCWKKKVGKLFFPNTITMLCSRAGVPTDPEDVILFDKGIIDTPNLAQL------QRKQEARQGGLAYV-INSILEQLAL-SASRQEYGERQA
        +IIV ++  C  KK   L FPN IT LC +  V  +  D IL     I    L  L      + K    +  +     N+ +  LAL  A  Q   +  A
Subjt:  KIIVNEISGCWKKKVGKLFFPNTITMLCSRAGVPTDPEDVILFDKGIIDTPNLAQL------QRKQEARQGGLAYV-INSILEQLAL-SASRQEYGERQA

Query:  L-----TFWNYVRNIDANLKKALQENFSKPYPALPAFPEDL---FNPWIPPPP
        L      F+ YV++ D  ++   QE         P FP+++   FN    P P
Subjt:  L-----TFWNYVRNIDANLKKALQENFSKPYPALPAFPEDL---FNPWIPPPP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGAAAACAAGAGCAAGAAAAGAGAGAGACAGCGAGGAAGAAGAGGTACCTGTTACCCCCGAGGTACAGAAGGTGAGAACGAAGAAGAAGAAGACGCTAGAGGAGAA
AGAAGCTAAAAGAAGAAGAATGCAACAACGGGCTGAGGAGCAAGAAGCTGCTCAGAAGGCGACAGAAGATGCTGCTACTACAGTGGAAGAAGGAAATCCGAAGGAACCTG
AAGTGCAGAACCCAGAGGAGGTAGAACCGGTAGAAGCGAATACAGAAAGAGTTCAAGAAAAGAACACTGAAGAGATTCAAGAAAAACAGGCTGAGGAGGTGCAGGAACAT
CATGCAGAGGTTGCACCTGAAGAAGGTAACGAGCAAGAACAGGAGGCTCGAGTGGAGGTGATCATGCTGGAGGTACCCAAACGTCACCGCATTAAGAGAAAAGCGGGTCG
CGTCAAGGTAGTCCGAACTAATACCCCCTCGCCTCCAACCTCTTATTCTGAAAGAGAAAATGCAGAAAGAGAGGAACAACATAAGGAAGAAGCCGAGAAAAAAGCAAGAG
AAGAAGTAGAGAAAGACGTTGAGGAAGAGCGGTTGCTCAAGCGAAGGGCGGAAAAGGGAAAAAGTGTTGTTGAAGCATCGGAAAAACCTGATGAGATAGAAGAGTCACAA
TTTTCGTATGATCGCTTCGTCAACAATTCTGCCAGAGCAAAATATGCTGAGCTGCTGAAAAGAGATTTCTTGTTTGAGAGAGGATTTAGTAGTGATCTTCCACATTTTCT
GACGACCGGTATTGCAGACCACGGCTGGGAGTTGTTTTGTGCAAAGCCTGAATCTGTGAACGCACAGGTGGTGCGCGAATTTTACGCAAATATTGACAAAGAAGTTGGTT
TCCAAGTAATTGTTCGAGGAGTTGAGGTTGACTGGAGTCCTAGTGCTATTAACGCACTGTATCACCTTCAGAATTTCCCCCACGCAGCATATAATGAGATGGTTGTGGCG
CCATCTAATGAGCAGCTGAGTGATGCTGTGCGGGAGATGGTGCTTCCAACGACTCATGACTCGATAGTCTCTAGGGAACGGGTTCTTCTGGCTTTCGCGATTTTGCGGTC
TCTCAGCATTGATGTAGGGAAGATTATTGTTAATGAGATATCTGGTTGTTGGAAGAAGAAAGTAGGGAAGCTGTTTTTCCCAAACACAATTACGATGCTATGTAGCAGGG
CAGGAGTGCCCACAGATCCAGAGGATGTGATTCTGTTTGACAAGGGAATCATCGACACGCCTAACTTGGCACAGCTTCAGCGTAAGCAAGAGGCACGTCAGGGTGGGCTT
GCCTATGTCATCAACTCGATTTTAGAACAACTGGCACTGTCGGCCAGTAGGCAAGAGTATGGCGAGAGGCAAGCTTTGACCTTTTGGAACTATGTTAGAAATATTGATGC
CAATCTTAAGAAGGCGCTACAAGAGAATTTTTCCAAACCATATCCAGCCCTTCCTGCATTCCCTGAGGATTTATTTAACCCTTGGATACCACCCCCACCTGTTGAAAGAG
ATGAGGAGAATGATGAAGAGCAGGAAACCTTTTGCTTGAGCATTTTCTCTGGCCTGGTCGTTGCTGCGGCAAAGAAAATTCTAGAGGTAGTGTTGACTTATGTGATCCAC
TTTAAGCTTAGGTTTAGTCCCACGCTTAGTGGTACAATTCAGAATTATTTTGCTGAAGCAGAGCTTGGTTTTGCAGAATGCTCAGAGGAAAAGCTGGAATTTCCCCAGAA
ATGCGAATGCGATCGCATTTCTGGGAAGGCTAAAATCAAATGCGACCGCATTTCTGGAAAAACTGAGGCAGTTTCGAGTCGTCCACAGGTCGTTTTGAACGAGACCTCTT
CGCACCTAACTGACTGTTTTGACCCTGAAACCGACTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCGAAAACAAGAGCAAGAAAAGAGAGAGACAGCGAGGAAGAAGAGGTACCTGTTACCCCCGAGGTACAGAAGGTGAGAACGAAGAAGAAGAAGACGCTAGAGGAGAA
AGAAGCTAAAAGAAGAAGAATGCAACAACGGGCTGAGGAGCAAGAAGCTGCTCAGAAGGCGACAGAAGATGCTGCTACTACAGTGGAAGAAGGAAATCCGAAGGAACCTG
AAGTGCAGAACCCAGAGGAGGTAGAACCGGTAGAAGCGAATACAGAAAGAGTTCAAGAAAAGAACACTGAAGAGATTCAAGAAAAACAGGCTGAGGAGGTGCAGGAACAT
CATGCAGAGGTTGCACCTGAAGAAGGTAACGAGCAAGAACAGGAGGCTCGAGTGGAGGTGATCATGCTGGAGGTACCCAAACGTCACCGCATTAAGAGAAAAGCGGGTCG
CGTCAAGGTAGTCCGAACTAATACCCCCTCGCCTCCAACCTCTTATTCTGAAAGAGAAAATGCAGAAAGAGAGGAACAACATAAGGAAGAAGCCGAGAAAAAAGCAAGAG
AAGAAGTAGAGAAAGACGTTGAGGAAGAGCGGTTGCTCAAGCGAAGGGCGGAAAAGGGAAAAAGTGTTGTTGAAGCATCGGAAAAACCTGATGAGATAGAAGAGTCACAA
TTTTCGTATGATCGCTTCGTCAACAATTCTGCCAGAGCAAAATATGCTGAGCTGCTGAAAAGAGATTTCTTGTTTGAGAGAGGATTTAGTAGTGATCTTCCACATTTTCT
GACGACCGGTATTGCAGACCACGGCTGGGAGTTGTTTTGTGCAAAGCCTGAATCTGTGAACGCACAGGTGGTGCGCGAATTTTACGCAAATATTGACAAAGAAGTTGGTT
TCCAAGTAATTGTTCGAGGAGTTGAGGTTGACTGGAGTCCTAGTGCTATTAACGCACTGTATCACCTTCAGAATTTCCCCCACGCAGCATATAATGAGATGGTTGTGGCG
CCATCTAATGAGCAGCTGAGTGATGCTGTGCGGGAGATGGTGCTTCCAACGACTCATGACTCGATAGTCTCTAGGGAACGGGTTCTTCTGGCTTTCGCGATTTTGCGGTC
TCTCAGCATTGATGTAGGGAAGATTATTGTTAATGAGATATCTGGTTGTTGGAAGAAGAAAGTAGGGAAGCTGTTTTTCCCAAACACAATTACGATGCTATGTAGCAGGG
CAGGAGTGCCCACAGATCCAGAGGATGTGATTCTGTTTGACAAGGGAATCATCGACACGCCTAACTTGGCACAGCTTCAGCGTAAGCAAGAGGCACGTCAGGGTGGGCTT
GCCTATGTCATCAACTCGATTTTAGAACAACTGGCACTGTCGGCCAGTAGGCAAGAGTATGGCGAGAGGCAAGCTTTGACCTTTTGGAACTATGTTAGAAATATTGATGC
CAATCTTAAGAAGGCGCTACAAGAGAATTTTTCCAAACCATATCCAGCCCTTCCTGCATTCCCTGAGGATTTATTTAACCCTTGGATACCACCCCCACCTGTTGAAAGAG
ATGAGGAGAATGATGAAGAGCAGGAAACCTTTTGCTTGAGCATTTTCTCTGGCCTGGTCGTTGCTGCGGCAAAGAAAATTCTAGAGGTAGTGTTGACTTATGTGATCCAC
TTTAAGCTTAGGTTTAGTCCCACGCTTAGTGGTACAATTCAGAATTATTTTGCTGAAGCAGAGCTTGGTTTTGCAGAATGCTCAGAGGAAAAGCTGGAATTTCCCCAGAA
ATGCGAATGCGATCGCATTTCTGGGAAGGCTAAAATCAAATGCGACCGCATTTCTGGAAAAACTGAGGCAGTTTCGAGTCGTCCACAGGTCGTTTTGAACGAGACCTCTT
CGCACCTAACTGACTGTTTTGACCCTGAAACCGACTAG
Protein sequenceShow/hide protein sequence
MAKTRARKERDSEEEEVPVTPEVQKVRTKKKKTLEEKEAKRRRMQQRAEEQEAAQKATEDAATTVEEGNPKEPEVQNPEEVEPVEANTERVQEKNTEEIQEKQAEEVQEH
HAEVAPEEGNEQEQEARVEVIMLEVPKRHRIKRKAGRVKVVRTNTPSPPTSYSERENAEREEQHKEEAEKKAREEVEKDVEEERLLKRRAEKGKSVVEASEKPDEIEESQ
FSYDRFVNNSARAKYAELLKRDFLFERGFSSDLPHFLTTGIADHGWELFCAKPESVNAQVVREFYANIDKEVGFQVIVRGVEVDWSPSAINALYHLQNFPHAAYNEMVVA
PSNEQLSDAVREMVLPTTHDSIVSRERVLLAFAILRSLSIDVGKIIVNEISGCWKKKVGKLFFPNTITMLCSRAGVPTDPEDVILFDKGIIDTPNLAQLQRKQEARQGGL
AYVINSILEQLALSASRQEYGERQALTFWNYVRNIDANLKKALQENFSKPYPALPAFPEDLFNPWIPPPPVERDEENDEEQETFCLSIFSGLVVAAAKKILEVVLTYVIH
FKLRFSPTLSGTIQNYFAEAELGFAECSEEKLEFPQKCECDRISGKAKIKCDRISGKTEAVSSRPQVVLNETSSHLTDCFDPETD