; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg004693 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg004693
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionFanconi-associated nuclease
Genome locationscaffold5:20280496..20290576
RNA-Seq ExpressionSpg004693
SyntenySpg004693
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8695166.1 hypothetical protein F3Y22_tig00110733pilonHSYRG00282 [Hibiscus syriacus]1.4e-1925.31Show/hide
Query:  FVNNLVRAKYAELLKRDFLFERGF------SGDLPHFLRTGIADHCWERFCSKPESVNAQVVREFYANIDKEEGFLAIVRGIEVDWSPGAINALYNLQNF
        FV+   +  Y  +  R   FE GF      + +L   +   +  H W++F   P  VNA +V+EFY+NI +      +VRGI + ++P AIN  + LQ  
Subjt:  FVNNLVRAKYAELLKRDFLFERGF------SGDLPHFLRTGIADHCWERFCSKPESVNAQVVREFYANIDKEEGFLAIVRGIEVDWSPGAINALYNLQNF

Query:  PHAAYNEMAVAPSNEQLSDAVREVGIEGAQWRLSKTEKRTFQ------------------------------ERVLLAFAILRSLSIDVGKIIADEISGC
                A    +E     + ++ + G +W   + +++T                                +R+LL  +IL   +ID+GKII +    C
Subjt:  PHAAYNEMAVAPSNEQLSDAVREVGIEGAQWRLSKTEKRTFQ------------------------------ERVLLAFAILRSLSIDVGKIIADEISGC

Query:  WKKKVGKLFFPNTITMLCKRARVPENEGDVILFDKGIIDTPNLARLQRTQEAR------------QGGLVYGINTILEQLALSASRQEFAE--RQALTFW
         K++   L FPN IT LC++ +V E   D IL     ++   +  L   +EA+                V   +T LEQ A+  + Q   +   + + ++
Subjt:  WKKKVGKLFFPNTITMLCKRARVPENEGDVILFDKGIIDTPNLARLQRTQEAR------------QGGLVYGINTILEQLALSASRQEFAE--RQALTFW

Query:  NYVRNRDANLKKALQENFSK
         Y + RDA L  AL E+  +
Subjt:  NYVRNRDANLKKALQENFSK

KAE8718449.1 hypothetical protein F3Y22_tig00110013pilonHSYRG00240 [Hibiscus syriacus]4.2e-1925.76Show/hide
Query:  LPYDRFVNNLVRAKYAELLKRDFLFERGF------SGDLPHFLRTGIADHCWERFCSKPESVNAQVVREFYANIDKEEGFLAIVRGIEVDWSPGAINALY
        + + +F N+  +A++     R+  FE GF       G     +   +    W +F   P SVNA +V+EFYANI K       VRG ++ ++  AIN  +
Subjt:  LPYDRFVNNLVRAKYAELLKRDFLFERGF------SGDLPHFLRTGIADHCWERFCSKPESVNAQVVREFYANIDKEEGFLAIVRGIEVDWSPGAINALY

Query:  NLQNF--PHAAYNEMAVAPSNEQLSDAVREVGIEGAQWRLSKTEKRTFQ------------------------------ERVLLAFAILRSLSIDVGKII
        +LQ     HA + E A    + +    + ++  E  +W   +T + +                                 R+LL  +++ S  IDVG+II
Subjt:  NLQNF--PHAAYNEMAVAPSNEQLSDAVREVGIEGAQWRLSKTEKRTFQ------------------------------ERVLLAFAILRSLSIDVGKII

Query:  ADEISGCWKKKVGKLFFPNTITMLCKRARVPENEGDVILFDKGIIDTPNLARLQRTQEARQGGLVY-------GINTILEQLAL-SASRQEFAERQAL--
          ++  C  KK   L FPN IT LC++ +V EN  D IL     I    L  L   +  +    V+         N  +  LAL  A  Q  A+  AL  
Subjt:  ADEISGCWKKKVGKLFFPNTITMLCKRARVPENEGDVILFDKGIIDTPNLARLQRTQEARQGGLVY-------GINTILEQLAL-SASRQEFAERQAL--

Query:  ---TFWNYVRNRDANLKKALQENFSKPFPALPAFLEDLLNPWIPPPPVEREGDGEEDPGQE
            F+ YV++RD  ++   QE         P F +++L  +      E E D  + P  +
Subjt:  ---TFWNYVRNRDANLKKALQENFSKPFPALPAFLEDLLNPWIPPPPVEREGDGEEDPGQE

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]1.0e-2030.77Show/hide
Query:  ESQLPYDRFVNNLVRAKYAELLKRDFLFERGF-------SGDLPHFLRTGIADHCWERFCSKPESVNAQVVREFYANIDKEEGFLAIVRGIEVDWSPGAI
        E++    R+ NN        +  R    E+GF        G LP F+   I  H W++FC+ PE     +VREFYAN+         VRG++V WS  AI
Subjt:  ESQLPYDRFVNNLVRAKYAELLKRDFLFERGF-------SGDLPHFLRTGIADHCWERFCSKPESVNAQVVREFYANIDKEEGFLAIVRGIEVDWSPGAI

Query:  NALYNLQNFPHAAYNEMAVAPSNEQLSDAVREVGIEGAQWRLSKT------------------------------EKRTFQERVLLAFAILRSLSIDVGK
        NA++ L + P   ++E     +   L   +  V + GA+W +S                                 K   ++R+LL  ++L   SI+VG+
Subjt:  NALYNLQNFPHAAYNEMAVAPSNEQLSDAVREVGIEGAQWRLSKT------------------------------EKRTFQERVLLAFAILRSLSIDVGK

Query:  IIADEISGCWKKKVGKLFFPNTITMLCKRARVPENEGDVILFDKGIIDTPNLARLQRTQE
        +I  EI  C  +K G LFFP+ IT LC+ AR P    +  L + G ID   +AR+  TQE
Subjt:  IIADEISGCWKKKVGKLFFPNTITMLCKRARVPENEGDVILFDKGIIDTPNLARLQRTQE

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]2.4e-3028.34Show/hide
Query:  ESQLPYDRFVNNLVRAKYAELLKRDFLFERGF-------SGDLPHFLRTGIADHCWERFCSKPESVNAQVVREFYANIDKEEGFLAIVRGIEVDWSPGAI
        E++    R+ NN        +  R    E+GF        G LP F+   I  H W++FC+ PE     +VREFYAN+   E     VRG++V WS  AI
Subjt:  ESQLPYDRFVNNLVRAKYAELLKRDFLFERGF-------SGDLPHFLRTGIADHCWERFCSKPESVNAQVVREFYANIDKEEGFLAIVRGIEVDWSPGAI

Query:  NALYNLQNFPHAAYNEMAVAPSNEQLSDAVREVGIEGAQWRLSKT------------------------------EKRTFQERVLLAFAILRSLSIDVGK
        NA++ L + P   ++E     + + L   +  V   GA+W +S                                 K   ++R+LL  ++L   SI+VG+
Subjt:  NALYNLQNFPHAAYNEMAVAPSNEQLSDAVREVGIEGAQWRLSKT------------------------------EKRTFQERVLLAFAILRSLSIDVGK

Query:  IIADEISGCWKKKVGKLFFPNTITMLCKRARVPENEGDVILFDKGIIDTPNLARLQR---TQEARQ---------------GGLVYGINTILEQLALSAS
        +I  EI  C  +K G LFFP+ IT LC+ AR P    +  L + G ID   +AR+ +   T+  +Q               G ++  +  + ++L+    
Subjt:  IIADEISGCWKKKVGKLFFPNTITMLCKRARVPENEGDVILFDKGIIDTPNLARLQR---TQEARQ---------------GGLVYGINTILEQLALSAS

Query:  RQ-------EFAERQALTFWNYVRNRDANLKKALQENFSKPFPALPAFLEDLLNPWIPPPPVEREGDGEEDPGQ
        +Q       +   +Q   FW Y + RD  LKKALQ NF++P P  PAF +++L         E + DG  +  +
Subjt:  RQ-------EFAERQALTFWNYVRNRDANLKKALQENFSKPFPALPAFLEDLLNPWIPPPPVEREGDGEEDPGQ

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]1.6e-2331.31Show/hide
Query:  VVREFYANIDKEEGFLAIVRGIEVDWSPGAINALYNLQNFPHAAYNEMAVAPSNEQLSDAVREVGIEGAQWRLSKT------------------------
        +VREFYAN+   E     VRG++V WS  AINA++ L + P   ++E     +  +L   +  V   GA+W +S                          
Subjt:  VVREFYANIDKEEGFLAIVRGIEVDWSPGAINALYNLQNFPHAAYNEMAVAPSNEQLSDAVREVGIEGAQWRLSKT------------------------

Query:  ------EKRTFQERVLLAFAILRSLSIDVGKIIADEISGCWKKKVGKLFFPNTITMLCKRARVPENEGDVILFDKGIIDTPNLARL------QRTQE---
               K   ++R+LL  ++L   SI+VG++I  EI  C  +K G LFFP+ IT LC+ A    NE    L + G ID   +AR+      + TQ+   
Subjt:  ------EKRTFQERVLLAFAILRSLSIDVGKIIADEISGCWKKKVGKLFFPNTITMLCKRARVPENEGDVILFDKGIIDTPNLARL------QRTQE---

Query:  --------ARQGGLVYGINTILEQLALSASRQEFAERQALTFWNYVRNRDANLKKALQENFSKPFPALPAFLEDLLNPWIPPPPVEREGDGEEDPGQ
                +R  G V      LEQ     S+QE   +Q   FW Y + RD  LKKALQ NF++P P  PAF +++L         E + DG  +  +
Subjt:  --------ARQGGLVYGINTILEQLALSASRQEFAERQALTFWNYVRNRDANLKKALQENFSKPFPALPAFLEDLLNPWIPPPPVEREGDGEEDPGQ

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)4.8e-2130.77Show/hide
Query:  ESQLPYDRFVNNLVRAKYAELLKRDFLFERGF-------SGDLPHFLRTGIADHCWERFCSKPESVNAQVVREFYANIDKEEGFLAIVRGIEVDWSPGAI
        E++    R+ NN        +  R    E+GF        G LP F+   I  H W++FC+ PE     +VREFYAN+         VRG++V WS  AI
Subjt:  ESQLPYDRFVNNLVRAKYAELLKRDFLFERGF-------SGDLPHFLRTGIADHCWERFCSKPESVNAQVVREFYANIDKEEGFLAIVRGIEVDWSPGAI

Query:  NALYNLQNFPHAAYNEMAVAPSNEQLSDAVREVGIEGAQWRLSKT------------------------------EKRTFQERVLLAFAILRSLSIDVGK
        NA++ L + P   ++E     +   L   +  V + GA+W +S                                 K   ++R+LL  ++L   SI+VG+
Subjt:  NALYNLQNFPHAAYNEMAVAPSNEQLSDAVREVGIEGAQWRLSKT------------------------------EKRTFQERVLLAFAILRSLSIDVGK

Query:  IIADEISGCWKKKVGKLFFPNTITMLCKRARVPENEGDVILFDKGIIDTPNLARLQRTQE
        +I  EI  C  +K G LFFP+ IT LC+ AR P    +  L + G ID   +AR+  TQE
Subjt:  IIADEISGCWKKKVGKLFFPNTITMLCKRARVPENEGDVILFDKGIIDTPNLARLQRTQE

A0A2P5BCG4 Uncharacterized protein (Fragment)1.1e-3028.34Show/hide
Query:  ESQLPYDRFVNNLVRAKYAELLKRDFLFERGF-------SGDLPHFLRTGIADHCWERFCSKPESVNAQVVREFYANIDKEEGFLAIVRGIEVDWSPGAI
        E++    R+ NN        +  R    E+GF        G LP F+   I  H W++FC+ PE     +VREFYAN+   E     VRG++V WS  AI
Subjt:  ESQLPYDRFVNNLVRAKYAELLKRDFLFERGF-------SGDLPHFLRTGIADHCWERFCSKPESVNAQVVREFYANIDKEEGFLAIVRGIEVDWSPGAI

Query:  NALYNLQNFPHAAYNEMAVAPSNEQLSDAVREVGIEGAQWRLSKT------------------------------EKRTFQERVLLAFAILRSLSIDVGK
        NA++ L + P   ++E     + + L   +  V   GA+W +S                                 K   ++R+LL  ++L   SI+VG+
Subjt:  NALYNLQNFPHAAYNEMAVAPSNEQLSDAVREVGIEGAQWRLSKT------------------------------EKRTFQERVLLAFAILRSLSIDVGK

Query:  IIADEISGCWKKKVGKLFFPNTITMLCKRARVPENEGDVILFDKGIIDTPNLARLQR---TQEARQ---------------GGLVYGINTILEQLALSAS
        +I  EI  C  +K G LFFP+ IT LC+ AR P    +  L + G ID   +AR+ +   T+  +Q               G ++  +  + ++L+    
Subjt:  IIADEISGCWKKKVGKLFFPNTITMLCKRARVPENEGDVILFDKGIIDTPNLARLQR---TQEARQ---------------GGLVYGINTILEQLALSAS

Query:  RQ-------EFAERQALTFWNYVRNRDANLKKALQENFSKPFPALPAFLEDLLNPWIPPPPVEREGDGEEDPGQ
        +Q       +   +Q   FW Y + RD  LKKALQ NF++P P  PAF +++L         E + DG  +  +
Subjt:  RQ-------EFAERQALTFWNYVRNRDANLKKALQENFSKPFPALPAFLEDLLNPWIPPPPVEREGDGEEDPGQ

A0A2P5DXM3 Uncharacterized protein8.0e-2431.31Show/hide
Query:  VVREFYANIDKEEGFLAIVRGIEVDWSPGAINALYNLQNFPHAAYNEMAVAPSNEQLSDAVREVGIEGAQWRLSKT------------------------
        +VREFYAN+   E     VRG++V WS  AINA++ L + P   ++E     +  +L   +  V   GA+W +S                          
Subjt:  VVREFYANIDKEEGFLAIVRGIEVDWSPGAINALYNLQNFPHAAYNEMAVAPSNEQLSDAVREVGIEGAQWRLSKT------------------------

Query:  ------EKRTFQERVLLAFAILRSLSIDVGKIIADEISGCWKKKVGKLFFPNTITMLCKRARVPENEGDVILFDKGIIDTPNLARL------QRTQE---
               K   ++R+LL  ++L   SI+VG++I  EI  C  +K G LFFP+ IT LC+ A    NE    L + G ID   +AR+      + TQ+   
Subjt:  ------EKRTFQERVLLAFAILRSLSIDVGKIIADEISGCWKKKVGKLFFPNTITMLCKRARVPENEGDVILFDKGIIDTPNLARL------QRTQE---

Query:  --------ARQGGLVYGINTILEQLALSASRQEFAERQALTFWNYVRNRDANLKKALQENFSKPFPALPAFLEDLLNPWIPPPPVEREGDGEEDPGQ
                +R  G V      LEQ     S+QE   +Q   FW Y + RD  LKKALQ NF++P P  PAF +++L         E + DG  +  +
Subjt:  --------ARQGGLVYGINTILEQLALSASRQEFAERQALTFWNYVRNRDANLKKALQENFSKPFPALPAFLEDLLNPWIPPPPVEREGDGEEDPGQ

A0A6A2ZUE4 Uncharacterized protein7.0e-2025.31Show/hide
Query:  FVNNLVRAKYAELLKRDFLFERGF------SGDLPHFLRTGIADHCWERFCSKPESVNAQVVREFYANIDKEEGFLAIVRGIEVDWSPGAINALYNLQNF
        FV+   +  Y  +  R   FE GF      + +L   +   +  H W++F   P  VNA +V+EFY+NI +      +VRGI + ++P AIN  + LQ  
Subjt:  FVNNLVRAKYAELLKRDFLFERGF------SGDLPHFLRTGIADHCWERFCSKPESVNAQVVREFYANIDKEEGFLAIVRGIEVDWSPGAINALYNLQNF

Query:  PHAAYNEMAVAPSNEQLSDAVREVGIEGAQWRLSKTEKRTFQ------------------------------ERVLLAFAILRSLSIDVGKIIADEISGC
                A    +E     + ++ + G +W   + +++T                                +R+LL  +IL   +ID+GKII +    C
Subjt:  PHAAYNEMAVAPSNEQLSDAVREVGIEGAQWRLSKTEKRTFQ------------------------------ERVLLAFAILRSLSIDVGKIIADEISGC

Query:  WKKKVGKLFFPNTITMLCKRARVPENEGDVILFDKGIIDTPNLARLQRTQEAR------------QGGLVYGINTILEQLALSASRQEFAE--RQALTFW
         K++   L FPN IT LC++ +V E   D IL     ++   +  L   +EA+                V   +T LEQ A+  + Q   +   + + ++
Subjt:  WKKKVGKLFFPNTITMLCKRARVPENEGDVILFDKGIIDTPNLARLQRTQEAR------------QGGLVYGINTILEQLALSASRQEFAE--RQALTFW

Query:  NYVRNRDANLKKALQENFSK
         Y + RDA L  AL E+  +
Subjt:  NYVRNRDANLKKALQENFSK

A0A6J1DYG0 uncharacterized protein LOC1110257643.2e-1753.23Show/hide
Query:  EQNEQQNNQAENPILVLPQ----QNKQALPQQNAKSSLETMMKEYMARTDAAIQSNQASMRALELQMGQLANELKARPQGKLPSDTEHPRREGKEQVKAV
        +QN+Q         +  PQ    Q  Q  P QN  S+LE MMKEYMARTDA IQS  ASMR  E Q+GQLANELK RPQG  P  TE P+REGKEQ KAV
Subjt:  EQNEQQNNQAENPILVLPQ----QNKQALPQQNAKSSLETMMKEYMARTDAAIQSNQASMRALELQMGQLANELKARPQGKLPSDTEHPRREGKEQVKAV

Query:  TLRSGKPLEERVEPSKTQAVEKNG
        TLRSG   +E   P+    +   G
Subjt:  TLRSGKPLEERVEPSKTQAVEKNG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACGATCCGCCTGGGGTGAGGTTCGAGCTTGATCCAGAAATCGAGAGGACATTCAGGATAAGAAGGAGAGAGCAGCGTAGACAGCAAAATCAAATGGCTGACGTGCC
GCGTCCCCCGCAGGGTCCAGAAGATCCAGCTGATCCCCAGCAGAATCGTGTGCTGCAGCAAAACCCGCCGCTGGAGCAAAATGAGCAGCAAAATAATCAGGCTGAGAATC
CTATCTTGGTATTGCCCCAGCAAAATAAGCAAGCTTTACCCCAGCAAAATGCAAAGAGTTCTCTTGAGACGATGATGAAAGAATATATGGCTCGCACAGATGCCGCTATT
CAAAGTAATCAAGCTTCAATGAGAGCCCTGGAATTGCAAATGGGCCAGCTAGCCAATGAGCTGAAGGCAAGGCCTCAAGGGAAACTTCCTTCAGATACTGAACACCCTAG
AAGGGAAGGTAAGGAGCAGGTAAAGGCAGTGACTCTTAGGAGTGGTAAGCCATTGGAAGAAAGAGTAGAGCCTAGTAAAACCCAAGCTGTAGAAAAAAATGGTGATAAAA
ATGTTGTTGTTGAGAAAGAGTTGGAGACTGTGGGACCTAATGGACCTACAGATCAGAAGCTCCAACGATACGAGACTAATCGGCTTAAACTCATTAACCAAGTTAGTCTT
CATTCGTTAACTGTGGGTCATTCCACTAAAGACCCTCAGCTGCACTCTTCTCACTGCAGAATATTTCTGTGTCCACGGATATCGACCAATACTACAAGTCAGTCCTTCAC
GTGTGTTCCACCGAAAGTAAAGGCGAAGAAGAAGAAGACACCAGAAGAAAAAGAAGCTAAAAGAAGAAGAAGACAGCAGAGGGCTGAGGATCAAGAAGTTGTAGAGAAGG
TGGTGGAAGATGTCGCTGCCACGGTGGTTGAAGAGGATCCGAAAGAACAAGAAGAACAAAACCCAGAGCAGACTGAGCCAGGTGTTGCGGATACCGAGGAAGTTCGAGAG
GAAAATACAGAGGAAGTTCGAGAAGAAATTACAGAGGAAGTTCGAGAAGAAATTACAAAGGAAGTTCCAGAAAAGCAGGCCGAGGTTGTGCAAGAAGAACAGGTAGAGGT
TGCACCTGAGGAAGTTAATGAGCAAGAACGGGAGGCTCGGGTGGAGGTGATCATGTCGGAAGTGCCCAAACGCCGCCGTATAAAGCGAAAAGCGGGCCGTGTTAAGGTAG
TCCGAACTGATACCCCCTCGCCTCCAACTACTGATTCTGAAAGAGAGAATGCAGAAAGAGAAGAGCGTGAGAAGAAGGAGGCCGAGGATAAAGCAAGAGAGGAAGCAGAG
AAAAAGGCTGAAGAAGAAAGATTGTGCAAGCAAAGGGCAGACAGGGGCAAGAGTGTTGCTGCAGCAACCGAGGAACCTGATGAAATAGAAGAGTCACAATTGCCGTATGA
TCGCTTTGTCAACAATCTTGTCAGAGCAAAGTATGCAGAGTTGCTGAAAAGAGACTTCCTGTTTGAAAGGGGATTTAGTGGTGATCTTCCACATTTTCTGAGGACCGGTA
TTGCAGACCACTGTTGGGAACGGTTTTGTTCAAAGCCTGAATCTGTGAATGCGCAGGTGGTGCGCGAGTTTTATGCAAATATTGACAAAGAAGAAGGTTTCCTAGCGATT
GTTCGAGGTATTGAGGTCGACTGGAGTCCTGGTGCTATTAATGCACTGTATAACCTTCAAAATTTCCCCCACGCAGCATATAATGAGATGGCTGTAGCGCCATCCAATGA
GCAGCTGAGTGACGCTGTGAGGGAAGTTGGTATTGAAGGGGCGCAGTGGCGGCTTTCGAAAACAGAGAAGAGGACGTTCCAGGAACGAGTGCTTCTGGCTTTCGCTATTT
TGAGGTCTCTCAGTATTGATGTGGGAAAAATTATTGCTGATGAAATATCTGGATGTTGGAAGAAGAAAGTGGGGAAGCTGTTTTTCCCGAATACCATTACCATGCTTTGC
AAGCGAGCAAGGGTTCCAGAGAATGAAGGAGATGTGATATTATTTGACAAGGGAATCATTGACACGCCTAACTTGGCGCGGCTTCAGCGTACGCAAGAGGCACGTCAGGG
TGGGCTGGTCTACGGCATCAACACGATTTTAGAACAACTCGCACTTTCGGCCAGCAGGCAGGAGTTTGCCGAGCGGCAAGCTTTAACCTTCTGGAACTATGTTAGAAATC
GTGATGCCAATCTGAAGAAGGCGCTACAGGAGAATTTTTCCAAACCATTTCCAGCCCTTCCAGCATTCCTTGAAGATTTATTGAACCCCTGGATTCCGCCACCGCCTGTC
GAGAGAGAAGGAGATGGAGAAGAAGATCCTGGTCAGGAGGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGAACGATCCGCCTGGGGTGAGGTTCGAGCTTGATCCAGAAATCGAGAGGACATTCAGGATAAGAAGGAGAGAGCAGCGTAGACAGCAAAATCAAATGGCTGACGTGCC
GCGTCCCCCGCAGGGTCCAGAAGATCCAGCTGATCCCCAGCAGAATCGTGTGCTGCAGCAAAACCCGCCGCTGGAGCAAAATGAGCAGCAAAATAATCAGGCTGAGAATC
CTATCTTGGTATTGCCCCAGCAAAATAAGCAAGCTTTACCCCAGCAAAATGCAAAGAGTTCTCTTGAGACGATGATGAAAGAATATATGGCTCGCACAGATGCCGCTATT
CAAAGTAATCAAGCTTCAATGAGAGCCCTGGAATTGCAAATGGGCCAGCTAGCCAATGAGCTGAAGGCAAGGCCTCAAGGGAAACTTCCTTCAGATACTGAACACCCTAG
AAGGGAAGGTAAGGAGCAGGTAAAGGCAGTGACTCTTAGGAGTGGTAAGCCATTGGAAGAAAGAGTAGAGCCTAGTAAAACCCAAGCTGTAGAAAAAAATGGTGATAAAA
ATGTTGTTGTTGAGAAAGAGTTGGAGACTGTGGGACCTAATGGACCTACAGATCAGAAGCTCCAACGATACGAGACTAATCGGCTTAAACTCATTAACCAAGTTAGTCTT
CATTCGTTAACTGTGGGTCATTCCACTAAAGACCCTCAGCTGCACTCTTCTCACTGCAGAATATTTCTGTGTCCACGGATATCGACCAATACTACAAGTCAGTCCTTCAC
GTGTGTTCCACCGAAAGTAAAGGCGAAGAAGAAGAAGACACCAGAAGAAAAAGAAGCTAAAAGAAGAAGAAGACAGCAGAGGGCTGAGGATCAAGAAGTTGTAGAGAAGG
TGGTGGAAGATGTCGCTGCCACGGTGGTTGAAGAGGATCCGAAAGAACAAGAAGAACAAAACCCAGAGCAGACTGAGCCAGGTGTTGCGGATACCGAGGAAGTTCGAGAG
GAAAATACAGAGGAAGTTCGAGAAGAAATTACAGAGGAAGTTCGAGAAGAAATTACAAAGGAAGTTCCAGAAAAGCAGGCCGAGGTTGTGCAAGAAGAACAGGTAGAGGT
TGCACCTGAGGAAGTTAATGAGCAAGAACGGGAGGCTCGGGTGGAGGTGATCATGTCGGAAGTGCCCAAACGCCGCCGTATAAAGCGAAAAGCGGGCCGTGTTAAGGTAG
TCCGAACTGATACCCCCTCGCCTCCAACTACTGATTCTGAAAGAGAGAATGCAGAAAGAGAAGAGCGTGAGAAGAAGGAGGCCGAGGATAAAGCAAGAGAGGAAGCAGAG
AAAAAGGCTGAAGAAGAAAGATTGTGCAAGCAAAGGGCAGACAGGGGCAAGAGTGTTGCTGCAGCAACCGAGGAACCTGATGAAATAGAAGAGTCACAATTGCCGTATGA
TCGCTTTGTCAACAATCTTGTCAGAGCAAAGTATGCAGAGTTGCTGAAAAGAGACTTCCTGTTTGAAAGGGGATTTAGTGGTGATCTTCCACATTTTCTGAGGACCGGTA
TTGCAGACCACTGTTGGGAACGGTTTTGTTCAAAGCCTGAATCTGTGAATGCGCAGGTGGTGCGCGAGTTTTATGCAAATATTGACAAAGAAGAAGGTTTCCTAGCGATT
GTTCGAGGTATTGAGGTCGACTGGAGTCCTGGTGCTATTAATGCACTGTATAACCTTCAAAATTTCCCCCACGCAGCATATAATGAGATGGCTGTAGCGCCATCCAATGA
GCAGCTGAGTGACGCTGTGAGGGAAGTTGGTATTGAAGGGGCGCAGTGGCGGCTTTCGAAAACAGAGAAGAGGACGTTCCAGGAACGAGTGCTTCTGGCTTTCGCTATTT
TGAGGTCTCTCAGTATTGATGTGGGAAAAATTATTGCTGATGAAATATCTGGATGTTGGAAGAAGAAAGTGGGGAAGCTGTTTTTCCCGAATACCATTACCATGCTTTGC
AAGCGAGCAAGGGTTCCAGAGAATGAAGGAGATGTGATATTATTTGACAAGGGAATCATTGACACGCCTAACTTGGCGCGGCTTCAGCGTACGCAAGAGGCACGTCAGGG
TGGGCTGGTCTACGGCATCAACACGATTTTAGAACAACTCGCACTTTCGGCCAGCAGGCAGGAGTTTGCCGAGCGGCAAGCTTTAACCTTCTGGAACTATGTTAGAAATC
GTGATGCCAATCTGAAGAAGGCGCTACAGGAGAATTTTTCCAAACCATTTCCAGCCCTTCCAGCATTCCTTGAAGATTTATTGAACCCCTGGATTCCGCCACCGCCTGTC
GAGAGAGAAGGAGATGGAGAAGAAGATCCTGGTCAGGAGGATTGA
Protein sequenceShow/hide protein sequence
MNDPPGVRFELDPEIERTFRIRRREQRRQQNQMADVPRPPQGPEDPADPQQNRVLQQNPPLEQNEQQNNQAENPILVLPQQNKQALPQQNAKSSLETMMKEYMARTDAAI
QSNQASMRALELQMGQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLEERVEPSKTQAVEKNGDKNVVVEKELETVGPNGPTDQKLQRYETNRLKLINQVSL
HSLTVGHSTKDPQLHSSHCRIFLCPRISTNTTSQSFTCVPPKVKAKKKKTPEEKEAKRRRRQQRAEDQEVVEKVVEDVAATVVEEDPKEQEEQNPEQTEPGVADTEEVRE
ENTEEVREEITEEVREEITKEVPEKQAEVVQEEQVEVAPEEVNEQEREARVEVIMSEVPKRRRIKRKAGRVKVVRTDTPSPPTTDSERENAEREEREKKEAEDKAREEAE
KKAEEERLCKQRADRGKSVAAATEEPDEIEESQLPYDRFVNNLVRAKYAELLKRDFLFERGFSGDLPHFLRTGIADHCWERFCSKPESVNAQVVREFYANIDKEEGFLAI
VRGIEVDWSPGAINALYNLQNFPHAAYNEMAVAPSNEQLSDAVREVGIEGAQWRLSKTEKRTFQERVLLAFAILRSLSIDVGKIIADEISGCWKKKVGKLFFPNTITMLC
KRARVPENEGDVILFDKGIIDTPNLARLQRTQEARQGGLVYGINTILEQLALSASRQEFAERQALTFWNYVRNRDANLKKALQENFSKPFPALPAFLEDLLNPWIPPPPV
EREGDGEEDPGQED