; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg006020 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg006020
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionFanconi-associated nuclease
Genome locationscaffold7:31465947..31473920
RNA-Seq ExpressionSpg006020
SyntenySpg006020
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8695166.1 hypothetical protein F3Y22_tig00110733pilonHSYRG00282 [Hibiscus syriacus]3.4e-2329.34Show/hide
Query:  FVNDLARAKYQEVLKRDFLFERGFSNDLPRFLEAGIVNLG-----------WRQFCAKPEPVNANIVREFYANLDIKNDFEVIVRGVPVQWSPEVINEMF
        FV++ A+  YQ +  R   FE GF      F EA   NLG           W++F   P PVNA IV+EFY+N+   N   V+VRG+ ++++P  IN  F
Subjt:  FVNDLARAKYQEVLKRDFLFERGFSNDLPRFLEAGIVNLG-----------WRQFCAKPEPVNANIVREFYANLDIKNDFEVIVRGVPVQWSPEVINEMF

Query:  DLQ-------DFPHAVFNEMVAAPSSDQLSAAVREMG----------------------FIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSE
         LQ        F   V +E       D      R  G                      F++ +L+PT+H++TVS  R+LL  +IL   +ID+GKII   
Subjt:  DLQ-------DFPHAVFNEMVAAPSSDQLSAAVREMG----------------------FIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSE

Query:  IVDCWKKKVGKLFFPNTITMLCSRAGVPTVPEDMIMADKGIIDTPNLVRLQRMQEVRQGGLVYGVNQILEQLSVLTSRLEFAERQAQT------------
           C K++   L FPN IT LC +  V     D I++    ++   +  L   +E +        +++     V  S  +  +   +T            
Subjt:  IVDCWKKKVGKLFFPNTITMLCSRAGVPTVPEDMIMADKGIIDTPNLVRLQRMQEVRQGGLVYGVNQILEQLSVLTSRLEFAERQAQT------------

Query:  -YWTYAKRRDDALRGAL
         Y+ YAKRRD  L  AL
Subjt:  -YWTYAKRRDDALRGAL

KAE8718449.1 hypothetical protein F3Y22_tig00110013pilonHSYRG00240 [Hibiscus syriacus]5.8e-2328.2Show/hide
Query:  RFVNDLARAKYQEVLKRDFLFERGF-------SNDLPRFLEAGIVNLGWRQFCAKPEPVNANIVREFYANLDIKNDFEVIVRGVPVQWSPEVINEMFDLQ
        +F ND A+A++Q    R+  FE GF           P  ++  ++ L W +F   P  VNA++V+EFYAN+   N   + VRG  ++++   IN  F LQ
Subjt:  RFVNDLARAKYQEVLKRDFLFERGF-------SNDLPRFLEAGIVNLGWRQFCAKPEPVNANIVREFYANLDIKNDFEVIVRGVPVQWSPEVINEMFDLQ

Query:  DF--PHAVFNEMVAAPSSDQL------------------SAAVREM---------GFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIVD
        +    HA+F E   +   D +                   +  RE           F++ +L+PT+H++TVS  R+LL  +++ S  IDVG+II  ++ D
Subjt:  DF--PHAVFNEMVAAPSSDQL------------------SAAVREM---------GFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIVD

Query:  CWKKKVGKLFFPNTITMLCSRAGVPTVPEDMIMADKGIIDTPNLVRLQRMQEVRQGGLVY----GVNQILEQLSVLTSRLEFAERQAQ---------TYW
        C  KK   L FPN IT LC +  V     D I+     I    L  L  ++  +    V+    G  +   ++ +L       + QAQ          ++
Subjt:  CWKKKVGKLFFPNTITMLCSRAGVPTVPEDMIMADKGIIDTPNLVRLQRMQEVRQGGLVY----GVNQILEQLSVLTSRLEFAERQAQ---------TYW

Query:  TYAKRRDDALRGALQTNFSTPYPAFPMFPDDL---FNLWIPPPP
         Y K RD  +    Q         FP FPD++   FN    P P
Subjt:  TYAKRRDDALRGALQTNFSTPYPAFPMFPDDL---FNLWIPPPP

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]1.4e-2434.54Show/hide
Query:  IRFVNDLARAKYQ-EVLKRDFLFERGFSND-------LPRFLEAGIVNLGWRQFCAKPEPVNANIVREFYANLDIKNDFEVIVRGVPVQWSPEVINEMFD
        ++F  + A  +Y+  +  R    E+GF  D       LP F+   I    W+QFCA PE     +VREFYANL    +  V VRGV V WS E IN +F 
Subjt:  IRFVNDLARAKYQ-EVLKRDFLFERGFSND-------LPRFLEAGIVNLGWRQFCAKPEPVNANIVREFYANLDIKNDFEVIVRGVPVQWSPEVINEMFD

Query:  LQD--FPHAVFNEMVAAPSSDQLSAAVREMG---------------------------FIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEI
        L D    H+ F E +       +   V   G                           F++  LLPTTH  TVS+DR+LL  ++L   SI+VG++I SEI
Subjt:  LQD--FPHAVFNEMVAAPSSDQLSAAVREMG---------------------------FIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEI

Query:  VDCWKKKVGKLFFPNTITMLCSRAGVPTVPEDMIMADKGIIDTPNLVRL
          C  +K G LFFP+ IT LC  A  P +  +  + + G ID   + R+
Subjt:  VDCWKKKVGKLFFPNTITMLCSRAGVPTVPEDMIMADKGIIDTPNLVRL

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]7.8e-3634.39Show/hide
Query:  IRFVNDLARAKYQ-EVLKRDFLFERGFSND-------LPRFLEAGIVNLGWRQFCAKPEPVNANIVREFYANLDIKNDFEVIVRGVPVQWSPEVINEMFD
        ++F  + A  +Y+  +  R    E+GF  D       LP F+   I    W+QFCA PE     +VREFYANL    +  V VRGV V WS E IN +F 
Subjt:  IRFVNDLARAKYQ-EVLKRDFLFERGFSND-------LPRFLEAGIVNLGWRQFCAKPEPVNANIVREFYANLDIKNDFEVIVRGVPVQWSPEVINEMFD

Query:  LQD--FPHAVF------------NEMVAAPSSD---------------QLSAAVREMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEI
        L D    H+ F             E VAA  ++                  AA     F++ RLLPTTH  TVS+DR+LL  ++L   SI+VG++I SEI
Subjt:  LQD--FPHAVF------------NEMVAAPSSD---------------QLSAAVREMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEI

Query:  VDCWKKKVGKLFFPNTITMLCSRAGVPTVPEDMIMADKGIIDTPNLVRLQR-------MQEVRQGGLVYGVNQ----ILEQLSVLTSR------------
          C  +K G LFFP+ IT LC  A  P +  +  + + G ID   + R+ +        Q           N+    IL+QL  L  R            
Subjt:  VDCWKKKVGKLFFPNTITMLCSRAGVPTVPEDMIMADKGIIDTPNLVRLQR-------MQEVRQGGLVYGVNQ----ILEQLSVLTSR------------

Query:  --LEFAERQAQTYWTYAKRRDDALRGALQTNFSTPYPAFPMFPDDL
          L+   +Q Q +W Y+K RD AL+ ALQ NF+ P P FP FP ++
Subjt:  --LEFAERQAQTYWTYAKRRDDALRGALQTNFSTPYPAFPMFPDDL

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]4.2e-2935.42Show/hide
Query:  IVREFYANLDIKNDFEVIVRGVPVQWSPEVINEMFDLQD--FPHAVFNEMVAAPSSDQLSAAVREMG---------------------------FIRLRL
        +VREFYANL    +  + VRGV V WS E IN +F L D    H+ F E +  P    +   V   G                           F++ RL
Subjt:  IVREFYANLDIKNDFEVIVRGVPVQWSPEVINEMFDLQD--FPHAVFNEMVAAPSSDQLSAAVREMG---------------------------FIRLRL

Query:  LPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIVDCWKKKVGKLFFPNTITMLCSRAGVPTVPEDMIMADKGIIDTPNLVRL---------QRMQEV
        LPTTH   VS+DR+LL  ++L   SI+VG++I SEI  C  +K G LFFP+ IT LC  A  P +  +  + + G ID   + R+         Q+    
Subjt:  LPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIVDCWKKKVGKLFFPNTITMLCSRAGVPTVPEDMIMADKGIIDTPNLVRL---------QRMQEV

Query:  RQGGLVYG--VNQILEQLSVLTSRL---EFAERQAQTYWTYAKRRDDALRGALQTNFSTPYPAFPMFPDDL
        R            +L+QL  L  RL   E   +Q Q +W Y+K RD AL+ ALQ NF+ P P FP FP ++
Subjt:  RQGGLVYG--VNQILEQLSVLTSRL---EFAERQAQTYWTYAKRRDDALRGALQTNFSTPYPAFPMFPDDL

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)6.7e-2534.54Show/hide
Query:  IRFVNDLARAKYQ-EVLKRDFLFERGFSND-------LPRFLEAGIVNLGWRQFCAKPEPVNANIVREFYANLDIKNDFEVIVRGVPVQWSPEVINEMFD
        ++F  + A  +Y+  +  R    E+GF  D       LP F+   I    W+QFCA PE     +VREFYANL    +  V VRGV V WS E IN +F 
Subjt:  IRFVNDLARAKYQ-EVLKRDFLFERGFSND-------LPRFLEAGIVNLGWRQFCAKPEPVNANIVREFYANLDIKNDFEVIVRGVPVQWSPEVINEMFD

Query:  LQD--FPHAVFNEMVAAPSSDQLSAAVREMG---------------------------FIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEI
        L D    H+ F E +       +   V   G                           F++  LLPTTH  TVS+DR+LL  ++L   SI+VG++I SEI
Subjt:  LQD--FPHAVFNEMVAAPSSDQLSAAVREMG---------------------------FIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEI

Query:  VDCWKKKVGKLFFPNTITMLCSRAGVPTVPEDMIMADKGIIDTPNLVRL
          C  +K G LFFP+ IT LC  A  P +  +  + + G ID   + R+
Subjt:  VDCWKKKVGKLFFPNTITMLCSRAGVPTVPEDMIMADKGIIDTPNLVRL

A0A2P5BCG4 Uncharacterized protein (Fragment)3.8e-3634.39Show/hide
Query:  IRFVNDLARAKYQ-EVLKRDFLFERGFSND-------LPRFLEAGIVNLGWRQFCAKPEPVNANIVREFYANLDIKNDFEVIVRGVPVQWSPEVINEMFD
        ++F  + A  +Y+  +  R    E+GF  D       LP F+   I    W+QFCA PE     +VREFYANL    +  V VRGV V WS E IN +F 
Subjt:  IRFVNDLARAKYQ-EVLKRDFLFERGFSND-------LPRFLEAGIVNLGWRQFCAKPEPVNANIVREFYANLDIKNDFEVIVRGVPVQWSPEVINEMFD

Query:  LQD--FPHAVF------------NEMVAAPSSD---------------QLSAAVREMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEI
        L D    H+ F             E VAA  ++                  AA     F++ RLLPTTH  TVS+DR+LL  ++L   SI+VG++I SEI
Subjt:  LQD--FPHAVF------------NEMVAAPSSD---------------QLSAAVREMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEI

Query:  VDCWKKKVGKLFFPNTITMLCSRAGVPTVPEDMIMADKGIIDTPNLVRLQR-------MQEVRQGGLVYGVNQ----ILEQLSVLTSR------------
          C  +K G LFFP+ IT LC  A  P +  +  + + G ID   + R+ +        Q           N+    IL+QL  L  R            
Subjt:  VDCWKKKVGKLFFPNTITMLCSRAGVPTVPEDMIMADKGIIDTPNLVRLQR-------MQEVRQGGLVYGVNQ----ILEQLSVLTSR------------

Query:  --LEFAERQAQTYWTYAKRRDDALRGALQTNFSTPYPAFPMFPDDL
          L+   +Q Q +W Y+K RD AL+ ALQ NF+ P P FP FP ++
Subjt:  --LEFAERQAQTYWTYAKRRDDALRGALQTNFSTPYPAFPMFPDDL

A0A2P5DXM3 Uncharacterized protein2.0e-2935.42Show/hide
Query:  IVREFYANLDIKNDFEVIVRGVPVQWSPEVINEMFDLQD--FPHAVFNEMVAAPSSDQLSAAVREMG---------------------------FIRLRL
        +VREFYANL    +  + VRGV V WS E IN +F L D    H+ F E +  P    +   V   G                           F++ RL
Subjt:  IVREFYANLDIKNDFEVIVRGVPVQWSPEVINEMFDLQD--FPHAVFNEMVAAPSSDQLSAAVREMG---------------------------FIRLRL

Query:  LPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIVDCWKKKVGKLFFPNTITMLCSRAGVPTVPEDMIMADKGIIDTPNLVRL---------QRMQEV
        LPTTH   VS+DR+LL  ++L   SI+VG++I SEI  C  +K G LFFP+ IT LC  A  P +  +  + + G ID   + R+         Q+    
Subjt:  LPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIVDCWKKKVGKLFFPNTITMLCSRAGVPTVPEDMIMADKGIIDTPNLVRL---------QRMQEV

Query:  RQGGLVYG--VNQILEQLSVLTSRL---EFAERQAQTYWTYAKRRDDALRGALQTNFSTPYPAFPMFPDDL
        R            +L+QL  L  RL   E   +Q Q +W Y+K RD AL+ ALQ NF+ P P FP FP ++
Subjt:  RQGGLVYG--VNQILEQLSVLTSRL---EFAERQAQTYWTYAKRRDDALRGALQTNFSTPYPAFPMFPDDL

A0A6A2ZUE4 Uncharacterized protein1.7e-2329.34Show/hide
Query:  FVNDLARAKYQEVLKRDFLFERGFSNDLPRFLEAGIVNLG-----------WRQFCAKPEPVNANIVREFYANLDIKNDFEVIVRGVPVQWSPEVINEMF
        FV++ A+  YQ +  R   FE GF      F EA   NLG           W++F   P PVNA IV+EFY+N+   N   V+VRG+ ++++P  IN  F
Subjt:  FVNDLARAKYQEVLKRDFLFERGFSNDLPRFLEAGIVNLG-----------WRQFCAKPEPVNANIVREFYANLDIKNDFEVIVRGVPVQWSPEVINEMF

Query:  DLQ-------DFPHAVFNEMVAAPSSDQLSAAVREMG----------------------FIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSE
         LQ        F   V +E       D      R  G                      F++ +L+PT+H++TVS  R+LL  +IL   +ID+GKII   
Subjt:  DLQ-------DFPHAVFNEMVAAPSSDQLSAAVREMG----------------------FIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSE

Query:  IVDCWKKKVGKLFFPNTITMLCSRAGVPTVPEDMIMADKGIIDTPNLVRLQRMQEVRQGGLVYGVNQILEQLSVLTSRLEFAERQAQT------------
           C K++   L FPN IT LC +  V     D I++    ++   +  L   +E +        +++     V  S  +  +   +T            
Subjt:  IVDCWKKKVGKLFFPNTITMLCSRAGVPTVPEDMIMADKGIIDTPNLVRLQRMQEVRQGGLVYGVNQILEQLSVLTSRLEFAERQAQT------------

Query:  -YWTYAKRRDDALRGAL
         Y+ YAKRRD  L  AL
Subjt:  -YWTYAKRRDDALRGAL

A0A6A3BU96 Uncharacterized protein2.8e-2328.2Show/hide
Query:  RFVNDLARAKYQEVLKRDFLFERGF-------SNDLPRFLEAGIVNLGWRQFCAKPEPVNANIVREFYANLDIKNDFEVIVRGVPVQWSPEVINEMFDLQ
        +F ND A+A++Q    R+  FE GF           P  ++  ++ L W +F   P  VNA++V+EFYAN+   N   + VRG  ++++   IN  F LQ
Subjt:  RFVNDLARAKYQEVLKRDFLFERGF-------SNDLPRFLEAGIVNLGWRQFCAKPEPVNANIVREFYANLDIKNDFEVIVRGVPVQWSPEVINEMFDLQ

Query:  DF--PHAVFNEMVAAPSSDQL------------------SAAVREM---------GFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIVD
        +    HA+F E   +   D +                   +  RE           F++ +L+PT+H++TVS  R+LL  +++ S  IDVG+II  ++ D
Subjt:  DF--PHAVFNEMVAAPSSDQL------------------SAAVREM---------GFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIVD

Query:  CWKKKVGKLFFPNTITMLCSRAGVPTVPEDMIMADKGIIDTPNLVRLQRMQEVRQGGLVY----GVNQILEQLSVLTSRLEFAERQAQ---------TYW
        C  KK   L FPN IT LC +  V     D I+     I    L  L  ++  +    V+    G  +   ++ +L       + QAQ          ++
Subjt:  CWKKKVGKLFFPNTITMLCSRAGVPTVPEDMIMADKGIIDTPNLVRLQRMQEVRQGGLVY----GVNQILEQLSVLTSRLEFAERQAQ---------TYW

Query:  TYAKRRDDALRGALQTNFSTPYPAFPMFPDDL---FNLWIPPPP
         Y K RD  +    Q         FP FPD++   FN    P P
Subjt:  TYAKRRDDALRGALQTNFSTPYPAFPMFPDDL---FNLWIPPPP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGTTGCGGTTCAAGAAGGGAATGTCGAGAAAAATCAAGAAACAGAGGGTGAAGAGCAGGTCGAAGGTGAGCTTGGCGAGAAGAAAACACCGGAACAGGAGACTCA
TGTGGAAATCATTATGCTGGAGCCACCAAAGCGCCGCCGCATCAAGAGGAAGGCGGGTCGCGTGAAGATGGTTCGTACCATGCCATCACCTCCGACGTCAGACTTTGAGA
ACGAAAGAAGGGAAGCTGAGAATAAGGAAAAAGAAGAAGAGGCGAGAAAGGCAGAGGAAGAGCGGTTGCGCATCCAGAGAGAAAACAGGGGCAAAGGTATTGCCGAAGCA
TCAGGAGAAATTGAGGAGTTGAGGGCGCCATTCATTCGCTTCGTCAACGATCTTGCCCGAGCAAAATACCAGGAGGTGCTGAAACGGGACTTCTTGTTCGAACGTGGATT
TAGCAATGACTTACCTAGGTTTTTGGAGGCTGGAATAGTGAACCTCGGTTGGAGGCAATTCTGTGCGAAGCCAGAACCTGTGAATGCCAACATAGTTCGAGAGTTTTACG
CCAACCTTGACATTAAGAATGATTTTGAGGTCATCGTGCGAGGAGTGCCTGTACAGTGGAGCCCTGAGGTCATCAATGAAATGTTCGATCTCCAAGATTTTCCGCATGCT
GTTTTTAATGAGATGGTGGCTGCACCATCTAGTGATCAGCTGAGTGCGGCTGTCCGGGAGATGGGTTTCATCAGGCTACGCTTGCTGCCGACAACACACGACTCCACAGT
ATCTCGGGATAGGGTATTGCTTGCCTTTGCCATTCTTCGCTCGATGAGTATAGATGTAGGAAAGATTATTTCTTCTGAGATTGTTGATTGCTGGAAGAAGAAGGTGGGGA
AGCTGTTCTTTCCGAACACTATCACGATGTTATGCAGCAGGGCAGGAGTGCCCACGGTTCCAGAAGATATGATCATGGCTGATAAGGGAATAATTGACACACCTAATCTG
GTGCGGCTTCAGCGTATGCAAGAGGTTCGCCAGGGTGGGCTTGTGTATGGCGTCAATCAAATCCTAGAGCAACTATCAGTGTTGACCAGTAGGTTAGAGTTTGCTGAAAG
GCAAGCTCAGACCTACTGGACTTATGCTAAGAGGAGAGATGATGCGCTCAGAGGGGCCTTGCAAACTAATTTCTCAACACCGTATCCGGCCTTTCCAATGTTTCCCGATG
ATTTGTTTAATCTTTGGATACCACCCCCGCCTGTTGAACGAGAAGAGGATGTTGATGAGGAGCAGGGTCAGGAAGACTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTGTTGCGGTTCAAGAAGGGAATGTCGAGAAAAATCAAGAAACAGAGGGTGAAGAGCAGGTCGAAGGTGAGCTTGGCGAGAAGAAAACACCGGAACAGGAGACTCA
TGTGGAAATCATTATGCTGGAGCCACCAAAGCGCCGCCGCATCAAGAGGAAGGCGGGTCGCGTGAAGATGGTTCGTACCATGCCATCACCTCCGACGTCAGACTTTGAGA
ACGAAAGAAGGGAAGCTGAGAATAAGGAAAAAGAAGAAGAGGCGAGAAAGGCAGAGGAAGAGCGGTTGCGCATCCAGAGAGAAAACAGGGGCAAAGGTATTGCCGAAGCA
TCAGGAGAAATTGAGGAGTTGAGGGCGCCATTCATTCGCTTCGTCAACGATCTTGCCCGAGCAAAATACCAGGAGGTGCTGAAACGGGACTTCTTGTTCGAACGTGGATT
TAGCAATGACTTACCTAGGTTTTTGGAGGCTGGAATAGTGAACCTCGGTTGGAGGCAATTCTGTGCGAAGCCAGAACCTGTGAATGCCAACATAGTTCGAGAGTTTTACG
CCAACCTTGACATTAAGAATGATTTTGAGGTCATCGTGCGAGGAGTGCCTGTACAGTGGAGCCCTGAGGTCATCAATGAAATGTTCGATCTCCAAGATTTTCCGCATGCT
GTTTTTAATGAGATGGTGGCTGCACCATCTAGTGATCAGCTGAGTGCGGCTGTCCGGGAGATGGGTTTCATCAGGCTACGCTTGCTGCCGACAACACACGACTCCACAGT
ATCTCGGGATAGGGTATTGCTTGCCTTTGCCATTCTTCGCTCGATGAGTATAGATGTAGGAAAGATTATTTCTTCTGAGATTGTTGATTGCTGGAAGAAGAAGGTGGGGA
AGCTGTTCTTTCCGAACACTATCACGATGTTATGCAGCAGGGCAGGAGTGCCCACGGTTCCAGAAGATATGATCATGGCTGATAAGGGAATAATTGACACACCTAATCTG
GTGCGGCTTCAGCGTATGCAAGAGGTTCGCCAGGGTGGGCTTGTGTATGGCGTCAATCAAATCCTAGAGCAACTATCAGTGTTGACCAGTAGGTTAGAGTTTGCTGAAAG
GCAAGCTCAGACCTACTGGACTTATGCTAAGAGGAGAGATGATGCGCTCAGAGGGGCCTTGCAAACTAATTTCTCAACACCGTATCCGGCCTTTCCAATGTTTCCCGATG
ATTTGTTTAATCTTTGGATACCACCCCCGCCTGTTGAACGAGAAGAGGATGTTGATGAGGAGCAGGGTCAGGAAGACTGA
Protein sequenceShow/hide protein sequence
MSVAVQEGNVEKNQETEGEEQVEGELGEKKTPEQETHVEIIMLEPPKRRRIKRKAGRVKMVRTMPSPPTSDFENERREAENKEKEEEARKAEEERLRIQRENRGKGIAEA
SGEIEELRAPFIRFVNDLARAKYQEVLKRDFLFERGFSNDLPRFLEAGIVNLGWRQFCAKPEPVNANIVREFYANLDIKNDFEVIVRGVPVQWSPEVINEMFDLQDFPHA
VFNEMVAAPSSDQLSAAVREMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIVDCWKKKVGKLFFPNTITMLCSRAGVPTVPEDMIMADKGIIDTPNL
VRLQRMQEVRQGGLVYGVNQILEQLSVLTSRLEFAERQAQTYWTYAKRRDDALRGALQTNFSTPYPAFPMFPDDLFNLWIPPPPVEREEDVDEEQGQED