; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr030364 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr030364
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionstructure-specific endonuclease subunit SLX1
Genome locationtig00153640:2813400..2831791
RNA-Seq ExpressionSgr030364
SyntenySgr030364
Gene Ontology termsGO:0000724 - double-strand break repair via homologous recombination (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0033557 - Slx1-Slx4 complex (cellular component)
GO:0008821 - crossover junction endodeoxyribonuclease activity (molecular function)
GO:0017108 - 5'-flap endonuclease activity (molecular function)
InterPro domainsIPR000305 - GIY-YIG endonuclease
IPR035901 - GIY-YIG endonuclease superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004151512.1 structure-specific endonuclease subunit SLX1 [Cucumis sativus]6.0e-5569.77Show/hide
Query:  MRVLSQTFRCHKRPISNPDVPKPSSSSTEDPAAVTLSVKSKPKP----WCVYLIISSNFPIKTYVGVTLNFARRFVEFLLLGFLFKAQISLKQHNGEIEG
        MR+LS TFRC K PISNP + K SSSS++DP   TL VKS+PKP    WCVYLIISSN PIKTYVGVTL+F RR                LKQHNGEI+G
Subjt:  MRVLSQTFRCHKRPISNPDVPKPSSSSTEDPAAVTLSVKSKPKP----WCVYLIISSNFPIKTYVGVTLNFARRFVEFLLLGFLFKAQISLKQHNGEIEG

Query:  GAKATRAGRPWICACTIHGFKDQSQACEFESKWKEVSRKLPHQRKKNDMGKQVDDQTLRLLKHREGALEKVK
        GAKATRAGRPWICACTIHGFKDQSQACEFESKWK+VSRK+ +++K+ D+GK +DDQTLRLLKHRE AL KVK
Subjt:  GAKATRAGRPWICACTIHGFKDQSQACEFESKWKEVSRKLPHQRKKNDMGKQVDDQTLRLLKHREGALEKVK

XP_008461838.1 PREDICTED: structure-specific endonuclease subunit slx1 [Cucumis melo]5.4e-5668.45Show/hide
Query:  MRVLSQTFRCHKRPISNPDVPKPSSSSTEDPAAVTLSVKSKPKPWCVYLIISSNFPIKTYVGVTLNFARRFVEFLLLGFLFKAQISLKQHNGEIEGGAKA
        MR+LSQTFRC K PISNP +PK SSSST+ P  +    + KPK WCVYLIISSN PIKTYVGVTL+F RR                LKQHNGEI+GGAKA
Subjt:  MRVLSQTFRCHKRPISNPDVPKPSSSSTEDPAAVTLSVKSKPKPWCVYLIISSNFPIKTYVGVTLNFARRFVEFLLLGFLFKAQISLKQHNGEIEGGAKA

Query:  TRAGRPWICACTIHGFKDQSQACEFESKWKEVSRKLPHQRKKNDMGKQVDDQTLRLLKHREGALEKVK
        T+AGRPWICACTIHGFKDQSQACEFESKWK+VSRK+ +++K++D+GKQ+D+QTLRLLKHR+ AL KVK
Subjt:  TRAGRPWICACTIHGFKDQSQACEFESKWKEVSRKLPHQRKKNDMGKQVDDQTLRLLKHREGALEKVK

XP_022152567.1 structure-specific endonuclease subunit SLX1 [Momordica charantia]1.4e-5671.35Show/hide
Query:  MRVLSQTFRCHKRPISNPDVPKP-SSSSTEDPAAVTLSVKS--KPKPWCVYLIISSNFPIKTYVGVTLNFARRFVEFLLLGFLFKAQISLKQHNGEIEGG
        MR+LS TFRCHK PISN   PKP SSSST+DP+A TLS +S  KPKPWCVYLIISSN P KTYVGVTLNF+RR                LKQHNGEI+GG
Subjt:  MRVLSQTFRCHKRPISNPDVPKP-SSSSTEDPAAVTLSVKS--KPKPWCVYLIISSNFPIKTYVGVTLNFARRFVEFLLLGFLFKAQISLKQHNGEIEGG

Query:  AKATRAGRPWICACTIHGFKDQSQACEFESKWKEVSRKLPHQRKKNDMGKQVDDQTLRLLKHREGALEKVK
        AKATRAGRPW+CAC IHGFKDQSQACEFESKWKEVSRK+ H+RKK+D  +Q D+Q LRLLK+R+ ALEKVK
Subjt:  AKATRAGRPWICACTIHGFKDQSQACEFESKWKEVSRKLPHQRKKNDMGKQVDDQTLRLLKHREGALEKVK

XP_022982643.1 structure-specific endonuclease subunit slx1 isoform X1 [Cucurbita maxima]1.4e-4866.47Show/hide
Query:  MRVLSQTFRCHKRPISNPDVPKPSSSSTEDPAAVTLSVKSKP--KPWCVYLIISSNFPIKTYVGVTLNFARRFVEFLLLGFLFKAQISLKQHNGEIEGGA
        MR+LSQTFR  K PIS+P+V K SSSST+ P   TL++KS+P  K WCVYLIISSN PIKTYVGVTL+F RR                LKQHNGEI+GGA
Subjt:  MRVLSQTFRCHKRPISNPDVPKPSSSSTEDPAAVTLSVKSKP--KPWCVYLIISSNFPIKTYVGVTLNFARRFVEFLLLGFLFKAQISLKQHNGEIEGGA

Query:  KATRAGRPWICACTIHGFKDQSQACEFESKWKEVSRKLPHQRKKNDMGKQVDDQTLRLLKHREGALEKVK
        KATRAGRPWICAC IHGFK+QSQACEFESKWKE+ RK+ H RK     ++VDDQTLRLLK R+ AL KVK
Subjt:  KATRAGRPWICACTIHGFKDQSQACEFESKWKEVSRKLPHQRKKNDMGKQVDDQTLRLLKHREGALEKVK

XP_038903984.1 structure-specific endonuclease subunit slx1 isoform X1 [Benincasa hispida]9.6e-5367.86Show/hide
Query:  MRVLSQTFRCHKRPISNPDVPKPSSSSTEDPAAVTLSVKSKPKPWCVYLIISSNFPIKTYVGVTLNFARRFVEFLLLGFLFKAQISLKQHNGEIEGGAKA
        MR+LSQTFR  K   SN  + K SSSST+DP  +  S + KPK WCVYLIISSN PIKTYVGVTL+F RR                LKQHNGEI+GGAKA
Subjt:  MRVLSQTFRCHKRPISNPDVPKPSSSSTEDPAAVTLSVKSKPKPWCVYLIISSNFPIKTYVGVTLNFARRFVEFLLLGFLFKAQISLKQHNGEIEGGAKA

Query:  TRAGRPWICACTIHGFKDQSQACEFESKWKEVSRKLPHQRKKNDMGKQVDDQTLRLLKHREGALEKVK
        TR GRPW+CACTIHGFKDQ QACEFESKWKEVSRK+ +++K++DMGKQVDDQTLRLLKHR+ AL KVK
Subjt:  TRAGRPWICACTIHGFKDQSQACEFESKWKEVSRKLPHQRKKNDMGKQVDDQTLRLLKHREGALEKVK

TrEMBL top hitse value%identityAlignment
A0A0A0LCZ9 GIY-YIG domain-containing protein2.9e-5569.77Show/hide
Query:  MRVLSQTFRCHKRPISNPDVPKPSSSSTEDPAAVTLSVKSKPKP----WCVYLIISSNFPIKTYVGVTLNFARRFVEFLLLGFLFKAQISLKQHNGEIEG
        MR+LS TFRC K PISNP + K SSSS++DP   TL VKS+PKP    WCVYLIISSN PIKTYVGVTL+F RR                LKQHNGEI+G
Subjt:  MRVLSQTFRCHKRPISNPDVPKPSSSSTEDPAAVTLSVKSKPKP----WCVYLIISSNFPIKTYVGVTLNFARRFVEFLLLGFLFKAQISLKQHNGEIEG

Query:  GAKATRAGRPWICACTIHGFKDQSQACEFESKWKEVSRKLPHQRKKNDMGKQVDDQTLRLLKHREGALEKVK
        GAKATRAGRPWICACTIHGFKDQSQACEFESKWK+VSRK+ +++K+ D+GK +DDQTLRLLKHRE AL KVK
Subjt:  GAKATRAGRPWICACTIHGFKDQSQACEFESKWKEVSRKLPHQRKKNDMGKQVDDQTLRLLKHREGALEKVK

A0A1S3CFH5 structure-specific endonuclease subunit slx12.6e-5668.45Show/hide
Query:  MRVLSQTFRCHKRPISNPDVPKPSSSSTEDPAAVTLSVKSKPKPWCVYLIISSNFPIKTYVGVTLNFARRFVEFLLLGFLFKAQISLKQHNGEIEGGAKA
        MR+LSQTFRC K PISNP +PK SSSST+ P  +    + KPK WCVYLIISSN PIKTYVGVTL+F RR                LKQHNGEI+GGAKA
Subjt:  MRVLSQTFRCHKRPISNPDVPKPSSSSTEDPAAVTLSVKSKPKPWCVYLIISSNFPIKTYVGVTLNFARRFVEFLLLGFLFKAQISLKQHNGEIEGGAKA

Query:  TRAGRPWICACTIHGFKDQSQACEFESKWKEVSRKLPHQRKKNDMGKQVDDQTLRLLKHREGALEKVK
        T+AGRPWICACTIHGFKDQSQACEFESKWK+VSRK+ +++K++D+GKQ+D+QTLRLLKHR+ AL KVK
Subjt:  TRAGRPWICACTIHGFKDQSQACEFESKWKEVSRKLPHQRKKNDMGKQVDDQTLRLLKHREGALEKVK

A0A6J1DGL6 structure-specific endonuclease subunit SLX16.9e-5771.35Show/hide
Query:  MRVLSQTFRCHKRPISNPDVPKP-SSSSTEDPAAVTLSVKS--KPKPWCVYLIISSNFPIKTYVGVTLNFARRFVEFLLLGFLFKAQISLKQHNGEIEGG
        MR+LS TFRCHK PISN   PKP SSSST+DP+A TLS +S  KPKPWCVYLIISSN P KTYVGVTLNF+RR                LKQHNGEI+GG
Subjt:  MRVLSQTFRCHKRPISNPDVPKP-SSSSTEDPAAVTLSVKS--KPKPWCVYLIISSNFPIKTYVGVTLNFARRFVEFLLLGFLFKAQISLKQHNGEIEGG

Query:  AKATRAGRPWICACTIHGFKDQSQACEFESKWKEVSRKLPHQRKKNDMGKQVDDQTLRLLKHREGALEKVK
        AKATRAGRPW+CAC IHGFKDQSQACEFESKWKEVSRK+ H+RKK+D  +Q D+Q LRLLK+R+ ALEKVK
Subjt:  AKATRAGRPWICACTIHGFKDQSQACEFESKWKEVSRKLPHQRKKNDMGKQVDDQTLRLLKHREGALEKVK

A0A6J1F1V2 structure-specific endonuclease subunit slx11.2e-4867.06Show/hide
Query:  MRVLSQTFRCHKRPISNPDVPKPSSSSTEDPAAVTLSVKSKPKP--WCVYLIISSNFPIKTYVGVTLNFARRFVEFLLLGFLFKAQISLKQHNGEIEGGA
        MR+LSQTFR  K PIS+P+V K SSSST+ P   TL VKS+P+P  WCVYLIISSN PIKTYVGVTL+F RR                LKQHNGEI+GGA
Subjt:  MRVLSQTFRCHKRPISNPDVPKPSSSSTEDPAAVTLSVKSKPKP--WCVYLIISSNFPIKTYVGVTLNFARRFVEFLLLGFLFKAQISLKQHNGEIEGGA

Query:  KATRAGRPWICACTIHGFKDQSQACEFESKWKEVSRKLPHQRKKNDMGKQVDDQTLRLLKHREGALEKVK
        KATRAGRPWICAC IHGFKDQSQACEFESKWKE+ RK+ H RK     ++VDD TLRLLK R+ AL KVK
Subjt:  KATRAGRPWICACTIHGFKDQSQACEFESKWKEVSRKLPHQRKKNDMGKQVDDQTLRLLKHREGALEKVK

A0A6J1J5C9 structure-specific endonuclease subunit slx1 isoform X17.0e-4966.47Show/hide
Query:  MRVLSQTFRCHKRPISNPDVPKPSSSSTEDPAAVTLSVKSKP--KPWCVYLIISSNFPIKTYVGVTLNFARRFVEFLLLGFLFKAQISLKQHNGEIEGGA
        MR+LSQTFR  K PIS+P+V K SSSST+ P   TL++KS+P  K WCVYLIISSN PIKTYVGVTL+F RR                LKQHNGEI+GGA
Subjt:  MRVLSQTFRCHKRPISNPDVPKPSSSSTEDPAAVTLSVKSKP--KPWCVYLIISSNFPIKTYVGVTLNFARRFVEFLLLGFLFKAQISLKQHNGEIEGGA

Query:  KATRAGRPWICACTIHGFKDQSQACEFESKWKEVSRKLPHQRKKNDMGKQVDDQTLRLLKHREGALEKVK
        KATRAGRPWICAC IHGFK+QSQACEFESKWKE+ RK+ H RK     ++VDDQTLRLLK R+ AL KVK
Subjt:  KATRAGRPWICACTIHGFKDQSQACEFESKWKEVSRKLPHQRKKNDMGKQVDDQTLRLLKHREGALEKVK

SwissProt top hitse value%identityAlignment
A8PWH1 Structure-specific endonuclease subunit SLX11.0e-0429.41Show/hide
Query:  PKPWCVYLIISSNFPIKTYVGVTLNFARRFVEFLLLGFLFKAQISLKQHNGEIEGGAKATRAGRPWICACTIHGFKDQSQACEFESKWKEVSRKLPHQRK
        P+ +  Y + S + P +TY+G T +  RR                L+QHNG ++ GA  TR  RPW     ++GF  +  A +FE  W++      H R 
Subjt:  PKPWCVYLIISSNFPIKTYVGVTLNFARRFVEFLLLGFLFKAQISLKQHNGEIEGGAKATRAGRPWICACTIHGFKDQSQACEFESKWKEVSRKLPHQRK

Query:  KNDMGKQVDDQTLRLLKHREGALEKVKSPVTVFTYAPRAAYSSNILTSALNLR
         +D G + +  +     +  G   K   P T  + AP +A S +   S LNLR
Subjt:  KNDMGKQVDDQTLRLLKHREGALEKVKSPVTVFTYAPRAAYSSNILTSALNLR

B6JY16 Structure-specific endonuclease subunit slx14.4e-0833.33Show/hide
Query:  WCVYLIISSNFPIKT-YVGVTLNFARRFVEFLLLGFLFKAQISLKQHNGEIEGGAKATRAGRPWICACTIHGFKDQSQACEFESKWKEVSRKLPHQRKKN
        +C YL++S     ++ Y+G T + ARR                L+QHNGEI+GGA  T+  RPW  AC +HGF  +  A +FE  W+          +++
Subjt:  WCVYLIISSNFPIKT-YVGVTLNFARRFVEFLLLGFLFKAQISLKQHNGEIEGGAKATRAGRPWICACTIHGFKDQSQACEFESKWKEVSRKLPHQRKKN

Query:  DMGKQVDDQTL
             V  QTL
Subjt:  DMGKQVDDQTL

Q5CT62 Structure-specific endonuclease subunit SLX1 homolog2.0e-0533.73Show/hide
Query:  YLIISSNFPIKTYVGVTLNFARRFVEFLLLGFLFKAQISLKQHNGEIEGGAKATRAGRPWICACTIHGFKDQSQACEFESKWK
        Y ++S      +Y+G ++N  RR                L+QHNGEI+ GAK T++G PW     + GF D+  A  FE  W+
Subjt:  YLIISSNFPIKTYVGVTLNFARRFVEFLLLGFLFKAQISLKQHNGEIEGGAKATRAGRPWICACTIHGFKDQSQACEFESKWK

Q9P7M3 Structure-specific endonuclease subunit slx14.9e-0734.44Show/hide
Query:  WCVYLIISSNFPIK--TYVGVTLNFARRFVEFLLLGFLFKAQISLKQHNGEIEGGAKATRAGRPWICACTIHGFKDQSQACEFESKWKEV
        +C YL+ S+        Y+G T +  RR                L+QHNGEI GGA  T+ GRPW  +C ++GF ++  A +FE  W+ +
Subjt:  WCVYLIISSNFPIK--TYVGVTLNFARRFVEFLLLGFLFKAQISLKQHNGEIEGGAKATRAGRPWICACTIHGFKDQSQACEFESKWKEV

Arabidopsis top hitse value%identityAlignment
AT2G30350.2 Excinuclease ABC, C subunit, N-terminal1.6e-0535.62Show/hide
Query:  KTYVGVTLNFARRFVEFLLLGFLFKAQISLKQHNGEIEGGAKATRAGRPWICACTIHGFKDQSQACEFESKWK
        +TY+G T+N  RR                ++QHNGEI  GA  T+  RPW     I+GF     A +FE  W+
Subjt:  KTYVGVTLNFARRFVEFLLLGFLFKAQISLKQHNGEIEGGAKATRAGRPWICACTIHGFKDQSQACEFESKWK

AT5G43210.1 Excinuclease ABC, C subunit, N-terminal3.8e-3146.67Show/hide
Query:  RVLSQTFRCHKRPISNP-------------DVPKPSSSSTEDPAAVTLSVKSKPKPWCVYLIISSNFPIKTYVGVTLNFARRFVEFLLLGFLFKAQISLK
        R+LS+TF   K   SNP              VP PSSSS ++ + +      K K W VYLI+S+  PIKTYVG+T +F+RR                LK
Subjt:  RVLSQTFRCHKRPISNP-------------DVPKPSSSSTEDPAAVTLSVKSKPKPWCVYLIISSNFPIKTYVGVTLNFARRFVEFLLLGFLFKAQISLK

Query:  QHNGEIEGGAKATRAGRPWICACTIHGFKDQSQACEFESKWKEVSRKLPHQRKKNDMGKQVDDQTLRLLKHREGALEKVK
        QHNGEI GGAKA+ AGRPW+CAC I GF   SQA  FESKWK  +RKLP ++K  DM      Q+  LL+HR  AL KV+
Subjt:  QHNGEIEGGAKATRAGRPWICACTIHGFKDQSQACEFESKWKEVSRKLPHQRKKNDMGKQVDDQTLRLLKHREGALEKVK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAGTGCTTTCTCAGACATTTCGTTGCCATAAGCGCCCAATTTCAAATCCAGACGTTCCGAAACCTTCTTCTTCGTCAACCGAGGATCCAGCTGCTGTGACATTGAG
CGTCAAATCGAAGCCGAAGCCGTGGTGTGTGTATCTCATCATCTCCTCCAATTTCCCCATCAAAACCTATGTCGGGGTCACCCTCAACTTCGCTCGCCGGTTTGTCGAAT
TTCTTCTCTTGGGGTTCTTGTTTAAAGCACAGATTTCTTTGAAACAACATAATGGTGAAATAGAAGGAGGTGCAAAAGCAACTCGTGCAGGACGACCCTGGATCTGTGCG
TGCACAATTCATGGTTTTAAAGACCAAAGTCAAGCTTGTGAGTTTGAATCAAAATGGAAAGAAGTTTCAAGGAAATTGCCCCATCAGAGGAAGAAGAATGATATGGGAAA
GCAGGTGGATGATCAGACATTAAGATTGCTCAAGCACAGGGAAGGAGCTCTGGAAAAAGTGAAGTCACCAGTCACAGTCTTCACATACGCGCCTAGAGCCGCGTACTCCT
CAAACATTCTTACGTCCGCTTTGAACCTCAGAAACTGTGGAAAAAACAGCCAAAAGCTGGTGACCATCACGAACCCGATCGTCAACGGCGTCGATATTATCGGCGGCAGC
CGGAACCTACCTCCGACCGCCCTCTTCACCACCATCTCCGCCGGCAAACACAGCCCATGCCGGAGACGACAAACGTTGCGGCCACCGCCATCAGGGACGCCCGTTTTTTC
CCCACGACGGCGGCCGCCAAGCTCCGGGACGGCTCGTAGACGGTCAGCCGGAGGATTTTGGTGGCCATCAGGTTCCATCTCCGGCCCCAGAAGTCCTGTAATGAAGTGGA
TAGGTAGGGCTCGTCGAAATTGCGAAGATGGTGACGAGAAGAAAGCCAAACTTGGCAGGGAAGCTCAGCGGCAGAACCGGACCATTGTTTGGTTGGCCGGAAATTGGAGG
TGGGTCGTCGGGGATTTCGATGGGGAGGGAGCCGACGGCGAGGAAACGGCCGAGGGAGGAAGAAGAAGCGTGGGAACAAAGAGGGCCTTTGCCGAAAGCAAAGAGGAGAA
GCTTGAAAATCGCGTATTCTTCAACCACCCTGACGTCAGCCTTACAAGTGTTGATGAACGGCGGAAAGAACAGCCAGAAACAGGTGGAGAAGAAGAATCCGACGGTCAGC
GGAGTGGACAACGGAGGAGGCAACGGCGGCAGCCGCCACGTTTCGATCCACACCTTCTTCAACAAGATCTCCGCCGTCAAACAAACGCCGTGGAGGACGAAGAAGCTCGA
CACCTCCCACGTTGGCGGCGCCCGGGACATGTAAAAGAACATGAGCTCATGCATCACCGCCGATACCAGGAAAGTGGCCACGACCGCAACGAGTGGAGCCCACCGCCGTC
CGACGACGCGGGCAGCCGCGAGAAGTGCTGGTCGGTACACGGTGGGGCGGAGGATTCCGGTGACGGAGAGGTTCCATCTGTTTCCCCAGAAGTCTTGGAGCGACGTGGAG
AGGTACGGCCGGTTGAACTGCGGCTGGAGCTCAACTCCGAGAGAGGATCGAACCAGAGTCGCCGCCATGGCAAGGCTGAGCTCGAGGAGGAAATAGATATGGAAGCAGAG
AAGAAGCCAGACGAGCTTCTGAGGGAGGATCTCACTGTAATCATACGCACGCACCAGCGCCGCCACAAGCAAGATCTTCACGCCATAGTTCACCGGCGATTCGTGGCCTT
GCTTTTTCGAGGTGGGTTCATTTGGTTGAACTGGATGGGAAGAGAAGCCAGAGCCCAGAAACTCCCGAATGACATTCCGGCGGCGGATAAAGGACCTTGGCCGACGGCGA
AGAGGAGGAGCTTGAAGTTGGCGAGCCAGGCGACGAAGAACGCCCAGAGAGAGATCCATATGGAAAACCAGACTTTGAAGAAGTTCGGAATCTCATCCTCCATTTTCTAC
TGTTCGCCGGTTTTGTCCGTCGCCGGAACGCCGATGGGTTTTTCAACTCCTGCTCGTTGCCCTGTGATTCAGGTCAGAAAATCAACTCACCGGCGAGAAAAAAAACATTT
TCTAAGAAAATCGCTCAACGCAGCGTACTCGGCCAATGCCTTGACGTCGATCCCACAACGGACGAACTGCGGGAAGAACATCCAGATTCCGGTGACGAACACGAACCCCA
CCGTCAATGGCACCGAGACCACCGCCGGCAGCTGCCACTTCTCCGGCAAGGCCTTCTTCAGCAAGATCTCCGCCGTTAAAGCAACCCCATGA
mRNA sequenceShow/hide mRNA sequence
ATGAGAGTGCTTTCTCAGACATTTCGTTGCCATAAGCGCCCAATTTCAAATCCAGACGTTCCGAAACCTTCTTCTTCGTCAACCGAGGATCCAGCTGCTGTGACATTGAG
CGTCAAATCGAAGCCGAAGCCGTGGTGTGTGTATCTCATCATCTCCTCCAATTTCCCCATCAAAACCTATGTCGGGGTCACCCTCAACTTCGCTCGCCGGTTTGTCGAAT
TTCTTCTCTTGGGGTTCTTGTTTAAAGCACAGATTTCTTTGAAACAACATAATGGTGAAATAGAAGGAGGTGCAAAAGCAACTCGTGCAGGACGACCCTGGATCTGTGCG
TGCACAATTCATGGTTTTAAAGACCAAAGTCAAGCTTGTGAGTTTGAATCAAAATGGAAAGAAGTTTCAAGGAAATTGCCCCATCAGAGGAAGAAGAATGATATGGGAAA
GCAGGTGGATGATCAGACATTAAGATTGCTCAAGCACAGGGAAGGAGCTCTGGAAAAAGTGAAGTCACCAGTCACAGTCTTCACATACGCGCCTAGAGCCGCGTACTCCT
CAAACATTCTTACGTCCGCTTTGAACCTCAGAAACTGTGGAAAAAACAGCCAAAAGCTGGTGACCATCACGAACCCGATCGTCAACGGCGTCGATATTATCGGCGGCAGC
CGGAACCTACCTCCGACCGCCCTCTTCACCACCATCTCCGCCGGCAAACACAGCCCATGCCGGAGACGACAAACGTTGCGGCCACCGCCATCAGGGACGCCCGTTTTTTC
CCCACGACGGCGGCCGCCAAGCTCCGGGACGGCTCGTAGACGGTCAGCCGGAGGATTTTGGTGGCCATCAGGTTCCATCTCCGGCCCCAGAAGTCCTGTAATGAAGTGGA
TAGGTAGGGCTCGTCGAAATTGCGAAGATGGTGACGAGAAGAAAGCCAAACTTGGCAGGGAAGCTCAGCGGCAGAACCGGACCATTGTTTGGTTGGCCGGAAATTGGAGG
TGGGTCGTCGGGGATTTCGATGGGGAGGGAGCCGACGGCGAGGAAACGGCCGAGGGAGGAAGAAGAAGCGTGGGAACAAAGAGGGCCTTTGCCGAAAGCAAAGAGGAGAA
GCTTGAAAATCGCGTATTCTTCAACCACCCTGACGTCAGCCTTACAAGTGTTGATGAACGGCGGAAAGAACAGCCAGAAACAGGTGGAGAAGAAGAATCCGACGGTCAGC
GGAGTGGACAACGGAGGAGGCAACGGCGGCAGCCGCCACGTTTCGATCCACACCTTCTTCAACAAGATCTCCGCCGTCAAACAAACGCCGTGGAGGACGAAGAAGCTCGA
CACCTCCCACGTTGGCGGCGCCCGGGACATGTAAAAGAACATGAGCTCATGCATCACCGCCGATACCAGGAAAGTGGCCACGACCGCAACGAGTGGAGCCCACCGCCGTC
CGACGACGCGGGCAGCCGCGAGAAGTGCTGGTCGGTACACGGTGGGGCGGAGGATTCCGGTGACGGAGAGGTTCCATCTGTTTCCCCAGAAGTCTTGGAGCGACGTGGAG
AGGTACGGCCGGTTGAACTGCGGCTGGAGCTCAACTCCGAGAGAGGATCGAACCAGAGTCGCCGCCATGGCAAGGCTGAGCTCGAGGAGGAAATAGATATGGAAGCAGAG
AAGAAGCCAGACGAGCTTCTGAGGGAGGATCTCACTGTAATCATACGCACGCACCAGCGCCGCCACAAGCAAGATCTTCACGCCATAGTTCACCGGCGATTCGTGGCCTT
GCTTTTTCGAGGTGGGTTCATTTGGTTGAACTGGATGGGAAGAGAAGCCAGAGCCCAGAAACTCCCGAATGACATTCCGGCGGCGGATAAAGGACCTTGGCCGACGGCGA
AGAGGAGGAGCTTGAAGTTGGCGAGCCAGGCGACGAAGAACGCCCAGAGAGAGATCCATATGGAAAACCAGACTTTGAAGAAGTTCGGAATCTCATCCTCCATTTTCTAC
TGTTCGCCGGTTTTGTCCGTCGCCGGAACGCCGATGGGTTTTTCAACTCCTGCTCGTTGCCCTGTGATTCAGGTCAGAAAATCAACTCACCGGCGAGAAAAAAAACATTT
TCTAAGAAAATCGCTCAACGCAGCGTACTCGGCCAATGCCTTGACGTCGATCCCACAACGGACGAACTGCGGGAAGAACATCCAGATTCCGGTGACGAACACGAACCCCA
CCGTCAATGGCACCGAGACCACCGCCGGCAGCTGCCACTTCTCCGGCAAGGCCTTCTTCAGCAAGATCTCCGCCGTTAAAGCAACCCCATGA
Protein sequenceShow/hide protein sequence
MRVLSQTFRCHKRPISNPDVPKPSSSSTEDPAAVTLSVKSKPKPWCVYLIISSNFPIKTYVGVTLNFARRFVEFLLLGFLFKAQISLKQHNGEIEGGAKATRAGRPWICA
CTIHGFKDQSQACEFESKWKEVSRKLPHQRKKNDMGKQVDDQTLRLLKHREGALEKVKSPVTVFTYAPRAAYSSNILTSALNLRNCGKNSQKLVTITNPIVNGVDIIGGS
RNLPPTALFTTISAGKHSPCRRRQTLRPPPSGTPVFSPRRRPPSSGTARRRSAGGFWWPSGSISGPRSPVMKWIGRARRNCEDGDEKKAKLGREAQRQNRTIVWLAGNWR
WVVGDFDGEGADGEETAEGGRRSVGTKRAFAESKEEKLENRVFFNHPDVSLTSVDERRKEQPETGGEEESDGQRSGQRRRQRRQPPRFDPHLLQQDLRRQTNAVEDEEAR
HLPRWRRPGHVKEHELMHHRRYQESGHDRNEWSPPPSDDAGSREKCWSVHGGAEDSGDGEVPSVSPEVLERRGEVRPVELRLELNSERGSNQSRRHGKAELEEEIDMEAE
KKPDELLREDLTVIIRTHQRRHKQDLHAIVHRRFVALLFRGGFIWLNWMGREARAQKLPNDIPAADKGPWPTAKRRSLKLASQATKNAQREIHMENQTLKKFGISSSIFY
CSPVLSVAGTPMGFSTPARCPVIQVRKSTHRREKKHFLRKSLNAAYSANALTSIPQRTNCGKNIQIPVTNTNPTVNGTETTAGSCHFSGKAFFSKISAVKATP