; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg024548 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg024548
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionFanconi-associated nuclease
Genome locationscaffold12:15885961..15887611
RNA-Seq ExpressionSpg024548
SyntenySpg024548
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]5.0e-2133.07Show/hide
Query:  IRFVNYLARAKYQEMLK-RDFLFERGF-------GNELPRFLRTGIENLGWSQFCAKPEPVNSNIVREFYANLDDKEEFQVIVRGVPVDWSPEAVNDLFD
        ++F    A  +Y+  ++ R    E+GF         +LP F+   I    W QFCA PE     +VREFYANL D  E  V VRGV V WS EA+N +F 
Subjt:  IRFVNYLARAKYQEMLK-RDFLFERGF-------GNELPRFLRTGIENLGWSQFCAKPEPVNSNIVREFYANLDDKEEFQVIVRGVPVDWSPEAVNDLFD

Query:  LQD--------FPHATFNEMVVAPSNDQLSAAVREVGIEGP-----------------------------SGVSRDRVLLAFAILRSMSIDVGKIISSEI
        L D          + T ++++       ++ A   V  +G                                VS+DR+LL  ++L   SI+VG++I SEI
Subjt:  LQD--------FPHATFNEMVVAPSNDQLSAAVREVGIEGP-----------------------------SGVSRDRVLLAFAILRSMSIDVGKIISSEI

Query:  LDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARLQRTQE
          C  +K G LFFP+ IT LCR A  P   ++  L + G ID   +AR+  TQE
Subjt:  LDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARLQRTQE

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]2.2e-3231.99Show/hide
Query:  IRFVNYLARAKYQEMLK-RDFLFERGF-------GNELPRFLRTGIENLGWSQFCAKPEPVNSNIVREFYANLDDKEEFQVIVRGVPVDWSPEAVNDLFD
        ++F    A  +Y+  ++ R    E+GF         +LP F+   I    W QFCA PE     +VREFYANL D EE  V VRGV V WS EA+N +F 
Subjt:  IRFVNYLARAKYQEMLK-RDFLFERGF-------GNELPRFLRTGIENLGWSQFCAKPEPVNSNIVREFYANLDDKEEFQVIVRGVPVDWSPEAVNDLFD

Query:  LQD--------FPHATFNEMVVAPSNDQLSAAVREVGIEGP-----------------------------SGVSRDRVLLAFAILRSMSIDVGKIISSEI
        L D          + T  +++        + A   V  +G                                VS+DR+LL  ++L   SI+VG++I SEI
Subjt:  LQD--------FPHATFNEMVVAPSNDQLSAAVREVGIEGP-----------------------------SGVSRDRVLLAFAILRSMSIDVGKIISSEI

Query:  LDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARLQR---TQEARQ---------------GGLVCGIQQI------QELLQLH-S
          C  +K G LFFP+ IT LCR A  P   ++  L + G ID   +AR+ +   T+  +Q               G ++  ++ +      QE+ Q H  
Subjt:  LDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARLQR---TQEARQ---------------GGLVCGIQQI------QELLQLH-S

Query:  SRMEFAERQFQTFWDYTKKRDVALRVALQSNFSEPYPALPVFPDDLL
        S ++   +Q Q FW Y+K+RD AL+ ALQ+NF+ P P  P FP ++L
Subjt:  SRMEFAERQFQTFWDYTKKRDVALRVALQSNFSEPYPALPVFPDDLL

PON59596.1 hypothetical protein PanWU01x14_158080 [Parasponia andersonii]3.0e-1838.42Show/hide
Query:  VSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARL------QRTQE-----------A
        VS+DR+LL +++L   SI+VG++I SEI  C  +K G LFFP+ IT LCR A  P   ++  L   G ID   +AR+      + TQ+           +
Subjt:  VSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARL------QRTQE-----------A

Query:  RQGGLVCGIQQI---------QELLQLH-SSRMEFAERQFQTFWDYTKKRDVALRVALQSNFSEPYPALPVFPDDLL
        R  G +  +QQ+         QE+ Q H  S ++   +Q Q FW Y+K+RD AL+ ALQ+NF+ P P  P FP +LL
Subjt:  RQGGLVCGIQQI---------QELLQLH-SSRMEFAERQFQTFWDYTKKRDVALRVALQSNFSEPYPALPVFPDDLL

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]1.2e-2736Show/hide
Query:  IVREFYANLDDKEEFQVIVRGVPVDWSPEAVNDLFDLQD--FPHATFNEMVVAPSNDQLSAAVREVGIE----------------GPSG-----------
        +VREFYANL D EE  + VRGV V WS EA+N +F L D    H+ F E +  P    +   V   G E                 P+            
Subjt:  IVREFYANLDDKEEFQVIVRGVPVDWSPEAVNDLFDLQD--FPHATFNEMVVAPSNDQLSAAVREVGIE----------------GPSG-----------

Query:  --------VSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARL------QRTQE----
                VS+DR+LL  ++L   SI+VG++I SEI  C  +K G LFFP+ IT LCR A    +E+   L + G ID   +AR+      + TQ+    
Subjt:  --------VSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARL------QRTQE----

Query:  -------ARQGGLVCGIQQIQELLQLHSSRMEFAERQFQTFWDYTKKRDVALRVALQSNFSEPYPALPVFPDDLL
               +R  G V  +QQ++ L Q   S+ E   +Q Q FW Y+K+RD AL+ ALQ+NF+ P P  P FP ++L
Subjt:  -------ARQGGLVCGIQQIQELLQLHSSRMEFAERQFQTFWDYTKKRDVALRVALQSNFSEPYPALPVFPDDLL

XP_004514776.1 uncharacterized protein LOC101493401 isoform X3 [Cicer arietinum]2.6e-1724.81Show/hide
Query:  QGLPFIRFVNYLARAKYQEMLK-RDFLFERGF---GNE----LPRFLRTGIENLGWSQFCAKPEPVNSNIVREFYANLDDKEEFQVIVRGVPVDWSPEAV
        +G  F +F+N   + K+  ++K R+F  E GF    NE    LP  L + I+   W  F        ++IVREFY+ + ++++  V+VRGV V ++P+ +
Subjt:  QGLPFIRFVNYLARAKYQEMLK-RDFLFERGF---GNE----LPRFLRTGIENLGWSQFCAKPEPVNSNIVREFYANLDDKEEFQVIVRGVPVDWSPEAV

Query:  NDLFDLQDFPHATFNEMV-------VAPSNDQLSAAVREVGIEGPS---------------------------------------GVSRDRVLLAFAILR
        N  F+L        N++V          S+++L++ ++ + + G +                                        V +DR+LL + ++ 
Subjt:  NDLFDLQDFPHATFNEMV-------VAPSNDQLSAAVREVGIEGPS---------------------------------------GVSRDRVLLAFAILR

Query:  SMSIDVGKIISSEILDC--WRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARL--QRTQEARQGGLV-------CGIQQ-----IQEL
          SI+VGKII  EI+ C   +KK  +L FP+ I+ LC R GV   +DD ++ ++  I   +L R       ++++ G V        G ++      +E 
Subjt:  SMSIDVGKIISSEILDC--WRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARL--QRTQEARQGGLV-------CGIQQ-----IQEL

Query:  LQLHSSRMEFA--------------ERQFQTFWDYTKKRDVALRVALQSNFSEPYPALPVFPDDLLNPWIPPPPMEGGEEEDENEPG
          + + + +F                +Q + FW + K+     R   + NF +     P FPD++L P++  P  E G+ +D  EPG
Subjt:  LQLHSSRMEFA--------------ERQFQTFWDYTKKRDVALRVALQSNFSEPYPALPVFPDDLLNPWIPPPPMEGGEEEDENEPG

TrEMBL top hitse value%identityAlignment
A0A1S2Z475 uncharacterized protein LOC101493401 isoform X31.2e-1724.81Show/hide
Query:  QGLPFIRFVNYLARAKYQEMLK-RDFLFERGF---GNE----LPRFLRTGIENLGWSQFCAKPEPVNSNIVREFYANLDDKEEFQVIVRGVPVDWSPEAV
        +G  F +F+N   + K+  ++K R+F  E GF    NE    LP  L + I+   W  F        ++IVREFY+ + ++++  V+VRGV V ++P+ +
Subjt:  QGLPFIRFVNYLARAKYQEMLK-RDFLFERGF---GNE----LPRFLRTGIENLGWSQFCAKPEPVNSNIVREFYANLDDKEEFQVIVRGVPVDWSPEAV

Query:  NDLFDLQDFPHATFNEMV-------VAPSNDQLSAAVREVGIEGPS---------------------------------------GVSRDRVLLAFAILR
        N  F+L        N++V          S+++L++ ++ + + G +                                        V +DR+LL + ++ 
Subjt:  NDLFDLQDFPHATFNEMV-------VAPSNDQLSAAVREVGIEGPS---------------------------------------GVSRDRVLLAFAILR

Query:  SMSIDVGKIISSEILDC--WRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARL--QRTQEARQGGLV-------CGIQQ-----IQEL
          SI+VGKII  EI+ C   +KK  +L FP+ I+ LC R GV   +DD ++ ++  I   +L R       ++++ G V        G ++      +E 
Subjt:  SMSIDVGKIISSEILDC--WRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARL--QRTQEARQGGLV-------CGIQQ-----IQEL

Query:  LQLHSSRMEFA--------------ERQFQTFWDYTKKRDVALRVALQSNFSEPYPALPVFPDDLLNPWIPPPPMEGGEEEDENEPG
          + + + +F                +Q + FW + K+     R   + NF +     P FPD++L P++  P  E G+ +D  EPG
Subjt:  LQLHSSRMEFA--------------ERQFQTFWDYTKKRDVALRVALQSNFSEPYPALPVFPDDLLNPWIPPPPMEGGEEEDENEPG

A0A2P5AGA5 Uncharacterized protein (Fragment)2.4e-2133.07Show/hide
Query:  IRFVNYLARAKYQEMLK-RDFLFERGF-------GNELPRFLRTGIENLGWSQFCAKPEPVNSNIVREFYANLDDKEEFQVIVRGVPVDWSPEAVNDLFD
        ++F    A  +Y+  ++ R    E+GF         +LP F+   I    W QFCA PE     +VREFYANL D  E  V VRGV V WS EA+N +F 
Subjt:  IRFVNYLARAKYQEMLK-RDFLFERGF-------GNELPRFLRTGIENLGWSQFCAKPEPVNSNIVREFYANLDDKEEFQVIVRGVPVDWSPEAVNDLFD

Query:  LQD--------FPHATFNEMVVAPSNDQLSAAVREVGIEGP-----------------------------SGVSRDRVLLAFAILRSMSIDVGKIISSEI
        L D          + T ++++       ++ A   V  +G                                VS+DR+LL  ++L   SI+VG++I SEI
Subjt:  LQD--------FPHATFNEMVVAPSNDQLSAAVREVGIEGP-----------------------------SGVSRDRVLLAFAILRSMSIDVGKIISSEI

Query:  LDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARLQRTQE
          C  +K G LFFP+ IT LCR A  P   ++  L + G ID   +AR+  TQE
Subjt:  LDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARLQRTQE

A0A2P5BCG4 Uncharacterized protein (Fragment)1.0e-3231.99Show/hide
Query:  IRFVNYLARAKYQEMLK-RDFLFERGF-------GNELPRFLRTGIENLGWSQFCAKPEPVNSNIVREFYANLDDKEEFQVIVRGVPVDWSPEAVNDLFD
        ++F    A  +Y+  ++ R    E+GF         +LP F+   I    W QFCA PE     +VREFYANL D EE  V VRGV V WS EA+N +F 
Subjt:  IRFVNYLARAKYQEMLK-RDFLFERGF-------GNELPRFLRTGIENLGWSQFCAKPEPVNSNIVREFYANLDDKEEFQVIVRGVPVDWSPEAVNDLFD

Query:  LQD--------FPHATFNEMVVAPSNDQLSAAVREVGIEGP-----------------------------SGVSRDRVLLAFAILRSMSIDVGKIISSEI
        L D          + T  +++        + A   V  +G                                VS+DR+LL  ++L   SI+VG++I SEI
Subjt:  LQD--------FPHATFNEMVVAPSNDQLSAAVREVGIEGP-----------------------------SGVSRDRVLLAFAILRSMSIDVGKIISSEI

Query:  LDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARLQR---TQEARQ---------------GGLVCGIQQI------QELLQLH-S
          C  +K G LFFP+ IT LCR A  P   ++  L + G ID   +AR+ +   T+  +Q               G ++  ++ +      QE+ Q H  
Subjt:  LDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARLQR---TQEARQ---------------GGLVCGIQQI------QELLQLH-S

Query:  SRMEFAERQFQTFWDYTKKRDVALRVALQSNFSEPYPALPVFPDDLL
        S ++   +Q Q FW Y+K+RD AL+ ALQ+NF+ P P  P FP ++L
Subjt:  SRMEFAERQFQTFWDYTKKRDVALRVALQSNFSEPYPALPVFPDDLL

A0A2P5CEY2 Uncharacterized protein1.5e-1838.42Show/hide
Query:  VSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARL------QRTQE-----------A
        VS+DR+LL +++L   SI+VG++I SEI  C  +K G LFFP+ IT LCR A  P   ++  L   G ID   +AR+      + TQ+           +
Subjt:  VSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARL------QRTQE-----------A

Query:  RQGGLVCGIQQI---------QELLQLH-SSRMEFAERQFQTFWDYTKKRDVALRVALQSNFSEPYPALPVFPDDLL
        R  G +  +QQ+         QE+ Q H  S ++   +Q Q FW Y+K+RD AL+ ALQ+NF+ P P  P FP +LL
Subjt:  RQGGLVCGIQQI---------QELLQLH-SSRMEFAERQFQTFWDYTKKRDVALRVALQSNFSEPYPALPVFPDDLL

A0A2P5DXM3 Uncharacterized protein5.9e-2836Show/hide
Query:  IVREFYANLDDKEEFQVIVRGVPVDWSPEAVNDLFDLQD--FPHATFNEMVVAPSNDQLSAAVREVGIE----------------GPSG-----------
        +VREFYANL D EE  + VRGV V WS EA+N +F L D    H+ F E +  P    +   V   G E                 P+            
Subjt:  IVREFYANLDDKEEFQVIVRGVPVDWSPEAVNDLFDLQD--FPHATFNEMVVAPSNDQLSAAVREVGIE----------------GPSG-----------

Query:  --------VSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARL------QRTQE----
                VS+DR+LL  ++L   SI+VG++I SEI  C  +K G LFFP+ IT LCR A    +E+   L + G ID   +AR+      + TQ+    
Subjt:  --------VSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARL------QRTQE----

Query:  -------ARQGGLVCGIQQIQELLQLHSSRMEFAERQFQTFWDYTKKRDVALRVALQSNFSEPYPALPVFPDDLL
               +R  G V  +QQ++ L Q   S+ E   +Q Q FW Y+K+RD AL+ ALQ+NF+ P P  P FP ++L
Subjt:  -------ARQGGLVCGIQQIQELLQLHSSRMEFAERQFQTFWDYTKKRDVALRVALQSNFSEPYPALPVFPDDLL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGAAAACGAGAGCGAGGAAAGAAAGAGAAAGTGAGGAGGAAGAGATACCCGTTACCCCCGAAGTTCGGAAAGTAAAAACCAAGAAGAAAAGAACGCCGGAAGAGAA
AGAAGTTAAGAGAAGGAGACGGCAACAGAGGGCTGAGGAACGAGAAAATGTCCAGAAAGCAGCAGAAGAAGTGGTTGAAGAAGATCCGCAAGAACCTGTCGTACAGAACC
CCGAGCAGGATGAGCCAAGAGTCGCGGATACAGAGGAAGTCCAAGAAACGGGACACACTGAGGAAAGTCAAGAGCAACAGAATAAGGATATGCAGGCAGAGGGTGCGACT
GAAGAGGAGCCAGTTCAAGAGGCTCGTGTTGAGGTTATCATGCCCGAACCGCCGAAACGTCGCCGCATAAAGCGAAAGGCCGGCCGTATTCCGGTGAATCGAACTGATAC
CCTATCACCGCCATCATCAGATTCTGAGAAAGAGCGAGAGGAAAGAGAGAAAAAAGAAGCTGGGGAAAAAGCGCGAGAAGAAGCAAAGAAGGCTGAGGAAGAGATTTTGC
GCAAGCAAAGAGAAGACAAGGGCAAAGGTATTGCCGAGGCATCAGGTGCGGCTGACGAGGTTGAAGCACAAGGGTTACCTTTTATTCGCTTCGTCAACTACCTTGCTCGA
GCAAAATACCAGGAGATGCTGAAACGGGACTTTCTGTTCGAACGAGGATTTGGCAATGAGTTGCCACGGTTCTTGAGGACGGGAATAGAGAACCTCGGCTGGAGCCAATT
TTGTGCGAAACCAGAGCCTGTAAATTCCAACATTGTTCGGGAATTTTACGCAAATCTTGACGATAAGGAAGAATTTCAGGTTATAGTTCGAGGAGTCCCAGTGGATTGGA
GCCCAGAAGCTGTTAATGACTTGTTTGATCTCCAGGATTTTCCGCATGCAACCTTCAATGAGATGGTGGTTGCCCCATCTAACGACCAGTTAAGTGCGGCTGTCCGAGAG
GTTGGCATTGAGGGGCCCAGTGGAGTATCTCGGGACAGGGTATTGCTTGCCTTTGCTATTCTTCGCTCAATGAGTATTGATGTAGGAAAAATAATTTCGTCTGAAATTCT
TGACTGCTGGCGGAAAAAGGTGGGGAAGCTGTTTTTCCCCAACACTATCACGATGCTATGCCGAAGGGCAGGGGTGCCAGAGAGTGAGGATGATATGATATTACCAGATA
AGGGAATAATTGATACGCCAAATTTGGCTAGGCTTCAGAGGACACAGGAAGCACGCCAAGGGGGTTTGGTGTGCGGCATCCAACAAATTCAGGAGCTGTTGCAATTGCAT
TCCAGCAGAATGGAATTCGCTGAAAGGCAATTTCAGACTTTCTGGGACTATACAAAGAAAAGGGATGTTGCCTTAAGGGTGGCCTTGCAATCAAATTTTTCTGAACCATA
CCCGGCTTTACCCGTATTCCCTGATGACCTACTGAACCCCTGGATTCCGCCCCCACCAATGGAAGGAGGAGAAGAGGAAGATGAAAATGAACCAGGCCAAGAGGACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGAAAACGAGAGCGAGGAAAGAAAGAGAAAGTGAGGAGGAAGAGATACCCGTTACCCCCGAAGTTCGGAAAGTAAAAACCAAGAAGAAAAGAACGCCGGAAGAGAA
AGAAGTTAAGAGAAGGAGACGGCAACAGAGGGCTGAGGAACGAGAAAATGTCCAGAAAGCAGCAGAAGAAGTGGTTGAAGAAGATCCGCAAGAACCTGTCGTACAGAACC
CCGAGCAGGATGAGCCAAGAGTCGCGGATACAGAGGAAGTCCAAGAAACGGGACACACTGAGGAAAGTCAAGAGCAACAGAATAAGGATATGCAGGCAGAGGGTGCGACT
GAAGAGGAGCCAGTTCAAGAGGCTCGTGTTGAGGTTATCATGCCCGAACCGCCGAAACGTCGCCGCATAAAGCGAAAGGCCGGCCGTATTCCGGTGAATCGAACTGATAC
CCTATCACCGCCATCATCAGATTCTGAGAAAGAGCGAGAGGAAAGAGAGAAAAAAGAAGCTGGGGAAAAAGCGCGAGAAGAAGCAAAGAAGGCTGAGGAAGAGATTTTGC
GCAAGCAAAGAGAAGACAAGGGCAAAGGTATTGCCGAGGCATCAGGTGCGGCTGACGAGGTTGAAGCACAAGGGTTACCTTTTATTCGCTTCGTCAACTACCTTGCTCGA
GCAAAATACCAGGAGATGCTGAAACGGGACTTTCTGTTCGAACGAGGATTTGGCAATGAGTTGCCACGGTTCTTGAGGACGGGAATAGAGAACCTCGGCTGGAGCCAATT
TTGTGCGAAACCAGAGCCTGTAAATTCCAACATTGTTCGGGAATTTTACGCAAATCTTGACGATAAGGAAGAATTTCAGGTTATAGTTCGAGGAGTCCCAGTGGATTGGA
GCCCAGAAGCTGTTAATGACTTGTTTGATCTCCAGGATTTTCCGCATGCAACCTTCAATGAGATGGTGGTTGCCCCATCTAACGACCAGTTAAGTGCGGCTGTCCGAGAG
GTTGGCATTGAGGGGCCCAGTGGAGTATCTCGGGACAGGGTATTGCTTGCCTTTGCTATTCTTCGCTCAATGAGTATTGATGTAGGAAAAATAATTTCGTCTGAAATTCT
TGACTGCTGGCGGAAAAAGGTGGGGAAGCTGTTTTTCCCCAACACTATCACGATGCTATGCCGAAGGGCAGGGGTGCCAGAGAGTGAGGATGATATGATATTACCAGATA
AGGGAATAATTGATACGCCAAATTTGGCTAGGCTTCAGAGGACACAGGAAGCACGCCAAGGGGGTTTGGTGTGCGGCATCCAACAAATTCAGGAGCTGTTGCAATTGCAT
TCCAGCAGAATGGAATTCGCTGAAAGGCAATTTCAGACTTTCTGGGACTATACAAAGAAAAGGGATGTTGCCTTAAGGGTGGCCTTGCAATCAAATTTTTCTGAACCATA
CCCGGCTTTACCCGTATTCCCTGATGACCTACTGAACCCCTGGATTCCGCCCCCACCAATGGAAGGAGGAGAAGAGGAAGATGAAAATGAACCAGGCCAAGAGGACTGA
Protein sequenceShow/hide protein sequence
MAKTRARKERESEEEEIPVTPEVRKVKTKKKRTPEEKEVKRRRRQQRAEERENVQKAAEEVVEEDPQEPVVQNPEQDEPRVADTEEVQETGHTEESQEQQNKDMQAEGAT
EEEPVQEARVEVIMPEPPKRRRIKRKAGRIPVNRTDTLSPPSSDSEKEREEREKKEAGEKAREEAKKAEEEILRKQREDKGKGIAEASGAADEVEAQGLPFIRFVNYLAR
AKYQEMLKRDFLFERGFGNELPRFLRTGIENLGWSQFCAKPEPVNSNIVREFYANLDDKEEFQVIVRGVPVDWSPEAVNDLFDLQDFPHATFNEMVVAPSNDQLSAAVRE
VGIEGPSGVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARLQRTQEARQGGLVCGIQQIQELLQLH
SSRMEFAERQFQTFWDYTKKRDVALRVALQSNFSEPYPALPVFPDDLLNPWIPPPPMEGGEEEDENEPGQED