; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg037290 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg037290
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionFanconi-associated nuclease
Genome locationscaffold7:40074043..40075696
RNA-Seq ExpressionSpg037290
SyntenySpg037290
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]2.7e-2628.18Show/hide
Query:  RFVNNSARAKYA-ELLKRDFLFERGF-------SGDLPHFLRTGIADHGWELFCAKPESVNAQVVREFYANIDKEDGFQVIVRGVEVDLSPSAINALYNL
        +F   +A  +Y   +  R    E+GF        G LP F+   I  H W+ FCA PE     +VREFYAN+   +   V VRGV+V  S  AINA++ L
Subjt:  RFVNNSARAKYA-ELLKRDFLFERGF-------SGDLPHFLRTGIADHGWELFCAKPESVNAQVVREFYANIDKEDGFQVIVRGVEVDLSPSAINALYNL

Query:  QNFPHTAYNEMVVVPSNEQLSDAVREVGIEGARW-----------------------------------------ERVLLAFAILRSLSIDVGKIVVNEI
         + P   ++E +   + + L   +  V   GA W                                         +R+LL  ++L   SI+VG+++ +EI
Subjt:  QNFPHTAYNEMVVVPSNEQLSDAVREVGIEGARW-----------------------------------------ERVLLAFAILRSLSIDVGKIVVNEI

Query:  SGCWKKKVGKLFFPNTITMLCKRAGVLEDEGDVILLDKGIIDTPNLARLHR---MQEVRQ---------------GGLVYDINMILEQLALSPSRQ----
          C  +K G LFFP+ IT LC+ A       +  L + G ID   +AR+ +    +  +Q               G ++  +  + ++L+    +Q    
Subjt:  SGCWKKKVGKLFFPNTITMLCKRAGVLEDEGDVILLDKGIIDTPNLARLHR---MQEVRQ---------------GGLVYDINMILEQLALSPSRQ----

Query:  ---EFAERQALTFWNYVKNRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEIGEKDDENEPGQ
           +   +Q   FW Y K RD  LKKALQ NF++P P  PAFP+++L         E  +KD  NE  +
Subjt:  ---EFAERQALTFWNYVKNRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEIGEKDDENEPGQ

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]2.0e-2130.17Show/hide
Query:  VVREFYANIDKEDGFQVIVRGVEVDLSPSAINALYNLQNFPHTAYNEMVVVPSNEQLSDAVREVGIEGARW-----------------------------
        +VREFYAN+   +   + VRGV+V  S  AINA++ L + P   ++E +   +  +L   +  V   GA W                             
Subjt:  VVREFYANIDKEDGFQVIVRGVEVDLSPSAINALYNLQNFPHTAYNEMVVVPSNEQLSDAVREVGIEGARW-----------------------------

Query:  ------------ERVLLAFAILRSLSIDVGKIVVNEISGCWKKKVGKLFFPNTITMLCKRAGVLEDEGDVILLDKGIIDTPNLARL--------------
                    +R+LL  ++L   SI+VG+++ +EI  C  +K G LFFP+ IT LC+ A  L +E    L + G ID   +AR+              
Subjt:  ------------ERVLLAFAILRSLSIDVGKIVVNEISGCWKKKVGKLFFPNTITMLCKRAGVLEDEGDVILLDKGIIDTPNLARL--------------

Query:  HRMQEVRQGGLVYDINMILEQLALSPSRQEFAERQALTFWNYVKNRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEIGEKDDENEPGQ
         R           D+   L+ L    S+QE   +Q   FW Y K RD  LKKALQ NF++P P  PAFP+++L         E  +KD  NE  +
Subjt:  HRMQEVRQGGLVYDINMILEQLALSPSRQEFAERQALTFWNYVKNRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEIGEKDDENEPGQ

XP_004514775.1 uncharacterized protein LOC101493401 isoform X2 [Cicer arietinum]1.4e-1924.8Show/hide
Query:  YDRFVNNSARAKYAELLK-RDFLFERGFSGD-------LPHFLRTGIADHGWELFCAKPESVNAQVVREFYANIDKEDGFQVIVRGVEVDLSPSAINALY
        + +F+N   + K+  L+K R+F  E GFS +       LP  L + I  H W+ F     +  A +VREFY+ I +     V+VRGV V  +P  +N  +
Subjt:  YDRFVNNSARAKYAELLK-RDFLFERGFSGD-------LPHFLRTGIADHGWELFCAKPESVNAQVVREFYANIDKEDGFQVIVRGVEVDLSPSAINALY

Query:  NL-------QNFPHTAYNEMVVVPSNEQLSDAVREVGIEGARW------------------------------------------ERVLLAFAILRSLSI
        NL        N     Y  +    S+E+L+  ++ + + G  W                                          +R+LL + ++   SI
Subjt:  NL-------QNFPHTAYNEMVVVPSNEQLSDAVREVGIEGARW------------------------------------------ERVLLAFAILRSLSI

Query:  DVGKIVVNEISGC--WKKKVGKLFFPNTITMLCKRAGVLEDEGDVILLDKGIIDTPNLARLHR---MQEVRQGGLVYDINMIL-----------EQLALS
        +VGKI+ +EI  C   KKK  +L FP+ I+ LC R GV  ++ D +++++  I   +L R      M   R+G +  +    +           E+  + 
Subjt:  DVGKIVVNEISGC--WKKKVGKLFFPNTITMLCKRAGVLEDEGDVILLDKGIIDTPNLARLHR---MQEVRQGGLVYDINMIL-----------EQLALS

Query:  PSRQEFA--------------ERQALTFWNYVKNRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEIGEKDDENEPG
          +++F                +Q   FW + K      +K  + NF K     P FP+++L P++  P  E G+  D  EPG
Subjt:  PSRQEFA--------------ERQALTFWNYVKNRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEIGEKDDENEPG

XP_004514776.1 uncharacterized protein LOC101493401 isoform X3 [Cicer arietinum]1.4e-1924.8Show/hide
Query:  YDRFVNNSARAKYAELLK-RDFLFERGFSGD-------LPHFLRTGIADHGWELFCAKPESVNAQVVREFYANIDKEDGFQVIVRGVEVDLSPSAINALY
        + +F+N   + K+  L+K R+F  E GFS +       LP  L + I  H W+ F     +  A +VREFY+ I +     V+VRGV V  +P  +N  +
Subjt:  YDRFVNNSARAKYAELLK-RDFLFERGFSGD-------LPHFLRTGIADHGWELFCAKPESVNAQVVREFYANIDKEDGFQVIVRGVEVDLSPSAINALY

Query:  NL-------QNFPHTAYNEMVVVPSNEQLSDAVREVGIEGARW------------------------------------------ERVLLAFAILRSLSI
        NL        N     Y  +    S+E+L+  ++ + + G  W                                          +R+LL + ++   SI
Subjt:  NL-------QNFPHTAYNEMVVVPSNEQLSDAVREVGIEGARW------------------------------------------ERVLLAFAILRSLSI

Query:  DVGKIVVNEISGC--WKKKVGKLFFPNTITMLCKRAGVLEDEGDVILLDKGIIDTPNLARLHR---MQEVRQGGLVYDINMIL-----------EQLALS
        +VGKI+ +EI  C   KKK  +L FP+ I+ LC R GV  ++ D +++++  I   +L R      M   R+G +  +    +           E+  + 
Subjt:  DVGKIVVNEISGC--WKKKVGKLFFPNTITMLCKRAGVLEDEGDVILLDKGIIDTPNLARLHR---MQEVRQGGLVYDINMIL-----------EQLALS

Query:  PSRQEFA--------------ERQALTFWNYVKNRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEIGEKDDENEPG
          +++F                +Q   FW + K      +K  + NF K     P FP+++L P++  P  E G+  D  EPG
Subjt:  PSRQEFA--------------ERQALTFWNYVKNRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEIGEKDDENEPG

XP_012575515.1 uncharacterized protein LOC101493401 isoform X1 [Cicer arietinum]1.4e-1924.8Show/hide
Query:  YDRFVNNSARAKYAELLK-RDFLFERGFSGD-------LPHFLRTGIADHGWELFCAKPESVNAQVVREFYANIDKEDGFQVIVRGVEVDLSPSAINALY
        + +F+N   + K+  L+K R+F  E GFS +       LP  L + I  H W+ F     +  A +VREFY+ I +     V+VRGV V  +P  +N  +
Subjt:  YDRFVNNSARAKYAELLK-RDFLFERGFSGD-------LPHFLRTGIADHGWELFCAKPESVNAQVVREFYANIDKEDGFQVIVRGVEVDLSPSAINALY

Query:  NL-------QNFPHTAYNEMVVVPSNEQLSDAVREVGIEGARW------------------------------------------ERVLLAFAILRSLSI
        NL        N     Y  +    S+E+L+  ++ + + G  W                                          +R+LL + ++   SI
Subjt:  NL-------QNFPHTAYNEMVVVPSNEQLSDAVREVGIEGARW------------------------------------------ERVLLAFAILRSLSI

Query:  DVGKIVVNEISGC--WKKKVGKLFFPNTITMLCKRAGVLEDEGDVILLDKGIIDTPNLARLHR---MQEVRQGGLVYDINMIL-----------EQLALS
        +VGKI+ +EI  C   KKK  +L FP+ I+ LC R GV  ++ D +++++  I   +L R      M   R+G +  +    +           E+  + 
Subjt:  DVGKIVVNEISGC--WKKKVGKLFFPNTITMLCKRAGVLEDEGDVILLDKGIIDTPNLARLHR---MQEVRQGGLVYDINMIL-----------EQLALS

Query:  PSRQEFA--------------ERQALTFWNYVKNRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEIGEKDDENEPG
          +++F                +Q   FW + K      +K  + NF K     P FP+++L P++  P  E G+  D  EPG
Subjt:  PSRQEFA--------------ERQALTFWNYVKNRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEIGEKDDENEPG

TrEMBL top hitse value%identityAlignment
A0A1S2Z472 uncharacterized protein LOC101493401 isoform X27.0e-2024.8Show/hide
Query:  YDRFVNNSARAKYAELLK-RDFLFERGFSGD-------LPHFLRTGIADHGWELFCAKPESVNAQVVREFYANIDKEDGFQVIVRGVEVDLSPSAINALY
        + +F+N   + K+  L+K R+F  E GFS +       LP  L + I  H W+ F     +  A +VREFY+ I +     V+VRGV V  +P  +N  +
Subjt:  YDRFVNNSARAKYAELLK-RDFLFERGFSGD-------LPHFLRTGIADHGWELFCAKPESVNAQVVREFYANIDKEDGFQVIVRGVEVDLSPSAINALY

Query:  NL-------QNFPHTAYNEMVVVPSNEQLSDAVREVGIEGARW------------------------------------------ERVLLAFAILRSLSI
        NL        N     Y  +    S+E+L+  ++ + + G  W                                          +R+LL + ++   SI
Subjt:  NL-------QNFPHTAYNEMVVVPSNEQLSDAVREVGIEGARW------------------------------------------ERVLLAFAILRSLSI

Query:  DVGKIVVNEISGC--WKKKVGKLFFPNTITMLCKRAGVLEDEGDVILLDKGIIDTPNLARLHR---MQEVRQGGLVYDINMIL-----------EQLALS
        +VGKI+ +EI  C   KKK  +L FP+ I+ LC R GV  ++ D +++++  I   +L R      M   R+G +  +    +           E+  + 
Subjt:  DVGKIVVNEISGC--WKKKVGKLFFPNTITMLCKRAGVLEDEGDVILLDKGIIDTPNLARLHR---MQEVRQGGLVYDINMIL-----------EQLALS

Query:  PSRQEFA--------------ERQALTFWNYVKNRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEIGEKDDENEPG
          +++F                +Q   FW + K      +K  + NF K     P FP+++L P++  P  E G+  D  EPG
Subjt:  PSRQEFA--------------ERQALTFWNYVKNRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEIGEKDDENEPG

A0A1S2Z475 uncharacterized protein LOC101493401 isoform X37.0e-2024.8Show/hide
Query:  YDRFVNNSARAKYAELLK-RDFLFERGFSGD-------LPHFLRTGIADHGWELFCAKPESVNAQVVREFYANIDKEDGFQVIVRGVEVDLSPSAINALY
        + +F+N   + K+  L+K R+F  E GFS +       LP  L + I  H W+ F     +  A +VREFY+ I +     V+VRGV V  +P  +N  +
Subjt:  YDRFVNNSARAKYAELLK-RDFLFERGFSGD-------LPHFLRTGIADHGWELFCAKPESVNAQVVREFYANIDKEDGFQVIVRGVEVDLSPSAINALY

Query:  NL-------QNFPHTAYNEMVVVPSNEQLSDAVREVGIEGARW------------------------------------------ERVLLAFAILRSLSI
        NL        N     Y  +    S+E+L+  ++ + + G  W                                          +R+LL + ++   SI
Subjt:  NL-------QNFPHTAYNEMVVVPSNEQLSDAVREVGIEGARW------------------------------------------ERVLLAFAILRSLSI

Query:  DVGKIVVNEISGC--WKKKVGKLFFPNTITMLCKRAGVLEDEGDVILLDKGIIDTPNLARLHR---MQEVRQGGLVYDINMIL-----------EQLALS
        +VGKI+ +EI  C   KKK  +L FP+ I+ LC R GV  ++ D +++++  I   +L R      M   R+G +  +    +           E+  + 
Subjt:  DVGKIVVNEISGC--WKKKVGKLFFPNTITMLCKRAGVLEDEGDVILLDKGIIDTPNLARLHR---MQEVRQGGLVYDINMIL-----------EQLALS

Query:  PSRQEFA--------------ERQALTFWNYVKNRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEIGEKDDENEPG
          +++F                +Q   FW + K      +K  + NF K     P FP+++L P++  P  E G+  D  EPG
Subjt:  PSRQEFA--------------ERQALTFWNYVKNRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEIGEKDDENEPG

A0A1S3EI57 uncharacterized protein LOC101493401 isoform X17.0e-2024.8Show/hide
Query:  YDRFVNNSARAKYAELLK-RDFLFERGFSGD-------LPHFLRTGIADHGWELFCAKPESVNAQVVREFYANIDKEDGFQVIVRGVEVDLSPSAINALY
        + +F+N   + K+  L+K R+F  E GFS +       LP  L + I  H W+ F     +  A +VREFY+ I +     V+VRGV V  +P  +N  +
Subjt:  YDRFVNNSARAKYAELLK-RDFLFERGFSGD-------LPHFLRTGIADHGWELFCAKPESVNAQVVREFYANIDKEDGFQVIVRGVEVDLSPSAINALY

Query:  NL-------QNFPHTAYNEMVVVPSNEQLSDAVREVGIEGARW------------------------------------------ERVLLAFAILRSLSI
        NL        N     Y  +    S+E+L+  ++ + + G  W                                          +R+LL + ++   SI
Subjt:  NL-------QNFPHTAYNEMVVVPSNEQLSDAVREVGIEGARW------------------------------------------ERVLLAFAILRSLSI

Query:  DVGKIVVNEISGC--WKKKVGKLFFPNTITMLCKRAGVLEDEGDVILLDKGIIDTPNLARLHR---MQEVRQGGLVYDINMIL-----------EQLALS
        +VGKI+ +EI  C   KKK  +L FP+ I+ LC R GV  ++ D +++++  I   +L R      M   R+G +  +    +           E+  + 
Subjt:  DVGKIVVNEISGC--WKKKVGKLFFPNTITMLCKRAGVLEDEGDVILLDKGIIDTPNLARLHR---MQEVRQGGLVYDINMIL-----------EQLALS

Query:  PSRQEFA--------------ERQALTFWNYVKNRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEIGEKDDENEPG
          +++F                +Q   FW + K      +K  + NF K     P FP+++L P++  P  E G+  D  EPG
Subjt:  PSRQEFA--------------ERQALTFWNYVKNRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEIGEKDDENEPG

A0A2P5BCG4 Uncharacterized protein (Fragment)1.3e-2628.18Show/hide
Query:  RFVNNSARAKYA-ELLKRDFLFERGF-------SGDLPHFLRTGIADHGWELFCAKPESVNAQVVREFYANIDKEDGFQVIVRGVEVDLSPSAINALYNL
        +F   +A  +Y   +  R    E+GF        G LP F+   I  H W+ FCA PE     +VREFYAN+   +   V VRGV+V  S  AINA++ L
Subjt:  RFVNNSARAKYA-ELLKRDFLFERGF-------SGDLPHFLRTGIADHGWELFCAKPESVNAQVVREFYANIDKEDGFQVIVRGVEVDLSPSAINALYNL

Query:  QNFPHTAYNEMVVVPSNEQLSDAVREVGIEGARW-----------------------------------------ERVLLAFAILRSLSIDVGKIVVNEI
         + P   ++E +   + + L   +  V   GA W                                         +R+LL  ++L   SI+VG+++ +EI
Subjt:  QNFPHTAYNEMVVVPSNEQLSDAVREVGIEGARW-----------------------------------------ERVLLAFAILRSLSIDVGKIVVNEI

Query:  SGCWKKKVGKLFFPNTITMLCKRAGVLEDEGDVILLDKGIIDTPNLARLHR---MQEVRQ---------------GGLVYDINMILEQLALSPSRQ----
          C  +K G LFFP+ IT LC+ A       +  L + G ID   +AR+ +    +  +Q               G ++  +  + ++L+    +Q    
Subjt:  SGCWKKKVGKLFFPNTITMLCKRAGVLEDEGDVILLDKGIIDTPNLARLHR---MQEVRQ---------------GGLVYDINMILEQLALSPSRQ----

Query:  ---EFAERQALTFWNYVKNRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEIGEKDDENEPGQ
           +   +Q   FW Y K RD  LKKALQ NF++P P  PAFP+++L         E  +KD  NE  +
Subjt:  ---EFAERQALTFWNYVKNRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEIGEKDDENEPGQ

A0A2P5DXM3 Uncharacterized protein9.8e-2230.17Show/hide
Query:  VVREFYANIDKEDGFQVIVRGVEVDLSPSAINALYNLQNFPHTAYNEMVVVPSNEQLSDAVREVGIEGARW-----------------------------
        +VREFYAN+   +   + VRGV+V  S  AINA++ L + P   ++E +   +  +L   +  V   GA W                             
Subjt:  VVREFYANIDKEDGFQVIVRGVEVDLSPSAINALYNLQNFPHTAYNEMVVVPSNEQLSDAVREVGIEGARW-----------------------------

Query:  ------------ERVLLAFAILRSLSIDVGKIVVNEISGCWKKKVGKLFFPNTITMLCKRAGVLEDEGDVILLDKGIIDTPNLARL--------------
                    +R+LL  ++L   SI+VG+++ +EI  C  +K G LFFP+ IT LC+ A  L +E    L + G ID   +AR+              
Subjt:  ------------ERVLLAFAILRSLSIDVGKIVVNEISGCWKKKVGKLFFPNTITMLCKRAGVLEDEGDVILLDKGIIDTPNLARL--------------

Query:  HRMQEVRQGGLVYDINMILEQLALSPSRQEFAERQALTFWNYVKNRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEIGEKDDENEPGQ
         R           D+   L+ L    S+QE   +Q   FW Y K RD  LKKALQ NF++P P  PAFP+++L         E  +KD  NE  +
Subjt:  HRMQEVRQGGLVYDINMILEQLALSPSRQEFAERQALTFWNYVKNRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEIGEKDDENEPGQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAAAACAAGAGCGAGGAAAGAAAGAGAGAATGAGGAGGAAGAGATATCGATTACCTCTGAGGTGCAAAAGGTAAAGGCGAAGAAGAAAAAGACACCGGAGGAGAA
AGAAGCTAAGAGAAGAATACGACAACAGAGGGCTGAGGAACAAGAAAAGGCAACAAAGGATGATGCTGTCACAGTAGAAGAAAGAGACCCAAAAGAACTTGAAAATCAGA
ACCCTGAACGGACTAACCTGGCAGTTGAGGATACGCAAGAAATTCAAGAAAAACAGGCTGAGGAGGTGCGAGAACATGCAGAGGTTGCACTGGAAGAAGGAAACGAGCCA
GTTCAAGAAGCTCGTGTTGAGGTCATCATGCCAGAAGTACCGAAGCGATGCCGCATAAAGCGAAAGGCTGTGCGCGTCCAGCGAAGGGCAGAAAAGGGCAAAAGTGTTGC
TGAAGCATCAGAAGAACCTGATGAGATAGATGAGCCACAGTTGCCGTATGATCGCTTCGTCAACAATTCTGCCAGAGCAAAATATGCTGAGTTGCTCAAAAGAGACTTCC
TGTTTGAGAGAGGATTCAGCGGTGATCTTCCGCATTTTCTGAGGACCGGCATTGCAGACCATGGCTGGGAGTTGTTTTGTGCAAAGCCTGAGTCTGTGAACGCACAGGTG
GTGCGCGAATTTTATGCTAATATTGACAAAGAAGATGGTTTCCAAGTGATTGTTCGAGGAGTCGAGGTTGACTTGAGTCCTAGTGCTATTAACGCACTGTATAACCTTCA
GAATTTCCCCCACACAGCGTATAATGAGATGGTTGTGGTGCCATCTAATGAGCAGCTGAGTGACGCTGTGCGGGAAGTGGGAATTGAAGGGGCACGGTGGGAACGAGTTC
TTCTGGCTTTCGCGATTTTGCGGTCTCTCAGCATCGATGTAGGGAAGATTGTTGTGAATGAAATTTCTGGTTGTTGGAAGAAGAAGGTGGGGAAGCTGTTTTTCCCAAAT
ACTATTACCATGCTTTGCAAGAGAGCAGGGGTTCTAGAGGATGAAGGAGATGTTATTTTGTTAGACAAGGGGATCATCGACACGCCTAACTTGGCACGACTTCATCGTAT
GCAAGAGGTGCGTCAGGGTGGGCTTGTCTACGACATCAACATGATTTTAGAACAACTAGCACTTTCGCCCAGTAGGCAAGAGTTTGCCGAGAGGCAAGCTTTGACCTTCT
GGAACTATGTTAAAAATCGTGATGCCAATCTGAAGAAGGCGCTTCAAGAAAATTTTTCCAAGCCATATCCAGCCCTTCCTGCATTCCCTGAGGATTTATTGAACCCCTGG
ATACCACCCCCACCGGTTGAAATAGGAGAAAAGGATGATGAAAATGAGCCGGGCCAAGAGGACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCAAAAACAAGAGCGAGGAAAGAAAGAGAGAATGAGGAGGAAGAGATATCGATTACCTCTGAGGTGCAAAAGGTAAAGGCGAAGAAGAAAAAGACACCGGAGGAGAA
AGAAGCTAAGAGAAGAATACGACAACAGAGGGCTGAGGAACAAGAAAAGGCAACAAAGGATGATGCTGTCACAGTAGAAGAAAGAGACCCAAAAGAACTTGAAAATCAGA
ACCCTGAACGGACTAACCTGGCAGTTGAGGATACGCAAGAAATTCAAGAAAAACAGGCTGAGGAGGTGCGAGAACATGCAGAGGTTGCACTGGAAGAAGGAAACGAGCCA
GTTCAAGAAGCTCGTGTTGAGGTCATCATGCCAGAAGTACCGAAGCGATGCCGCATAAAGCGAAAGGCTGTGCGCGTCCAGCGAAGGGCAGAAAAGGGCAAAAGTGTTGC
TGAAGCATCAGAAGAACCTGATGAGATAGATGAGCCACAGTTGCCGTATGATCGCTTCGTCAACAATTCTGCCAGAGCAAAATATGCTGAGTTGCTCAAAAGAGACTTCC
TGTTTGAGAGAGGATTCAGCGGTGATCTTCCGCATTTTCTGAGGACCGGCATTGCAGACCATGGCTGGGAGTTGTTTTGTGCAAAGCCTGAGTCTGTGAACGCACAGGTG
GTGCGCGAATTTTATGCTAATATTGACAAAGAAGATGGTTTCCAAGTGATTGTTCGAGGAGTCGAGGTTGACTTGAGTCCTAGTGCTATTAACGCACTGTATAACCTTCA
GAATTTCCCCCACACAGCGTATAATGAGATGGTTGTGGTGCCATCTAATGAGCAGCTGAGTGACGCTGTGCGGGAAGTGGGAATTGAAGGGGCACGGTGGGAACGAGTTC
TTCTGGCTTTCGCGATTTTGCGGTCTCTCAGCATCGATGTAGGGAAGATTGTTGTGAATGAAATTTCTGGTTGTTGGAAGAAGAAGGTGGGGAAGCTGTTTTTCCCAAAT
ACTATTACCATGCTTTGCAAGAGAGCAGGGGTTCTAGAGGATGAAGGAGATGTTATTTTGTTAGACAAGGGGATCATCGACACGCCTAACTTGGCACGACTTCATCGTAT
GCAAGAGGTGCGTCAGGGTGGGCTTGTCTACGACATCAACATGATTTTAGAACAACTAGCACTTTCGCCCAGTAGGCAAGAGTTTGCCGAGAGGCAAGCTTTGACCTTCT
GGAACTATGTTAAAAATCGTGATGCCAATCTGAAGAAGGCGCTTCAAGAAAATTTTTCCAAGCCATATCCAGCCCTTCCTGCATTCCCTGAGGATTTATTGAACCCCTGG
ATACCACCCCCACCGGTTGAAATAGGAGAAAAGGATGATGAAAATGAGCCGGGCCAAGAGGACTGA
Protein sequenceShow/hide protein sequence
MAKTRARKERENEEEEISITSEVQKVKAKKKKTPEEKEAKRRIRQQRAEEQEKATKDDAVTVEERDPKELENQNPERTNLAVEDTQEIQEKQAEEVREHAEVALEEGNEP
VQEARVEVIMPEVPKRCRIKRKAVRVQRRAEKGKSVAEASEEPDEIDEPQLPYDRFVNNSARAKYAELLKRDFLFERGFSGDLPHFLRTGIADHGWELFCAKPESVNAQV
VREFYANIDKEDGFQVIVRGVEVDLSPSAINALYNLQNFPHTAYNEMVVVPSNEQLSDAVREVGIEGARWERVLLAFAILRSLSIDVGKIVVNEISGCWKKKVGKLFFPN
TITMLCKRAGVLEDEGDVILLDKGIIDTPNLARLHRMQEVRQGGLVYDINMILEQLALSPSRQEFAERQALTFWNYVKNRDANLKKALQENFSKPYPALPAFPEDLLNPW
IPPPPVEIGEKDDENEPGQED