; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg027306 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg027306
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRT_RNaseH_2 domain-containing protein
Genome locationscaffold7:39899673..39901325
RNA-Seq ExpressionSpg027306
SyntenySpg027306
Gene Ontology termsGO:0009987 - cellular process (biological process)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8695166.1 hypothetical protein F3Y22_tig00110733pilonHSYRG00282 [Hibiscus syriacus]4.2e-1725.16Show/hide
Query:  FVNNSARAKYAELLKRDFLFERGF------SGDLPHFLRTGIADHGWELFCAKPVSVNTKVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQ--
        FV+ +A+  Y  +  R   FE GF      + +L   +   +  H W+ F   PV VN  +V+EFY+NI + +   V+VRG+ + ++P+AIN  + LQ  
Subjt:  FVNNSARAKYAELLKRDFLFERGF------SGDLPHFLRTGIADHGWELFCAKPVSVNTKVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQ--

Query:  NFPHAAYNEMVVVPSNEQLSDAVREVGIEGARW-----------------------------------------ERVLLAFAILRSLSIDVGKIVVNEIS
        +  + ++ + V    +E     + ++ + G RW                                         +R+LL  +IL   +ID+GKI+V    
Subjt:  NFPHAAYNEMVVVPSNEQLSDAVREVGIEGARW-----------------------------------------ERVLLAFAILRSLSIDVGKIVVNEIS

Query:  GCWKKKVGKLFFPNTITMLCKSAGVPEDEGGVILFDKGIIDTPNLARLQRMQE------------VRQGGLVYDINMILEQLALSPNRQEFAE--RQALT
         C K++   L FPN IT LC+   V E+    IL     ++   +  L   +E            V     V   +  LEQ A+    Q   +   + + 
Subjt:  GCWKKKVGKLFFPNTITMLCKSAGVPEDEGGVILFDKGIIDTPNLARLQRMQE------------VRQGGLVYDINMILEQLALSPNRQEFAE--RQALT

Query:  FWNYVKNRDANLKKALQENFSK
        ++ Y K RDA L  AL E+  +
Subjt:  FWNYVKNRDANLKKALQENFSK

KAE8718449.1 hypothetical protein F3Y22_tig00110013pilonHSYRG00240 [Hibiscus syriacus]1.4e-1725.37Show/hide
Query:  LPYDRFVNNSARAKYAELLKRDFLFERGF------SGDLPHFLRTGIADHGWELFCAKPVSVNTKVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALY
        + + +F N+ A+A++     R+  FE GF       G     +   +    W  F   P SVN  +V+EFYANI K +   + VRG ++ ++  AIN  +
Subjt:  LPYDRFVNNSARAKYAELLKRDFLFERGF------SGDLPHFLRTGIADHGWELFCAKPVSVNTKVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALY

Query:  NLQNF--PHAAYNEMVVVPSNEQLSDAVREVGIEGARW-----------------------------------------ERVLLAFAILRSLSIDVGKIV
        +LQ     HA + E      + +    + ++  E   W                                          R+LL  +++ S  IDVG+I+
Subjt:  NLQNF--PHAAYNEMVVVPSNEQLSDAVREVGIEGARW-----------------------------------------ERVLLAFAILRSLSIDVGKIV

Query:  VNEISGCWKKKVGKLFFPNTITMLCKSAGVPEDEGGVILFDKGIIDTPNLARLQRMQEVRQGGLVY-------DINMILEQLALSPN-RQEFAERQAL--
        V ++  C  KK   L FPN IT LC+   V E+    IL     I    L  L  ++  +    V+       + N  +  LAL     Q  A+  AL  
Subjt:  VNEISGCWKKKVGKLFFPNTITMLCKSAGVPEDEGGVILFDKGIIDTPNLARLQRMQEVRQGGLVY-------DINMILEQLALSPN-RQEFAERQAL--

Query:  ---TFWNYVKNRDANLKKALQENFSKPYPALLGFPKDLL
            F+ YVK+RD  ++   QE          GFP ++L
Subjt:  ---TFWNYVKNRDANLKKALQENFSKPYPALLGFPKDLL

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]1.3e-1829.72Show/hide
Query:  RFVNNSARAKYA-ELLKRDFLFERGF-------SGDLPHFLRTGIADHGWELFCAKPVSVNTKVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNL
        +F   +A  +Y   +  R    E+GF        G LP F+   I  H W+ FCA P      +VREFYAN+       V VRGV+V WS  AINA++ L
Subjt:  RFVNNSARAKYA-ELLKRDFLFERGF-------SGDLPHFLRTGIADHGWELFCAKPVSVNTKVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNL

Query:  QNFPHAAYNEMVVVPSNEQLSDAVREVGIEGARW-----------------------------------------ERVLLAFAILRSLSIDVGKIVVNEI
         + P   ++E +   +   L   +  V + GA W                                         +R+LL  ++L   SI+VG+++ +EI
Subjt:  QNFPHAAYNEMVVVPSNEQLSDAVREVGIEGARW-----------------------------------------ERVLLAFAILRSLSIDVGKIVVNEI

Query:  SGCWKKKVGKLFFPNTITMLCKSAGVPEDEGGVILFDKGIIDTPNLARL
          C  +K G LFFP+ IT LC++A  P       L + G ID   +AR+
Subjt:  SGCWKKKVGKLFFPNTITMLCKSAGVPEDEGGVILFDKGIIDTPNLARL

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]6.9e-2828.53Show/hide
Query:  RFVNNSARAKYA-ELLKRDFLFERGF-------SGDLPHFLRTGIADHGWELFCAKPVSVNTKVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNL
        +F   +A  +Y   +  R    E+GF        G LP F+   I  H W+ FCA P      +VREFYAN+   +   V VRGV+V WS  AINA++ L
Subjt:  RFVNNSARAKYA-ELLKRDFLFERGF-------SGDLPHFLRTGIADHGWELFCAKPVSVNTKVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNL

Query:  QNFPHAAYNEMVVVPSNEQLSDAVREVGIEGARW-----------------------------------------ERVLLAFAILRSLSIDVGKIVVNEI
         + P   ++E +   + + L   +  V   GA W                                         +R+LL  ++L   SI+VG+++ +EI
Subjt:  QNFPHAAYNEMVVVPSNEQLSDAVREVGIEGARW-----------------------------------------ERVLLAFAILRSLSIDVGKIVVNEI

Query:  SGCWKKKVGKLFFPNTITMLCKSAGVPEDEGGVILFDKGIIDTPNLARL---------QRMQEVR---------QGGLVYDINMILEQLALSPNRQ----
          C  +K G LFFP+ IT LC++A  P       L + G ID   +AR+         Q+    R          G ++  +  + ++L+    +Q    
Subjt:  SGCWKKKVGKLFFPNTITMLCKSAGVPEDEGGVILFDKGIIDTPNLARL---------QRMQEVR---------QGGLVYDINMILEQLALSPNRQ----

Query:  ---EFAERQALTFWNYVKNRDANLKKALQENFSKPYPALLGFPKDLL
           +   +Q   FW Y K RD  LKKALQ NF++P P    FP+++L
Subjt:  ---EFAERQALTFWNYVKNRDANLKKALQENFSKPYPALLGFPKDLL

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]7.4e-2229.67Show/hide
Query:  VVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMVVVPSNEQLSDAVREVGIEGARW-----------------------------
        +VREFYAN+   +   + VRGV+V WS  AINA++ L + P   ++E +   +  +L   +  V   GA W                             
Subjt:  VVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMVVVPSNEQLSDAVREVGIEGARW-----------------------------

Query:  ------------ERVLLAFAILRSLSIDVGKIVVNEISGCWKKKVGKLFFPNTITMLCKSAGVPEDEGGVILFDKGIIDTPNLARL--------------
                    +R+LL  ++L   SI+VG+++ +EI  C  +K G LFFP+ IT LC++A    +E    L + G ID   +AR+              
Subjt:  ------------ERVLLAFAILRSLSIDVGKIVVNEISGCWKKKVGKLFFPNTITMLCKSAGVPEDEGGVILFDKGIIDTPNLARL--------------

Query:  QRMQEVRQGGLVYDINMILEQLALSPNRQEFAERQALTFWNYVKNRDANLKKALQENFSKPYPALLGFPKDLL
         R           D+   L+ L    ++QE   +Q   FW Y K RD  LKKALQ NF++P P    FP+++L
Subjt:  QRMQEVRQGGLVYDINMILEQLALSPNRQEFAERQALTFWNYVKNRDANLKKALQENFSKPYPALLGFPKDLL

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)6.3e-1929.72Show/hide
Query:  RFVNNSARAKYA-ELLKRDFLFERGF-------SGDLPHFLRTGIADHGWELFCAKPVSVNTKVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNL
        +F   +A  +Y   +  R    E+GF        G LP F+   I  H W+ FCA P      +VREFYAN+       V VRGV+V WS  AINA++ L
Subjt:  RFVNNSARAKYA-ELLKRDFLFERGF-------SGDLPHFLRTGIADHGWELFCAKPVSVNTKVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNL

Query:  QNFPHAAYNEMVVVPSNEQLSDAVREVGIEGARW-----------------------------------------ERVLLAFAILRSLSIDVGKIVVNEI
         + P   ++E +   +   L   +  V + GA W                                         +R+LL  ++L   SI+VG+++ +EI
Subjt:  QNFPHAAYNEMVVVPSNEQLSDAVREVGIEGARW-----------------------------------------ERVLLAFAILRSLSIDVGKIVVNEI

Query:  SGCWKKKVGKLFFPNTITMLCKSAGVPEDEGGVILFDKGIIDTPNLARL
          C  +K G LFFP+ IT LC++A  P       L + G ID   +AR+
Subjt:  SGCWKKKVGKLFFPNTITMLCKSAGVPEDEGGVILFDKGIIDTPNLARL

A0A2P5BCG4 Uncharacterized protein (Fragment)3.3e-2828.53Show/hide
Query:  RFVNNSARAKYA-ELLKRDFLFERGF-------SGDLPHFLRTGIADHGWELFCAKPVSVNTKVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNL
        +F   +A  +Y   +  R    E+GF        G LP F+   I  H W+ FCA P      +VREFYAN+   +   V VRGV+V WS  AINA++ L
Subjt:  RFVNNSARAKYA-ELLKRDFLFERGF-------SGDLPHFLRTGIADHGWELFCAKPVSVNTKVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNL

Query:  QNFPHAAYNEMVVVPSNEQLSDAVREVGIEGARW-----------------------------------------ERVLLAFAILRSLSIDVGKIVVNEI
         + P   ++E +   + + L   +  V   GA W                                         +R+LL  ++L   SI+VG+++ +EI
Subjt:  QNFPHAAYNEMVVVPSNEQLSDAVREVGIEGARW-----------------------------------------ERVLLAFAILRSLSIDVGKIVVNEI

Query:  SGCWKKKVGKLFFPNTITMLCKSAGVPEDEGGVILFDKGIIDTPNLARL---------QRMQEVR---------QGGLVYDINMILEQLALSPNRQ----
          C  +K G LFFP+ IT LC++A  P       L + G ID   +AR+         Q+    R          G ++  +  + ++L+    +Q    
Subjt:  SGCWKKKVGKLFFPNTITMLCKSAGVPEDEGGVILFDKGIIDTPNLARL---------QRMQEVR---------QGGLVYDINMILEQLALSPNRQ----

Query:  ---EFAERQALTFWNYVKNRDANLKKALQENFSKPYPALLGFPKDLL
           +   +Q   FW Y K RD  LKKALQ NF++P P    FP+++L
Subjt:  ---EFAERQALTFWNYVKNRDANLKKALQENFSKPYPALLGFPKDLL

A0A2P5DXM3 Uncharacterized protein3.6e-2229.67Show/hide
Query:  VVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMVVVPSNEQLSDAVREVGIEGARW-----------------------------
        +VREFYAN+   +   + VRGV+V WS  AINA++ L + P   ++E +   +  +L   +  V   GA W                             
Subjt:  VVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMVVVPSNEQLSDAVREVGIEGARW-----------------------------

Query:  ------------ERVLLAFAILRSLSIDVGKIVVNEISGCWKKKVGKLFFPNTITMLCKSAGVPEDEGGVILFDKGIIDTPNLARL--------------
                    +R+LL  ++L   SI+VG+++ +EI  C  +K G LFFP+ IT LC++A    +E    L + G ID   +AR+              
Subjt:  ------------ERVLLAFAILRSLSIDVGKIVVNEISGCWKKKVGKLFFPNTITMLCKSAGVPEDEGGVILFDKGIIDTPNLARL--------------

Query:  QRMQEVRQGGLVYDINMILEQLALSPNRQEFAERQALTFWNYVKNRDANLKKALQENFSKPYPALLGFPKDLL
         R           D+   L+ L    ++QE   +Q   FW Y K RD  LKKALQ NF++P P    FP+++L
Subjt:  QRMQEVRQGGLVYDINMILEQLALSPNRQEFAERQALTFWNYVKNRDANLKKALQENFSKPYPALLGFPKDLL

A0A6A2ZUE4 Uncharacterized protein2.0e-1725.16Show/hide
Query:  FVNNSARAKYAELLKRDFLFERGF------SGDLPHFLRTGIADHGWELFCAKPVSVNTKVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQ--
        FV+ +A+  Y  +  R   FE GF      + +L   +   +  H W+ F   PV VN  +V+EFY+NI + +   V+VRG+ + ++P+AIN  + LQ  
Subjt:  FVNNSARAKYAELLKRDFLFERGF------SGDLPHFLRTGIADHGWELFCAKPVSVNTKVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQ--

Query:  NFPHAAYNEMVVVPSNEQLSDAVREVGIEGARW-----------------------------------------ERVLLAFAILRSLSIDVGKIVVNEIS
        +  + ++ + V    +E     + ++ + G RW                                         +R+LL  +IL   +ID+GKI+V    
Subjt:  NFPHAAYNEMVVVPSNEQLSDAVREVGIEGARW-----------------------------------------ERVLLAFAILRSLSIDVGKIVVNEIS

Query:  GCWKKKVGKLFFPNTITMLCKSAGVPEDEGGVILFDKGIIDTPNLARLQRMQE------------VRQGGLVYDINMILEQLALSPNRQEFAE--RQALT
         C K++   L FPN IT LC+   V E+    IL     ++   +  L   +E            V     V   +  LEQ A+    Q   +   + + 
Subjt:  GCWKKKVGKLFFPNTITMLCKSAGVPEDEGGVILFDKGIIDTPNLARLQRMQE------------VRQGGLVYDINMILEQLALSPNRQEFAE--RQALT

Query:  FWNYVKNRDANLKKALQENFSK
        ++ Y K RDA L  AL E+  +
Subjt:  FWNYVKNRDANLKKALQENFSK

A0A6A3BU96 Uncharacterized protein7.0e-1825.37Show/hide
Query:  LPYDRFVNNSARAKYAELLKRDFLFERGF------SGDLPHFLRTGIADHGWELFCAKPVSVNTKVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALY
        + + +F N+ A+A++     R+  FE GF       G     +   +    W  F   P SVN  +V+EFYANI K +   + VRG ++ ++  AIN  +
Subjt:  LPYDRFVNNSARAKYAELLKRDFLFERGF------SGDLPHFLRTGIADHGWELFCAKPVSVNTKVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALY

Query:  NLQNF--PHAAYNEMVVVPSNEQLSDAVREVGIEGARW-----------------------------------------ERVLLAFAILRSLSIDVGKIV
        +LQ     HA + E      + +    + ++  E   W                                          R+LL  +++ S  IDVG+I+
Subjt:  NLQNF--PHAAYNEMVVVPSNEQLSDAVREVGIEGARW-----------------------------------------ERVLLAFAILRSLSIDVGKIV

Query:  VNEISGCWKKKVGKLFFPNTITMLCKSAGVPEDEGGVILFDKGIIDTPNLARLQRMQEVRQGGLVY-------DINMILEQLALSPN-RQEFAERQAL--
        V ++  C  KK   L FPN IT LC+   V E+    IL     I    L  L  ++  +    V+       + N  +  LAL     Q  A+  AL  
Subjt:  VNEISGCWKKKVGKLFFPNTITMLCKSAGVPEDEGGVILFDKGIIDTPNLARLQRMQEVRQGGLVY-------DINMILEQLALSPN-RQEFAERQAL--

Query:  ---TFWNYVKNRDANLKKALQENFSKPYPALLGFPKDLL
            F+ YVK+RD  ++   QE          GFP ++L
Subjt:  ---TFWNYVKNRDANLKKALQENFSKPYPALLGFPKDLL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAAAACAAGAGCGAGGAAAGAAAGAGAGAATGAGGAGGAAGAGATATCGATTACCTCTGAGGTGCAAAAGGTAAAGGCGAAGAAGAAAAAGACACCGGAGGAGAA
AGAAGCAAAGAGAAGAATACGACAACAGAGGGTTGAGGAACAAGAAAAGGCAACAGAGGATGATGCTGTCACAGTAGAAGAAGGAGACCCAAAAGAACCTGAAAATCAGA
ACCCTGAACGGACTAACCCGGCAGTTGAGGATACGCAAGAAATTCAAGAAAAGCAGGCTGAGGAGGTGCGAGAACATGAAGAGGTTGCACTGGAAGAAGGAAACGAGCCA
GTTCAAGAAGCTCGTGTTGAGGTCATCATGCCTGAAGTACCGAAGCGATGCCGCATAAAGCGAAAGGCTGTGCGCGTCCAGAAGGCAGAGCGAGAGGAACGAGAGAAAAA
AGAAGCTGAGGAAAAAATAAGAGAAGAAGCAGAGAAGAAGGCTGAGGAAGAGCGGTTGCTCAAGCGAAGGGCAGAAAAGGGCAAAAGTGTTGCTGAAGCATCAGAAGAAC
CTGATGAGATAGATGAGCCACAGTTGCCGTATGATCGCTTCGTCAACAATTCTGCCAGAGCAAAATATGCTGAGTTGCTGAAAAGAGACTTCCTGTTTGAGAGAGGATTC
AGCGGTGATCTTCCGCATTTTCTGAGGACCGGCATTGCAGACCATGGCTGGGAGTTGTTTTGTGCAAAGCCTGTGTCTGTGAACACAAAGGTGGTGCGCGAATTTTATGC
TAATATTGACAAAGAAGATGGTTTCCAAGTGATTGTTCGAGGAGTCGAGGTTGACTGGAGTCCTAGTGCTATTAACGCACTGTATAACCTTCAGAATTTCCCCCACGCAG
CGTATAATGAGATGGTTGTGGTGCCATCTAATGAGCAGCTGAGTGACGCTGTGCGGGAAGTGGGAATTGAAGGGGCACGGTGGGAACGAGTTCTTCTGGCTTTCGCGATT
TTGCGGTCTCTCAGCATCGATGTAGGGAAGATTGTTGTGAATGAAATTTCTGGTTGTTGGAAGAAGAAGGTGGGGAAGCTGTTTTTCCCAAATACTATTACCATGCTTTG
CAAGAGTGCAGGGGTTCCAGAGGATGAAGGAGGTGTTATTTTGTTTGACAAGGGGATCATCGACACGCCTAACTTGGCACGACTTCAGCGTATGCAAGAGGTGCGTCAGG
GTGGGCTTGTCTACGACATCAACATGATTTTAGAACAACTAGCACTTTCGCCCAATAGGCAAGAGTTTGCCGAGAGGCAAGCTTTGACCTTCTGGAACTATGTTAAAAAT
CGTGATGCCAATCTGAAGAAGGCGCTTCAAGAAAATTTTTCCAAGCCATATCCAGCCCTTCTTGGATTCCCTAAGGATTTATTGAACCCCTGGATACCACCCCCACCGGT
TGAAAGAGGAGAAGAGGATGATGAAAATGAGCTGGGCCAAGAGGACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCAAAAACAAGAGCGAGGAAAGAAAGAGAGAATGAGGAGGAAGAGATATCGATTACCTCTGAGGTGCAAAAGGTAAAGGCGAAGAAGAAAAAGACACCGGAGGAGAA
AGAAGCAAAGAGAAGAATACGACAACAGAGGGTTGAGGAACAAGAAAAGGCAACAGAGGATGATGCTGTCACAGTAGAAGAAGGAGACCCAAAAGAACCTGAAAATCAGA
ACCCTGAACGGACTAACCCGGCAGTTGAGGATACGCAAGAAATTCAAGAAAAGCAGGCTGAGGAGGTGCGAGAACATGAAGAGGTTGCACTGGAAGAAGGAAACGAGCCA
GTTCAAGAAGCTCGTGTTGAGGTCATCATGCCTGAAGTACCGAAGCGATGCCGCATAAAGCGAAAGGCTGTGCGCGTCCAGAAGGCAGAGCGAGAGGAACGAGAGAAAAA
AGAAGCTGAGGAAAAAATAAGAGAAGAAGCAGAGAAGAAGGCTGAGGAAGAGCGGTTGCTCAAGCGAAGGGCAGAAAAGGGCAAAAGTGTTGCTGAAGCATCAGAAGAAC
CTGATGAGATAGATGAGCCACAGTTGCCGTATGATCGCTTCGTCAACAATTCTGCCAGAGCAAAATATGCTGAGTTGCTGAAAAGAGACTTCCTGTTTGAGAGAGGATTC
AGCGGTGATCTTCCGCATTTTCTGAGGACCGGCATTGCAGACCATGGCTGGGAGTTGTTTTGTGCAAAGCCTGTGTCTGTGAACACAAAGGTGGTGCGCGAATTTTATGC
TAATATTGACAAAGAAGATGGTTTCCAAGTGATTGTTCGAGGAGTCGAGGTTGACTGGAGTCCTAGTGCTATTAACGCACTGTATAACCTTCAGAATTTCCCCCACGCAG
CGTATAATGAGATGGTTGTGGTGCCATCTAATGAGCAGCTGAGTGACGCTGTGCGGGAAGTGGGAATTGAAGGGGCACGGTGGGAACGAGTTCTTCTGGCTTTCGCGATT
TTGCGGTCTCTCAGCATCGATGTAGGGAAGATTGTTGTGAATGAAATTTCTGGTTGTTGGAAGAAGAAGGTGGGGAAGCTGTTTTTCCCAAATACTATTACCATGCTTTG
CAAGAGTGCAGGGGTTCCAGAGGATGAAGGAGGTGTTATTTTGTTTGACAAGGGGATCATCGACACGCCTAACTTGGCACGACTTCAGCGTATGCAAGAGGTGCGTCAGG
GTGGGCTTGTCTACGACATCAACATGATTTTAGAACAACTAGCACTTTCGCCCAATAGGCAAGAGTTTGCCGAGAGGCAAGCTTTGACCTTCTGGAACTATGTTAAAAAT
CGTGATGCCAATCTGAAGAAGGCGCTTCAAGAAAATTTTTCCAAGCCATATCCAGCCCTTCTTGGATTCCCTAAGGATTTATTGAACCCCTGGATACCACCCCCACCGGT
TGAAAGAGGAGAAGAGGATGATGAAAATGAGCTGGGCCAAGAGGACTGA
Protein sequenceShow/hide protein sequence
MAKTRARKERENEEEEISITSEVQKVKAKKKKTPEEKEAKRRIRQQRVEEQEKATEDDAVTVEEGDPKEPENQNPERTNPAVEDTQEIQEKQAEEVREHEEVALEEGNEP
VQEARVEVIMPEVPKRCRIKRKAVRVQKAEREEREKKEAEEKIREEAEKKAEEERLLKRRAEKGKSVAEASEEPDEIDEPQLPYDRFVNNSARAKYAELLKRDFLFERGF
SGDLPHFLRTGIADHGWELFCAKPVSVNTKVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMVVVPSNEQLSDAVREVGIEGARWERVLLAFAI
LRSLSIDVGKIVVNEISGCWKKKVGKLFFPNTITMLCKSAGVPEDEGGVILFDKGIIDTPNLARLQRMQEVRQGGLVYDINMILEQLALSPNRQEFAERQALTFWNYVKN
RDANLKKALQENFSKPYPALLGFPKDLLNPWIPPPPVERGEEDDENELGQED