; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg027911 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg027911
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRT_RNaseH_2 domain-containing protein
Genome locationscaffold2:37605469..37611944
RNA-Seq ExpressionSpg027911
SyntenySpg027911
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8695166.1 hypothetical protein F3Y22_tig00110733pilonHSYRG00282 [Hibiscus syriacus]1.7e-2428.3Show/hide
Query:  FINELARAKYQEMLKRDFLFERGFGDDLPHFLRVG------ITNHGWSQFCAKPDPVNSNIIREFYANVDNAEEFQAIVRGVTVDWSPGAINSLFNLQ--
        F++E A+  YQ +  R   FE GF         +G      +T H W +F   P PVN+ I++EFY+N+    +   +VRG+++ ++P AIN  F LQ  
Subjt:  FINELARAKYQEMLKRDFLFERGFGDDLPHFLRVG------ITNHGWSQFCAKPDPVNSNIIREFYANVDNAEEFQAIVRGVTVDWSPGAINSLFNLQ--

Query:  DFPHAGFNEMVVAPSSDQLNAVVALRGL--------------------TNTWLGFIKLRLLPTTHDSTVSRDRVLLIFAILRSLSIDVGKIISNEIFNCW
        D  +  F + V   +   +   + L G                        W  F+K +L+PT+H++TVS  R+LL+ +IL   +ID+GKII      C 
Subjt:  DFPHAGFNEMVVAPSSDQLNAVVALRGL--------------------TNTWLGFIKLRLLPTTHDSTVSRDRVLLIFAILRSLSIDVGKIISNEIFNCW

Query:  RKKVGKLFFPNTITMLCSRAGVPTVPEDVILPYKGIIDTPNLARL-------QRTQEACQGGLVCGIHQILEQLALSASRQEFAKRQAQ------TYWTY
        +++   L FPN IT LC +  V     D IL     ++   +  L        +  EA    +    H       L  + Q   +   Q       Y+ Y
Subjt:  RKKVGKLFFPNTITMLCSRAGVPTVPEDVILPYKGIIDTPNLARL-------QRTQEACQGGLVCGIHQILEQLALSASRQEFAKRQAQ------TYWTY

Query:  AKRRDDTLRRALQSNFSK
        AKRRD  L  AL  +  +
Subjt:  AKRRDDTLRRALQSNFSK

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]1.3e-2937.4Show/hide
Query:  IRFINELARAKYQEMLK-RDFLFERGFGDD-------LPHFLRVGITNHGWSQFCAKPDPVNSNIIREFYANVDNAEEFQAIVRGVTVDWSPGAINSLFN
        ++F  E A  +Y+  ++ R    E+GF  D       LP   +V IT H W QFCA P+     ++REFYAN+ +  E    VRGV V WS  AIN++F 
Subjt:  IRFINELARAKYQEMLK-RDFLFERGFGDD-------LPHFLRVGITNHGWSQFCAKPDPVNSNIIREFYANVDNAEEFQAIVRGVTVDWSPGAINSLFN

Query:  LQD--FPHAGFNEMV-----------VAPSSDQLNAVV---------ALRGLTNTWLGFIKLRLLPTTHDSTVSRDRVLLIFAILRSLSIDVGKIISNEI
        L D    H+ F E +           VA +  + N            AL      W  F+K  LLPTTH  TVS+DR+LL+ ++L   SI+VG++I +EI
Subjt:  LQD--FPHAGFNEMV-----------VAPSSDQLNAVV---------ALRGLTNTWLGFIKLRLLPTTHDSTVSRDRVLLIFAILRSLSIDVGKIISNEI

Query:  FNCWRKKVGKLFFPNTITMLCSRAGVPTVPEDVILPYKGIIDTPNLARLQRTQE
          C  +K G LFFP+ IT LC  A  P +  +  L   G ID   +AR+  TQE
Subjt:  FNCWRKKVGKLFFPNTITMLCSRAGVPTVPEDVILPYKGIIDTPNLARLQRTQE

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]1.4e-3934.68Show/hide
Query:  IRFINELARAKYQEMLK-RDFLFERGFGDD-------LPHFLRVGITNHGWSQFCAKPDPVNSNIIREFYANVDNAEEFQAIVRGVTVDWSPGAINSLFN
        ++F  E A  +Y+  ++ R    E+GF  D       LP   +V IT H W QFCA P+     ++REFYAN+ + EE    VRGV V WS  AIN++F 
Subjt:  IRFINELARAKYQEMLK-RDFLFERGFGDD-------LPHFLRVGITNHGWSQFCAKPDPVNSNIIREFYANVDNAEEFQAIVRGVTVDWSPGAINSLFN

Query:  LQD--FPHAGFNEMV-----------VAPSSDQLNAVV---------ALRGLTNTWLGFIKLRLLPTTHDSTVSRDRVLLIFAILRSLSIDVGKIISNEI
        L D    H+ F + +           VA +  + N            AL      W  F+K RLLPTTH  TVS+DR+LL+ ++L   SI+VG++I +EI
Subjt:  LQD--FPHAGFNEMV-----------VAPSSDQLNAVV---------ALRGLTNTWLGFIKLRLLPTTHDSTVSRDRVLLIFAILRSLSIDVGKIISNEI

Query:  FNCWRKKVGKLFFPNTITMLCSRAGVPTVPEDVILPYKGIIDTPNLARL--------------QRTQEACQGGLVCGIHQILEQLALSASRQEFAK----
          C  +K G LFFP+ IT LC  A  P +  +  L   G ID   +AR+               R   A        I Q L+ L    S+QE  +    
Subjt:  FNCWRKKVGKLFFPNTITMLCSRAGVPTVPEDVILPYKGIIDTPNLARL--------------QRTQEACQGGLVCGIHQILEQLALSASRQEFAK----

Query:  -------RQAQTYWTYAKRRDDTLRRALQSNFSKPYQAFPMFPDDL
               +Q Q +W Y+K RD  L++ALQ+NF++P   FP FP ++
Subjt:  -------RQAQTYWTYAKRRDDTLRRALQSNFSKPYQAFPMFPDDL

PON70375.1 hypothetical protein PanWU01x14_080440 [Parasponia andersonii]1.7e-2431.86Show/hide
Query:  IRFINELARAKYQE-------MLKRDFLFERGFGDDLPHFLRVGITNHGWSQFCAKPDPVNSNIIREFYANVDNAEEFQAIVRGVTVDWSPGAINSLFNL
        ++F ++ A  +Y+E        ++++F+++     + P F+   I  H W  FCA P+     ++REFY N+ N ++    +RGV V  S  AIN++F+L
Subjt:  IRFINELARAKYQE-------MLKRDFLFERGFGDDLPHFLRVGITNHGWSQFCAKPDPVNSNIIREFYANVDNAEEFQAIVRGVTVDWSPGAINSLFNL

Query:  QD--FPHAGFNEMVVAPSSDQLNAVVALRGL--------------------TNTWLGFIKLRLLPTTHDSTVSRDRVLLIFAILRSLSIDVGKIISNEIF
         D    H+ F E +  P    +   VA+ G                        W  F+K RLLPTTH  TVS++ V L++++L   SI+VG++I  EI 
Subjt:  QD--FPHAGFNEMVVAPSSDQLNAVVALRGL--------------------TNTWLGFIKLRLLPTTHDSTVSRDRVLLIFAILRSLSIDVGKIISNEIF

Query:  NCWRKKVGKLFFPNTITMLCSRAGVP
         C  +K G LFFP+ IT +C     P
Subjt:  NCWRKKVGKLFFPNTITMLCSRAGVP

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]1.3e-3236.53Show/hide
Query:  IIREFYANVDNAEEFQAIVRGVTVDWSPGAINSLFNLQD--FPHAGFNEMVVAPSSDQLNAVVALRG--------------------LTNTWLGFIKLRL
        ++REFYAN+ + EE    VRGV V WS  AIN++F L D    H+ F E +  P    +   VA  G                        W  F+K RL
Subjt:  IIREFYANVDNAEEFQAIVRGVTVDWSPGAINSLFNLQD--FPHAGFNEMVVAPSSDQLNAVVALRG--------------------LTNTWLGFIKLRL

Query:  LPTTHDSTVSRDRVLLIFAILRSLSIDVGKIISNEIFNCWRKKVGKLFFPNTITMLCSRAGVPTVPEDVILPYKGIIDTPNLARL------QRTQE----
        LPTTH   VS+DR+LL+ ++L   SI+VG++I +EI  C  +K G LFFP+ IT LC  A  P +  +  L   G ID   +AR+      + TQ+    
Subjt:  LPTTHDSTVSRDRVLLIFAILRSLSIDVGKIISNEIFNCWRKKVGKLFFPNTITMLCSRAGVPTVPEDVILPYKGIIDTPNLARL------QRTQE----

Query:  ---ACQGGLVCG-IHQILEQLALSASRQEFAKRQAQTYWTYAKRRDDTLRRALQSNFSKPYQAFPMFPDDL
           A       G + Q L+ L    S+QE   +Q Q +W Y+K RD  L++ALQ+NF++P   FP FP ++
Subjt:  ---ACQGGLVCG-IHQILEQLALSASRQEFAKRQAQTYWTYAKRRDDTLRRALQSNFSKPYQAFPMFPDDL

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)6.5e-3037.4Show/hide
Query:  IRFINELARAKYQEMLK-RDFLFERGFGDD-------LPHFLRVGITNHGWSQFCAKPDPVNSNIIREFYANVDNAEEFQAIVRGVTVDWSPGAINSLFN
        ++F  E A  +Y+  ++ R    E+GF  D       LP   +V IT H W QFCA P+     ++REFYAN+ +  E    VRGV V WS  AIN++F 
Subjt:  IRFINELARAKYQEMLK-RDFLFERGFGDD-------LPHFLRVGITNHGWSQFCAKPDPVNSNIIREFYANVDNAEEFQAIVRGVTVDWSPGAINSLFN

Query:  LQD--FPHAGFNEMV-----------VAPSSDQLNAVV---------ALRGLTNTWLGFIKLRLLPTTHDSTVSRDRVLLIFAILRSLSIDVGKIISNEI
        L D    H+ F E +           VA +  + N            AL      W  F+K  LLPTTH  TVS+DR+LL+ ++L   SI+VG++I +EI
Subjt:  LQD--FPHAGFNEMV-----------VAPSSDQLNAVV---------ALRGLTNTWLGFIKLRLLPTTHDSTVSRDRVLLIFAILRSLSIDVGKIISNEI

Query:  FNCWRKKVGKLFFPNTITMLCSRAGVPTVPEDVILPYKGIIDTPNLARLQRTQE
          C  +K G LFFP+ IT LC  A  P +  +  L   G ID   +AR+  TQE
Subjt:  FNCWRKKVGKLFFPNTITMLCSRAGVPTVPEDVILPYKGIIDTPNLARLQRTQE

A0A2P5BCG4 Uncharacterized protein (Fragment)6.9e-4034.68Show/hide
Query:  IRFINELARAKYQEMLK-RDFLFERGFGDD-------LPHFLRVGITNHGWSQFCAKPDPVNSNIIREFYANVDNAEEFQAIVRGVTVDWSPGAINSLFN
        ++F  E A  +Y+  ++ R    E+GF  D       LP   +V IT H W QFCA P+     ++REFYAN+ + EE    VRGV V WS  AIN++F 
Subjt:  IRFINELARAKYQEMLK-RDFLFERGFGDD-------LPHFLRVGITNHGWSQFCAKPDPVNSNIIREFYANVDNAEEFQAIVRGVTVDWSPGAINSLFN

Query:  LQD--FPHAGFNEMV-----------VAPSSDQLNAVV---------ALRGLTNTWLGFIKLRLLPTTHDSTVSRDRVLLIFAILRSLSIDVGKIISNEI
        L D    H+ F + +           VA +  + N            AL      W  F+K RLLPTTH  TVS+DR+LL+ ++L   SI+VG++I +EI
Subjt:  LQD--FPHAGFNEMV-----------VAPSSDQLNAVV---------ALRGLTNTWLGFIKLRLLPTTHDSTVSRDRVLLIFAILRSLSIDVGKIISNEI

Query:  FNCWRKKVGKLFFPNTITMLCSRAGVPTVPEDVILPYKGIIDTPNLARL--------------QRTQEACQGGLVCGIHQILEQLALSASRQEFAK----
          C  +K G LFFP+ IT LC  A  P +  +  L   G ID   +AR+               R   A        I Q L+ L    S+QE  +    
Subjt:  FNCWRKKVGKLFFPNTITMLCSRAGVPTVPEDVILPYKGIIDTPNLARL--------------QRTQEACQGGLVCGIHQILEQLALSASRQEFAK----

Query:  -------RQAQTYWTYAKRRDDTLRRALQSNFSKPYQAFPMFPDDL
               +Q Q +W Y+K RD  L++ALQ+NF++P   FP FP ++
Subjt:  -------RQAQTYWTYAKRRDDTLRRALQSNFSKPYQAFPMFPDDL

A0A2P5DAQ2 Uncharacterized protein8.2e-2531.86Show/hide
Query:  IRFINELARAKYQE-------MLKRDFLFERGFGDDLPHFLRVGITNHGWSQFCAKPDPVNSNIIREFYANVDNAEEFQAIVRGVTVDWSPGAINSLFNL
        ++F ++ A  +Y+E        ++++F+++     + P F+   I  H W  FCA P+     ++REFY N+ N ++    +RGV V  S  AIN++F+L
Subjt:  IRFINELARAKYQE-------MLKRDFLFERGFGDDLPHFLRVGITNHGWSQFCAKPDPVNSNIIREFYANVDNAEEFQAIVRGVTVDWSPGAINSLFNL

Query:  QD--FPHAGFNEMVVAPSSDQLNAVVALRGL--------------------TNTWLGFIKLRLLPTTHDSTVSRDRVLLIFAILRSLSIDVGKIISNEIF
         D    H+ F E +  P    +   VA+ G                        W  F+K RLLPTTH  TVS++ V L++++L   SI+VG++I  EI 
Subjt:  QD--FPHAGFNEMVVAPSSDQLNAVVALRGL--------------------TNTWLGFIKLRLLPTTHDSTVSRDRVLLIFAILRSLSIDVGKIISNEIF

Query:  NCWRKKVGKLFFPNTITMLCSRAGVP
         C  +K G LFFP+ IT +C     P
Subjt:  NCWRKKVGKLFFPNTITMLCSRAGVP

A0A2P5DXM3 Uncharacterized protein6.3e-3336.53Show/hide
Query:  IIREFYANVDNAEEFQAIVRGVTVDWSPGAINSLFNLQD--FPHAGFNEMVVAPSSDQLNAVVALRG--------------------LTNTWLGFIKLRL
        ++REFYAN+ + EE    VRGV V WS  AIN++F L D    H+ F E +  P    +   VA  G                        W  F+K RL
Subjt:  IIREFYANVDNAEEFQAIVRGVTVDWSPGAINSLFNLQD--FPHAGFNEMVVAPSSDQLNAVVALRG--------------------LTNTWLGFIKLRL

Query:  LPTTHDSTVSRDRVLLIFAILRSLSIDVGKIISNEIFNCWRKKVGKLFFPNTITMLCSRAGVPTVPEDVILPYKGIIDTPNLARL------QRTQE----
        LPTTH   VS+DR+LL+ ++L   SI+VG++I +EI  C  +K G LFFP+ IT LC  A  P +  +  L   G ID   +AR+      + TQ+    
Subjt:  LPTTHDSTVSRDRVLLIFAILRSLSIDVGKIISNEIFNCWRKKVGKLFFPNTITMLCSRAGVPTVPEDVILPYKGIIDTPNLARL------QRTQE----

Query:  ---ACQGGLVCG-IHQILEQLALSASRQEFAKRQAQTYWTYAKRRDDTLRRALQSNFSKPYQAFPMFPDDL
           A       G + Q L+ L    S+QE   +Q Q +W Y+K RD  L++ALQ+NF++P   FP FP ++
Subjt:  ---ACQGGLVCG-IHQILEQLALSASRQEFAKRQAQTYWTYAKRRDDTLRRALQSNFSKPYQAFPMFPDDL

A0A6A2ZUE4 Uncharacterized protein8.2e-2528.3Show/hide
Query:  FINELARAKYQEMLKRDFLFERGFGDDLPHFLRVG------ITNHGWSQFCAKPDPVNSNIIREFYANVDNAEEFQAIVRGVTVDWSPGAINSLFNLQ--
        F++E A+  YQ +  R   FE GF         +G      +T H W +F   P PVN+ I++EFY+N+    +   +VRG+++ ++P AIN  F LQ  
Subjt:  FINELARAKYQEMLKRDFLFERGFGDDLPHFLRVG------ITNHGWSQFCAKPDPVNSNIIREFYANVDNAEEFQAIVRGVTVDWSPGAINSLFNLQ--

Query:  DFPHAGFNEMVVAPSSDQLNAVVALRGL--------------------TNTWLGFIKLRLLPTTHDSTVSRDRVLLIFAILRSLSIDVGKIISNEIFNCW
        D  +  F + V   +   +   + L G                        W  F+K +L+PT+H++TVS  R+LL+ +IL   +ID+GKII      C 
Subjt:  DFPHAGFNEMVVAPSSDQLNAVVALRGL--------------------TNTWLGFIKLRLLPTTHDSTVSRDRVLLIFAILRSLSIDVGKIISNEIFNCW

Query:  RKKVGKLFFPNTITMLCSRAGVPTVPEDVILPYKGIIDTPNLARL-------QRTQEACQGGLVCGIHQILEQLALSASRQEFAKRQAQ------TYWTY
        +++   L FPN IT LC +  V     D IL     ++   +  L        +  EA    +    H       L  + Q   +   Q       Y+ Y
Subjt:  RKKVGKLFFPNTITMLCSRAGVPTVPEDVILPYKGIIDTPNLARL-------QRTQEACQGGLVCGIHQILEQLALSASRQEFAKRQAQ------TYWTY

Query:  AKRRDDTLRRALQSNFSK
        AKRRD  L  AL  +  +
Subjt:  AKRRDDTLRRALQSNFSK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGAAAACAAGAGCAAGAAAGGAGCGAGATAATGAAGAAGAAGAGGTACCAGTTACCCCTAAGGCACAGAAAGTGAAAACGAAGAAAAAGAAAACGCCGGAGGAAAA
AGAAGCCAAACGAAGGAGAAGGCAGCAGAGGGCTGAGGACCAAGAAGCTGTCCAGAAGGCGGCGGAAGATGTTGCTGCTACGATAATTGAAGAAGGAAATCAGAAGGAAC
CAGAGGGACAGAACACTAAGCTGAGTGACCCAGTAGTTGCAGATACAGAGGGAGTTCAAGAAGAACAAACAGAGGAAGTTCAAGAAAAACAGGCCGAAGATACGCAAGAA
GGTAGGACAGAGGATGTTCAGGAAACAGTACCAAAACATCGCCGTGTGAAGCGAAAAGCTAGACGCGTCAAGGTAGTCCGAACTGATACCCCATCACCACCATCGACGGA
TTCTGAGAAAGAGAATGCAAAGAGAGAGGAACGGGAGAAAAAGGAGGCTGAGGACAGAGAGAGAGAAGAAGCAGGAAAGAAAGCAGCGGAAGAAACTTTGACAAAGCATC
AAGAAGACAGAGGCAAAGGAATTGTTGAAGCATCGGATGAACCTATAGAAGAAGCAGAAGAAGGACCATTCATCCGCTTCATCAATGAACTTGCCCGAGCAAAATACCAG
GAGATGCTAAAAAGGGATTTCTTATTTGAAAGAGGGTTTGGTGACGATCTGCCACATTTCTTAAGGGTAGGGATCACGAATCATGGTTGGAGTCAGTTTTGTGCGAAACC
AGATCCAGTGAATTCGAACATTATTCGAGAATTTTATGCGAATGTTGATAATGCAGAGGAATTTCAGGCCATAGTCCGAGGAGTGACTGTTGACTGGAGCCCAGGAGCTA
TTAATTCACTATTCAACCTTCAGGACTTCCCACATGCAGGCTTTAATGAGATGGTGGTGGCACCATCGAGTGACCAGTTAAATGCGGTGGTGGCATTGAGGGGGCTCACC
AATACTTGGTTGGGCTTCATCAAGCTGCGTTTGCTTCCAACTACGCATGATTCAACGGTGTCTCGCGACCGAGTGCTTCTGATATTCGCAATTCTTCGATCCTTAAGTAT
TGATGTTGGAAAAATCATTTCGAATGAAATCTTTAATTGCTGGCGCAAAAAGGTGGGGAAGCTATTTTTCCCGAATACGATCACTATGTTATGCAGCAGGGCAGGAGTGC
CCACGGTTCCAGAGGATGTAATTTTGCCTTACAAGGGAATCATAGATACGCCTAATCTGGCGCGGCTTCAGCGAACGCAAGAGGCATGCCAGGGTGGGCTTGTGTGTGGA
ATTCATCAAATCCTAGAGCAACTAGCACTGTCGGCCAGTAGGCAAGAGTTTGCTAAAAGGCAAGCTCAAACCTATTGGACCTATGCTAAAAGAAGAGATGACACACTTCG
GAGGGCCTTGCAGTCCAATTTCTCCAAACCATATCAGGCCTTCCCTATGTTTCCCGATGATTTATTTAACCTTTGGATACCGCCCCCACCTGTCGAAAGAGAAGAAGAGG
ATGATGAAAATGAGCAGGAAACCTTTTGCTTGAGCATTTCTTCTAGCCTGGTCACAGCTGTGGCAAAGAAGATTCTGAGAACTGTTTTGCTGCAGCAGAGCTTGGTTTTG
CAGAATGCTGAGGTAGAGGTTGAAGGTAATGTTGGATTATCTGGTTCGATTAAGCTATATTATAGCGTGTGTCTCTATACACTTTATGAACTAATTCGCTTTAGGCGTTG
GCGTCGAGATGCCAAGGCCAGCGTCTCGACGCCGCCATCATGGCGTCGAGACGCCTACTCTCGAGTTGCTCCGAAAATTGCAATTTCTTGCTTGATCCTTTTGGCTCCGG
AAATGATGTCCACACTAGATAGGTACAGGAGAAGGAGAAAGAAAATGAAAAGGGCAGGATTTGCGATGCAAATGCGGTCGCAAATCTACCTTGTGCAATTTGCGCCGCAT
TTGCAGCACATTCCGTTCAAAACAAGTCGGCGAACCTTTCTGTCACTCACCCATAAAGAAAATCGTGGAAAGATGAAGCCTCCATCCGCTTCTCTACCGAAGAACTCCGC
TTCAGCCCACAAACCATCATCAGATTCTTCTTCTAAGAGGTCAAAAACTCAAGGTACACTTCCCCGCCCATCTACTCGTGCAGGTTTGCTCTCAAAAGCATGGGAAGAAG
ACTTGAGACGGGTTGAAGAGAAAAGTTGTAAAGCCATTGAAGACACAGCTATGGAGGAAGAGGAGGAAGAAGAAGAGGAAACACCCTTTATCTTAAACCAAAAGCTATCT
GAATCAAGGAAAAGGAGGAGGAAGAGAAGAGAGTGGGATCGACCTGAAATTGCGACTGCATATATGGCCAAACTTGAGGAGAGAGTGCCAGTTCTGGAGGCACAAGTCAA
GGATAATGCACCTGTATATGGGGCGCAAGGGGCGCAAGTGCAGCCGCTCGAGCAAAAAGATTGGAATAAAGTTGTGGTGGATACAATATGGCTTGAAAAGGCCAGGGCTG
CCCAAATAAAGGATGAGCTATGGGCGGAAGAAAGAAGAAATAAAAAGGAGGCCAAGAGAAAACAGAGGGATGAAGAAGAACAAGTTGCACTTGATTTTGCAGCTTTGTGT
CTCAAGAACTCAGTGGATGCCACTGAGAAAGAAGAATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGAAAACAAGAGCAAGAAAGGAGCGAGATAATGAAGAAGAAGAGGTACCAGTTACCCCTAAGGCACAGAAAGTGAAAACGAAGAAAAAGAAAACGCCGGAGGAAAA
AGAAGCCAAACGAAGGAGAAGGCAGCAGAGGGCTGAGGACCAAGAAGCTGTCCAGAAGGCGGCGGAAGATGTTGCTGCTACGATAATTGAAGAAGGAAATCAGAAGGAAC
CAGAGGGACAGAACACTAAGCTGAGTGACCCAGTAGTTGCAGATACAGAGGGAGTTCAAGAAGAACAAACAGAGGAAGTTCAAGAAAAACAGGCCGAAGATACGCAAGAA
GGTAGGACAGAGGATGTTCAGGAAACAGTACCAAAACATCGCCGTGTGAAGCGAAAAGCTAGACGCGTCAAGGTAGTCCGAACTGATACCCCATCACCACCATCGACGGA
TTCTGAGAAAGAGAATGCAAAGAGAGAGGAACGGGAGAAAAAGGAGGCTGAGGACAGAGAGAGAGAAGAAGCAGGAAAGAAAGCAGCGGAAGAAACTTTGACAAAGCATC
AAGAAGACAGAGGCAAAGGAATTGTTGAAGCATCGGATGAACCTATAGAAGAAGCAGAAGAAGGACCATTCATCCGCTTCATCAATGAACTTGCCCGAGCAAAATACCAG
GAGATGCTAAAAAGGGATTTCTTATTTGAAAGAGGGTTTGGTGACGATCTGCCACATTTCTTAAGGGTAGGGATCACGAATCATGGTTGGAGTCAGTTTTGTGCGAAACC
AGATCCAGTGAATTCGAACATTATTCGAGAATTTTATGCGAATGTTGATAATGCAGAGGAATTTCAGGCCATAGTCCGAGGAGTGACTGTTGACTGGAGCCCAGGAGCTA
TTAATTCACTATTCAACCTTCAGGACTTCCCACATGCAGGCTTTAATGAGATGGTGGTGGCACCATCGAGTGACCAGTTAAATGCGGTGGTGGCATTGAGGGGGCTCACC
AATACTTGGTTGGGCTTCATCAAGCTGCGTTTGCTTCCAACTACGCATGATTCAACGGTGTCTCGCGACCGAGTGCTTCTGATATTCGCAATTCTTCGATCCTTAAGTAT
TGATGTTGGAAAAATCATTTCGAATGAAATCTTTAATTGCTGGCGCAAAAAGGTGGGGAAGCTATTTTTCCCGAATACGATCACTATGTTATGCAGCAGGGCAGGAGTGC
CCACGGTTCCAGAGGATGTAATTTTGCCTTACAAGGGAATCATAGATACGCCTAATCTGGCGCGGCTTCAGCGAACGCAAGAGGCATGCCAGGGTGGGCTTGTGTGTGGA
ATTCATCAAATCCTAGAGCAACTAGCACTGTCGGCCAGTAGGCAAGAGTTTGCTAAAAGGCAAGCTCAAACCTATTGGACCTATGCTAAAAGAAGAGATGACACACTTCG
GAGGGCCTTGCAGTCCAATTTCTCCAAACCATATCAGGCCTTCCCTATGTTTCCCGATGATTTATTTAACCTTTGGATACCGCCCCCACCTGTCGAAAGAGAAGAAGAGG
ATGATGAAAATGAGCAGGAAACCTTTTGCTTGAGCATTTCTTCTAGCCTGGTCACAGCTGTGGCAAAGAAGATTCTGAGAACTGTTTTGCTGCAGCAGAGCTTGGTTTTG
CAGAATGCTGAGGTAGAGGTTGAAGGTAATGTTGGATTATCTGGTTCGATTAAGCTATATTATAGCGTGTGTCTCTATACACTTTATGAACTAATTCGCTTTAGGCGTTG
GCGTCGAGATGCCAAGGCCAGCGTCTCGACGCCGCCATCATGGCGTCGAGACGCCTACTCTCGAGTTGCTCCGAAAATTGCAATTTCTTGCTTGATCCTTTTGGCTCCGG
AAATGATGTCCACACTAGATAGGTACAGGAGAAGGAGAAAGAAAATGAAAAGGGCAGGATTTGCGATGCAAATGCGGTCGCAAATCTACCTTGTGCAATTTGCGCCGCAT
TTGCAGCACATTCCGTTCAAAACAAGTCGGCGAACCTTTCTGTCACTCACCCATAAAGAAAATCGTGGAAAGATGAAGCCTCCATCCGCTTCTCTACCGAAGAACTCCGC
TTCAGCCCACAAACCATCATCAGATTCTTCTTCTAAGAGGTCAAAAACTCAAGGTACACTTCCCCGCCCATCTACTCGTGCAGGTTTGCTCTCAAAAGCATGGGAAGAAG
ACTTGAGACGGGTTGAAGAGAAAAGTTGTAAAGCCATTGAAGACACAGCTATGGAGGAAGAGGAGGAAGAAGAAGAGGAAACACCCTTTATCTTAAACCAAAAGCTATCT
GAATCAAGGAAAAGGAGGAGGAAGAGAAGAGAGTGGGATCGACCTGAAATTGCGACTGCATATATGGCCAAACTTGAGGAGAGAGTGCCAGTTCTGGAGGCACAAGTCAA
GGATAATGCACCTGTATATGGGGCGCAAGGGGCGCAAGTGCAGCCGCTCGAGCAAAAAGATTGGAATAAAGTTGTGGTGGATACAATATGGCTTGAAAAGGCCAGGGCTG
CCCAAATAAAGGATGAGCTATGGGCGGAAGAAAGAAGAAATAAAAAGGAGGCCAAGAGAAAACAGAGGGATGAAGAAGAACAAGTTGCACTTGATTTTGCAGCTTTGTGT
CTCAAGAACTCAGTGGATGCCACTGAGAAAGAAGAATAA
Protein sequenceShow/hide protein sequence
MAKTRARKERDNEEEEVPVTPKAQKVKTKKKKTPEEKEAKRRRRQQRAEDQEAVQKAAEDVAATIIEEGNQKEPEGQNTKLSDPVVADTEGVQEEQTEEVQEKQAEDTQE
GRTEDVQETVPKHRRVKRKARRVKVVRTDTPSPPSTDSEKENAKREEREKKEAEDREREEAGKKAAEETLTKHQEDRGKGIVEASDEPIEEAEEGPFIRFINELARAKYQ
EMLKRDFLFERGFGDDLPHFLRVGITNHGWSQFCAKPDPVNSNIIREFYANVDNAEEFQAIVRGVTVDWSPGAINSLFNLQDFPHAGFNEMVVAPSSDQLNAVVALRGLT
NTWLGFIKLRLLPTTHDSTVSRDRVLLIFAILRSLSIDVGKIISNEIFNCWRKKVGKLFFPNTITMLCSRAGVPTVPEDVILPYKGIIDTPNLARLQRTQEACQGGLVCG
IHQILEQLALSASRQEFAKRQAQTYWTYAKRRDDTLRRALQSNFSKPYQAFPMFPDDLFNLWIPPPPVEREEEDDENEQETFCLSISSSLVTAVAKKILRTVLLQQSLVL
QNAEVEVEGNVGLSGSIKLYYSVCLYTLYELIRFRRWRRDAKASVSTPPSWRRDAYSRVAPKIAISCLILLAPEMMSTLDRYRRRRKKMKRAGFAMQMRSQIYLVQFAPH
LQHIPFKTSRRTFLSLTHKENRGKMKPPSASLPKNSASAHKPSSDSSSKRSKTQGTLPRPSTRAGLLSKAWEEDLRRVEEKSCKAIEDTAMEEEEEEEEETPFILNQKLS
ESRKRRRKRREWDRPEIATAYMAKLEERVPVLEAQVKDNAPVYGAQGAQVQPLEQKDWNKVVVDTIWLEKARAAQIKDELWAEERRNKKEAKRKQRDEEEQVALDFAALC
LKNSVDATEKEE