; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg034315 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg034315
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRNase H domain-containing protein
Genome locationscaffold12:21437354..21440943
RNA-Seq ExpressionSpg034315
SyntenySpg034315
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG71894.1 hypothetical protein EZV62_000473 [Acer yangbiense]4.8e-1726.7Show/hide
Query:  YKIQEGLKLMANNYVSSLIEENLSWKEDLIKASFSPEAADDILKIPLGDINSKDKIIWAYDKKGTFSVKSAYRLA------------QANCPLQETSQSD
        +K+    +L   + VS+LIE    W E LI+ +F PE A+ IL IPL     +D  +W ++K GTF+VKSAYR+A             A CPL      +
Subjt:  YKIQEGLKLMANNYVSSLIEENLSWKEDLIKASFSPEAADDILKIPLGDINSKDKIIWAYDKKGTFSVKSAYRLA------------QANCPLQETSQSD

Query:  TSKTVDW---SPMDYWN---------------------WLVDNGSKEDLSKGVLIMWSIWNFRNKSRALKLKPESIYSSIEAILEENETKN---LIARKP
        T +   W   S    WN                     W+    + E++   +   W +WN RN         +S++        ++ TK    L   K 
Subjt:  TSKTVDW---SPMDYWN---------------------WLVDNGSKEDLSKGVLIMWSIWNFRNKSRALKLKPESIYSSIEAILEENETKN---LIARKP

Query:  VRLRSQMSQVP------WNPPLPNRWKLNSNASWVDRSNSGGLGWVVHDSSGSLICVGMKQCQRNWSTSALEGKAMLEGINCIRNTCIQYNIGLEIEADS
        +  ++  S +P      W PP     K+N +A+     N   +G V+ DSSG ++           ST   E K + EG+    ++ +Q    L IE+DS
Subjt:  VRLRSQMSQVP------WNPPLPNRWKLNSNASWVDRSNSGGLGWVVHDSSGSLICVGMKQCQRNWSTSALEGKAMLEGINCIRNTCIQYNIGLEIEADS

Query:  LELINAINGISENLTELLNVVEAISSVVLSLAVAEFRHCVRGANAKAHCVAR
        L ++   NG      ++ NV+  I  ++         +  R  N  AH VAR
Subjt:  LELINAINGISENLTELLNVVEAISSVVLSLAVAEFRHCVRGANAKAHCVAR

XP_022143535.1 uncharacterized protein LOC111013412 [Momordica charantia]9.7e-1828.64Show/hide
Query:  VDNGSKEDLSKGVLIMWSIWNFRNKSRALKLKPES--IYSSIEAILEENETKNLIARKP--------VRLRSQMSQVPWNPPLPNRWKLNSNASWVDRSN
        +D   +E+  + ++I W IW  RNKS    + PE+  I  +I+  +  +  +N   +          +R     +   W PP  N WKLN+NA+W   +N
Subjt:  VDNGSKEDLSKGVLIMWSIWNFRNKSRALKLKPES--IYSSIEAILEENETKNLIARKP--------VRLRSQMSQVPWNPPLPNRWKLNSNASWVDRSN

Query:  SGGLGWVVHDSSGSLICVGMKQCQRNWSTSALEGKAMLEGINCIRNTCIQYNIGLEIEADSLELINAINGISENLTELLNVVEAISSVVLSLAVAEFRHC
        +GG+GW++ D  G +I    +  +   + + LE  A+ EG+  IR    ++   + +E+DSLE I+ ++   ++ TE++ ++E I  ++  + +   RH 
Subjt:  SGGLGWVVHDSSGSLICVGMKQCQRNWSTSALEGKAMLEGINCIRNTCIQYNIGLEIEADSLELINAINGISENLTELLNVVEAISSVVLSLAVAEFRHC

Query:  VRGANAKAHCVAR
         R AN  AH +AR
Subjt:  VRGANAKAHCVAR

XP_022148549.1 uncharacterized protein LOC111017181 [Momordica charantia]5.7e-1831.92Show/hide
Query:  MDYWNWLVDNGSKEDLSKGVLIMWSIWNFRNKSRALKLKPESIYSSIEAILEE---NETKNLIARKPVRLRSQMSQVPWNPPLPNRWKLNSNASWVDRSN
        MDY++W+  +  +     G++++WSIW +RN+     ++ +     I A  E      T NL     +  ++    V W PP  + WKLN +A+W+D  +
Subjt:  MDYWNWLVDNGSKEDLSKGVLIMWSIWNFRNKSRALKLKPESIYSSIEAILEE---NETKNLIARKPVRLRSQMSQVPWNPPLPNRWKLNSNASWVDRSN

Query:  SGGLGWVVHDSSGSLICVGMKQCQRNWSTSALEGKAMLEGINCIRNTCIQYNIGLEIEADSLELINAINGISENLTELLNVVEAISSVVLSLAVAEFRHC
        +GGLGW+V DS G  I   M +C +  S S                  ++  I +E+E+D LE++N IN  S  LTE+  +VE I   + SL +  F+H 
Subjt:  SGGLGWVVHDSSGSLICVGMKQCQRNWSTSALEGKAMLEGINCIRNTCIQYNIGLEIEADSLELINAINGISENLTELLNVVEAISSVVLSLAVAEFRHC

Query:  VRGANAKAHCVAR
           AN  AH +AR
Subjt:  VRGANAKAHCVAR

XP_022154991.1 uncharacterized protein LOC111022134 isoform X2 [Momordica charantia]3.7e-1727.43Show/hide
Query:  INSKDKII----WAYDKKGTFSVKSAYRLAQANCPLQETSQSDTSKTVDWSPMDYWNWLVDNGSKEDLSKGVLIMWSIWNFRNKS--RALKLKPESIYSS
        +  K KI+    W +DK      +      +ANC   E          +W+  +YW WL+D   +E+  + ++I   IW  RNKS  + +  +   I  +
Subjt:  INSKDKII----WAYDKKGTFSVKSAYRLAQANCPLQETSQSDTSKTVDWSPMDYWNWLVDNGSKEDLSKGVLIMWSIWNFRNKS--RALKLKPESIYSS

Query:  IE--AILEENETKNLIARK----PVRLRSQMSQVPWNPPLPNRWKLNSNASWVDRSNSGGLGWVVHDSSGSLICVGMKQCQRNWSTSALEGKAMLEGINC
        I+   I    +  NL  +     P+R     ++  W PP  N WKLN++A+W   +N+ G+GW++ D  G +I  G +  +   + + LE  A+ EG+  
Subjt:  IE--AILEENETKNLIARK----PVRLRSQMSQVPWNPPLPNRWKLNSNASWVDRSNSGGLGWVVHDSSGSLICVGMKQCQRNWSTSALEGKAMLEGINC

Query:  IRNTCIQYNIGLEIEADSLELINAIN
        IR    ++   + +E+DSLE I+ ++
Subjt:  IRNTCIQYNIGLEIEADSLELINAIN

XP_023897447.1 uncharacterized protein LOC112009345 [Quercus suber]2.2e-1726.51Show/hide
Query:  LIEENLSWKEDLIKASFSPEAADDILKIPLGDINSKDKIIWAYDKKGTFSVKSAYRLAQANCPLQETSQSDTSKTVDWSPMDYWNWLVDNGS--KEDLSK
        + +E   WKE++I+  F P   + IL IPL     +D++IWA    G FSV+SAYR+A      +    + ++  +       W+  V + S  +E +  
Subjt:  LIEENLSWKEDLIKASFSPEAADDILKIPLGDINSKDKIIWAYDKKGTFSVKSAYRLAQANCPLQETSQSDTSKTVDWSPMDYWNWLVDNGS--KEDLSK

Query:  GVLIMWSIWNFRNKSR--ALKLKPESIYSSIEAILEENETKNLIARKPVRLRSQMSQVPWNPPLPNRWKLNSNASWVDRSNSGGLGWVVHDSSGSLICVG
         V + W+ W  RN+ R  A K  PE+I   +   L E       A + V    +   V WNPP P+  K+N + +     N  G+G VV D  G ++   
Subjt:  GVLIMWSIWNFRNKSR--ALKLKPESIYSSIEAILEENETKNLIARKPVRLRSQMSQVPWNPPLPNRWKLNSNASWVDRSNSGGLGWVVHDSSGSLICVG

Query:  MKQCQRNWSTSALEGKAMLEGINCIRNTCIQYNIGLEIEADSLELINAINGISENLTELLNVVEAISSVVLSLAVAEFRHCVRGANAKAHCVARHVAS
         ++         +E KA   G+   ++   Q    + +E DSL ++ A+ GIS + + + +++  +             H  R  N  AH +A++  S
Subjt:  MKQCQRNWSTSALEGKAMLEGINCIRNTCIQYNIGLEIEADSLELINAINGISENLTELLNVVEAISSVVLSLAVAEFRHCVRGANAKAHCVARHVAS

TrEMBL top hitse value%identityAlignment
A0A5C7IST2 RNase H domain-containing protein2.3e-1726.7Show/hide
Query:  YKIQEGLKLMANNYVSSLIEENLSWKEDLIKASFSPEAADDILKIPLGDINSKDKIIWAYDKKGTFSVKSAYRLA------------QANCPLQETSQSD
        +K+    +L   + VS+LIE    W E LI+ +F PE A+ IL IPL     +D  +W ++K GTF+VKSAYR+A             A CPL      +
Subjt:  YKIQEGLKLMANNYVSSLIEENLSWKEDLIKASFSPEAADDILKIPLGDINSKDKIIWAYDKKGTFSVKSAYRLA------------QANCPLQETSQSD

Query:  TSKTVDW---SPMDYWN---------------------WLVDNGSKEDLSKGVLIMWSIWNFRNKSRALKLKPESIYSSIEAILEENETKN---LIARKP
        T +   W   S    WN                     W+    + E++   +   W +WN RN         +S++        ++ TK    L   K 
Subjt:  TSKTVDW---SPMDYWN---------------------WLVDNGSKEDLSKGVLIMWSIWNFRNKSRALKLKPESIYSSIEAILEENETKN---LIARKP

Query:  VRLRSQMSQVP------WNPPLPNRWKLNSNASWVDRSNSGGLGWVVHDSSGSLICVGMKQCQRNWSTSALEGKAMLEGINCIRNTCIQYNIGLEIEADS
        +  ++  S +P      W PP     K+N +A+     N   +G V+ DSSG ++           ST   E K + EG+    ++ +Q    L IE+DS
Subjt:  VRLRSQMSQVP------WNPPLPNRWKLNSNASWVDRSNSGGLGWVVHDSSGSLICVGMKQCQRNWSTSALEGKAMLEGINCIRNTCIQYNIGLEIEADS

Query:  LELINAINGISENLTELLNVVEAISSVVLSLAVAEFRHCVRGANAKAHCVAR
        L ++   NG      ++ NV+  I  ++         +  R  N  AH VAR
Subjt:  LELINAINGISENLTELLNVVEAISSVVLSLAVAEFRHCVRGANAKAHCVAR

A0A6J1CP26 uncharacterized protein LOC1110134124.7e-1828.64Show/hide
Query:  VDNGSKEDLSKGVLIMWSIWNFRNKSRALKLKPES--IYSSIEAILEENETKNLIARKP--------VRLRSQMSQVPWNPPLPNRWKLNSNASWVDRSN
        +D   +E+  + ++I W IW  RNKS    + PE+  I  +I+  +  +  +N   +          +R     +   W PP  N WKLN+NA+W   +N
Subjt:  VDNGSKEDLSKGVLIMWSIWNFRNKSRALKLKPES--IYSSIEAILEENETKNLIARKP--------VRLRSQMSQVPWNPPLPNRWKLNSNASWVDRSN

Query:  SGGLGWVVHDSSGSLICVGMKQCQRNWSTSALEGKAMLEGINCIRNTCIQYNIGLEIEADSLELINAINGISENLTELLNVVEAISSVVLSLAVAEFRHC
        +GG+GW++ D  G +I    +  +   + + LE  A+ EG+  IR    ++   + +E+DSLE I+ ++   ++ TE++ ++E I  ++  + +   RH 
Subjt:  SGGLGWVVHDSSGSLICVGMKQCQRNWSTSALEGKAMLEGINCIRNTCIQYNIGLEIEADSLELINAINGISENLTELLNVVEAISSVVLSLAVAEFRHC

Query:  VRGANAKAHCVAR
         R AN  AH +AR
Subjt:  VRGANAKAHCVAR

A0A6J1D4B6 uncharacterized protein LOC1110171812.8e-1831.92Show/hide
Query:  MDYWNWLVDNGSKEDLSKGVLIMWSIWNFRNKSRALKLKPESIYSSIEAILEE---NETKNLIARKPVRLRSQMSQVPWNPPLPNRWKLNSNASWVDRSN
        MDY++W+  +  +     G++++WSIW +RN+     ++ +     I A  E      T NL     +  ++    V W PP  + WKLN +A+W+D  +
Subjt:  MDYWNWLVDNGSKEDLSKGVLIMWSIWNFRNKSRALKLKPESIYSSIEAILEE---NETKNLIARKPVRLRSQMSQVPWNPPLPNRWKLNSNASWVDRSN

Query:  SGGLGWVVHDSSGSLICVGMKQCQRNWSTSALEGKAMLEGINCIRNTCIQYNIGLEIEADSLELINAINGISENLTELLNVVEAISSVVLSLAVAEFRHC
        +GGLGW+V DS G  I   M +C +  S S                  ++  I +E+E+D LE++N IN  S  LTE+  +VE I   + SL +  F+H 
Subjt:  SGGLGWVVHDSSGSLICVGMKQCQRNWSTSALEGKAMLEGINCIRNTCIQYNIGLEIEADSLELINAINGISENLTELLNVVEAISSVVLSLAVAEFRHC

Query:  VRGANAKAHCVAR
           AN  AH +AR
Subjt:  VRGANAKAHCVAR

A0A6J1DQC9 uncharacterized protein LOC111022134 isoform X21.8e-1727.43Show/hide
Query:  INSKDKII----WAYDKKGTFSVKSAYRLAQANCPLQETSQSDTSKTVDWSPMDYWNWLVDNGSKEDLSKGVLIMWSIWNFRNKS--RALKLKPESIYSS
        +  K KI+    W +DK      +      +ANC   E          +W+  +YW WL+D   +E+  + ++I   IW  RNKS  + +  +   I  +
Subjt:  INSKDKII----WAYDKKGTFSVKSAYRLAQANCPLQETSQSDTSKTVDWSPMDYWNWLVDNGSKEDLSKGVLIMWSIWNFRNKS--RALKLKPESIYSS

Query:  IE--AILEENETKNLIARK----PVRLRSQMSQVPWNPPLPNRWKLNSNASWVDRSNSGGLGWVVHDSSGSLICVGMKQCQRNWSTSALEGKAMLEGINC
        I+   I    +  NL  +     P+R     ++  W PP  N WKLN++A+W   +N+ G+GW++ D  G +I  G +  +   + + LE  A+ EG+  
Subjt:  IE--AILEENETKNLIARK----PVRLRSQMSQVPWNPPLPNRWKLNSNASWVDRSNSGGLGWVVHDSSGSLICVGMKQCQRNWSTSALEGKAMLEGINC

Query:  IRNTCIQYNIGLEIEADSLELINAIN
        IR    ++   + +E+DSLE I+ ++
Subjt:  IRNTCIQYNIGLEIEADSLELINAIN

A0A803P9C5 Uncharacterized protein3.1e-1727.08Show/hide
Query:  NNYVSSLIEENLSWKEDLIKASFSPEAADDILKIPLGDINSKDKIIWAYDKKGTFSVKSAYRLAQANCPLQETSQSDTSKTVD-----WSPM--------
        +N V+  I  N  W  +L+   FSP   + IL IPL  + + D+ IW YD  G +SV + Y  A +   L+E   S  S T +     W  +        
Subjt:  NNYVSSLIEENLSWKEDLIKASFSPEAADDILKIPLGDINSKDKIIWAYDKKGTFSVKSAYRLAQANCPLQETSQSDTSKTVD-----WSPM--------

Query:  ---DYW---NWLVDNG------------------SKEDLSKGVLIMWSIWNFRNKSRALKLKPESIYSSIEAILEENETKNLIARKPV-RLR-SQMSQVP
           D W    +++D                    SK+DL   + +MW IW+ RN             S I A              PV  LR S +    
Subjt:  ---DYW---NWLVDNG------------------SKEDLSKGVLIMWSIWNFRNKSRALKLKPESIYSSIEAILEENETKNLIARKPV-RLR-SQMSQVP

Query:  WNPPLPNRWKLNSNASWVDRSNSGGLGWVVHDSSGSLICVGMKQCQRNWSTSALEGKAMLEGINCIRNTCIQYNIGLE-IEADSLELINAINGISENLTE
        W PP  N +KLN +A+     +  G+G +V +S+G ++    K    N+ +  +E KAM  G++  +    QY I ++ +E D L L+NA+NG   + + 
Subjt:  WNPPLPNRWKLNSNASWVDRSNSGGLGWVVHDSSGSLICVGMKQCQRNWSTSALEGKAMLEGINCIRNTCIQYNIGLE-IEADSLELINAINGISENLTE

Query:  LLNVVEAISSVVLSLAVAEFRHCVRGANAKAHCVAR
           +V  ++  + S +     H  R AN  AH +A+
Subjt:  LLNVVEAISSVVLSLAVAEFRHCVRGANAKAHCVAR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.6e-0524.64Show/hide
Query:  GVLIMWSIWNFRNKSRALKLK------PESIYSSIEAILEENETKNLIARKPVRLRSQMSQVPWNPPLPNRW-KLNSNASWVDRSNSGGLGWVVHDSSGS
        G L+ W +W        L  K      PE +  ++E   E +  + L  +       +   V W  P P +W K N++A+W   +   G+GW++ + SG 
Subjt:  GVLIMWSIWNFRNKSRALKLK------PESIYSSIEAILEENETKNLIARKPVRLRSQMSQVPWNPPLPNRW-KLNSNASWVDRSNSGGLGWVVHDSSGS

Query:  LICVGMKQCQRNWSTSALEGKAMLEGINCIRNTCIQYNI-GLEIEADSLELINAINGISENLTELLNVVEAISSVVLSLAVAEFRHCVRGANAKAHCVAR
        ++ +G +   R  + + LE  A LE +     T  ++N   +  E+D+  L+N +N   +    L   +E I  ++      +F    RG N  A  +AR
Subjt:  LICVGMKQCQRNWSTSALEGKAMLEGINCIRNTCIQYNI-GLEIEADSLELINAINGISENLTELLNVVEAISSVVLSLAVAEFRHCVRGANAKAHCVAR

Query:  HVASPSS
           S S+
Subjt:  HVASPSS

AT4G29090.1 Ribonuclease H-like superfamily protein2.4e-0625.34Show/hide
Query:  YWNWLVDNGSKEDLSKGVLI---MWSIWNFRNK--SRALKLKPESIYSSIEAILEE---NETKNLIARKPVRLRSQMSQVPWNPPLPNRW-KLNSNASWV
        YW + + NG+ +      L+   +W +W  RN+   R  +   + +    E  LEE            KP   RS   +  W PP P++W K N++A+W 
Subjt:  YWNWLVDNGSKEDLSKGVLI---MWSIWNFRNK--SRALKLKPESIYSSIEAILEE---NETKNLIARKPVRLRSQMSQVPWNPPLPNRW-KLNSNASWV

Query:  DRSNSGGLGWVVHDSSGSLICVGMKQCQRNWSTSALEGKAMLEGINCIRNTCIQYNIGLEIEADSLELINAINGISENLTELLNVVEAISSVVLSLAVAE
          +   G+GWV+ +  G +  +G +   +  S    E +AM   +  +     QYN  +  E+DS  LI  +N   E    L   ++ +  ++      +
Subjt:  DRSNSGGLGWVVHDSSGSLICVGMKQCQRNWSTSALEGKAMLEGINCIRNTCIQYNIGLEIEADSLELINAINGISENLTELLNVVEAISSVVLSLAVAE

Query:  FRHCVRGANAKAHCVARHVAS
        F    R  N  A  VAR   S
Subjt:  FRHCVRGANAKAHCVARHVAS

AT5G65005.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.7e-0525.13Show/hide
Query:  IMWSIWNFRNKSRALKLKPESIYSSIEAILEENETKNLI-------ARKPVRLRSQMSQVPWNPPLPNRWKLNSNASWVDRSNSGGLGWVVHDSSGSLIC
        +MW IW   N             +++E  L  N+TK  +        +   R         W+PP  ++ K N +AS  +R+   GLGW++ +S G++I 
Subjt:  IMWSIWNFRNKSRALKLKPESIYSSIEAILEENETKNLI-------ARKPVRLRSQMSQVPWNPPLPNRWKLNSNASWVDRSNSGGLGWVVHDSSGSLIC

Query:  VGMKQCQRNWSTSALEGKAMLEGINCIRNTCIQYNIG---LEIEADSLELINAINGISENLTELLNVVEAISSVVLSLAVAEFRHCVRGANAKAHCVAR
         GM + Q   +T   E   ++  I         Y  G   +  E D+  +   IN  S N   L + ++ I S + S    EF    R  N  A  +A+
Subjt:  VGMKQCQRNWSTSALEGKAMLEGINCIRNTCIQYNIG---LEIEADSLELINAINGISENLTELLNVVEAISSVVLSLAVAEFRHCVRGANAKAHCVAR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGATAAGGCGGTGAAAGCTAGAGTCGAGCACTTAAACTTCCACCAGTCAGACCATAGGCCTATTATCATCAAAGTAAGCTGGAAAGTCCCTATTAATCTTTCCAG
ACCCAGCAATCGTTCAGTCAAATTCGAAGAAAGCTGGAAATCTTTCGAAGAAAGCAAAGACATTATCAAAAATTTCTGGAATACAGAGCTTAACATCTCGAGGTTTAATG
TTAACTACAAAATTCAAGAGGGCCTGAAGCTTATGGCAAATAATTATGTTAGCAGCCTTATTGAGGAGAATTTGTCCTGGAAAGAAGATCTTATCAAGGCTAGCTTCTCC
CCTGAAGCTGCTGACGACATCTTGAAGATCCCTTTGGGCGATATTAATTCAAAGGACAAGATTATCTGGGCTTATGACAAAAAAGGAACCTTCTCTGTGAAGAGTGCCTA
TAGGCTAGCCCAAGCCAATTGCCCTTTACAAGAGACTTCTCAATCTGACACTTCCAAGACGGTTGATTGGTCTCCCATGGATTACTGGAATTGGTTGGTGGACAACGGTA
GCAAAGAAGACCTTTCAAAAGGAGTTCTCATTATGTGGTCAATCTGGAATTTTAGAAACAAATCGAGGGCACTCAAGCTAAAACCAGAATCAATCTACTCATCTATTGAA
GCTATTTTAGAAGAAAATGAGACAAAGAACCTGATTGCTCGGAAGCCCGTCAGGCTGAGGAGCCAAATGAGTCAAGTTCCCTGGAATCCTCCCCTCCCTAACCGCTGGAA
GCTAAATTCGAACGCTTCCTGGGTTGATCGTTCCAACTCTGGAGGCCTGGGCTGGGTGGTGCATGACTCAAGTGGATCCCTAATCTGTGTAGGCATGAAGCAGTGTCAAA
GAAATTGGAGTACCAGTGCCCTTGAAGGAAAAGCTATGCTCGAAGGGATCAACTGCATCAGAAATACCTGTATTCAATACAACATTGGCTTGGAAATTGAAGCCGACTCA
CTGGAGCTTATCAACGCCATTAATGGCATTTCCGAGAACCTCACCGAATTGTTGAATGTGGTGGAGGCAATCTCTTCAGTGGTTTTGTCGCTTGCAGTGGCTGAATTTCG
TCACTGTGTAAGGGGAGCCAACGCAAAAGCCCACTGCGTTGCTCGTCACGTAGCTTCCCCGTCTTCTGTTTCTAATTCAGTTTCTTCTTCTTTTGTTTCCCAGAGGTGTT
CCTCTTCGTGGGAACAGGGCTGTATTTTTTGGGCCCCTGGTTTTCCTTCGTGGATTCTTTCCCTCCTTAATGAGGGTGGTTGTTTTGTTGGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAGGATAAGGCGGTGAAAGCTAGAGTCGAGCACTTAAACTTCCACCAGTCAGACCATAGGCCTATTATCATCAAAGTAAGCTGGAAAGTCCCTATTAATCTTTCCAG
ACCCAGCAATCGTTCAGTCAAATTCGAAGAAAGCTGGAAATCTTTCGAAGAAAGCAAAGACATTATCAAAAATTTCTGGAATACAGAGCTTAACATCTCGAGGTTTAATG
TTAACTACAAAATTCAAGAGGGCCTGAAGCTTATGGCAAATAATTATGTTAGCAGCCTTATTGAGGAGAATTTGTCCTGGAAAGAAGATCTTATCAAGGCTAGCTTCTCC
CCTGAAGCTGCTGACGACATCTTGAAGATCCCTTTGGGCGATATTAATTCAAAGGACAAGATTATCTGGGCTTATGACAAAAAAGGAACCTTCTCTGTGAAGAGTGCCTA
TAGGCTAGCCCAAGCCAATTGCCCTTTACAAGAGACTTCTCAATCTGACACTTCCAAGACGGTTGATTGGTCTCCCATGGATTACTGGAATTGGTTGGTGGACAACGGTA
GCAAAGAAGACCTTTCAAAAGGAGTTCTCATTATGTGGTCAATCTGGAATTTTAGAAACAAATCGAGGGCACTCAAGCTAAAACCAGAATCAATCTACTCATCTATTGAA
GCTATTTTAGAAGAAAATGAGACAAAGAACCTGATTGCTCGGAAGCCCGTCAGGCTGAGGAGCCAAATGAGTCAAGTTCCCTGGAATCCTCCCCTCCCTAACCGCTGGAA
GCTAAATTCGAACGCTTCCTGGGTTGATCGTTCCAACTCTGGAGGCCTGGGCTGGGTGGTGCATGACTCAAGTGGATCCCTAATCTGTGTAGGCATGAAGCAGTGTCAAA
GAAATTGGAGTACCAGTGCCCTTGAAGGAAAAGCTATGCTCGAAGGGATCAACTGCATCAGAAATACCTGTATTCAATACAACATTGGCTTGGAAATTGAAGCCGACTCA
CTGGAGCTTATCAACGCCATTAATGGCATTTCCGAGAACCTCACCGAATTGTTGAATGTGGTGGAGGCAATCTCTTCAGTGGTTTTGTCGCTTGCAGTGGCTGAATTTCG
TCACTGTGTAAGGGGAGCCAACGCAAAAGCCCACTGCGTTGCTCGTCACGTAGCTTCCCCGTCTTCTGTTTCTAATTCAGTTTCTTCTTCTTTTGTTTCCCAGAGGTGTT
CCTCTTCGTGGGAACAGGGCTGTATTTTTTGGGCCCCTGGTTTTCCTTCGTGGATTCTTTCCCTCCTTAATGAGGGTGGTTGTTTTGTTGGTTAG
Protein sequenceShow/hide protein sequence
MKDKAVKARVEHLNFHQSDHRPIIIKVSWKVPINLSRPSNRSVKFEESWKSFEESKDIIKNFWNTELNISRFNVNYKIQEGLKLMANNYVSSLIEENLSWKEDLIKASFS
PEAADDILKIPLGDINSKDKIIWAYDKKGTFSVKSAYRLAQANCPLQETSQSDTSKTVDWSPMDYWNWLVDNGSKEDLSKGVLIMWSIWNFRNKSRALKLKPESIYSSIE
AILEENETKNLIARKPVRLRSQMSQVPWNPPLPNRWKLNSNASWVDRSNSGGLGWVVHDSSGSLICVGMKQCQRNWSTSALEGKAMLEGINCIRNTCIQYNIGLEIEADS
LELINAINGISENLTELLNVVEAISSVVLSLAVAEFRHCVRGANAKAHCVARHVASPSSVSNSVSSSFVSQRCSSSWEQGCIFWAPGFPSWILSLLNEGGCFVG