; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg006973 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg006973
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold10:39170671..39174819
RNA-Seq ExpressionSpg006973
SyntenySpg006973
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR005135 - Endonuclease/exonuclease/phosphatase
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA3453480.1 reverse transcriptase [Gossypium australe]4.9e-5742.07Show/hide
Query:  MKILCWNVRGVGNPRAVRSLRHVIRKHNPMLVFLAETKCNSHLAEKLKRKLGFSNMYIINSEGNGGGLILMWQNHIDITVNSSSRGHIDSTVKNSGWW--
        MK  CWN RG+G+PRAVR LR+++++H+P LVFL ETK +S   ++++R  GF+N   + +EG+ GGL L W++ I +T+ S S+ HID  +K  G    
Subjt:  MKILCWNVRGVGNPRAVRSLRHVIRKHNPMLVFLAETKCNSHLAEKLKRKLGFSNMYIINSEGNGGGLILMWQNHIDITVNSSSRGHIDSTVKNSGWW--

Query:  WRFTGFYGNPDQSKRKDSWRLLERLKDSSNLPWIVGGDFNEIMFSHEKKGGSPKTLTSLNAFRDTINDCGLMDIGFFGDRYTWRKNKSNKEATKERLDRF
        WRFTGFYG+P    +K  W LLERL    + PW+V GDFNEIMFS EK+GG  +    +  FRDT+ +CGLMD+GF G  +TW +   ++   +ERLDR 
Subjt:  WRFTGFYGNPDQSKRKDSWRLLERLKDSSNLPWIVGGDFNEIMFSHEKKGGSPKTLTSLNAFRDTINDCGLMDIGFFGDRYTWRKNKSNKEATKERLDRF

Query:  FMNPDMPPLANELKVEHLNYLHSDHRAIMLNINWTEDSQSSGYGRKSVRLEESWLNHEGCKVVFQEVWNSS
          N     L    +++HL ++ SDH  ++L    T DS ++    +    E  W   E  + V + +W SS
Subjt:  FMNPDMPPLANELKVEHLNYLHSDHRAIMLNINWTEDSQSSGYGRKSVRLEESWLNHEGCKVVFQEVWNSS

KAF4351405.1 hypothetical protein F8388_001025, partial [Cannabis sativa]3.9e-5442.8Show/hide
Query:  KILCWNVRGVGNPRAVRSLRHVIRKHNPMLVFLAETKCNSHLAEKLKRKLGFSNMYIINSEGNGGGLILMWQNHIDITVNSSSRGHIDSTVKNSGW-WWR
        K++  NV G+GNP A+ +LR V+RK++P LVFL+ETK     AE ++R++ FSN + ++  G  GGL+L+W +  +++V S S GHID+ VK  G   WR
Subjt:  KILCWNVRGVGNPRAVRSLRHVIRKHNPMLVFLAETKCNSHLAEKLKRKLGFSNMYIINSEGNGGGLILMWQNHIDITVNSSSRGHIDSTVKNSGW-WWR

Query:  FTGFYGNPDQSKRKDSWRLLERLKDSSNLPWIVGGDFNEIMFSHEKKGGSPKTLTSLNAFRDTINDCGLMDIGFFGDRYTWRKNKSNKEATKERLDRFFM
        FTGFYGNP  S R DSW+LL RLK   +LPWI GGDFNEI+  +EKKGG  ++L++++ F+  ++ C L+D+GF G  +TW   +      +ERLDR+F 
Subjt:  FTGFYGNPDQSKRKDSWRLLERLKDSSNLPWIVGGDFNEIMFSHEKKGGSPKTLTSLNAFRDTINDCGLMDIGFFGDRYTWRKNKSNKEATKERLDRFFM

Query:  NPDMPPLANELKVEHLNYLHSDHR---AIMLNINWTEDSQSSGYGRKSVRLEESWLNHEGCKVVFQEVWNS
        N +   L   +KV + +++HSDHR   AI+ N+            +KS R E  WL    C+ +  + W S
Subjt:  NPDMPPLANELKVEHLNYLHSDHR---AIMLNINWTEDSQSSGYGRKSVRLEESWLNHEGCKVVFQEVWNS

XP_024021734.1 uncharacterized protein LOC112091706 [Morus notabilis]1.7e-5742.49Show/hide
Query:  MKILCWNVRGVGNPRAVRSLRHVIRKHNPMLVFLAETKCNSHLAEKLKRKLGFSNMYIINSEGNGGGLILMWQNHIDITVNSSSRGHIDSTVK-NSGWWW
        M ++ WNVRG+GNPRA   LR +IR  +P L FL ET+  S  AE +KR+ GF   + ++  G  GGL+LMW+  +++ + S S+ HID  V+   G WW
Subjt:  MKILCWNVRGVGNPRAVRSLRHVIRKHNPMLVFLAETKCNSHLAEKLKRKLGFSNMYIINSEGNGGGLILMWQNHIDITVNSSSRGHIDSTVK-NSGWWW

Query:  RFTGFYGNPDQSKRKDSWRLLERLKDSSNLPWIVGGDFNEIMFSHEKKGGSPKTLTSLNAFRDTINDCGLMDIGFFGDRYTWRKNKSNKEATKERLDRFF
        RFTGFYGNP +S R  SW LL RLK  SNLPW+V GDFNEI+F  +K+GG+ +   S+N FRDT+  C L+D+GF G ++TW   +      +ERLDR  
Subjt:  RFTGFYGNPDQSKRKDSWRLLERLKDSSNLPWIVGGDFNEIMFSHEKKGGSPKTLTSLNAFRDTINDCGLMDIGFFGDRYTWRKNKSNKEATKERLDRFF

Query:  MNPDMPPLANELKVEHLNYLHSDHRAIMLNINWTEDSQSSGYG---RKSVRLEESWLNHEGCKVVFQEVWNSS
           +   L     V ++++  SDHRA+ L +    D   +G G   RK  R E  W+  E  K   +  W ++
Subjt:  MNPDMPPLANELKVEHLNYLHSDHRAIMLNINWTEDSQSSGYG---RKSVRLEESWLNHEGCKVVFQEVWNSS

XP_028075737.1 uncharacterized protein LOC114277953 [Camellia sinensis]1.1e-5337.76Show/hide
Query:  MKILCWNVRGVGNPRAVRSLRHVIRKHNPMLVFLAETKCNSHLAEKLKRKLGFSNMYIINSEGNGGGLILMWQNHIDITVNSSSRGHIDSTVKNSGW--W
        MKILCWN RG+GNPR VR L+ +++K  P +VFL ETK ++   E+++ KLG    + ++  G  GGL L+W   I + + S SRGH+DS + +      
Subjt:  MKILCWNVRGVGNPRAVRSLRHVIRKHNPMLVFLAETKCNSHLAEKLKRKLGFSNMYIINSEGNGGGLILMWQNHIDITVNSSSRGHIDSTVKNSGW--W

Query:  WRFTGFYGNPDQSKRKDSWRLLERLKDSSNLPWIVGGDFNEIMFSHEKKGGSPKTLTSLNAFRDTINDCGLMDIGFFGDRYTWRKNKSNKEATKERLDRF
        W FTGFYGNP  S R DSW LL RL+D  +LPW+  GDFNEI+++HEK G + ++   ++ FR  ++DC L D+GF G  +TW   ++     +ERLDR 
Subjt:  WRFTGFYGNPDQSKRKDSWRLLERLKDSSNLPWIVGGDFNEIMFSHEKKGGSPKTLTSLNAFRDTINDCGLMDIGFFGDRYTWRKNKSNKEATKERLDRF

Query:  FMNPDMPPLANELKVEHLNYLHSDHRAIMLNINWTEDSQS-SGYGRKSVRLEESWLNHEGCKVVFQEVWN-----SSLWISNGAIHRAESEIKS
         +N          +V HL    SDH  I+L++   +   +     RK  R E  WL  E C+ +    W+     S + + +G +    S++++
Subjt:  FMNPDMPPLANELKVEHLNYLHSDHRAIMLNINWTEDSQS-SGYGRKSVRLEESWLNHEGCKVVFQEVWN-----SSLWISNGAIHRAESEIKS

XP_042950313.1 uncharacterized protein LOC122282426 [Carya illinoinensis]1.1e-5339.55Show/hide
Query:  MKILCWNVRGVGNPRAVRSLRHVIRKHNPMLVFLAETKCNSHLAEKLKRKLGFSNMYIINSEGNGGGLILMWQNHIDITVNSSSRGHIDSTVK-NSGWWW
        MK +CWN RG+GNP  +R+LR +I +  P L+FL ETK ++   + LK KLGF N + ++SEG  GGL L+W + + + + S S+ HID  +K +    W
Subjt:  MKILCWNVRGVGNPRAVRSLRHVIRKHNPMLVFLAETKCNSHLAEKLKRKLGFSNMYIINSEGNGGGLILMWQNHIDITVNSSSRGHIDSTVK-NSGWWW

Query:  RFTGFYGNPDQSKRKDSWRLLERLKDSSNLPWIVGGDFNEIMFSHEKKGGSPKTLTSLNAFRDTINDCGLMDIGFFGDRYTWRKNKSNKEATKERLDRFF
        RFTG YG+PD S+R  +W L+  L     LPW+VGGD NE++  HEK+GG  + ++ + AFR+ + +C L D+G+ G R+TW   +       E LDRF 
Subjt:  RFTGFYGNPDQSKRKDSWRLLERLKDSSNLPWIVGGDFNEIMFSHEKKGGSPKTLTSLNAFRDTINDCGLMDIGFFGDRYTWRKNKSNKEATKERLDRFF

Query:  MNPDMPPLANELKVEHLNYLHSDHRAIMLNINWTEDSQSSGYGRKSVRLEESWLNHEGCKVVFQEVWN
         N  +  L   L V+H N  HSDH  I+     + + Q      K   LE  W+  E C+ + ++VW+
Subjt:  MNPDMPPLANELKVEHLNYLHSDHRAIMLNINWTEDSQSSGYGRKSVRLEESWLNHEGCKVVFQEVWN

TrEMBL top hitse value%identityAlignment
A0A1U8HV94 uncharacterized protein LOC1078899121.6e-5339.48Show/hide
Query:  MKILCWNVRGVGNPRAVRSLRHVIRKHNPMLVFLAETKCNSHLAEKLKRKLGFSNMYIINSEGNGGGLILMWQNHIDITVNSSSRGHIDSTVKNS--GWW
        MKIL WNVRG+GNPR V  LRH ++ +NP +VF  ETK   +  E+++R+ GF N   ++S G+ GGL L W++ + I++ S S+ HID  ++++  G  
Subjt:  MKILCWNVRGVGNPRAVRSLRHVIRKHNPMLVFLAETKCNSHLAEKLKRKLGFSNMYIINSEGNGGGLILMWQNHIDITVNSSSRGHIDSTVKNS--GWW

Query:  WRFTGFYGNPDQSKRKDSWRLLERLKDSSNLPWIVGGDFNEIMFSHEKKGGSPKTLTSLNAFRDTINDCGLMDIGFFGDRYTWRKNKSNKEATKERLDRF
        WRFTGFYG+     R +SW LL+ L+++  LPW V GDFNEIM+ HEK+GG P+    ++AFR  + DC L+D+G+ G+ +TW++    +   +ERLDR 
Subjt:  WRFTGFYGNPDQSKRKDSWRLLERLKDSSNLPWIVGGDFNEIMFSHEKKGGSPKTLTSLNAFRDTINDCGLMDIGFFGDRYTWRKNKSNKEATKERLDRF

Query:  FMNPDMPPLANELKVEHLNYLHSDHRAIMLNINWTEDSQSSGYGRKSVRLEESWLNHEGCKVVFQEVWNSS
          N D   L  +  ++HL +  SDH  +++N    ED++     ++S + E  W+  E      + +W++S
Subjt:  FMNPDMPPLANELKVEHLNYLHSDHRAIMLNINWTEDSQSSGYGRKSVRLEESWLNHEGCKVVFQEVWNSS

A0A2K2CS29 Endo/exonuclease/phosphatase domain-containing protein2.1e-5340.22Show/hide
Query:  MKILCWNVRGVGNPRAVRSLRHVIRKHNPMLVFLAETKCNSHLAEKLKRKLGFSNMYIINSEGNGGGLILMWQNHIDITVNSSSRGHID-STVKNSGWWW
        MKI+ WN RG+GN  A+R L  + +K  P ++FL+ETK +    EK +  LG   M + + EG  GG+ L W+  +D+++    RGHI    ++  G+ W
Subjt:  MKILCWNVRGVGNPRAVRSLRHVIRKHNPMLVFLAETKCNSHLAEKLKRKLGFSNMYIINSEGNGGGLILMWQNHIDITVNSSSRGHID-STVKNSGWWW

Query:  RFTGFYGNPDQSKRKDSWRLLERLKDSSNLPWIVGGDFNEIMFSHEKKGGSPKTLTSLNAFRDTINDCGLMDIGFFGDRYTWRKNKSNKEA-TKERLDRF
        R TG YG+P   ++K + RLL  L   ++LPW+  GDFNEI+F+HEK+GG+P+  + L+ FRD +  CGL D+GF GD +TWR N    +   +ERLD  
Subjt:  RFTGFYGNPDQSKRKDSWRLLERLKDSSNLPWIVGGDFNEIMFSHEKKGGSPKTLTSLNAFRDTINDCGLMDIGFFGDRYTWRKNKSNKEA-TKERLDRF

Query:  FMNPDMPPLANELKVEHLNYLHSDHRAIMLNINWTEDSQSSG--YGRKSVRLEESWLNHEGCKVVFQEVWN
          N        + +V +++  HSDHR I L IN  E S+ SG  +G++ +R E  WL  E C+ V Q  W+
Subjt:  FMNPDMPPLANELKVEHLNYLHSDHRAIMLNINWTEDSQSSG--YGRKSVRLEESWLNHEGCKVVFQEVWN

A0A5B6U6G4 Reverse transcriptase2.4e-5742.07Show/hide
Query:  MKILCWNVRGVGNPRAVRSLRHVIRKHNPMLVFLAETKCNSHLAEKLKRKLGFSNMYIINSEGNGGGLILMWQNHIDITVNSSSRGHIDSTVKNSGWW--
        MK  CWN RG+G+PRAVR LR+++++H+P LVFL ETK +S   ++++R  GF+N   + +EG+ GGL L W++ I +T+ S S+ HID  +K  G    
Subjt:  MKILCWNVRGVGNPRAVRSLRHVIRKHNPMLVFLAETKCNSHLAEKLKRKLGFSNMYIINSEGNGGGLILMWQNHIDITVNSSSRGHIDSTVKNSGWW--

Query:  WRFTGFYGNPDQSKRKDSWRLLERLKDSSNLPWIVGGDFNEIMFSHEKKGGSPKTLTSLNAFRDTINDCGLMDIGFFGDRYTWRKNKSNKEATKERLDRF
        WRFTGFYG+P    +K  W LLERL    + PW+V GDFNEIMFS EK+GG  +    +  FRDT+ +CGLMD+GF G  +TW +   ++   +ERLDR 
Subjt:  WRFTGFYGNPDQSKRKDSWRLLERLKDSSNLPWIVGGDFNEIMFSHEKKGGSPKTLTSLNAFRDTINDCGLMDIGFFGDRYTWRKNKSNKEATKERLDRF

Query:  FMNPDMPPLANELKVEHLNYLHSDHRAIMLNINWTEDSQSSGYGRKSVRLEESWLNHEGCKVVFQEVWNSS
          N     L    +++HL ++ SDH  ++L    T DS ++    +    E  W   E  + V + +W SS
Subjt:  FMNPDMPPLANELKVEHLNYLHSDHRAIMLNINWTEDSQSSGYGRKSVRLEESWLNHEGCKVVFQEVWNSS

A0A7J6DZ24 CCHC-type domain-containing protein1.9e-5442.8Show/hide
Query:  KILCWNVRGVGNPRAVRSLRHVIRKHNPMLVFLAETKCNSHLAEKLKRKLGFSNMYIINSEGNGGGLILMWQNHIDITVNSSSRGHIDSTVKNSGW-WWR
        K++  NV G+GNP A+ +LR V+RK++P LVFL+ETK     AE ++R++ FSN + ++  G  GGL+L+W +  +++V S S GHID+ VK  G   WR
Subjt:  KILCWNVRGVGNPRAVRSLRHVIRKHNPMLVFLAETKCNSHLAEKLKRKLGFSNMYIINSEGNGGGLILMWQNHIDITVNSSSRGHIDSTVKNSGW-WWR

Query:  FTGFYGNPDQSKRKDSWRLLERLKDSSNLPWIVGGDFNEIMFSHEKKGGSPKTLTSLNAFRDTINDCGLMDIGFFGDRYTWRKNKSNKEATKERLDRFFM
        FTGFYGNP  S R DSW+LL RLK   +LPWI GGDFNEI+  +EKKGG  ++L++++ F+  ++ C L+D+GF G  +TW   +      +ERLDR+F 
Subjt:  FTGFYGNPDQSKRKDSWRLLERLKDSSNLPWIVGGDFNEIMFSHEKKGGSPKTLTSLNAFRDTINDCGLMDIGFFGDRYTWRKNKSNKEATKERLDRFFM

Query:  NPDMPPLANELKVEHLNYLHSDHR---AIMLNINWTEDSQSSGYGRKSVRLEESWLNHEGCKVVFQEVWNS
        N +   L   +KV + +++HSDHR   AI+ N+            +KS R E  WL    C+ +  + W S
Subjt:  NPDMPPLANELKVEHLNYLHSDHR---AIMLNINWTEDSQSSGYGRKSVRLEESWLNHEGCKVVFQEVWNS

A0A803PBM9 Uncharacterized protein1.9e-5432.73Show/hide
Query:  MKILCWNVRGVGNPRAVRSLRHVIRKHNPMLVFLAETKCNSHLAEKLKRKLGFSNMYIINSEGNGGGLILMWQNHIDITVNSSSRGHIDSTV-KNSGWWW
        MK+L WNV+G+GNP  VR+L+ ++ + +P LVF++E++     AE L+  LG+   +++ + G  GGLIL+W N +D  + S S  HIDS + K  G WW
Subjt:  MKILCWNVRGVGNPRAVRSLRHVIRKHNPMLVFLAETKCNSHLAEKLKRKLGFSNMYIINSEGNGGGLILMWQNHIDITVNSSSRGHIDSTV-KNSGWWW

Query:  RFTGFYGNPDQSKRKDSWRLLERLKDSSNLPWIVGGDFNEIMFSHEKKGGSPKTLTSLNAFRDTINDCGLMDIGFFGDRYTWRKNKSNKEATKERLDRFF
        RFTGFYG+PD ++R +SW+LL R+    + PW++GGDFNEI+ + EK GG PK    +N FR  +N   L ++ + G  YTW   + N E   ERLDR  
Subjt:  RFTGFYGNPDQSKRKDSWRLLERLKDSSNLPWIVGGDFNEIMFSHEKKGGSPKTLTSLNAFRDTINDCGLMDIGFFGDRYTWRKNKSNKEATKERLDRFF

Query:  MNPDMPPLANELKVEHLNYLHSDHRAIMLNINWTEDSQSSGYGRKS-VRLEESWLNHEGCKVVFQEVW------NSSLWISN------------------
         NP+   L  + KV HL+ + SDH  ++L+          G    S    E +W + E C  + +E W      N+++ + +                  
Subjt:  MNPDMPPLANELKVEHLNYLHSDHRAIMLNINWTEDSQSSGYGRKS-VRLEESWLNHEGCKVVFQEVW------NSSLWISN------------------

Query:  ---GAIHRAESEIKSLTNSTDPADSERLLAEEKTLNDLLAEEEGYWRLRN----MRRADMED---CEKALGIKRTDSLGHIWDFKNKVEH-NQATPTIDS
             +   E +I  L+ ST+  D + L   E+  N LL +EE +WR R+    ++  D        KA   KR +++  + D   K  H N+    +  
Subjt:  ---GAIHRAESEIKSLTNSTDPADSERLLAEEKTLNDLLAEEEGYWRLRN----MRRADMED---CEKALGIKRTDSLGHIWDFKNKVEH-NQATPTIDS

Query:  IH--NFFNKNLKDVEDSYLKEQLVVRSRNQASRATWEPLK
        ++    F  N   + D    +++V    N+ SR T E LK
Subjt:  IH--NFFNKNLKDVEDSYLKEQLVVRSRNQASRATWEPLK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G40390.1 DNAse I-like superfamily protein3.5e-0530.93Show/hide
Query:  QSKRKDSWRLLERLKDSS---NLPWIVGGDFNEI--MFSHEKKGGSPKTLTSLNAFRDTINDCGLMDIGFFGDRYTWRKNKSNKEATKERLDRFFMN
        +++R+  W  + RL  SS   N PW+V GDFN+I  +  H     S  +L  L   +  + D  L+D+   G  YTW  ++ +    + +LDR  +N
Subjt:  QSKRKDSWRLLERLKDSS---NLPWIVGGDFNEI--MFSHEKKGGSPKTLTSLNAFRDTINDCGLMDIGFFGDRYTWRKNKSNKEATKERLDRFFMN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAATTCTATGCTGGAACGTTCGAGGGGTGGGGAATCCTCGGGCGGTCCGATCCTTGCGGCACGTTATCCGCAAGCATAACCCCATGTTAGTGTTCTTAGCAGAAAC
AAAGTGCAACAGTCATCTGGCTGAAAAGTTGAAAAGGAAGCTCGGCTTTAGTAACATGTACATCATTAACAGTGAGGGTAATGGTGGAGGTTTGATTCTTATGTGGCAAA
ACCATATTGATATAACAGTAAACTCTTCCTCAAGAGGTCACATTGATTCCACAGTCAAAAATTCGGGATGGTGGTGGAGATTCACGGGGTTCTACGGAAATCCGGATCAA
AGCAAAAGGAAGGATTCTTGGAGGCTCCTCGAACGGCTCAAGGATAGCTCCAATCTCCCTTGGATTGTTGGAGGAGATTTTAATGAGATTATGTTCAGCCACGAGAAAAA
AGGGGGATCTCCTAAAACTCTTACCTCTTTAAATGCTTTTCGTGACACGATAAATGACTGTGGTTTAATGGACATTGGTTTCTTTGGTGATAGGTACACTTGGAGAAAAA
ACAAGAGCAACAAGGAGGCAACAAAAGAACGACTTGACCGTTTCTTCATGAATCCCGATATGCCTCCCCTGGCAAATGAGTTGAAAGTGGAGCATCTTAACTACCTCCAT
TCTGATCATCGAGCTATCATGTTGAATATTAATTGGACTGAGGATTCCCAGTCGTCGGGGTATGGTAGAAAATCAGTGAGATTGGAGGAGAGTTGGCTGAATCATGAAGG
GTGTAAAGTTGTCTTTCAAGAGGTGTGGAATTCCAGCTTGTGGATCTCAAACGGAGCTATTCATAGAGCTGAATCCGAAATAAAAAGTCTCACAAACTCTACTGATCCTG
CTGATTCTGAGAGGCTACTTGCCGAAGAGAAAACGCTAAATGATCTGCTAGCTGAGGAAGAAGGATATTGGAGATTAAGGAATATGAGGAGGGCCGATATGGAAGATTGC
GAAAAAGCTCTGGGTATTAAAAGAACAGACTCGTTAGGCCATATCTGGGACTTCAAGAACAAAGTGGAGCACAATCAAGCTACTCCTACGATCGACTCAATCCATAATTT
TTTCAACAAAAATTTAAAGGATGTTGAGGACTCGTACCTGAAGGAGCAGCTCGTGGTTAGATCACGGAACCAAGCGAGTCGCGCGACCTGGGAGCCCCTGAAGCATAATA
CCTGGAAATTAAATACTGATGCAGCGTGGAATGAGAAAGATTGTTGTGGAGGTATCAGATGGGTGGTGCACGACTCGAATGGGTCCATGATCTTCAGCGGATCGAAGAAA
ATCCACACGAATTGGCCTATTAAATGGCTAGAAGCTAAAGCGATCCTAGAAGCTTTGAAGGAAATCGCAAGTACCTGTTCCCGGAAGCAAATCCCCCTTGTTATTGAATT
AGAAGCGCTCAAGATCATCAGAGTGTTGAGTGGAGAAGTCGAAGACCCGTCGGACGTGAAAACCATCACAGACGACATCATCAGCCACACTTCACGGCTACCTCGGGTGG
AATTCTGCCATTGCAGTAGGACCACGAACACAGTAGCCCACTGTGAAGCGAGATTTGAGTGTAATCTTCGTTTTGATGCTAGCCAGGCGTTTCCCCTCTCGCGGGAAAAT
GGGTTTCATTTTGTAGCCCCTGGCTTTTTTTATTGTTTGGATTCTGGTACTTTTGGCCCCTCCATTTTATCGAGGGTTTTGCGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAAATTCTATGCTGGAACGTTCGAGGGGTGGGGAATCCTCGGGCGGTCCGATCCTTGCGGCACGTTATCCGCAAGCATAACCCCATGTTAGTGTTCTTAGCAGAAAC
AAAGTGCAACAGTCATCTGGCTGAAAAGTTGAAAAGGAAGCTCGGCTTTAGTAACATGTACATCATTAACAGTGAGGGTAATGGTGGAGGTTTGATTCTTATGTGGCAAA
ACCATATTGATATAACAGTAAACTCTTCCTCAAGAGGTCACATTGATTCCACAGTCAAAAATTCGGGATGGTGGTGGAGATTCACGGGGTTCTACGGAAATCCGGATCAA
AGCAAAAGGAAGGATTCTTGGAGGCTCCTCGAACGGCTCAAGGATAGCTCCAATCTCCCTTGGATTGTTGGAGGAGATTTTAATGAGATTATGTTCAGCCACGAGAAAAA
AGGGGGATCTCCTAAAACTCTTACCTCTTTAAATGCTTTTCGTGACACGATAAATGACTGTGGTTTAATGGACATTGGTTTCTTTGGTGATAGGTACACTTGGAGAAAAA
ACAAGAGCAACAAGGAGGCAACAAAAGAACGACTTGACCGTTTCTTCATGAATCCCGATATGCCTCCCCTGGCAAATGAGTTGAAAGTGGAGCATCTTAACTACCTCCAT
TCTGATCATCGAGCTATCATGTTGAATATTAATTGGACTGAGGATTCCCAGTCGTCGGGGTATGGTAGAAAATCAGTGAGATTGGAGGAGAGTTGGCTGAATCATGAAGG
GTGTAAAGTTGTCTTTCAAGAGGTGTGGAATTCCAGCTTGTGGATCTCAAACGGAGCTATTCATAGAGCTGAATCCGAAATAAAAAGTCTCACAAACTCTACTGATCCTG
CTGATTCTGAGAGGCTACTTGCCGAAGAGAAAACGCTAAATGATCTGCTAGCTGAGGAAGAAGGATATTGGAGATTAAGGAATATGAGGAGGGCCGATATGGAAGATTGC
GAAAAAGCTCTGGGTATTAAAAGAACAGACTCGTTAGGCCATATCTGGGACTTCAAGAACAAAGTGGAGCACAATCAAGCTACTCCTACGATCGACTCAATCCATAATTT
TTTCAACAAAAATTTAAAGGATGTTGAGGACTCGTACCTGAAGGAGCAGCTCGTGGTTAGATCACGGAACCAAGCGAGTCGCGCGACCTGGGAGCCCCTGAAGCATAATA
CCTGGAAATTAAATACTGATGCAGCGTGGAATGAGAAAGATTGTTGTGGAGGTATCAGATGGGTGGTGCACGACTCGAATGGGTCCATGATCTTCAGCGGATCGAAGAAA
ATCCACACGAATTGGCCTATTAAATGGCTAGAAGCTAAAGCGATCCTAGAAGCTTTGAAGGAAATCGCAAGTACCTGTTCCCGGAAGCAAATCCCCCTTGTTATTGAATT
AGAAGCGCTCAAGATCATCAGAGTGTTGAGTGGAGAAGTCGAAGACCCGTCGGACGTGAAAACCATCACAGACGACATCATCAGCCACACTTCACGGCTACCTCGGGTGG
AATTCTGCCATTGCAGTAGGACCACGAACACAGTAGCCCACTGTGAAGCGAGATTTGAGTGTAATCTTCGTTTTGATGCTAGCCAGGCGTTTCCCCTCTCGCGGGAAAAT
GGGTTTCATTTTGTAGCCCCTGGCTTTTTTTATTGTTTGGATTCTGGTACTTTTGGCCCCTCCATTTTATCGAGGGTTTTGCGGTAG
Protein sequenceShow/hide protein sequence
MKILCWNVRGVGNPRAVRSLRHVIRKHNPMLVFLAETKCNSHLAEKLKRKLGFSNMYIINSEGNGGGLILMWQNHIDITVNSSSRGHIDSTVKNSGWWWRFTGFYGNPDQ
SKRKDSWRLLERLKDSSNLPWIVGGDFNEIMFSHEKKGGSPKTLTSLNAFRDTINDCGLMDIGFFGDRYTWRKNKSNKEATKERLDRFFMNPDMPPLANELKVEHLNYLH
SDHRAIMLNINWTEDSQSSGYGRKSVRLEESWLNHEGCKVVFQEVWNSSLWISNGAIHRAESEIKSLTNSTDPADSERLLAEEKTLNDLLAEEEGYWRLRNMRRADMEDC
EKALGIKRTDSLGHIWDFKNKVEHNQATPTIDSIHNFFNKNLKDVEDSYLKEQLVVRSRNQASRATWEPLKHNTWKLNTDAAWNEKDCCGGIRWVVHDSNGSMIFSGSKK
IHTNWPIKWLEAKAILEALKEIASTCSRKQIPLVIELEALKIIRVLSGEVEDPSDVKTITDDIISHTSRLPRVEFCHCSRTTNTVAHCEARFECNLRFDASQAFPLSREN
GFHFVAPGFFYCLDSGTFGPSILSRVLR