; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg008556 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg008556
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold10:36941031..36944311
RNA-Seq ExpressionSpg008556
SyntenySpg008556
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_006491472.1 uncharacterized protein LOC102626455 [Citrus sinensis]2.8e-2226.99Show/hide
Query:  NIINKGIVTNPLCLLCRNHVEWTSHLFWDCKVSKNLWLKYIPLTKDLFDLNRGFWNAKAYWLWLMDNLSDKELEKSVIILWSLWQHRNEVLTNSSNADQF
        N+  +  +  P+C  C+  VE  SH+  +CK ++ +W     + +   D N+ F++A    +W   + ++ EL   ++  W +W  RN+ +     +D  
Subjt:  NIINKGIVTNPLCLLCRNHVEWTSHLFWDCKVSKNLWLKYIPLTKDLFDLNRGFWNAKAYWLWLMDNLSDKELEKSVIILWSLWQHRNEVLTNSSNADQF

Query:  RISKQIDRNIFAQEEIDQSHRSRLGRKSEQRVKSQTSHVQWQPPPGNCWKLNVDASWSEARGIGGVGWILRDSSGSSLCMGYKLISKSWSVKTLEMKAIE
         ++ + D  + A + +  S    +    ++ +  Q    +W+PP  N  KLNVDA+ S      G+G I+RD+ G  L +G K       V   E +AI 
Subjt:  RISKQIDRNIFAQEEIDQSHRSRLGRKSEQRVKSQTSHVQWQPPPGNCWKLNVDASWSEARGIGGVGWILRDSSGSSLCMGYKLISKSWSVKTLEMKAIE

Query:  EGLKCVPSFLERSREQSIAPILVESDAIGVVRILNGEEEDLSEISFLAEEILRLKESLGEVSFHFCPRSCNEAAHRLARIAISDPPVSV
         GL+        + + S + ++VESD   VV +LN  +   +EI ++  ++ R  +   +V F F PR+CN  AH LA+ A+ +    V
Subjt:  EGLKCVPSFLERSREQSIAPILVESDAIGVVRILNGEEEDLSEISFLAEEILRLKESLGEVSFHFCPRSCNEAAHRLARIAISDPPVSV

XP_015385738.1 uncharacterized protein LOC107177034 [Citrus sinensis]3.4e-2328.27Show/hide
Query:  NIINKGIVTNPLCLLCRNHVEWTSHLFWDCKVSKNLWLKYIPLTKDLFD-LNRGFWNAKAYWLWLMDNLSDKELEKSVIILWSLWQHRNEVLTNSSNADQ
        N+  + I+  P C +C+   E T+H    CK +K +W +Y P      D +N+   +     + +   L+  ++E  V I W++W  RN+ L      + 
Subjt:  NIINKGIVTNPLCLLCRNHVEWTSHLFWDCKVSKNLWLKYIPLTKDLFD-LNRGFWNAKAYWLWLMDNLSDKELEKSVIILWSLWQHRNEVLTNSSNADQ

Query:  FRISKQIDRNIFAQEEIDQSHRSRLGRKSEQRVKSQTSHVQWQPPPGNCWKLNVDASWSEARGIGGVGWILRDSSGSSLCMGYKLISKSWSVKTLEMKAI
             + +  + A   + Q          E +  S+   V W PPP N +K+NVDA+ +  R   G+G ++RDSS        K    +  V+T E +A+
Subjt:  FRISKQIDRNIFAQEEIDQSHRSRLGRKSEQRVKSQTSHVQWQPPPGNCWKLNVDASWSEARGIGGVGWILRDSSGSSLCMGYKLISKSWSVKTLEMKAI

Query:  EEGLKCVPSFLERSREQSIAPILVESDAIGVVRILNGEEEDLSEISFLAEEILRLKESLGEVSFHFCPRSCNEAAHRLARIAI
        E GL         ++  +   I+VESD   VV+++N  E   +EI ++  EI  L +    +S++  PRSCN  AH LA++A+
Subjt:  EEGLKCVPSFLERSREQSIAPILVESDAIGVVRILNGEEEDLSEISFLAEEILRLKESLGEVSFHFCPRSCNEAAHRLARIAI

XP_022143317.1 uncharacterized protein LOC111013216 [Momordica charantia]1.8e-2435.82Show/hide
Query:  RGFWNAKAYWLWLMDNLSDKELEKSVIILWSLWQHRNEVLTNSSNADQFRISKQIDRNI--FAQEEIDQSHRSRLGRKSEQ--------RVKSQTSHVQW
        R  W  K  W WL++ LSD+E+  S++I W +W+ RN  +      D+    +Q+DR+I  F    ID+       R+S+Q        R       V+W
Subjt:  RGFWNAKAYWLWLMDNLSDKELEKSVIILWSLWQHRNEVLTNSSNADQFRISKQIDRNI--FAQEEIDQSHRSRLGRKSEQ--------RVKSQTSHVQW

Query:  QPPPGNCWKLNVDASWSEARGIGGVGWILRDSSGSSLCMGYKLISKSWSVKTLEMKAIEEGLKCVPSFLERSREQSIAPILVESDAIGVVRILNGEEEDL
          PP NCWKLN DASWSE R +GG+GWIL D  G  +  G   I +   +  LE+  I  GL+ +         QS +PI +ESD++ V+R++  E+ DL
Subjt:  QPPPGNCWKLNVDASWSEARGIGGVGWILRDSSGSSLCMGYKLISKSWSVKTLEMKAIEEGLKCVPSFLERSREQSIAPILVESDAIGVVRILNGEEEDL

Query:  S
        +
Subjt:  S

XP_022154990.1 uncharacterized protein LOC111022134 isoform X1 [Momordica charantia]8.2e-3033.18Show/hide
Query:  RNHVEWTSHLFWDCKVSKNLWLKYIPLTKDLFDLNRGFWNAKAYWLWLMDNLSDKELEKSVIILWSLWQHRNEVLTNSSNADQFRISKQIDRNIFAQEEI
        R   E T H+ W+CKV K++W+   P+  + F ++R  W  K YW WLMD   ++E  +S+II   +W+ RN+ +    +++   I   IDR I      
Subjt:  RNHVEWTSHLFWDCKVSKNLWLKYIPLTKDLFDLNRGFWNAKAYWLWLMDNLSDKELEKSVIILWSLWQHRNEVLTNSSNADQFRISKQIDRNIFAQEEI

Query:  DQSHRSRLGRKSEQ----RVKSQTSHVQWQPPPGNCWKLNVDASWSEARGIGGVGWILRDSSGSSLCMGYKLISKSWSVKTLEMKAIEEGLKCVPSFLER
        D    + L RKS+     R     +  +W+PP  N WKLN DA+W       G+GWILRD  G  +  G ++I    ++  LE+ AI EGL+ +      
Subjt:  DQSHRSRLGRKSEQ----RVKSQTSHVQWQPPPGNCWKLNVDASWSEARGIGGVGWILRDSSGSSLCMGYKLISKSWSVKTLEMKAIEEGLKCVPSFLER

Query:  SREQSIAPILVESDAIGVVRILN
         R++   PI +ESD++  + +L+
Subjt:  SREQSIAPILVESDAIGVVRILN

XP_022155262.1 uncharacterized protein LOC111022403 [Momordica charantia]3.7e-2232.11Show/hide
Query:  LMDNLSDKELEKSVIILWSLWQHRNEVLTNSSNADQFRISKQIDRNIFAQEEIDQSHRSRLGRKSEQRVKSQTSHVQWQPPPGNCWKLNVDASWSEARGI
        ++D  SD++L+  +I  W +W HRN V+    ++    + +Q+ +  F  E   QS  S          K+  + ++W+PPP + W LN DASWS++   
Subjt:  LMDNLSDKELEKSVIILWSLWQHRNEVLTNSSNADQFRISKQIDRNIFAQEEIDQSHRSRLGRKSEQRVKSQTSHVQWQPPPGNCWKLNVDASWSEARGI

Query:  GGVGWILRDSSGSSLCMGYKLISKSWSVKTLEMKAIEEGLKCVPSFLERSREQSIAPILVESDAIGVVRILNGEEEDLSEISFLAEEILRLKESLGEVSF
        GG+GWI+R   G  +  G + +    +VK LE  AI EGL+ + +         + P+ +E+D+  V  +LN + EDL++  ++ EEIL L++S   ++F
Subjt:  GGVGWILRDSSGSSLCMGYKLISKSWSVKTLEMKAIEEGLKCVPSFLERSREQSIAPILVESDAIGVVRILNGEEEDLSEISFLAEEILRLKESLGEVSF

Query:  HFCPRSCNEAAHRLARIA
            R  N  AH LA+ A
Subjt:  HFCPRSCNEAAHRLARIA

TrEMBL top hitse value%identityAlignment
A0A5B7BI33 Uncharacterized protein (Fragment)4.9e-2026.33Show/hide
Query:  IINKGIVTNPLCLLCRNHVEWTSHLFWDCKVSKNLWLKYIPLTKDLFDLNRGFWNAKAYWLWLMDNLSDKELEKSVIILWSLWQHRNEVLTNSSNADQFR
        ++++ +  +PLC LC N  E   HLF  C   K +W       K  F       + + ++ ++ + L    +E   +I+W +W HRN +  +    D  R
Subjt:  IINKGIVTNPLCLLCRNHVEWTSHLFWDCKVSKNLWLKYIPLTKDLFDLNRGFWNAKAYWLWLMDNLSDKELEKSVIILWSLWQHRNEVLTNSSNADQFR

Query:  ISKQIDRNIF-AQEEIDQSHRSRLGRKSEQRVKSQTSHVQWQPPPGNCWKLNVDASWSEARGIGGVGWILRDSSGSSLCMGYKLISKSWSVKTLEMKAIE
        +       +F AQ+ +++ H ++        V S+     W PPP + +KLNVD SW      GG+G ++RDS G  +    K +    S    E  A+ 
Subjt:  ISKQIDRNIF-AQEEIDQSHRSRLGRKSEQRVKSQTSHVQWQPPPGNCWKLNVDASWSEARGIGGVGWILRDSSGSSLCMGYKLISKSWSVKTLEMKAIE

Query:  EGLKCVPSFLERSREQSIAPILVESDAIGVVRILNGEEEDLSEISFLAEEILRLKESLGEVSFHFCPRSCNEAAHRLARIA
         G+         ++E  I  +L+ESD + +V  +     D S I  + ++I R    L         RS N+ AH +A  A
Subjt:  EGLKCVPSFLERSREQSIAPILVESDAIGVVRILNGEEEDLSEISFLAEEILRLKESLGEVSFHFCPRSCNEAAHRLARIA

A0A6J1CQG0 uncharacterized protein LOC1110132168.6e-2535.82Show/hide
Query:  RGFWNAKAYWLWLMDNLSDKELEKSVIILWSLWQHRNEVLTNSSNADQFRISKQIDRNI--FAQEEIDQSHRSRLGRKSEQ--------RVKSQTSHVQW
        R  W  K  W WL++ LSD+E+  S++I W +W+ RN  +      D+    +Q+DR+I  F    ID+       R+S+Q        R       V+W
Subjt:  RGFWNAKAYWLWLMDNLSDKELEKSVIILWSLWQHRNEVLTNSSNADQFRISKQIDRNI--FAQEEIDQSHRSRLGRKSEQ--------RVKSQTSHVQW

Query:  QPPPGNCWKLNVDASWSEARGIGGVGWILRDSSGSSLCMGYKLISKSWSVKTLEMKAIEEGLKCVPSFLERSREQSIAPILVESDAIGVVRILNGEEEDL
          PP NCWKLN DASWSE R +GG+GWIL D  G  +  G   I +   +  LE+  I  GL+ +         QS +PI +ESD++ V+R++  E+ DL
Subjt:  QPPPGNCWKLNVDASWSEARGIGGVGWILRDSSGSSLCMGYKLISKSWSVKTLEMKAIEEGLKCVPSFLERSREQSIAPILVESDAIGVVRILNGEEEDL

Query:  S
        +
Subjt:  S

A0A6J1DL64 uncharacterized protein LOC111022134 isoform X14.0e-3033.18Show/hide
Query:  RNHVEWTSHLFWDCKVSKNLWLKYIPLTKDLFDLNRGFWNAKAYWLWLMDNLSDKELEKSVIILWSLWQHRNEVLTNSSNADQFRISKQIDRNIFAQEEI
        R   E T H+ W+CKV K++W+   P+  + F ++R  W  K YW WLMD   ++E  +S+II   +W+ RN+ +    +++   I   IDR I      
Subjt:  RNHVEWTSHLFWDCKVSKNLWLKYIPLTKDLFDLNRGFWNAKAYWLWLMDNLSDKELEKSVIILWSLWQHRNEVLTNSSNADQFRISKQIDRNIFAQEEI

Query:  DQSHRSRLGRKSEQ----RVKSQTSHVQWQPPPGNCWKLNVDASWSEARGIGGVGWILRDSSGSSLCMGYKLISKSWSVKTLEMKAIEEGLKCVPSFLER
        D    + L RKS+     R     +  +W+PP  N WKLN DA+W       G+GWILRD  G  +  G ++I    ++  LE+ AI EGL+ +      
Subjt:  DQSHRSRLGRKSEQ----RVKSQTSHVQWQPPPGNCWKLNVDASWSEARGIGGVGWILRDSSGSSLCMGYKLISKSWSVKTLEMKAIEEGLKCVPSFLER

Query:  SREQSIAPILVESDAIGVVRILN
         R++   PI +ESD++  + +L+
Subjt:  SREQSIAPILVESDAIGVVRILN

A0A6J1DNV9 uncharacterized protein LOC1110224031.8e-2232.11Show/hide
Query:  LMDNLSDKELEKSVIILWSLWQHRNEVLTNSSNADQFRISKQIDRNIFAQEEIDQSHRSRLGRKSEQRVKSQTSHVQWQPPPGNCWKLNVDASWSEARGI
        ++D  SD++L+  +I  W +W HRN V+    ++    + +Q+ +  F  E   QS  S          K+  + ++W+PPP + W LN DASWS++   
Subjt:  LMDNLSDKELEKSVIILWSLWQHRNEVLTNSSNADQFRISKQIDRNIFAQEEIDQSHRSRLGRKSEQRVKSQTSHVQWQPPPGNCWKLNVDASWSEARGI

Query:  GGVGWILRDSSGSSLCMGYKLISKSWSVKTLEMKAIEEGLKCVPSFLERSREQSIAPILVESDAIGVVRILNGEEEDLSEISFLAEEILRLKESLGEVSF
        GG+GWI+R   G  +  G + +    +VK LE  AI EGL+ + +         + P+ +E+D+  V  +LN + EDL++  ++ EEIL L++S   ++F
Subjt:  GGVGWILRDSSGSSLCMGYKLISKSWSVKTLEMKAIEEGLKCVPSFLERSREQSIAPILVESDAIGVVRILNGEEEDLSEISFLAEEILRLKESLGEVSF

Query:  HFCPRSCNEAAHRLARIA
            R  N  AH LA+ A
Subjt:  HFCPRSCNEAAHRLARIA

A0A6J1DQC9 uncharacterized protein LOC111022134 isoform X23.7e-2032.97Show/hide
Query:  WNAKAYWLWLMDNLSDKELEKSVIILWSLWQHRNEVLTNSSNADQFRISKQIDRNIFAQEEIDQSHRSRLGRKSEQ----RVKSQTSHVQWQPPPGNCWK
        W  K YW WLMD   ++E  +S+II   +W+ RN+ +    +++   I   IDR I      D    + L RKS+     R     +  +W+PP  N WK
Subjt:  WNAKAYWLWLMDNLSDKELEKSVIILWSLWQHRNEVLTNSSNADQFRISKQIDRNIFAQEEIDQSHRSRLGRKSEQ----RVKSQTSHVQWQPPPGNCWK

Query:  LNVDASWSEARGIGGVGWILRDSSGSSLCMGYKLISKSWSVKTLEMKAIEEGLKCVPSFLERSREQSIAPILVESDAIGVVRILN
        LN DA+W       G+GWILRD  G  +  G ++I    ++  LE+ AI EGL+ +       R++   PI +ESD++  + +L+
Subjt:  LNVDASWSEARGIGGVGWILRDSSGSSLCMGYKLISKSWSVKTLEMKAIEEGLKCVPSFLERSREQSIAPILVESDAIGVVRILN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G02520.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.3e-0426.21Show/hide
Query:  IINKGIVTNPLCLLCRNHVEWTSHLFWDCKVSKNLWLKYIPLTKDLFD---LNRGFWNAKAYWLWLMDNLSDKELEKSVII-----LWSLWQHRNEVLTN
        +I+ G +  PLCL C  H E   HLF+DC+ ++ +W+ Y      +F       G         WL +   DK +   + +     ++++W+ RN  L +
Subjt:  IINKGIVTNPLCLLCRNHVEWTSHLFWDCKVSKNLWLKYIPLTKDLFD---LNRGFWNAKAYWLWLMDNLSDKELEKSVII-----LWSLWQHRNEVLTN

Query:  SSN
        S++
Subjt:  SSN

AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.2e-2028.32Show/hide
Query:  CLLCRNHVEWTSHLFWDCKVSKNLW-LKYIPLTKDLFDLNRGFWN----AKAYWLWLMDNLSDKELEKSVII---LWSLWQHRNEVLTNSSNADQFRISK
        C+ C +  E  +HL + C  ++ +W +  IP   +      G W     A  YW+  ++    K  +   ++   LW LW+ RNE++      D    + 
Subjt:  CLLCRNHVEWTSHLFWDCKVSKNLW-LKYIPLTKDLFDLNRGFWN----AKAYWLWLMDNLSDKELEKSVII---LWSLWQHRNEVLTNSSNADQFRISK

Query:  QIDRNIFAQEEIDQSHRSRLGRKSEQRVKSQTSHVQWQPPPGNCWKLNVDASWSEARGIGGVGWILRDSSGSSLCMGYKLISKSWSVKTLEMKAIEEGLK
        ++ R      E   + R   G+ S  +V+   S VQW+ PP    K N DA+W       G+GWILR+ SG  L MG + + ++ +V   E++A+   + 
Subjt:  QIDRNIFAQEEIDQSHRSRLGRKSEQRVKSQTSHVQWQPPPGNCWKLNVDASWSEARGIGGVGWILRDSSGSSLCMGYKLISKSWSVKTLEMKAIEEGLK

Query:  CVPSFLERSREQSIAPILVESDAIGVVRILNGEEEDLSEISFLAEEILRLKESLGEVSFHFCPRSCNEAAHRLARIAIS
         +  F       +   I+ ESDA  +V +LN  ++    +    E+I +L     EV F F PR  N+ A R+AR +IS
Subjt:  CVPSFLERSREQSIAPILVESDAIGVVRILNGEEEDLSEISFLAEEILRLKESLGEVSFHFCPRSCNEAAHRLARIAIS

AT3G23320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein4.4e-0526.16Show/hide
Query:  QEEIDQSHRSRLGRKSEQRVKSQTSHVQWQPPPGNCWKLNVDASWSEARGIGGVGWILRDSSGSSLCMGYKLISKSWSVKTLEMKAIEEGLKCVPSFLER
        +E +D  H++    K  Q   S+T   +WQ P     K N D S    R   G+ WI+R+S G+ L  G        ++K  E  A+   ++C      R
Subjt:  QEEIDQSHRSRLGRKSEQRVKSQTSHVQWQPPPGNCWKLNVDASWSEARGIGGVGWILRDSSGSSLCMGYKLISKSWSVKTLEMKAIEEGLKCVPSFLER

Query:  SREQSIAPILVESDAIGVVRILNGEEEDLSEISFLAEEILRLKESLGEVSFHFCPRSCNEAAHRLARIAISD
          E        E D I V R++  +E +   + +  E I +  ++   V F F  R  N     LA+ A+++
Subjt:  SREQSIAPILVESDAIGVVRILNGEEEDLSEISFLAEEILRLKESLGEVSFHFCPRSCNEAAHRLARIAISD

AT4G10613.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein8.8e-0627.93Show/hide
Query:  IINKGIVTNPLCLLCRNHVEWTSHLFWDCKVSKNLWLKYIPLTKDLFDLNRGFWNAKAYWLWLMDNLSDKELEKSV--IILWSLWQHRNEVLTNSSNADQ
        +++ G+  +PLC LC   VE   HL   C  S ++W   +     L  +    W +   W+ L    S   + K V    + ++W+ RN +L N  +   
Subjt:  IINKGIVTNPLCLLCRNHVEWTSHLFWDCKVSKNLWLKYIPLTKDLFDLNRGFWNAKAYWLWLMDNLSDKELEKSV--IILWSLWQHRNEVLTNSSNADQ

Query:  FRISKQIDRNI
          I K IDR I
Subjt:  FRISKQIDRNI

AT5G65005.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein3.0e-0621.36Show/hide
Query:  ILWSLWQHRNEVLTNSSNADQFRISKQIDRNIFAQEEIDQSHRSRLGRKSEQRVKSQTSHVQWQPPPGNCWKLNVDASWSEARGIGGVGWILRDSSGSSL
        ++W +W+  N+++ N +   +F+ + ++  N   +E +D +  +   +++  R    + + +W PP  +  K N DAS  E   + G+GWILR+S G+ +
Subjt:  ILWSLWQHRNEVLTNSSNADQFRISKQIDRNIFAQEEIDQSHRSRLGRKSEQRVKSQTSHVQWQPPPGNCWKLNVDASWSEARGIGGVGWILRDSSGSSL

Query:  CMGYKLISKSWSVKTLEMKAIEEGLKCVPSFLERSREQSIAPILVESDAIGVVRILNGEEEDLSEISFLAEEILRLKESLGEVSFHFCPRSCNEAAHRLA
          G        + +  E   +   ++    F  +        ++ E D   + R++N +  +   +    + I     S   + F F  R  N  A  LA
Subjt:  CMGYKLISKSWSVKTLEMKAIEEGLKCVPSFLERSREQSIAPILVESDAIGVVRILNGEEEDLSEISFLAEEILRLKESLGEVSFHFCPRSCNEAAHRLA

Query:  RIAISD
        + AI +
Subjt:  RIAISD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACATTATTAACAAAGGAATTGTGACTAACCCTTTATGCCTTTTGTGCAGGAATCACGTGGAATGGACTTCCCATCTCTTTTGGGATTGCAAGGTATCAAAAAATTT
GTGGCTCAAATACATTCCTCTAACAAAAGATTTATTTGATCTCAACAGGGGATTTTGGAATGCAAAGGCATATTGGCTTTGGCTCATGGACAATCTGAGCGACAAAGAAT
TGGAAAAATCCGTAATAATTCTCTGGAGCTTATGGCAACATAGAAATGAAGTTCTCACTAATTCCTCCAACGCAGATCAGTTCAGAATTTCAAAGCAAATAGACAGAAAT
ATTTTCGCGCAAGAAGAAATTGACCAATCTCACCGTTCGCGGCTAGGAAGGAAGTCTGAGCAAAGAGTGAAGAGCCAGACGAGTCATGTGCAGTGGCAGCCCCCGCCTGG
AAATTGCTGGAAACTCAACGTCGACGCCTCTTGGAGCGAAGCCAGGGGCATTGGAGGGGTGGGGTGGATCCTTCGTGACTCCTCAGGATCTTCACTATGCATGGGTTACA
AACTGATCAGCAAAAGCTGGTCTGTCAAAACGCTTGAAATGAAAGCAATTGAAGAAGGCCTCAAATGTGTTCCTTCTTTTCTCGAGCGATCCAGGGAGCAATCCATTGCC
CCAATCTTGGTGGAATCTGACGCCATCGGAGTCGTTCGCATCCTTAACGGGGAAGAAGAGGACCTCTCGGAAATCTCTTTTTTGGCCGAGGAGATTCTTCGTCTTAAGGA
GTCTTTAGGGGAGGTGTCGTTTCATTTTTGTCCGAGATCTTGCAACGAGGCCGCCCATCGTTTGGCGCGAATTGCAATCTCTGATCCTCCGGTTTCAGTTTCTTTTTCTG
GTTTTGAGATCTCTTCAAATGCGGAAGAAGATCATGATTATCCTTTGATTTGCATGGGTGAGAGTGACCCAACATCGCCGACTCAATAA
mRNA sequenceShow/hide mRNA sequence
ATGAACATTATTAACAAAGGAATTGTGACTAACCCTTTATGCCTTTTGTGCAGGAATCACGTGGAATGGACTTCCCATCTCTTTTGGGATTGCAAGGTATCAAAAAATTT
GTGGCTCAAATACATTCCTCTAACAAAAGATTTATTTGATCTCAACAGGGGATTTTGGAATGCAAAGGCATATTGGCTTTGGCTCATGGACAATCTGAGCGACAAAGAAT
TGGAAAAATCCGTAATAATTCTCTGGAGCTTATGGCAACATAGAAATGAAGTTCTCACTAATTCCTCCAACGCAGATCAGTTCAGAATTTCAAAGCAAATAGACAGAAAT
ATTTTCGCGCAAGAAGAAATTGACCAATCTCACCGTTCGCGGCTAGGAAGGAAGTCTGAGCAAAGAGTGAAGAGCCAGACGAGTCATGTGCAGTGGCAGCCCCCGCCTGG
AAATTGCTGGAAACTCAACGTCGACGCCTCTTGGAGCGAAGCCAGGGGCATTGGAGGGGTGGGGTGGATCCTTCGTGACTCCTCAGGATCTTCACTATGCATGGGTTACA
AACTGATCAGCAAAAGCTGGTCTGTCAAAACGCTTGAAATGAAAGCAATTGAAGAAGGCCTCAAATGTGTTCCTTCTTTTCTCGAGCGATCCAGGGAGCAATCCATTGCC
CCAATCTTGGTGGAATCTGACGCCATCGGAGTCGTTCGCATCCTTAACGGGGAAGAAGAGGACCTCTCGGAAATCTCTTTTTTGGCCGAGGAGATTCTTCGTCTTAAGGA
GTCTTTAGGGGAGGTGTCGTTTCATTTTTGTCCGAGATCTTGCAACGAGGCCGCCCATCGTTTGGCGCGAATTGCAATCTCTGATCCTCCGGTTTCAGTTTCTTTTTCTG
GTTTTGAGATCTCTTCAAATGCGGAAGAAGATCATGATTATCCTTTGATTTGCATGGGTGAGAGTGACCCAACATCGCCGACTCAATAA
Protein sequenceShow/hide protein sequence
MNIINKGIVTNPLCLLCRNHVEWTSHLFWDCKVSKNLWLKYIPLTKDLFDLNRGFWNAKAYWLWLMDNLSDKELEKSVIILWSLWQHRNEVLTNSSNADQFRISKQIDRN
IFAQEEIDQSHRSRLGRKSEQRVKSQTSHVQWQPPPGNCWKLNVDASWSEARGIGGVGWILRDSSGSSLCMGYKLISKSWSVKTLEMKAIEEGLKCVPSFLERSREQSIA
PILVESDAIGVVRILNGEEEDLSEISFLAEEILRLKESLGEVSFHFCPRSCNEAAHRLARIAISDPPVSVSFSGFEISSNAEEDHDYPLICMGESDPTSPTQ