; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0027625 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0027625
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionPolynucleotidyl transferase, ribonuclease H-like superfamily protein
Genome locationchr8:2627281..2628754
RNA-Seq ExpressionLag0027625
SyntenyLag0027625
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022145060.1 uncharacterized protein LOC111014578 [Momordica charantia]2.9e-2535.05Show/hide
Query:  VFVWKYFHNSIPIMVNLMNHHVPVNMSCLVFHEEMETIDHALFRCQRAREVWEVLLPSISMDQWDQLEIKDRWLSFVD-IPTQTLEGICVGTWAIWNDRN
        +F+W+ F++ +P M NL+   +  N +CLV  +++ET DHALF+C+RA+EVW +LLP           ++D  L  V+ + T   + + VG WAIWNDRN
Subjt:  VFVWKYFHNSIPIMVNLMNHHVPVNMSCLVFHEEMETIDHALFRCQRAREVWEVLLPSISMDQWDQLEIKDRWLSFVD-IPTQTLEGICVGTWAIWNDRN

Query:  NLYHNRQIPSPTIRSEWIQEYLAEFWAAN-PNGGSLD-QSDEDVNKIILE-------GEDIIMHIDASFLDDKSKCGIGIVMRTKQGYLKATQIHFVRRC
         +   RQIP   IRS+WI  Y+ +F   + P+ G    Q D   N   +E          I +++DA+    + + GIGIV R ++G + A   H     
Subjt:  NLYHNRQIPSPTIRSEWIQEYLAEFWAAN-PNGGSLD-QSDEDVNKIILE-------GEDIIMHIDASFLDDKSKCGIGIVMRTKQGYLKATQIHFVRRC

Query:  QSPLGAEAITVLTG
          PL AE++ +  G
Subjt:  QSPLGAEAITVLTG

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]5.4e-2426.69Show/hide
Query:  WKKVGKLRVPNKVKVFVWKYFHNSIPIMVNLMNHHVPVNMSCLVFHEEMETIDHALFRCQRAREVWEVLLPSIS-MDQWDQLEIKDRWLSFVD-IPTQTL
        W  + KL VP K+K+F+W+  H  IP   NL+   +    +C +  +  E+I HA F C+RAR++W  L P ++ +   D +   + W S  + +  + L
Subjt:  WKKVGKLRVPNKVKVFVWKYFHNSIPIMVNLMNHHVPVNMSCLVFHEEMETIDHALFRCQRAREVWEVLLPSIS-MDQWDQLEIKDRWLSFVD-IPTQTL

Query:  EGICVGTWAIWNDRNNLYHNRQIPSPTIRSEWIQEYLAEFWAANPNGGS--LDQSDEDVNKIILEGEDIIMHIDASFLDDKSKCGIGIVMRTKQGYLKAT
            +  W IWNDRN+L H +Q+     + EW+  +L     A  +  S     +   V +       + + ++       +    G ++R     L A 
Subjt:  EGICVGTWAIWNDRNNLYHNRQIPSPTIRSEWIQEYLAEFWAANPNGGS--LDQSDEDVNKIILEGEDIIMHIDASFLDDKSKCGIGIVMRTKQGYLKAT

Query:  QIHFVRRCQSPLGAEAITVLTGLQLARNLKVRRLTVMSDCLNLVKSINGQIHGRSSISTTLWDIKEIAASFDFTDISSTPR
            V    SPL AE   +L GL+ A       L V SD L  ++ I  +IH R      + +I+ +   F F   S + R
Subjt:  QIHFVRRCQSPLGAEAITVLTGLQLARNLKVRRLTVMSDCLNLVKSINGQIHGRSSISTTLWDIKEIAASFDFTDISSTPR

XP_030479133.1 uncharacterized protein LOC115696372 [Cannabis sativa]7.1e-2428.27Show/hide
Query:  WWKKVGKLRVPNKVKVFVWKYFHNSIPIMVNLMNHHVPVNMSCLVFHEEMETIDHALFRCQRAREVWEVLLPSISMDQWDQLEIKDRWLSFVDI-PTQTL
        WWK   KL +P+KVK+F WK   +SIP+  +L +  +  + +C +     E+I HALF C  A+EVW+    SI     D+L+  D  +    I      
Subjt:  WWKKVGKLRVPNKVKVFVWKYFHNSIPIMVNLMNHHVPVNMSCLVFHEEMETIDHALFRCQRAREVWEVLLPSISMDQWDQLEIKDRWLSFVDI-PTQTL

Query:  EGICVGTWAIWNDRNNLYHNRQIPSP----TIRSEWIQEYLAEFWAANPNGGSLDQSDEDVNKIILEGEDIIMHIDASFLDDKSKCGIGIVMRTKQGYLK
        E I    W IW+DRNN  H +++ +P    T    ++ +Y +   A  P   +        +          +++DA+    +SK GIG+++R   G +K
Subjt:  EGICVGTWAIWNDRNNLYHNRQIPSP----TIRSEWIQEYLAEFWAANPNGGSLDQSDEDVNKIILEGEDIIMHIDASFLDDKSKCGIGIVMRTKQGYLK

Query:  ATQIHFVRRCQSPLGAEAITVLTGLQLARNLKVRRLTVMSDCLNLVKSINGQIHGRSSISTTLWDIKEIAASFDFTDISSTPR
        A               EA  +  GL  A+  ++    V +DCL LV ++NG +   S     + D+K   ++F  T +S   R
Subjt:  ATQIHFVRRCQSPLGAEAITVLTGLQLARNLKVRRLTVMSDCLNLVKSINGQIHGRSSISTTLWDIKEIAASFDFTDISSTPR

XP_030497600.1 uncharacterized protein LOC115713257 [Cannabis sativa]4.6e-2328.62Show/hide
Query:  WWKKVGKLRVPNKVKVFVWKYFHNSIPIMVNLMNHHVPVNMSCLVFHEEMETIDHALFRCQRAREVWEVLLPSISMDQWDQLEIKDRWLSFVDIPTQT-L
        WW+    L +P+KV++F W+  ++++P+  NL +  V  + +C +     E+I HALF C  A+ VW+     +   +   ++  D  L    I T++ L
Subjt:  WWKKVGKLRVPNKVKVFVWKYFHNSIPIMVNLMNHHVPVNMSCLVFHEEMETIDHALFRCQRAREVWEVLLPSISMDQWDQLEIKDRWLSFVDIPTQT-L

Query:  EGICVGTWAIWNDRNNLYHNRQIPSPTIRSEWIQEYLAEFW----AANPNGGSLDQSDEDVNKIILEGEDIIMHIDASFLDDKSKCGIGIVMRTKQGYLK
        E +    W IW+DRNN  H +Q+  P   S   + YLA F     A  P    +      V  +     ++ M++DA+    ++K GIG+++R   G + 
Subjt:  EGICVGTWAIWNDRNNLYHNRQIPSPTIRSEWIQEYLAEFW----AANPNGGSLDQSDEDVNKIILEGEDIIMHIDASFLDDKSKCGIGIVMRTKQGYLK

Query:  ATQIHFVRRCQSPLGAEAITVLTGLQLARNLKVRRLTVMSDCLNLVKSINGQIHGRSSISTTLWDIKEIAASFDFTDISSTPR
        A     V         EA  +  GLQ A+ L+++   V +DCL LV ++ G+    SS    + DI    +SF    IS   R
Subjt:  ATQIHFVRRCQSPLGAEAITVLTGLQLARNLKVRRLTVMSDCLNLVKSINGQIHGRSSISTTLWDIKEIAASFDFTDISSTPR

XP_030502555.1 uncharacterized protein LOC115717715 [Cannabis sativa]6.4e-2529.21Show/hide
Query:  SKNGVEVRWWKKVGKLRVPNKVKVFVWKYFHNSIPIMVNLMNHHVPVNMSCLVFHEEMETIDHALFRCQRAREVWEVLLPSISMDQWDQLEIKDRWLSFV
        S +G +  WWK+   L +P+KV++F WK  ++++P+  NL +  V  + +C +     E+I HALF C  A+ VW+     +   +   ++  D  +   
Subjt:  SKNGVEVRWWKKVGKLRVPNKVKVFVWKYFHNSIPIMVNLMNHHVPVNMSCLVFHEEMETIDHALFRCQRAREVWEVLLPSISMDQWDQLEIKDRWLSFV

Query:  DIPTQT-LEGICVGTWAIWNDRNNLYHNRQIPSPTIRSEWIQEYLAEFWAANPNG-GSLDQSDEDVNKIIL---EGEDIIMHIDASFLDDKSKCGIGIVM
         I T + LE +    W IW+DRNN  H +Q+  P   S   + YLA F +       +  +   DVN++         + M++DA+    +SK G+G+++
Subjt:  DIPTQT-LEGICVGTWAIWNDRNNLYHNRQIPSPTIRSEWIQEYLAEFWAANPNG-GSLDQSDEDVNKIIL---EGEDIIMHIDASFLDDKSKCGIGIVM

Query:  RTKQGYLKATQIHFVRRCQSPLGAEAITVLTGLQLARNLKVRRLTVMSDCLNLVKSINGQIHGRSSISTTLWDIKEIAASFDFTDISSTPR
        R   G + A               EA  +  GLQLA  L+++   V +DCL LV +ING     SS    + DI    +S     IS   R
Subjt:  RTKQGYLKATQIHFVRRCQSPLGAEAITVLTGLQLARNLKVRRLTVMSDCLNLVKSINGQIHGRSSISTTLWDIKEIAASFDFTDISSTPR

TrEMBL top hitse value%identityAlignment
A0A6J1CTE3 uncharacterized protein LOC1110145781.4e-2535.05Show/hide
Query:  VFVWKYFHNSIPIMVNLMNHHVPVNMSCLVFHEEMETIDHALFRCQRAREVWEVLLPSISMDQWDQLEIKDRWLSFVD-IPTQTLEGICVGTWAIWNDRN
        +F+W+ F++ +P M NL+   +  N +CLV  +++ET DHALF+C+RA+EVW +LLP           ++D  L  V+ + T   + + VG WAIWNDRN
Subjt:  VFVWKYFHNSIPIMVNLMNHHVPVNMSCLVFHEEMETIDHALFRCQRAREVWEVLLPSISMDQWDQLEIKDRWLSFVD-IPTQTLEGICVGTWAIWNDRN

Query:  NLYHNRQIPSPTIRSEWIQEYLAEFWAAN-PNGGSLD-QSDEDVNKIILE-------GEDIIMHIDASFLDDKSKCGIGIVMRTKQGYLKATQIHFVRRC
         +   RQIP   IRS+WI  Y+ +F   + P+ G    Q D   N   +E          I +++DA+    + + GIGIV R ++G + A   H     
Subjt:  NLYHNRQIPSPTIRSEWIQEYLAEFWAAN-PNGGSLD-QSDEDVNKIILE-------GEDIIMHIDASFLDDKSKCGIGIVMRTKQGYLKATQIHFVRRC

Query:  QSPLGAEAITVLTG
          PL AE++ +  G
Subjt:  QSPLGAEAITVLTG

A0A803NM27 Uncharacterized protein5.3e-2527.44Show/hide
Query:  MSKNGVEVRWWKKVGKLRVPNKVKVFVWKYFHNSIPIMVNLMNHHVPVNMSCLVFHEEMETIDHALFRCQRAREVWEVLLPSISMDQWDQLEIKDRWLSF
        +S +   V WWK   +L++P KVK+F WK  HN++P+   L       + SC +     E++ HA+F C+ AR VW++   S +      ++I+D     
Subjt:  MSKNGVEVRWWKKVGKLRVPNKVKVFVWKYFHNSIPIMVNLMNHHVPVNMSCLVFHEEMETIDHALFRCQRAREVWEVLLPSISMDQWDQLEIKDRWLSF

Query:  VDIPTQT-LEGICVGTWAIWNDRNNLYHNRQIPSPTIRSEWIQEYLAEFWAANP---NGGSLDQSDEDVNKIILEGED--IIMHIDASFLDDKSKCGIGI
         +  T++ LE I    W+IW+DRNN+ H +    P++ S     +L+ F +A     + G         ++         + +++DA+F D + + G G 
Subjt:  VDIPTQT-LEGICVGTWAIWNDRNNLYHNRQIPSPTIRSEWIQEYLAEFWAANP---NGGSLDQSDEDVNKIILEGED--IIMHIDASFLDDKSKCGIGI

Query:  VMRTKQGYLKATQIHFVRRCQSPLGAEAITVLTGLQLARNLKVRRLTVMSDCLNLVKSINGQIHGRSSISTTLWDIK
        ++R   G +KA   H +  C  P   EA  +   L+ AR L  +   V +D L LV ++       SS    ++D++
Subjt:  VMRTKQGYLKATQIHFVRRCQSPLGAEAITVLTGLQLARNLKVRRLTVMSDCLNLVKSINGQIHGRSSISTTLWDIK

A0A803Q185 Uncharacterized protein8.2e-2629.82Show/hide
Query:  SKNGVEVRWWKKVGKLRVPNKVKVFVWKYFHNSIPIMVNLMNHHVPVNMSC--LVFHEEMETIDHALFRCQRAREVWEVL---LPSISMDQWDQLEIKDR
        S + + V+WW+K+ +L++P K+KVFVWK  H  +P    L   HV     C     H   ETI HAL+ C++++  W++    L      Q D+L    R
Subjt:  SKNGVEVRWWKKVGKLRVPNKVKVFVWKYFHNSIPIMVNLMNHHVPVNMSC--LVFHEEMETIDHALFRCQRAREVWEVL---LPSISMDQWDQLEIKDR

Query:  WLSFVDIPTQTLEGICVGTWAIWNDRNNLYHNRQIPSPTIRSEWIQEYLAEFWAANPNGGSLDQSDEDVNKIILEGEDIIMHIDASFLDDKSKCGIGIVM
         LS V +     E   V TW++WN RN   H+  +P P    EW  + L +F         + + +E V K+   GE + +++DAS        G+G V+
Subjt:  WLSFVDIPTQTLEGICVGTWAIWNDRNNLYHNRQIPSPTIRSEWIQEYLAEFWAANPNGGSLDQSDEDVNKIILEGEDIIMHIDASFLDDKSKCGIGIVM

Query:  RTKQGYLKATQIHFVRRCQSPLGAEAITVLTGLQLARNLKVRRLTVMSDCLNLVKSINGQIHGRSSISTTLWDIK
        R +QG +       ++R  SPL  E + +L G+Q+     +R  ++  DCL  ++ I+ +  G   +   L  I+
Subjt:  RTKQGYLKATQIHFVRRCQSPLGAEAITVLTGLQLARNLKVRRLTVMSDCLNLVKSINGQIHGRSSISTTLWDIK

A0A803Q8J4 Uncharacterized protein1.2e-2428.36Show/hide
Query:  WWKKVGKLRVPNKVKVFVWKYFHNSIPIMVNLMNHHVPVNMSCLVFHEEMETIDHALFRCQRAREVWEVLLPSISMDQWDQLEIKDRWLSFVDIPTQT-L
        WWK   +L++P KVK+F WK  HN++P+   L       + SC +     E++ HALF C+ AR VW+V   + +      + I+D      +  T++ L
Subjt:  WWKKVGKLRVPNKVKVFVWKYFHNSIPIMVNLMNHHVPVNMSCLVFHEEMETIDHALFRCQRAREVWEVLLPSISMDQWDQLEIKDRWLSFVDIPTQT-L

Query:  EGICVGTWAIWNDRNNLYHNRQIPSPTIRSEWIQEYLAEFWAANP---NGGSLDQSDEDVNKIILEGED--IIMHIDASFLDDKSKCGIGIVMRTKQGYL
        E I    W+IW+DRNN+ H +    PT+ S   Q +L  + +        G    +   ++K         + +++DA+F +  +K G G ++R   G +
Subjt:  EGICVGTWAIWNDRNNLYHNRQIPSPTIRSEWIQEYLAEFWAANP---NGGSLDQSDEDVNKIILEGED--IIMHIDASFLDDKSKCGIGIVMRTKQGYL

Query:  KATQIHFVRRCQSPLGAEAITVLTGLQLARNLKVRRLTVMSDCLNLVKSINGQIHGRSSISTTLWDIK
        KA   H +  C  P   EA  +   L+ AR L  +   V +D L L  ++      RSS    ++D++
Subjt:  KATQIHFVRRCQSPLGAEAITVLTGLQLARNLKVRRLTVMSDCLNLVKSINGQIHGRSSISTTLWDIK

A0A803QAH8 Uncharacterized protein4.8e-2627.8Show/hide
Query:  MSKNGVEVRWWKKVGKLRVPNKVKVFVWKYFHNSIPIMVNLMNHHVPVNMSCLVFHEEMETIDHALFRCQRAREVWEVLLPSISMDQWDQLEIKDRWLSF
        +S +   V WWK   +L++P KVK+F WK  HN++P+   L       + SC +     E++ HA+F C+ AR VW+++  S +      ++I+D     
Subjt:  MSKNGVEVRWWKKVGKLRVPNKVKVFVWKYFHNSIPIMVNLMNHHVPVNMSCLVFHEEMETIDHALFRCQRAREVWEVLLPSISMDQWDQLEIKDRWLSF

Query:  VDIPTQT-LEGICVGTWAIWNDRNNLYHNRQIPSPTIRSEWIQEYLAEFWAANP---NGGSLDQSDEDVNKIILEGED--IIMHIDASFLDDKSKCGIGI
         +  T++ LE I    W+IW+DRNN+ H +    P++ S     +L+ F +A+    + G         +K         + +++DA+F D + + G G 
Subjt:  VDIPTQT-LEGICVGTWAIWNDRNNLYHNRQIPSPTIRSEWIQEYLAEFWAANP---NGGSLDQSDEDVNKIILEGED--IIMHIDASFLDDKSKCGIGI

Query:  VMRTKQGYLKATQIHFVRRCQSPLGAEAITVLTGLQLARNLKVRRLTVMSDCLNLVKSINGQIHGRSSISTTLWDIK
        ++R   G +KA   H +  C  P   EA  +   L+ AR L  +   V +D L LV ++       SS    ++D++
Subjt:  VMRTKQGYLKATQIHFVRRCQSPLGAEAITVLTGLQLARNLKVRRLTVMSDCLNLVKSINGQIHGRSSISTTLWDIK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein7.4e-1123.13Show/hide
Query:  HVPVNMSCLVFHEEMETIDHALFRCQRAREVWEVL-LPSISMDQW-DQLEIKDRWL--SFVDIPTQTLEGICVG--TWAIWNDRNNL-YHNRQIPSPTIR
        H+    SC+   +  ET++H LF+C  AR VW +  +P+    +W D L     W+    V+IP     G  V    W +W  RN L +  ++  +P + 
Subjt:  HVPVNMSCLVFHEEMETIDHALFRCQRAREVWEVL-LPSISMDQW-DQLEIKDRWL--SFVDIPTQTLEGICVG--TWAIWNDRNNL-YHNRQIPSPTIR

Query:  SEWIQEYLAEFWAANPN-----GGSLDQSDEDVNKIILEGEDIIMHIDASFLDDKSKCGIGIVMRTKQGYLKATQIHFVRRCQSPLGAEAITVLTGLQLA
           ++++  E W+          G   + +  V       + +  + DA++  +  +CGIG ++R + G +       + R ++ L AE   +   +   
Subjt:  SEWIQEYLAEFWAANPN-----GGSLDQSDEDVNKIILEGEDIIMHIDASFLDDKSKCGIGIVMRTKQGYLKATQIHFVRRCQSPLGAEAITVLTGLQLA

Query:  RNLKVRRLTVMSDCLNLVKSINGQIHGRSSISTTLWDIKEIAASFDFTDISSTPRGNTGLAEEKAVANTSNKQISNPNFSI
             +R+   SD   LV  +N       ++   L DI+++   F+      TPRG   +A+  A  + S        FSI
Subjt:  RNLKVRRLTVMSDCLNLVKSINGQIHGRSSISTTLWDIKEIAASFDFTDISSTPRGNTGLAEEKAVANTSNKQISNPNFSI

AT3G09510.1 Ribonuclease H-like superfamily protein4.3e-1125.71Show/hide
Query:  KVGKLRVPNKVKVFVWKYFHNSIPIMVNLMNHHVPVNMSCLVFHEEMETIDHALFRCQRAREVWEVLLPSISMDQW---DQLEIKDRWLSFVDIPTQT--
        ++  L +  K+K F+W+    ++     L    + ++ SC   H E E+I+HALF C  A   W +   S+  +Q    D  E     L+FV   T +  
Subjt:  KVGKLRVPNKVKVFVWKYFHNSIPIMVNLMNHHVPVNMSCLVFHEEMETIDHALFRCQRAREVWEVLLPSISMDQW---DQLEIKDRWLSFVDIPTQT--

Query:  --LEGICVGTWAIWNDRNNLYHN--RQIPSPTIRS------EWIQEYLAEFWAANPNGGSLDQSDEDVNKIILEGEDIIMHIDASFLDDKSKCGIGIVMR
          L  + +  W IW  RNN+  N  R+ PS T+ S      +W+    +     +P     +   E  N        +  + DA F   K +   G ++R
Subjt:  --LEGICVGTWAIWNDRNNLYHN--RQIPSPTIRS------EWIQEYLAEFWAANPNGGSLDQSDEDVNKIILEGEDIIMHIDASFLDDKSKCGIGIVMR

Query:  TKQGYLKATQIHFVRRCQSPLGAEAITVLTGLQLARNLKVRRLTVMSDCLNLVKSINGQIHGRSSISTTLWDIKEIAASF
           G   +     +    +PL AE   +L  LQ        ++ +  DC  L+  ING I   SS++  L DI   A  F
Subjt:  TKQGYLKATQIHFVRRCQSPLGAEAITVLTGLQLARNLKVRRLTVMSDCLNLVKSINGQIHGRSSISTTLWDIKEIAASF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCAAGAATGGAGTGGAAGTTAGGTGGTGGAAAAAGGTTGGGAAATTAAGAGTTCCTAACAAAGTAAAAGTCTTTGTGTGGAAATATTTCCATAATTCCATTCCAAT
CATGGTTAACTTAATGAACCATCATGTACCAGTCAACATGAGTTGCCTGGTTTTTCATGAGGAGATGGAAACCATAGATCATGCTCTATTCAGATGCCAAAGAGCACGGG
AGGTTTGGGAGGTTCTTTTACCTTCAATTTCGATGGATCAGTGGGATCAGTTAGAAATTAAAGATCGATGGTTGAGCTTTGTTGACATTCCAACTCAGACATTAGAAGGC
ATTTGTGTAGGGACCTGGGCAATTTGGAATGATAGAAATAATTTGTACCACAATCGCCAAATTCCAAGTCCTACGATTAGAAGTGAATGGATCCAGGAATATCTAGCAGA
GTTCTGGGCGGCAAACCCGAATGGAGGATCGTTGGATCAATCGGATGAAGATGTTAATAAAATCATATTAGAAGGGGAAGACATTATTATGCACATCGATGCATCATTCC
TGGATGACAAGTCTAAATGTGGTATCGGCATAGTCATGCGTACTAAACAAGGTTATCTAAAGGCAACCCAGATTCATTTTGTTAGAAGATGCCAATCTCCTTTGGGAGCA
GAAGCTATAACTGTTTTGACAGGACTGCAATTAGCAAGGAATTTGAAGGTGAGGAGATTAACAGTTATGTCCGACTGTTTGAACCTGGTTAAATCTATAAATGGTCAGAT
TCATGGGCGGTCGAGTATATCTACAACACTTTGGGACATCAAAGAAATTGCAGCCTCTTTTGACTTTACCGACATCAGCTCCACTCCACGTGGCAACACAGGATTGGCCG
AAGAAAAGGCCGTGGCCAACACTTCTAACAAGCAAATCAGCAACCCGAATTTCTCGATTCGGCGCGCAGTGGGACATGAACGAATCTTAGCAACCAAGTTTATCGGAGAA
AAGAGATTTTGGGATTTTTTAAGAGGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCCAAGAATGGAGTGGAAGTTAGGTGGTGGAAAAAGGTTGGGAAATTAAGAGTTCCTAACAAAGTAAAAGTCTTTGTGTGGAAATATTTCCATAATTCCATTCCAAT
CATGGTTAACTTAATGAACCATCATGTACCAGTCAACATGAGTTGCCTGGTTTTTCATGAGGAGATGGAAACCATAGATCATGCTCTATTCAGATGCCAAAGAGCACGGG
AGGTTTGGGAGGTTCTTTTACCTTCAATTTCGATGGATCAGTGGGATCAGTTAGAAATTAAAGATCGATGGTTGAGCTTTGTTGACATTCCAACTCAGACATTAGAAGGC
ATTTGTGTAGGGACCTGGGCAATTTGGAATGATAGAAATAATTTGTACCACAATCGCCAAATTCCAAGTCCTACGATTAGAAGTGAATGGATCCAGGAATATCTAGCAGA
GTTCTGGGCGGCAAACCCGAATGGAGGATCGTTGGATCAATCGGATGAAGATGTTAATAAAATCATATTAGAAGGGGAAGACATTATTATGCACATCGATGCATCATTCC
TGGATGACAAGTCTAAATGTGGTATCGGCATAGTCATGCGTACTAAACAAGGTTATCTAAAGGCAACCCAGATTCATTTTGTTAGAAGATGCCAATCTCCTTTGGGAGCA
GAAGCTATAACTGTTTTGACAGGACTGCAATTAGCAAGGAATTTGAAGGTGAGGAGATTAACAGTTATGTCCGACTGTTTGAACCTGGTTAAATCTATAAATGGTCAGAT
TCATGGGCGGTCGAGTATATCTACAACACTTTGGGACATCAAAGAAATTGCAGCCTCTTTTGACTTTACCGACATCAGCTCCACTCCACGTGGCAACACAGGATTGGCCG
AAGAAAAGGCCGTGGCCAACACTTCTAACAAGCAAATCAGCAACCCGAATTTCTCGATTCGGCGCGCAGTGGGACATGAACGAATCTTAGCAACCAAGTTTATCGGAGAA
AAGAGATTTTGGGATTTTTTAAGAGGTTAG
Protein sequenceShow/hide protein sequence
MSKNGVEVRWWKKVGKLRVPNKVKVFVWKYFHNSIPIMVNLMNHHVPVNMSCLVFHEEMETIDHALFRCQRAREVWEVLLPSISMDQWDQLEIKDRWLSFVDIPTQTLEG
ICVGTWAIWNDRNNLYHNRQIPSPTIRSEWIQEYLAEFWAANPNGGSLDQSDEDVNKIILEGEDIIMHIDASFLDDKSKCGIGIVMRTKQGYLKATQIHFVRRCQSPLGA
EAITVLTGLQLARNLKVRRLTVMSDCLNLVKSINGQIHGRSSISTTLWDIKEIAASFDFTDISSTPRGNTGLAEEKAVANTSNKQISNPNFSIRRAVGHERILATKFIGE
KRFWDFLRG