; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0041154 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0041154
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionPolynucleotidyl transferase, ribonuclease H-like superfamily protein
Genome locationchr13:12950867..12955902
RNA-Seq ExpressionLag0041154
SyntenyLag0041154
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF4396512.1 hypothetical protein F8388_005782 [Cannabis sativa]1.3e-1229.25Show/hide
Query:  EEFFGEACIALWSIWNDRNNILHNIPI---PSWTQ-RCEWIRSYWKEYRRGSKSSNHDMTIDKFVETEHVQGVTLSTDAAIHPNALGSGYGALVLDVSGV
        EEFF    +  W +W  RNN +    I     WT    + I ++ + +++ SK  N  M ++            ++TDA++     G G  A++ D  G 
Subjt:  EEFFGEACIALWSIWNDRNNILHNIPI---PSWTQ-RCEWIRSYWKEYRRGSKSSNHDMTIDKFVETEHVQGVTLSTDAAIHPNALGSGYGALVLDVSGV

Query:  MLGALEVYVPTTFSPIAAECQAIIHGLKLLQQFQITKAHVLSDSLNVIRMINGELDISSDVYHWVLDIRGFRESFNSLTFSFVSRSRNSRADRLAKDALF
        ++ A   ++P   S + AE  A+  G++L  ++ IT A V SDS ++I+ +N +    +D    V DI+  R  F +L F F +RS N  A+ LAK +  
Subjt:  MLGALEVYVPTTFSPIAAECQAIIHGLKLLQQFQITKAHVLSDSLNVIRMINGELDISSDVYHWVLDIRGFRESFNSLTFSFVSRSRNSRADRLAKDALF

Query:  SQRSILWINDFP
        +Q+S +W +  P
Subjt:  SQRSILWINDFP

XP_017250619.1 PREDICTED: uncharacterized protein LOC108221234 [Daucus carota subsp. sativus]1.3e-1226.24Show/hide
Query:  IALWSIWNDRNNILHNIPIPSWTQRCEWIRSYWKEYRRGSKSSNHDMTIDKFVETEHVQ----GVTLSTDAAIHPNALGSGYGALVLDVSGVMLGALEVY
        +  W IW +RN ++H     + +Q   W+ +Y++E +    S N  ++    +    V+    G TL  DAA+  N    G GA ++  +      L   
Subjt:  IALWSIWNDRNNILHNIPIPSWTQRCEWIRSYWKEYRRGSKSSNHDMTIDKFVETEHVQ----GVTLSTDAAIHPNALGSGYGALVLDVSGVMLGALEVY

Query:  VPTTFSPIAAECQAIIHGLKLLQQFQITKAHVLSDSLNVIRMINGELDISSDVYHWVLDIRGFRESFNSLTFSFVSRSRNSRADRLAKDALFSQRSILWI
        +    S + AE  A++ GL+  Q    T   VL+DS ++++ +N E +  +++   + D R   + F  +  S VSR+ N +A  LA+ AL     + W+
Subjt:  VPTTFSPIAAECQAIIHGLKLLQQFQITKAHVLSDSLNVIRMINGELDISSDVYHWVLDIRGFRESFNSLTFSFVSRSRNSRADRLAKDALFSQRSILWI

Query:  ND
         D
Subjt:  ND

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]2.3e-2030.84Show/hide
Query:  ACIALWSIWNDRNNILHNIPIPSWTQRCEWIRSYWKEYRRGSKS-------SNHDMTIDKFVETEHVQGVTLSTDAAIHPNALGSGYGALVLDVSGVMLG
        A I  W IWNDRN+++H   +     +CEW+  +   + +   S       SNH   +  +  +  V  + L+TDAA       + +G ++ D S  ++ 
Subjt:  ACIALWSIWNDRNNILHNIPIPSWTQRCEWIRSYWKEYRRGSKS-------SNHDMTIDKFVETEHVQGVTLSTDAAIHPNALGSGYGALVLDVSGVMLG

Query:  ALEVYVPTTFSPIAAECQAIIHGLKLLQQFQITKAHVLSDSLNVIRMINGELDISSDVYHWVLDIRGFRESFNSLTFSFVSRSRNSRADRLAKDALFS-Q
        A  + VP   SP+ AE + I+ GLK       T   V SDSL  I++I  E+    D  +WV++I+     F  ++FS  SR  N  A  LAK  + S  
Subjt:  ALEVYVPTTFSPIAAECQAIIHGLKLLQQFQITKAHVLSDSLNVIRMINGELDISSDVYHWVLDIRGFRESFNSLTFSFVSRSRNSRADRLAKDALFS-Q

Query:  RSILWINDFPPWLISM------SNVAH
         +  W+ +FP WL+ +      SN AH
Subjt:  RSILWINDFPPWLISM------SNVAH

XP_022158489.1 uncharacterized protein LOC111024968 [Momordica charantia]8.4e-1530.24Show/hide
Query:  IALWSIWNDRNNILHNIPIPSWTQRCEWIRSYWKEYRRGSKSSNHDMTIDKFVETEHVQG--VTLSTDAAIHPNALGSGYGALVLDVSGVMLGALEVYVP
        I  W++WNDR+ +++   IP    + EWI  Y +E R     +     I +  E     G  V ++TDAA+     GSG G L+ + +  ++GA+ V   
Subjt:  IALWSIWNDRNNILHNIPIPSWTQRCEWIRSYWKEYRRGSKSSNHDMTIDKFVETEHVQG--VTLSTDAAIHPNALGSGYGALVLDVSGVMLGALEVYVP

Query:  TTFSPIAAECQAIIHGLKLLQQFQITKAHVLSDSLNVIRMINGELDISSDVYHWVLDIRGFRESFNSLTFSFVSRSRNSRADRLAKDALFSQRSILWIND
           +P+ A+  AI  GL L  +  + +  V +DSL  + +I  +     +   WV DIR F   F  + F  V R  N+ A  L ++ +  +   LW  D
Subjt:  TTFSPIAAECQAIIHGLKLLQQFQITKAHVLSDSLNVIRMINGELDISSDVYHWVLDIRGFRESFNSLTFSFVSRSRNSRADRLAKDALFSQRSILWIND

Query:  FPPWL
        FP WL
Subjt:  FPPWL

XP_030483444.1 uncharacterized protein LOC115700033 [Cannabis sativa]2.3e-1228.84Show/hide
Query:  MQEEFFGEACIALWSIWNDRNNILHNIPIPSWTQRCEWIRSYWKEYRRGSKSSNHDMTIDKFVETEHVQGVTLS----TDAAIHPNALGSGYGALVLDVS
        +++E F +  I+  +IW +RN   H   I +  Q   WI SY+ E  R    +   ++     E  + +   LS     DAAI       G+GA++    
Subjt:  MQEEFFGEACIALWSIWNDRNNILHNIPIPSWTQRCEWIRSYWKEYRRGSKSSNHDMTIDKFVETEHVQGVTLS----TDAAIHPNALGSGYGALVLDVS

Query:  GVMLGALEVYVPTTFSPIAAECQAIIHGLKLLQQFQITKAHVLSDSLNVIRMINGELDISSDVYHWVLDIRGFRESFNSLTFSFVSRSRNSRADRLAKDA
          M  AL   +  +FS   AE  A++ GL   Q   +    V SDSL+++  + G+    +++     DI+    SF  ++ S VSR+ N  A RLAK A
Subjt:  GVMLGALEVYVPTTFSPIAAECQAIIHGLKLLQQFQITKAHVLSDSLNVIRMINGELDISSDVYHWVLDIRGFRESFNSLTFSFVSRSRNSRADRLAKDA

Query:  LFSQRSILWINDFPP
        L     + W+ + PP
Subjt:  LFSQRSILWINDFPP

TrEMBL top hitse value%identityAlignment
A0A6J1DX30 uncharacterized protein LOC1110248741.1e-2030.84Show/hide
Query:  ACIALWSIWNDRNNILHNIPIPSWTQRCEWIRSYWKEYRRGSKS-------SNHDMTIDKFVETEHVQGVTLSTDAAIHPNALGSGYGALVLDVSGVMLG
        A I  W IWNDRN+++H   +     +CEW+  +   + +   S       SNH   +  +  +  V  + L+TDAA       + +G ++ D S  ++ 
Subjt:  ACIALWSIWNDRNNILHNIPIPSWTQRCEWIRSYWKEYRRGSKS-------SNHDMTIDKFVETEHVQGVTLSTDAAIHPNALGSGYGALVLDVSGVMLG

Query:  ALEVYVPTTFSPIAAECQAIIHGLKLLQQFQITKAHVLSDSLNVIRMINGELDISSDVYHWVLDIRGFRESFNSLTFSFVSRSRNSRADRLAKDALFS-Q
        A  + VP   SP+ AE + I+ GLK       T   V SDSL  I++I  E+    D  +WV++I+     F  ++FS  SR  N  A  LAK  + S  
Subjt:  ALEVYVPTTFSPIAAECQAIIHGLKLLQQFQITKAHVLSDSLNVIRMINGELDISSDVYHWVLDIRGFRESFNSLTFSFVSRSRNSRADRLAKDALFS-Q

Query:  RSILWINDFPPWLISM------SNVAH
         +  W+ +FP WL+ +      SN AH
Subjt:  RSILWINDFPPWLISM------SNVAH

A0A7J6HML3 Uncharacterized protein6.5e-1329.25Show/hide
Query:  EEFFGEACIALWSIWNDRNNILHNIPI---PSWTQ-RCEWIRSYWKEYRRGSKSSNHDMTIDKFVETEHVQGVTLSTDAAIHPNALGSGYGALVLDVSGV
        EEFF    +  W +W  RNN +    I     WT    + I ++ + +++ SK  N  M ++            ++TDA++     G G  A++ D  G 
Subjt:  EEFFGEACIALWSIWNDRNNILHNIPI---PSWTQ-RCEWIRSYWKEYRRGSKSSNHDMTIDKFVETEHVQGVTLSTDAAIHPNALGSGYGALVLDVSGV

Query:  MLGALEVYVPTTFSPIAAECQAIIHGLKLLQQFQITKAHVLSDSLNVIRMINGELDISSDVYHWVLDIRGFRESFNSLTFSFVSRSRNSRADRLAKDALF
        ++ A   ++P   S + AE  A+  G++L  ++ IT A V SDS ++I+ +N +    +D    V DI+  R  F +L F F +RS N  A+ LAK +  
Subjt:  MLGALEVYVPTTFSPIAAECQAIIHGLKLLQQFQITKAHVLSDSLNVIRMINGELDISSDVYHWVLDIRGFRESFNSLTFSFVSRSRNSRADRLAKDALF

Query:  SQRSILWINDFP
        +Q+S +W +  P
Subjt:  SQRSILWINDFP

A0A803NML1 Uncharacterized protein1.0e-1326.43Show/hide
Query:  QEEFFGEACIALWSIWNDRNNILHNIPIPSWTQRCEWIRSYWKEYRRG-------SKSSNHDMTIDKFVETEHVQ--------GVTLSTDAAIHPNALGS
        Q+  F      LW IW DRN ++H       T   ++   +++++ +        + +S H  +       +HVQ        G  L+ DAA +      
Subjt:  QEEFFGEACIALWSIWNDRNNILHNIPIPSWTQRCEWIRSYWKEYRRG-------SKSSNHDMTIDKFVETEHVQ--------GVTLSTDAAIHPNALGS

Query:  GYGALVLDVSGVMLGALEVYVPTTFSPIAAECQAIIHGLKLLQQFQITKAHVLSDS---LNVIRMINGELDISSDVYHWVLDIRGFRESFNSLTFSFVSR
        G GA++ D  G +L AL   V  +F     E +A+ H +  + Q Q    H+ +D+    N +  +N +L   SD+   ++DIR    SF  +  + V R
Subjt:  GYGALVLDVSGVMLGALEVYVPTTFSPIAAECQAIIHGLKLLQQFQITKAHVLSDS---LNVIRMINGELDISSDVYHWVLDIRGFRESFNSLTFSFVSR

Query:  SRNSRADRLAKDALFSQRSILWINDFP
        + N  A  LAK AL      +WI + P
Subjt:  SRNSRADRLAKDALFSQRSILWINDFP

A0A803P5M6 Uncharacterized protein7.7e-1426.43Show/hide
Query:  QEEFFGEACIALWSIWNDRNNILHNIPIPSWTQRCEWIRSYWKEYRRG-------SKSSNHDMTIDKFVETEHVQ--------GVTLSTDAAIHPNALGS
        Q+  F      LW IW DRN ++H       T   ++   +++++ +        + +S+H  +       +HVQ        G  L+ DAA +      
Subjt:  QEEFFGEACIALWSIWNDRNNILHNIPIPSWTQRCEWIRSYWKEYRRG-------SKSSNHDMTIDKFVETEHVQ--------GVTLSTDAAIHPNALGS

Query:  GYGALVLDVSGVMLGALEVYVPTTFSPIAAECQAIIHGLKLLQQFQITKAHVLSDS---LNVIRMINGELDISSDVYHWVLDIRGFRESFNSLTFSFVSR
        G GA++ D  G +L AL   V  +F     E +A+ H +  + Q Q    H+ +D+    N +  +N +L   SD+   ++DIR    SF  +  + V R
Subjt:  GYGALVLDVSGVMLGALEVYVPTTFSPIAAECQAIIHGLKLLQQFQITKAHVLSDS---LNVIRMINGELDISSDVYHWVLDIRGFRESFNSLTFSFVSR

Query:  SRNSRADRLAKDALFSQRSILWINDFP
        + N  A  LAK AL      +WI + P
Subjt:  SRNSRADRLAKDALFSQRSILWINDFP

A0A803PQM1 Uncharacterized protein1.5e-1227.75Show/hide
Query:  QEEFFGEACIALWSIWNDRNNILHNIPIPSWTQRCEWIRSYWKEYRRGSKSSN--------HDMTIDKFV-ETEH---------VQGVTLSTDAAIHPNA
        QE+F    C+ LW IW DRN + H       +    +   + ++Y R    SN        H  T      +T H         + G  L+ DAA + + 
Subjt:  QEEFFGEACIALWSIWNDRNNILHNIPIPSWTQRCEWIRSYWKEYRRGSKSSN--------HDMTIDKFV-ETEH---------VQGVTLSTDAAIHPNA

Query:  LGSGYGALVLDVSGVMLGALEVYVPTTFSPIAAECQAIIHGLKLLQQFQITKAHVLSDSLNVIRMINGELDISSDVYHWVLDIRGFRESFNSLTFSFVSR
           G GA++    G+++ AL   V  +F     E +A+ H L  + Q Q++  H+ +D+L V   +N      S     + DIR     F S+  S   R
Subjt:  LGSGYGALVLDVSGVMLGALEVYVPTTFSPIAAECQAIIHGLKLLQQFQITKAHVLSDSLNVIRMINGELDISSDVYHWVLDIRGFRESFNSLTFSFVSR

Query:  SRNSRADRLAKDALFSQRSILWINDFP
        S N  A  LAK AL     I W+ + P
Subjt:  SRNSRADRLAKDALFSQRSILWINDFP

SwissProt top hitse value%identityAlignment
P64956 Uncharacterized protein Mb2253c2.4e-0429.23Show/hide
Query:  VTLSTDAAIHPNALGSGYGALV--LDVSGVMLGALEVYVPTTFSPIAAECQAIIHGLKLLQQFQITKAHVLSDSLNVIRMINGELDIS-SDVYHWVLDIR
        V +  D     N   +GYGA+V   D S V+  + +     T +   AE + +I GL    +   T+A VL DS  V+  ++G   +   D+    +  +
Subjt:  VTLSTDAAIHPNALGSGYGALV--LDVSGVMLGALEVYVPTTFSPIAAECQAIIHGLKLLQQFQITKAHVLSDSLNVIRMINGELDIS-SDVYHWVLDIR

Query:  GFRESFNSLTFSFVSRSRNSRADRLAKDAL
             F  + + +V R+RN+ ADRLA DA+
Subjt:  GFRESFNSLTFSFVSRSRNSRADRLAKDAL

P9WLH4 Uncharacterized protein MT22872.4e-0429.23Show/hide
Query:  VTLSTDAAIHPNALGSGYGALV--LDVSGVMLGALEVYVPTTFSPIAAECQAIIHGLKLLQQFQITKAHVLSDSLNVIRMINGELDIS-SDVYHWVLDIR
        V +  D     N   +GYGA+V   D S V+  + +     T +   AE + +I GL    +   T+A VL DS  V+  ++G   +   D+    +  +
Subjt:  VTLSTDAAIHPNALGSGYGALV--LDVSGVMLGALEVYVPTTFSPIAAECQAIIHGLKLLQQFQITKAHVLSDSLNVIRMINGELDIS-SDVYHWVLDIR

Query:  GFRESFNSLTFSFVSRSRNSRADRLAKDAL
             F  + + +V R+RN+ ADRLA DA+
Subjt:  GFRESFNSLTFSFVSRSRNSRADRLAKDAL

P9WLH5 Bifunctional protein Rv2228c2.4e-0429.23Show/hide
Query:  VTLSTDAAIHPNALGSGYGALV--LDVSGVMLGALEVYVPTTFSPIAAECQAIIHGLKLLQQFQITKAHVLSDSLNVIRMINGELDIS-SDVYHWVLDIR
        V +  D     N   +GYGA+V   D S V+  + +     T +   AE + +I GL    +   T+A VL DS  V+  ++G   +   D+    +  +
Subjt:  VTLSTDAAIHPNALGSGYGALV--LDVSGVMLGALEVYVPTTFSPIAAECQAIIHGLKLLQQFQITKAHVLSDSLNVIRMINGELDIS-SDVYHWVLDIR

Query:  GFRESFNSLTFSFVSRSRNSRADRLAKDAL
             F  + + +V R+RN+ ADRLA DA+
Subjt:  GFRESFNSLTFSFVSRSRNSRADRLAKDAL

Arabidopsis top hitse value%identityAlignment
AT1G04625.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.4e-0730.33Show/hide
Query:  DAAIHPNALGSGYGALVLDVSGVMLGALEVYVPTTFSPIAAECQAIIHGLKLLQQFQITKAHVLSDSLNVIRMINGELDISSDVYHWVLDIRGFRESFNS
        D   H NA  S  G L+ D  G  LGA       T +P+ +E QA++  ++        K +   D+  V  ++NG    +  V++W+ DI  +R  F  
Subjt:  DAAIHPNALGSGYGALVLDVSGVMLGALEVYVPTTFSPIAAECQAIIHGLKLLQQFQITKAHVLSDSLNVIRMINGELDISSDVYHWVLDIRGFRESFNS

Query:  LTFSFVSRSRNSRADRLAKDAL
          F++  R  N  AD LAK  L
Subjt:  LTFSFVSRSRNSRADRLAKDAL

AT1G10000.1 Ribonuclease H-like superfamily protein3.4e-0633.72Show/hide
Query:  SPIAAECQAIIHGLKLLQQFQITKAHVLSDSLNVIRMINGELDISSDVYHWVLDIRGFRESFNSLTFSFVSRSRNSRADRLAKDAL
        SP+AAE  AI   +    Q + +   VLSDS +++  +N  + + ++++  +++IR  R  F S++F F+ R  NS AD  AK +L
Subjt:  SPIAAECQAIIHGLKLLQQFQITKAHVLSDSLNVIRMINGELDISSDVYHWVLDIRGFRESFNSLTFSFVSRSRNSRADRLAKDAL

AT3G25270.1 Ribonuclease H-like superfamily protein5.6e-0923.01Show/hide
Query:  QEEFFGEACIALWSIWNDRNNILHNIPIPSW----------TQRCEWIRSYWKEYRRGSKSSNHDMTIDKFVETEHVQG--VTLSTDAAIHPNALGSGYG
        Q + F  A   LW +W  RN ++      SW           Q  E   +Y +   +   SS H        + +      +  + D A +     +  G
Subjt:  QEEFFGEACIALWSIWNDRNNILHNIPIPSW----------TQRCEWIRSYWKEYRRGSKSSNHDMTIDKFVETEHVQG--VTLSTDAAIHPNALGSGYG

Query:  ALVLDVSGVMLGALEVYVPTTFSPIAAECQAIIHGLKLLQQFQITKAHVLSDSLNVIRMINGELDISSDVYHWVLDIRGFRESFNSLTFSFVSRSRNSRA
         L+ D +GV +G+ +    TT   + +E QA+I  ++        K     DS  V  ++N E  ++   ++W+ + R +++ F    F +V R+ N  A
Subjt:  ALVLDVSGVMLGALEVYVPTTFSPIAAECQAIIHGLKLLQQFQITKAHVLSDSLNVIRMINGELDISSDVYHWVLDIRGFRESFNSLTFSFVSRSRNSRA

Query:  DRLAKDALFSQRSILWINDFPPWLIS
        D LAK  L   +S  +    P ++ S
Subjt:  DRLAKDALFSQRSILWINDFPPWLIS

AT4G09490.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein4.8e-0832.56Show/hide
Query:  VTLSTDAAIHPNALGSGYGALVLDVSGVMLGALEVYVPTTFSPIAAECQAIIHGLKLLQQFQITKAHVLSDSLNVIRMINGELDISSDVYHWVLDIRGFR
        VT+ TDAA        G+G ++ +   +     +        P+ AE  A+   L+  Q   ITK  + SDS  +I  I  E   S++ Y  + DI    
Subjt:  VTLSTDAAIHPNALGSGYGALVLDVSGVMLGALEVYVPTTFSPIAAECQAIIHGLKLLQQFQITKAHVLSDSLNVIRMINGELDISSDVYHWVLDIRGFR

Query:  ESFNSLTFSFVSRSRNSRADRLAKDALFS
          F  ++FSFV RS N  AD LAK +L S
Subjt:  ESFNSLTFSFVSRSRNSRADRLAKDALFS

AT5G65005.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.7e-0522.92Show/hide
Query:  LWSIWNDRNNILHNIPIPSWTQRCEWI---RSYWKEYRRGSKSSNHDMTIDKFVETEHV----QGVTLSTDAAIHPNALGSGYGALVLDVSGVMLGALEV
        +W IW   N+++ N     +    E        W +    ++  N +   D    T+        +  + DA+ H     SG G ++ +  G ++     
Subjt:  LWSIWNDRNNILHNIPIPSWTQRCEWI---RSYWKEYRRGSKSSNHDMTIDKFVETEHV----QGVTLSTDAAIHPNALGSGYGALVLDVSGVMLGALEV

Query:  YVPTTFSPIAAECQAIIHGLKLLQQFQITKAHVLSDSLNVIRMINGELDISSDVYHWVLDIRGFRESFNSLTFSFVSRSRNSRADRLAKDAL
              +   AEC  +I  ++    F   K     D+  + RMIN +   +  + H++  I+ +  SF S+ FSF  R +N  AD LAK A+
Subjt:  YVPTTFSPIAAECQAIIHGLKLLQQFQITKAHVLSDSLNVIRMINGELDISSDVYHWVLDIRGFRESFNSLTFSFVSRSRNSRADRLAKDAL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAGAGGAATTTTTTGGGGAAGCTTGTATTGCTTTATGGTCCATTTGGAATGATCGGAACAATATTCTCCACAACATTCCTATTCCAAGTTGGACTCAACGTTGTGA
ATGGATTCGTAGCTATTGGAAGGAATATAGACGCGGAAGCAAAAGCTCCAATCACGACATGACTATAGATAAGTTTGTCGAAACAGAACATGTACAAGGAGTGACTTTGT
CAACGGACGCTGCTATTCATCCGAATGCACTAGGATCGGGTTATGGGGCTTTGGTTTTGGATGTTTCTGGTGTTATGTTGGGTGCTCTTGAAGTTTATGTTCCTACCACT
TTTTCTCCAATAGCAGCAGAATGCCAAGCTATTATCCATGGGCTAAAGCTATTGCAGCAATTTCAAATTACGAAAGCTCATGTTCTGTCAGATTCGCTCAATGTCATTCG
AATGATTAATGGAGAGTTGGACATTTCCTCGGACGTCTATCACTGGGTTCTAGATATCAGAGGCTTTAGAGAAAGTTTTAATTCTTTGACTTTTTCTTTTGTTTCTCGAT
CTAGGAATTCCAGAGCTGATCGTCTCGCTAAAGACGCCTTGTTCTCTCAAAGGTCCATTTTGTGGATAAATGATTTCCCACCATGGTTAATCTCAATGAGTAATGTTGCT
CATTGCAAATATTATGCTGCTGAGCGACTGGAGGGAGCAAATTCTATGCTGCAGCAAAACTGGGAGCAGAAACTGCCACATCACAGCTCGTTAGCCAACTTCATGAACCG
ACTTCTGTTGAGTTATTTTCGGGATAAAGGAGCAAGGAGAGCCCTACACGTGTCCAAGTTGACCTAA
mRNA sequenceShow/hide mRNA sequence
ATGCAAGAGGAATTTTTTGGGGAAGCTTGTATTGCTTTATGGTCCATTTGGAATGATCGGAACAATATTCTCCACAACATTCCTATTCCAAGTTGGACTCAACGTTGTGA
ATGGATTCGTAGCTATTGGAAGGAATATAGACGCGGAAGCAAAAGCTCCAATCACGACATGACTATAGATAAGTTTGTCGAAACAGAACATGTACAAGGAGTGACTTTGT
CAACGGACGCTGCTATTCATCCGAATGCACTAGGATCGGGTTATGGGGCTTTGGTTTTGGATGTTTCTGGTGTTATGTTGGGTGCTCTTGAAGTTTATGTTCCTACCACT
TTTTCTCCAATAGCAGCAGAATGCCAAGCTATTATCCATGGGCTAAAGCTATTGCAGCAATTTCAAATTACGAAAGCTCATGTTCTGTCAGATTCGCTCAATGTCATTCG
AATGATTAATGGAGAGTTGGACATTTCCTCGGACGTCTATCACTGGGTTCTAGATATCAGAGGCTTTAGAGAAAGTTTTAATTCTTTGACTTTTTCTTTTGTTTCTCGAT
CTAGGAATTCCAGAGCTGATCGTCTCGCTAAAGACGCCTTGTTCTCTCAAAGGTCCATTTTGTGGATAAATGATTTCCCACCATGGTTAATCTCAATGAGTAATGTTGCT
CATTGCAAATATTATGCTGCTGAGCGACTGGAGGGAGCAAATTCTATGCTGCAGCAAAACTGGGAGCAGAAACTGCCACATCACAGCTCGTTAGCCAACTTCATGAACCG
ACTTCTGTTGAGTTATTTTCGGGATAAAGGAGCAAGGAGAGCCCTACACGTGTCCAAGTTGACCTAA
Protein sequenceShow/hide protein sequence
MQEEFFGEACIALWSIWNDRNNILHNIPIPSWTQRCEWIRSYWKEYRRGSKSSNHDMTIDKFVETEHVQGVTLSTDAAIHPNALGSGYGALVLDVSGVMLGALEVYVPTT
FSPIAAECQAIIHGLKLLQQFQITKAHVLSDSLNVIRMINGELDISSDVYHWVLDIRGFRESFNSLTFSFVSRSRNSRADRLAKDALFSQRSILWINDFPPWLISMSNVA
HCKYYAAERLEGANSMLQQNWEQKLPHHSSLANFMNRLLLSYFRDKGARRALHVSKLT