; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g01840 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g01840
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRibonuclease H
Genome locationchr1:1241065..1243683
RNA-Seq ExpressionMoc01g01840
SyntenyMoc01g01840
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0140640 - catalytic activity, acting on a nucleic acid (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022147189.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111016200 [Momordica charantia]5.6e-8163.18Show/hide
Query:  MTIGERIEYGIKHRRLTSIASETSATKKASSLKKKEDEVQMVRADRHSWKQQPYGRTARYALYYYPTPYEYNQPYVNNATSHFSPMPLKIF---DSQHVK
        MTIGERIEYG++H R+TS   E  A KKAS  KKKE EVQMV ADRHSWKQQPY RT +Y+ YYYPTPY YNQP+VNNATSH+ P   + F    SQ+ +
Subjt:  MTIGERIEYGIKHRRLTSIASETSATKKASSLKKKEDEVQMVRADRHSWKQQPYGRTARYALYYYPTPYEYNQPYVNNATSHFSPMPLKIF---DSQHVK

Query:  ----TSNLGVNNIIHSMTCTDLGHHNNRGARKQTQFNPIPMTCTDLLPQLFQNNQLTPVPVDPIQPPYPRWYDANVRCDYHAGAIEHSTENYTALKYRVQ
            + N       H+   T  G  NNRGARKQTQF+PIPMT T+LLPQLFQNNQL PVPVDPIQPPYPRWYDAN RCDYHAGAI HSTEN T LKYRVQ
Subjt:  ----TSNLGVNNIIHSMTCTDLGHHNNRGARKQTQFNPIPMTCTDLLPQLFQNNQLTPVPVDPIQPPYPRWYDANVRCDYHAGAIEHSTENYTALKYRVQ

Query:  TLIKAGWLNFKKENGPDVNNNPLPKHQNQQINAVTAV-IEEREQTGPLVFPCPNGFKL
         LIKAGW NFKKENG DV+   L  HQN QINA+    IE + +   +  P    F++
Subjt:  TLIKAGWLNFKKENGPDVNNNPLPKHQNQQINAVTAV-IEEREQTGPLVFPCPNGFKL

XP_022147189.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111016200 [Momordica charantia]2.1e-2760.68Show/hide
Query:  HQN-QQINAVTAVIEEREQTGPLVFPCPNGFKLDNWSVLEIPSF------------------ELDTPIYNVNSDEKMDDEPSIELLRMLEEEEKMLGPRE
        HQ   + + V AV EEREQ GP V+ CP+GF+L NWSV+++PSF                  ELDTPIY + SDE++DDEPS ELLRMLEEEEKMLGP E
Subjt:  HQN-QQINAVTAVIEEREQTGPLVFPCPNGFKLDNWSVLEIPSF------------------ELDTPIYNVNSDEKMDDEPSIELLRMLEEEEKMLGPRE

Query:  ELTETVNLGSQAEAKEL
        ELTET+NLGSQAEAKE+
Subjt:  ELTETVNLGSQAEAKEL

XP_022147189.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111016200 [Momordica charantia]7.9e-7559.53Show/hide
Query:  MTIGERIEYGIKHRRLTSIASETSATKKASSLKKKEDEVQMVRADRHSWKQQPYGRTARYALYYYPTPYEYNQPYVNNAT------SHFSPMPLKIFDSQ
        +TIGERIEYG+ H R+TS   E+S  K  +S KKKE EVQMV ADRH W+Q PYG+T  YA YYYP+PY YNQPYVN AT       +F P   + F  +
Subjt:  MTIGERIEYGIKHRRLTSIASETSATKKASSLKKKEDEVQMVRADRHSWKQQPYGRTARYALYYYPTPYEYNQPYVNNAT------SHFSPMPLKIFDSQ

Query:  HVKTSNLGVNNIIHSMTCTDLGHHNNRGARKQTQFNPIPMTCTDLLPQLFQNNQLTPVPVDPIQPPYPRWYDANVRCDYHAGAIEHSTENYTALKYRVQT
          +       N +++      G  NNR  RKQ+QF+PIPMT T+LLPQLFQNNQL PVPVDPIQPPYP WYDAN RCDYHAGAI HSTEN TALKYRVQ 
Subjt:  HVKTSNLGVNNIIHSMTCTDLGHHNNRGARKQTQFNPIPMTCTDLLPQLFQNNQLTPVPVDPIQPPYPRWYDANVRCDYHAGAIEHSTENYTALKYRVQT

Query:  LIKAGWLNFKKENGPDVNNNPLPKHQNQQINAVTAV-IEEREQTGPLVFPCPNGFKL
        LIKAG L FKKEN PDV NNPLP H+N QINAV    IE R +   +  P    F++
Subjt:  LIKAGWLNFKKENGPDVNNNPLPKHQNQQINAVTAV-IEEREQTGPLVFPCPNGFKL

XP_022155098.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111022231, partial [Momordica charantia]1.8e-0277.78Show/hide
Query:  SIELLRMLEEEEKMLGPREELTETVNLGSQAEAKEL
        S ELLRMLEEEEK LGP  E TE VNLGSQAE KE+
Subjt:  SIELLRMLEEEEKMLGPREELTETVNLGSQAEAKEL

XP_022155098.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111022231, partial [Momordica charantia]5.7e-4143.1Show/hide
Query:  MTIGERIEYGIKHRRLTSIASETSATKKASSLKKKEDEVQMVRADRHSWKQQPYGRTARYALYYYPTPYEYNQPYVNNATSHFSPMPLKIFDSQHVKTSN
        + IGERIEYGIKH RL   ++E    KK ++ KKKE EV  +        +  +G+      +        + PY N   +H      K+ +S   +   
Subjt:  MTIGERIEYGIKHRRLTSIASETSATKKASSLKKKEDEVQMVRADRHSWKQQPYGRTARYALYYYPTPYEYNQPYVNNATSHFSPMPLKIFDSQHVKTSN

Query:  LGVNNIIHSMTCTDLGHHNNRGARKQTQFNPIPMTCTDLLPQLFQNNQLTPVPVDPIQPPYPRWYDANVRCDYHAGAIEHSTENYTALKYRVQTLIKAGW
         G  +  +S T                +F+PIPMT T+LLPQL QN QL P+P++PIQPPYP+WYD N RCDYHAG + HSTEN  ALK +VQ+LI AGW
Subjt:  LGVNNIIHSMTCTDLGHHNNRGARKQTQFNPIPMTCTDLLPQLFQNNQLTPVPVDPIQPPYPRWYDANVRCDYHAGAIEHSTENYTALKYRVQTLIKAGW

Query:  LNFKKE-NGPDVNNNPLPKHQNQQINAVTAVI
        L+FKK    PDVNNNPLP H+N ++NA+   +
Subjt:  LNFKKE-NGPDVNNNPLPKHQNQQINAVTAVI

XP_022158986.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111025431 [Momordica charantia]8.4e-8565.12Show/hide
Query:  MTIGERIEYGIKHRRLTSIASETSATKKASSLKKKEDEVQMVRADRHSWKQQPYGRTARYALYYYPTPYEYNQPYVNNATSHFSPMPLKIF---DSQHVK
        MTIGERIEYG++H R+TS   E  A KKAS  KKKE EVQMV ADRHSWKQQPY RT RY  YYYPTPY YNQP+VNNATSH+SP   + F    SQ+ +
Subjt:  MTIGERIEYGIKHRRLTSIASETSATKKASSLKKKEDEVQMVRADRHSWKQQPYGRTARYALYYYPTPYEYNQPYVNNATSHFSPMPLKIF---DSQHVK

Query:  ----TSNLGVNNIIHSMTCTDLGHHNNRGARKQTQFNPIPMTCTDLLPQLFQNNQLTPVPVDPIQPPYPRWYDANVRCDYHAGAIEHSTENYTALKYRVQ
            + N       H+   T      NRGARKQTQF+PIPMT T+LLPQLFQNNQL PVPVDPIQPPYPRWYD N RCDYHAGAI HSTEN TALKYRVQ
Subjt:  ----TSNLGVNNIIHSMTCTDLGHHNNRGARKQTQFNPIPMTCTDLLPQLFQNNQLTPVPVDPIQPPYPRWYDANVRCDYHAGAIEHSTENYTALKYRVQ

Query:  TLIKAGWLNFKKENGPDVNNNPLPKHQNQQINAVTA-VIEEREQTGPLVFPCPNGFKL
         LIKAGWLNFKKENGPDV+ NPLP HQN QINA+    IE + +   +  P    F++
Subjt:  TLIKAGWLNFKKENGPDVNNNPLPKHQNQQINAVTA-VIEEREQTGPLVFPCPNGFKL

XP_031738551.1 LOW QUALITY PROTEIN: uncharacterized protein LOC101203611 [Cucumis sativus]1.7e-4042.67Show/hide
Query:  MTIGERIEYGIKHRRLTSIASETSATKKASSLKKKEDEVQMVRADRHSWKQQPYGRTARYALYYYPTPYEYNQPYVNNATSHFSPMPLKIFDSQHVKTSN
        + IGERIEYGIKH RL   ++E    KK ++ KKKE EV  +        +  +G+      +        + PY N   +H      K+ +S   +   
Subjt:  MTIGERIEYGIKHRRLTSIASETSATKKASSLKKKEDEVQMVRADRHSWKQQPYGRTARYALYYYPTPYEYNQPYVNNATSHFSPMPLKIFDSQHVKTSN

Query:  LGVNNIIHSMTCTDLGHHNNRGARKQTQFNPIPMTCTDLLPQLFQNNQLTPVPVDPIQPPYPRWYDANVRCDYHAGAIEHSTENYTALKYRVQTLIKAGW
         G  +  +S T                +F+PIPMT T+LLPQL  N QL P+P++PIQPPYP+WYD N RCDYHAG + HSTEN  ALK +VQ+LI AGW
Subjt:  LGVNNIIHSMTCTDLGHHNNRGARKQTQFNPIPMTCTDLLPQLFQNNQLTPVPVDPIQPPYPRWYDANVRCDYHAGAIEHSTENYTALKYRVQTLIKAGW

Query:  LNFKKE-NGPDVNNNPLPKHQNQQINAVTAVI
        L+FKK    PDVNNNPLP H+N ++NA+   +
Subjt:  LNFKKE-NGPDVNNNPLPKHQNQQINAVTAVI

XP_031738551.1 LOW QUALITY PROTEIN: uncharacterized protein LOC101203611 [Cucumis sativus]2.0e-0935.21Show/hide
Query:  AVTAVIEEREQTGPLVFPCPNGFKLDNWSVL---------EIPSF----------------ELDTPIYNVNSDEKMDDEP----SIELLRMLEEEEKMLG
        ++ AV +E       V+ CP  F+L+NW V          + P+F                 LDT IY + SD++ DDE     S ELLR++EEE+K+LG
Subjt:  AVTAVIEEREQTGPLVFPCPNGFKLDNWSVL---------EIPSF----------------ELDTPIYNVNSDEKMDDEP----SIELLRMLEEEEKMLG

Query:  PREELTETVNLGSQAEAKELLRKNNDGVWSEDCQAAFNRIKD
        P +EL E +NLGSQ E+KE+  K    + SE  +   N +++
Subjt:  PREELTETVNLGSQAEAKELLRKNNDGVWSEDCQAAFNRIKD

TrEMBL top hitse value%identityAlignment
A0A5A7ULC1 Uncharacterized protein2.3e-4042.51Show/hide
Query:  MTIGERIEYGIKHRRLTSIASETSATKKASSLKKKEDEVQMVRADRHSWKQQPYGRTARYALYYYPTPYEYNQP-YVNNATSHFSPMPLKIFDSQH-VKT
        + IGERIEYGIKH RL    +E    KK +  KKKE EV ++           +    ++   +    YE N P Y++N     S +P   +   H V  
Subjt:  MTIGERIEYGIKHRRLTSIASETSATKKASSLKKKEDEVQMVRADRHSWKQQPYGRTARYALYYYPTPYEYNQP-YVNNATSHFSPMPLKIFDSQH-VKT

Query:  SNLGVNNIIHSMTCTDLGHHNNRGARKQTQFNPIPMTCTDLLPQLFQNNQLTPVPVDPIQPPYPRWYDANVRCDYHAGAIEHSTENYTALKYRVQTLIKA
        +   VN+          G   N       +F+PIPMT T+LLPQL QN QL P+P+ PIQPPYP+WYD+N RCDYHAG + HSTEN  ALK  VQ+LI  
Subjt:  SNLGVNNIIHSMTCTDLGHHNNRGARKQTQFNPIPMTCTDLLPQLFQNNQLTPVPVDPIQPPYPRWYDANVRCDYHAGAIEHSTENYTALKYRVQTLIKA

Query:  GWLNFKK-ENGPDVNNNPLPKHQNQQINAVTAVIEE-REQTGPLVFP
        GWL+FKK    P+VN NPLP H+N ++NAV +++E+ + +   +V P
Subjt:  GWLNFKK-ENGPDVNNNPLPKHQNQQINAVTAVIEE-REQTGPLVFP

A0A6J1D099 Ribonuclease H2.1e-8163.18Show/hide
Query:  MTIGERIEYGIKHRRLTSIASETSATKKASSLKKKEDEVQMVRADRHSWKQQPYGRTARYALYYYPTPYEYNQPYVNNATSHFSPMPLKIF---DSQHVK
        MTIGERIEYG++H R+TS   E  A KKAS  KKKE EVQMV ADRHSWKQQPY RT +Y+ YYYPTPY YNQP+VNNATSH+ P   + F    SQ+ +
Subjt:  MTIGERIEYGIKHRRLTSIASETSATKKASSLKKKEDEVQMVRADRHSWKQQPYGRTARYALYYYPTPYEYNQPYVNNATSHFSPMPLKIF---DSQHVK

Query:  ----TSNLGVNNIIHSMTCTDLGHHNNRGARKQTQFNPIPMTCTDLLPQLFQNNQLTPVPVDPIQPPYPRWYDANVRCDYHAGAIEHSTENYTALKYRVQ
            + N       H+   T  G  NNRGARKQTQF+PIPMT T+LLPQLFQNNQL PVPVDPIQPPYPRWYDAN RCDYHAGAI HSTEN T LKYRVQ
Subjt:  ----TSNLGVNNIIHSMTCTDLGHHNNRGARKQTQFNPIPMTCTDLLPQLFQNNQLTPVPVDPIQPPYPRWYDANVRCDYHAGAIEHSTENYTALKYRVQ

Query:  TLIKAGWLNFKKENGPDVNNNPLPKHQNQQINAVTAV-IEEREQTGPLVFPCPNGFKL
         LIKAGW NFKKENG DV+   L  HQN QINA+    IE + +   +  P    F++
Subjt:  TLIKAGWLNFKKENGPDVNNNPLPKHQNQQINAVTAV-IEEREQTGPLVFPCPNGFKL

A0A6J1D099 Ribonuclease H1.0e-2760.68Show/hide
Query:  HQN-QQINAVTAVIEEREQTGPLVFPCPNGFKLDNWSVLEIPSF------------------ELDTPIYNVNSDEKMDDEPSIELLRMLEEEEKMLGPRE
        HQ   + + V AV EEREQ GP V+ CP+GF+L NWSV+++PSF                  ELDTPIY + SDE++DDEPS ELLRMLEEEEKMLGP E
Subjt:  HQN-QQINAVTAVIEEREQTGPLVFPCPNGFKLDNWSVLEIPSF------------------ELDTPIYNVNSDEKMDDEPSIELLRMLEEEEKMLGPRE

Query:  ELTETVNLGSQAEAKEL
        ELTET+NLGSQAEAKE+
Subjt:  ELTETVNLGSQAEAKEL

A0A6J1D099 Ribonuclease H3.8e-7559.53Show/hide
Query:  MTIGERIEYGIKHRRLTSIASETSATKKASSLKKKEDEVQMVRADRHSWKQQPYGRTARYALYYYPTPYEYNQPYVNNAT------SHFSPMPLKIFDSQ
        +TIGERIEYG+ H R+TS   E+S  K  +S KKKE EVQMV ADRH W+Q PYG+T  YA YYYP+PY YNQPYVN AT       +F P   + F  +
Subjt:  MTIGERIEYGIKHRRLTSIASETSATKKASSLKKKEDEVQMVRADRHSWKQQPYGRTARYALYYYPTPYEYNQPYVNNAT------SHFSPMPLKIFDSQ

Query:  HVKTSNLGVNNIIHSMTCTDLGHHNNRGARKQTQFNPIPMTCTDLLPQLFQNNQLTPVPVDPIQPPYPRWYDANVRCDYHAGAIEHSTENYTALKYRVQT
          +       N +++      G  NNR  RKQ+QF+PIPMT T+LLPQLFQNNQL PVPVDPIQPPYP WYDAN RCDYHAGAI HSTEN TALKYRVQ 
Subjt:  HVKTSNLGVNNIIHSMTCTDLGHHNNRGARKQTQFNPIPMTCTDLLPQLFQNNQLTPVPVDPIQPPYPRWYDANVRCDYHAGAIEHSTENYTALKYRVQT

Query:  LIKAGWLNFKKENGPDVNNNPLPKHQNQQINAVTAV-IEEREQTGPLVFPCPNGFKL
        LIKAG L FKKEN PDV NNPLP H+N QINAV    IE R +   +  P    F++
Subjt:  LIKAGWLNFKKENGPDVNNNPLPKHQNQQINAVTAV-IEEREQTGPLVFPCPNGFKL

A0A6J1DM29 LOW QUALITY PROTEIN: uncharacterized protein LOC1110222318.7e-0377.78Show/hide
Query:  SIELLRMLEEEEKMLGPREELTETVNLGSQAEAKEL
        S ELLRMLEEEEK LGP  E TE VNLGSQAE KE+
Subjt:  SIELLRMLEEEEKMLGPREELTETVNLGSQAEAKEL

A0A6J1DM29 LOW QUALITY PROTEIN: uncharacterized protein LOC1110222318.0e-4142.51Show/hide
Query:  MTIGERIEYGIKHRRLTSIASETSATKKASSLKKKEDEVQMVRADRHSWKQQPYGRTARYALYYYPTPYEYNQP-YVNNATSHFSPMPLKIFDSQH-VKT
        + IGERIEYGIKH RL    +E    KK +  KKKE E+  +           +  + ++   +    YE N P Y++N     S +P   +   H V  
Subjt:  MTIGERIEYGIKHRRLTSIASETSATKKASSLKKKEDEVQMVRADRHSWKQQPYGRTARYALYYYPTPYEYNQP-YVNNATSHFSPMPLKIFDSQH-VKT

Query:  SNLGVNNIIHSMTCTDLGHHNNRGARKQTQFNPIPMTCTDLLPQLFQNNQLTPVPVDPIQPPYPRWYDANVRCDYHAGAIEHSTENYTALKYRVQTLIKA
        +   VN+          G   N       +F+PIPMT T+LLPQL QN QL P+P+ PIQPPYP+WYD+N RCDYHAG + HSTEN  ALK  VQ+LI A
Subjt:  SNLGVNNIIHSMTCTDLGHHNNRGARKQTQFNPIPMTCTDLLPQLFQNNQLTPVPVDPIQPPYPRWYDANVRCDYHAGAIEHSTENYTALKYRVQTLIKA

Query:  GWLNFKKE-NGPDVNNNPLPKHQNQQINAVTAVIEE-REQTGPLVFP
        GWL+FKK    P+VN NPLP H+N ++NAV +++E+ + +   +V P
Subjt:  GWLNFKKE-NGPDVNNNPLPKHQNQQINAVTAVIEE-REQTGPLVFP

A0A6J1E2J7 Ribonuclease H4.1e-8565.12Show/hide
Query:  MTIGERIEYGIKHRRLTSIASETSATKKASSLKKKEDEVQMVRADRHSWKQQPYGRTARYALYYYPTPYEYNQPYVNNATSHFSPMPLKIF---DSQHVK
        MTIGERIEYG++H R+TS   E  A KKAS  KKKE EVQMV ADRHSWKQQPY RT RY  YYYPTPY YNQP+VNNATSH+SP   + F    SQ+ +
Subjt:  MTIGERIEYGIKHRRLTSIASETSATKKASSLKKKEDEVQMVRADRHSWKQQPYGRTARYALYYYPTPYEYNQPYVNNATSHFSPMPLKIF---DSQHVK

Query:  ----TSNLGVNNIIHSMTCTDLGHHNNRGARKQTQFNPIPMTCTDLLPQLFQNNQLTPVPVDPIQPPYPRWYDANVRCDYHAGAIEHSTENYTALKYRVQ
            + N       H+   T      NRGARKQTQF+PIPMT T+LLPQLFQNNQL PVPVDPIQPPYPRWYD N RCDYHAGAI HSTEN TALKYRVQ
Subjt:  ----TSNLGVNNIIHSMTCTDLGHHNNRGARKQTQFNPIPMTCTDLLPQLFQNNQLTPVPVDPIQPPYPRWYDANVRCDYHAGAIEHSTENYTALKYRVQ

Query:  TLIKAGWLNFKKENGPDVNNNPLPKHQNQQINAVTA-VIEEREQTGPLVFPCPNGFKL
         LIKAGWLNFKKENGPDV+ NPLP HQN QINA+    IE + +   +  P    F++
Subjt:  TLIKAGWLNFKKENGPDVNNNPLPKHQNQQINAVTA-VIEEREQTGPLVFPCPNGFKL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCATCGGAGAAAGAATCGAGTACGGTATCAAGCATCGTCGATTAACTAGTATTGCTAGTGAGACATCGGCCACGAAAAAGGCAAGTTCTTTGAAGAAGAAG
GAGGATGAGGTGCAAATGGTAAGAGCAGACCGACACTCTTGGAAACAACAACCGTATGGTCGGACAGCGCGATATGCTCTATATTATTACCCAACGCCATACGAG
TATAATCAACCATATGTTAATAATGCAACTTCACATTTCTCTCCTATGCCTCTCAAAATTTTCGACTCCCAGCATGTCAAAACTTCCAACCTAGGCGTCAACAAC
ATAATACATTCTATGACTTGTACTGACCTTGGGCATCACAATAACAGAGGAGCACGTAAACAGACTCAGTTCAACCCAATCCCTATGACTTGTACTGACCTTTTG
CCTCAGTTATTTCAAAATAATCAACTAACACCGGTACCTGTGGATCCAATCCAGCCTCCATACCCAAGATGGTATGATGCAAATGTCCGTTGTGACTATCACGCA
GGAGCTATAGAGCATTCAACAGAGAACTACACCGCACTTAAATACAGAGTTCAAACTTTAATCAAGGCAGGATGGTTGAACTTTAAAAAAGAAAATGGACCTGAT
GTCAATAACAATCCTTTGCCAAAGCATCAGAACCAACAAATAAATGCAGTTACAGCCGTGATAGAAGAAAGAGAGCAAACTGGTCCTTTAGTCTTCCCGTGTCCA
AATGGTTTCAAGCTGGACAATTGGAGCGTGTTAGAGATACCATCTTTTGAGCTTGATACACCTATATACAACGTCAATTCTGATGAGAAAATGGATGATGAGCCC
TCTATTGAGTTATTGAGAATGTTAGAAGAAGAAGAAAAGATGTTGGGACCTCGTGAGGAATTAACTGAGACTGTTAACTTGGGGTCACAAGCGGAGGCCAAAGAG
TTGCTCCGCAAGAACAATGATGGGGTATGGAGTGAAGATTGTCAAGCAGCATTTAATAGGATTAAGGATGGTGCAAGGACTTGGCTAAACGCGTTAGAACCAAAT
TCTATCAACACATGGGCGGAACTGACGGAGAAATTTTTGGCAAAGTACCATACTTTGACTAGGAACGCAGACCTTCGAGAGGACATTGTGTCTTTTAGACAGAAA
GAGAACGAAGCAGATCAAGAAGCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGACCATCGGAGAAAGAATCGAGTACGGTATCAAGCATCGTCGATTAACTAGTATTGCTAGTGAGACATCGGCCACGAAAAAGGCAAGTTCTTTGAAGAAGAAG
GAGGATGAGGTGCAAATGGTAAGAGCAGACCGACACTCTTGGAAACAACAACCGTATGGTCGGACAGCGCGATATGCTCTATATTATTACCCAACGCCATACGAG
TATAATCAACCATATGTTAATAATGCAACTTCACATTTCTCTCCTATGCCTCTCAAAATTTTCGACTCCCAGCATGTCAAAACTTCCAACCTAGGCGTCAACAAC
ATAATACATTCTATGACTTGTACTGACCTTGGGCATCACAATAACAGAGGAGCACGTAAACAGACTCAGTTCAACCCAATCCCTATGACTTGTACTGACCTTTTG
CCTCAGTTATTTCAAAATAATCAACTAACACCGGTACCTGTGGATCCAATCCAGCCTCCATACCCAAGATGGTATGATGCAAATGTCCGTTGTGACTATCACGCA
GGAGCTATAGAGCATTCAACAGAGAACTACACCGCACTTAAATACAGAGTTCAAACTTTAATCAAGGCAGGATGGTTGAACTTTAAAAAAGAAAATGGACCTGAT
GTCAATAACAATCCTTTGCCAAAGCATCAGAACCAACAAATAAATGCAGTTACAGCCGTGATAGAAGAAAGAGAGCAAACTGGTCCTTTAGTCTTCCCGTGTCCA
AATGGTTTCAAGCTGGACAATTGGAGCGTGTTAGAGATACCATCTTTTGAGCTTGATACACCTATATACAACGTCAATTCTGATGAGAAAATGGATGATGAGCCC
TCTATTGAGTTATTGAGAATGTTAGAAGAAGAAGAAAAGATGTTGGGACCTCGTGAGGAATTAACTGAGACTGTTAACTTGGGGTCACAAGCGGAGGCCAAAGAG
TTGCTCCGCAAGAACAATGATGGGGTATGGAGTGAAGATTGTCAAGCAGCATTTAATAGGATTAAGGATGGTGCAAGGACTTGGCTAAACGCGTTAGAACCAAAT
TCTATCAACACATGGGCGGAACTGACGGAGAAATTTTTGGCAAAGTACCATACTTTGACTAGGAACGCAGACCTTCGAGAGGACATTGTGTCTTTTAGACAGAAA
GAGAACGAAGCAGATCAAGAAGCTTGA
Protein sequenceShow/hide protein sequence
MTIGERIEYGIKHRRLTSIASETSATKKASSLKKKEDEVQMVRADRHSWKQQPYGRTARYALYYYPTPYEYNQPYVNNATSHFSPMPLKIFDSQHVKTSNLGVNN
IIHSMTCTDLGHHNNRGARKQTQFNPIPMTCTDLLPQLFQNNQLTPVPVDPIQPPYPRWYDANVRCDYHAGAIEHSTENYTALKYRVQTLIKAGWLNFKKENGPD
VNNNPLPKHQNQQINAVTAVIEEREQTGPLVFPCPNGFKLDNWSVLEIPSFELDTPIYNVNSDEKMDDEPSIELLRMLEEEEKMLGPREELTETVNLGSQAEAKE
LLRKNNDGVWSEDCQAAFNRIKDGARTWLNALEPNSINTWAELTEKFLAKYHTLTRNADLREDIVSFRQKENEADQEA