; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g29030 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g29030
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRibonuclease H
Genome locationchr4:21523149..21524495
RNA-Seq ExpressionMoc04g29030
SyntenyMoc04g29030
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0090502 - RNA phosphodiester bond hydrolysis, endonucleolytic (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022151719.1 uncharacterized protein LOC111019634 [Momordica charantia]2.4e-10468.99Show/hide
Query:  MRHRLRTMEEMYAEATRANRTASPS---IAPGAPGEKGAPSIQPGDRELIPNDRGVDYSLRDNDLRKHLTEKKKRASRELEDSPSYSREFSNSDLKAQSK
        MR ++ TME+MY+E  +A    S S   +A     E+    + P  +         +Y+ +  DLR+HL  K+  + R+   SPS S    NS+ +A+S 
Subjt:  MRHRLRTMEEMYAEATRANRTASPS---IAPGAPGEKGAPSIQPGDRELIPNDRGVDYSLRDNDLRKHLTEKKKRASRELEDSPSYSREFSNSDLKAQSK

Query:  YKPLAPEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGHLGESSFTADILEAPIPPKFKTPAIKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCR
        Y P+ PE VITREEFD +K KFD QVEALKA+CEKK+ SFDDG LGES FT+DILEA IP KFKTP +KPYDGSKDPKDYVEVFEGLMDFQAATDAIKCR
Subjt:  YKPLAPEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGHLGESSFTADILEAPIPPKFKTPAIKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCR

Query:  AFQIALTGSACLWYRRLSARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSTMCYFLTGLADETLTVKLG
         FQIALTGSA LWYRRL ARSISTYSQLRKEFI QFSSRHYDRKTATHL TIRQKEGETLREYVTRFQEEQLKVAHCSD S MCYFLT LADETLTVKL 
Subjt:  AFQIALTGSACLWYRRLSARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSTMCYFLTGLADETLTVKLG

Query:  EEAPATFAEVLQKAKK
        EEAPATF EVLQKAKK
Subjt:  EEAPATFAEVLQKAKK

XP_022155128.1 uncharacterized protein LOC111022267 [Momordica charantia]9.2e-10470.36Show/hide
Query:  DYSLRDNDLRKHLTEKKKRASRELEDSPSYSREFSNSDLKAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGHLGESSFTADILE
        +Y+ +  DLR+HL  K+  + R+   SPS S    NS+ +A+S Y P+ P+ VITREEFD +K KFD QVEALKA CEKK+ SFDDG LGE  FT DILE
Subjt:  DYSLRDNDLRKHLTEKKKRASRELEDSPSYSREFSNSDLKAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGHLGESSFTADILE

Query:  APIPPKFKTPAIKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSACLWYRRLSARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKE
        API PKFKTP +KPYDGSK+PKDYV+VFEGLM+FQAATDAIKCRAFQIA TGSA LWYRRL ARSISTYSQLRKEFISQFSSR+YDRKTATHLATIRQK+
Subjt:  APIPPKFKTPAIKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSACLWYRRLSARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKE

Query:  GETLREYVTRFQEEQLKVAHCSDDSTMCYFLTGLADETLTVKLGEEAPATFAEVLQKAKKSLMDRSYFEPRLADLRSRSIR--KKLARRRGGLMSG---P
        GETLREYVTRFQEEQLKVAHCSDDS MCYFLTGLAD+TLTVKLGEEAPATFAEVLQKAKK +  +         LR+++ R  KK+ +RR G   G    
Subjt:  GETLREYVTRFQEEQLKVAHCSDDSTMCYFLTGLADETLTVKLGEEAPATFAEVLQKAKKSLMDRSYFEPRLADLRSRSIR--KKLARRRGGLMSG---P

Query:  KIRDHPP
        K RD  P
Subjt:  KIRDHPP

XP_022155139.1 uncharacterized protein LOC111022280 [Momordica charantia]1.0e-11071.34Show/hide
Query:  PGAPGEKGAPSIQPGDRELIPNDRGVDYSLRDNDLRKHLTEKKKRASRELEDSPSYSREFSNSDLKAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALK
        PGAPGEKGAPSIQPG+RE IPND GVDYSLRDNDLRKHLT+KKK+AS E EDS SYSREFSNS+LKAQSKYKPL PEAVI REEFDLMKH+FDEQVEALK
Subjt:  PGAPGEKGAPSIQPGDRELIPNDRGVDYSLRDNDLRKHLTEKKKRASRELEDSPSYSREFSNSDLKAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALK

Query:  ARCEKKDCSFDDGHLGESSFTADILEAPIPPKFKTPAIKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSACLWYRRLSARSISTYSQLRK
        ARCEKK+  FDD  LGES FT+DI+EAPIPPKFKTP +KPYDGSKDPKDYVEVFEGLMDFQAATDAIKC AFQIALTGSA LW RRL ARSISTYSQLRK
Subjt:  ARCEKKDCSFDDGHLGESSFTADILEAPIPPKFKTPAIKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSACLWYRRLSARSISTYSQLRK

Query:  EFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSTMCYFLTGLADETLTVKLGEEAPATFAEVLQKAKKSLMDRSYFEPRLAD
        EFI QFS RHYDRKTATHLATIRQKE                                   DETLTVKLGEEAPATFAEVLQ AKK +  +     +   
Subjt:  EFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSTMCYFLTGLADETLTVKLGEEAPATFAEVLQKAKKSLMDRSYFEPRLAD

Query:  LRSRSIRKKLARRR
           +  +K+L++++
Subjt:  LRSRSIRKKLARRR

XP_022156088.1 uncharacterized protein LOC111023060 [Momordica charantia]3.0e-10266.14Show/hide
Query:  LDDMRHRLRTMEEMY---AEATRANRTASPSIAPGAPGEKGAPSIQPGDRELIPNDRGVDYSLRDNDLRKHLTEKKKRASRELEDSPSYSREFSNSDLKA
        ++ MR ++RTMEEMY    +   A   +   +      E+G     P D E +             DLR HL  K+  + R    S  + +   NS+ +A
Subjt:  LDDMRHRLRTMEEMY---AEATRANRTASPSIAPGAPGEKGAPSIQPGDRELIPNDRGVDYSLRDNDLRKHLTEKKKRASRELEDSPSYSREFSNSDLKA

Query:  QSKYKPLAPEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGHLGESSFTADILEAPIPPKFKTPAIKPYDGSKDPKDYVEVFEGLMDFQAATDAI
        +S Y P+APE VITREEF+ +K KFD QVEALK RCEKK+ +FDDG LGES FT+DILEA IPPKFKTP +K YDGSKDPKDYVEVFEGLMDFQAATDAI
Subjt:  QSKYKPLAPEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGHLGESSFTADILEAPIPPKFKTPAIKPYDGSKDPKDYVEVFEGLMDFQAATDAI

Query:  KCRAFQIALTGSACLWYRRLSARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSTMCYFLTGLADETLTV
        KCRAFQIALTGSA LWYRRL ARSISTYSQLRKEFISQF SRHYDRKT THLATIRQKEG+TL+EY+TRFQEEQLKV HCSDDS+MCYFLTGLADET TV
Subjt:  KCRAFQIALTGSACLWYRRLSARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSTMCYFLTGLADETLTV

Query:  KLGEEAPATFAEVLQKAKK
        KLGEEA ATFAEVLQ  KK
Subjt:  KLGEEAPATFAEVLQKAKK

XP_022159327.1 uncharacterized protein LOC111025738 [Momordica charantia]2.4e-11257.63Show/hide
Query:  MVHPANSANTTEQR------------GARIVGDQVRARQEGDLPRRSARHANQELPPAHPKPSKANRGRGGTSRKTSQRANQEADPEVLSTLQRELDDMR
        MV P +S NT ++R            GA +V  Q+      +   RSAR    +L PAHPKP KANRGRGG SR+T+  A      E    LQ+E++ MR
Subjt:  MVHPANSANTTEQR------------GARIVGDQVRARQEGDLPRRSARHANQELPPAHPKPSKANRGRGGTSRKTSQRANQEADPEVLSTLQRELDDMR

Query:  HRLRTMEEMYAEATRANRTASPSIAPGAPGEKGAPSIQPGDRELIPNDRGVDYSLRDNDLRKHLTEKKKRASRELEDSPSYSREFSNSDLKAQSKYKPLA
         ++ TMEEMY E  +A    S S    A  E+G                         DLR HL+ K+  + R+   SPS S +  NS+ +A+S Y P+ 
Subjt:  HRLRTMEEMYAEATRANRTASPSIAPGAPGEKGAPSIQPGDRELIPNDRGVDYSLRDNDLRKHLTEKKKRASRELEDSPSYSREFSNSDLKAQSKYKPLA

Query:  PEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGHLGESSFTADILEAPIPPKFKTPAIKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIA
        PE VITREEFD +K KFD QVE LKARCE K  +FDDG LGES FT+DILEA IP KFKTP +KPYDGSKDPKDYVEVFEGLM FQAATDAIK RAFQIA
Subjt:  PEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGHLGESSFTADILEAPIPPKFKTPAIKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIA

Query:  LTGSACLWYRRLSARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSTMCYFLTGLADETLTVKLGEEAPA
        LT SA LWYRRL ARSISTYSQLRKEF SQFSSRHY+RKTATHLATIRQKE ETLREYVT FQEEQLKVAH SDDS +CYFLT L DETLTVKLGEEAPA
Subjt:  LTGSACLWYRRLSARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSTMCYFLTGLADETLTVKLGEEAPA

Query:  TFAEVLQKAKKSLMDRSYFEPRLADLRSRSIRKKLARRR
        TFAEVLQKAKK +  +  F  +      +  +KK ++ +
Subjt:  TFAEVLQKAKKSLMDRSYFEPRLADLRSRSIRKKLARRR

TrEMBL top hitse value%identityAlignment
A0A6J1DDW5 uncharacterized protein LOC1110196341.2e-10468.99Show/hide
Query:  MRHRLRTMEEMYAEATRANRTASPS---IAPGAPGEKGAPSIQPGDRELIPNDRGVDYSLRDNDLRKHLTEKKKRASRELEDSPSYSREFSNSDLKAQSK
        MR ++ TME+MY+E  +A    S S   +A     E+    + P  +         +Y+ +  DLR+HL  K+  + R+   SPS S    NS+ +A+S 
Subjt:  MRHRLRTMEEMYAEATRANRTASPS---IAPGAPGEKGAPSIQPGDRELIPNDRGVDYSLRDNDLRKHLTEKKKRASRELEDSPSYSREFSNSDLKAQSK

Query:  YKPLAPEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGHLGESSFTADILEAPIPPKFKTPAIKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCR
        Y P+ PE VITREEFD +K KFD QVEALKA+CEKK+ SFDDG LGES FT+DILEA IP KFKTP +KPYDGSKDPKDYVEVFEGLMDFQAATDAIKCR
Subjt:  YKPLAPEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGHLGESSFTADILEAPIPPKFKTPAIKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCR

Query:  AFQIALTGSACLWYRRLSARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSTMCYFLTGLADETLTVKLG
         FQIALTGSA LWYRRL ARSISTYSQLRKEFI QFSSRHYDRKTATHL TIRQKEGETLREYVTRFQEEQLKVAHCSD S MCYFLT LADETLTVKL 
Subjt:  AFQIALTGSACLWYRRLSARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSTMCYFLTGLADETLTVKLG

Query:  EEAPATFAEVLQKAKK
        EEAPATF EVLQKAKK
Subjt:  EEAPATFAEVLQKAKK

A0A6J1DM55 uncharacterized protein LOC1110222674.4e-10470.36Show/hide
Query:  DYSLRDNDLRKHLTEKKKRASRELEDSPSYSREFSNSDLKAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGHLGESSFTADILE
        +Y+ +  DLR+HL  K+  + R+   SPS S    NS+ +A+S Y P+ P+ VITREEFD +K KFD QVEALKA CEKK+ SFDDG LGE  FT DILE
Subjt:  DYSLRDNDLRKHLTEKKKRASRELEDSPSYSREFSNSDLKAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGHLGESSFTADILE

Query:  APIPPKFKTPAIKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSACLWYRRLSARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKE
        API PKFKTP +KPYDGSK+PKDYV+VFEGLM+FQAATDAIKCRAFQIA TGSA LWYRRL ARSISTYSQLRKEFISQFSSR+YDRKTATHLATIRQK+
Subjt:  APIPPKFKTPAIKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSACLWYRRLSARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKE

Query:  GETLREYVTRFQEEQLKVAHCSDDSTMCYFLTGLADETLTVKLGEEAPATFAEVLQKAKKSLMDRSYFEPRLADLRSRSIR--KKLARRRGGLMSG---P
        GETLREYVTRFQEEQLKVAHCSDDS MCYFLTGLAD+TLTVKLGEEAPATFAEVLQKAKK +  +         LR+++ R  KK+ +RR G   G    
Subjt:  GETLREYVTRFQEEQLKVAHCSDDSTMCYFLTGLADETLTVKLGEEAPATFAEVLQKAKKSLMDRSYFEPRLADLRSRSIR--KKLARRRGGLMSG---P

Query:  KIRDHPP
        K RD  P
Subjt:  KIRDHPP

A0A6J1DPC9 uncharacterized protein LOC1110222804.9e-11171.34Show/hide
Query:  PGAPGEKGAPSIQPGDRELIPNDRGVDYSLRDNDLRKHLTEKKKRASRELEDSPSYSREFSNSDLKAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALK
        PGAPGEKGAPSIQPG+RE IPND GVDYSLRDNDLRKHLT+KKK+AS E EDS SYSREFSNS+LKAQSKYKPL PEAVI REEFDLMKH+FDEQVEALK
Subjt:  PGAPGEKGAPSIQPGDRELIPNDRGVDYSLRDNDLRKHLTEKKKRASRELEDSPSYSREFSNSDLKAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALK

Query:  ARCEKKDCSFDDGHLGESSFTADILEAPIPPKFKTPAIKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSACLWYRRLSARSISTYSQLRK
        ARCEKK+  FDD  LGES FT+DI+EAPIPPKFKTP +KPYDGSKDPKDYVEVFEGLMDFQAATDAIKC AFQIALTGSA LW RRL ARSISTYSQLRK
Subjt:  ARCEKKDCSFDDGHLGESSFTADILEAPIPPKFKTPAIKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSACLWYRRLSARSISTYSQLRK

Query:  EFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSTMCYFLTGLADETLTVKLGEEAPATFAEVLQKAKKSLMDRSYFEPRLAD
        EFI QFS RHYDRKTATHLATIRQKE                                   DETLTVKLGEEAPATFAEVLQ AKK +  +     +   
Subjt:  EFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSTMCYFLTGLADETLTVKLGEEAPATFAEVLQKAKKSLMDRSYFEPRLAD

Query:  LRSRSIRKKLARRR
           +  +K+L++++
Subjt:  LRSRSIRKKLARRR

A0A6J1DPN4 uncharacterized protein LOC1110230601.4e-10266.14Show/hide
Query:  LDDMRHRLRTMEEMY---AEATRANRTASPSIAPGAPGEKGAPSIQPGDRELIPNDRGVDYSLRDNDLRKHLTEKKKRASRELEDSPSYSREFSNSDLKA
        ++ MR ++RTMEEMY    +   A   +   +      E+G     P D E +             DLR HL  K+  + R    S  + +   NS+ +A
Subjt:  LDDMRHRLRTMEEMY---AEATRANRTASPSIAPGAPGEKGAPSIQPGDRELIPNDRGVDYSLRDNDLRKHLTEKKKRASRELEDSPSYSREFSNSDLKA

Query:  QSKYKPLAPEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGHLGESSFTADILEAPIPPKFKTPAIKPYDGSKDPKDYVEVFEGLMDFQAATDAI
        +S Y P+APE VITREEF+ +K KFD QVEALK RCEKK+ +FDDG LGES FT+DILEA IPPKFKTP +K YDGSKDPKDYVEVFEGLMDFQAATDAI
Subjt:  QSKYKPLAPEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGHLGESSFTADILEAPIPPKFKTPAIKPYDGSKDPKDYVEVFEGLMDFQAATDAI

Query:  KCRAFQIALTGSACLWYRRLSARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSTMCYFLTGLADETLTV
        KCRAFQIALTGSA LWYRRL ARSISTYSQLRKEFISQF SRHYDRKT THLATIRQKEG+TL+EY+TRFQEEQLKV HCSDDS+MCYFLTGLADET TV
Subjt:  KCRAFQIALTGSACLWYRRLSARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSTMCYFLTGLADETLTV

Query:  KLGEEAPATFAEVLQKAKK
        KLGEEA ATFAEVLQ  KK
Subjt:  KLGEEAPATFAEVLQKAKK

A0A6J1DZJ1 uncharacterized protein LOC1110257381.2e-11257.63Show/hide
Query:  MVHPANSANTTEQR------------GARIVGDQVRARQEGDLPRRSARHANQELPPAHPKPSKANRGRGGTSRKTSQRANQEADPEVLSTLQRELDDMR
        MV P +S NT ++R            GA +V  Q+      +   RSAR    +L PAHPKP KANRGRGG SR+T+  A      E    LQ+E++ MR
Subjt:  MVHPANSANTTEQR------------GARIVGDQVRARQEGDLPRRSARHANQELPPAHPKPSKANRGRGGTSRKTSQRANQEADPEVLSTLQRELDDMR

Query:  HRLRTMEEMYAEATRANRTASPSIAPGAPGEKGAPSIQPGDRELIPNDRGVDYSLRDNDLRKHLTEKKKRASRELEDSPSYSREFSNSDLKAQSKYKPLA
         ++ TMEEMY E  +A    S S    A  E+G                         DLR HL+ K+  + R+   SPS S +  NS+ +A+S Y P+ 
Subjt:  HRLRTMEEMYAEATRANRTASPSIAPGAPGEKGAPSIQPGDRELIPNDRGVDYSLRDNDLRKHLTEKKKRASRELEDSPSYSREFSNSDLKAQSKYKPLA

Query:  PEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGHLGESSFTADILEAPIPPKFKTPAIKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIA
        PE VITREEFD +K KFD QVE LKARCE K  +FDDG LGES FT+DILEA IP KFKTP +KPYDGSKDPKDYVEVFEGLM FQAATDAIK RAFQIA
Subjt:  PEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGHLGESSFTADILEAPIPPKFKTPAIKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIA

Query:  LTGSACLWYRRLSARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSTMCYFLTGLADETLTVKLGEEAPA
        LT SA LWYRRL ARSISTYSQLRKEF SQFSSRHY+RKTATHLATIRQKE ETLREYVT FQEEQLKVAH SDDS +CYFLT L DETLTVKLGEEAPA
Subjt:  LTGSACLWYRRLSARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSTMCYFLTGLADETLTVKLGEEAPA

Query:  TFAEVLQKAKKSLMDRSYFEPRLADLRSRSIRKKLARRR
        TFAEVLQKAKK +  +  F  +      +  +KK ++ +
Subjt:  TFAEVLQKAKKSLMDRSYFEPRLADLRSRSIRKKLARRR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGCATCCAGCAAACTCTGCCAATACGACAGAGCAGAGGGGTGCAAGAATAGTCGGGGACCAGGTCCGAGCAAGGCAAGAGGGAGATCTGCCGCGCAGATCTGCCCG
CCATGCGAATCAAGAGCTACCACCTGCTCACCCGAAACCCTCAAAAGCCAACAGAGGTCGAGGTGGGACGTCGAGAAAGACCTCCCAAAGGGCCAATCAGGAAGCGGACC
CCGAAGTTCTGTCTACTCTCCAGCGCGAATTGGATGATATGCGCCATCGATTGCGCACAATGGAAGAAATGTACGCCGAGGCAACGCGTGCTAACCGAACTGCATCTCCC
TCTATAGCTCCGGGCGCACCCGGAGAAAAGGGAGCTCCATCTATCCAACCTGGCGATCGCGAGCTCATTCCCAACGATAGAGGGGTGGATTACAGCTTGCGGGATAACGA
TCTGAGAAAGCATCTCACTGAAAAGAAGAAGAGAGCATCTCGGGAGCTAGAAGACTCTCCTTCCTACTCCCGAGAATTCTCCAACTCGGACCTAAAGGCTCAATCAAAAT
ACAAGCCTCTAGCACCAGAAGCTGTGATCACTAGGGAAGAGTTCGACCTGATGAAGCACAAGTTCGATGAGCAGGTCGAAGCGCTTAAGGCCAGGTGTGAGAAGAAAGAC
TGTTCGTTCGACGATGGCCACTTGGGAGAATCGTCATTCACCGCGGACATCTTGGAGGCTCCAATCCCTCCGAAATTCAAGACTCCCGCCATAAAGCCCTATGACGGGTC
TAAGGACCCTAAAGACTATGTTGAGGTCTTCGAGGGCCTCATGGACTTTCAAGCAGCGACAGATGCGATCAAGTGCCGCGCCTTCCAGATCGCGCTTACCGGCAGCGCGT
GCCTTTGGTACCGAAGACTGTCGGCCAGGTCGATCTCGACCTACTCCCAGCTAAGGAAGGAGTTCATCAGTCAGTTCTCCTCTAGGCATTATGACAGAAAGACAGCGACT
CACCTTGCCACCATCAGGCAAAAAGAAGGAGAGACGCTGAGAGAGTATGTCACACGGTTCCAGGAGGAGCAGCTTAAGGTGGCGCACTGCTCCGATGATTCGACCATGTG
CTACTTCCTCACCGGCTTGGCCGATGAGACCTTGACGGTAAAGCTTGGAGAAGAGGCTCCAGCCACCTTCGCTGAAGTATTGCAGAAAGCGAAAAAGTCATTGATGGACA
GGAGCTACTTCGAACCAAGACTGGCCGACCTGAGAAGCAGATCGATCAGAAAAAAATTAGCCAGGAGAAGAGGAGGATTGATGTCAGGTCCAAAGATAAGGGATCATCCT
CCTCCAATAGCAGAACAGAGTACGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTGCATCCAGCAAACTCTGCCAATACGACAGAGCAGAGGGGTGCAAGAATAGTCGGGGACCAGGTCCGAGCAAGGCAAGAGGGAGATCTGCCGCGCAGATCTGCCCG
CCATGCGAATCAAGAGCTACCACCTGCTCACCCGAAACCCTCAAAAGCCAACAGAGGTCGAGGTGGGACGTCGAGAAAGACCTCCCAAAGGGCCAATCAGGAAGCGGACC
CCGAAGTTCTGTCTACTCTCCAGCGCGAATTGGATGATATGCGCCATCGATTGCGCACAATGGAAGAAATGTACGCCGAGGCAACGCGTGCTAACCGAACTGCATCTCCC
TCTATAGCTCCGGGCGCACCCGGAGAAAAGGGAGCTCCATCTATCCAACCTGGCGATCGCGAGCTCATTCCCAACGATAGAGGGGTGGATTACAGCTTGCGGGATAACGA
TCTGAGAAAGCATCTCACTGAAAAGAAGAAGAGAGCATCTCGGGAGCTAGAAGACTCTCCTTCCTACTCCCGAGAATTCTCCAACTCGGACCTAAAGGCTCAATCAAAAT
ACAAGCCTCTAGCACCAGAAGCTGTGATCACTAGGGAAGAGTTCGACCTGATGAAGCACAAGTTCGATGAGCAGGTCGAAGCGCTTAAGGCCAGGTGTGAGAAGAAAGAC
TGTTCGTTCGACGATGGCCACTTGGGAGAATCGTCATTCACCGCGGACATCTTGGAGGCTCCAATCCCTCCGAAATTCAAGACTCCCGCCATAAAGCCCTATGACGGGTC
TAAGGACCCTAAAGACTATGTTGAGGTCTTCGAGGGCCTCATGGACTTTCAAGCAGCGACAGATGCGATCAAGTGCCGCGCCTTCCAGATCGCGCTTACCGGCAGCGCGT
GCCTTTGGTACCGAAGACTGTCGGCCAGGTCGATCTCGACCTACTCCCAGCTAAGGAAGGAGTTCATCAGTCAGTTCTCCTCTAGGCATTATGACAGAAAGACAGCGACT
CACCTTGCCACCATCAGGCAAAAAGAAGGAGAGACGCTGAGAGAGTATGTCACACGGTTCCAGGAGGAGCAGCTTAAGGTGGCGCACTGCTCCGATGATTCGACCATGTG
CTACTTCCTCACCGGCTTGGCCGATGAGACCTTGACGGTAAAGCTTGGAGAAGAGGCTCCAGCCACCTTCGCTGAAGTATTGCAGAAAGCGAAAAAGTCATTGATGGACA
GGAGCTACTTCGAACCAAGACTGGCCGACCTGAGAAGCAGATCGATCAGAAAAAAATTAGCCAGGAGAAGAGGAGGATTGATGTCAGGTCCAAAGATAAGGGATCATCCT
CCTCCAATAGCAGAACAGAGTACGTAG
Protein sequenceShow/hide protein sequence
MVHPANSANTTEQRGARIVGDQVRARQEGDLPRRSARHANQELPPAHPKPSKANRGRGGTSRKTSQRANQEADPEVLSTLQRELDDMRHRLRTMEEMYAEATRANRTASP
SIAPGAPGEKGAPSIQPGDRELIPNDRGVDYSLRDNDLRKHLTEKKKRASRELEDSPSYSREFSNSDLKAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALKARCEKKD
CSFDDGHLGESSFTADILEAPIPPKFKTPAIKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSACLWYRRLSARSISTYSQLRKEFISQFSSRHYDRKTAT
HLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSTMCYFLTGLADETLTVKLGEEAPATFAEVLQKAKKSLMDRSYFEPRLADLRSRSIRKKLARRRGGLMSGPKIRDHP
PPIAEQST