; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g18220 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g18220
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr8:13785527..13786943
RNA-Seq ExpressionMoc08g18220
SyntenyMoc08g18220
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022151719.1 uncharacterized protein LOC111019634 [Momordica charantia]3.4e-8960Show/hide
Query:  DDMRHRRTRREGRWVPSLHPGDREPIPNNEGVDYSLRDNDLRKHLTDKKKRASREPEDSPSYSREFSNSDLKAQSKYKSPTPEAVITREEFDLMKHRFDE
        +DM  +R    G  V   HP         E  +Y+ +  DLR+HL ++K+ +S     SPS S    NS+ +A+S Y   TPE VITREEFD +K +FD 
Subjt:  DDMRHRRTRREGRWVPSLHPGDREPIPNNEGVDYSLRDNDLRKHLTDKKKRASREPEDSPSYSREFSNSDLKAQSKYKSPTPEAVITREEFDLMKHRFDE

Query:  QVEALKAR--------------------------------------------PKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRRLPARSIST
        QVEALKA+                                            PKDYVEVFEGLMDFQAATDAIKCR FQIALTGSARLWYRRLPARSIST
Subjt:  QVEALKAR--------------------------------------------PKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRRLPARSIST

Query:  YSQLRKEFISQSSSHHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVVHCSDNSAMCYFLIGLADETLNVKLGNEAPATFAEVLQKVKKVIDGQELL
        YSQLRKEFI Q SS HYDRKTATHL TIRQKEGETLREYVTRFQEEQLKV HCSD+SAMCYFL  LADETL VKL  EAPATF EVLQK KK+IDGQELL
Subjt:  YSQLRKEFISQSSSHHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVVHCSDNSAMCYFLIGLADETLNVKLGNEAPATFAEVLQKVKKVIDGQELL

Query:  RTKTGRPEKQIDQKKLSQEKRKSDSKSKDKGSSSSGSRTEYRRSE
        RTKT RPEK+IDQ + S++K K+DSK++DKG SS  SR  YRRS+
Subjt:  RTKTGRPEKQIDQKKLSQEKRKSDSKSKDKGSSSSGSRTEYRRSE

XP_022155128.1 uncharacterized protein LOC111022267 [Momordica charantia]4.3e-9263.12Show/hide
Query:  EGVDYSLRDNDLRKHLTDKKKRASREPEDSPSYSREFSNSDLKAQSKYKSPTPEAVITREEFDLMKHRFDEQVEALKA----------------------
        E  +Y+ +  DLR+HL ++K+ +S     SPS S    NS+ +A+S Y   TP+ VITREEFD +K +FD QVEALKA                      
Subjt:  EGVDYSLRDNDLRKHLTDKKKRASREPEDSPSYSREFSNSDLKAQSKYKSPTPEAVITREEFDLMKHRFDEQVEALKA----------------------

Query:  ----------------------RPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQSSSHHYDRKTATHLATIR
                               PKDYV+VFEGLM+FQAATDAIKCRAFQIA TGSARLWYRRLPARSISTYSQLRKEFISQ SS +YDRKTATHLATIR
Subjt:  ----------------------RPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQSSSHHYDRKTATHLATIR

Query:  QKEGETLREYVTRFQEEQLKVVHCSDNSAMCYFLIGLADETLNVKLGNEAPATFAEVLQKVKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKSDSKSKD
        QK+GETLREYVTRFQEEQLKV HCSD+SAMCYFL GLAD+TL VKLG EAPATFAEVLQK KKVIDGQELLRTKTGRPEK+IDQ+++ ++K K+ SKS+D
Subjt:  QKEGETLREYVTRFQEEQLKVVHCSDNSAMCYFLIGLADETLNVKLGNEAPATFAEVLQKVKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKSDSKSKD

Query:  KGSSSSGSRTEYRRSEIDHN
        KG SSS SR +Y+RS+ +HN
Subjt:  KGSSSSGSRTEYRRSEIDHN

XP_022155139.1 uncharacterized protein LOC111022280 [Momordica charantia]5.0e-10167.56Show/hide
Query:  PSLHPGDREPIPNNEGVDYSLRDNDLRKHLTDKKKRASREPEDSPSYSREFSNSDLKAQSKYKSPTPEAVITREEFDLMKHRFDEQVEALKAR-------
        PS+ PG+REPIPN+EGVDYSLRDNDLRKHLTDKKK+AS EPEDS SYSREFSNS+LKAQSKYK   PEAVI REEFDLMKHRFDEQVEALKAR       
Subjt:  PSLHPGDREPIPNNEGVDYSLRDNDLRKHLTDKKKRASREPEDSPSYSREFSNSDLKAQSKYKSPTPEAVITREEFDLMKHRFDEQVEALKAR-------

Query:  -------------------------------------PKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQSSSH
                                             PKDYVEVFEGLMDFQAATDAIKC AFQIALTGSARLW RRLPARSISTYSQLRKEFI Q S  
Subjt:  -------------------------------------PKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQSSSH

Query:  HYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVVHCSDNSAMCYFLIGLADETLNVKLGNEAPATFAEVLQKVKKVIDGQELLRTKTGRPEKQIDQKK
        HYDRKTATHLATIRQKE                                   DETL VKLG EAPATFAEVLQ  KKVIDGQELLRTKT RPEKQIDQK+
Subjt:  HYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVVHCSDNSAMCYFLIGLADETLNVKLGNEAPATFAEVLQKVKKVIDGQELLRTKTGRPEKQIDQKK

Query:  LSQEKRKSDSKSKDKGSSSSGSRTEYRRSEIDHNRS
        LSQ+KRK DSKSKDKGSSSSGSRTEYRRSE   +RS
Subjt:  LSQEKRKSDSKSKDKGSSSSGSRTEYRRSEIDHNRS

XP_022156088.1 uncharacterized protein LOC111023060 [Momordica charantia]5.2e-9058.77Show/hide
Query:  EALSTLQRELDDMRHRRTRREGRWVPSLHPGDREPIPNN-----EGVDYSLRDNDLRKHLTDKKKRASREPEDSPSYSREFSNSDLKAQSKYKSPTPEAV
        EA+ T  R +++M ++  +  G    S      E +        + VD      DLR HL ++K+ +S   E + +Y  +  NS+ +A+S Y    PE V
Subjt:  EALSTLQRELDDMRHRRTRREGRWVPSLHPGDREPIPNN-----EGVDYSLRDNDLRKHLTDKKKRASREPEDSPSYSREFSNSDLKAQSKYKSPTPEAV

Query:  ITREEFDLMKHRFDEQVEALKAR--------------------------------------------PKDYVEVFEGLMDFQAATDAIKCRAFQIALTGS
        ITREEF+ +K +FD QVEALK R                                            PKDYVEVFEGLMDFQAATDAIKCRAFQIALTGS
Subjt:  ITREEFDLMKHRFDEQVEALKAR--------------------------------------------PKDYVEVFEGLMDFQAATDAIKCRAFQIALTGS

Query:  ARLWYRRLPARSISTYSQLRKEFISQSSSHHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVVHCSDNSAMCYFLIGLADETLNVKLGNEAPATFAE
        ARLWYRRLPARSISTYSQLRKEFISQ  S HYDRKT THLATIRQKEG+TL+EY+TRFQEEQLKVVHCSD+S+MCYFL GLADET  VKLG EA ATFAE
Subjt:  ARLWYRRLPARSISTYSQLRKEFISQSSSHHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVVHCSDNSAMCYFLIGLADETLNVKLGNEAPATFAE

Query:  VLQKVKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKSDSKSKDKGSSSSGSRTEYRRS
        VLQ  KK IDGQELLRTKT RPEKQIDQKK SQ+KRK+DSKSKDKGSSSS SRT+Y RS
Subjt:  VLQKVKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKSDSKSKDKGSSSSGSRTEYRRS

XP_022159327.1 uncharacterized protein LOC111025738 [Momordica charantia]3.7e-10455.97Show/hide
Query:  MVQPANSANTIERRGLNADNGPQRDLDARMVEDQVRAGQEGDLPRRSARHANQELPPAHPKPSKANRGRGGTSRKTSQRANPTVDPEALSTLQRELDDMR
        MVQP +S NT +RR L A++G QR++ A +VE Q+  G   +   RSAR    +L PAHPKP KANRGRGG SR+T+  A P    E    LQ+E++ MR
Subjt:  MVQPANSANTIERRGLNADNGPQRDLDARMVEDQVRAGQEGDLPRRSARHANQELPPAHPKPSKANRGRGGTSRKTSQRANPTVDPEALSTLQRELDDMR

Query:  HRRTRRE---GRWVPSLHPGDREPIPNNEGVDYSLRD--NDLRKHLTDKKKRASREPEDSPSYSREFSNSDLKAQSKYKSPTPEAVITREEFDLMKHRFD
         +    E      V ++  G R         D + RD   DLR HL+ K+  + R+   SPS S +  NS+ +A+S Y    PE VITREEFD +K +FD
Subjt:  HRRTRRE---GRWVPSLHPGDREPIPNNEGVDYSLRD--NDLRKHLTDKKKRASREPEDSPSYSREFSNSDLKAQSKYKSPTPEAVITREEFDLMKHRFD

Query:  EQVEALKAR--------------------------------------------PKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRRLPARSIS
         QVE LKAR                                            PKDYVEVFEGLM FQAATDAIK RAFQIALT SARLWYRRLPARSIS
Subjt:  EQVEALKAR--------------------------------------------PKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRRLPARSIS

Query:  TYSQLRKEFISQSSSHHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVVHCSDNSAMCYFLIGLADETLNVKLGNEAPATFAEVLQKVKKVIDGQEL
        TYSQLRKEF SQ SS HY+RKTATHLATIRQKE ETLREYVT FQEEQLKV H SD+SA+CYFL  L DETL VKLG EAPATFAEVLQK KKVIDGQEL
Subjt:  TYSQLRKEFISQSSSHHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVVHCSDNSAMCYFLIGLADETLNVKLGNEAPATFAEVLQKVKKVIDGQEL

Query:  LRTKTGRPEKQIDQKKLSQEKRKSDSKSKDKGSSSSGSRTEYRRSEIDHNRS
         RTKTGR EKQIDQKK SQEKRK++SKSKDK         EYRRS+   +RS
Subjt:  LRTKTGRPEKQIDQKKLSQEKRKSDSKSKDKGSSSSGSRTEYRRSEIDHNRS

TrEMBL top hitse value%identityAlignment
A0A6J1DDW5 uncharacterized protein LOC1110196341.6e-8960Show/hide
Query:  DDMRHRRTRREGRWVPSLHPGDREPIPNNEGVDYSLRDNDLRKHLTDKKKRASREPEDSPSYSREFSNSDLKAQSKYKSPTPEAVITREEFDLMKHRFDE
        +DM  +R    G  V   HP         E  +Y+ +  DLR+HL ++K+ +S     SPS S    NS+ +A+S Y   TPE VITREEFD +K +FD 
Subjt:  DDMRHRRTRREGRWVPSLHPGDREPIPNNEGVDYSLRDNDLRKHLTDKKKRASREPEDSPSYSREFSNSDLKAQSKYKSPTPEAVITREEFDLMKHRFDE

Query:  QVEALKAR--------------------------------------------PKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRRLPARSIST
        QVEALKA+                                            PKDYVEVFEGLMDFQAATDAIKCR FQIALTGSARLWYRRLPARSIST
Subjt:  QVEALKAR--------------------------------------------PKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRRLPARSIST

Query:  YSQLRKEFISQSSSHHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVVHCSDNSAMCYFLIGLADETLNVKLGNEAPATFAEVLQKVKKVIDGQELL
        YSQLRKEFI Q SS HYDRKTATHL TIRQKEGETLREYVTRFQEEQLKV HCSD+SAMCYFL  LADETL VKL  EAPATF EVLQK KK+IDGQELL
Subjt:  YSQLRKEFISQSSSHHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVVHCSDNSAMCYFLIGLADETLNVKLGNEAPATFAEVLQKVKKVIDGQELL

Query:  RTKTGRPEKQIDQKKLSQEKRKSDSKSKDKGSSSSGSRTEYRRSE
        RTKT RPEK+IDQ + S++K K+DSK++DKG SS  SR  YRRS+
Subjt:  RTKTGRPEKQIDQKKLSQEKRKSDSKSKDKGSSSSGSRTEYRRSE

A0A6J1DM55 uncharacterized protein LOC1110222672.1e-9263.12Show/hide
Query:  EGVDYSLRDNDLRKHLTDKKKRASREPEDSPSYSREFSNSDLKAQSKYKSPTPEAVITREEFDLMKHRFDEQVEALKA----------------------
        E  +Y+ +  DLR+HL ++K+ +S     SPS S    NS+ +A+S Y   TP+ VITREEFD +K +FD QVEALKA                      
Subjt:  EGVDYSLRDNDLRKHLTDKKKRASREPEDSPSYSREFSNSDLKAQSKYKSPTPEAVITREEFDLMKHRFDEQVEALKA----------------------

Query:  ----------------------RPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQSSSHHYDRKTATHLATIR
                               PKDYV+VFEGLM+FQAATDAIKCRAFQIA TGSARLWYRRLPARSISTYSQLRKEFISQ SS +YDRKTATHLATIR
Subjt:  ----------------------RPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQSSSHHYDRKTATHLATIR

Query:  QKEGETLREYVTRFQEEQLKVVHCSDNSAMCYFLIGLADETLNVKLGNEAPATFAEVLQKVKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKSDSKSKD
        QK+GETLREYVTRFQEEQLKV HCSD+SAMCYFL GLAD+TL VKLG EAPATFAEVLQK KKVIDGQELLRTKTGRPEK+IDQ+++ ++K K+ SKS+D
Subjt:  QKEGETLREYVTRFQEEQLKVVHCSDNSAMCYFLIGLADETLNVKLGNEAPATFAEVLQKVKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKSDSKSKD

Query:  KGSSSSGSRTEYRRSEIDHN
        KG SSS SR +Y+RS+ +HN
Subjt:  KGSSSSGSRTEYRRSEIDHN

A0A6J1DPC9 uncharacterized protein LOC1110222802.4e-10167.56Show/hide
Query:  PSLHPGDREPIPNNEGVDYSLRDNDLRKHLTDKKKRASREPEDSPSYSREFSNSDLKAQSKYKSPTPEAVITREEFDLMKHRFDEQVEALKAR-------
        PS+ PG+REPIPN+EGVDYSLRDNDLRKHLTDKKK+AS EPEDS SYSREFSNS+LKAQSKYK   PEAVI REEFDLMKHRFDEQVEALKAR       
Subjt:  PSLHPGDREPIPNNEGVDYSLRDNDLRKHLTDKKKRASREPEDSPSYSREFSNSDLKAQSKYKSPTPEAVITREEFDLMKHRFDEQVEALKAR-------

Query:  -------------------------------------PKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQSSSH
                                             PKDYVEVFEGLMDFQAATDAIKC AFQIALTGSARLW RRLPARSISTYSQLRKEFI Q S  
Subjt:  -------------------------------------PKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQSSSH

Query:  HYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVVHCSDNSAMCYFLIGLADETLNVKLGNEAPATFAEVLQKVKKVIDGQELLRTKTGRPEKQIDQKK
        HYDRKTATHLATIRQKE                                   DETL VKLG EAPATFAEVLQ  KKVIDGQELLRTKT RPEKQIDQK+
Subjt:  HYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVVHCSDNSAMCYFLIGLADETLNVKLGNEAPATFAEVLQKVKKVIDGQELLRTKTGRPEKQIDQKK

Query:  LSQEKRKSDSKSKDKGSSSSGSRTEYRRSEIDHNRS
        LSQ+KRK DSKSKDKGSSSSGSRTEYRRSE   +RS
Subjt:  LSQEKRKSDSKSKDKGSSSSGSRTEYRRSEIDHNRS

A0A6J1DPN4 uncharacterized protein LOC1110230602.5e-9058.77Show/hide
Query:  EALSTLQRELDDMRHRRTRREGRWVPSLHPGDREPIPNN-----EGVDYSLRDNDLRKHLTDKKKRASREPEDSPSYSREFSNSDLKAQSKYKSPTPEAV
        EA+ T  R +++M ++  +  G    S      E +        + VD      DLR HL ++K+ +S   E + +Y  +  NS+ +A+S Y    PE V
Subjt:  EALSTLQRELDDMRHRRTRREGRWVPSLHPGDREPIPNN-----EGVDYSLRDNDLRKHLTDKKKRASREPEDSPSYSREFSNSDLKAQSKYKSPTPEAV

Query:  ITREEFDLMKHRFDEQVEALKAR--------------------------------------------PKDYVEVFEGLMDFQAATDAIKCRAFQIALTGS
        ITREEF+ +K +FD QVEALK R                                            PKDYVEVFEGLMDFQAATDAIKCRAFQIALTGS
Subjt:  ITREEFDLMKHRFDEQVEALKAR--------------------------------------------PKDYVEVFEGLMDFQAATDAIKCRAFQIALTGS

Query:  ARLWYRRLPARSISTYSQLRKEFISQSSSHHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVVHCSDNSAMCYFLIGLADETLNVKLGNEAPATFAE
        ARLWYRRLPARSISTYSQLRKEFISQ  S HYDRKT THLATIRQKEG+TL+EY+TRFQEEQLKVVHCSD+S+MCYFL GLADET  VKLG EA ATFAE
Subjt:  ARLWYRRLPARSISTYSQLRKEFISQSSSHHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVVHCSDNSAMCYFLIGLADETLNVKLGNEAPATFAE

Query:  VLQKVKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKSDSKSKDKGSSSSGSRTEYRRS
        VLQ  KK IDGQELLRTKT RPEKQIDQKK SQ+KRK+DSKSKDKGSSSS SRT+Y RS
Subjt:  VLQKVKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKSDSKSKDKGSSSSGSRTEYRRS

A0A6J1DZJ1 uncharacterized protein LOC1110257381.8e-10455.97Show/hide
Query:  MVQPANSANTIERRGLNADNGPQRDLDARMVEDQVRAGQEGDLPRRSARHANQELPPAHPKPSKANRGRGGTSRKTSQRANPTVDPEALSTLQRELDDMR
        MVQP +S NT +RR L A++G QR++ A +VE Q+  G   +   RSAR    +L PAHPKP KANRGRGG SR+T+  A P    E    LQ+E++ MR
Subjt:  MVQPANSANTIERRGLNADNGPQRDLDARMVEDQVRAGQEGDLPRRSARHANQELPPAHPKPSKANRGRGGTSRKTSQRANPTVDPEALSTLQRELDDMR

Query:  HRRTRRE---GRWVPSLHPGDREPIPNNEGVDYSLRD--NDLRKHLTDKKKRASREPEDSPSYSREFSNSDLKAQSKYKSPTPEAVITREEFDLMKHRFD
         +    E      V ++  G R         D + RD   DLR HL+ K+  + R+   SPS S +  NS+ +A+S Y    PE VITREEFD +K +FD
Subjt:  HRRTRRE---GRWVPSLHPGDREPIPNNEGVDYSLRD--NDLRKHLTDKKKRASREPEDSPSYSREFSNSDLKAQSKYKSPTPEAVITREEFDLMKHRFD

Query:  EQVEALKAR--------------------------------------------PKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRRLPARSIS
         QVE LKAR                                            PKDYVEVFEGLM FQAATDAIK RAFQIALT SARLWYRRLPARSIS
Subjt:  EQVEALKAR--------------------------------------------PKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRRLPARSIS

Query:  TYSQLRKEFISQSSSHHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVVHCSDNSAMCYFLIGLADETLNVKLGNEAPATFAEVLQKVKKVIDGQEL
        TYSQLRKEF SQ SS HY+RKTATHLATIRQKE ETLREYVT FQEEQLKV H SD+SA+CYFL  L DETL VKLG EAPATFAEVLQK KKVIDGQEL
Subjt:  TYSQLRKEFISQSSSHHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVVHCSDNSAMCYFLIGLADETLNVKLGNEAPATFAEVLQKVKKVIDGQEL

Query:  LRTKTGRPEKQIDQKKLSQEKRKSDSKSKDKGSSSSGSRTEYRRSEIDHNRS
         RTKTGR EKQIDQKK SQEKRK++SKSKDK         EYRRS+   +RS
Subjt:  LRTKTGRPEKQIDQKKLSQEKRKSDSKSKDKGSSSSGSRTEYRRSEIDHNRS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGCAGCCAGCAAACTCTGCCAATACGATAGAACGGAGGGGTTTGAACGCTGATAACGGCCCTCAGCGAGACCTTGATGCTAGAATGGTCGAGGATCAGGTTCGAGC
AGGGCAAGAGGGAGATCTTCCACGGAGATCGGCCCGCCATGCGAACCAGGAGTTACCCCCTGCTCACCCTAAACCCTCAAAAGCCAACCGAGGCCGAGGAGGGACCTCCA
GAAAGACCTCCCAAAGGGCCAACCCGACGGTAGACCCTGAGGCTTTGTCTACTCTCCAGCGCGAGTTGGATGATATGCGCCATCGGCGCACCCGGAGAGAAGGAAGGTGG
GTTCCATCTCTCCACCCTGGCGACCGCGAGCCCATTCCCAACAATGAGGGGGTGGATTATAGCTTGCGGGACAATGATCTGAGAAAGCACCTCACTGATAAGAAGAAGAG
AGCATCTCGGGAGCCGGAAGACTCTCCGTCCTACTCCCGAGAGTTCTCCAATTCTGACCTCAAGGCTCAATCAAAGTATAAGTCTCCAACACCAGAAGCTGTGATCACCA
GGGAAGAGTTCGACCTGATGAAGCACAGGTTTGACGAGCAGGTCGAGGCGCTCAAAGCCAGACCAAAGGACTATGTGGAGGTCTTCGAGGGCCTCATGGACTTTCAGGCG
GCAACAGATGCGATCAAATGCCGCGCCTTCCAGATCGCGCTTACCGGCAGCGCGCGCCTGTGGTACCGGAGACTGCCGGCTAGGTCGATCTCGACCTACTCTCAGCTGAG
AAAAGAGTTCATTAGTCAATCCTCTTCTCATCATTACGATAGAAAGACAGCGACTCACCTCGCCACCATCAGACAGAAGGAAGGTGAGACGCTGAGAGAGTATGTCACGA
GGTTCCAGGAGGAACAGCTGAAGGTCGTGCACTGCTCCGACAATTCGGCCATGTGCTACTTCCTCATCGGCCTGGCCGATGAGACCCTTAACGTGAAGCTCGGAAATGAG
GCTCCAGCAACCTTCGCCGAAGTCCTGCAAAAGGTGAAGAAAGTCATCGATGGACAAGAGCTCCTCCGAACCAAGACTGGCCGACCTGAAAAGCAAATCGACCAGAAGAA
GCTAAGCCAAGAGAAGAGGAAGTCCGATTCTAAGTCTAAGGACAAGGGATCATCCTCTTCTGGTAGTAGAACCGAGTATCGTCGGTCGGAGATCGACCATAATCGAAGCT
GA
mRNA sequenceShow/hide mRNA sequence
ATGGTGCAGCCAGCAAACTCTGCCAATACGATAGAACGGAGGGGTTTGAACGCTGATAACGGCCCTCAGCGAGACCTTGATGCTAGAATGGTCGAGGATCAGGTTCGAGC
AGGGCAAGAGGGAGATCTTCCACGGAGATCGGCCCGCCATGCGAACCAGGAGTTACCCCCTGCTCACCCTAAACCCTCAAAAGCCAACCGAGGCCGAGGAGGGACCTCCA
GAAAGACCTCCCAAAGGGCCAACCCGACGGTAGACCCTGAGGCTTTGTCTACTCTCCAGCGCGAGTTGGATGATATGCGCCATCGGCGCACCCGGAGAGAAGGAAGGTGG
GTTCCATCTCTCCACCCTGGCGACCGCGAGCCCATTCCCAACAATGAGGGGGTGGATTATAGCTTGCGGGACAATGATCTGAGAAAGCACCTCACTGATAAGAAGAAGAG
AGCATCTCGGGAGCCGGAAGACTCTCCGTCCTACTCCCGAGAGTTCTCCAATTCTGACCTCAAGGCTCAATCAAAGTATAAGTCTCCAACACCAGAAGCTGTGATCACCA
GGGAAGAGTTCGACCTGATGAAGCACAGGTTTGACGAGCAGGTCGAGGCGCTCAAAGCCAGACCAAAGGACTATGTGGAGGTCTTCGAGGGCCTCATGGACTTTCAGGCG
GCAACAGATGCGATCAAATGCCGCGCCTTCCAGATCGCGCTTACCGGCAGCGCGCGCCTGTGGTACCGGAGACTGCCGGCTAGGTCGATCTCGACCTACTCTCAGCTGAG
AAAAGAGTTCATTAGTCAATCCTCTTCTCATCATTACGATAGAAAGACAGCGACTCACCTCGCCACCATCAGACAGAAGGAAGGTGAGACGCTGAGAGAGTATGTCACGA
GGTTCCAGGAGGAACAGCTGAAGGTCGTGCACTGCTCCGACAATTCGGCCATGTGCTACTTCCTCATCGGCCTGGCCGATGAGACCCTTAACGTGAAGCTCGGAAATGAG
GCTCCAGCAACCTTCGCCGAAGTCCTGCAAAAGGTGAAGAAAGTCATCGATGGACAAGAGCTCCTCCGAACCAAGACTGGCCGACCTGAAAAGCAAATCGACCAGAAGAA
GCTAAGCCAAGAGAAGAGGAAGTCCGATTCTAAGTCTAAGGACAAGGGATCATCCTCTTCTGGTAGTAGAACCGAGTATCGTCGGTCGGAGATCGACCATAATCGAAGCT
GA
Protein sequenceShow/hide protein sequence
MVQPANSANTIERRGLNADNGPQRDLDARMVEDQVRAGQEGDLPRRSARHANQELPPAHPKPSKANRGRGGTSRKTSQRANPTVDPEALSTLQRELDDMRHRRTRREGRW
VPSLHPGDREPIPNNEGVDYSLRDNDLRKHLTDKKKRASREPEDSPSYSREFSNSDLKAQSKYKSPTPEAVITREEFDLMKHRFDEQVEALKARPKDYVEVFEGLMDFQA
ATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQSSSHHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVVHCSDNSAMCYFLIGLADETLNVKLGNE
APATFAEVLQKVKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKSDSKSKDKGSSSSGSRTEYRRSEIDHNRS