; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g01060 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g01060
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGag/pol protein
Genome locationchr1:733681..748061
RNA-Seq ExpressionMoc01g01060
SyntenyMoc01g01060
Gene Ontology termsNA
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]3.8e-6043.3Show/hide
Query:  MSTSIIAMLAVEKLNGENYTQWKTNLNTILVVDDLRFVLTEECPQAPMPNAVRASRDAYDRWIKANDKANVYILASISDVLAKKHERIVTAREIMDSLQD
        M+TSI+ +LA EKLNG+NY+ WK+NLNTILVVDDLRFVLTEECPQAP  NA R  R+AYDRW+KANDKA VYILAS++DVLAKKH+ I TA+ IMDSL++
Subjt:  MSTSIIAMLAVEKLNGENYTQWKTNLNTILVVDDLRFVLTEECPQAPMPNAVRASRDAYDRWIKANDKANVYILASISDVLAKKHERIVTAREIMDSLQD

Query:  MFGQPSIQARWREPGQLYL-----GISSEEFPTILDKKGRQTLSPQNNSIEVRPQETKFVLSS-------------------------SGSKTFKN-KKN
        MFGQPS   R      +Y      G S  E   +LD      ++  N        +  F+L S                         +  + F+N   +
Subjt:  MFGQPSIQARWREPGQLYL-----GISSEEFPTILDKKGRQTLSPQNNSIEVRPQETKFVLSS-------------------------SGSKTFKN-KKN

Query:  SGKGVKAN-------------------PTAAAATKKGKTKV---------VDKGKCFHYNVDGHCKRNCPKYLAEKKKAK--------------------
         GK V+AN                   P+ A   KKGK K           DKGKCFH N DGH KRNCPKYLAEKK  K                    
Subjt:  SGKGVKAN-------------------PTAAAATKKGKTKV---------VDKGKCFHYNVDGHCKRNCPKYLAEKKKAK--------------------

Query:  ------EGATNHVCYSFQGISSWRQLDAGDMTLKVGMGDVVSAVADSPLICIGESKFI
               GATNH+C+SFQ  SSW++L  G++TLKVG G+VVSA A   L    + +++
Subjt:  ------EGATNHVCYSFQGISSWRQLDAGDMTLKVGMGDVVSAVADSPLICIGESKFI

KAA0031826.1 gag/pol protein [Cucumis melo var. makuwa]2.5e-5941.92Show/hide
Query:  MSTSIIAMLAVEKLNGENYTQWKTNLNTILVVDDLRFVLTEECPQAPMPNAVRASRDAYDRWIKANDKANVYILASISDVLAKKHERIVTAREIMDSLQD
        M+++ + MLA +KLNG NY  WK  +NT+L++DDLRFVL EECPQ P  NA R  R+ Y+RW KAN+KA  YILAS+S+VLAKKHE ++TAREIMDSLQ+
Subjt:  MSTSIIAMLAVEKLNGENYTQWKTNLNTILVVDDLRFVLTEECPQAPMPNAVRASRDAYDRWIKANDKANVYILASISDVLAKKHERIVTAREIMDSLQD

Query:  MFGQPSIQ-----------ARWREPGQLYLGI------------------SSEEFPTILDKKGRQTLSPQNNSI----------------------EVRP
        MFGQ S Q           AR  E   +   +                   + +   IL+      L  ++N++                      +++ 
Subjt:  MFGQPSIQ-----------ARWREPGQLYLGI------------------SSEEFPTILDKKGRQTLSPQNNSI----------------------EVRP

Query:  QE-------------------TKFVLSSSGSKTFKNKKNSGKGVKANPTAAAATKKGKTKVVDKGKCFHYNVDGHCKRNCPKYLAEKKKAKE--------
        Q+                   TK + SSSG+K +K KK  G+G KAN  AA  TKK K     KG CFH N +GH KRNCPKYLAEKKKAK+        
Subjt:  QE-------------------TKFVLSSSGSKTFKNKKNSGKGVKANPTAAAATKKGKTKVVDKGKCFHYNVDGHCKRNCPKYLAEKKKAKE--------

Query:  ----------------GATNHVCYSFQGISSWRQLDAGDMTLKVGMGDVVSAVADSPL-ICIGES
                        GATNHVC SFQGISSWRQL+ G+MT++VG G VVSA+A   L +C+ +S
Subjt:  ----------------GATNHVCYSFQGISSWRQLDAGDMTLKVGMGDVVSAVADSPL-ICIGES

KAA0047792.1 gag/pol protein [Cucumis melo var. makuwa]2.5e-5941.92Show/hide
Query:  MSTSIIAMLAVEKLNGENYTQWKTNLNTILVVDDLRFVLTEECPQAPMPNAVRASRDAYDRWIKANDKANVYILASISDVLAKKHERIVTAREIMDSLQD
        M+++ + MLA +KLNG NY  WK  +NT+L++DDLRFVL EECPQ P  NA R  R+ Y+RW KAN+KA  YILAS+S+VLAKKHE ++TAREIMDSLQ+
Subjt:  MSTSIIAMLAVEKLNGENYTQWKTNLNTILVVDDLRFVLTEECPQAPMPNAVRASRDAYDRWIKANDKANVYILASISDVLAKKHERIVTAREIMDSLQD

Query:  MFGQPSIQ-----------ARWREPGQLYLGI------------------SSEEFPTILDKKGRQTLSPQNNSI----------------------EVRP
        MFGQ S Q           AR  E   +   +                   + +   IL+      L  ++N++                      +++ 
Subjt:  MFGQPSIQ-----------ARWREPGQLYLGI------------------SSEEFPTILDKKGRQTLSPQNNSI----------------------EVRP

Query:  QE-------------------TKFVLSSSGSKTFKNKKNSGKGVKANPTAAAATKKGKTKVVDKGKCFHYNVDGHCKRNCPKYLAEKKKAKE--------
        Q+                   TK + SSSG+K +K KK  G+G KAN  AA  TKK K     KG CFH N +GH KRNCPKYLAEKKKAK+        
Subjt:  QE-------------------TKFVLSSSGSKTFKNKKNSGKGVKANPTAAAATKKGKTKVVDKGKCFHYNVDGHCKRNCPKYLAEKKKAKE--------

Query:  ----------------GATNHVCYSFQGISSWRQLDAGDMTLKVGMGDVVSAVADSPL-ICIGES
                        GATNHVC SFQGISSWRQL+ G+MT++VG G VVSA+A   L +C+ +S
Subjt:  ----------------GATNHVCYSFQGISSWRQLDAGDMTLKVGMGDVVSAVADSPL-ICIGES

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]2.5e-5941.92Show/hide
Query:  MSTSIIAMLAVEKLNGENYTQWKTNLNTILVVDDLRFVLTEECPQAPMPNAVRASRDAYDRWIKANDKANVYILASISDVLAKKHERIVTAREIMDSLQD
        M+++ + MLA +KLNG NY  WK  +NT+L++DDLRFVL EECPQ P  NA R  R+ Y+RW KAN+KA  YILAS+S+VLAKKHE ++TAREIMDSLQ+
Subjt:  MSTSIIAMLAVEKLNGENYTQWKTNLNTILVVDDLRFVLTEECPQAPMPNAVRASRDAYDRWIKANDKANVYILASISDVLAKKHERIVTAREIMDSLQD

Query:  MFGQPSIQ-----------ARWREPGQLYLGI------------------SSEEFPTILDKKGRQTLSPQNNSI----------------------EVRP
        MFGQ S Q           AR  E   +   +                   + +   IL+      L  ++N++                      +++ 
Subjt:  MFGQPSIQ-----------ARWREPGQLYLGI------------------SSEEFPTILDKKGRQTLSPQNNSI----------------------EVRP

Query:  QE-------------------TKFVLSSSGSKTFKNKKNSGKGVKANPTAAAATKKGKTKVVDKGKCFHYNVDGHCKRNCPKYLAEKKKAKE--------
        Q+                   TK + SSSG+K +K KK  G+G KAN  AA  TKK K     KG CFH N +GH KRNCPKYLAEKKKAK+        
Subjt:  QE-------------------TKFVLSSSGSKTFKNKKNSGKGVKANPTAAAATKKGKTKVVDKGKCFHYNVDGHCKRNCPKYLAEKKKAKE--------

Query:  ----------------GATNHVCYSFQGISSWRQLDAGDMTLKVGMGDVVSAVADSPL-ICIGES
                        GATNHVC SFQGISSWRQL+ G+MT++VG G VVSA+A   L +C+ +S
Subjt:  ----------------GATNHVCYSFQGISSWRQLDAGDMTLKVGMGDVVSAVADSPL-ICIGES

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]4.2e-5941.92Show/hide
Query:  MSTSIIAMLAVEKLNGENYTQWKTNLNTILVVDDLRFVLTEECPQAPMPNAVRASRDAYDRWIKANDKANVYILASISDVLAKKHERIVTAREIMDSLQD
        M+++ + MLA +KLNG NY  WK  +NT+L++DDLRFVL EECPQ P  NA R  R+ Y+RW KAN+KA  YILAS+S+VLAKKHE ++TAREIMDSLQ+
Subjt:  MSTSIIAMLAVEKLNGENYTQWKTNLNTILVVDDLRFVLTEECPQAPMPNAVRASRDAYDRWIKANDKANVYILASISDVLAKKHERIVTAREIMDSLQD

Query:  MFGQPSIQ-----------ARWREPGQLYLGI------------------SSEEFPTILDKKGRQTLSPQNNSI----------------------EVRP
        MFGQ S Q           AR  E   +   +                   + +   IL+      L  ++N++                      +++ 
Subjt:  MFGQPSIQ-----------ARWREPGQLYLGI------------------SSEEFPTILDKKGRQTLSPQNNSI----------------------EVRP

Query:  QE-------------------TKFVLSSSGSKTFKNKKNSGKGVKANPTAAAATKKGKTKVVDKGKCFHYNVDGHCKRNCPKYLAEKKKAKE--------
        Q+                   TK + SSSG+K +K KK  G+G KAN  AA  TKK K     KG CFH N +GH KRNCPKYLAEKKKAK+        
Subjt:  QE-------------------TKFVLSSSGSKTFKNKKNSGKGVKANPTAAAATKKGKTKVVDKGKCFHYNVDGHCKRNCPKYLAEKKKAKE--------

Query:  ----------------GATNHVCYSFQGISSWRQLDAGDMTLKVGMGDVVSAVADSPL-ICIGES
                        GATNHVC SFQGISSWRQL+ G+MT++VG G VVSA+A   L +C+ +S
Subjt:  ----------------GATNHVCYSFQGISSWRQLDAGDMTLKVGMGDVVSAVADSPL-ICIGES

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein1.2e-5941.92Show/hide
Query:  MSTSIIAMLAVEKLNGENYTQWKTNLNTILVVDDLRFVLTEECPQAPMPNAVRASRDAYDRWIKANDKANVYILASISDVLAKKHERIVTAREIMDSLQD
        M+++ + MLA +KLNG NY  WK  +NT+L++DDLRFVL EECPQ P  NA R  R+ Y+RW KAN+KA  YILAS+S+VLAKKHE ++TAREIMDSLQ+
Subjt:  MSTSIIAMLAVEKLNGENYTQWKTNLNTILVVDDLRFVLTEECPQAPMPNAVRASRDAYDRWIKANDKANVYILASISDVLAKKHERIVTAREIMDSLQD

Query:  MFGQPSIQ-----------ARWREPGQLYLGI------------------SSEEFPTILDKKGRQTLSPQNNSI----------------------EVRP
        MFGQ S Q           AR  E   +   +                   + +   IL+      L  ++N++                      +++ 
Subjt:  MFGQPSIQ-----------ARWREPGQLYLGI------------------SSEEFPTILDKKGRQTLSPQNNSI----------------------EVRP

Query:  QE-------------------TKFVLSSSGSKTFKNKKNSGKGVKANPTAAAATKKGKTKVVDKGKCFHYNVDGHCKRNCPKYLAEKKKAKE--------
        Q+                   TK + SSSG+K +K KK  G+G KAN  AA  TKK K     KG CFH N +GH KRNCPKYLAEKKKAK+        
Subjt:  QE-------------------TKFVLSSSGSKTFKNKKNSGKGVKANPTAAAATKKGKTKVVDKGKCFHYNVDGHCKRNCPKYLAEKKKAKE--------

Query:  ----------------GATNHVCYSFQGISSWRQLDAGDMTLKVGMGDVVSAVADSPL-ICIGES
                        GATNHVC SFQGISSWRQL+ G+MT++VG G VVSA+A   L +C+ +S
Subjt:  ----------------GATNHVCYSFQGISSWRQLDAGDMTLKVGMGDVVSAVADSPL-ICIGES

A0A5A7TWB9 Gag/pol protein1.2e-5941.92Show/hide
Query:  MSTSIIAMLAVEKLNGENYTQWKTNLNTILVVDDLRFVLTEECPQAPMPNAVRASRDAYDRWIKANDKANVYILASISDVLAKKHERIVTAREIMDSLQD
        M+++ + MLA +KLNG NY  WK  +NT+L++DDLRFVL EECPQ P  NA R  R+ Y+RW KAN+KA  YILAS+S+VLAKKHE ++TAREIMDSLQ+
Subjt:  MSTSIIAMLAVEKLNGENYTQWKTNLNTILVVDDLRFVLTEECPQAPMPNAVRASRDAYDRWIKANDKANVYILASISDVLAKKHERIVTAREIMDSLQD

Query:  MFGQPSIQ-----------ARWREPGQLYLGI------------------SSEEFPTILDKKGRQTLSPQNNSI----------------------EVRP
        MFGQ S Q           AR  E   +   +                   + +   IL+      L  ++N++                      +++ 
Subjt:  MFGQPSIQ-----------ARWREPGQLYLGI------------------SSEEFPTILDKKGRQTLSPQNNSI----------------------EVRP

Query:  QE-------------------TKFVLSSSGSKTFKNKKNSGKGVKANPTAAAATKKGKTKVVDKGKCFHYNVDGHCKRNCPKYLAEKKKAKE--------
        Q+                   TK + SSSG+K +K KK  G+G KAN  AA  TKK K     KG CFH N +GH KRNCPKYLAEKKKAK+        
Subjt:  QE-------------------TKFVLSSSGSKTFKNKKNSGKGVKANPTAAAATKKGKTKVVDKGKCFHYNVDGHCKRNCPKYLAEKKKAKE--------

Query:  ----------------GATNHVCYSFQGISSWRQLDAGDMTLKVGMGDVVSAVADSPL-ICIGES
                        GATNHVC SFQGISSWRQL+ G+MT++VG G VVSA+A   L +C+ +S
Subjt:  ----------------GATNHVCYSFQGISSWRQLDAGDMTLKVGMGDVVSAVADSPL-ICIGES

A0A5A7UGV2 Gag/pol protein1.2e-5941.92Show/hide
Query:  MSTSIIAMLAVEKLNGENYTQWKTNLNTILVVDDLRFVLTEECPQAPMPNAVRASRDAYDRWIKANDKANVYILASISDVLAKKHERIVTAREIMDSLQD
        M+++ + MLA +KLNG NY  WK  +NT+L++DDLRFVL EECPQ P  NA R  R+ Y+RW KAN+KA  YILAS+S+VLAKKHE ++TAREIMDSLQ+
Subjt:  MSTSIIAMLAVEKLNGENYTQWKTNLNTILVVDDLRFVLTEECPQAPMPNAVRASRDAYDRWIKANDKANVYILASISDVLAKKHERIVTAREIMDSLQD

Query:  MFGQPSIQ-----------ARWREPGQLYLGI------------------SSEEFPTILDKKGRQTLSPQNNSI----------------------EVRP
        MFGQ S Q           AR  E   +   +                   + +   IL+      L  ++N++                      +++ 
Subjt:  MFGQPSIQ-----------ARWREPGQLYLGI------------------SSEEFPTILDKKGRQTLSPQNNSI----------------------EVRP

Query:  QE-------------------TKFVLSSSGSKTFKNKKNSGKGVKANPTAAAATKKGKTKVVDKGKCFHYNVDGHCKRNCPKYLAEKKKAKE--------
        Q+                   TK + SSSG+K +K KK  G+G KAN  AA  TKK K     KG CFH N +GH KRNCPKYLAEKKKAK+        
Subjt:  QE-------------------TKFVLSSSGSKTFKNKKNSGKGVKANPTAAAATKKGKTKVVDKGKCFHYNVDGHCKRNCPKYLAEKKKAKE--------

Query:  ----------------GATNHVCYSFQGISSWRQLDAGDMTLKVGMGDVVSAVADSPL-ICIGES
                        GATNHVC SFQGISSWRQL+ G+MT++VG G VVSA+A   L +C+ +S
Subjt:  ----------------GATNHVCYSFQGISSWRQLDAGDMTLKVGMGDVVSAVADSPL-ICIGES

A0A5D3CPJ6 Gag/pol protein2.0e-5941.92Show/hide
Query:  MSTSIIAMLAVEKLNGENYTQWKTNLNTILVVDDLRFVLTEECPQAPMPNAVRASRDAYDRWIKANDKANVYILASISDVLAKKHERIVTAREIMDSLQD
        M+++ + MLA +KLNG NY  WK  +NT+L++DDLRFVL EECPQ P  NA R  R+ Y+RW KAN+KA  YILAS+S+VLAKKHE ++TAREIMDSLQ+
Subjt:  MSTSIIAMLAVEKLNGENYTQWKTNLNTILVVDDLRFVLTEECPQAPMPNAVRASRDAYDRWIKANDKANVYILASISDVLAKKHERIVTAREIMDSLQD

Query:  MFGQPSIQ-----------ARWREPGQLYLGI------------------SSEEFPTILDKKGRQTLSPQNNSI----------------------EVRP
        MFGQ S Q           AR  E   +   +                   + +   IL+      L  ++N++                      +++ 
Subjt:  MFGQPSIQ-----------ARWREPGQLYLGI------------------SSEEFPTILDKKGRQTLSPQNNSI----------------------EVRP

Query:  QE-------------------TKFVLSSSGSKTFKNKKNSGKGVKANPTAAAATKKGKTKVVDKGKCFHYNVDGHCKRNCPKYLAEKKKAKE--------
        Q+                   TK + SSSG+K +K KK  G+G KAN  AA  TKK K     KG CFH N +GH KRNCPKYLAEKKKAK+        
Subjt:  QE-------------------TKFVLSSSGSKTFKNKKNSGKGVKANPTAAAATKKGKTKVVDKGKCFHYNVDGHCKRNCPKYLAEKKKAKE--------

Query:  ----------------GATNHVCYSFQGISSWRQLDAGDMTLKVGMGDVVSAVADSPL-ICIGES
                        GATNHVC SFQGISSWRQL+ G+MT++VG G VVSA+A   L +C+ +S
Subjt:  ----------------GATNHVCYSFQGISSWRQLDAGDMTLKVGMGDVVSAVADSPL-ICIGES

E2GK51 Gag/pol protein (Fragment)1.8e-6043.3Show/hide
Query:  MSTSIIAMLAVEKLNGENYTQWKTNLNTILVVDDLRFVLTEECPQAPMPNAVRASRDAYDRWIKANDKANVYILASISDVLAKKHERIVTAREIMDSLQD
        M+TSI+ +LA EKLNG+NY+ WK+NLNTILVVDDLRFVLTEECPQAP  NA R  R+AYDRW+KANDKA VYILAS++DVLAKKH+ I TA+ IMDSL++
Subjt:  MSTSIIAMLAVEKLNGENYTQWKTNLNTILVVDDLRFVLTEECPQAPMPNAVRASRDAYDRWIKANDKANVYILASISDVLAKKHERIVTAREIMDSLQD

Query:  MFGQPSIQARWREPGQLYL-----GISSEEFPTILDKKGRQTLSPQNNSIEVRPQETKFVLSS-------------------------SGSKTFKN-KKN
        MFGQPS   R      +Y      G S  E   +LD      ++  N        +  F+L S                         +  + F+N   +
Subjt:  MFGQPSIQARWREPGQLYL-----GISSEEFPTILDKKGRQTLSPQNNSIEVRPQETKFVLSS-------------------------SGSKTFKN-KKN

Query:  SGKGVKAN-------------------PTAAAATKKGKTKV---------VDKGKCFHYNVDGHCKRNCPKYLAEKKKAK--------------------
         GK V+AN                   P+ A   KKGK K           DKGKCFH N DGH KRNCPKYLAEKK  K                    
Subjt:  SGKGVKAN-------------------PTAAAATKKGKTKV---------VDKGKCFHYNVDGHCKRNCPKYLAEKKKAK--------------------

Query:  ------EGATNHVCYSFQGISSWRQLDAGDMTLKVGMGDVVSAVADSPLICIGESKFI
               GATNH+C+SFQ  SSW++L  G++TLKVG G+VVSA A   L    + +++
Subjt:  ------EGATNHVCYSFQGISSWRQLDAGDMTLKVGMGDVVSAVADSPLICIGESKFI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTACTTCTATTATTGCCATGCTTGCCGTCGAAAAACTTAACGGCGAAAATTACACACAATGGAAAACGAACCTTAACACGATACTCGTGGTAGATGATCTTAGGTT
CGTCTTAACTGAGGAGTGTCCTCAGGCTCCCATGCCTAATGCAGTCCGAGCCAGTCGGGATGCCTATGACAGATGGATCAAGGCCAATGACAAGGCCAACGTCTACATCT
TGGCAAGCATATCTGATGTGCTGGCCAAGAAGCATGAGAGGATAGTCACCGCAAGGGAGATCATGGACTCATTGCAGGACATGTTTGGACAACCGTCCATTCAAGCCCGA
TGGAGGGAGCCAGGTCAGCTTTATCTTGGAATCTCTTCCGAAGAGTTTCCTACAATTTTGGACAAGAAGGGGAGACAAACGTTATCACCTCAAAACAATTCCATCGAGGT
TCGACCTCAGGAAACCAAATTTGTACTTTCTTCTTCTGGAAGTAAGACTTTTAAAAATAAGAAGAATAGTGGTAAGGGGGTGAAAGCTAATCCTACTGCTGCTGCTGCTA
CCAAAAAGGGCAAGACCAAAGTTGTAGACAAAGGAAAATGTTTCCACTACAACGTGGATGGGCATTGTAAAAGAAACTGCCCGAAATACTTGGCCGAAAAGAAGAAAGCC
AAAGAAGGGGCCACTAACCATGTTTGTTATTCTTTTCAGGGAATTAGTTCTTGGAGGCAGCTTGATGCTGGGGATATGACTCTCAAAGTCGGAATGGGAGATGTCGTCTC
AGCAGTCGCAGATTCTCCCTTGATTTGCATAGGTGAGAGTAAGTTCATCAGCGCTGCTCAATATGCCTCCCATTTCAGGGATAAGACTAGGTTCCAATTCCCTGATCAAC
AAGTTAGGTGTGAGCGAGAGGGTGAACTGGAAGGTCGAAATCCTCACATCAATATTTGGCGCCGTCTGTGGGGACAAAAGAAAAACAAGCCAAACAACATGACACCTGAG
AGGAGTCCGTGGGGTTCTGACGATGATTGCCACGCGAGGAGGAGGTTAAACTTGGATGACCTCCTGATAAGAGGACCTGAGGGCGAAACAGGATTGAGTCGACAAAACCC
CGAGCACCAAGAAGGACTGCTCGAGGTGCCAACGACAATAGCCTCGGAACAGCTCCAAAGTCAGTTTGCGGCCTTAGAGAGAAAGGTGGAGGCGATGCTTCAACGCATGA
CCCAATTACTTCAACAACTAGAGCCACAAGAGGCCAGCGAGGAACCCCTCATCCAAGACCCCCGAAAGGGAAAAGTAGATCACCTGGATGCCTACCGAGAATGGATGGAC
ATCTATGGAGTATCAGAGGAGATTAGATGCTGGGTATTCTCGACGACTTTGAGTGGGTCGGCCAGGGCAAGAATTGAGTACTCCGAGGATGAGGTGACTCATCTCCTTCA
CCCAGACAACGATGCACTGGTCATCACTCTAAAAATAACTAATGCGAAGGTGCACCGGATCTTGGTAGATGGAGGCAACTCAGCAGATATCATCTCCCTCATGGCCTACA
AAGCCATTGGTCTAGGAGACAAAGGTCTTAAGAGTAGCCCGGCACCGCTTGTAGGGTTTCGAGGAGAGCGAGTCATTCCTGAAGGAAGGATAGAATTGCTAGTGATATTC
GGGAGTAGGCCAAAGAGCATCACTAAAAGGGTAGATTTCTTGGTCGTCGAGTACGCATCCTCCTATAATGCGATATTAGGTAGGCCGACAATGCACATGCTCAGGACGAT
GCCATCTACATATCACCAGTCTATGAAATTCTCAACAACCAGTGGAATTGGTGAAATCAAAGGAAAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCTACTTCTATTATTGCCATGCTTGCCGTCGAAAAACTTAACGGCGAAAATTACACACAATGGAAAACGAACCTTAACACGATACTCGTGGTAGATGATCTTAGGTT
CGTCTTAACTGAGGAGTGTCCTCAGGCTCCCATGCCTAATGCAGTCCGAGCCAGTCGGGATGCCTATGACAGATGGATCAAGGCCAATGACAAGGCCAACGTCTACATCT
TGGCAAGCATATCTGATGTGCTGGCCAAGAAGCATGAGAGGATAGTCACCGCAAGGGAGATCATGGACTCATTGCAGGACATGTTTGGACAACCGTCCATTCAAGCCCGA
TGGAGGGAGCCAGGTCAGCTTTATCTTGGAATCTCTTCCGAAGAGTTTCCTACAATTTTGGACAAGAAGGGGAGACAAACGTTATCACCTCAAAACAATTCCATCGAGGT
TCGACCTCAGGAAACCAAATTTGTACTTTCTTCTTCTGGAAGTAAGACTTTTAAAAATAAGAAGAATAGTGGTAAGGGGGTGAAAGCTAATCCTACTGCTGCTGCTGCTA
CCAAAAAGGGCAAGACCAAAGTTGTAGACAAAGGAAAATGTTTCCACTACAACGTGGATGGGCATTGTAAAAGAAACTGCCCGAAATACTTGGCCGAAAAGAAGAAAGCC
AAAGAAGGGGCCACTAACCATGTTTGTTATTCTTTTCAGGGAATTAGTTCTTGGAGGCAGCTTGATGCTGGGGATATGACTCTCAAAGTCGGAATGGGAGATGTCGTCTC
AGCAGTCGCAGATTCTCCCTTGATTTGCATAGGTGAGAGTAAGTTCATCAGCGCTGCTCAATATGCCTCCCATTTCAGGGATAAGACTAGGTTCCAATTCCCTGATCAAC
AAGTTAGGTGTGAGCGAGAGGGTGAACTGGAAGGTCGAAATCCTCACATCAATATTTGGCGCCGTCTGTGGGGACAAAAGAAAAACAAGCCAAACAACATGACACCTGAG
AGGAGTCCGTGGGGTTCTGACGATGATTGCCACGCGAGGAGGAGGTTAAACTTGGATGACCTCCTGATAAGAGGACCTGAGGGCGAAACAGGATTGAGTCGACAAAACCC
CGAGCACCAAGAAGGACTGCTCGAGGTGCCAACGACAATAGCCTCGGAACAGCTCCAAAGTCAGTTTGCGGCCTTAGAGAGAAAGGTGGAGGCGATGCTTCAACGCATGA
CCCAATTACTTCAACAACTAGAGCCACAAGAGGCCAGCGAGGAACCCCTCATCCAAGACCCCCGAAAGGGAAAAGTAGATCACCTGGATGCCTACCGAGAATGGATGGAC
ATCTATGGAGTATCAGAGGAGATTAGATGCTGGGTATTCTCGACGACTTTGAGTGGGTCGGCCAGGGCAAGAATTGAGTACTCCGAGGATGAGGTGACTCATCTCCTTCA
CCCAGACAACGATGCACTGGTCATCACTCTAAAAATAACTAATGCGAAGGTGCACCGGATCTTGGTAGATGGAGGCAACTCAGCAGATATCATCTCCCTCATGGCCTACA
AAGCCATTGGTCTAGGAGACAAAGGTCTTAAGAGTAGCCCGGCACCGCTTGTAGGGTTTCGAGGAGAGCGAGTCATTCCTGAAGGAAGGATAGAATTGCTAGTGATATTC
GGGAGTAGGCCAAAGAGCATCACTAAAAGGGTAGATTTCTTGGTCGTCGAGTACGCATCCTCCTATAATGCGATATTAGGTAGGCCGACAATGCACATGCTCAGGACGAT
GCCATCTACATATCACCAGTCTATGAAATTCTCAACAACCAGTGGAATTGGTGAAATCAAAGGAAAGTAG
Protein sequenceShow/hide protein sequence
MSTSIIAMLAVEKLNGENYTQWKTNLNTILVVDDLRFVLTEECPQAPMPNAVRASRDAYDRWIKANDKANVYILASISDVLAKKHERIVTAREIMDSLQDMFGQPSIQAR
WREPGQLYLGISSEEFPTILDKKGRQTLSPQNNSIEVRPQETKFVLSSSGSKTFKNKKNSGKGVKANPTAAAATKKGKTKVVDKGKCFHYNVDGHCKRNCPKYLAEKKKA
KEGATNHVCYSFQGISSWRQLDAGDMTLKVGMGDVVSAVADSPLICIGESKFISAAQYASHFRDKTRFQFPDQQVRCEREGELEGRNPHINIWRRLWGQKKNKPNNMTPE
RSPWGSDDDCHARRRLNLDDLLIRGPEGETGLSRQNPEHQEGLLEVPTTIASEQLQSQFAALERKVEAMLQRMTQLLQQLEPQEASEEPLIQDPRKGKVDHLDAYREWMD
IYGVSEEIRCWVFSTTLSGSARARIEYSEDEVTHLLHPDNDALVITLKITNAKVHRILVDGGNSADIISLMAYKAIGLGDKGLKSSPAPLVGFRGERVIPEGRIELLVIF
GSRPKSITKRVDFLVVEYASSYNAILGRPTMHMLRTMPSTYHQSMKFSTTSGIGEIKGK