; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g04430 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g04430
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGag/pol protein
Genome locationchr1:2926525..2935487
RNA-Seq ExpressionMoc01g04430
SyntenyMoc01g04430
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035879.1 gag/pol protein [Cucumis melo var. makuwa]1.0e-5847.85Show/hide
Query:  NATVPMRNVYDRWIRANDKAKVYILANISDVLAKKHEDTVTAKKIMDSLQSMFGQPSSQARHEALKVEWGRHRRA-------------------------
        NAT  +R  Y+RW +AN+KA+ YILA++S+VLAKKHE  +TA++IMDSLQ MFGQ S Q +H+ALK  +                               
Subjt:  NATVPMRNVYDRWIRANDKAKVYILANISDVLAKKHEDTVTAKKIMDSLQSMFGQPSSQARHEALKVEWGRHRRA-------------------------

Query:  -ESGQLYSRISS----------------------------EEFPAISQQCGPEREANVATS-KTFHRGLSSGTKSSPTSSGSKTFKKKKAANKHPKPDST
         E+ Q+   + S                            + F ++ +  G + EANVATS + FHRG +SGTKS P+SSG+K +KKKK    +    + 
Subjt:  -ESGQLYSRISS----------------------------EEFPAISQQCGPEREANVATS-KTFHRGLSSGTKSSPTSSGSKTFKKKKAANKHPKPDST

Query:  AAAAKKGKTKVADKGKCFHCNMNGHWKRNWPKSLIEKKKANEGKYDLLVLETCLVENDYSAQILDLGATKHVCSSFKGISSWRQLEAGEMTLKVEMGEVV
         AAAK  K   A KG CFHCN  GHWKRN PK L EKKKA +GKYDLLVLETCLVEND SA I+D GAT HVCSSF+GISSWRQLE GEMT++V  G VV
Subjt:  AAAAKKGKTKVADKGKCFHCNMNGHWKRNWPKSLIEKKKANEGKYDLLVLETCLVENDYSAQILDLGATKHVCSSFKGISSWRQLEAGEMTLKVEMGEVV

Query:  SIV
        S +
Subjt:  SIV

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]1.0e-5847.85Show/hide
Query:  NATVPMRNVYDRWIRANDKAKVYILANISDVLAKKHEDTVTAKKIMDSLQSMFGQPSSQARHEALKVEWGRHRRA-------------------------
        NAT  +R  Y+RW +AN+KA+ YILA++S+VLAKKHE  +TA++IMDSLQ MFGQ S Q +H+ALK  +                               
Subjt:  NATVPMRNVYDRWIRANDKAKVYILANISDVLAKKHEDTVTAKKIMDSLQSMFGQPSSQARHEALKVEWGRHRRA-------------------------

Query:  -ESGQLYSRISS----------------------------EEFPAISQQCGPEREANVATS-KTFHRGLSSGTKSSPTSSGSKTFKKKKAANKHPKPDST
         E+ Q+   + S                            + F ++ +  G + EANVATS + FHRG +SGTKS P+SSG+K +KKKK    +    + 
Subjt:  -ESGQLYSRISS----------------------------EEFPAISQQCGPEREANVATS-KTFHRGLSSGTKSSPTSSGSKTFKKKKAANKHPKPDST

Query:  AAAAKKGKTKVADKGKCFHCNMNGHWKRNWPKSLIEKKKANEGKYDLLVLETCLVENDYSAQILDLGATKHVCSSFKGISSWRQLEAGEMTLKVEMGEVV
         AAAK  K   A KG CFHCN  GHWKRN PK L EKKKA +GKYDLLVLETCLVEND SA I+D GAT HVCSSF+GISSWRQLE GEMT++V  G VV
Subjt:  AAAAKKGKTKVADKGKCFHCNMNGHWKRNWPKSLIEKKKANEGKYDLLVLETCLVENDYSAQILDLGATKHVCSSFKGISSWRQLEAGEMTLKVEMGEVV

Query:  SIV
        S +
Subjt:  SIV

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]1.0e-5847.85Show/hide
Query:  NATVPMRNVYDRWIRANDKAKVYILANISDVLAKKHEDTVTAKKIMDSLQSMFGQPSSQARHEALKVEWGRHRRA-------------------------
        NAT  +R  Y+RW +AN+KA+ YILA++S+VLAKKHE  +TA++IMDSLQ MFGQ S Q +H+ALK  +                               
Subjt:  NATVPMRNVYDRWIRANDKAKVYILANISDVLAKKHEDTVTAKKIMDSLQSMFGQPSSQARHEALKVEWGRHRRA-------------------------

Query:  -ESGQLYSRISS----------------------------EEFPAISQQCGPEREANVATS-KTFHRGLSSGTKSSPTSSGSKTFKKKKAANKHPKPDST
         E+ Q+   + S                            + F ++ +  G + EANVATS + FHRG +SGTKS P+SSG+K +KKKK    +    + 
Subjt:  -ESGQLYSRISS----------------------------EEFPAISQQCGPEREANVATS-KTFHRGLSSGTKSSPTSSGSKTFKKKKAANKHPKPDST

Query:  AAAAKKGKTKVADKGKCFHCNMNGHWKRNWPKSLIEKKKANEGKYDLLVLETCLVENDYSAQILDLGATKHVCSSFKGISSWRQLEAGEMTLKVEMGEVV
         AAAK  K   A KG CFHCN  GHWKRN PK L EKKKA +GKYDLLVLETCLVEND SA I+D GAT HVCSSF+GISSWRQLE GEMT++V  G VV
Subjt:  AAAAKKGKTKVADKGKCFHCNMNGHWKRNWPKSLIEKKKANEGKYDLLVLETCLVENDYSAQILDLGATKHVCSSFKGISSWRQLEAGEMTLKVEMGEVV

Query:  SIV
        S +
Subjt:  SIV

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]1.0e-5847.85Show/hide
Query:  NATVPMRNVYDRWIRANDKAKVYILANISDVLAKKHEDTVTAKKIMDSLQSMFGQPSSQARHEALKVEWGRHRRA-------------------------
        NAT  +R  Y+RW +AN+KA+ YILA++S+VLAKKHE  +TA++IMDSLQ MFGQ S Q +H+ALK  +                               
Subjt:  NATVPMRNVYDRWIRANDKAKVYILANISDVLAKKHEDTVTAKKIMDSLQSMFGQPSSQARHEALKVEWGRHRRA-------------------------

Query:  -ESGQLYSRISS----------------------------EEFPAISQQCGPEREANVATS-KTFHRGLSSGTKSSPTSSGSKTFKKKKAANKHPKPDST
         E+ Q+   + S                            + F ++ +  G + EANVATS + FHRG +SGTKS P+SSG+K +KKKK    +    + 
Subjt:  -ESGQLYSRISS----------------------------EEFPAISQQCGPEREANVATS-KTFHRGLSSGTKSSPTSSGSKTFKKKKAANKHPKPDST

Query:  AAAAKKGKTKVADKGKCFHCNMNGHWKRNWPKSLIEKKKANEGKYDLLVLETCLVENDYSAQILDLGATKHVCSSFKGISSWRQLEAGEMTLKVEMGEVV
         AAAK  K   A KG CFHCN  GHWKRN PK L EKKKA +GKYDLLVLETCLVEND SA I+D GAT HVCSSF+GISSWRQLE GEMT++V  G VV
Subjt:  AAAAKKGKTKVADKGKCFHCNMNGHWKRNWPKSLIEKKKANEGKYDLLVLETCLVENDYSAQILDLGATKHVCSSFKGISSWRQLEAGEMTLKVEMGEVV

Query:  SIV
        S +
Subjt:  SIV

TYK26319.1 gag/pol protein [Cucumis melo var. makuwa]4.7e-5947.85Show/hide
Query:  NATVPMRNVYDRWIRANDKAKVYILANISDVLAKKHEDTVTAKKIMDSLQSMFGQPSSQARHEALKVEWGRHRRAESG----QLYSRI------------
        NAT  +R  Y+RW +AN+KA+ YILA++S+VLAKKHE  +TA++IMDSLQ MFGQ S Q +H+ALK  +      E+      LY  +            
Subjt:  NATVPMRNVYDRWIRANDKAKVYILANISDVLAKKHEDTVTAKKIMDSLQSMFGQPSSQARHEALKVEWGRHRRAESG----QLYSRI------------

Query:  --------------------------------------SSEEFPAISQQCGPEREANVATS-KTFHRGLSSGTKSSPTSSGSKTFKKKKAANKHPKPDST
                                                + F ++ +  G + EANVATS + FHRG +SGTKS P+SSG+K +KKKK    +    + 
Subjt:  --------------------------------------SSEEFPAISQQCGPEREANVATS-KTFHRGLSSGTKSSPTSSGSKTFKKKKAANKHPKPDST

Query:  AAAAKKGKTKVADKGKCFHCNMNGHWKRNWPKSLIEKKKANEGKYDLLVLETCLVENDYSAQILDLGATKHVCSSFKGISSWRQLEAGEMTLKVEMGEVV
         AAAK  K   A KG CFHCN  GHWKRN PK L EKKKA +GKYDLLVLETCLVEND SA I+D GAT HVCSSF+GISSWRQLE GEMT++V  G VV
Subjt:  AAAAKKGKTKVADKGKCFHCNMNGHWKRNWPKSLIEKKKANEGKYDLLVLETCLVENDYSAQILDLGATKHVCSSFKGISSWRQLEAGEMTLKVEMGEVV

Query:  SIV
        S +
Subjt:  SIV

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein5.0e-5947.85Show/hide
Query:  NATVPMRNVYDRWIRANDKAKVYILANISDVLAKKHEDTVTAKKIMDSLQSMFGQPSSQARHEALKVEWGRHRRA-------------------------
        NAT  +R  Y+RW +AN+KA+ YILA++S+VLAKKHE  +TA++IMDSLQ MFGQ S Q +H+ALK  +                               
Subjt:  NATVPMRNVYDRWIRANDKAKVYILANISDVLAKKHEDTVTAKKIMDSLQSMFGQPSSQARHEALKVEWGRHRRA-------------------------

Query:  -ESGQLYSRISS----------------------------EEFPAISQQCGPEREANVATS-KTFHRGLSSGTKSSPTSSGSKTFKKKKAANKHPKPDST
         E+ Q+   + S                            + F ++ +  G + EANVATS + FHRG +SGTKS P+SSG+K +KKKK    +    + 
Subjt:  -ESGQLYSRISS----------------------------EEFPAISQQCGPEREANVATS-KTFHRGLSSGTKSSPTSSGSKTFKKKKAANKHPKPDST

Query:  AAAAKKGKTKVADKGKCFHCNMNGHWKRNWPKSLIEKKKANEGKYDLLVLETCLVENDYSAQILDLGATKHVCSSFKGISSWRQLEAGEMTLKVEMGEVV
         AAAK  K   A KG CFHCN  GHWKRN PK L EKKKA +GKYDLLVLETCLVEND SA I+D GAT HVCSSF+GISSWRQLE GEMT++V  G VV
Subjt:  AAAAKKGKTKVADKGKCFHCNMNGHWKRNWPKSLIEKKKANEGKYDLLVLETCLVENDYSAQILDLGATKHVCSSFKGISSWRQLEAGEMTLKVEMGEVV

Query:  SIV
        S +
Subjt:  SIV

A0A5A7TWB9 Gag/pol protein5.0e-5947.85Show/hide
Query:  NATVPMRNVYDRWIRANDKAKVYILANISDVLAKKHEDTVTAKKIMDSLQSMFGQPSSQARHEALKVEWGRHRRA-------------------------
        NAT  +R  Y+RW +AN+KA+ YILA++S+VLAKKHE  +TA++IMDSLQ MFGQ S Q +H+ALK  +                               
Subjt:  NATVPMRNVYDRWIRANDKAKVYILANISDVLAKKHEDTVTAKKIMDSLQSMFGQPSSQARHEALKVEWGRHRRA-------------------------

Query:  -ESGQLYSRISS----------------------------EEFPAISQQCGPEREANVATS-KTFHRGLSSGTKSSPTSSGSKTFKKKKAANKHPKPDST
         E+ Q+   + S                            + F ++ +  G + EANVATS + FHRG +SGTKS P+SSG+K +KKKK    +    + 
Subjt:  -ESGQLYSRISS----------------------------EEFPAISQQCGPEREANVATS-KTFHRGLSSGTKSSPTSSGSKTFKKKKAANKHPKPDST

Query:  AAAAKKGKTKVADKGKCFHCNMNGHWKRNWPKSLIEKKKANEGKYDLLVLETCLVENDYSAQILDLGATKHVCSSFKGISSWRQLEAGEMTLKVEMGEVV
         AAAK  K   A KG CFHCN  GHWKRN PK L EKKKA +GKYDLLVLETCLVEND SA I+D GAT HVCSSF+GISSWRQLE GEMT++V  G VV
Subjt:  AAAAKKGKTKVADKGKCFHCNMNGHWKRNWPKSLIEKKKANEGKYDLLVLETCLVENDYSAQILDLGATKHVCSSFKGISSWRQLEAGEMTLKVEMGEVV

Query:  SIV
        S +
Subjt:  SIV

A0A5A7V4M1 Gag/pol protein5.0e-5947.85Show/hide
Query:  NATVPMRNVYDRWIRANDKAKVYILANISDVLAKKHEDTVTAKKIMDSLQSMFGQPSSQARHEALKVEWGRHRRA-------------------------
        NAT  +R  Y+RW +AN+KA+ YILA++S+VLAKKHE  +TA++IMDSLQ MFGQ S Q +H+ALK  +                               
Subjt:  NATVPMRNVYDRWIRANDKAKVYILANISDVLAKKHEDTVTAKKIMDSLQSMFGQPSSQARHEALKVEWGRHRRA-------------------------

Query:  -ESGQLYSRISS----------------------------EEFPAISQQCGPEREANVATS-KTFHRGLSSGTKSSPTSSGSKTFKKKKAANKHPKPDST
         E+ Q+   + S                            + F ++ +  G + EANVATS + FHRG +SGTKS P+SSG+K +KKKK    +    + 
Subjt:  -ESGQLYSRISS----------------------------EEFPAISQQCGPEREANVATS-KTFHRGLSSGTKSSPTSSGSKTFKKKKAANKHPKPDST

Query:  AAAAKKGKTKVADKGKCFHCNMNGHWKRNWPKSLIEKKKANEGKYDLLVLETCLVENDYSAQILDLGATKHVCSSFKGISSWRQLEAGEMTLKVEMGEVV
         AAAK  K   A KG CFHCN  GHWKRN PK L EKKKA +GKYDLLVLETCLVEND SA I+D GAT HVCSSF+GISSWRQLE GEMT++V  G VV
Subjt:  AAAAKKGKTKVADKGKCFHCNMNGHWKRNWPKSLIEKKKANEGKYDLLVLETCLVENDYSAQILDLGATKHVCSSFKGISSWRQLEAGEMTLKVEMGEVV

Query:  SIV
        S +
Subjt:  SIV

A0A5D3CPJ6 Gag/pol protein5.0e-5947.85Show/hide
Query:  NATVPMRNVYDRWIRANDKAKVYILANISDVLAKKHEDTVTAKKIMDSLQSMFGQPSSQARHEALKVEWGRHRRA-------------------------
        NAT  +R  Y+RW +AN+KA+ YILA++S+VLAKKHE  +TA++IMDSLQ MFGQ S Q +H+ALK  +                               
Subjt:  NATVPMRNVYDRWIRANDKAKVYILANISDVLAKKHEDTVTAKKIMDSLQSMFGQPSSQARHEALKVEWGRHRRA-------------------------

Query:  -ESGQLYSRISS----------------------------EEFPAISQQCGPEREANVATS-KTFHRGLSSGTKSSPTSSGSKTFKKKKAANKHPKPDST
         E+ Q+   + S                            + F ++ +  G + EANVATS + FHRG +SGTKS P+SSG+K +KKKK    +    + 
Subjt:  -ESGQLYSRISS----------------------------EEFPAISQQCGPEREANVATS-KTFHRGLSSGTKSSPTSSGSKTFKKKKAANKHPKPDST

Query:  AAAAKKGKTKVADKGKCFHCNMNGHWKRNWPKSLIEKKKANEGKYDLLVLETCLVENDYSAQILDLGATKHVCSSFKGISSWRQLEAGEMTLKVEMGEVV
         AAAK  K   A KG CFHCN  GHWKRN PK L EKKKA +GKYDLLVLETCLVEND SA I+D GAT HVCSSF+GISSWRQLE GEMT++V  G VV
Subjt:  AAAAKKGKTKVADKGKCFHCNMNGHWKRNWPKSLIEKKKANEGKYDLLVLETCLVENDYSAQILDLGATKHVCSSFKGISSWRQLEAGEMTLKVEMGEVV

Query:  SIV
        S +
Subjt:  SIV

A0A5D3DS88 Gag/pol protein2.3e-5947.85Show/hide
Query:  NATVPMRNVYDRWIRANDKAKVYILANISDVLAKKHEDTVTAKKIMDSLQSMFGQPSSQARHEALKVEWGRHRRAESG----QLYSRI------------
        NAT  +R  Y+RW +AN+KA+ YILA++S+VLAKKHE  +TA++IMDSLQ MFGQ S Q +H+ALK  +      E+      LY  +            
Subjt:  NATVPMRNVYDRWIRANDKAKVYILANISDVLAKKHEDTVTAKKIMDSLQSMFGQPSSQARHEALKVEWGRHRRAESG----QLYSRI------------

Query:  --------------------------------------SSEEFPAISQQCGPEREANVATS-KTFHRGLSSGTKSSPTSSGSKTFKKKKAANKHPKPDST
                                                + F ++ +  G + EANVATS + FHRG +SGTKS P+SSG+K +KKKK    +    + 
Subjt:  --------------------------------------SSEEFPAISQQCGPEREANVATS-KTFHRGLSSGTKSSPTSSGSKTFKKKKAANKHPKPDST

Query:  AAAAKKGKTKVADKGKCFHCNMNGHWKRNWPKSLIEKKKANEGKYDLLVLETCLVENDYSAQILDLGATKHVCSSFKGISSWRQLEAGEMTLKVEMGEVV
         AAAK  K   A KG CFHCN  GHWKRN PK L EKKKA +GKYDLLVLETCLVEND SA I+D GAT HVCSSF+GISSWRQLE GEMT++V  G VV
Subjt:  AAAAKKGKTKVADKGKCFHCNMNGHWKRNWPKSLIEKKKANEGKYDLLVLETCLVENDYSAQILDLGATKHVCSSFKGISSWRQLEAGEMTLKVEMGEVV

Query:  SIV
        S +
Subjt:  SIV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTAACGCCACTGTGCCGATGCGAAACGTCTATGACCGATGGATCAGGGCCAATGACAAGGCCAAGGTCTACATCTTGGCGAACATATCTGATGTGCTTGCTAAGAA
GCACGAAGACACGGTCACCGCTAAGAAGATCATGGACTCGCTGCAAAGCATGTTTGGACAACCGTCTTCACAGGCTCGACATGAAGCTCTTAAAGTTGAATGGGGCCGTC
ATAGACGAGCAGAGTCAGGTCAGCTTTATTCTAGAATCTCTTCCGAAGAGTTTCCTGCAATTTCGCAGCAGTGCGGACCGGAAAGGGAGGCTAACGTTGCCACCTCAAAG
ACGTTCCACCGAGGTTTGTCCTCTGGAACCAAGTCTTCGCCCACTTCTTCTGGAAGTAAGACTTTCAAGAAGAAGAAGGCTGCCAATAAGCATCCTAAACCTGACTCTAC
TGCTGCCGCTGCCAAGAAAGGCAAGACCAAGGTTGCAGACAAAGGAAAATGTTTTCACTGCAACATGAACGGGCATTGGAAGCGCAACTGGCCGAAATCCCTGATCGAGA
AGAAGAAAGCTAACGAAGGTAAATATGATTTACTTGTTTTGGAAACTTGTTTAGTAGAGAATGATTACTCTGCTCAGATACTTGATTTAGGAGCCACTAAACATGTTTGT
TCTTCATTTAAGGGAATTAGTTCCTGGAGGCAGCTTGAAGCCGGAGAGATGACTCTCAAGGTCGAAATGGGAGAAGTCGTCTCAATTGTGGGCTCGTCCTTTGATTTGTA
TGGGTGCAAGTGGCTCGAGTTGCCGACACAATATTCCTACAATTTTGGGGACGAGACCAAGTGGGGAGCTGGGAACATAACTACACAAGATGGCATTAACTTCTTCCCAA
GTTTAGGATCTGCATGGGTGAGAGCAGCTCAACAGCGCTGGCTCAATAAGCCTCCCATTTCAGGGATAAGACCAGAGGATCAGTGGGACTTAAGGAACAAGAGACTACGC
AAAGGGACAAGTTCGAGCTCTTCCATTGCTACGGCAGGATCTGAATGGTGCGTGGTGGAGGGGTCTACTTCATCGAGTTGGATACTTAATAGCTTGATGGGCAATAGACG
GGATGGCCTGGAGCATATGGATGATGGGTCAATCGGCCTATACCACCATGAACTCAGCCATCCTGGTGACTTGGGTGTCTCCTTGGCCAATAGTGATGGGGAGATCAATA
CATCCTTCTATAGTGACTATTTCCCCTGCGAATCCAACCAAGGGTGTTGGGCTCTTTTTAAGCTGTGCCTTAGTCCATTCAAAGGCAGCATAGGTGGGCGTCATTGTAGG
GTGAATGGACCTCTTCAAGGTCGGAGTCGAAGAACGGAACAGGATTGGCCCCCACTTGGGCCTCTGAAGATAGTGTTGATGTCAGCTCGAGCTCAGCTTGCCAACAAATT
TTTGGAAGTAGCCATATTGAATCAGATCTTCAATTTCGCGCTTCAGCTCGAAACAATTGGAAGTGTCGTGGTCGTGATCTCTGTGAAAACAGAAGTATATTTCCTTGTTT
CGTTTGTCAGGATCTCCTCTGAGCTTTCCAGGACGTTTGAGGAGTTTCTCCATCCCAGTCTCTTCAATTTCGGAGCAGCTCCTATCGTCAATGAATATCTTGGCCTTTTG
CAAGACCTCAGTGATGCAGGATGGTGCTTCTTCTCCCAATTTTACCGTCAATGTCTCATCGACCAGGCCGATGATGAAGTAGCACATTGTGGAGTCGTCCGAACAGTGCG
CGACCTTCAATTGTCCCTCCTGGAACCTGGTCACGTACTCTTTGAGTGTCTCGCCCTCCTTCTATCGAATGGTGGTGAGATGGGTGGCTGTCTTTCGGTCATAATGCCGA
GAAGAGAATTGACTTGTAATCTCTTTCCTCAACTGACTGTCAAGCTCTTCCCTTGTGATTACTTCTTCAGCTTTGATGATGCTACATCATCGGATCCAGAAACAGGTAGA
AAGCGACGAGCTAGCAGTGAAAAGATTTCTCAACGGGAATTACCACAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGCCTAACGCCACTGTGCCGATGCGAAACGTCTATGACCGATGGATCAGGGCCAATGACAAGGCCAAGGTCTACATCTTGGCGAACATATCTGATGTGCTTGCTAAGAA
GCACGAAGACACGGTCACCGCTAAGAAGATCATGGACTCGCTGCAAAGCATGTTTGGACAACCGTCTTCACAGGCTCGACATGAAGCTCTTAAAGTTGAATGGGGCCGTC
ATAGACGAGCAGAGTCAGGTCAGCTTTATTCTAGAATCTCTTCCGAAGAGTTTCCTGCAATTTCGCAGCAGTGCGGACCGGAAAGGGAGGCTAACGTTGCCACCTCAAAG
ACGTTCCACCGAGGTTTGTCCTCTGGAACCAAGTCTTCGCCCACTTCTTCTGGAAGTAAGACTTTCAAGAAGAAGAAGGCTGCCAATAAGCATCCTAAACCTGACTCTAC
TGCTGCCGCTGCCAAGAAAGGCAAGACCAAGGTTGCAGACAAAGGAAAATGTTTTCACTGCAACATGAACGGGCATTGGAAGCGCAACTGGCCGAAATCCCTGATCGAGA
AGAAGAAAGCTAACGAAGGTAAATATGATTTACTTGTTTTGGAAACTTGTTTAGTAGAGAATGATTACTCTGCTCAGATACTTGATTTAGGAGCCACTAAACATGTTTGT
TCTTCATTTAAGGGAATTAGTTCCTGGAGGCAGCTTGAAGCCGGAGAGATGACTCTCAAGGTCGAAATGGGAGAAGTCGTCTCAATTGTGGGCTCGTCCTTTGATTTGTA
TGGGTGCAAGTGGCTCGAGTTGCCGACACAATATTCCTACAATTTTGGGGACGAGACCAAGTGGGGAGCTGGGAACATAACTACACAAGATGGCATTAACTTCTTCCCAA
GTTTAGGATCTGCATGGGTGAGAGCAGCTCAACAGCGCTGGCTCAATAAGCCTCCCATTTCAGGGATAAGACCAGAGGATCAGTGGGACTTAAGGAACAAGAGACTACGC
AAAGGGACAAGTTCGAGCTCTTCCATTGCTACGGCAGGATCTGAATGGTGCGTGGTGGAGGGGTCTACTTCATCGAGTTGGATACTTAATAGCTTGATGGGCAATAGACG
GGATGGCCTGGAGCATATGGATGATGGGTCAATCGGCCTATACCACCATGAACTCAGCCATCCTGGTGACTTGGGTGTCTCCTTGGCCAATAGTGATGGGGAGATCAATA
CATCCTTCTATAGTGACTATTTCCCCTGCGAATCCAACCAAGGGTGTTGGGCTCTTTTTAAGCTGTGCCTTAGTCCATTCAAAGGCAGCATAGGTGGGCGTCATTGTAGG
GTGAATGGACCTCTTCAAGGTCGGAGTCGAAGAACGGAACAGGATTGGCCCCCACTTGGGCCTCTGAAGATAGTGTTGATGTCAGCTCGAGCTCAGCTTGCCAACAAATT
TTTGGAAGTAGCCATATTGAATCAGATCTTCAATTTCGCGCTTCAGCTCGAAACAATTGGAAGTGTCGTGGTCGTGATCTCTGTGAAAACAGAAGTATATTTCCTTGTTT
CGTTTGTCAGGATCTCCTCTGAGCTTTCCAGGACGTTTGAGGAGTTTCTCCATCCCAGTCTCTTCAATTTCGGAGCAGCTCCTATCGTCAATGAATATCTTGGCCTTTTG
CAAGACCTCAGTGATGCAGGATGGTGCTTCTTCTCCCAATTTTACCGTCAATGTCTCATCGACCAGGCCGATGATGAAGTAGCACATTGTGGAGTCGTCCGAACAGTGCG
CGACCTTCAATTGTCCCTCCTGGAACCTGGTCACGTACTCTTTGAGTGTCTCGCCCTCCTTCTATCGAATGGTGGTGAGATGGGTGGCTGTCTTTCGGTCATAATGCCGA
GAAGAGAATTGACTTGTAATCTCTTTCCTCAACTGACTGTCAAGCTCTTCCCTTGTGATTACTTCTTCAGCTTTGATGATGCTACATCATCGGATCCAGAAACAGGTAGA
AAGCGACGAGCTAGCAGTGAAAAGATTTCTCAACGGGAATTACCACAGTAA
Protein sequenceShow/hide protein sequence
MPNATVPMRNVYDRWIRANDKAKVYILANISDVLAKKHEDTVTAKKIMDSLQSMFGQPSSQARHEALKVEWGRHRRAESGQLYSRISSEEFPAISQQCGPEREANVATSK
TFHRGLSSGTKSSPTSSGSKTFKKKKAANKHPKPDSTAAAAKKGKTKVADKGKCFHCNMNGHWKRNWPKSLIEKKKANEGKYDLLVLETCLVENDYSAQILDLGATKHVC
SSFKGISSWRQLEAGEMTLKVEMGEVVSIVGSSFDLYGCKWLELPTQYSYNFGDETKWGAGNITTQDGINFFPSLGSAWVRAAQQRWLNKPPISGIRPEDQWDLRNKRLR
KGTSSSSSIATAGSEWCVVEGSTSSSWILNSLMGNRRDGLEHMDDGSIGLYHHELSHPGDLGVSLANSDGEINTSFYSDYFPCESNQGCWALFKLCLSPFKGSIGGRHCR
VNGPLQGRSRRTEQDWPPLGPLKIVLMSARAQLANKFLEVAILNQIFNFALQLETIGSVVVVISVKTEVYFLVSFVRISSELSRTFEEFLHPSLFNFGAAPIVNEYLGLL
QDLSDAGWCFFSQFYRQCLIDQADDEVAHCGVVRTVRDLQLSLLEPGHVLFECLALLLSNGGEMGGCLSVIMPRRELTCNLFPQLTVKLFPCDYFFSFDDATSSDPETGR
KRRASSEKISQRELPQ