; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g04000 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g04000
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGag/pol protein
Genome locationchr1:2589761..2594738
RNA-Seq ExpressionMoc01g04000
SyntenyMoc01g04000
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0005488 - binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044955.1 gag/pol protein [Cucumis melo var. makuwa]1.1e-6261.03Show/hide
Query:  LLAAEKLNGENYTQWKTNLNMILVVNDLMFVLTEECLQAPTPNAIRASRDAYDRWIKANGKVNVYILGSISNVLAKKHESMVTAREIIDSLQDMFGQPSF
        +LAA+KLNG NY  WK  +N +L+++DL FVL EEC Q P  NA R  R+ Y+RW KAN K   YIL S+S VLAKKHESM+TAREI+DSLQ+MFGQ S+
Subjt:  LLAAEKLNGENYTQWKTNLNMILVVNDLMFVLTEECLQAPTPNAIRASRDAYDRWIKANGKVNVYILGSISNVLAKKHESMVTAREIIDSLQDMFGQPSF

Query:  QAWYDALKFIYNSHMKEGTSVREHVLNLY--------------KVSQVSFILESLLKNFLQFCNNAVMNKIEYNLTTLLNELQIFQSLIKNKGQERETNV
        Q  +DALK+IYN+ M EG SVREHVLN+               + SQVSFILESL ++FLQF +NAVMNKI Y LTTLLNELQ F+SL+K KGQ+ E NV
Subjt:  QAWYDALKFIYNSHMKEGTSVREHVLNLY--------------KVSQVSFILESLLKNFLQFCNNAVMNKIEYNLTTLLNELQIFQSLIKNKGQERETNV

Query:  ATS-KRFHRGATN
        ATS ++FHRG+T+
Subjt:  ATS-KRFHRGATN

KAA0055183.1 gag/pol protein [Cucumis melo var. makuwa]4.0e-7048.22Show/hide
Query:  PGVTTHTEGLFI---------------ALLAAEKLNGENYTQWKTNLNMILVVNDLMFVLTEECLQAPTPNAIRASRDAYDRWIKANGKVNVYILGSISN
        PGVT H+EG  I               +++A++KLNG+NY  WK+NLN ILVV+DL FVL EEC Q+   NA RA+R AYDRWIK N K  VYIL ++S+
Subjt:  PGVTTHTEGLFI---------------ALLAAEKLNGENYTQWKTNLNMILVVNDLMFVLTEECLQAPTPNAIRASRDAYDRWIKANGKVNVYILGSISN

Query:  VLAKKHESMVTAREIIDSLQDMFGQPSFQAWYDALKFIYNSHMKEGTSVREHVLNLY--------------KVSQVSFILESLLKNFLQFCNNAVMNKIE
        +LAKKHES+ T +EI+DSL+ MFGQP +   ++ +K+IY  HMKEGTSVREHVL++               + +QVSFIL+SL K+F+ F  NA +NKIE
Subjt:  VLAKKHESMVTAREIIDSLQDMFGQPSFQAWYDALKFIYNSHMKEGTSVREHVLNLY--------------KVSQVSFILESLLKNFLQFCNNAVMNKIE

Query:  YNLTTLLNELQIFQSLIKNKGQERETNVATSK-RFHRGATNHVCSSFQGISPRRQLDAGKMTLKVVLNEISDEATNTLTRVVDKASTSTRVVDGASTSRQ
        +NLT  LNELQ FQ+L K KG+E E NVAT+K +F RG+     SS   I P   L + +   K    +   +     T   D+ S+ST+VVD      Q
Subjt:  YNLTTLLNELQIFQSLIKNKGQERETNVATSK-RFHRGATNHVCSSFQGISPRRQLDAGKMTLKVVLNEISDEATNTLTRVVDKASTSTRVVDGASTSRQ

Query:  SHPSQELRVPRHSGRIVSQPDRYMDLIEIQVFIPDDDV
        +HPSQEL   R S R+V QPDRY+ L E Q+ IPD+ +
Subjt:  SHPSQELRVPRHSGRIVSQPDRYMDLIEIQVFIPDDDV

KAA0061339.1 gag/pol protein [Cucumis melo var. makuwa]8.6e-6539.36Show/hide
Query:  LLAAEKLNGENYTQWKTNLNMILVVNDLMFVLTEECLQAPTPNAIRASRDAYDRWIKANGKVNVYILGSISNVLAKKHESMVTAREIIDSLQDMFGQPSF
        +LAA+KLNG NY  WK  +N +L+++DL FVL EEC Q P  NA +  R+ Y+RW KAN K   YIL S+S VLAKKHESM+TAREI+DSLQ+MFGQ S+
Subjt:  LLAAEKLNGENYTQWKTNLNMILVVNDLMFVLTEECLQAPTPNAIRASRDAYDRWIKANGKVNVYILGSISNVLAKKHESMVTAREIIDSLQDMFGQPSF

Query:  QAWYDALKFIYNSHMKEGTSVREHVLNLY--------------KVSQVSFILESLLKNFLQFCNNAVMNKIEYNLTTLLNELQIFQSLIKNKGQERETNV
        Q  +DALK+IYN+ M EG SVREHVLN+               + SQVSFILESL ++FLQF +NAVMNKI Y LTTLLNELQ F+SL+K KGQ+ E NV
Subjt:  QAWYDALKFIYNSHMKEGTSVREHVLNLY--------------KVSQVSFILESLLKNFLQFCNNAVMNKIEYNLTTLLNELQIFQSLIKNKGQERETNV

Query:  ATS-KRFHRGATN-----------------------------------------------------------------------HVCSSFQGIS------
        ATS ++FHRG+T+                                                                       H+ +SF G +      
Subjt:  ATS-KRFHRGATN-----------------------------------------------------------------------HVCSSFQGIS------

Query:  -----PRRQLDAGKMTL---------------------------------------------KVVLNEISDEATNTLTRVVDKASTSTRVVDGASTSRQS
             P + +    + L                                             K+VLNE+S E T   TRVV++ S  TRVV   S++R +
Subjt:  -----PRRQLDAGKMTL---------------------------------------------KVVLNEISDEATNTLTRVVDKASTSTRVVDGASTSRQS

Query:  HPSQELRVPRHSGRIVSQPDRYMDLIEIQVFIPDDDV
        H  Q LR PR SGR+ + P RYM L E    I D D+
Subjt:  HPSQELRVPRHSGRIVSQPDRYMDLIEIQVFIPDDDV

TYJ97035.1 gag/pol protein [Cucumis melo var. makuwa]4.7e-6348.46Show/hide
Query:  IALLAAEKLNGENYTQWKTNLNMILVVNDLMFVLTEECLQAPTPNAIRASRDAYDRWIKANGKVNVYILGSISNVLAKKHESMVTAREIIDSLQDMFGQP
        + LLA EKLNG+NY  WK+NLN ILVV+DL FVLTEEC Q P+ NA + SR AYDRWIKAN K  VYIL S+S+VLAKKHES+ TA+EI+DSL+ MFGQP
Subjt:  IALLAAEKLNGENYTQWKTNLNMILVVNDLMFVLTEECLQAPTPNAIRASRDAYDRWIKANGKVNVYILGSISNVLAKKHESMVTAREIIDSLQDMFGQP

Query:  SFQAWYDALKFIYNSHMKEGTSVREHVL------NLYKV--------SQVSFILESLLKNFLQFCNNAVMNKIEYNLTTLLNELQIFQSLIKNKGQE---
         +   ++ +K+IY   MKEGTS++EHVL      N+++V        +QVSFILESL K+F+ F  NA +NKIE+N TTLLNELQ FQ+L K    +   
Subjt:  SFQAWYDALKFIYNSHMKEGTSVREHVL------NLYKV--------SQVSFILESLLKNFLQFCNNAVMNKIEYNLTTLLNELQIFQSLIKNKGQE---

Query:  --------RETNVATSKRFHRGATNHVCSSFQGIS--PRRQLDAGKMTLKVVLNEISDEATNTLTRVVDKASTSTRVVDGASTSRQSHPSQELRVPRHSG
                R  NV  S+ +      +  S     S      L   K        E   E  N      D+ S+ST+VVD      Q+HPSQEL  PR SG
Subjt:  --------RETNVATSKRFHRGATNHVCSSFQGIS--PRRQLDAGKMTLKVVLNEISDEATNTLTRVVDKASTSTRVVDGASTSRQSHPSQELRVPRHSG

Query:  RIVSQPDRYMDLIEIQVFIPDDDV
        R+V Q D Y+ L E Q+ IPDD +
Subjt:  RIVSQPDRYMDLIEIQVFIPDDDV

XP_022158568.1 uncharacterized protein LOC111025021 [Momordica charantia]4.7e-6360.56Show/hide
Query:  IALLAAEKLNGENYTQWKTNLNMILVVNDLMFVLTEECLQAPTPNAIRASRDAYDRWIKANGKVNVYILGSISNVLAKKHESMVTAREIIDSLQDMFGQP
        I LLA++KLNG+NY  WK+NLN ILV++DL FVLTEEC  AP PNA R  RDAYDRW+KAN K  VYIL SIS VL+KKHE + T REI+DSLQ +FGQP
Subjt:  IALLAAEKLNGENYTQWKTNLNMILVVNDLMFVLTEECLQAPTPNAIRASRDAYDRWIKANGKVNVYILGSISNVLAKKHESMVTAREIIDSLQDMFGQP

Query:  SFQAWYDALKFIYNSHMKEGTSVREHVLNLY--------------KVSQVSFILESLLKNFLQFCNNAVMNKIEYNLTTLLNELQIFQSLIKNKGQERET
        S    +DA+K++YN  MKEG+SVREHVLN+               ++SQV FI++SL K++ QF  NA+MNKIEY+LTTLLNELQ+++SL+KNKG E E 
Subjt:  SFQAWYDALKFIYNSHMKEGTSVREHVLNLY--------------KVSQVSFILESLLKNFLQFCNNAVMNKIEYNLTTLLNELQIFQSLIKNKGQERET

Query:  NVATS--KRFHRG
        NVAT+  ++FH+G
Subjt:  NVATS--KRFHRG

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein5.1e-6361.03Show/hide
Query:  LLAAEKLNGENYTQWKTNLNMILVVNDLMFVLTEECLQAPTPNAIRASRDAYDRWIKANGKVNVYILGSISNVLAKKHESMVTAREIIDSLQDMFGQPSF
        +LAA+KLNG NY  WK  +N +L+++DL FVL EEC Q P  NA R  R+ Y+RW KAN K   YIL S+S VLAKKHESM+TAREI+DSLQ+MFGQ S+
Subjt:  LLAAEKLNGENYTQWKTNLNMILVVNDLMFVLTEECLQAPTPNAIRASRDAYDRWIKANGKVNVYILGSISNVLAKKHESMVTAREIIDSLQDMFGQPSF

Query:  QAWYDALKFIYNSHMKEGTSVREHVLNLY--------------KVSQVSFILESLLKNFLQFCNNAVMNKIEYNLTTLLNELQIFQSLIKNKGQERETNV
        Q  +DALK+IYN+ M EG SVREHVLN+               + SQVSFILESL ++FLQF +NAVMNKI Y LTTLLNELQ F+SL+K KGQ+ E NV
Subjt:  QAWYDALKFIYNSHMKEGTSVREHVLNLY--------------KVSQVSFILESLLKNFLQFCNNAVMNKIEYNLTTLLNELQIFQSLIKNKGQERETNV

Query:  ATS-KRFHRGATN
        ATS ++FHRG+T+
Subjt:  ATS-KRFHRGATN

A0A5A7V6N0 Gag/pol protein4.2e-6539.36Show/hide
Query:  LLAAEKLNGENYTQWKTNLNMILVVNDLMFVLTEECLQAPTPNAIRASRDAYDRWIKANGKVNVYILGSISNVLAKKHESMVTAREIIDSLQDMFGQPSF
        +LAA+KLNG NY  WK  +N +L+++DL FVL EEC Q P  NA +  R+ Y+RW KAN K   YIL S+S VLAKKHESM+TAREI+DSLQ+MFGQ S+
Subjt:  LLAAEKLNGENYTQWKTNLNMILVVNDLMFVLTEECLQAPTPNAIRASRDAYDRWIKANGKVNVYILGSISNVLAKKHESMVTAREIIDSLQDMFGQPSF

Query:  QAWYDALKFIYNSHMKEGTSVREHVLNLY--------------KVSQVSFILESLLKNFLQFCNNAVMNKIEYNLTTLLNELQIFQSLIKNKGQERETNV
        Q  +DALK+IYN+ M EG SVREHVLN+               + SQVSFILESL ++FLQF +NAVMNKI Y LTTLLNELQ F+SL+K KGQ+ E NV
Subjt:  QAWYDALKFIYNSHMKEGTSVREHVLNLY--------------KVSQVSFILESLLKNFLQFCNNAVMNKIEYNLTTLLNELQIFQSLIKNKGQERETNV

Query:  ATS-KRFHRGATN-----------------------------------------------------------------------HVCSSFQGIS------
        ATS ++FHRG+T+                                                                       H+ +SF G +      
Subjt:  ATS-KRFHRGATN-----------------------------------------------------------------------HVCSSFQGIS------

Query:  -----PRRQLDAGKMTL---------------------------------------------KVVLNEISDEATNTLTRVVDKASTSTRVVDGASTSRQS
             P + +    + L                                             K+VLNE+S E T   TRVV++ S  TRVV   S++R +
Subjt:  -----PRRQLDAGKMTL---------------------------------------------KVVLNEISDEATNTLTRVVDKASTSTRVVDGASTSRQS

Query:  HPSQELRVPRHSGRIVSQPDRYMDLIEIQVFIPDDDV
        H  Q LR PR SGR+ + P RYM L E    I D D+
Subjt:  HPSQELRVPRHSGRIVSQPDRYMDLIEIQVFIPDDDV

A0A5D3BF11 Gag/pol protein2.3e-6348.46Show/hide
Query:  IALLAAEKLNGENYTQWKTNLNMILVVNDLMFVLTEECLQAPTPNAIRASRDAYDRWIKANGKVNVYILGSISNVLAKKHESMVTAREIIDSLQDMFGQP
        + LLA EKLNG+NY  WK+NLN ILVV+DL FVLTEEC Q P+ NA + SR AYDRWIKAN K  VYIL S+S+VLAKKHES+ TA+EI+DSL+ MFGQP
Subjt:  IALLAAEKLNGENYTQWKTNLNMILVVNDLMFVLTEECLQAPTPNAIRASRDAYDRWIKANGKVNVYILGSISNVLAKKHESMVTAREIIDSLQDMFGQP

Query:  SFQAWYDALKFIYNSHMKEGTSVREHVL------NLYKV--------SQVSFILESLLKNFLQFCNNAVMNKIEYNLTTLLNELQIFQSLIKNKGQE---
         +   ++ +K+IY   MKEGTS++EHVL      N+++V        +QVSFILESL K+F+ F  NA +NKIE+N TTLLNELQ FQ+L K    +   
Subjt:  SFQAWYDALKFIYNSHMKEGTSVREHVL------NLYKV--------SQVSFILESLLKNFLQFCNNAVMNKIEYNLTTLLNELQIFQSLIKNKGQE---

Query:  --------RETNVATSKRFHRGATNHVCSSFQGIS--PRRQLDAGKMTLKVVLNEISDEATNTLTRVVDKASTSTRVVDGASTSRQSHPSQELRVPRHSG
                R  NV  S+ +      +  S     S      L   K        E   E  N      D+ S+ST+VVD      Q+HPSQEL  PR SG
Subjt:  --------RETNVATSKRFHRGATNHVCSSFQGIS--PRRQLDAGKMTLKVVLNEISDEATNTLTRVVDKASTSTRVVDGASTSRQSHPSQELRVPRHSG

Query:  RIVSQPDRYMDLIEIQVFIPDDDV
        R+V Q D Y+ L E Q+ IPDD +
Subjt:  RIVSQPDRYMDLIEIQVFIPDDDV

A0A5D3DQ34 Gag/pol protein1.9e-7048.22Show/hide
Query:  PGVTTHTEGLFI---------------ALLAAEKLNGENYTQWKTNLNMILVVNDLMFVLTEECLQAPTPNAIRASRDAYDRWIKANGKVNVYILGSISN
        PGVT H+EG  I               +++A++KLNG+NY  WK+NLN ILVV+DL FVL EEC Q+   NA RA+R AYDRWIK N K  VYIL ++S+
Subjt:  PGVTTHTEGLFI---------------ALLAAEKLNGENYTQWKTNLNMILVVNDLMFVLTEECLQAPTPNAIRASRDAYDRWIKANGKVNVYILGSISN

Query:  VLAKKHESMVTAREIIDSLQDMFGQPSFQAWYDALKFIYNSHMKEGTSVREHVLNLY--------------KVSQVSFILESLLKNFLQFCNNAVMNKIE
        +LAKKHES+ T +EI+DSL+ MFGQP +   ++ +K+IY  HMKEGTSVREHVL++               + +QVSFIL+SL K+F+ F  NA +NKIE
Subjt:  VLAKKHESMVTAREIIDSLQDMFGQPSFQAWYDALKFIYNSHMKEGTSVREHVLNLY--------------KVSQVSFILESLLKNFLQFCNNAVMNKIE

Query:  YNLTTLLNELQIFQSLIKNKGQERETNVATSK-RFHRGATNHVCSSFQGISPRRQLDAGKMTLKVVLNEISDEATNTLTRVVDKASTSTRVVDGASTSRQ
        +NLT  LNELQ FQ+L K KG+E E NVAT+K +F RG+     SS   I P   L + +   K    +   +     T   D+ S+ST+VVD      Q
Subjt:  YNLTTLLNELQIFQSLIKNKGQERETNVATSK-RFHRGATNHVCSSFQGISPRRQLDAGKMTLKVVLNEISDEATNTLTRVVDKASTSTRVVDGASTSRQ

Query:  SHPSQELRVPRHSGRIVSQPDRYMDLIEIQVFIPDDDV
        +HPSQEL   R S R+V QPDRY+ L E Q+ IPD+ +
Subjt:  SHPSQELRVPRHSGRIVSQPDRYMDLIEIQVFIPDDDV

A0A6J1DWG6 uncharacterized protein LOC1110250212.3e-6360.56Show/hide
Query:  IALLAAEKLNGENYTQWKTNLNMILVVNDLMFVLTEECLQAPTPNAIRASRDAYDRWIKANGKVNVYILGSISNVLAKKHESMVTAREIIDSLQDMFGQP
        I LLA++KLNG+NY  WK+NLN ILV++DL FVLTEEC  AP PNA R  RDAYDRW+KAN K  VYIL SIS VL+KKHE + T REI+DSLQ +FGQP
Subjt:  IALLAAEKLNGENYTQWKTNLNMILVVNDLMFVLTEECLQAPTPNAIRASRDAYDRWIKANGKVNVYILGSISNVLAKKHESMVTAREIIDSLQDMFGQP

Query:  SFQAWYDALKFIYNSHMKEGTSVREHVLNLY--------------KVSQVSFILESLLKNFLQFCNNAVMNKIEYNLTTLLNELQIFQSLIKNKGQERET
        S    +DA+K++YN  MKEG+SVREHVLN+               ++SQV FI++SL K++ QF  NA+MNKIEY+LTTLLNELQ+++SL+KNKG E E 
Subjt:  SFQAWYDALKFIYNSHMKEGTSVREHVLNLY--------------KVSQVSFILESLLKNFLQFCNNAVMNKIEYNLTTLLNELQIFQSLIKNKGQERET

Query:  NVATS--KRFHRG
        NVAT+  ++FH+G
Subjt:  NVATS--KRFHRG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTACATGGCTCAATAGCAAGGTGAATGGAGAAAGTATTTATAGTGAGTTGGAGAAGGACATGTGGCAACACATCTTGCGGTCTCCTCCATTGGTTTACAACTTGAG
ATTTCATACACGGCCTGCGTGTCGTCCTGGAGTGACCACCCATACGGAGGGTCTGTTTATTGCTCTGCTTGCCGCCGAAAAACTTAACGGAGAAAATTACACACAGTGGA
AAACGAACCTTAACATGATACTCGTGGTAAATGATCTTATGTTCGTCTTAACTGAAGAGTGTCTTCAGGCTCCCACACCTAATGCAATCCGAGCCAGTCGGGATGCCTAT
GACAGATGGATCAAGGCCAATGGCAAGGTCAATGTCTACATCTTGGGAAGCATATCTAATGTGCTAGCCAAGAAGCATGAAAGCATGGTTACCGCAAGGGAGATCATTGA
CTCATTGCAGGACATGTTTGGACAACCGTCCTTTCAAGCCTGGTACGATGCCCTCAAGTTCATTTACAATTCCCACATGAAAGAGGGAACATCAGTGCGAGAACATGTTC
TCAATCTGTACAAGGTGAGCCAGGTCAGCTTTATCTTGGAATCTCTTCTGAAGAATTTTCTACAATTCTGTAACAATGCTGTGATGAACAAGATAGAGTACAACCTTACC
ACGCTCTTGAATGAGCTTCAGATCTTTCAGTCTCTTATAAAAAATAAGGGACAGGAAAGGGAGACAAATGTTGCCACCTCAAAACGGTTCCATCGAGGGGCCACTAATCA
TGTTTGTTCTTCTTTTCAGGGAATTAGTCCCCGGAGGCAACTTGATGCTGGAAAGATGACTCTCAAGGTTGTGTTAAATGAGATTTCCGATGAAGCTACAAATACATTAA
CGAGAGTTGTTGATAAAGCTAGCACTTCAACAAGAGTTGTTGATGGCGCTAGTACTTCACGTCAGTCACATCCATCTCAAGAGTTGAGAGTACCTCGACATAGTGGGAGG
ATTGTGTCACAACCTGATCGTTACATGGATTTAATAGAAATCCAGGTCTTCATACCTGATGATGACGTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTTACATGGCTCAATAGCAAGGTGAATGGAGAAAGTATTTATAGTGAGTTGGAGAAGGACATGTGGCAACACATCTTGCGGTCTCCTCCATTGGTTTACAACTTGAG
ATTTCATACACGGCCTGCGTGTCGTCCTGGAGTGACCACCCATACGGAGGGTCTGTTTATTGCTCTGCTTGCCGCCGAAAAACTTAACGGAGAAAATTACACACAGTGGA
AAACGAACCTTAACATGATACTCGTGGTAAATGATCTTATGTTCGTCTTAACTGAAGAGTGTCTTCAGGCTCCCACACCTAATGCAATCCGAGCCAGTCGGGATGCCTAT
GACAGATGGATCAAGGCCAATGGCAAGGTCAATGTCTACATCTTGGGAAGCATATCTAATGTGCTAGCCAAGAAGCATGAAAGCATGGTTACCGCAAGGGAGATCATTGA
CTCATTGCAGGACATGTTTGGACAACCGTCCTTTCAAGCCTGGTACGATGCCCTCAAGTTCATTTACAATTCCCACATGAAAGAGGGAACATCAGTGCGAGAACATGTTC
TCAATCTGTACAAGGTGAGCCAGGTCAGCTTTATCTTGGAATCTCTTCTGAAGAATTTTCTACAATTCTGTAACAATGCTGTGATGAACAAGATAGAGTACAACCTTACC
ACGCTCTTGAATGAGCTTCAGATCTTTCAGTCTCTTATAAAAAATAAGGGACAGGAAAGGGAGACAAATGTTGCCACCTCAAAACGGTTCCATCGAGGGGCCACTAATCA
TGTTTGTTCTTCTTTTCAGGGAATTAGTCCCCGGAGGCAACTTGATGCTGGAAAGATGACTCTCAAGGTTGTGTTAAATGAGATTTCCGATGAAGCTACAAATACATTAA
CGAGAGTTGTTGATAAAGCTAGCACTTCAACAAGAGTTGTTGATGGCGCTAGTACTTCACGTCAGTCACATCCATCTCAAGAGTTGAGAGTACCTCGACATAGTGGGAGG
ATTGTGTCACAACCTGATCGTTACATGGATTTAATAGAAATCCAGGTCTTCATACCTGATGATGACGTTTAG
Protein sequenceShow/hide protein sequence
MFTWLNSKVNGESIYSELEKDMWQHILRSPPLVYNLRFHTRPACRPGVTTHTEGLFIALLAAEKLNGENYTQWKTNLNMILVVNDLMFVLTEECLQAPTPNAIRASRDAY
DRWIKANGKVNVYILGSISNVLAKKHESMVTAREIIDSLQDMFGQPSFQAWYDALKFIYNSHMKEGTSVREHVLNLYKVSQVSFILESLLKNFLQFCNNAVMNKIEYNLT
TLLNELQIFQSLIKNKGQERETNVATSKRFHRGATNHVCSSFQGISPRRQLDAGKMTLKVVLNEISDEATNTLTRVVDKASTSTRVVDGASTSRQSHPSQELRVPRHSGR
IVSQPDRYMDLIEIQVFIPDDDV