; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g16630 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g16630
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGag/pol protein
Genome locationchr3:11000879..11007748
RNA-Seq ExpressionMoc03g16630
SyntenyMoc03g16630
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK05765.1 gag/pol protein [Cucumis melo var. makuwa]1.2e-7157.82Show/hide
Query:  KFVLTEECPPAPAPNANRTVRDAYDRWVKANEKARVYILASISEVLSKKHERLATAREIMDSLQALFGQPSTTIMYDAIKYVYNCRMKEGPSVREHVLNM
        +FVLTEECP  PA NANR  R AYDRW+KANEKARVYILAS+S+VL+KKHE LAT +EIMDSL+ +FGQP   I + AIKY+Y  RMKEG SVREHVL+M
Subjt:  KFVLTEECPPAPAPNANRTVRDAYDRWVKANEKARVYILASISEVLSKKHERLATAREIMDSLQALFGQPSTTIMYDAIKYVYNCRMKEGPSVREHVLNM

Query:  MVHFNVAEVNGAVMKEISQVGFIVQSLPKSYFQFKTNVMMNKIEYNLTTLLNELKLYESLLKNKGFEAEANVATTSKRKFHEGSSSGSKSRPSYQKKGIQ
        M+HFN+A+VNG V+ + SQV FI++SLPKS+  F+TN  +NKIE+NLTTLLNEL+ +++L K KG E EANVATT K KF  GSSS SK  PS   + I+
Subjt:  MVHFNVAEVNGAVMKEISQVGFIVQSLPKSYFQFKTNVMMNKIEYNLTTLLNELKLYESLLKNKGFEAEANVATTSKRKFHEGSSSGSKSRPSYQKKGIQ

Query:  KKKKDKGRGKAPAAVKGKEKI----AMKMGTGKEIVPNTS---PRKELRRKSKETSSWRQLGEDEATLRVGSGEL
        K    KG+GK P   KGK+        + G     + N      +K+  +K KETSSW++L E E TL+VG+GE+
Subjt:  KKKKDKGRGKAPAAVKGKEKI----AMKMGTGKEIVPNTS---PRKELRRKSKETSSWRQLGEDEATLRVGSGEL

TYK06658.1 ty1-copia retrotransposon protein [Cucumis melo var. makuwa]4.2e-7250.46Show/hide
Query:  MASLRSDKVLPDLSKLESLDGSNYRCWSQKLLIFFEQLEVDYVLT-------------------------TTVADVA-----------------------
        M+   S+K+LPDLSKLE LDG+NYR WSQKLLIFFEQLEVDYVLT                         TTVAD                         
Subjt:  MASLRSDKVLPDLSKLESLDGSNYRCWSQKLLIFFEQLEVDYVLT-------------------------TTVADVA-----------------------

Query:  TAAGVPAGATAAA--------GAVAAGVAATAAYNHMRTEEANRQKDKFSYQFVNSVNANLIESSVANKDRFKGKRNLVAKERIQKKKGFQFKSFSGRIE
        T++  PAG  + +              +      +HM TEEANR KDK + Q +NSVNA L+ESS+ N+DR K ++    K   ++    QFK+  G+I+
Subjt:  TAAGVPAGATAAA--------GAVAAGVAATAAYNHMRTEEANRQKDKFSYQFVNSVNANLIESSVANKDRFKGKRNLVAKERIQKKKGFQFKSFSGRIE

Query:  KSKIVCFVCGKPGHKSYQCNQRKGKPEQKHVPQASIAETDEVIAAVVVEANLVENKSDWILDTGASGHFCSNRDLFHEFQDSTGCECVFMGNSAMAGVLG
        K K+VC+VCGK GHKSYQCNQRKG+P QK  PQA++AE D  I A +VEANL+ENK+DWIL+TGAS HFC+N +L H+++D+   ECVFMGNSA+AGV+G
Subjt:  KSKIVCFVCGKPGHKSYQCNQRKGKPEQKHVPQASIAETDEVIAAVVVEANLVENKSDWILDTGASGHFCSNRDLFHEFQDSTGCECVFMGNSAMAGVLG

Query:  KGQILLKLTSSKTLSLSDVLYIPSLRRNL
        KG+++LKLTS KTLSLS+VLY+PSLRRNL
Subjt:  KGQILLKLTSSKTLSLSDVLYIPSLRRNL

XP_022158568.1 uncharacterized protein LOC111025021 [Momordica charantia]2.4e-8890.96Show/hide
Query:  KFVLTEECPPAPAPNANRTVRDAYDRWVKANEKARVYILASISEVLSKKHERLATAREIMDSLQALFGQPSTTIMYDAIKYVYNCRMKEGPSVREHVLNM
        +FVLTEECPPAPAPNANRTVRDAYDRWVKANEKARVYILASISEVLSKKHERLAT REIMDSLQALFGQPSTT+M+DA+KYVYNCRMKEG SVREHVLNM
Subjt:  KFVLTEECPPAPAPNANRTVRDAYDRWVKANEKARVYILASISEVLSKKHERLATAREIMDSLQALFGQPSTTIMYDAIKYVYNCRMKEGPSVREHVLNM

Query:  MVHFNVAEVNGAVMKEISQVGFIVQSLPKSYFQFKTNVMMNKIEYNLTTLLNELKLYESLLKNKGFEAEANVATTSKRKFHEGSSSGS
        MVHFNVAEVN  VM EISQVGFI+QSLPKSYFQFK N MMNKIEY+LTTLLNEL+LYESLLKNKGFEAEANVATTSKRKFH+G SS S
Subjt:  MVHFNVAEVNGAVMKEISQVGFIVQSLPKSYFQFKTNVMMNKIEYNLTTLLNELKLYESLLKNKGFEAEANVATTSKRKFHEGSSSGS

XP_022158791.1 uncharacterized protein LOC111025258 [Momordica charantia]5.6e-6988.96Show/hide
Query:  KFVLTEECPPAPAPNANRTVRDAYDRWVKANEKARVYILASISEVLSKKHERLATAREIMDSLQALFGQPSTTIMYDAIKYVYNCRMKEGPSVREHVLNM
        +FVLTEECPPA APN+N+TVRDA+DRW KANEKARVYILASIS+VLSKKHE LATAREIMDSLQALFGQPST+I++DAIKYVYNCRMKEG SVREHVLNM
Subjt:  KFVLTEECPPAPAPNANRTVRDAYDRWVKANEKARVYILASISEVLSKKHERLATAREIMDSLQALFGQPSTTIMYDAIKYVYNCRMKEGPSVREHVLNM

Query:  MVHFNVAEVNGAVMKEISQVGFIVQSLPKSYFQFKTNVMMNKIEYNLTTLLNEL
        MVHFNVAEVN AVM EISQVGFI+QSLPKSYFQFKTN MMNKIEY+LTTLLNEL
Subjt:  MVHFNVAEVNGAVMKEISQVGFIVQSLPKSYFQFKTNVMMNKIEYNLTTLLNEL

XP_038876370.1 uncharacterized protein LOC120068812, partial [Benincasa hispida]6.4e-7373.44Show/hide
Query:  KFVLTEECPPAPAPNANRTVRDAYDRWVKANEKARVYILASISEVLSKKHERLATAREIMDSLQALFGQPSTTIMYDAIKYVYNCRMKEGPSVREHVLNM
        +FVLTEECPP PA NANRTV   +DRW KA EKA+VYIL SIS++LSKKHE++ TA+EIM+SLQALFGQPS++ M+DAIK+VYNCRMKEGP+VREHVL+M
Subjt:  KFVLTEECPPAPAPNANRTVRDAYDRWVKANEKARVYILASISEVLSKKHERLATAREIMDSLQALFGQPSTTIMYDAIKYVYNCRMKEGPSVREHVLNM

Query:  MVHFNVAEVNGAVMKEISQVGFIVQSLPKSYFQFKTNVMMNKIEYNLTTLLNELKLYESLLKNKGFEAEANVATTSKRKFHEGSSSGSKSRP
        MVHFN+ EVN AVM E SQVGFI++SLPKS+FQF+ N MMNKI+YNLTT+LNEL++Y+ LLKNKG EAEANVATTSKR+F +  +SG+KS P
Subjt:  MVHFNVAEVNGAVMKEISQVGFIVQSLPKSYFQFKTNVMMNKIEYNLTTLLNELKLYESLLKNKGFEAEANVATTSKRKFHEGSSSGSKSRP

TrEMBL top hitse value%identityAlignment
A0A5A7US12 Gag/pol protein1.8e-6856.09Show/hide
Query:  KFVLTEECPPAPAPNANRTVRDAYDRWVKANEKARVYILASISEVLSKKHERLATAREIMDSLQALFGQPSTTIMYDAIKYVYNCRMKEGPSVREHVLNM
        +F+LTEECP  PA N N+  R AYDRW+KANEKARVYILAS+S+VL+KKHE LATA+EIMDSL+ +F QP   + ++AIKY+Y  RMKEG SVREHVL+M
Subjt:  KFVLTEECPPAPAPNANRTVRDAYDRWVKANEKARVYILASISEVLSKKHERLATAREIMDSLQALFGQPSTTIMYDAIKYVYNCRMKEGPSVREHVLNM

Query:  MVHFNVAEVNGAVMKEISQVGFIVQSLPKSYFQFKTNVMMNKIEYNLTTLLNELKLYESLLKNKGFEAEANVATTSKRKFHEGSSSGSKSRPSYQKKGIQ
        M+HFN+AEVNG  + E +QV FI++SLPKS+  F+ N  +NKIE+NLTTLLNEL+ +++L K KG E EANVATT K KF  GSSS SKSRP    + I+
Subjt:  MVHFNVAEVNGAVMKEISQVGFIVQSLPKSYFQFKTNVMMNKIEYNLTTLLNELKLYESLLKNKGFEAEANVATTSKRKFHEGSSSGSKSRPSYQKKGIQ

Query:  KKKKDKGRGKAPAAVKGKEKIAMKMGTGKEIVPNTSPRKELRRKSK--ETSSWRQLGEDEATLRVGSGELI
        KK+K     K P   KGK+               T   KE   K    ETSSW++L E + TL+VG+GE++
Subjt:  KKKKDKGRGKAPAAVKGKEKIAMKMGTGKEIVPNTSPRKELRRKSK--ETSSWRQLGEDEATLRVGSGELI

A0A5D3C306 Gag/pol protein5.9e-7257.82Show/hide
Query:  KFVLTEECPPAPAPNANRTVRDAYDRWVKANEKARVYILASISEVLSKKHERLATAREIMDSLQALFGQPSTTIMYDAIKYVYNCRMKEGPSVREHVLNM
        +FVLTEECP  PA NANR  R AYDRW+KANEKARVYILAS+S+VL+KKHE LAT +EIMDSL+ +FGQP   I + AIKY+Y  RMKEG SVREHVL+M
Subjt:  KFVLTEECPPAPAPNANRTVRDAYDRWVKANEKARVYILASISEVLSKKHERLATAREIMDSLQALFGQPSTTIMYDAIKYVYNCRMKEGPSVREHVLNM

Query:  MVHFNVAEVNGAVMKEISQVGFIVQSLPKSYFQFKTNVMMNKIEYNLTTLLNELKLYESLLKNKGFEAEANVATTSKRKFHEGSSSGSKSRPSYQKKGIQ
        M+HFN+A+VNG V+ + SQV FI++SLPKS+  F+TN  +NKIE+NLTTLLNEL+ +++L K KG E EANVATT K KF  GSSS SK  PS   + I+
Subjt:  MVHFNVAEVNGAVMKEISQVGFIVQSLPKSYFQFKTNVMMNKIEYNLTTLLNELKLYESLLKNKGFEAEANVATTSKRKFHEGSSSGSKSRPSYQKKGIQ

Query:  KKKKDKGRGKAPAAVKGKEKI----AMKMGTGKEIVPNTS---PRKELRRKSKETSSWRQLGEDEATLRVGSGEL
        K    KG+GK P   KGK+        + G     + N      +K+  +K KETSSW++L E E TL+VG+GE+
Subjt:  KKKKDKGRGKAPAAVKGKEKI----AMKMGTGKEIVPNTS---PRKELRRKSKETSSWRQLGEDEATLRVGSGEL

A0A5D3C674 Ty1-copia retrotransposon protein2.0e-7250.46Show/hide
Query:  MASLRSDKVLPDLSKLESLDGSNYRCWSQKLLIFFEQLEVDYVLT-------------------------TTVADVA-----------------------
        M+   S+K+LPDLSKLE LDG+NYR WSQKLLIFFEQLEVDYVLT                         TTVAD                         
Subjt:  MASLRSDKVLPDLSKLESLDGSNYRCWSQKLLIFFEQLEVDYVLT-------------------------TTVADVA-----------------------

Query:  TAAGVPAGATAAA--------GAVAAGVAATAAYNHMRTEEANRQKDKFSYQFVNSVNANLIESSVANKDRFKGKRNLVAKERIQKKKGFQFKSFSGRIE
        T++  PAG  + +              +      +HM TEEANR KDK + Q +NSVNA L+ESS+ N+DR K ++    K   ++    QFK+  G+I+
Subjt:  TAAGVPAGATAAA--------GAVAAGVAATAAYNHMRTEEANRQKDKFSYQFVNSVNANLIESSVANKDRFKGKRNLVAKERIQKKKGFQFKSFSGRIE

Query:  KSKIVCFVCGKPGHKSYQCNQRKGKPEQKHVPQASIAETDEVIAAVVVEANLVENKSDWILDTGASGHFCSNRDLFHEFQDSTGCECVFMGNSAMAGVLG
        K K+VC+VCGK GHKSYQCNQRKG+P QK  PQA++AE D  I A +VEANL+ENK+DWIL+TGAS HFC+N +L H+++D+   ECVFMGNSA+AGV+G
Subjt:  KSKIVCFVCGKPGHKSYQCNQRKGKPEQKHVPQASIAETDEVIAAVVVEANLVENKSDWILDTGASGHFCSNRDLFHEFQDSTGCECVFMGNSAMAGVLG

Query:  KGQILLKLTSSKTLSLSDVLYIPSLRRNL
        KG+++LKLTS KTLSLS+VLY+PSLRRNL
Subjt:  KGQILLKLTSSKTLSLSDVLYIPSLRRNL

A0A6J1DWG6 uncharacterized protein LOC1110250211.2e-8890.96Show/hide
Query:  KFVLTEECPPAPAPNANRTVRDAYDRWVKANEKARVYILASISEVLSKKHERLATAREIMDSLQALFGQPSTTIMYDAIKYVYNCRMKEGPSVREHVLNM
        +FVLTEECPPAPAPNANRTVRDAYDRWVKANEKARVYILASISEVLSKKHERLAT REIMDSLQALFGQPSTT+M+DA+KYVYNCRMKEG SVREHVLNM
Subjt:  KFVLTEECPPAPAPNANRTVRDAYDRWVKANEKARVYILASISEVLSKKHERLATAREIMDSLQALFGQPSTTIMYDAIKYVYNCRMKEGPSVREHVLNM

Query:  MVHFNVAEVNGAVMKEISQVGFIVQSLPKSYFQFKTNVMMNKIEYNLTTLLNELKLYESLLKNKGFEAEANVATTSKRKFHEGSSSGS
        MVHFNVAEVN  VM EISQVGFI+QSLPKSYFQFK N MMNKIEY+LTTLLNEL+LYESLLKNKGFEAEANVATTSKRKFH+G SS S
Subjt:  MVHFNVAEVNGAVMKEISQVGFIVQSLPKSYFQFKTNVMMNKIEYNLTTLLNELKLYESLLKNKGFEAEANVATTSKRKFHEGSSSGS

A0A6J1E205 uncharacterized protein LOC1110252582.7e-6988.96Show/hide
Query:  KFVLTEECPPAPAPNANRTVRDAYDRWVKANEKARVYILASISEVLSKKHERLATAREIMDSLQALFGQPSTTIMYDAIKYVYNCRMKEGPSVREHVLNM
        +FVLTEECPPA APN+N+TVRDA+DRW KANEKARVYILASIS+VLSKKHE LATAREIMDSLQALFGQPST+I++DAIKYVYNCRMKEG SVREHVLNM
Subjt:  KFVLTEECPPAPAPNANRTVRDAYDRWVKANEKARVYILASISEVLSKKHERLATAREIMDSLQALFGQPSTTIMYDAIKYVYNCRMKEGPSVREHVLNM

Query:  MVHFNVAEVNGAVMKEISQVGFIVQSLPKSYFQFKTNVMMNKIEYNLTTLLNEL
        MVHFNVAEVN AVM EISQVGFI+QSLPKSYFQFKTN MMNKIEY+LTTLLNEL
Subjt:  MVHFNVAEVNGAVMKEISQVGFIVQSLPKSYFQFKTNVMMNKIEYNLTTLLNEL

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.0e-0936.09Show/hide
Query:  CFVCGKPGHKSYQC-NQRKGKPE---QKHVPQ--ASIAETDEVIAAVVVE---ANLVENKSDWILDTGASGHFCSNRDLFHEFQDSTGCECVFMGNSAMA
        C+ C +PGH    C N RKGK E   QK+     A +   D V+  +  E    +L   +S+W++DT AS H    RDLF  +  +     V MGN++ +
Subjt:  CFVCGKPGHKSYQC-NQRKGKPE---QKHVPQ--ASIAETDEVIAAVVVE---ANLVENKSDWILDTGASGHFCSNRDLFHEFQDSTGCECVFMGNSAMA

Query:  GVLGKGQILLKLTSSKTLSLSDVLYIPSLRRNL
         + G G I +K     TL L DV ++P LR NL
Subjt:  GVLGKGQILLKLTSSKTLSLSDVLYIPSLRRNL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.3e-0428.23Show/hide
Query:  CFVCGKPGHKSYQCNQRKGKPEQKHVPQASIAETDEVIAAVVVEANLVENKSDWILDTGASGHFCSNRDLFHEFQDSTGCECVFMGNSAMAGVLGKGQIL
        C +C   GH + +C Q        +  Q++   T     A +   N   N ++W+LD+GA+ H  S+ +     Q  TG + V + + +   +   G   
Subjt:  CFVCGKPGHKSYQCNQRKGKPEQKHVPQASIAETDEVIAAVVVEANLVENKSDWILDTGASGHFCSNRDLFHEFQDSTGCECVFMGNSAMAGVLGKGQIL

Query:  LKLTSSKTLSLSDVLYIPSLRRNL
        L  TSS++L L+ VLY+P++ +NL
Subjt:  LKLTSSKTLSLSDVLYIPSLRRNL

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCACTAAGATCCGACAAAGTGCTTCCAGATCTATCCAAACTCGAATCATTAGATGGATCAAATTACCGATGCTGGTCTCAGAAGCTACTCATCTTTTTC
GAGCAATTGGAGGTAGACTACGTCCTGACAACCACCGTTGCCGATGTAGCCACCGCCGCAGGAGTTCCCGCCGGAGCCACTGCCGCCGCTGGAGCAGTCGCCGCT
GGAGTTGCCGCCACCGCCGCTTATAATCATATGCGCACAGAAGAAGCCAATAGACAAAAGGATAAGTTCTCTTATCAGTTTGTCAATTCAGTTAATGCTAACTTA
ATTGAATCTTCTGTTGCAAATAAAGACAGATTCAAAGGTAAAAGAAACCTGGTTGCCAAGGAAAGAATCCAAAAGAAGAAAGGATTTCAATTCAAGAGTTTTAGT
GGAAGAATTGAAAAATCAAAGATAGTATGTTTCGTGTGCGGGAAACCTGGACATAAATCCTACCAGTGTAACCAAAGGAAAGGAAAGCCAGAACAAAAGCATGTT
CCACAAGCCAGCATTGCAGAGACTGACGAAGTCATTGCTGCTGTGGTAGTGGAGGCAAATTTGGTGGAAAACAAGTCAGACTGGATTCTCGACACTGGAGCTTCC
GGACACTTTTGTTCAAACCGAGACCTCTTCCATGAGTTTCAAGACTCCACTGGCTGTGAATGCGTCTTTATGGGTAACTCAGCTATGGCTGGAGTCCTTGGTAAA
GGACAGATTCTTCTAAAACTTACATCTAGTAAAACTTTATCCTTAAGTGATGTTTTGTACATTCCTTCTCTACGTAGGAACTTGGCAATGACACCTATCTTTTAC
ATGCTAGTGGAGTTTGAAGTGGAAGACATGCAAGGCATGTTTTTGCATGCATTTGGTCTTTATATAGGTGATGAAGATCACTTGGATAAGGCTAATGCTTACTTG
CATAAGTTTGTTTTAACGGAGGAGTGTCCTCCAGCCCCTGCCCCTAATGCAAACCGAACAGTTCGGGATGCCTATGATAGATGGGTTAAAGCTAATGAAAAAGCT
CGTGTCTACATTTTAGCCAGTATATCAGAAGTATTGTCCAAAAAGCACGAGAGATTAGCTACTGCAAGAGAGATCATGGACTCTCTACAAGCCTTGTTTGGACAA
CCATCAACAACTATCATGTATGATGCGATTAAGTATGTTTACAATTGCAGAATGAAGGAAGGACCTTCCGTAAGGGAGCATGTTTTGAACATGATGGTTCACTTC
AATGTTGCAGAAGTGAACGGTGCAGTCATGAAAGAAATAAGTCAAGTTGGATTCATCGTGCAATCTCTTCCGAAGAGTTATTTTCAATTCAAAACGAATGTTATG
ATGAATAAGATTGAATACAACCTGACAACCCTTCTGAATGAACTCAAACTTTACGAATCTTTATTGAAAAACAAAGGTTTTGAGGCTGAGGCAAATGTTGCTACT
ACCTCAAAGAGGAAATTCCACGAGGGATCTTCCTCTGGGAGTAAATCTAGACCTTCTTATCAGAAGAAAGGAATTCAGAAGAAGAAGAAGGACAAAGGGAGGGGG
AAGGCTCCGGCTGCGGTAAAAGGCAAGGAAAAAATTGCAATGAAAATGGGCACTGGAAAAGAAATTGTCCCAAATACCTCGCCGAGAAAAGAGCTGAGAAGGAAA
AGCAAGGAAACTAGTTCCTGGAGACAGCTTGGAGAAGATGAGGCCACTCTTCGGGTTGGATCAGGGGAGCTCATCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCACTAAGATCCGACAAAGTGCTTCCAGATCTATCCAAACTCGAATCATTAGATGGATCAAATTACCGATGCTGGTCTCAGAAGCTACTCATCTTTTTC
GAGCAATTGGAGGTAGACTACGTCCTGACAACCACCGTTGCCGATGTAGCCACCGCCGCAGGAGTTCCCGCCGGAGCCACTGCCGCCGCTGGAGCAGTCGCCGCT
GGAGTTGCCGCCACCGCCGCTTATAATCATATGCGCACAGAAGAAGCCAATAGACAAAAGGATAAGTTCTCTTATCAGTTTGTCAATTCAGTTAATGCTAACTTA
ATTGAATCTTCTGTTGCAAATAAAGACAGATTCAAAGGTAAAAGAAACCTGGTTGCCAAGGAAAGAATCCAAAAGAAGAAAGGATTTCAATTCAAGAGTTTTAGT
GGAAGAATTGAAAAATCAAAGATAGTATGTTTCGTGTGCGGGAAACCTGGACATAAATCCTACCAGTGTAACCAAAGGAAAGGAAAGCCAGAACAAAAGCATGTT
CCACAAGCCAGCATTGCAGAGACTGACGAAGTCATTGCTGCTGTGGTAGTGGAGGCAAATTTGGTGGAAAACAAGTCAGACTGGATTCTCGACACTGGAGCTTCC
GGACACTTTTGTTCAAACCGAGACCTCTTCCATGAGTTTCAAGACTCCACTGGCTGTGAATGCGTCTTTATGGGTAACTCAGCTATGGCTGGAGTCCTTGGTAAA
GGACAGATTCTTCTAAAACTTACATCTAGTAAAACTTTATCCTTAAGTGATGTTTTGTACATTCCTTCTCTACGTAGGAACTTGGCAATGACACCTATCTTTTAC
ATGCTAGTGGAGTTTGAAGTGGAAGACATGCAAGGCATGTTTTTGCATGCATTTGGTCTTTATATAGGTGATGAAGATCACTTGGATAAGGCTAATGCTTACTTG
CATAAGTTTGTTTTAACGGAGGAGTGTCCTCCAGCCCCTGCCCCTAATGCAAACCGAACAGTTCGGGATGCCTATGATAGATGGGTTAAAGCTAATGAAAAAGCT
CGTGTCTACATTTTAGCCAGTATATCAGAAGTATTGTCCAAAAAGCACGAGAGATTAGCTACTGCAAGAGAGATCATGGACTCTCTACAAGCCTTGTTTGGACAA
CCATCAACAACTATCATGTATGATGCGATTAAGTATGTTTACAATTGCAGAATGAAGGAAGGACCTTCCGTAAGGGAGCATGTTTTGAACATGATGGTTCACTTC
AATGTTGCAGAAGTGAACGGTGCAGTCATGAAAGAAATAAGTCAAGTTGGATTCATCGTGCAATCTCTTCCGAAGAGTTATTTTCAATTCAAAACGAATGTTATG
ATGAATAAGATTGAATACAACCTGACAACCCTTCTGAATGAACTCAAACTTTACGAATCTTTATTGAAAAACAAAGGTTTTGAGGCTGAGGCAAATGTTGCTACT
ACCTCAAAGAGGAAATTCCACGAGGGATCTTCCTCTGGGAGTAAATCTAGACCTTCTTATCAGAAGAAAGGAATTCAGAAGAAGAAGAAGGACAAAGGGAGGGGG
AAGGCTCCGGCTGCGGTAAAAGGCAAGGAAAAAATTGCAATGAAAATGGGCACTGGAAAAGAAATTGTCCCAAATACCTCGCCGAGAAAAGAGCTGAGAAGGAAA
AGCAAGGAAACTAGTTCCTGGAGACAGCTTGGAGAAGATGAGGCCACTCTTCGGGTTGGATCAGGGGAGCTCATCTAA
Protein sequenceShow/hide protein sequence
MASLRSDKVLPDLSKLESLDGSNYRCWSQKLLIFFEQLEVDYVLTTTVADVATAAGVPAGATAAAGAVAAGVAATAAYNHMRTEEANRQKDKFSYQFVNSVNANL
IESSVANKDRFKGKRNLVAKERIQKKKGFQFKSFSGRIEKSKIVCFVCGKPGHKSYQCNQRKGKPEQKHVPQASIAETDEVIAAVVVEANLVENKSDWILDTGAS
GHFCSNRDLFHEFQDSTGCECVFMGNSAMAGVLGKGQILLKLTSSKTLSLSDVLYIPSLRRNLAMTPIFYMLVEFEVEDMQGMFLHAFGLYIGDEDHLDKANAYL
HKFVLTEECPPAPAPNANRTVRDAYDRWVKANEKARVYILASISEVLSKKHERLATAREIMDSLQALFGQPSTTIMYDAIKYVYNCRMKEGPSVREHVLNMMVHF
NVAEVNGAVMKEISQVGFIVQSLPKSYFQFKTNVMMNKIEYNLTTLLNELKLYESLLKNKGFEAEANVATTSKRKFHEGSSSGSKSRPSYQKKGIQKKKKDKGRG
KAPAAVKGKEKIAMKMGTGKEIVPNTSPRKELRRKSKETSSWRQLGEDEATLRVGSGELI