; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g16740 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g16740
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionEnzymatic polyprotein
Genome locationchr8:12857481..12858444
RNA-Seq ExpressionMoc08g16740
SyntenyMoc08g16740
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001995 - Peptidase A2A, retrovirus, catalytic
IPR018061 - Retropepsins


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0056776.1 Enzymatic polyprotein [Cucumis melo var. makuwa]2.0e-8862.46Show/hide
Query:  RSEEEYSSYSKSSTDNDDIDLINDE-DSNEETFFSQSDSSEEDEVIPCTGHCAEKCHGHINVISKNQEALFDLIEQLPDEISKRMCLIKLRESLEAEALQ
        RS+++ +S ++SS++ D I+++ +E  S+EE F+SQSDSS+++  IPCTG CA KC GHINVI+K+QE LFDLIEQ+PDE +KR CL+KL++SLE +A Q
Subjt:  RSEEEYSSYSKSSTDNDDIDLINDE-DSNEETFFSQSDSSEEDEVIPCTGHCAEKCHGHINVISKNQEALFDLIEQLPDEISKRMCLIKLRESLEAEALQ

Query:  RKPELNLIEYSFQDILKKVKGEAKKSIQIKDLHNEVKTLKKEVVDSKQRLSTLEYALKKSQESEATEGETSSKPE-----QTLLTGSPSGINYISKVQNQ
        +  + N I YS+QDIL +VKGEAK  IQ++DLH+EVKTLK+EV ++KQRL  LE A +  Q S+A++ E++S  E     + LL      IN ISK+QNQ
Subjt:  RKPELNLIEYSFQDILKKVKGEAKKSIQIKDLHNEVKTLKKEVVDSKQRLSTLEYALKKSQESEATEGETSSKPE-----QTLLTGSPSGINYISKVQNQ

Query:  KWMSKIIFKIRDFQLETYALIDSGADQNVIQEGLVPCKYFEKTKEVLSEAEGNPLNIKYKLSKVHIYNDDVCLINTFILVKNLNEGVILGTPF
        KWMSKI+FK++DFQLE  ALIDSGADQNVIQEGLVP +YFEKTKE LS A GNPLNI++KLSKVHI   DVCL+NTFILVKNLNEG+ILGTPF
Subjt:  KWMSKIIFKIRDFQLETYALIDSGADQNVIQEGLVPCKYFEKTKEVLSEAEGNPLNIKYKLSKVHIYNDDVCLINTFILVKNLNEGVILGTPF

KAA0059217.1 Enzymatic polyprotein [Cucumis melo var. makuwa]4.6e-8862.12Show/hide
Query:  RSEEEYSSYSKSSTDNDDIDLINDE-DSNEETFFSQSDSSEEDEVIPCTGHCAEKCHGHINVISKNQEALFDLIEQLPDEISKRMCLIKLRESLEAEALQ
        RS+++ +S ++SS++ D I+++ +E  S+EE F+SQSDSS+++  IPCTG CA KC GHINVI+K+QE LFDLIEQ+PDE +KR CL+KL++SLE +A Q
Subjt:  RSEEEYSSYSKSSTDNDDIDLINDE-DSNEETFFSQSDSSEEDEVIPCTGHCAEKCHGHINVISKNQEALFDLIEQLPDEISKRMCLIKLRESLEAEALQ

Query:  RKPELNLIEYSFQDILKKVKGEAKKSIQIKDLHNEVKTLKKEVVDSKQRLSTLEYALKKSQESEATEGETSSKPE-----QTLLTGSPSGINYISKVQNQ
        +  + N I YS+QDIL +VKGEAK  IQ++DLH+EVKTLK+EV ++KQRL  LE A +  Q S+A++ E++S  E     + LL      IN IS++QNQ
Subjt:  RKPELNLIEYSFQDILKKVKGEAKKSIQIKDLHNEVKTLKKEVVDSKQRLSTLEYALKKSQESEATEGETSSKPE-----QTLLTGSPSGINYISKVQNQ

Query:  KWMSKIIFKIRDFQLETYALIDSGADQNVIQEGLVPCKYFEKTKEVLSEAEGNPLNIKYKLSKVHIYNDDVCLINTFILVKNLNEGVILGTPF
        KWMSKI+FK++DFQLE  ALIDSGADQNVIQEGLVP +YFEKTKE LS A GNPLNI++KLSKVHI   DVCL+NTFILVKNLNEG+ILGTPF
Subjt:  KWMSKIIFKIRDFQLETYALIDSGADQNVIQEGLVPCKYFEKTKEVLSEAEGNPLNIKYKLSKVHIYNDDVCLINTFILVKNLNEGVILGTPF

KAA0068171.1 Enzymatic polyprotein [Cucumis melo var. makuwa]5.0e-8761.77Show/hide
Query:  RSEEEYSSYSKSSTDNDDIDLINDEDSN-EETFFSQSDSSEEDEVIPCTGHCAEKCHGHINVISKNQEALFDLIEQLPDEISKRMCLIKLRESLEAEALQ
        R+++E SS ++SS++ D I+++ +E+S+ EE F+SQS+SS+++  IPCTG CA KC GHINVI+K+QE LFDLIEQ+PDE +KR CL+KL++SLE +A Q
Subjt:  RSEEEYSSYSKSSTDNDDIDLINDEDSN-EETFFSQSDSSEEDEVIPCTGHCAEKCHGHINVISKNQEALFDLIEQLPDEISKRMCLIKLRESLEAEALQ

Query:  RKPELNLIEYSFQDILKKVKGEAKKSIQIKDLHNEVKTLKKEVVDSKQRLSTLEYALKKSQESEATEGETSS-----KPEQTLLTGSPSGINYISKVQNQ
        +  + N I YS+QDIL +VKGEAK  IQ++DLH+EVK LKKEV ++KQRL  LE A +  Q+S+A + E++S        + LL      IN IS++QNQ
Subjt:  RKPELNLIEYSFQDILKKVKGEAKKSIQIKDLHNEVKTLKKEVVDSKQRLSTLEYALKKSQESEATEGETSS-----KPEQTLLTGSPSGINYISKVQNQ

Query:  KWMSKIIFKIRDFQLETYALIDSGADQNVIQEGLVPCKYFEKTKEVLSEAEGNPLNIKYKLSKVHIYNDDVCLINTFILVKNLNEGVILGTPF
        KWMSKI+FK++DFQLE  ALIDSGADQNVIQEGLVP +YFEKTKE LS A GNPLNI++KLSKVHI   DVCL+NTFILVKNLNEG+ILGTPF
Subjt:  KWMSKIIFKIRDFQLETYALIDSGADQNVIQEGLVPCKYFEKTKEVLSEAEGNPLNIKYKLSKVHIYNDDVCLINTFILVKNLNEGVILGTPF

TYJ97599.1 Enzymatic polyprotein [Cucumis melo var. makuwa]4.6e-8862.12Show/hide
Query:  RSEEEYSSYSKSSTDNDDIDLINDE-DSNEETFFSQSDSSEEDEVIPCTGHCAEKCHGHINVISKNQEALFDLIEQLPDEISKRMCLIKLRESLEAEALQ
        RS+++ +S ++SS++ D I+++ +E  S+EE F+SQSDSS+++  IPCTG CA KC GHINVI+K+QE LFDLIEQ+PDE +KR CL+KL++SLE +A Q
Subjt:  RSEEEYSSYSKSSTDNDDIDLINDE-DSNEETFFSQSDSSEEDEVIPCTGHCAEKCHGHINVISKNQEALFDLIEQLPDEISKRMCLIKLRESLEAEALQ

Query:  RKPELNLIEYSFQDILKKVKGEAKKSIQIKDLHNEVKTLKKEVVDSKQRLSTLEYALKKSQESEATEGETSSKPE-----QTLLTGSPSGINYISKVQNQ
        +  + N I YS+QDIL +VKGEAK  IQ++DLH+EVKTLK+EV ++KQRL  LE A +  Q S+A++ E++S  E     + LL      IN IS++QNQ
Subjt:  RKPELNLIEYSFQDILKKVKGEAKKSIQIKDLHNEVKTLKKEVVDSKQRLSTLEYALKKSQESEATEGETSSKPE-----QTLLTGSPSGINYISKVQNQ

Query:  KWMSKIIFKIRDFQLETYALIDSGADQNVIQEGLVPCKYFEKTKEVLSEAEGNPLNIKYKLSKVHIYNDDVCLINTFILVKNLNEGVILGTPF
        KWMSKI+FK++DFQLE  ALIDSGADQNVIQEGLVP +YFEKTKE LS A GNPLNI++KLSKVHI   DVCL+NTFILVKNLNEG+ILGTPF
Subjt:  KWMSKIIFKIRDFQLETYALIDSGADQNVIQEGLVPCKYFEKTKEVLSEAEGNPLNIKYKLSKVHIYNDDVCLINTFILVKNLNEGVILGTPF

TYK21091.1 Enzymatic polyprotein [Cucumis melo var. makuwa]5.0e-8761.77Show/hide
Query:  RSEEEYSSYSKSSTDNDDIDLINDEDSN-EETFFSQSDSSEEDEVIPCTGHCAEKCHGHINVISKNQEALFDLIEQLPDEISKRMCLIKLRESLEAEALQ
        R+++E SS ++SS++ D I+++ +E+S+ EE F+SQS+SS+++  IPCTG CA KC GHINVI+K+QE LFDLIEQ+PDE +KR CL+KL++SLE +A Q
Subjt:  RSEEEYSSYSKSSTDNDDIDLINDEDSN-EETFFSQSDSSEEDEVIPCTGHCAEKCHGHINVISKNQEALFDLIEQLPDEISKRMCLIKLRESLEAEALQ

Query:  RKPELNLIEYSFQDILKKVKGEAKKSIQIKDLHNEVKTLKKEVVDSKQRLSTLEYALKKSQESEATEGETSS-----KPEQTLLTGSPSGINYISKVQNQ
        +  + N I YS+QDIL +VKGEAK  IQ++DLH+EVK LKKEV ++KQRL  LE A +  Q+S+A + E++S        + LL      IN IS++QNQ
Subjt:  RKPELNLIEYSFQDILKKVKGEAKKSIQIKDLHNEVKTLKKEVVDSKQRLSTLEYALKKSQESEATEGETSS-----KPEQTLLTGSPSGINYISKVQNQ

Query:  KWMSKIIFKIRDFQLETYALIDSGADQNVIQEGLVPCKYFEKTKEVLSEAEGNPLNIKYKLSKVHIYNDDVCLINTFILVKNLNEGVILGTPF
        KWMSKI+FK++DFQLE  ALIDSGADQNVIQEGLVP +YFEKTKE LS A GNPLNI++KLSKVHI   DVCL+NTFILVKNLNEG+ILGTPF
Subjt:  KWMSKIIFKIRDFQLETYALIDSGADQNVIQEGLVPCKYFEKTKEVLSEAEGNPLNIKYKLSKVHIYNDDVCLINTFILVKNLNEGVILGTPF

TrEMBL top hitse value%identityAlignment
A0A5A7UR29 Enzymatic polyprotein9.9e-8962.46Show/hide
Query:  RSEEEYSSYSKSSTDNDDIDLINDE-DSNEETFFSQSDSSEEDEVIPCTGHCAEKCHGHINVISKNQEALFDLIEQLPDEISKRMCLIKLRESLEAEALQ
        RS+++ +S ++SS++ D I+++ +E  S+EE F+SQSDSS+++  IPCTG CA KC GHINVI+K+QE LFDLIEQ+PDE +KR CL+KL++SLE +A Q
Subjt:  RSEEEYSSYSKSSTDNDDIDLINDE-DSNEETFFSQSDSSEEDEVIPCTGHCAEKCHGHINVISKNQEALFDLIEQLPDEISKRMCLIKLRESLEAEALQ

Query:  RKPELNLIEYSFQDILKKVKGEAKKSIQIKDLHNEVKTLKKEVVDSKQRLSTLEYALKKSQESEATEGETSSKPE-----QTLLTGSPSGINYISKVQNQ
        +  + N I YS+QDIL +VKGEAK  IQ++DLH+EVKTLK+EV ++KQRL  LE A +  Q S+A++ E++S  E     + LL      IN ISK+QNQ
Subjt:  RKPELNLIEYSFQDILKKVKGEAKKSIQIKDLHNEVKTLKKEVVDSKQRLSTLEYALKKSQESEATEGETSSKPE-----QTLLTGSPSGINYISKVQNQ

Query:  KWMSKIIFKIRDFQLETYALIDSGADQNVIQEGLVPCKYFEKTKEVLSEAEGNPLNIKYKLSKVHIYNDDVCLINTFILVKNLNEGVILGTPF
        KWMSKI+FK++DFQLE  ALIDSGADQNVIQEGLVP +YFEKTKE LS A GNPLNI++KLSKVHI   DVCL+NTFILVKNLNEG+ILGTPF
Subjt:  KWMSKIIFKIRDFQLETYALIDSGADQNVIQEGLVPCKYFEKTKEVLSEAEGNPLNIKYKLSKVHIYNDDVCLINTFILVKNLNEGVILGTPF

A0A5A7UX67 Enzymatic polyprotein2.2e-8862.12Show/hide
Query:  RSEEEYSSYSKSSTDNDDIDLINDE-DSNEETFFSQSDSSEEDEVIPCTGHCAEKCHGHINVISKNQEALFDLIEQLPDEISKRMCLIKLRESLEAEALQ
        RS+++ +S ++SS++ D I+++ +E  S+EE F+SQSDSS+++  IPCTG CA KC GHINVI+K+QE LFDLIEQ+PDE +KR CL+KL++SLE +A Q
Subjt:  RSEEEYSSYSKSSTDNDDIDLINDE-DSNEETFFSQSDSSEEDEVIPCTGHCAEKCHGHINVISKNQEALFDLIEQLPDEISKRMCLIKLRESLEAEALQ

Query:  RKPELNLIEYSFQDILKKVKGEAKKSIQIKDLHNEVKTLKKEVVDSKQRLSTLEYALKKSQESEATEGETSSKPE-----QTLLTGSPSGINYISKVQNQ
        +  + N I YS+QDIL +VKGEAK  IQ++DLH+EVKTLK+EV ++KQRL  LE A +  Q S+A++ E++S  E     + LL      IN IS++QNQ
Subjt:  RKPELNLIEYSFQDILKKVKGEAKKSIQIKDLHNEVKTLKKEVVDSKQRLSTLEYALKKSQESEATEGETSSKPE-----QTLLTGSPSGINYISKVQNQ

Query:  KWMSKIIFKIRDFQLETYALIDSGADQNVIQEGLVPCKYFEKTKEVLSEAEGNPLNIKYKLSKVHIYNDDVCLINTFILVKNLNEGVILGTPF
        KWMSKI+FK++DFQLE  ALIDSGADQNVIQEGLVP +YFEKTKE LS A GNPLNI++KLSKVHI   DVCL+NTFILVKNLNEG+ILGTPF
Subjt:  KWMSKIIFKIRDFQLETYALIDSGADQNVIQEGLVPCKYFEKTKEVLSEAEGNPLNIKYKLSKVHIYNDDVCLINTFILVKNLNEGVILGTPF

A0A5A7VRE0 Reverse transcriptase2.4e-8761.77Show/hide
Query:  RSEEEYSSYSKSSTDNDDIDLINDEDSN-EETFFSQSDSSEEDEVIPCTGHCAEKCHGHINVISKNQEALFDLIEQLPDEISKRMCLIKLRESLEAEALQ
        R+++E SS ++SS++ D I+++ +E+S+ EE F+SQS+SS+++  IPCTG CA KC GHINVI+K+QE LFDLIEQ+PDE +KR CL+KL++SLE +A Q
Subjt:  RSEEEYSSYSKSSTDNDDIDLINDEDSN-EETFFSQSDSSEEDEVIPCTGHCAEKCHGHINVISKNQEALFDLIEQLPDEISKRMCLIKLRESLEAEALQ

Query:  RKPELNLIEYSFQDILKKVKGEAKKSIQIKDLHNEVKTLKKEVVDSKQRLSTLEYALKKSQESEATEGETSS-----KPEQTLLTGSPSGINYISKVQNQ
        +  + N I YS+QDIL +VKGEAK  IQ++DLH+EVK LKKEV ++KQRL  LE A +  Q+S+A + E++S        + LL      IN IS++QNQ
Subjt:  RKPELNLIEYSFQDILKKVKGEAKKSIQIKDLHNEVKTLKKEVVDSKQRLSTLEYALKKSQESEATEGETSS-----KPEQTLLTGSPSGINYISKVQNQ

Query:  KWMSKIIFKIRDFQLETYALIDSGADQNVIQEGLVPCKYFEKTKEVLSEAEGNPLNIKYKLSKVHIYNDDVCLINTFILVKNLNEGVILGTPF
        KWMSKI+FK++DFQLE  ALIDSGADQNVIQEGLVP +YFEKTKE LS A GNPLNI++KLSKVHI   DVCL+NTFILVKNLNEG+ILGTPF
Subjt:  KWMSKIIFKIRDFQLETYALIDSGADQNVIQEGLVPCKYFEKTKEVLSEAEGNPLNIKYKLSKVHIYNDDVCLINTFILVKNLNEGVILGTPF

A0A5D3BEY3 Enzymatic polyprotein2.2e-8862.12Show/hide
Query:  RSEEEYSSYSKSSTDNDDIDLINDE-DSNEETFFSQSDSSEEDEVIPCTGHCAEKCHGHINVISKNQEALFDLIEQLPDEISKRMCLIKLRESLEAEALQ
        RS+++ +S ++SS++ D I+++ +E  S+EE F+SQSDSS+++  IPCTG CA KC GHINVI+K+QE LFDLIEQ+PDE +KR CL+KL++SLE +A Q
Subjt:  RSEEEYSSYSKSSTDNDDIDLINDE-DSNEETFFSQSDSSEEDEVIPCTGHCAEKCHGHINVISKNQEALFDLIEQLPDEISKRMCLIKLRESLEAEALQ

Query:  RKPELNLIEYSFQDILKKVKGEAKKSIQIKDLHNEVKTLKKEVVDSKQRLSTLEYALKKSQESEATEGETSSKPE-----QTLLTGSPSGINYISKVQNQ
        +  + N I YS+QDIL +VKGEAK  IQ++DLH+EVKTLK+EV ++KQRL  LE A +  Q S+A++ E++S  E     + LL      IN IS++QNQ
Subjt:  RKPELNLIEYSFQDILKKVKGEAKKSIQIKDLHNEVKTLKKEVVDSKQRLSTLEYALKKSQESEATEGETSSKPE-----QTLLTGSPSGINYISKVQNQ

Query:  KWMSKIIFKIRDFQLETYALIDSGADQNVIQEGLVPCKYFEKTKEVLSEAEGNPLNIKYKLSKVHIYNDDVCLINTFILVKNLNEGVILGTPF
        KWMSKI+FK++DFQLE  ALIDSGADQNVIQEGLVP +YFEKTKE LS A GNPLNI++KLSKVHI   DVCL+NTFILVKNLNEG+ILGTPF
Subjt:  KWMSKIIFKIRDFQLETYALIDSGADQNVIQEGLVPCKYFEKTKEVLSEAEGNPLNIKYKLSKVHIYNDDVCLINTFILVKNLNEGVILGTPF

A0A5D3DBS1 Reverse transcriptase2.4e-8761.77Show/hide
Query:  RSEEEYSSYSKSSTDNDDIDLINDEDSN-EETFFSQSDSSEEDEVIPCTGHCAEKCHGHINVISKNQEALFDLIEQLPDEISKRMCLIKLRESLEAEALQ
        R+++E SS ++SS++ D I+++ +E+S+ EE F+SQS+SS+++  IPCTG CA KC GHINVI+K+QE LFDLIEQ+PDE +KR CL+KL++SLE +A Q
Subjt:  RSEEEYSSYSKSSTDNDDIDLINDEDSN-EETFFSQSDSSEEDEVIPCTGHCAEKCHGHINVISKNQEALFDLIEQLPDEISKRMCLIKLRESLEAEALQ

Query:  RKPELNLIEYSFQDILKKVKGEAKKSIQIKDLHNEVKTLKKEVVDSKQRLSTLEYALKKSQESEATEGETSS-----KPEQTLLTGSPSGINYISKVQNQ
        +  + N I YS+QDIL +VKGEAK  IQ++DLH+EVK LKKEV ++KQRL  LE A +  Q+S+A + E++S        + LL      IN IS++QNQ
Subjt:  RKPELNLIEYSFQDILKKVKGEAKKSIQIKDLHNEVKTLKKEVVDSKQRLSTLEYALKKSQESEATEGETSS-----KPEQTLLTGSPSGINYISKVQNQ

Query:  KWMSKIIFKIRDFQLETYALIDSGADQNVIQEGLVPCKYFEKTKEVLSEAEGNPLNIKYKLSKVHIYNDDVCLINTFILVKNLNEGVILGTPF
        KWMSKI+FK++DFQLE  ALIDSGADQNVIQEGLVP +YFEKTKE LS A GNPLNI++KLSKVHI   DVCL+NTFILVKNLNEG+ILGTPF
Subjt:  KWMSKIIFKIRDFQLETYALIDSGADQNVIQEGLVPCKYFEKTKEVLSEAEGNPLNIKYKLSKVHIYNDDVCLINTFILVKNLNEGVILGTPF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACAGAAGCGAAGAAGAATACTCTTCCTATTCCAAGTCTTCAACAGATAATGATGATATTGATCTCATAAACGATGAAGACTCAAATGAGGAAACCTTTTTCTCTCA
AAGTGATTCCTCTGAAGAAGATGAAGTCATTCCTTGCACTGGCCATTGCGCGGAAAAATGCCACGGCCACATCAATGTCATTAGCAAAAATCAAGAAGCACTCTTTGATC
TAATTGAGCAATTACCCGATGAAATCTCTAAGCGAATGTGTCTCATCAAACTAAGGGAAAGTCTCGAAGCTGAAGCTCTTCAAAGGAAGCCAGAATTGAACCTAATAGAA
TATTCTTTCCAAGATATTCTAAAAAAGGTCAAAGGAGAGGCTAAGAAATCTATCCAAATTAAAGATCTCCACAATGAAGTGAAGACTCTAAAAAAGGAAGTTGTTGATAG
TAAGCAACGTCTTTCTACTCTTGAATATGCCTTAAAAAAATCTCAAGAGTCAGAAGCCACGGAAGGAGAAACTTCCTCAAAACCTGAACAAACTTTACTGACTGGTTCAC
CAAGCGGAATCAATTACATTAGTAAAGTTCAAAACCAGAAGTGGATGTCCAAGATTATCTTCAAAATCAGAGATTTCCAGTTGGAGACTTACGCCCTTATCGACTCTGGA
GCCGATCAGAATGTCATCCAAGAAGGTTTAGTACCTTGTAAATACTTCGAGAAAACCAAAGAAGTTCTTAGCGAAGCCGAGGGAAATCCGTTGAACATCAAGTATAAATT
ATCTAAGGTCCACATCTACAACGACGACGTGTGCCTTATCAACACCTTCATCCTGGTCAAAAACCTTAATGAAGGAGTCATACTAGGTACCCCTTTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGACAGAAGCGAAGAAGAATACTCTTCCTATTCCAAGTCTTCAACAGATAATGATGATATTGATCTCATAAACGATGAAGACTCAAATGAGGAAACCTTTTTCTCTCA
AAGTGATTCCTCTGAAGAAGATGAAGTCATTCCTTGCACTGGCCATTGCGCGGAAAAATGCCACGGCCACATCAATGTCATTAGCAAAAATCAAGAAGCACTCTTTGATC
TAATTGAGCAATTACCCGATGAAATCTCTAAGCGAATGTGTCTCATCAAACTAAGGGAAAGTCTCGAAGCTGAAGCTCTTCAAAGGAAGCCAGAATTGAACCTAATAGAA
TATTCTTTCCAAGATATTCTAAAAAAGGTCAAAGGAGAGGCTAAGAAATCTATCCAAATTAAAGATCTCCACAATGAAGTGAAGACTCTAAAAAAGGAAGTTGTTGATAG
TAAGCAACGTCTTTCTACTCTTGAATATGCCTTAAAAAAATCTCAAGAGTCAGAAGCCACGGAAGGAGAAACTTCCTCAAAACCTGAACAAACTTTACTGACTGGTTCAC
CAAGCGGAATCAATTACATTAGTAAAGTTCAAAACCAGAAGTGGATGTCCAAGATTATCTTCAAAATCAGAGATTTCCAGTTGGAGACTTACGCCCTTATCGACTCTGGA
GCCGATCAGAATGTCATCCAAGAAGGTTTAGTACCTTGTAAATACTTCGAGAAAACCAAAGAAGTTCTTAGCGAAGCCGAGGGAAATCCGTTGAACATCAAGTATAAATT
ATCTAAGGTCCACATCTACAACGACGACGTGTGCCTTATCAACACCTTCATCCTGGTCAAAAACCTTAATGAAGGAGTCATACTAGGTACCCCTTTTTAA
Protein sequenceShow/hide protein sequence
MDRSEEEYSSYSKSSTDNDDIDLINDEDSNEETFFSQSDSSEEDEVIPCTGHCAEKCHGHINVISKNQEALFDLIEQLPDEISKRMCLIKLRESLEAEALQRKPELNLIE
YSFQDILKKVKGEAKKSIQIKDLHNEVKTLKKEVVDSKQRLSTLEYALKKSQESEATEGETSSKPEQTLLTGSPSGINYISKVQNQKWMSKIIFKIRDFQLETYALIDSG
ADQNVIQEGLVPCKYFEKTKEVLSEAEGNPLNIKYKLSKVHIYNDDVCLINTFILVKNLNEGVILGTPF