; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g22440 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g22440
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionCCHC-type domain-containing protein
Genome locationchr4:16261281..16275123
RNA-Seq ExpressionMoc04g22440
SyntenyMoc04g22440
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR001969 - Aspartic peptidase, active site
IPR021109 - Aspartic peptidase domain superfamily
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031954.1 zf-CCHC domain-containing protein [Cucumis melo var. makuwa]1.6e-3940.97Show/hide
Query:  VTMEEQQANKIKFQYQRRNTSEGQS--TSSRKGTSFFDK-SAIPGTSAGSKGK------TNEDSKAMDQPIKKTTNTYSRPTLGKCFRCGQVGHLSNECP
        ++  EQ+  +     + +N S   +  TSS K  ++F+K S  P TS   KGK       N + +       K+ N Y+ P+LGKCFRCGQ GHLSN CP
Subjt:  VTMEEQQANKIKFQYQRRNTSEGQS--TSSRKGTSFFDK-SAIPGTSAGSKGK------TNEDSKAMDQPIKKTTNTYSRPTLGKCFRCGQVGHLSNECP

Query:  QRKSITIVDDAQENDPLCDQESDEEVAYLEPDEGEQVTCVLERILLTPKTISNPQRHSLFRTRCTVNGKVCNIIIDSGSTENVVSSKLVTALNPKLSPHP
        Q+K+I + D+ +++     ++++EEV  +E D G++V+C+L+R+LL P+   +PQRHS F+TRCT+NGKVC++IIDSGS+EN ++ KLV  LN K  PHP
Subjt:  QRKSITIVDDAQENDPLCDQESDEEVAYLEPDEGEQVTCVLERILLTPKTISNPQRHSLFRTRCTVNGKVCNIIIDSGSTENVVSSKLVTALNPKLSPHP

Query:  TPYKADTKRRAQDFKEGDLVMIHLSKG
         PYK    ++  +    ++  + LS G
Subjt:  TPYKADTKRRAQDFKEGDLVMIHLSKG

KAA0047078.1 reverse transcriptase [Cucumis melo var. makuwa]9.6e-4047.83Show/hide
Query:  PGTSAGSKGKTNEDSKAMDQPIK------KTTNTYSRPTLGKCFRCGQVGHLSNECPQRKSITIVDDAQENDPLCDQESDEEVAYLEPDEGEQVTCVLER
        P TS   KGK  E     D   +      K  N Y+RP+LGKCFRCGQ GH SN CPQRK+I + D  +++     +E +EE   +E D+G +V+CV++R
Subjt:  PGTSAGSKGKTNEDSKAMDQPIK------KTTNTYSRPTLGKCFRCGQVGHLSNECPQRKSITIVDDAQENDPLCDQESDEEVAYLEPDEGEQVTCVLER

Query:  ILLTPKTISNPQRHSLFRTRCTVNGKVCNIIIDSGSTENVVSSKLVTALNPKLSPHPTPYKADTKRRAQDFKEGDLVMIHLSKG
        +LL PK  +NPQ HSLF+TRCT+NGKVC++IID+GS+EN V+ KLVTALN K  PHP PYK    ++  +    ++  + LS G
Subjt:  ILLTPKTISNPQRHSLFRTRCTVNGKVCNIIIDSGSTENVVSSKLVTALNPKLSPHPTPYKADTKRRAQDFKEGDLVMIHLSKG

KAA0059834.1 uncharacterized protein E6C27_scaffold108G001170 [Cucumis melo var. makuwa]3.7e-3942.62Show/hide
Query:  EEHCRLAIATAVTMEEQQANKIKFQYQRR----NTSEGQSTSSRKGTSFFDKSAIPGTSAGSKGK---TNEDSKAMDQPIK-KTTNTYSRPTLGKCFRCG
        E+H ++A     T+EE     +K   ++     N S+ QS SSR           P TS G K K   T + +K  D   K K+ NTY+RP+L KCFRCG
Subjt:  EEHCRLAIATAVTMEEQQANKIKFQYQRR----NTSEGQSTSSRKGTSFFDKSAIPGTSAGSKGK---TNEDSKAMDQPIK-KTTNTYSRPTLGKCFRCG

Query:  QVGHLSNECPQRKSITIVDDAQENDPLCDQESDEEVAYLEPDEGEQVTCVLERILLTPKTISNPQRHSLFRTRCTVNGKVCNIIIDSGSTENVVSSKLVT
        Q GHLSN CPQR++I++ D    +    D+E +EE  ++E D+G++++ V++R+L+ PK  +NPQRHSLF+TRCT+N +VC++IIDSGS+EN V+ KLVT
Subjt:  QVGHLSNECPQRKSITIVDDAQENDPLCDQESDEEVAYLEPDEGEQVTCVLERILLTPKTISNPQRHSLFRTRCTVNGKVCNIIIDSGSTENVVSSKLVT

Query:  ALNPKLSPHPTPYKADTKRRAQDFKEGDLVMIHLSKG
         LN K +P+P PYK    R+  +    ++  + LS G
Subjt:  ALNPKLSPHPTPYKADTKRRAQDFKEGDLVMIHLSKG

TYK16316.1 zf-CCHC domain-containing protein [Cucumis melo var. makuwa]6.3e-3942.22Show/hide
Query:  TMEEQQANKIKFQYQR--------RNTSEGQSTSSRKGTSFFDKSAIPGTSAGSKGKTNEDSKAMDQPIKKTTNTYSRPTLGKCFRCGQVGHLSNECPQR
        T+EE    ++K   +R        +  S G+ T  +  TS  DK       A    +TN+  K++ +   KT N Y+RP+LGKCFRCG+  HLSN CPQR
Subjt:  TMEEQQANKIKFQYQR--------RNTSEGQSTSSRKGTSFFDKSAIPGTSAGSKGKTNEDSKAMDQPIKKTTNTYSRPTLGKCFRCGQVGHLSNECPQR

Query:  KSITIVDDAQENDPLCDQESDEEVAYLEPDEGEQVTCVLERILLTPKTISNPQRHSLFRTRCTVNGKVCNIIIDSGSTENVVSSKLVTALNPKLSPHPTP
        K+I + +D        D+E +EE   +E D+ ++++C+++R+L+TPK  +NPQRH+LF+TRCT+NGKVC++II+SGS+EN V+ KLVTALN K  PHP P
Subjt:  KSITIVDDAQENDPLCDQESDEEVAYLEPDEGEQVTCVLERILLTPKTISNPQRHSLFRTRCTVNGKVCNIIIDSGSTENVVSSKLVTALNPKLSPHPTP

Query:  YKADTKRRAQDFKEGDLVMIHLSKG
        YK    ++  +    ++  I LS G
Subjt:  YKADTKRRAQDFKEGDLVMIHLSKG

XP_031741035.1 uncharacterized protein LOC116403692 [Cucumis sativus]3.7e-3944.59Show/hide
Query:  AIATAVTMEEQQANKIKFQYQRRNTSEGQSTSSRKGTSFFDKSAIPGTSAGSKGKTNEDSKAMDQPIKKTT------NTYSRPTLGKCFRCGQVGHLSNE
        AI+ A T+EE  A + K    RR+  E  ST S+           P TS  +KGK  ++ +   +  K+ T      N+YSRP+LGKCFRCGQ GHLS+ 
Subjt:  AIATAVTMEEQQANKIKFQYQRRNTSEGQSTSSRKGTSFFDKSAIPGTSAGSKGKTNEDSKAMDQPIKKTT------NTYSRPTLGKCFRCGQVGHLSNE

Query:  CPQRKSITIVDDAQENDPLCDQ--ESDEEVAYLEPDEGEQVTCVLERILLTPKTISNPQRHSLFRTRCTVNGKVCNIIIDSGSTENVVSSKLVTALNPKL
        CPQRK+I I   A+E   + +   E++EE   +E D+GE+V+CV++R+L+TPK   N QRH LF+TRCT+NG+VC++IIDSGS+EN V+ KLVT LN K 
Subjt:  CPQRKSITIVDDAQENDPLCDQ--ESDEEVAYLEPDEGEQVTCVLERILLTPKTISNPQRHSLFRTRCTVNGKVCNIIIDSGSTENVVSSKLVTALNPKL

Query:  SPHPTPYKADTKRRAQDFKEGDLVMIHLSKG
          HP PYK    R+  +    ++  + LS G
Subjt:  SPHPTPYKADTKRRAQDFKEGDLVMIHLSKG

TrEMBL top hitse value%identityAlignment
A0A5A7SLA2 Zf-CCHC domain-containing protein8.0e-4040.97Show/hide
Query:  VTMEEQQANKIKFQYQRRNTSEGQS--TSSRKGTSFFDK-SAIPGTSAGSKGK------TNEDSKAMDQPIKKTTNTYSRPTLGKCFRCGQVGHLSNECP
        ++  EQ+  +     + +N S   +  TSS K  ++F+K S  P TS   KGK       N + +       K+ N Y+ P+LGKCFRCGQ GHLSN CP
Subjt:  VTMEEQQANKIKFQYQRRNTSEGQS--TSSRKGTSFFDK-SAIPGTSAGSKGK------TNEDSKAMDQPIKKTTNTYSRPTLGKCFRCGQVGHLSNECP

Query:  QRKSITIVDDAQENDPLCDQESDEEVAYLEPDEGEQVTCVLERILLTPKTISNPQRHSLFRTRCTVNGKVCNIIIDSGSTENVVSSKLVTALNPKLSPHP
        Q+K+I + D+ +++     ++++EEV  +E D G++V+C+L+R+LL P+   +PQRHS F+TRCT+NGKVC++IIDSGS+EN ++ KLV  LN K  PHP
Subjt:  QRKSITIVDDAQENDPLCDQESDEEVAYLEPDEGEQVTCVLERILLTPKTISNPQRHSLFRTRCTVNGKVCNIIIDSGSTENVVSSKLVTALNPKLSPHP

Query:  TPYKADTKRRAQDFKEGDLVMIHLSKG
         PYK    ++  +    ++  + LS G
Subjt:  TPYKADTKRRAQDFKEGDLVMIHLSKG

A0A5A7UXS4 CCHC-type domain-containing protein1.8e-3942.62Show/hide
Query:  EEHCRLAIATAVTMEEQQANKIKFQYQRR----NTSEGQSTSSRKGTSFFDKSAIPGTSAGSKGK---TNEDSKAMDQPIK-KTTNTYSRPTLGKCFRCG
        E+H ++A     T+EE     +K   ++     N S+ QS SSR           P TS G K K   T + +K  D   K K+ NTY+RP+L KCFRCG
Subjt:  EEHCRLAIATAVTMEEQQANKIKFQYQRR----NTSEGQSTSSRKGTSFFDKSAIPGTSAGSKGK---TNEDSKAMDQPIK-KTTNTYSRPTLGKCFRCG

Query:  QVGHLSNECPQRKSITIVDDAQENDPLCDQESDEEVAYLEPDEGEQVTCVLERILLTPKTISNPQRHSLFRTRCTVNGKVCNIIIDSGSTENVVSSKLVT
        Q GHLSN CPQR++I++ D    +    D+E +EE  ++E D+G++++ V++R+L+ PK  +NPQRHSLF+TRCT+N +VC++IIDSGS+EN V+ KLVT
Subjt:  QVGHLSNECPQRKSITIVDDAQENDPLCDQESDEEVAYLEPDEGEQVTCVLERILLTPKTISNPQRHSLFRTRCTVNGKVCNIIIDSGSTENVVSSKLVT

Query:  ALNPKLSPHPTPYKADTKRRAQDFKEGDLVMIHLSKG
         LN K +P+P PYK    R+  +    ++  + LS G
Subjt:  ALNPKLSPHPTPYKADTKRRAQDFKEGDLVMIHLSKG

A0A5D3C3X9 Reverse transcriptase4.7e-4047.83Show/hide
Query:  PGTSAGSKGKTNEDSKAMDQPIK------KTTNTYSRPTLGKCFRCGQVGHLSNECPQRKSITIVDDAQENDPLCDQESDEEVAYLEPDEGEQVTCVLER
        P TS   KGK  E     D   +      K  N Y+RP+LGKCFRCGQ GH SN CPQRK+I + D  +++     +E +EE   +E D+G +V+CV++R
Subjt:  PGTSAGSKGKTNEDSKAMDQPIK------KTTNTYSRPTLGKCFRCGQVGHLSNECPQRKSITIVDDAQENDPLCDQESDEEVAYLEPDEGEQVTCVLER

Query:  ILLTPKTISNPQRHSLFRTRCTVNGKVCNIIIDSGSTENVVSSKLVTALNPKLSPHPTPYKADTKRRAQDFKEGDLVMIHLSKG
        +LL PK  +NPQ HSLF+TRCT+NGKVC++IID+GS+EN V+ KLVTALN K  PHP PYK    ++  +    ++  + LS G
Subjt:  ILLTPKTISNPQRHSLFRTRCTVNGKVCNIIIDSGSTENVVSSKLVTALNPKLSPHPTPYKADTKRRAQDFKEGDLVMIHLSKG

A0A5D3D0X3 Zf-CCHC domain-containing protein3.0e-3942.22Show/hide
Query:  TMEEQQANKIKFQYQR--------RNTSEGQSTSSRKGTSFFDKSAIPGTSAGSKGKTNEDSKAMDQPIKKTTNTYSRPTLGKCFRCGQVGHLSNECPQR
        T+EE    ++K   +R        +  S G+ T  +  TS  DK       A    +TN+  K++ +   KT N Y+RP+LGKCFRCG+  HLSN CPQR
Subjt:  TMEEQQANKIKFQYQR--------RNTSEGQSTSSRKGTSFFDKSAIPGTSAGSKGKTNEDSKAMDQPIKKTTNTYSRPTLGKCFRCGQVGHLSNECPQR

Query:  KSITIVDDAQENDPLCDQESDEEVAYLEPDEGEQVTCVLERILLTPKTISNPQRHSLFRTRCTVNGKVCNIIIDSGSTENVVSSKLVTALNPKLSPHPTP
        K+I + +D        D+E +EE   +E D+ ++++C+++R+L+TPK  +NPQRH+LF+TRCT+NGKVC++II+SGS+EN V+ KLVTALN K  PHP P
Subjt:  KSITIVDDAQENDPLCDQESDEEVAYLEPDEGEQVTCVLERILLTPKTISNPQRHSLFRTRCTVNGKVCNIIIDSGSTENVVSSKLVTALNPKLSPHPTP

Query:  YKADTKRRAQDFKEGDLVMIHLSKG
        YK    ++  +    ++  I LS G
Subjt:  YKADTKRRAQDFKEGDLVMIHLSKG

A0A6J1BU87 uncharacterized protein LOC1110048164.0e-3945.59Show/hide
Query:  LSIKQEGTFAEYRQLYEALAAALPQLSDEVLESAFLNGLDPVMRAEVRALEPKGLDQVVRKAELIDDINTASKEAEGIMTVEKSQSGPSAFKTATKTLEV
        ++IKQEG+  EY++ +EAL+A  P L +EVLES +LNGL+P++RAEV A +P GLDQ++R+A+LI+D  T S+E   +     S+ G    K A KT E 
Subjt:  LSIKQEGTFAEYRQLYEALAAALPQLSDEVLESAFLNGLDPVMRAEVRALEPKGLDQVVRKAELIDDINTASKEAEGIMTVEKSQSGPSAFKTATKTLEV

Query:  IPTRTVTLFSKQPNQTQGTGASQAAKKKPPFKKLSEAEFQKRRERGLCFRCEEKYIVGHKCKNRELRVFVLHDDE--MFELEDEMQQEDASGEKGNVLSA
          TR+VTL SK              KK+  +K+L+E E++KR E GLCFRCE+KY VGH+C+N++LRVF++HD+E  M E E++ ++ +   EKG  +  
Subjt:  IPTRTVTLFSKQPNQTQGTGASQAAKKKPPFKKLSEAEFQKRRERGLCFRCEEKYIVGHKCKNRELRVFVLHDDE--MFELEDEMQQEDASGEKGNVLSA

Query:  PTVL
         TV+
Subjt:  PTVL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G29750.1 Eukaryotic aspartyl protease family protein1.8e-0439.06Show/hide
Query:  IKQEGTFAEYRQLYEALAAALPQLSDEVLESAFLNGLDPVMRAEVRALEPKGLDQV-VRKAELI
        I+QEG+  +YR+ +EAL      L  +  E  FL GL P ++  VR L+P G++    R+AEL+
Subjt:  IKQEGTFAEYRQLYEALAAALPQLSDEVLESAFLNGLDPVMRAEVRALEPKGLDQV-VRKAELI

AT3G42723.1 aminoacyl-tRNA ligases;ATP binding;nucleotide binding2.6e-0632.29Show/hide
Query:  IKQEGTFAEYRQLYEALAAALPQLSDEVLESAFLNGLDPVMRAEVRALEPKGLDQVVRKAELIDDINTASKEAEGIMTVEKSQSGPSAFKTATKTL
        I+QEG+  EYR+ +EAL      L  + LE+ FL GL P ++  VR L+P G+ Q++  A+ +++ N+      G+      Q+ P  + T    L
Subjt:  IKQEGTFAEYRQLYEALAAALPQLSDEVLESAFLNGLDPVMRAEVRALEPKGLDQVVRKAELIDDINTASKEAEGIMTVEKSQSGPSAFKTATKTL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACAGTCAAAGAGCTGGAAACCAGGTGTGAGGTGGTGGAGAAGGATGTTGCAGACATGAAGATCCAGATCACATCGCTGCACCAAGACTTCGCTGAGCAAAAG
AAGGTGCTCCAGGAGGTCAACGTGATGCTGAAAGCATTCATGGCTCAAGATATTGGGGGTACCTCGGCGTCTATGTCGATGGGGACTGGGGAGACTTCGGTGACA
AAAGGCAAAACGAGCGGAGAGGTACTTCATCCGGTGAAAGAAGGGAAGCAGTGCGCGCGGCTGTTATCCATCAAGCAAGAGGGGACGTTCGCAGAGTACCGGCAG
TTGTATGAAGCGTTAGCCGCCGCCCTCCCTCAACTCTCCGACGAGGTGTTGGAAAGTGCCTTCCTAAATGGACTGGATCCAGTGATGCGAGCTGAAGTTCGGGCC
CTAGAGCCCAAAGGCCTGGACCAGGTAGTCCGCAAGGCCGAACTTATTGATGATATCAATACGGCCAGTAAGGAAGCGGAGGGAATTATGACCGTTGAGAAAAGC
CAAAGTGGCCCAAGCGCGTTCAAAACGGCCACCAAAACCCTAGAGGTAATCCCTACTCGCACTGTTACTCTTTTTAGCAAACAACCTAACCAAACTCAGGGAACG
GGGGCGTCACAAGCAGCCAAGAAGAAACCACCCTTCAAAAAGCTATCAGAAGCAGAGTTTCAGAAAAGAAGGGAGCGTGGCCTGTGTTTCCGCTGTGAGGAGAAG
TATATTGTAGGCCACAAGTGCAAGAACAGAGAGCTCAGAGTTTTTGTTCTGCACGACGACGAGATGTTCGAACTGGAGGATGAGATGCAACAAGAGGACGCATCG
GGGGAGAAAGGAAATGTTCTATCAGCACCCACAGTATTAGATGTAGAAGTTGTCAAACGAAAAATTGCTGAGGATCCACACTTCTCGAAAGTGTTAGCGGATCTG
GATGATGACCCGGAGGCCACGACAGCTCAGTTTATTCCTCCTCAACTGATCGAGGAGTGGGAATGGATGGTGGAACCTGATTCAGTGTTTGCTTATCAGACCAAT
CCCACCACGGGAGAAGTCGAAGCCTTAATAATTGGAAAGGTTTATCGGAAGAAGAAGCAACCTGGGAGTCTGTGCGCGACATCAGCAGACAATTTCCTGAGTTTC
ACCTTGAGAACAAGTGCTACTGTTTTCCGATTGCTGTTCGTGGGTTTCTTCTCGCCGTCAATCGCCGTCACCATCGCCGTGTGCTTTCGTTACCGCCAGTTGTCA
CCCGTCGCCTTCGCCCCTCTGTTCCAACAGTATCAAGAGTCATTGACGGCATCGTCCTCTAATCCGATTACTGCCATCGCTGAGTCAGATGGCACCACTTCTCCT
ATTCTTGGCTCTGACACAGATCTTACGACGAAGAAGACTATTGGTAAAGGGCGTGAATCCAATGGCCTCTACACGTTTAATACACAAATACCTACAGCTACCATT
TGCACTCGAGTACCCTCTCATTTCGAAGAACATTGTCGTTTAGCTATAGCAACTGCTGTCACTATGGAGGAACAGCAAGCCAATAAAATCAAATTTCAGTACCAA
AGGCGCAACACAAGTGAGGGGCAGTCAACTAGTTCAAGAAAAGGTACATCTTTCTTTGATAAATCTGCTATTCCAGGTACCTCTGCGGGTTCTAAAGGGAAGACC
AACGAGGATAGTAAAGCAATGGATCAACCTATCAAGAAAACAACTAACACTTACAGTAGACCCACCTTGGGTAAGTGCTTCCGTTGTGGGCAAGTAGGACATTTG
TCCAATGAATGCCCCCAACGGAAGTCGATCACCATTGTGGATGATGCTCAAGAAAATGATCCATTATGTGATCAAGAATCCGATGAAGAAGTTGCTTATCTTGAA
CCCGATGAGGGTGAACAAGTTACATGTGTTTTGGAGCGCATTCTTCTAACTCCAAAAACTATATCCAATCCCCAAAGGCATTCGTTGTTCCGAACAAGATGTACT
GTCAATGGCAAAGTTTGCAACATAATAATAGATAGTGGGAGTACTGAAAATGTGGTATCTAGCAAGCTAGTGACAGCTTTAAATCCAAAACTCTCTCCTCATCCT
ACTCCTTATAAGGCTGATACTAAGAGACGTGCCCAAGACTTCAAGGAAGGAGACCTTGTTATGATCCACCTCTCCAAAGGTCGTCTTCTAGCTGGTTCTGCTCAT
AAATTGCAGAAAAAGAAATTTGGTCCATTCGAGGTCCTCAAGCAATACGGCCCAAATGCTTATAAAATTCAACTTCCTCCAGATTTCAACATAAGTCCAATATTC
AATGTTGCAGACATTCATCCATATGCTGCTCCGGATTCCTTTGTGTTAGCCTCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGACAGTCAAAGAGCTGGAAACCAGGTGTGAGGTGGTGGAGAAGGATGTTGCAGACATGAAGATCCAGATCACATCGCTGCACCAAGACTTCGCTGAGCAAAAG
AAGGTGCTCCAGGAGGTCAACGTGATGCTGAAAGCATTCATGGCTCAAGATATTGGGGGTACCTCGGCGTCTATGTCGATGGGGACTGGGGAGACTTCGGTGACA
AAAGGCAAAACGAGCGGAGAGGTACTTCATCCGGTGAAAGAAGGGAAGCAGTGCGCGCGGCTGTTATCCATCAAGCAAGAGGGGACGTTCGCAGAGTACCGGCAG
TTGTATGAAGCGTTAGCCGCCGCCCTCCCTCAACTCTCCGACGAGGTGTTGGAAAGTGCCTTCCTAAATGGACTGGATCCAGTGATGCGAGCTGAAGTTCGGGCC
CTAGAGCCCAAAGGCCTGGACCAGGTAGTCCGCAAGGCCGAACTTATTGATGATATCAATACGGCCAGTAAGGAAGCGGAGGGAATTATGACCGTTGAGAAAAGC
CAAAGTGGCCCAAGCGCGTTCAAAACGGCCACCAAAACCCTAGAGGTAATCCCTACTCGCACTGTTACTCTTTTTAGCAAACAACCTAACCAAACTCAGGGAACG
GGGGCGTCACAAGCAGCCAAGAAGAAACCACCCTTCAAAAAGCTATCAGAAGCAGAGTTTCAGAAAAGAAGGGAGCGTGGCCTGTGTTTCCGCTGTGAGGAGAAG
TATATTGTAGGCCACAAGTGCAAGAACAGAGAGCTCAGAGTTTTTGTTCTGCACGACGACGAGATGTTCGAACTGGAGGATGAGATGCAACAAGAGGACGCATCG
GGGGAGAAAGGAAATGTTCTATCAGCACCCACAGTATTAGATGTAGAAGTTGTCAAACGAAAAATTGCTGAGGATCCACACTTCTCGAAAGTGTTAGCGGATCTG
GATGATGACCCGGAGGCCACGACAGCTCAGTTTATTCCTCCTCAACTGATCGAGGAGTGGGAATGGATGGTGGAACCTGATTCAGTGTTTGCTTATCAGACCAAT
CCCACCACGGGAGAAGTCGAAGCCTTAATAATTGGAAAGGTTTATCGGAAGAAGAAGCAACCTGGGAGTCTGTGCGCGACATCAGCAGACAATTTCCTGAGTTTC
ACCTTGAGAACAAGTGCTACTGTTTTCCGATTGCTGTTCGTGGGTTTCTTCTCGCCGTCAATCGCCGTCACCATCGCCGTGTGCTTTCGTTACCGCCAGTTGTCA
CCCGTCGCCTTCGCCCCTCTGTTCCAACAGTATCAAGAGTCATTGACGGCATCGTCCTCTAATCCGATTACTGCCATCGCTGAGTCAGATGGCACCACTTCTCCT
ATTCTTGGCTCTGACACAGATCTTACGACGAAGAAGACTATTGGTAAAGGGCGTGAATCCAATGGCCTCTACACGTTTAATACACAAATACCTACAGCTACCATT
TGCACTCGAGTACCCTCTCATTTCGAAGAACATTGTCGTTTAGCTATAGCAACTGCTGTCACTATGGAGGAACAGCAAGCCAATAAAATCAAATTTCAGTACCAA
AGGCGCAACACAAGTGAGGGGCAGTCAACTAGTTCAAGAAAAGGTACATCTTTCTTTGATAAATCTGCTATTCCAGGTACCTCTGCGGGTTCTAAAGGGAAGACC
AACGAGGATAGTAAAGCAATGGATCAACCTATCAAGAAAACAACTAACACTTACAGTAGACCCACCTTGGGTAAGTGCTTCCGTTGTGGGCAAGTAGGACATTTG
TCCAATGAATGCCCCCAACGGAAGTCGATCACCATTGTGGATGATGCTCAAGAAAATGATCCATTATGTGATCAAGAATCCGATGAAGAAGTTGCTTATCTTGAA
CCCGATGAGGGTGAACAAGTTACATGTGTTTTGGAGCGCATTCTTCTAACTCCAAAAACTATATCCAATCCCCAAAGGCATTCGTTGTTCCGAACAAGATGTACT
GTCAATGGCAAAGTTTGCAACATAATAATAGATAGTGGGAGTACTGAAAATGTGGTATCTAGCAAGCTAGTGACAGCTTTAAATCCAAAACTCTCTCCTCATCCT
ACTCCTTATAAGGCTGATACTAAGAGACGTGCCCAAGACTTCAAGGAAGGAGACCTTGTTATGATCCACCTCTCCAAAGGTCGTCTTCTAGCTGGTTCTGCTCAT
AAATTGCAGAAAAAGAAATTTGGTCCATTCGAGGTCCTCAAGCAATACGGCCCAAATGCTTATAAAATTCAACTTCCTCCAGATTTCAACATAAGTCCAATATTC
AATGTTGCAGACATTCATCCATATGCTGCTCCGGATTCCTTTGTGTTAGCCTCTTAA
Protein sequenceShow/hide protein sequence
MTVKELETRCEVVEKDVADMKIQITSLHQDFAEQKKVLQEVNVMLKAFMAQDIGGTSASMSMGTGETSVTKGKTSGEVLHPVKEGKQCARLLSIKQEGTFAEYRQ
LYEALAAALPQLSDEVLESAFLNGLDPVMRAEVRALEPKGLDQVVRKAELIDDINTASKEAEGIMTVEKSQSGPSAFKTATKTLEVIPTRTVTLFSKQPNQTQGT
GASQAAKKKPPFKKLSEAEFQKRRERGLCFRCEEKYIVGHKCKNRELRVFVLHDDEMFELEDEMQQEDASGEKGNVLSAPTVLDVEVVKRKIAEDPHFSKVLADL
DDDPEATTAQFIPPQLIEEWEWMVEPDSVFAYQTNPTTGEVEALIIGKVYRKKKQPGSLCATSADNFLSFTLRTSATVFRLLFVGFFSPSIAVTIAVCFRYRQLS
PVAFAPLFQQYQESLTASSSNPITAIAESDGTTSPILGSDTDLTTKKTIGKGRESNGLYTFNTQIPTATICTRVPSHFEEHCRLAIATAVTMEEQQANKIKFQYQ
RRNTSEGQSTSSRKGTSFFDKSAIPGTSAGSKGKTNEDSKAMDQPIKKTTNTYSRPTLGKCFRCGQVGHLSNECPQRKSITIVDDAQENDPLCDQESDEEVAYLE
PDEGEQVTCVLERILLTPKTISNPQRHSLFRTRCTVNGKVCNIIIDSGSTENVVSSKLVTALNPKLSPHPTPYKADTKRRAQDFKEGDLVMIHLSKGRLLAGSAH
KLQKKKFGPFEVLKQYGPNAYKIQLPPDFNISPIFNVADIHPYAAPDSFVLAS