; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0005942 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0005942
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTy3-gypsy retrotransposon protein
Genome locationchr6:34125601..34135003
RNA-Seq ExpressionLag0005942
SyntenyLag0005942
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR005162 - Retrotransposon gag domain
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_031735972.1 uncharacterized protein LOC116401693 [Cucumis sativus]5.6e-9357.53Show/hide
Query:  MAPKKAPAKVTTSSDSYTGPVTRSRSQGIEIREDHTPLAVASRISKLIEESSKDRVAVKDNPLFESVVPTSKQPKDTLNPDVTSVMMADVDHDERMAEME
        MA KKA +K + +SD+YTGP+TRSRS+GI I+      A+A  I K + ES K  + +K+NPL+      S +     +PDV SVMMADV  +  MAEME
Subjt:  MAPKKAPAKVTTSSDSYTGPVTRSRSQGIEIREDHTPLAVASRISKLIEESSKDRVAVKDNPLFESVVPTSKQPKDTLNPDVTSVMMADVDHDERMAEME

Query:  KKLNLLMKAVDERYLEIAYLKNQLQNREVAESSQTPTAGKNDKGKAVVHEDQSQHSA-------------------------------------------
        +K+NLLMK VDER  EIA LK Q+Q RE AESSQTP    +DKGK VV E+Q Q  +                                           
Subjt:  KKLNLLMKAVDERYLEIAYLKNQLQNREVAESSQTPTAGKNDKGKAVVHEDQSQHSA-------------------------------------------

Query:  --LPTGYQPPKFQQFDGKGNPKQHITHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPDTMDSWEQMEREFLNRFYSTRRIVSMTELTSTKQRKG
          +P GYQPPKFQQFDGKGNPKQH+ HFVETCENAG+RGD LV+QFVR+LKGNAF+WYTDLEP++++SWEQ+E+EFLNRFYSTRR VSM ELT+TKQRKG
Subjt:  --LPTGYQPPKFQQFDGKGNPKQHITHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPDTMDSWEQMEREFLNRFYSTRRIVSMTELTSTKQRKG

Query:  EPVIDYINRWRALSLDCKDRLSEVSSVEMCTQ
        EPVIDYINRWRALSLDCKDRL+E+S+VEMCTQ
Subjt:  EPVIDYINRWRALSLDCKDRLSEVSSVEMCTQ

XP_031737053.1 uncharacterized protein LOC116402138 [Cucumis sativus]5.6e-9357.53Show/hide
Query:  MAPKKAPAKVTTSSDSYTGPVTRSRSQGIEIREDHTPLAVASRISKLIEESSKDRVAVKDNPLFESVVPTSKQPKDTLNPDVTSVMMADVDHDERMAEME
        MA KKA +K + +SD+YTGP+TRSRS+GI I+      A+A  I K + ES K  + +K+NPL+      S +     +PDV SVMMADV  +  MAEME
Subjt:  MAPKKAPAKVTTSSDSYTGPVTRSRSQGIEIREDHTPLAVASRISKLIEESSKDRVAVKDNPLFESVVPTSKQPKDTLNPDVTSVMMADVDHDERMAEME

Query:  KKLNLLMKAVDERYLEIAYLKNQLQNREVAESSQTPTAGKNDKGKAVVHEDQSQHSA-------------------------------------------
        +K+NLLMK VDER  EIA LK Q+Q RE AESSQTP    +DKGK VV E+Q Q  +                                           
Subjt:  KKLNLLMKAVDERYLEIAYLKNQLQNREVAESSQTPTAGKNDKGKAVVHEDQSQHSA-------------------------------------------

Query:  --LPTGYQPPKFQQFDGKGNPKQHITHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPDTMDSWEQMEREFLNRFYSTRRIVSMTELTSTKQRKG
          +P GYQPPKFQQFDGKGNPKQH+ HFVETCENAG+RGD LV+QFVR+LKGNAF+WYTDLEP++++SWEQ+E+EFLNRFYSTRR VSM ELT+TKQRKG
Subjt:  --LPTGYQPPKFQQFDGKGNPKQHITHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPDTMDSWEQMEREFLNRFYSTRRIVSMTELTSTKQRKG

Query:  EPVIDYINRWRALSLDCKDRLSEVSSVEMCTQ
        EPVIDYINRWRALSLDCKDRL+E+S+VEMCTQ
Subjt:  EPVIDYINRWRALSLDCKDRLSEVSSVEMCTQ

XP_031739134.1 uncharacterized protein LOC116402863 [Cucumis sativus]5.6e-9357.53Show/hide
Query:  MAPKKAPAKVTTSSDSYTGPVTRSRSQGIEIREDHTPLAVASRISKLIEESSKDRVAVKDNPLFESVVPTSKQPKDTLNPDVTSVMMADVDHDERMAEME
        MA KKA +K + +SD+YTGP+TRSRS+GI I+      A+A  I K + ES K  + +K+NPL+      S +     +PDV SVMMADV  +  MAEME
Subjt:  MAPKKAPAKVTTSSDSYTGPVTRSRSQGIEIREDHTPLAVASRISKLIEESSKDRVAVKDNPLFESVVPTSKQPKDTLNPDVTSVMMADVDHDERMAEME

Query:  KKLNLLMKAVDERYLEIAYLKNQLQNREVAESSQTPTAGKNDKGKAVVHEDQSQHSA-------------------------------------------
        +K+NLLMK VDER  EIA LK Q+Q RE AESSQTP    +DKGK VV E+Q Q  +                                           
Subjt:  KKLNLLMKAVDERYLEIAYLKNQLQNREVAESSQTPTAGKNDKGKAVVHEDQSQHSA-------------------------------------------

Query:  --LPTGYQPPKFQQFDGKGNPKQHITHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPDTMDSWEQMEREFLNRFYSTRRIVSMTELTSTKQRKG
          +P GYQPPKFQQFDGKGNPKQH+ HFVETCENAG+RGD LV+QFVR+LKGNAF+WYTDLEP++++SWEQ+E+EFLNRFYSTRR VSM ELT+TKQRKG
Subjt:  --LPTGYQPPKFQQFDGKGNPKQHITHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPDTMDSWEQMEREFLNRFYSTRRIVSMTELTSTKQRKG

Query:  EPVIDYINRWRALSLDCKDRLSEVSSVEMCTQ
        EPVIDYINRWRALSLDCKDRL+E+S+VEMCTQ
Subjt:  EPVIDYINRWRALSLDCKDRLSEVSSVEMCTQ

XP_031742032.1 uncharacterized protein LOC116404025 [Cucumis sativus]5.6e-9357.53Show/hide
Query:  MAPKKAPAKVTTSSDSYTGPVTRSRSQGIEIREDHTPLAVASRISKLIEESSKDRVAVKDNPLFESVVPTSKQPKDTLNPDVTSVMMADVDHDERMAEME
        MA KKA +K + +SD+YTGP+TRSRS+GI I+      A+A  I K + ES K  + +K+NPL+      S +     +PDV SVMMADV  +  MAEME
Subjt:  MAPKKAPAKVTTSSDSYTGPVTRSRSQGIEIREDHTPLAVASRISKLIEESSKDRVAVKDNPLFESVVPTSKQPKDTLNPDVTSVMMADVDHDERMAEME

Query:  KKLNLLMKAVDERYLEIAYLKNQLQNREVAESSQTPTAGKNDKGKAVVHEDQSQHSA-------------------------------------------
        +K+NLLMK VDER  EIA LK Q+Q RE AESSQTP    +DKGK VV E+Q Q  +                                           
Subjt:  KKLNLLMKAVDERYLEIAYLKNQLQNREVAESSQTPTAGKNDKGKAVVHEDQSQHSA-------------------------------------------

Query:  --LPTGYQPPKFQQFDGKGNPKQHITHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPDTMDSWEQMEREFLNRFYSTRRIVSMTELTSTKQRKG
          +P GYQPPKFQQFDGKGNPKQH+ HFVETCENAG+RGD LV+QFVR+LKGNAF+WYTDLEP++++SWEQ+E+EFLNRFYSTRR VSM ELT+TKQRKG
Subjt:  --LPTGYQPPKFQQFDGKGNPKQHITHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPDTMDSWEQMEREFLNRFYSTRRIVSMTELTSTKQRKG

Query:  EPVIDYINRWRALSLDCKDRLSEVSSVEMCTQ
        EPVIDYINRWRALSLDCKDRL+E+S+VEMCTQ
Subjt:  EPVIDYINRWRALSLDCKDRLSEVSSVEMCTQ

XP_031742199.1 uncharacterized protein LOC105435721 [Cucumis sativus]5.6e-9357.53Show/hide
Query:  MAPKKAPAKVTTSSDSYTGPVTRSRSQGIEIREDHTPLAVASRISKLIEESSKDRVAVKDNPLFESVVPTSKQPKDTLNPDVTSVMMADVDHDERMAEME
        MA KKA +K + +SD+YTGP+TRSRS+GI I+      A+A  I K + ES K  + +K+NPL+      S +     +PDV SVMMADV  +  MAEME
Subjt:  MAPKKAPAKVTTSSDSYTGPVTRSRSQGIEIREDHTPLAVASRISKLIEESSKDRVAVKDNPLFESVVPTSKQPKDTLNPDVTSVMMADVDHDERMAEME

Query:  KKLNLLMKAVDERYLEIAYLKNQLQNREVAESSQTPTAGKNDKGKAVVHEDQSQHSA-------------------------------------------
        +K+NLLMK VDER  EIA LK Q+Q RE AESSQTP    +DKGK VV E+Q Q  +                                           
Subjt:  KKLNLLMKAVDERYLEIAYLKNQLQNREVAESSQTPTAGKNDKGKAVVHEDQSQHSA-------------------------------------------

Query:  --LPTGYQPPKFQQFDGKGNPKQHITHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPDTMDSWEQMEREFLNRFYSTRRIVSMTELTSTKQRKG
          +P GYQPPKFQQFDGKGNPKQH+ HFVETCENAG+RGD LV+QFVR+LKGNAF+WYTDLEP++++SWEQ+E+EFLNRFYSTRR VSM ELT+TKQRKG
Subjt:  --LPTGYQPPKFQQFDGKGNPKQHITHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPDTMDSWEQMEREFLNRFYSTRRIVSMTELTSTKQRKG

Query:  EPVIDYINRWRALSLDCKDRLSEVSSVEMCTQ
        EPVIDYINRWRALSLDCKDRL+E+S+VEMCTQ
Subjt:  EPVIDYINRWRALSLDCKDRLSEVSSVEMCTQ

TrEMBL top hitse value%identityAlignment
A0A5A7SI89 Ty3-gypsy retrotransposon protein6.0e-8561.94Show/hide
Query:  PLAVASRISKLIEESSKDRVAVKDNPLFESVVPTSKQPKDTL-NPDVTSVMMADVD-HDERMAEMEKKLNLLMKAVDERYLEIAYLKNQLQNREVAESSQ
        P  VA  I + I +  K  + +K+NP  +     S++  + +  P++ SVM+ DVD  ++RMAE+EKK+N+LMK V+ER  EIA LKN +++R+ AESS 
Subjt:  PLAVASRISKLIEESSKDRVAVKDNPLFESVVPTSKQPKDTL-NPDVTSVMMADVD-HDERMAEMEKKLNLLMKAVDERYLEIAYLKNQLQNREVAESSQ

Query:  TPTAGKNDKGKAVVHEDQSQHSA---------------LPTGYQPPKFQQFDGKGNPKQHITHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPD
        T T    +KGKA++ E Q Q+S                +P GYQPPKFQQFDGKGNPKQH+ HF+ETCE AGTRGDLLVKQFVRTLKGNAFDWYTDLEP+
Subjt:  TPTAGKNDKGKAVVHEDQSQHSA---------------LPTGYQPPKFQQFDGKGNPKQHITHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPD

Query:  TMDSWEQMEREFLNRFYSTRRIVSMTELTSTKQRKGEPVIDYINRWRALSLDCKDRLSEVSSVEMCTQ
        ++DSWEQ+ER+FLNRFYSTRRIVSM ELT+TKQRKGEPVIDYINRWRALSLDCKDRL+E+S+VEMCTQ
Subjt:  TMDSWEQMEREFLNRFYSTRRIVSMTELTSTKQRKGEPVIDYINRWRALSLDCKDRLSEVSSVEMCTQ

A0A5A7TZU9 Ribonuclease H1.9e-8655.62Show/hide
Query:  SDSYTGPVTRSRSQGIEIREDHTPLAVASRISKLIEESSKDRVAVKDNPLFESVVPTSKQPKDTL-NPDVTSVMMADVD-HDERMAEMEKKLNLLMKAVD
        SD    P TRSRS+ I+  ED  P  VA  I + I +  K  + +K+NP  +     S++  + +  P++ SVM+ DVD  ++RMAE+EKK+N+LMKAV+
Subjt:  SDSYTGPVTRSRSQGIEIREDHTPLAVASRISKLIEESSKDRVAVKDNPLFESVVPTSKQPKDTL-NPDVTSVMMADVD-HDERMAEMEKKLNLLMKAVD

Query:  ERYLEIAYLKNQLQNREVAESSQTPTAGKNDKGKAVVHEDQSQHSA--------------------------------------------LPTGYQPPKF
        ER  EIA LKN +++R+ AESS T T    +KGKA++ E Q Q+S                                             +P GYQPPKF
Subjt:  ERYLEIAYLKNQLQNREVAESSQTPTAGKNDKGKAVVHEDQSQHSA--------------------------------------------LPTGYQPPKF

Query:  QQFDGKGNPKQHITHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPDTMDSWEQMEREFLNRFYSTRRIVSMTELTSTKQRKGEPVIDYINRWRA
        QQFDGKGNPKQH+ HF+ETCE AGTRGDLLVKQFVRTLKGNAFDWYTDLEP+++DSWEQ+ER+FLNRFYSTRRIVSM ELT+TKQRKGEPVIDYINRWRA
Subjt:  QQFDGKGNPKQHITHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPDTMDSWEQMEREFLNRFYSTRRIVSMTELTSTKQRKGEPVIDYINRWRA

Query:  LSLDCKDRLSEVSSVEMCTQ
        LSLDCKDRL+E+S+VEMCTQ
Subjt:  LSLDCKDRLSEVSSVEMCTQ

A0A5D3BDK1 Ty3-gypsy retrotransposon protein1.3e-8458.19Show/hide
Query:  MAPKKAPAKVTTSSDSYTGPVTRSRSQGIEIREDHTPLAVASRISKLIEESSKDRVAVKDNPLFESVVPTSKQPKDTLNPDVTSVMMADVDHDERMAEME
        MA KKA +  + +SD+YTGP+TR+RS+GI I+E      +A  I K + ES K  + +K+NPL+++    S + K   +PDV SV+M D+  +  MAEME
Subjt:  MAPKKAPAKVTTSSDSYTGPVTRSRSQGIEIREDHTPLAVASRISKLIEESSKDRVAVKDNPLFESVVPTSKQPKDTLNPDVTSVMMADVDHDERMAEME

Query:  KKLNLLMKAVDERYLEIAYLKNQLQNREVAESSQTPTAGKNDKGKAVVHEDQSQHSALPTGYQPPKFQQFDGKGNPKQHITHFVETCENAGTRGDLLVKQ
        +K+N LMK V+ER  EI  L+ Q++ R+ AES+QTP     DKGK  V E+Q Q  ++          QFDGKGNPKQHI  FVETCENA +RGD LV+Q
Subjt:  KKLNLLMKAVDERYLEIAYLKNQLQNREVAESSQTPTAGKNDKGKAVVHEDQSQHSALPTGYQPPKFQQFDGKGNPKQHITHFVETCENAGTRGDLLVKQ

Query:  FVRTLKGNAFDWYTDLEPDTMDSWEQMEREFLNRFYSTRRIVSMTELTSTKQRKGEPVIDYINRWRALSLDCKDRLSEVSSVEMCTQ
        FVR+LKGNAF+WYT+LEP+ +DSWEQ+E+EFLNRFYSTRR VSM ELT+TKQ+KGEPVIDYINRWRALSL+CKDRL+E+S+VEMCTQ
Subjt:  FVRTLKGNAFDWYTDLEPDTMDSWEQMEREFLNRFYSTRRIVSMTELTSTKQRKGEPVIDYINRWRALSLDCKDRLSEVSSVEMCTQ

A0A5D3CD35 Ty3-gypsy retrotransposon protein9.6e-8348.8Show/hide
Query:  MAPKKAPAKVTTSSDSYTGPVTRSRSQGIEIREDHTPLAVASRISKLIEESSKDRVAVKDNPLFESVVPTSKQPKDTLNPDVTSVMMADVDHDERMAEME
        MA KK  +  + +SD+YTGP+T S S+GI   +D     VA  I K + ES K  + +K+NPL+++    S + K   +PDV  VMMAD+  +  MA+ME
Subjt:  MAPKKAPAKVTTSSDSYTGPVTRSRSQGIEIREDHTPLAVASRISKLIEESSKDRVAVKDNPLFESVVPTSKQPKDTLNPDVTSVMMADVDHDERMAEME

Query:  KKLNLLMKAVDERYLEIAYLKNQLQNREVAESSQTPTAGKNDKGKAVVHEDQSQHSA-------------------------------------------
        KK+N LMKA++E   EI  L+ Q++ RE AESSQTP     DKGK VV ++Q Q  +                                           
Subjt:  KKLNLLMKAVDERYLEIAYLKNQLQNREVAESSQTPTAGKNDKGKAVVHEDQSQHSA-------------------------------------------

Query:  --LPTGYQPPKFQQFDGKGNPKQHITHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPDTMDSWEQMEREFLNRFYSTRRIVSMTELTSTKQRKG
          +P GYQPPKFQQFDGKGNPKQHI HFVETC NAG+RGD LV+QFVR+LKGNAF+WYT+LE + +D+WEQ+E+EFL+RFYS RR VSM ELT+TKQ+KG
Subjt:  --LPTGYQPPKFQQFDGKGNPKQHITHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPDTMDSWEQMEREFLNRFYSTRRIVSMTELTSTKQRKG

Query:  EPVIDYINRWRALSLDCKDRLSEVSSVEMCTQAT-IVFLKLFSAAACTTLPRRYSHAAAARTPLPSRCSHDSPTP
        EPVIDYINRWRALSLDCKDRL+E+SSVEMCTQ      L +       T     + A      + SR + D P P
Subjt:  EPVIDYINRWRALSLDCKDRLSEVSSVEMCTQAT-IVFLKLFSAAACTTLPRRYSHAAAARTPLPSRCSHDSPTP

A0A5D3DIN4 Ty3-gypsy retrotransposon protein7.4e-8352.71Show/hide
Query:  MAPKKAPAKVTTSSDSYTGPVTRSRSQGIEIREDHTPLAVASRISKLIEESSKDRVAVKDNPLFESVVPTSKQPKDTLNPDVTSVMMADVDHDERMAEME
        MA KK  +K + +SDSY G VT+S  +   ++E      +  +  + + ES K R+ ++DNPLF +  P S       + +V SVMM DV  +  M EME
Subjt:  MAPKKAPAKVTTSSDSYTGPVTRSRSQGIEIREDHTPLAVASRISKLIEESSKDRVAVKDNPLFESVVPTSKQPKDTLNPDVTSVMMADVDHDERMAEME

Query:  KKLNLLMKAVDERYLEIAYLKNQLQNREVAESSQTPTAGKNDKGKAVVHEDQSQHSA-------------------------------------------
        KK+N LMK V+ER  EIA LK+Q++  E AESS+TP     DKGK VV E+Q Q  +                                           
Subjt:  KKLNLLMKAVDERYLEIAYLKNQLQNREVAESSQTPTAGKNDKGKAVVHEDQSQHSA-------------------------------------------

Query:  --LPTGYQPPKFQQFDGKGNPKQHITHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPDTMDSWEQMEREFLNRFYSTRRIVSMTELTSTKQRKG
          +P GYQPPKFQQFDGKGNPKQHI HFVE CENAG+RGD LV+QFVR+LKGNAF+WYTDLEP+ +DSWEQ+E EFLN FYSTRR+VSM ELT+TKQRKG
Subjt:  --LPTGYQPPKFQQFDGKGNPKQHITHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPDTMDSWEQMEREFLNRFYSTRRIVSMTELTSTKQRKG

Query:  EPVIDYINRWRALSLDCKDRLSEVSSVEMCTQ
        EPVIDYINRWRALSLDCKD+L+E+S+VEMCTQ
Subjt:  EPVIDYINRWRALSLDCKDRLSEVSSVEMCTQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCACCAAAGAAAGCTCCCGCGAAGGTTACCACATCAAGCGACTCTTACACTGGTCCTGTCACTCGTAGTCGCTCCCAAGGAATTGAGATCAGGGAGGATCATACTCC
TCTTGCTGTTGCAAGCAGGATCTCAAAGTTGATTGAAGAATCCTCTAAGGATAGGGTTGCAGTCAAAGACAACCCACTGTTCGAATCTGTCGTTCCAACATCTAAGCAGC
CAAAGGACACACTAAATCCTGATGTGACGTCCGTCATGATGGCTGATGTAGACCATGATGAAAGAATGGCAGAGATGGAAAAGAAACTCAATCTCTTAATGAAGGCAGTT
GATGAGAGATATCTTGAGATTGCCTATTTGAAGAACCAGCTGCAGAACCGAGAAGTGGCTGAGTCTAGCCAGACCCCTACTGCAGGAAAGAATGATAAAGGGAAAGCTGT
CGTGCATGAGGACCAATCGCAACACTCCGCACTGCCCACTGGGTATCAACCTCCCAAGTTCCAACAGTTTGATGGAAAGGGCAACCCAAAGCAGCACATCACTCACTTTG
TTGAAACTTGTGAGAACGCTGGTACTCGGGGAGATTTGTTGGTTAAGCAATTTGTTCGAACACTGAAAGGAAACGCATTCGATTGGTACACTGACCTGGAGCCTGATACA
ATGGACAGTTGGGAACAGATGGAGAGAGAGTTTCTAAATCGCTTCTACAGTACGAGGCGAATAGTTAGTATGACAGAACTCACGAGCACAAAGCAGCGAAAGGGTGAGCC
AGTCATTGATTACATCAACCGTTGGAGAGCCTTGAGTCTCGACTGTAAAGATAGACTCTCCGAGGTGTCTTCTGTTGAGATGTGCACCCAAGCCACCATTGTTTTCTTGA
AGCTCTTCTCAGCCGCCGCTTGCACGACTCTCCCTCGCCGCTACTCGCACGCCGCCGCCGCTCGCACGCCTCTCCCCAGCCGCTGCTCGCACGACTCTCCCACGCCGCCG
CCACTCGCACATAGCCGCTCGCACGCCTCTCCCCAGCCACCACTCGCTCCGTTCAACCGCCGTCTCTCCCCAGCCGCACCTCGCACTGTTCAACCGCGCCTCGCACAGTT
CAGCCGCGCCTCTCCCCAGCCGCACCTCTCACCGTTCAGCCGCGCCTCTCCCCAATCGCCGCTCGCACGCCTCTCCCCAGCCGCCGCTCGCACCATTCAGCCGCCGTTAA
AGCGCCATTCAGCAGCCCAACGCTGCCGTCTGTTGTTTAGCTCGCACAAGCACTGCCGTTTAGCCCACCATTTGACTTTTGCCCATCGGTTTGGGAACTTGGTTGTTGGG
GTTAATTCGAACACAAACCAGCAGAAATTTGGAGTTATTCGATATCATGGGATGTTGTTGGTGATATCATTAAAAACATTCACAATTTATTCAATTGACTCTCTGAAGCA
TGGCTTTCGTGATGATGTAAAGAACATGGTTAATACGGCTATAAGAATGTTCTATTCTCAAACTAATATAGGAAGTCTCCCCTTCAAATGGGTATATGTTAAGTGTGCTC
AACAATTGGGATCTACGGAGTGTGGATACTACACTCTTAAATTTATACGAGATATAGTATCCCATAGGAGTAGAGTGATTACAGATGTGCCTCAAGTTCGGTGTTTCACT
CGCCCTAAGTTCGTTGTTCCTGCTTCTTCAAGTTCGAAGGTTCTCACGCGCTGCGTTGCAGTTCTTTCTCCCCAAGTTCGAAGGTTCTCCGCTGCTGCAGTTCCTTCCTC
CAAGTTTGAAGGTTCTCACGTCGCTTTGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGTGCTTCGCTGCAGTTCCTTCCTCTAAGTTCGAAGGTTCTCACGCGC
TTCGTTGCAGTTCCTTCCTCAGAGTTCGAAGGTTCTCACGCGCTTCGTTGCAGTTCCTTCCTCCAAGTTCAAAGGTTCTCACGCGCTTCGTAGCAGTTCCTTCCTCCAAG
TTCGAAGGTTCTCACGCGCTTCGTTGCAGTTCTTTCTCCCCAAGTTCGAAGGTTCACGCACTTCGCTTCAGTTCTTTCTCCCAAATTCGAAGGTTCTCACGCGCTTCGCT
GCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCTGCTACTGCAGTTCATTCCTCCAAGTTCGAAGGTTCTCACGTGCTCCGCTGCAGTTCCTTCCTCCAAGTTTGAAGGTTC
TCACGTCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGTGCTTCGCTGCAGTTGTTTCCTCCAAGTTCGAAGTTCCTTCCTCCAAGTTCGAAGGTCCTC
ATGCTACGCTCGGCTGCGCTACTTCCTAAAGTCCAAGGAGCACGTCATGTCCTTGAACTCATGCTGAAAGACGTGGCGGCGGAAAAAGTCCAAGGAGCACGTCATGTCCT
TGAACTCATGTTGAAAGATGTGGCAGCGGAAAAATTTCAAGGAGCACGTCATGTCCTTGAACTCATGCTGAAATACGTGGCGGCGACAAAAGTCCAAGGAGCACGTCATG
TCCGTGAACTCATGTTGAAAGATGTGGCGGCGGCAAAAGTCCAAGGAGCACGTCATGTCCTTGAACTCATGCTGAAAGATGTAGCGACAGCAAAAGTCCAAGGAGCACGT
CATGTCCTTGAACTCATGCTAAAAGACGTGGCGGCGGCAAAAGTCCAAGGAACGTGTCCCAAAGCGTGGCGGCGACACAAGTCCAAGGTACATGTCCCAAAGTCAGGGAC
ATGTCCCTGCACTCGTACTGAAAGCGCCGAGCGGAGCTCCCTCTCGTCTCTCTCTCTAACTCTCTTTCCCCTCGCCATTGCCACCGCAAAGAAGAACGTCGTCGTGTTTC
GGCAAGGAGCGATTCGCCAAAAAAAGAGACTGAATCTCTCGCTCCCTCAGTCTCTCTCTCATCTCCCTCTCGTCGCCGTCGCCGCCTGGAGATGTGATCGTCGTCGCCGC
CTGAAGATTGGCTTGTGCCGCGAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCACCAAAGAAAGCTCCCGCGAAGGTTACCACATCAAGCGACTCTTACACTGGTCCTGTCACTCGTAGTCGCTCCCAAGGAATTGAGATCAGGGAGGATCATACTCC
TCTTGCTGTTGCAAGCAGGATCTCAAAGTTGATTGAAGAATCCTCTAAGGATAGGGTTGCAGTCAAAGACAACCCACTGTTCGAATCTGTCGTTCCAACATCTAAGCAGC
CAAAGGACACACTAAATCCTGATGTGACGTCCGTCATGATGGCTGATGTAGACCATGATGAAAGAATGGCAGAGATGGAAAAGAAACTCAATCTCTTAATGAAGGCAGTT
GATGAGAGATATCTTGAGATTGCCTATTTGAAGAACCAGCTGCAGAACCGAGAAGTGGCTGAGTCTAGCCAGACCCCTACTGCAGGAAAGAATGATAAAGGGAAAGCTGT
CGTGCATGAGGACCAATCGCAACACTCCGCACTGCCCACTGGGTATCAACCTCCCAAGTTCCAACAGTTTGATGGAAAGGGCAACCCAAAGCAGCACATCACTCACTTTG
TTGAAACTTGTGAGAACGCTGGTACTCGGGGAGATTTGTTGGTTAAGCAATTTGTTCGAACACTGAAAGGAAACGCATTCGATTGGTACACTGACCTGGAGCCTGATACA
ATGGACAGTTGGGAACAGATGGAGAGAGAGTTTCTAAATCGCTTCTACAGTACGAGGCGAATAGTTAGTATGACAGAACTCACGAGCACAAAGCAGCGAAAGGGTGAGCC
AGTCATTGATTACATCAACCGTTGGAGAGCCTTGAGTCTCGACTGTAAAGATAGACTCTCCGAGGTGTCTTCTGTTGAGATGTGCACCCAAGCCACCATTGTTTTCTTGA
AGCTCTTCTCAGCCGCCGCTTGCACGACTCTCCCTCGCCGCTACTCGCACGCCGCCGCCGCTCGCACGCCTCTCCCCAGCCGCTGCTCGCACGACTCTCCCACGCCGCCG
CCACTCGCACATAGCCGCTCGCACGCCTCTCCCCAGCCACCACTCGCTCCGTTCAACCGCCGTCTCTCCCCAGCCGCACCTCGCACTGTTCAACCGCGCCTCGCACAGTT
CAGCCGCGCCTCTCCCCAGCCGCACCTCTCACCGTTCAGCCGCGCCTCTCCCCAATCGCCGCTCGCACGCCTCTCCCCAGCCGCCGCTCGCACCATTCAGCCGCCGTTAA
AGCGCCATTCAGCAGCCCAACGCTGCCGTCTGTTGTTTAGCTCGCACAAGCACTGCCGTTTAGCCCACCATTTGACTTTTGCCCATCGGTTTGGGAACTTGGTTGTTGGG
GTTAATTCGAACACAAACCAGCAGAAATTTGGAGTTATTCGATATCATGGGATGTTGTTGGTGATATCATTAAAAACATTCACAATTTATTCAATTGACTCTCTGAAGCA
TGGCTTTCGTGATGATGTAAAGAACATGGTTAATACGGCTATAAGAATGTTCTATTCTCAAACTAATATAGGAAGTCTCCCCTTCAAATGGGTATATGTTAAGTGTGCTC
AACAATTGGGATCTACGGAGTGTGGATACTACACTCTTAAATTTATACGAGATATAGTATCCCATAGGAGTAGAGTGATTACAGATGTGCCTCAAGTTCGGTGTTTCACT
CGCCCTAAGTTCGTTGTTCCTGCTTCTTCAAGTTCGAAGGTTCTCACGCGCTGCGTTGCAGTTCTTTCTCCCCAAGTTCGAAGGTTCTCCGCTGCTGCAGTTCCTTCCTC
CAAGTTTGAAGGTTCTCACGTCGCTTTGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGTGCTTCGCTGCAGTTCCTTCCTCTAAGTTCGAAGGTTCTCACGCGC
TTCGTTGCAGTTCCTTCCTCAGAGTTCGAAGGTTCTCACGCGCTTCGTTGCAGTTCCTTCCTCCAAGTTCAAAGGTTCTCACGCGCTTCGTAGCAGTTCCTTCCTCCAAG
TTCGAAGGTTCTCACGCGCTTCGTTGCAGTTCTTTCTCCCCAAGTTCGAAGGTTCACGCACTTCGCTTCAGTTCTTTCTCCCAAATTCGAAGGTTCTCACGCGCTTCGCT
GCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCTGCTACTGCAGTTCATTCCTCCAAGTTCGAAGGTTCTCACGTGCTCCGCTGCAGTTCCTTCCTCCAAGTTTGAAGGTTC
TCACGTCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGTGCTTCGCTGCAGTTGTTTCCTCCAAGTTCGAAGTTCCTTCCTCCAAGTTCGAAGGTCCTC
ATGCTACGCTCGGCTGCGCTACTTCCTAAAGTCCAAGGAGCACGTCATGTCCTTGAACTCATGCTGAAAGACGTGGCGGCGGAAAAAGTCCAAGGAGCACGTCATGTCCT
TGAACTCATGTTGAAAGATGTGGCAGCGGAAAAATTTCAAGGAGCACGTCATGTCCTTGAACTCATGCTGAAATACGTGGCGGCGACAAAAGTCCAAGGAGCACGTCATG
TCCGTGAACTCATGTTGAAAGATGTGGCGGCGGCAAAAGTCCAAGGAGCACGTCATGTCCTTGAACTCATGCTGAAAGATGTAGCGACAGCAAAAGTCCAAGGAGCACGT
CATGTCCTTGAACTCATGCTAAAAGACGTGGCGGCGGCAAAAGTCCAAGGAACGTGTCCCAAAGCGTGGCGGCGACACAAGTCCAAGGTACATGTCCCAAAGTCAGGGAC
ATGTCCCTGCACTCGTACTGAAAGCGCCGAGCGGAGCTCCCTCTCGTCTCTCTCTCTAACTCTCTTTCCCCTCGCCATTGCCACCGCAAAGAAGAACGTCGTCGTGTTTC
GGCAAGGAGCGATTCGCCAAAAAAAGAGACTGAATCTCTCGCTCCCTCAGTCTCTCTCTCATCTCCCTCTCGTCGCCGTCGCCGCCTGGAGATGTGATCGTCGTCGCCGC
CTGAAGATTGGCTTGTGCCGCGAGTGA
Protein sequenceShow/hide protein sequence
MAPKKAPAKVTTSSDSYTGPVTRSRSQGIEIREDHTPLAVASRISKLIEESSKDRVAVKDNPLFESVVPTSKQPKDTLNPDVTSVMMADVDHDERMAEMEKKLNLLMKAV
DERYLEIAYLKNQLQNREVAESSQTPTAGKNDKGKAVVHEDQSQHSALPTGYQPPKFQQFDGKGNPKQHITHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPDT
MDSWEQMEREFLNRFYSTRRIVSMTELTSTKQRKGEPVIDYINRWRALSLDCKDRLSEVSSVEMCTQATIVFLKLFSAAACTTLPRRYSHAAAARTPLPSRCSHDSPTPP
PLAHSRSHASPQPPLAPFNRRLSPAAPRTVQPRLAQFSRASPQPHLSPFSRASPQSPLARLSPAAARTIQPPLKRHSAAQRCRLLFSSHKHCRLAHHLTFAHRFGNLVVG
VNSNTNQQKFGVIRYHGMLLVISLKTFTIYSIDSLKHGFRDDVKNMVNTAIRMFYSQTNIGSLPFKWVYVKCAQQLGSTECGYYTLKFIRDIVSHRSRVITDVPQVRCFT
RPKFVVPASSSSKVLTRCVAVLSPQVRRFSAAAVPSSKFEGSHVALLQFLPPSSKVLTCFAAVPSSKFEGSHALRCSSFLRVRRFSRASLQFLPPSSKVLTRFVAVPSSK
FEGSHALRCSSFSPSSKVHALRFSSFSQIRRFSRASLQFLPPSSKVLCYCSSFLQVRRFSRAPLQFLPPSLKVLTSLRCSSFLQVRRFSRASLQLFPPSSKFLPPSSKVL
MLRSAALLPKVQGARHVLELMLKDVAAEKVQGARHVLELMLKDVAAEKFQGARHVLELMLKYVAATKVQGARHVRELMLKDVAAAKVQGARHVLELMLKDVATAKVQGAR
HVLELMLKDVAAAKVQGTCPKAWRRHKSKVHVPKSGTCPCTRTESAERSSLSSLSLTLFPLAIATAKKNVVVFRQGAIRQKKRLNLSLPQSLSHLPLVAVAAWRCDRRRR
LKIGLCRE