; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0018093 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0018093
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCACTA en-spm transposon protein
Genome locationchr5:15992326..15995531
RNA-Seq ExpressionLag0018093
SyntenyLag0018093
Gene Ontology termsNA
InterPro domainsIPR004252 - Probable transposase, Ptta/En/Spm, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022156286.1 uncharacterized protein LOC111023212 [Momordica charantia]3.0e-5858.72Show/hide
Query:  VEARQNPPERITNLEDWNILCDRWETPEWKEKADKNKNSRSNLPFNHRAGPKSFLQLQHELKIKENRDVDQVDLFHESHFCERDGWVNDVAKDAYLEMQK
        VEAR NPPER+TN EDWN LCDRWETPEWKE   KNK +R+ LPFNHRAG KSFLQLQHELKIKE  D+  VDLF ESH+ E+DG VND A+DAY  MQ 
Subjt:  VEARQNPPERITNLEDWNILCDRWETPEWKEKADKNKNSRSNLPFNHRAGPKSFLQLQHELKIKENRDVDQVDLFHESHFCERDGWVNDVAKDAYLEMQK

Query:  IIDQSTQEGATLIPPNKVYKQVLGHRSGHIKGLGWDPKPS-----SSSSVTSSQ-YEKELEKKVEKMEGEMEQMKASDADMHESNVALKSQFSMWESRWS
        +I   TQEG   +   +  ++VLG R  H+KGLG+ P+P+     SSS+VTSS  YEKELEKKVE ME EM +MK         N  LK   S WE RW+
Subjt:  IIDQSTQEGATLIPPNKVYKQVLGHRSGHIKGLGWDPKPS-----SSSSVTSSQ-YEKELEKKVEKMEGEMEQMKASDADMHESNVALKSQFSMWESRWS

Query:  DIQNLLGRGQREDGPSNN
        +I   +  G++ DGPSNN
Subjt:  DIQNLLGRGQREDGPSNN

XP_038887408.1 poly [ADP-ribose] polymerase 1-like isoform X1 [Benincasa hispida]4.6e-9147.86Show/hide
Query:  ENDDLELLEVPGANEVDES--VQDVILSRDDLEPSV-VIQSQVSRQQLNKTVSSNNEDDFINDEPE---NAAFGSRDRSRN-QRGRGRRTRGHSRNIELD
        E+  L  + V G + V ES  +QD  L+   L   +  + +   ++Q    +   ++  FI    E     A GSR  SR   RG  RRTRGHSRN+ELD
Subjt:  ENDDLELLEVPGANEVDES--VQDVILSRDDLEPSV-VIQSQVSRQQLNKTVSSNNEDDFINDEPE---NAAFGSRDRSRN-QRGRGRRTRGHSRNIELD

Query:  RYVALHGRIRIEITEQIGKSVCGWATRFSGAISTITRSTVPLSCATWRAVPKQVRDAVKAPFV-------------------------------------
        R+V +HGRIRIEI E++GK VC  AT+FS AI TI R+T+PL C  W  V K+VRD V    +                                     
Subjt:  RYVALHGRIRIEITEQIGKSVCGWATRFSGAISTITRSTVPLSCATWRAVPKQVRDAVKAPFV-------------------------------------

Query:  --------------------------------EARQNPPERITNLEDWNILCDRWETPEWKEKADKNKNSRSNLPFNHRAGPKSFLQLQHELKIKENRDV
                                        EAR  PP+RIT+  DWN+LC+RWETPEWK+K + NK SRS +P+ HR G KSF+Q+Q E+KIKE RDV
Subjt:  --------------------------------EARQNPPERITNLEDWNILCDRWETPEWKEKADKNKNSRSNLPFNHRAGPKSFLQLQHELKIKENRDV

Query:  DQVDLFHESHFCERDGWVNDVAKDAYLEMQKIIDQSTQEGATLIPPNKVYKQVLGHRSGHIKGLGWDPKPSSSSSVTS-SQYEKELEKKVEKMEGEMEQM
        DQVDLF +SHFCE+DGWVN+ AKDAYLEMQ++++ S QE  T +   +V KQVLGHRSG+IKGLG +PKPSSSSSVTS  Q +KELEKK+EKME EM QM
Subjt:  DQVDLFHESHFCERDGWVNDVAKDAYLEMQKIIDQSTQEGATLIPPNKVYKQVLGHRSGHIKGLGWDPKPSSSSSVTS-SQYEKELEKKVEKMEGEMEQM

Query:  KASDADMHESNVALKSQFSMWESRWSDIQNLLGRGQREDGPSN
        KA+   M E+NVAL SQ SMWE RW++IQN+LGRGQ +DG SN
Subjt:  KASDADMHESNVALKSQFSMWESRWSDIQNLLGRGQREDGPSN

XP_038887409.1 poly [ADP-ribose] polymerase 1-like isoform X2 [Benincasa hispida]3.4e-9450.96Show/hide
Query:  ENDDLELLEVPGANEVDES--VQDVILSRDDLEPSV-VIQSQVSRQQLNKTVSSNNEDDFINDEPE---NAAFGSRDRSRN-QRGRGRRTRGHSRNIELD
        E+  L  + V G + V ES  +QD  L+   L   +  + +   ++Q    +   ++  FI    E     A GSR  SR   RG  RRTRGHSRN+ELD
Subjt:  ENDDLELLEVPGANEVDES--VQDVILSRDDLEPSV-VIQSQVSRQQLNKTVSSNNEDDFINDEPE---NAAFGSRDRSRN-QRGRGRRTRGHSRNIELD

Query:  RYVALHGRIRIEITEQIGKSVCGWATRFSGAISTITRSTVPLSCATWRAVPKQVRDAVKAPFV-------------------------------------
        R+V +HGRIRIEI E++GK VC  AT+FS AI TI R+T+PL C  W  V K+VRD V    +                                     
Subjt:  RYVALHGRIRIEITEQIGKSVCGWATRFSGAISTITRSTVPLSCATWRAVPKQVRDAVKAPFV-------------------------------------

Query:  -----EARQNPPERITNLEDWNILCDRWETPEWKEKADKNKNSRSNLPFNHRAGPKSFLQLQHELKIKENRDVDQVDLFHESHFCERDGWVNDVAKDAYL
             EAR  PP+RIT+  DWN+LC+RWETPEWK+K + NK SRS +P+ HR G KSF+Q+Q E+KIKE RDVDQVDLF +SHFCE+DGWVN+ AKDAYL
Subjt:  -----EARQNPPERITNLEDWNILCDRWETPEWKEKADKNKNSRSNLPFNHRAGPKSFLQLQHELKIKENRDVDQVDLFHESHFCERDGWVNDVAKDAYL

Query:  EMQKIIDQSTQEGATLIPPNKVYKQVLGHRSGHIKGLGWDPKPSSSSSVTS-SQYEKELEKKVEKMEGEMEQMKASDADMHESNVALKSQFSMWESRWSD
        EMQ++++ S QE  T +   +V KQVLGHRSG+IKGLG +PKPSSSSSVTS  Q +KELEKK+EKME EM QMKA+   M E+NVAL SQ SMWE RW++
Subjt:  EMQKIIDQSTQEGATLIPPNKVYKQVLGHRSGHIKGLGWDPKPSSSSSVTS-SQYEKELEKKVEKMEGEMEQMKASDADMHESNVALKSQFSMWESRWSD

Query:  IQNLLGRGQREDGPSN
        IQN+LGRGQ +DG SN
Subjt:  IQNLLGRGQREDGPSN

XP_038887410.1 poly [ADP-ribose] polymerase 1-like isoform X3 [Benincasa hispida]3.9e-7442.76Show/hide
Query:  ENDDLELLEVPGANEVDES--VQDVILSRDDLEPSV-VIQSQVSRQQLNKTVSSNNEDDFINDEPE---NAAFGSRDRSRN-QRGRGRRTRGHSRNIELD
        E+  L  + V G + V ES  +QD  L+   L   +  + +   ++Q    +   ++  FI    E     A GSR  SR   RG  RRTRGHSRN+ELD
Subjt:  ENDDLELLEVPGANEVDES--VQDVILSRDDLEPSV-VIQSQVSRQQLNKTVSSNNEDDFINDEPE---NAAFGSRDRSRN-QRGRGRRTRGHSRNIELD

Query:  RYVALHGRIRIEITEQIGKSVCGWATRFSGAISTITRSTVPLSCATWRAVPKQVRDAVKAPFV-------------------------------------
        R+V +HGRIRIEI E++GK VC  AT+FS AI TI R+T+PL C  W  V K+VRD V    +                                     
Subjt:  RYVALHGRIRIEITEQIGKSVCGWATRFSGAISTITRSTVPLSCATWRAVPKQVRDAVKAPFV-------------------------------------

Query:  --------------------------------EARQNPPERITNLEDWNILCDRWETPEWKEKADKNKNSRSNLPFNHRAGPKSFLQLQHELKIKENRDV
                                        EAR  PP+RIT+  DWN+LC+RWETPEWK+K + NK SRS +P+ HR G KSF+Q+Q E+KIKE RDV
Subjt:  --------------------------------EARQNPPERITNLEDWNILCDRWETPEWKEKADKNKNSRSNLPFNHRAGPKSFLQLQHELKIKENRDV

Query:  DQVDLFHESHFCERDGWVNDVAKDAYLEMQKIIDQSTQEGATLIPPNKVYKQVLGHRSGHIKGLGWDPKPSSSSSVTSSQYEKELEKKVEKMEGEMEQMK
        DQVDLF +SHFCE+DGWVN+ AKDAYLEMQ++++ S QE                           DP P S     S + +KELEKK+EKME EM QMK
Subjt:  DQVDLFHESHFCERDGWVNDVAKDAYLEMQKIIDQSTQEGATLIPPNKVYKQVLGHRSGHIKGLGWDPKPSSSSSVTSSQYEKELEKKVEKMEGEMEQMK

Query:  ASDADMHESNVALKSQFSMWESRWSDIQNLLGRGQREDGPSN
        A+   M E+NVAL SQ SMWE RW++IQN+LGRGQ +DG SN
Subjt:  ASDADMHESNVALKSQFSMWESRWSDIQNLLGRGQREDGPSN

XP_038887413.1 uncharacterized protein LOC120077557 isoform X5 [Benincasa hispida]4.6e-9147.86Show/hide
Query:  ENDDLELLEVPGANEVDES--VQDVILSRDDLEPSV-VIQSQVSRQQLNKTVSSNNEDDFINDEPE---NAAFGSRDRSRN-QRGRGRRTRGHSRNIELD
        E+  L  + V G + V ES  +QD  L+   L   +  + +   ++Q    +   ++  FI    E     A GSR  SR   RG  RRTRGHSRN+ELD
Subjt:  ENDDLELLEVPGANEVDES--VQDVILSRDDLEPSV-VIQSQVSRQQLNKTVSSNNEDDFINDEPE---NAAFGSRDRSRN-QRGRGRRTRGHSRNIELD

Query:  RYVALHGRIRIEITEQIGKSVCGWATRFSGAISTITRSTVPLSCATWRAVPKQVRDAVKAPFV-------------------------------------
        R+V +HGRIRIEI E++GK VC  AT+FS AI TI R+T+PL C  W  V K+VRD V    +                                     
Subjt:  RYVALHGRIRIEITEQIGKSVCGWATRFSGAISTITRSTVPLSCATWRAVPKQVRDAVKAPFV-------------------------------------

Query:  --------------------------------EARQNPPERITNLEDWNILCDRWETPEWKEKADKNKNSRSNLPFNHRAGPKSFLQLQHELKIKENRDV
                                        EAR  PP+RIT+  DWN+LC+RWETPEWK+K + NK SRS +P+ HR G KSF+Q+Q E+KIKE RDV
Subjt:  --------------------------------EARQNPPERITNLEDWNILCDRWETPEWKEKADKNKNSRSNLPFNHRAGPKSFLQLQHELKIKENRDV

Query:  DQVDLFHESHFCERDGWVNDVAKDAYLEMQKIIDQSTQEGATLIPPNKVYKQVLGHRSGHIKGLGWDPKPSSSSSVTS-SQYEKELEKKVEKMEGEMEQM
        DQVDLF +SHFCE+DGWVN+ AKDAYLEMQ++++ S QE  T +   +V KQVLGHRSG+IKGLG +PKPSSSSSVTS  Q +KELEKK+EKME EM QM
Subjt:  DQVDLFHESHFCERDGWVNDVAKDAYLEMQKIIDQSTQEGATLIPPNKVYKQVLGHRSGHIKGLGWDPKPSSSSSVTS-SQYEKELEKKVEKMEGEMEQM

Query:  KASDADMHESNVALKSQFSMWESRWSDIQNLLGRGQREDGPSN
        KA+   M E+NVAL SQ SMWE RW++IQN+LGRGQ +DG SN
Subjt:  KASDADMHESNVALKSQFSMWESRWSDIQNLLGRGQREDGPSN

TrEMBL top hitse value%identityAlignment
A0A5A7SPZ3 Transposase8.9e-4838.58Show/hide
Query:  RDRSRNQRGRGRRTRGHSRNIELDRYVALHGRIRIEITEQIGKSVCGWATRFSGAISTITRSTVPLSCATWRAVPKQVRDAV------------------
        R +SR ++ R R  RG+ RNIELD++V  HG+++IEI E+ GK V  +A + +  I T  R+T+ LSC  W+A+P  V++ +                  
Subjt:  RDRSRNQRGRGRRTRGHSRNIELDRYVALHGRIRIEITEQIGKSVCGWATRFSGAISTITRSTVPLSCATWRAVPKQVRDAV------------------

Query:  ------------------------KAPFVEARQNPPERITNLEDWNILCDRWETPEWKEKADKNKNSRSNLPFNHRAGPKSFLQLQHELKIKENRDVDQV
                                   F+EAR NPP++IT+ EDWN++CDRWET  WK+K + NK SRS + FNH  G KSFLQ++HEL+ K+  DVD+V
Subjt:  ------------------------KAPFVEARQNPPERITNLEDWNILCDRWETPEWKEKADKNKNSRSNLPFNHRAGPKSFLQLQHELKIKENRDVDQV

Query:  DLFHESHFCERDGWVNDVAKDAYLEMQKIIDQSTQEGATLIPPNKVYKQVLGHRSGHIKGLGWDPKPSSSSSVTSSQYEKELEKKVEKMEGEMEQMKASD
        ++F E+HF E++GW+ND AKDAY    +II +ST+ G   I   K  K VLG  S  I  L      S  S+V+S++         EK + EM  +K   
Subjt:  DLFHESHFCERDGWVNDVAKDAYLEMQKIIDQSTQEGATLIPPNKVYKQVLGHRSGHIKGLGWDPKPSSSSSVTSSQYEKELEKKVEKMEGEMEQMKASD

Query:  ADMHESNVALKSQFSMWESRWSDIQNLLG-RGQREDG
            E N  L  + + WE RW+DI+  +G RG R  G
Subjt:  ADMHESNVALKSQFSMWESRWSDIQNLLG-RGQREDG

A0A5A7T3V0 CACTA en-spm transposon protein2.2e-5438.72Show/hide
Query:  EAEDQENDDLELLEVPGANEVDESVQDVILSRDDLEPSVVIQSQVSRQQLNKTVSSNNEDDFINDEP---ENAAFGSRDRSRNQRGRGRRTR--------
        E E+ E+D LELLE   + EVDES+ D+   R D+EP+VV   ++  Q       S  +DDFINDEP   E++     D+++       +T         
Subjt:  EAEDQENDDLELLEVPGANEVDESVQDVILSRDDLEPSVVIQSQVSRQQLNKTVSSNNEDDFINDEP---ENAAFGSRDRSRNQRGRGRRTR--------

Query:  --GHSRNIELDRYVALHGRIRIEITEQIGKSVCGWATRFSGAISTITRSTVPLSCATWRAVPKQVRDAVKAPFVEARQNPPERITNLEDWNILCDRWETP
          G+ RNIELD++V  HG+++IEI+E+ GK V  +A   +  I T  R+T+PLSC   +AVP  V++ V            +RIT+ EDWN++CDRWET 
Subjt:  --GHSRNIELDRYVALHGRIRIEITEQIGKSVCGWATRFSGAISTITRSTVPLSCATWRAVPKQVRDAVKAPFVEARQNPPERITNLEDWNILCDRWETP

Query:  EWKEKADKNKNSRSNLPFNHRAGPKSFLQLQHELKIKENRDVDQVDLFHESHFCERDGWVNDVAKDAYLEMQKIIDQSTQEGATLIPPNKVYKQVLGHRS
         WK+K + NK SRS + FNH    KSFLQ++HELK K+  DVD+V++FHE+HF E++GW+ND AK+AY    +II +ST+ G   I   K  + VLG RS
Subjt:  EWKEKADKNKNSRSNLPFNHRAGPKSFLQLQHELKIKENRDVDQVDLFHESHFCERDGWVNDVAKDAYLEMQKIIDQSTQEGATLIPPNKVYKQVLGHRS

Query:  GHIKGLGWDPKPSSSSSVTSSQYEKELEKKVEKMEGEMEQMKASDADMHESNVALKSQFSMWESRWSDIQNLLGRG------QREDGPSN
                   P S  S+ S+     +    EK + EM  +K       E N  L  + + WE RW+D++  L  G      + ++ PSN
Subjt:  GHIKGLGWDPKPSSSSSVTSSQYEKELEKKVEKMEGEMEQMKASDADMHESNVALKSQFSMWESRWSDIQNLLGRG------QREDGPSN

A0A5A7TRX4 DUF4216 domain-containing protein4.2e-5839.04Show/hide
Query:  LEEAEDQENDDLELLEVPGANEVDESVQDVILSRDDLEPSVVIQSQVSRQQLNKTVSSNNEDDFINDEPENAAFGSRDRSRNQRGRGRRTRGHSRNIELD
        + E+E+ E+D  ELLE   +  VDES+ D+   R D+EP+VV   +   Q       S  +DDFINDE E       D        GR  RG+ RNIELD
Subjt:  LEEAEDQENDDLELLEVPGANEVDESVQDVILSRDDLEPSVVIQSQVSRQQLNKTVSSNNEDDFINDEPENAAFGSRDRSRNQRGRGRRTRGHSRNIELD

Query:  RYVALHGRIRIEITEQIGKSVCGWATRFSGAISTITRSTVPLSCATWRAVPKQVRDAV------------------------------------------
        ++V  HG+I+IEI E+ GK V  +A + +  I T  R+T+PLSC  W+AVP  VR+ V                                          
Subjt:  RYVALHGRIRIEITEQIGKSVCGWATRFSGAISTITRSTVPLSCATWRAVPKQVRDAV------------------------------------------

Query:  KAPFVEARQNPPERITNLEDWNILCDRWETPEWKEKADKNKNSRSNLPFNHRAGPKSFLQLQHELKIKENRDVDQVDLFHESHFCERDGWVNDVAKDAYL
            VEAR NP  RIT+ EDWN++CDRWET  WK+K + NK S S + FNH  G KSFLQ++HELK K+  DVD++++FHE+HF E++GW ND AKDAYL
Subjt:  KAPFVEARQNPPERITNLEDWNILCDRWETPEWKEKADKNKNSRSNLPFNHRAGPKSFLQLQHELKIKENRDVDQVDLFHESHFCERDGWVNDVAKDAYL

Query:  EMQKIIDQSTQEGATLIPPNKVYKQVLGHRSGHIKGLGWDPKPSSSSSVTSSQYEKELEKKVEKMEGEMEQMKASDADMHESNVALKSQFSMWESRW
        EMQ+II +ST+ G   I   K  + VLG RS           P S  S+ S+     +    EK + EM  +K       E+N  L  + + WE  +
Subjt:  EMQKIIDQSTQEGATLIPPNKVYKQVLGHRSGHIKGLGWDPKPSSSSSVTSSQYEKELEKKVEKMEGEMEQMKASDADMHESNVALKSQFSMWESRW

A0A5D3B974 DUF4216 domain-containing protein2.2e-5438.72Show/hide
Query:  EAEDQENDDLELLEVPGANEVDESVQDVILSRDDLEPSVVIQSQVSRQQLNKTVSSNNEDDFINDEP---ENAAFGSRDRSRNQRGRGRRTR--------
        E E+ E+D LELLE   + EVDES+ D+   R D+EP+VV   ++  Q       S  +DDFINDEP   E++     D+++       +T         
Subjt:  EAEDQENDDLELLEVPGANEVDESVQDVILSRDDLEPSVVIQSQVSRQQLNKTVSSNNEDDFINDEP---ENAAFGSRDRSRNQRGRGRRTR--------

Query:  --GHSRNIELDRYVALHGRIRIEITEQIGKSVCGWATRFSGAISTITRSTVPLSCATWRAVPKQVRDAVKAPFVEARQNPPERITNLEDWNILCDRWETP
          G+ RNIELD++V  HG+++IEI+E+ GK V  +A   +  I T  R+T+PLSC   +AVP  V++ V            +RIT+ EDWN++CDRWET 
Subjt:  --GHSRNIELDRYVALHGRIRIEITEQIGKSVCGWATRFSGAISTITRSTVPLSCATWRAVPKQVRDAVKAPFVEARQNPPERITNLEDWNILCDRWETP

Query:  EWKEKADKNKNSRSNLPFNHRAGPKSFLQLQHELKIKENRDVDQVDLFHESHFCERDGWVNDVAKDAYLEMQKIIDQSTQEGATLIPPNKVYKQVLGHRS
         WK+K + NK SRS + FNH    KSFLQ++HELK K+  DVD+V++FHE+HF E++GW+ND AK+AY    +II +ST+ G   I   K  + VLG RS
Subjt:  EWKEKADKNKNSRSNLPFNHRAGPKSFLQLQHELKIKENRDVDQVDLFHESHFCERDGWVNDVAKDAYLEMQKIIDQSTQEGATLIPPNKVYKQVLGHRS

Query:  GHIKGLGWDPKPSSSSSVTSSQYEKELEKKVEKMEGEMEQMKASDADMHESNVALKSQFSMWESRWSDIQNLLGRG------QREDGPSN
                   P S  S+ S+     +    EK + EM  +K       E N  L  + + WE RW+D++  L  G      + ++ PSN
Subjt:  GHIKGLGWDPKPSSSSSVTSSQYEKELEKKVEKMEGEMEQMKASDADMHESNVALKSQFSMWESRWSDIQNLLGRG------QREDGPSN

A0A6J1DUH3 uncharacterized protein LOC1110232121.5e-5858.72Show/hide
Query:  VEARQNPPERITNLEDWNILCDRWETPEWKEKADKNKNSRSNLPFNHRAGPKSFLQLQHELKIKENRDVDQVDLFHESHFCERDGWVNDVAKDAYLEMQK
        VEAR NPPER+TN EDWN LCDRWETPEWKE   KNK +R+ LPFNHRAG KSFLQLQHELKIKE  D+  VDLF ESH+ E+DG VND A+DAY  MQ 
Subjt:  VEARQNPPERITNLEDWNILCDRWETPEWKEKADKNKNSRSNLPFNHRAGPKSFLQLQHELKIKENRDVDQVDLFHESHFCERDGWVNDVAKDAYLEMQK

Query:  IIDQSTQEGATLIPPNKVYKQVLGHRSGHIKGLGWDPKPS-----SSSSVTSSQ-YEKELEKKVEKMEGEMEQMKASDADMHESNVALKSQFSMWESRWS
        +I   TQEG   +   +  ++VLG R  H+KGLG+ P+P+     SSS+VTSS  YEKELEKKVE ME EM +MK         N  LK   S WE RW+
Subjt:  IIDQSTQEGATLIPPNKVYKQVLGHRSGHIKGLGWDPKPS-----SSSSVTSSQ-YEKELEKKVEKMEGEMEQMKASDADMHESNVALKSQFSMWESRWS

Query:  DIQNLLGRGQREDGPSNN
        +I   +  G++ DGPSNN
Subjt:  DIQNLLGRGQREDGPSNN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAACAACGAAGCTTGCGGCGGCGATTCCTACAGTGGAAGGAGGAGCGAAAAATCGTCATCTCGAAGAAGCTGAAGACCAAGAGAATGACGATTTAGAGTTACTAGA
AGTCCCTGGGGCAAATGAGGTTGATGAATCCGTCCAGGATGTCATATTGAGTAGGGACGACCTGGAACCCAGTGTCGTCATTCAAAGCCAAGTGAGCAGACAACAATTGA
ATAAAACTGTGTCCTCCAACAATGAAGACGACTTTATAAACGATGAGCCTGAAAATGCTGCGTTTGGATCTAGAGATCGTTCAAGAAATCAACGAGGAAGAGGTAGGCGG
ACTAGAGGACATAGTCGGAATATTGAACTAGATCGATATGTGGCTCTTCATGGGAGGATCAGAATTGAGATCACCGAGCAGATTGGAAAATCAGTATGCGGTTGGGCTAC
AAGGTTTAGTGGCGCTATTAGTACCATAACAAGGAGCACGGTTCCTTTGAGTTGTGCGACATGGAGGGCTGTACCAAAACAAGTACGGGACGCTGTGAAGGCCCCATTTG
TTGAAGCCCGTCAAAATCCACCCGAGAGGATTACAAACCTTGAAGATTGGAATATTCTATGTGATCGATGGGAGACACCTGAGTGGAAGGAAAAAGCAGATAAGAATAAA
AATAGTCGATCGAACCTTCCATTCAACCATCGAGCTGGGCCGAAGTCATTTCTCCAACTACAACATGAATTGAAAATCAAAGAGAATCGGGATGTTGACCAGGTAGATTT
GTTCCATGAAAGTCATTTTTGTGAAAGAGATGGATGGGTCAACGATGTTGCCAAAGATGCATATCTAGAGATGCAAAAAATCATAGATCAATCAACACAAGAAGGCGCAA
CATTGATTCCCCCAAACAAAGTATATAAGCAAGTGTTGGGTCATCGATCAGGCCACATCAAAGGCCTAGGTTGGGACCCAAAACCCAGCTCATCATCCAGTGTCACATCA
TCACAATATGAAAAAGAACTAGAAAAGAAGGTTGAGAAGATGGAAGGTGAAATGGAACAGATGAAGGCTTCTGACGCAGATATGCATGAATCAAATGTTGCCCTGAAGTC
ACAATTTTCGATGTGGGAAAGTAGATGGTCTGACATTCAAAACTTGTTGGGGCGAGGTCAGAGAGAAGATGGACCTTCAAACAATTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCAACAACGAAGCTTGCGGCGGCGATTCCTACAGTGGAAGGAGGAGCGAAAAATCGTCATCTCGAAGAAGCTGAAGACCAAGAGAATGACGATTTAGAGTTACTAGA
AGTCCCTGGGGCAAATGAGGTTGATGAATCCGTCCAGGATGTCATATTGAGTAGGGACGACCTGGAACCCAGTGTCGTCATTCAAAGCCAAGTGAGCAGACAACAATTGA
ATAAAACTGTGTCCTCCAACAATGAAGACGACTTTATAAACGATGAGCCTGAAAATGCTGCGTTTGGATCTAGAGATCGTTCAAGAAATCAACGAGGAAGAGGTAGGCGG
ACTAGAGGACATAGTCGGAATATTGAACTAGATCGATATGTGGCTCTTCATGGGAGGATCAGAATTGAGATCACCGAGCAGATTGGAAAATCAGTATGCGGTTGGGCTAC
AAGGTTTAGTGGCGCTATTAGTACCATAACAAGGAGCACGGTTCCTTTGAGTTGTGCGACATGGAGGGCTGTACCAAAACAAGTACGGGACGCTGTGAAGGCCCCATTTG
TTGAAGCCCGTCAAAATCCACCCGAGAGGATTACAAACCTTGAAGATTGGAATATTCTATGTGATCGATGGGAGACACCTGAGTGGAAGGAAAAAGCAGATAAGAATAAA
AATAGTCGATCGAACCTTCCATTCAACCATCGAGCTGGGCCGAAGTCATTTCTCCAACTACAACATGAATTGAAAATCAAAGAGAATCGGGATGTTGACCAGGTAGATTT
GTTCCATGAAAGTCATTTTTGTGAAAGAGATGGATGGGTCAACGATGTTGCCAAAGATGCATATCTAGAGATGCAAAAAATCATAGATCAATCAACACAAGAAGGCGCAA
CATTGATTCCCCCAAACAAAGTATATAAGCAAGTGTTGGGTCATCGATCAGGCCACATCAAAGGCCTAGGTTGGGACCCAAAACCCAGCTCATCATCCAGTGTCACATCA
TCACAATATGAAAAAGAACTAGAAAAGAAGGTTGAGAAGATGGAAGGTGAAATGGAACAGATGAAGGCTTCTGACGCAGATATGCATGAATCAAATGTTGCCCTGAAGTC
ACAATTTTCGATGTGGGAAAGTAGATGGTCTGACATTCAAAACTTGTTGGGGCGAGGTCAGAGAGAAGATGGACCTTCAAACAATTAG
Protein sequenceShow/hide protein sequence
MATTKLAAAIPTVEGGAKNRHLEEAEDQENDDLELLEVPGANEVDESVQDVILSRDDLEPSVVIQSQVSRQQLNKTVSSNNEDDFINDEPENAAFGSRDRSRNQRGRGRR
TRGHSRNIELDRYVALHGRIRIEITEQIGKSVCGWATRFSGAISTITRSTVPLSCATWRAVPKQVRDAVKAPFVEARQNPPERITNLEDWNILCDRWETPEWKEKADKNK
NSRSNLPFNHRAGPKSFLQLQHELKIKENRDVDQVDLFHESHFCERDGWVNDVAKDAYLEMQKIIDQSTQEGATLIPPNKVYKQVLGHRSGHIKGLGWDPKPSSSSSVTS
SQYEKELEKKVEKMEGEMEQMKASDADMHESNVALKSQFSMWESRWSDIQNLLGRGQREDGPSNN