; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0005275 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0005275
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCACTA en-spm transposon protein
Genome locationchr6:13185798..13188446
RNA-Seq ExpressionLag0005275
SyntenyLag0005275
Gene Ontology termsNA
InterPro domainsIPR004252 - Probable transposase, Ptta/En/Spm, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_038887408.1 poly [ADP-ribose] polymerase 1-like isoform X1 [Benincasa hispida]5.7e-8948.92Show/hide
Query:  SQLSRQQLNKTVSSNNENDFINDEPENEARDALANA-NRPVIGSSTDQ---AASGSRGRSRN-QRGRGRGTREHSRNIELDGYVALHGRIRIEITEQIGK
        S L    LN +  S   N  +    + +    L  A + P IGSST++    ASGSR  SR   RG  R TR HSRN+ELD +V +HGRIRIEI E++GK
Subjt:  SQLSRQQLNKTVSSNNENDFINDEPENEARDALANA-NRPVIGSSTDQ---AASGSRGRSRN-QRGRGRGTREHSRNIELDGYVALHGRIRIEITEQIGK

Query:  PVCGWSTRFSGAIGTKTRSTVPLSCATWRVVPEQVRDAVKARLL---------------------------TYFDLDMSKRHVVKYIHKLMSSTFKEFRA
        PVC  +T+FS AIGT  R+T+PL C  W  V ++VRD V  +LL                           +YFD D+ K+HV KY+ + + +TFKE+R+
Subjt:  PVCGWSTRFSGAIGTKTRSTVPLSCATWRVVPEQVRDAVKARLL---------------------------TYFDLDMSKRHVVKYIHKLMSSTFKEFRA

Query:  DLYKHYCEFEDPTEARQNPPERITNLEDWNLLYDRWETPEWKEKADKNKNSQSNLPFNHRAGPKSFLQLQHELKIKENRDVDQVDLFHESHFCERDGWVN
        DLYKHY  F+DP EAR  PP+RIT+  DWNLL +RWETPEWK+K + NK S+S +P+ HR G KSF+Q+Q E+KIKE RDVDQVDLF +SHFCE+DGWVN
Subjt:  DLYKHYCEFEDPTEARQNPPERITNLEDWNLLYDRWETPEWKEKADKNKNSQSNLPFNHRAGPKSFLQLQHELKIKENRDVDQVDLFHESHFCERDGWVN

Query:  DVAKDAY----------------------------------------DPKPSSSSSVTS-SQYEKELENKVEKMEGEMEQMKASYVDMRESNVALKSQLS
        + AKDAY                                        +PKPSSSSSVTS  Q +KELE K+EKME EM QMKA+Y  M+E+NVAL SQLS
Subjt:  DVAKDAY----------------------------------------DPKPSSSSSVTS-SQYEKELENKVEKMEGEMEQMKASYVDMRESNVALKSQLS

Query:  MWESIWSDIQNLLGR
        MWE  W++IQN+LGR
Subjt:  MWESIWSDIQNLLGR

XP_038887409.1 poly [ADP-ribose] polymerase 1-like isoform X2 [Benincasa hispida]2.2e-9352.32Show/hide
Query:  SQLSRQQLNKTVSSNNENDFINDEPENEARDALANA-NRPVIGSSTDQ---AASGSRGRSRN-QRGRGRGTREHSRNIELDGYVALHGRIRIEITEQIGK
        S L    LN +  S   N  +    + +    L  A + P IGSST++    ASGSR  SR   RG  R TR HSRN+ELD +V +HGRIRIEI E++GK
Subjt:  SQLSRQQLNKTVSSNNENDFINDEPENEARDALANA-NRPVIGSSTDQ---AASGSRGRSRN-QRGRGRGTREHSRNIELDGYVALHGRIRIEITEQIGK

Query:  PVCGWSTRFSGAIGTKTRSTVPLSCATWRVVPEQVRDAVKARLLTYFDLDMSKRHVVKYIHKLMSSTFKEFRADLYKHYCEFEDPTEARQNPPERITNLE
        PVC  +T+FS AIGT  R+T+PL C  W  V ++VRD V  +LL+YFD D+ K+HV KY+ + + +TFKE+R+DLYKHY  F+DP EAR  PP+RIT+  
Subjt:  PVCGWSTRFSGAIGTKTRSTVPLSCATWRVVPEQVRDAVKARLLTYFDLDMSKRHVVKYIHKLMSSTFKEFRADLYKHYCEFEDPTEARQNPPERITNLE

Query:  DWNLLYDRWETPEWKEKADKNKNSQSNLPFNHRAGPKSFLQLQHELKIKENRDVDQVDLFHESHFCERDGWVNDVAKDAY--------------------
        DWNLL +RWETPEWK+K + NK S+S +P+ HR G KSF+Q+Q E+KIKE RDVDQVDLF +SHFCE+DGWVN+ AKDAY                    
Subjt:  DWNLLYDRWETPEWKEKADKNKNSQSNLPFNHRAGPKSFLQLQHELKIKENRDVDQVDLFHESHFCERDGWVNDVAKDAY--------------------

Query:  --------------------DPKPSSSSSVTS-SQYEKELENKVEKMEGEMEQMKASYVDMRESNVALKSQLSMWESIWSDIQNLLGR
                            +PKPSSSSSVTS  Q +KELE K+EKME EM QMKA+Y  M+E+NVAL SQLSMWE  W++IQN+LGR
Subjt:  --------------------DPKPSSSSSVTS-SQYEKELENKVEKMEGEMEQMKASYVDMRESNVALKSQLSMWESIWSDIQNLLGR

XP_038887410.1 poly [ADP-ribose] polymerase 1-like isoform X3 [Benincasa hispida]2.8e-8850.65Show/hide
Query:  SQLSRQQLNKTVSSNNENDFINDEPENEARDALANA-NRPVIGSSTDQ---AASGSRGRSRN-QRGRGRGTREHSRNIELDGYVALHGRIRIEITEQIGK
        S L    LN +  S   N  +    + +    L  A + P IGSST++    ASGSR  SR   RG  R TR HSRN+ELD +V +HGRIRIEI E++GK
Subjt:  SQLSRQQLNKTVSSNNENDFINDEPENEARDALANA-NRPVIGSSTDQ---AASGSRGRSRN-QRGRGRGTREHSRNIELDGYVALHGRIRIEITEQIGK

Query:  PVCGWSTRFSGAIGTKTRSTVPLSCATWRVVPEQVRDAVKARLL---------------------------TYFDLDMSKRHVVKYIHKLMSSTFKEFRA
        PVC  +T+FS AIGT  R+T+PL C  W  V ++VRD V  +LL                           +YFD D+ K+HV KY+ + + +TFKE+R+
Subjt:  PVCGWSTRFSGAIGTKTRSTVPLSCATWRVVPEQVRDAVKARLL---------------------------TYFDLDMSKRHVVKYIHKLMSSTFKEFRA

Query:  DLYKHYCEFEDPTEARQNPPERITNLEDWNLLYDRWETPEWKEKADKNKNSQSNLPFNHRAGPKSFLQLQHELKIKENRDVDQVDLFHESHFCERDGWVN
        DLYKHY  F+DP EAR  PP+RIT+  DWNLL +RWETPEWK+K + NK S+S +P+ HR G KSF+Q+Q E+KIKE RDVDQVDLF +SHFCE+DGWVN
Subjt:  DLYKHYCEFEDPTEARQNPPERITNLEDWNLLYDRWETPEWKEKADKNKNSQSNLPFNHRAGPKSFLQLQHELKIKENRDVDQVDLFHESHFCERDGWVN

Query:  DVAKDAY-------------DPKPSSSSSVTSSQYEKELENKVEKMEGEMEQMKASYVDMRESNVALKSQLSMWESIWSDIQNLLGR
        + AKDAY             DP P S     S + +KELE K+EKME EM QMKA+Y  M+E+NVAL SQLSMWE  W++IQN+LGR
Subjt:  DVAKDAY-------------DPKPSSSSSVTSSQYEKELENKVEKMEGEMEQMKASYVDMRESNVALKSQLSMWESIWSDIQNLLGR

XP_038887411.1 poly [ADP-ribose] polymerase 1-like isoform X4 [Benincasa hispida]1.2e-7050.32Show/hide
Query:  SQLSRQQLNKTVSSNNENDFINDEPENEARDALANA-NRPVIGSSTDQ---AASGSRGRSRN-QRGRGRGTREHSRNIELDGYVALHGRIRIEITEQIGK
        S L    LN +  S   N  +    + +    L  A + P IGSST++    ASGSR  SR   RG  R TR HSRN+ELD +V +HGRIRIEI E++GK
Subjt:  SQLSRQQLNKTVSSNNENDFINDEPENEARDALANA-NRPVIGSSTDQ---AASGSRGRSRN-QRGRGRGTREHSRNIELDGYVALHGRIRIEITEQIGK

Query:  PVCGWSTRFSGAIGTKTRSTVPLSCATWRVVPEQVRDAVKARLL---------------------------TYFDLDMSKRHVVKYIHKLMSSTFKEFRA
        PVC  +T+FS AIGT  R+T+PL C  W  V ++VRD V  +LL                           +YFD D+ K+HV KY+ + + +TFKE+R+
Subjt:  PVCGWSTRFSGAIGTKTRSTVPLSCATWRVVPEQVRDAVKARLL---------------------------TYFDLDMSKRHVVKYIHKLMSSTFKEFRA

Query:  DLYKHYCEFEDPTEARQNPPERITNLEDWNLLYDRWETPEWKEKADKNKNSQSNLPFNHRAGPKSFLQLQHELKIKENRDVDQVDLFHESHFCERDGWVN
        DLYKHY  F+DP EAR  PP+RIT+  DWNLL +RWETPEWK+K + NK S+S +P+ HR G KSF+Q+Q E+KIKE RDVDQVDLF +SHFCE+DGWVN
Subjt:  DLYKHYCEFEDPTEARQNPPERITNLEDWNLLYDRWETPEWKEKADKNKNSQSNLPFNHRAGPKSFLQLQHELKIKENRDVDQVDLFHESHFCERDGWVN

Query:  DVAKDAYDPKPS
        + AKDAY  K S
Subjt:  DVAKDAYDPKPS

XP_038887413.1 uncharacterized protein LOC120077557 isoform X5 [Benincasa hispida]5.7e-8948.92Show/hide
Query:  SQLSRQQLNKTVSSNNENDFINDEPENEARDALANA-NRPVIGSSTDQ---AASGSRGRSRN-QRGRGRGTREHSRNIELDGYVALHGRIRIEITEQIGK
        S L    LN +  S   N  +    + +    L  A + P IGSST++    ASGSR  SR   RG  R TR HSRN+ELD +V +HGRIRIEI E++GK
Subjt:  SQLSRQQLNKTVSSNNENDFINDEPENEARDALANA-NRPVIGSSTDQ---AASGSRGRSRN-QRGRGRGTREHSRNIELDGYVALHGRIRIEITEQIGK

Query:  PVCGWSTRFSGAIGTKTRSTVPLSCATWRVVPEQVRDAVKARLL---------------------------TYFDLDMSKRHVVKYIHKLMSSTFKEFRA
        PVC  +T+FS AIGT  R+T+PL C  W  V ++VRD V  +LL                           +YFD D+ K+HV KY+ + + +TFKE+R+
Subjt:  PVCGWSTRFSGAIGTKTRSTVPLSCATWRVVPEQVRDAVKARLL---------------------------TYFDLDMSKRHVVKYIHKLMSSTFKEFRA

Query:  DLYKHYCEFEDPTEARQNPPERITNLEDWNLLYDRWETPEWKEKADKNKNSQSNLPFNHRAGPKSFLQLQHELKIKENRDVDQVDLFHESHFCERDGWVN
        DLYKHY  F+DP EAR  PP+RIT+  DWNLL +RWETPEWK+K + NK S+S +P+ HR G KSF+Q+Q E+KIKE RDVDQVDLF +SHFCE+DGWVN
Subjt:  DLYKHYCEFEDPTEARQNPPERITNLEDWNLLYDRWETPEWKEKADKNKNSQSNLPFNHRAGPKSFLQLQHELKIKENRDVDQVDLFHESHFCERDGWVN

Query:  DVAKDAY----------------------------------------DPKPSSSSSVTS-SQYEKELENKVEKMEGEMEQMKASYVDMRESNVALKSQLS
        + AKDAY                                        +PKPSSSSSVTS  Q +KELE K+EKME EM QMKA+Y  M+E+NVAL SQLS
Subjt:  DVAKDAY----------------------------------------DPKPSSSSSVTS-SQYEKELENKVEKMEGEMEQMKASYVDMRESNVALKSQLS

Query:  MWESIWSDIQNLLGR
        MWE  W++IQN+LGR
Subjt:  MWESIWSDIQNLLGR

TrEMBL top hitse value%identityAlignment
A0A5A7SPZ3 Transposase1.2e-5540.39Show/hide
Query:  RGRSRNQRGRGRGTREHSRNIELDGYVALHGRIRIEITEQIGKPVCGWSTRFSGAIGTKTRSTVPLSCATWRVVPEQVRDAVKARLLTYFDLDMSKRHVV
        R +SR ++ R RG R + RNIELD +V  HG+++IEI E+ GKPV  ++ + +  IGT  R+T+ LSC  W+ +P  V++ +  R  T+F+ D +   V 
Subjt:  RGRSRNQRGRGRGTREHSRNIELDGYVALHGRIRIEITEQIGKPVCGWSTRFSGAIGTKTRSTVPLSCATWRVVPEQVRDAVKARLLTYFDLDMSKRHVV

Query:  KYIHKLMSSTFKEFRADLYKHYCEFEDPTEARQNPPERITNLEDWNLLYDRWETPEWKEKADKNKNSQSNLPFNHRAGPKSFLQLQHELKIKENRDVDQV
        KY+ + M + F+EFRA L+K+YC+F+D  EAR NPP++IT+ EDWN++ DRWET  WK+K + NK S+S + FNH  G KSFLQ++HEL+ K+  DVD+V
Subjt:  KYIHKLMSSTFKEFRADLYKHYCEFEDPTEARQNPPERITNLEDWNLLYDRWETPEWKEKADKNKNSQSNLPFNHRAGPKSFLQLQHELKIKENRDVDQV

Query:  DLFHESHFCERDGWVNDVAKDAY----DPKPSSSSSVTSSQYEKEL--------------ENKVEKMEGEMEQMKASYVDMRESNVALKSQLSMWESIWS
        ++F E+HF E++GW+ND AKDAY    +   +   +++S++  K +              E+    +    E+ K     ++E N  L  +L+ WE  W+
Subjt:  DLFHESHFCERDGWVNDVAKDAY----DPKPSSSSSVTSSQYEKEL--------------ENKVEKMEGEMEQMKASYVDMRESNVALKSQLSMWESIWS

Query:  DIQNLLG
        DI+  +G
Subjt:  DIQNLLG

A0A5A7TFG0 Transposon protein, putative, CACTA, En/Spm sub-class1.3e-5136.54Show/hide
Query:  EVEDQENDDLELLEVPGANEVDESVQDVILSRDDVEPSVVIQSQLSRQQLNKTVSSNNENDFINDEPEN-EARDA----------LANANRPVIGSSTDQ
        E E+ E+D  ELLE   +  VDES+ D+   R DVEP+VV   +   Q       S  ++DFINDE E  ++ D+          +   N P +  ST  
Subjt:  EVEDQENDDLELLEVPGANEVDESVQDVILSRDDVEPSVVIQSQLSRQQLNKTVSSNNENDFINDEPEN-EARDA----------LANANRPVIGSSTDQ

Query:  AASGSRGRSRNQRGRGRGTREHSRNIELDGYVALHGRIRIEITEQIGKPVCGWSTRFSGAIGTKTRSTVPLSCATWRVVPEQVRDAVKARLLTYFDLDMS
              G     + RGRG R + RNIELD +V  HG+I+IEI E+ GKPV  ++ + +  IGT  R+T+PLSC  W+ VP  VR+ V  RL T+F+ D +
Subjt:  AASGSRGRSRNQRGRGRGTREHSRNIELDGYVALHGRIRIEITEQIGKPVCGWSTRFSGAIGTKTRSTVPLSCATWRVVPEQVRDAVKARLLTYFDLDMS

Query:  KRHVVKYIHKLMSSTFKEFRADLYKHYCEFEDPTEARQNPPERITNLEDWNLLYDRWETPEWKEKADKNKNSQSNLPFNHRAGPKSFLQLQHELKIKENR
           V KY+ + M + F+EFRADL+K+YC+F+D  EAR NPP RIT  EDWN++ DRWET  WK                                 K+  
Subjt:  KRHVVKYIHKLMSSTFKEFRADLYKHYCEFEDPTEARQNPPERITNLEDWNLLYDRWETPEWKEKADKNKNSQSNLPFNHRAGPKSFLQLQHELKIKENR

Query:  DVDQVDLFHESHFCERDGWVNDVAKDAYDP-----KPSSSSSVTSSQYEKELE--------NKVEKMEGE---------MEQMKASYVDMRESNVALKSQ
        DVD++++FHE+HF E++GW+ND AKDAY         S+ + V +    K  E          V    GE          E+ K     ++E+N  L  +
Subjt:  DVDQVDLFHESHFCERDGWVNDVAKDAYDP-----KPSSSSSVTSSQYEKELE--------NKVEKMEGE---------MEQMKASYVDMRESNVALKSQ

Query:  LSMWE
        L+ WE
Subjt:  LSMWE

A0A5A7TRX4 DUF4216 domain-containing protein2.6e-6340.61Show/hide
Query:  EVEDQENDDLELLEVPGANEVDESVQDVILSRDDVEPSVVIQSQLSRQQLNKTVSSNNENDFINDEPENEARDALANANRPVIGSSTDQAASGSRGRSRN
        E E+ E+D  ELLE   +  VDES+ D+   R DVEP+VV   +   Q       S  ++DFINDE E             +  S +D            
Subjt:  EVEDQENDDLELLEVPGANEVDESVQDVILSRDDVEPSVVIQSQLSRQQLNKTVSSNNENDFINDEPENEARDALANANRPVIGSSTDQAASGSRGRSRN

Query:  QRGRGRGTREHSRNIELDGYVALHGRIRIEITEQIGKPVCGWSTRFSGAIGTKTRSTVPLSCATWRVVPEQVRDAVKARLLTYFDLDMSKRHVVKYIHKL
            GRG R + RNIELD +V  HG+I+IEI E+ GKPV  ++ + +  IGT  R+T+PLSC  W+ VP  VR+ V   L T+F+ D +   V KY+ + 
Subjt:  QRGRGRGTREHSRNIELDGYVALHGRIRIEITEQIGKPVCGWSTRFSGAIGTKTRSTVPLSCATWRVVPEQVRDAVKARLLTYFDLDMSKRHVVKYIHKL

Query:  MSSTFKEFRADLYKHYCEFEDPTEARQNPPERITNLEDWNLLYDRWETPEWKEKADKNKNSQSNLPFNHRAGPKSFLQLQHELKIKENRDVDQVDLFHES
        M +TF+EFRADL+K+YC+F+D  EAR NP  RIT+ EDWN++ DRWET  WK+K + NK S S + FNH  G KSFLQ++HELK K+  DVD++++FHE+
Subjt:  MSSTFKEFRADLYKHYCEFEDPTEARQNPPERITNLEDWNLLYDRWETPEWKEKADKNKNSQSNLPFNHRAGPKSFLQLQHELKIKENRDVDQVDLFHES

Query:  HFCERDGWVNDVAKDAYDP-----KPSSSSSVTSSQYEKELE--------NKVEKMEGE---------MEQMKASYVDMRESNVALKSQLSMWE
        HF E++GW ND AKDAY       + S+ + V +    K  E          V    GE          E+ K     ++E+N  L  +L+ WE
Subjt:  HFCERDGWVNDVAKDAYDP-----KPSSSSSVTSSQYEKELE--------NKVEKMEGE---------MEQMKASYVDMRESNVALKSQLSMWE

A0A5A7US78 Uncharacterized protein8.7e-5135.06Show/hide
Query:  EVEDQENDDLELLEVPGANEVDESVQDVILSRDDVEPSVVIQSQLSRQQLNKTVSSNNENDFINDEPEN-EARDA----------LANANRPVIGSSTDQ
        E E+ E+D  +LLE   +  VDES+ D+   R DVEP+VV   +   Q       S  ++DFINDE E  ++ D+          +   N P I  ST  
Subjt:  EVEDQENDDLELLEVPGANEVDESVQDVILSRDDVEPSVVIQSQLSRQQLNKTVSSNNENDFINDEPEN-EARDA----------LANANRPVIGSSTDQ

Query:  AASGSRGRSRNQRGRGRGTREHSRNIELDGYVALHGRIRIEITEQIGKPVCGWSTRFSGAIGTKTRSTVPLSCATWRVVPEQVRDAVKARLLTYFDLDMS
              G     + RGRG R + RNIELD +V  HG+I+IEI E+ GKPV  +  + +  IGT  R+T+PLSC  W+ VP  VR+ V   L T+F+ D +
Subjt:  AASGSRGRSRNQRGRGRGTREHSRNIELDGYVALHGRIRIEITEQIGKPVCGWSTRFSGAIGTKTRSTVPLSCATWRVVPEQVRDAVKARLLTYFDLDMS

Query:  KRHVVKYIHKLMSSTFKEFRADLYKHYCEFEDPTEARQNPPERITNLEDWNLLYDRWETPEWKEKADKNKNSQSNLPFNHRAGPKSFLQLQHELKIKENR
           V KY+ + M +TF+EFRA+L+K+YC+F+D  EAR NPP RIT+ EDWN++ DRWET  WK+K                               K+  
Subjt:  KRHVVKYIHKLMSSTFKEFRADLYKHYCEFEDPTEARQNPPERITNLEDWNLLYDRWETPEWKEKADKNKNSQSNLPFNHRAGPKSFLQLQHELKIKENR

Query:  DVDQVDLFHESHFCERDGWVNDVAKDAY--------DPKPSSSSSVTSSQYEKEL--------------ENKVEKMEGEMEQMKASYVDMRESNVALKSQ
        DVD++++FHE+HF +++GW+ND AKDAY        +   +   ++++++  K +              E+    +    E+ K     ++E+N  L  +
Subjt:  DVDQVDLFHESHFCERDGWVNDVAKDAY--------DPKPSSSSSVTSSQYEKEL--------------ENKVEKMEGEMEQMKASYVDMRESNVALKSQ

Query:  LSMWE
        L+ WE
Subjt:  LSMWE

A0A6J1DUH3 uncharacterized protein LOC1110232122.5e-5049.19Show/hide
Query:  YFDLDMSKRHVVKYIHKLMSSTFKEFRADLYKHYCEFEDPTEARQNPPERITNLEDWNLLYDRWETPEWKEKADKNKNSQSNLPFNHRAGPKSFLQLQHE
        +F +D+SKR V K+I + M  +FK++R+DL+++YCEFEDP EAR NPPER+TN EDWN L DRWETPEWKE   KNK +++ LPFNHRAG KSFLQLQHE
Subjt:  YFDLDMSKRHVVKYIHKLMSSTFKEFRADLYKHYCEFEDPTEARQNPPERITNLEDWNLLYDRWETPEWKEKADKNKNSQSNLPFNHRAGPKSFLQLQHE

Query:  LKIKENRDVDQVDLFHESHFCERDGWVNDVAKDAYD----------------------------------------PKPS-----SSSSVTSSQ-YEKEL
        LKIKE  D+  VDLF ESH+ E+DG VND A+DAY+                                        P+P+     SSS+VTSS  YEKEL
Subjt:  LKIKENRDVDQVDLFHESHFCERDGWVNDVAKDAYD----------------------------------------PKPS-----SSSSVTSSQ-YEKEL

Query:  ENKVEKMEGEMEQMKASYVDMRESNVALKSQLSMWESIWSDIQNLL
        E KVE ME EM +MK         N  LK  +S WE  W++I   +
Subjt:  ENKVEKMEGEMEQMKASYVDMRESNVALKSQLSMWESIWSDIQNLL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTTGATCTTGGCTTTCCAAAGATCAAAGTCTCCCTTTCCATCGAACTTATCGATCTCAAATCTTGTCATGGCCATTCTTGCTTCAAGAAACAGATTCTTGACCGA
CTTTCTGGGCTCTGATACCACTGTGATACACACACAATCAGTCGAAGAAAAGCAAAGCAAGAAAACGATCCCTTCCTCCTCACACTCACGCAAGACTGGAATTAAACACC
AGAGAAAGAACGAAGAGAACCAGGAAGACAAGTTTAAAGTGGTTCGGCAACGATGGCCTACGTCCACTGTGAAAGGAGCTGAATTTTATTCAATTACTCAGAAGAATGCA
AAAGAAGTTGAAGACCAAGAGAATGACGATTTAGAGTTACTAGAAGTCCCTGGGGCAAATGAGGTTGATGAATCCGTCCAGGATGTCATATTGAGTAGGGACGATGTTGA
ACCCAGTGTCGTCATTCAAAGCCAATTAAGCAGACAACAATTGAATAAAACTGTGTCCTCCAACAATGAAAACGACTTTATAAACGATGAGCCTGAAAATGAGGCACGTG
ATGCACTTGCAAATGCTAACCGACCAGTAATTGGTTCATCCACTGATCAGGCTGCGTCAGGATCTAGAGGTCGTTCAAGAAATCAACGAGGAAGAGGAAGGGGGACTAGA
GAACATAGCCGGAATATTGAACTAGACGGATATGTGGCTCTTCATGGGAGGATCAGAATTGAGATCACCGAGCAGATTGGAAAACCAGTATGTGGTTGGTCTACGAGGTT
TAGTGGCGCTATTGGTACCAAAACAAGGAGCACGGTTCCTTTAAGTTGTGCGACATGGAGGGTTGTACCAGAACAAGTACGGGATGCTGTGAAGGCCCGTTTGTTGACAT
ATTTTGACCTAGATATGTCAAAGAGACATGTGGTGAAGTACATACACAAACTCATGTCATCAACTTTCAAGGAATTTCGAGCGGACTTATATAAACATTATTGTGAGTTT
GAGGATCCTACAGAAGCTCGTCAAAATCCACCCGAGAGGATTACAAACCTTGAAGATTGGAATCTTTTATATGATCGATGGGAGACACCTGAGTGGAAGGAAAAAGCGGA
TAAGAATAAAAATAGTCAATCGAACCTTCCATTCAACCATCGAGCTGGGCCGAAGTCATTTCTCCAACTACAACATGAATTGAAAATCAAAGAGAATCGGGATGTTGACC
AGGTAGATTTGTTCCATGAAAGTCATTTTTGTGAAAGAGATGGATGGGTCAACGATGTTGCCAAAGATGCATATGACCCAAAACCCAGCTCATCATCCAGTGTCACATCA
TCACAATATGAAAAAGAACTAGAAAATAAGGTTGAGAAGATGGAAGGTGAAATGGAACAGATGAAGGCCTCTTACGTAGATATGCGGGAATCAAATGTTGCCCTGAAGTC
ACAATTGTCGATGTGGGAAAGTATATGGTCTGACATTCAAAACTTGTTAGGGCGAGTTATGGTAGCCACATATATCTACTTGGGCATAGAGTACATATTTTATTGTTATA
CGTGTGGCTATCCTGCAGTTATGGTAGCCACATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTTGATCTTGGCTTTCCAAAGATCAAAGTCTCCCTTTCCATCGAACTTATCGATCTCAAATCTTGTCATGGCCATTCTTGCTTCAAGAAACAGATTCTTGACCGA
CTTTCTGGGCTCTGATACCACTGTGATACACACACAATCAGTCGAAGAAAAGCAAAGCAAGAAAACGATCCCTTCCTCCTCACACTCACGCAAGACTGGAATTAAACACC
AGAGAAAGAACGAAGAGAACCAGGAAGACAAGTTTAAAGTGGTTCGGCAACGATGGCCTACGTCCACTGTGAAAGGAGCTGAATTTTATTCAATTACTCAGAAGAATGCA
AAAGAAGTTGAAGACCAAGAGAATGACGATTTAGAGTTACTAGAAGTCCCTGGGGCAAATGAGGTTGATGAATCCGTCCAGGATGTCATATTGAGTAGGGACGATGTTGA
ACCCAGTGTCGTCATTCAAAGCCAATTAAGCAGACAACAATTGAATAAAACTGTGTCCTCCAACAATGAAAACGACTTTATAAACGATGAGCCTGAAAATGAGGCACGTG
ATGCACTTGCAAATGCTAACCGACCAGTAATTGGTTCATCCACTGATCAGGCTGCGTCAGGATCTAGAGGTCGTTCAAGAAATCAACGAGGAAGAGGAAGGGGGACTAGA
GAACATAGCCGGAATATTGAACTAGACGGATATGTGGCTCTTCATGGGAGGATCAGAATTGAGATCACCGAGCAGATTGGAAAACCAGTATGTGGTTGGTCTACGAGGTT
TAGTGGCGCTATTGGTACCAAAACAAGGAGCACGGTTCCTTTAAGTTGTGCGACATGGAGGGTTGTACCAGAACAAGTACGGGATGCTGTGAAGGCCCGTTTGTTGACAT
ATTTTGACCTAGATATGTCAAAGAGACATGTGGTGAAGTACATACACAAACTCATGTCATCAACTTTCAAGGAATTTCGAGCGGACTTATATAAACATTATTGTGAGTTT
GAGGATCCTACAGAAGCTCGTCAAAATCCACCCGAGAGGATTACAAACCTTGAAGATTGGAATCTTTTATATGATCGATGGGAGACACCTGAGTGGAAGGAAAAAGCGGA
TAAGAATAAAAATAGTCAATCGAACCTTCCATTCAACCATCGAGCTGGGCCGAAGTCATTTCTCCAACTACAACATGAATTGAAAATCAAAGAGAATCGGGATGTTGACC
AGGTAGATTTGTTCCATGAAAGTCATTTTTGTGAAAGAGATGGATGGGTCAACGATGTTGCCAAAGATGCATATGACCCAAAACCCAGCTCATCATCCAGTGTCACATCA
TCACAATATGAAAAAGAACTAGAAAATAAGGTTGAGAAGATGGAAGGTGAAATGGAACAGATGAAGGCCTCTTACGTAGATATGCGGGAATCAAATGTTGCCCTGAAGTC
ACAATTGTCGATGTGGGAAAGTATATGGTCTGACATTCAAAACTTGTTAGGGCGAGTTATGGTAGCCACATATATCTACTTGGGCATAGAGTACATATTTTATTGTTATA
CGTGTGGCTATCCTGCAGTTATGGTAGCCACATAA
Protein sequenceShow/hide protein sequence
MALILAFQRSKSPFPSNLSISNLVMAILASRNRFLTDFLGSDTTVIHTQSVEEKQSKKTIPSSSHSRKTGIKHQRKNEENQEDKFKVVRQRWPTSTVKGAEFYSITQKNA
KEVEDQENDDLELLEVPGANEVDESVQDVILSRDDVEPSVVIQSQLSRQQLNKTVSSNNENDFINDEPENEARDALANANRPVIGSSTDQAASGSRGRSRNQRGRGRGTR
EHSRNIELDGYVALHGRIRIEITEQIGKPVCGWSTRFSGAIGTKTRSTVPLSCATWRVVPEQVRDAVKARLLTYFDLDMSKRHVVKYIHKLMSSTFKEFRADLYKHYCEF
EDPTEARQNPPERITNLEDWNLLYDRWETPEWKEKADKNKNSQSNLPFNHRAGPKSFLQLQHELKIKENRDVDQVDLFHESHFCERDGWVNDVAKDAYDPKPSSSSSVTS
SQYEKELENKVEKMEGEMEQMKASYVDMRESNVALKSQLSMWESIWSDIQNLLGRVMVATYIYLGIEYIFYCYTCGYPAVMVAT