; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0015653 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0015653
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCACTA en-spm transposon protein
Genome locationchr12:18677722..18679122
RNA-Seq ExpressionLag0015653
SyntenyLag0015653
Gene Ontology termsNA
InterPro domainsIPR004252 - Probable transposase, Ptta/En/Spm, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK16780.1 CACTA en-spm transposon protein [Cucumis melo var. makuwa]9.8e-5037.16Show/hide
Query:  FAEGEDNHVATVSSTGETSDTSARRT------RGFSRNFELDRYVRIHGKITIEITDELRKPVCDNTVKFSGAIGKIVRESFSVRYATWKVVPKDARDHL
        F E  DN     SS G+ + +S+++T      R  SR  EL+R+V I+G+I + I     KP+  + V+FS AIG  VR++F VR   W  V ++  + +
Subjt:  FAEGEDNHVATVSSTGETSDTSARRT------RGFSRNFELDRYVRIHGKITIEITDELRKPVCDNTVKFSGAIGKIVRESFSVRYATWKVVPKDARDHL

Query:  KQCLLNHFIIDLSKKVVHKFIDHEKQTIFKEYRAELVKHYRKYSTLEEGRAHPHPRVVD-IADWHIMCNRWEDPKWKNMGEKNKQNRAKLPSNHYTWSKS
        K  L   F++D + + +++F++H+  T FKE+RA+  KH++KYS  EE RA+P   +V    DWH +C+ +    ++     NK  R K P NH + SKS
Subjt:  KQCLLNHFIIDLSKKVVHKFIDHEKQTIFKEYRAELVKHYRKYSTLEEGRAHPHPRVVD-IADWHIMCNRWEDPKWKNMGEKNKQNRAKLPSNHYTWSKS

Query:  FLHIQQKNKTREGENLGPIDLFHKTRYKEGKGWVSQAANEAHDQMVALKQAPTLDRTQPPKPDDICEVVLGSRSHQIKGLGWGPKPKSRKCTS-SSDATS
        FL  Q +   R G+ +  ++LF +T  + G  +VSQAA +AH+QM+ L+  PT + +QP   D+IC+ VLG R    KGLGWGPKPK+R+  S SS +TS
Subjt:  FLHIQQKNKTREGENLGPIDLFHKTRYKEGKGWVSQAANEAHDQMVALKQAPTLDRTQPPKPDDICEVVLGSRSHQIKGLGWGPKPKSRKCTS-SSDATS

Query:  LRRGHELSAALSETEEQLHATKEQLDQRTKN
          +  +      E + +LH   E+++ + +N
Subjt:  LRRGHELSAALSETEEQLHATKEQLDQRTKN

TYK17077.1 CACTA en-spm transposon protein [Cucumis melo var. makuwa]2.8e-4937.65Show/hide
Query:  FAEGEDNHVATVSS-----TGETSD--TSARRTRGFSRNFELDRYVRIHGKITIEITDELRKPVCDNTVKFSGAIGKIVRESFSVRYATWKVVPKDARDH
        F +  DN++A  SS     TG +S   T   R R  SR  EL+R+V I+G+I + I     KP+  + V+FS AIG  VR++F VR   W  V ++  + 
Subjt:  FAEGEDNHVATVSS-----TGETSD--TSARRTRGFSRNFELDRYVRIHGKITIEITDELRKPVCDNTVKFSGAIGKIVRESFSVRYATWKVVPKDARDH

Query:  LKQCLLNHFIIDLSKKVVHKFIDHEKQTIFKEYRAELVKHYRKYSTLEEGRAHPHPRVVD-IADWHIMCNRWEDPKWKNMGEKNKQNRAKLPSNHYTWSK
        +K  L   F++D + + +++F++H+  T FKE+RA+  KH++KYS  EE RA+P   +V    DWH +C+ +    ++     NK  R K P NH + SK
Subjt:  LKQCLLNHFIIDLSKKVVHKFIDHEKQTIFKEYRAELVKHYRKYSTLEEGRAHPHPRVVD-IADWHIMCNRWEDPKWKNMGEKNKQNRAKLPSNHYTWSK

Query:  SFLHIQQKNKTREGENLGPIDLFHKTRYKEGKGWVSQAANEAHDQMVALKQAPTLDRTQPPKPDDICEVVLGSRSHQIKGLGWGPKPKSRKCTS-SSDAT
        SFL  Q +   R G+ +  ++LF +T  + G  +VSQAA +AH+QM+ L+  PT + +QP   D+IC+ VLG R    KGLGWGPKPK+R+  S SS +T
Subjt:  SFLHIQQKNKTREGENLGPIDLFHKTRYKEGKGWVSQAANEAHDQMVALKQAPTLDRTQPPKPDDICEVVLGSRSHQIKGLGWGPKPKSRKCTS-SSDAT

Query:  SLRRGHELSAALSETEEQLHATKEQLDQRTKN
        S  +  +      E + +LH   E+++ + +N
Subjt:  SLRRGHELSAALSETEEQLHATKEQLDQRTKN

XP_038887408.1 poly [ADP-ribose] polymerase 1-like isoform X1 [Benincasa hispida]8.2e-5739.38Show/hide
Query:  SSTGETSDTSARRTRGFSRNFELDRYVRIHGKITIEITDELRKPVCDNTVKFSGAIGKIVRESFSVRYATWKVVPKDARDHLK-----------------
        S + + S    RRTRG SRN ELDR+V IHG+I IEI +E+ KPVC N  KFS AIG I R +  +R   W  V K+ RD +                  
Subjt:  SSTGETSDTSARRTRGFSRNFELDRYVRIHGKITIEITDELRKPVCDNTVKFSGAIGKIVRESFSVRYATWKVVPKDARDHLK-----------------

Query:  --QCLL--------NHFIIDLSKKVVHKFIDHEKQTIFKEYRAELVKHYRKYSTLEEGRAHPHPRVVDIADWHIMCNRWEDPKWKNMGEKNKQNRAKLPS
          +CL+        ++F  D+ KK V K++    Q  FKEYR++L KHYR++   +E RA P  R+ D  DW+++CNRWE P+WK   E NK++R+K+P 
Subjt:  --QCLL--------NHFIIDLSKKVVHKFIDHEKQTIFKEYRAELVKHYRKYSTLEEGRAHPHPRVVDIADWHIMCNRWEDPKWKNMGEKNKQNRAKLPS

Query:  NHYTWSKSFLHIQQKNKTREGENLGPIDLFHKTRYKEGKGWVSQAANEAHDQMVALKQAPTLDRTQPPKPDDICEVVLGSRSHQIKGLGWGPKPKSRKCT
         H T SKSF+ +Q + K +EG ++  +DLF ++ + E  GWV+  A +A+ +M  L +A   +   P    ++C+ VLG RS  IKGLG  PKP S    
Subjt:  NHYTWSKSFLHIQQKNKTREGENLGPIDLFHKTRYKEGKGWVSQAANEAHDQMVALKQAPTLDRTQPPKPDDICEVVLGSRSHQIKGLGWGPKPKSRKCT

Query:  SSSDATSLRRGHELSAALSETEEQL
        SSS  + L+   EL   + + E+++
Subjt:  SSSDATSLRRGHELSAALSETEEQL

XP_038887409.1 poly [ADP-ribose] polymerase 1-like isoform X2 [Benincasa hispida]3.2e-6142.95Show/hide
Query:  SSTGETSDTSARRTRGFSRNFELDRYVRIHGKITIEITDELRKPVCDNTVKFSGAIGKIVRESFSVRYATWKVVPKDARDHLKQCLLNHFIIDLSKKVVH
        S + + S    RRTRG SRN ELDR+V IHG+I IEI +E+ KPVC N  KFS AIG I R +  +R   W  V K+ RD +   LL++F  D+ KK V 
Subjt:  SSTGETSDTSARRTRGFSRNFELDRYVRIHGKITIEITDELRKPVCDNTVKFSGAIGKIVRESFSVRYATWKVVPKDARDHLKQCLLNHFIIDLSKKVVH

Query:  KFIDHEKQTIFKEYRAELVKHYRKYSTLEEGRAHPHPRVVDIADWHIMCNRWEDPKWKNMGEKNKQNRAKLPSNHYTWSKSFLHIQQKNKTREGENLGPI
        K++    Q  FKEYR++L KHYR++   +E RA P  R+ D  DW+++CNRWE P+WK   E NK++R+K+P  H T SKSF+ +Q + K +EG ++  +
Subjt:  KFIDHEKQTIFKEYRAELVKHYRKYSTLEEGRAHPHPRVVDIADWHIMCNRWEDPKWKNMGEKNKQNRAKLPSNHYTWSKSFLHIQQKNKTREGENLGPI

Query:  DLFHKTRYKEGKGWVSQAANEAHDQMVALKQAPTLDRTQPPKPDDICEVVLGSRSHQIKGLGWGPKPKSRKCTSSSDATSLRRGHELSAALSETEEQL
        DLF ++ + E  GWV+  A +A+ +M  L +A   +   P    ++C+ VLG RS  IKGLG  PKP S    SSS  + L+   EL   + + E+++
Subjt:  DLFHKTRYKEGKGWVSQAANEAHDQMVALKQAPTLDRTQPPKPDDICEVVLGSRSHQIKGLGWGPKPKSRKCTSSSDATSLRRGHELSAALSETEEQL

XP_038887413.1 uncharacterized protein LOC120077557 isoform X5 [Benincasa hispida]8.2e-5739.38Show/hide
Query:  SSTGETSDTSARRTRGFSRNFELDRYVRIHGKITIEITDELRKPVCDNTVKFSGAIGKIVRESFSVRYATWKVVPKDARDHLK-----------------
        S + + S    RRTRG SRN ELDR+V IHG+I IEI +E+ KPVC N  KFS AIG I R +  +R   W  V K+ RD +                  
Subjt:  SSTGETSDTSARRTRGFSRNFELDRYVRIHGKITIEITDELRKPVCDNTVKFSGAIGKIVRESFSVRYATWKVVPKDARDHLK-----------------

Query:  --QCLL--------NHFIIDLSKKVVHKFIDHEKQTIFKEYRAELVKHYRKYSTLEEGRAHPHPRVVDIADWHIMCNRWEDPKWKNMGEKNKQNRAKLPS
          +CL+        ++F  D+ KK V K++    Q  FKEYR++L KHYR++   +E RA P  R+ D  DW+++CNRWE P+WK   E NK++R+K+P 
Subjt:  --QCLL--------NHFIIDLSKKVVHKFIDHEKQTIFKEYRAELVKHYRKYSTLEEGRAHPHPRVVDIADWHIMCNRWEDPKWKNMGEKNKQNRAKLPS

Query:  NHYTWSKSFLHIQQKNKTREGENLGPIDLFHKTRYKEGKGWVSQAANEAHDQMVALKQAPTLDRTQPPKPDDICEVVLGSRSHQIKGLGWGPKPKSRKCT
         H T SKSF+ +Q + K +EG ++  +DLF ++ + E  GWV+  A +A+ +M  L +A   +   P    ++C+ VLG RS  IKGLG  PKP S    
Subjt:  NHYTWSKSFLHIQQKNKTREGENLGPIDLFHKTRYKEGKGWVSQAANEAHDQMVALKQAPTLDRTQPPKPDDICEVVLGSRSHQIKGLGWGPKPKSRKCT

Query:  SSSDATSLRRGHELSAALSETEEQL
        SSS  + L+   EL   + + E+++
Subjt:  SSSDATSLRRGHELSAALSETEEQL

TrEMBL top hitse value%identityAlignment
A0A5D3C6Z8 CACTA en-spm transposon protein2.3e-4936.75Show/hide
Query:  FAEGEDNHVA-TVSSTGETSDTSARRT------RGFSRNFELDRYVRIHGKITIEITDELRKPVCDNTVKFSGAIGKIVRESFSVRYATWKVVPKDARDH
        F +  DN++A   SS G+ +++S+++T      R  SR  EL+R+V I+G+I + I     KP+  + V+FS AIG  VR++F VR   W  V ++  + 
Subjt:  FAEGEDNHVA-TVSSTGETSDTSARRT------RGFSRNFELDRYVRIHGKITIEITDELRKPVCDNTVKFSGAIGKIVRESFSVRYATWKVVPKDARDH

Query:  LKQCLLNHFIIDLSKKVVHKFIDHEKQTIFKEYRAELVKHYRKYSTLEEGRAHPHPRVVD-IADWHIMCNRWEDPKWKNMGEKNKQNRAKLPSNHYTWSK
        +K  L   F++D + + +++F++H+  T FKE+RA+  KH++KYS  EE RA+P   +V    DWH +C+ +    ++     NK  R K P NH + SK
Subjt:  LKQCLLNHFIIDLSKKVVHKFIDHEKQTIFKEYRAELVKHYRKYSTLEEGRAHPHPRVVD-IADWHIMCNRWEDPKWKNMGEKNKQNRAKLPSNHYTWSK

Query:  SFLHIQQKNKTREGENLGPIDLFHKTRYKEGKGWVSQAANEAHDQMVALKQAPTLDRTQPPKPDDICEVVLGSRSHQIKGLGWGPKPKSRKCTS-SSDAT
        SFL  Q +   R G+ +  ++LF +T  + G  +VSQAA +AH+QM+ L+  P  + +QP   D+IC+ VLG R    KGLGWGPKPK+R+  S SS +T
Subjt:  SFLHIQQKNKTREGENLGPIDLFHKTRYKEGKGWVSQAANEAHDQMVALKQAPTLDRTQPPKPDDICEVVLGSRSHQIKGLGWGPKPKSRKCTS-SSDAT

Query:  SLRRGHELSAALSETEEQLHATKEQLDQRTKN
        S  +  +      E + +LH   E+++ + +N
Subjt:  SLRRGHELSAALSETEEQLHATKEQLDQRTKN

A0A5D3CIP7 CACTA en-spm transposon protein2.3e-4937.96Show/hide
Query:  FAEGEDNHVATVSST------GETSD--TSARRTRGFSRNFELDRYVRIHGKITIEITDELRKPVCDNTVKFSGAIGKIVRESFSVRYATWKVVPKDARD
        F +  DN++A  SS+      G +S   T   R R  SR  EL+R+V I+G+I + I     KP+  + V+FS AIG  VR++F VR   W  V ++  +
Subjt:  FAEGEDNHVATVSST------GETSD--TSARRTRGFSRNFELDRYVRIHGKITIEITDELRKPVCDNTVKFSGAIGKIVRESFSVRYATWKVVPKDARD

Query:  HLKQCLLNHFIIDLSKKVVHKFIDHEKQTIFKEYRAELVKHYRKYSTLEEGRAHPHPRVVD-IADWHIMCNRWEDPKWKNMGEKNKQNRAKLPSNHYTWS
         +K  L   F++D + + +++F++H+  T FKE+RA+  +H++KYS  EE RA+P   +V    DWH +C+ +    ++     NK  R K P NH + S
Subjt:  HLKQCLLNHFIIDLSKKVVHKFIDHEKQTIFKEYRAELVKHYRKYSTLEEGRAHPHPRVVD-IADWHIMCNRWEDPKWKNMGEKNKQNRAKLPSNHYTWS

Query:  KSFLHIQQKNKTREGENLGPIDLFHKTRYKEGKGWVSQAANEAHDQMVALKQAPTLDRTQPPKPDDICEVVLGSRSHQIKGLGWGPKPKSRKCTSSSDAT
        KSFL  Q +   R+GE +  ++LF +T  + G  +VSQAA +AH+QM+ L+  PT + +QP   D+IC+ VLG R    KGLGWGPKPK+R+  S+S ++
Subjt:  KSFLHIQQKNKTREGENLGPIDLFHKTRYKEGKGWVSQAANEAHDQMVALKQAPTLDRTQPPKPDDICEVVLGSRSHQIKGLGWGPKPKSRKCTSSSDAT

Query:  -----SLRRGHELSAALSETEEQL
             S  +  EL A L E  E++
Subjt:  -----SLRRGHELSAALSETEEQL

A0A5D3CM32 CACTA en-spm transposon protein1.4e-4937.65Show/hide
Query:  FAEGEDNHVATVSS-----TGETSD--TSARRTRGFSRNFELDRYVRIHGKITIEITDELRKPVCDNTVKFSGAIGKIVRESFSVRYATWKVVPKDARDH
        F +  DN++A  SS     TG +S   T   R R  SR  EL+R+V I+G+I + I     KP+  + V+FS AIG  VR++F VR   W  V ++  + 
Subjt:  FAEGEDNHVATVSS-----TGETSD--TSARRTRGFSRNFELDRYVRIHGKITIEITDELRKPVCDNTVKFSGAIGKIVRESFSVRYATWKVVPKDARDH

Query:  LKQCLLNHFIIDLSKKVVHKFIDHEKQTIFKEYRAELVKHYRKYSTLEEGRAHPHPRVVD-IADWHIMCNRWEDPKWKNMGEKNKQNRAKLPSNHYTWSK
        +K  L   F++D + + +++F++H+  T FKE+RA+  KH++KYS  EE RA+P   +V    DWH +C+ +    ++     NK  R K P NH + SK
Subjt:  LKQCLLNHFIIDLSKKVVHKFIDHEKQTIFKEYRAELVKHYRKYSTLEEGRAHPHPRVVD-IADWHIMCNRWEDPKWKNMGEKNKQNRAKLPSNHYTWSK

Query:  SFLHIQQKNKTREGENLGPIDLFHKTRYKEGKGWVSQAANEAHDQMVALKQAPTLDRTQPPKPDDICEVVLGSRSHQIKGLGWGPKPKSRKCTS-SSDAT
        SFL  Q +   R G+ +  ++LF +T  + G  +VSQAA +AH+QM+ L+  PT + +QP   D+IC+ VLG R    KGLGWGPKPK+R+  S SS +T
Subjt:  SFLHIQQKNKTREGENLGPIDLFHKTRYKEGKGWVSQAANEAHDQMVALKQAPTLDRTQPPKPDDICEVVLGSRSHQIKGLGWGPKPKSRKCTS-SSDAT

Query:  SLRRGHELSAALSETEEQLHATKEQLDQRTKN
        S  +  +      E + +LH   E+++ + +N
Subjt:  SLRRGHELSAALSETEEQLHATKEQLDQRTKN

A0A5D3CZD0 CACTA en-spm transposon protein1.4e-4937.65Show/hide
Query:  FAEGEDNHVATVSS-----TGETSD--TSARRTRGFSRNFELDRYVRIHGKITIEITDELRKPVCDNTVKFSGAIGKIVRESFSVRYATWKVVPKDARDH
        F +  DN++A  SS     TG +S   T   R R  SR  EL+R+V I+G+I + I     KP+  + V+FS AIG  VR++F VR   W  V ++  + 
Subjt:  FAEGEDNHVATVSS-----TGETSD--TSARRTRGFSRNFELDRYVRIHGKITIEITDELRKPVCDNTVKFSGAIGKIVRESFSVRYATWKVVPKDARDH

Query:  LKQCLLNHFIIDLSKKVVHKFIDHEKQTIFKEYRAELVKHYRKYSTLEEGRAHPHPRVVD-IADWHIMCNRWEDPKWKNMGEKNKQNRAKLPSNHYTWSK
        +K  L   F++D + + +++F++H+  T FKE+RA+  KH++KYS  EE RA+P   +V    DWH +C+ +    ++     NK  R K P NH + SK
Subjt:  LKQCLLNHFIIDLSKKVVHKFIDHEKQTIFKEYRAELVKHYRKYSTLEEGRAHPHPRVVD-IADWHIMCNRWEDPKWKNMGEKNKQNRAKLPSNHYTWSK

Query:  SFLHIQQKNKTREGENLGPIDLFHKTRYKEGKGWVSQAANEAHDQMVALKQAPTLDRTQPPKPDDICEVVLGSRSHQIKGLGWGPKPKSRKCTS-SSDAT
        SFL  Q +   R G+ +  ++LF +T  + G  +VSQAA +AH+QM+ L+  PT + +QP   D+IC+ VLG R    KGLGWGPKPK+R+  S SS +T
Subjt:  SFLHIQQKNKTREGENLGPIDLFHKTRYKEGKGWVSQAANEAHDQMVALKQAPTLDRTQPPKPDDICEVVLGSRSHQIKGLGWGPKPKSRKCTS-SSDAT

Query:  SLRRGHELSAALSETEEQLHATKEQLDQRTKN
        S  +  +      E + +LH   E+++ + +N
Subjt:  SLRRGHELSAALSETEEQLHATKEQLDQRTKN

A0A5D3CZJ1 CACTA en-spm transposon protein4.7e-5037.16Show/hide
Query:  FAEGEDNHVATVSSTGETSDTSARRT------RGFSRNFELDRYVRIHGKITIEITDELRKPVCDNTVKFSGAIGKIVRESFSVRYATWKVVPKDARDHL
        F E  DN     SS G+ + +S+++T      R  SR  EL+R+V I+G+I + I     KP+  + V+FS AIG  VR++F VR   W  V ++  + +
Subjt:  FAEGEDNHVATVSSTGETSDTSARRT------RGFSRNFELDRYVRIHGKITIEITDELRKPVCDNTVKFSGAIGKIVRESFSVRYATWKVVPKDARDHL

Query:  KQCLLNHFIIDLSKKVVHKFIDHEKQTIFKEYRAELVKHYRKYSTLEEGRAHPHPRVVD-IADWHIMCNRWEDPKWKNMGEKNKQNRAKLPSNHYTWSKS
        K  L   F++D + + +++F++H+  T FKE+RA+  KH++KYS  EE RA+P   +V    DWH +C+ +    ++     NK  R K P NH + SKS
Subjt:  KQCLLNHFIIDLSKKVVHKFIDHEKQTIFKEYRAELVKHYRKYSTLEEGRAHPHPRVVD-IADWHIMCNRWEDPKWKNMGEKNKQNRAKLPSNHYTWSKS

Query:  FLHIQQKNKTREGENLGPIDLFHKTRYKEGKGWVSQAANEAHDQMVALKQAPTLDRTQPPKPDDICEVVLGSRSHQIKGLGWGPKPKSRKCTS-SSDATS
        FL  Q +   R G+ +  ++LF +T  + G  +VSQAA +AH+QM+ L+  PT + +QP   D+IC+ VLG R    KGLGWGPKPK+R+  S SS +TS
Subjt:  FLHIQQKNKTREGENLGPIDLFHKTRYKEGKGWVSQAANEAHDQMVALKQAPTLDRTQPPKPDDICEVVLGSRSHQIKGLGWGPKPKSRKCTS-SSDATS

Query:  LRRGHELSAALSETEEQLHATKEQLDQRTKN
          +  +      E + +LH   E+++ + +N
Subjt:  LRRGHELSAALSETEEQLHATKEQLDQRTKN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGGGTCCGATTGAGGAACGCAATGCAGATCGATTTTTTGCTGAGGGTGAAGACAATCATGTCGCCACAGTATCTTCCACGGGCGAGACATCAGATACGTCTGCACG
TAGAACGAGAGGATTTTCTAGGAATTTTGAGCTGGACAGATACGTTCGAATCCATGGGAAGATAACTATCGAGATCACCGACGAATTGAGAAAGCCGGTGTGTGACAATA
CAGTAAAATTCAGCGGTGCCATTGGTAAAATTGTCAGAGAATCATTCTCTGTACGTTATGCAACGTGGAAAGTTGTACCGAAGGACGCAAGGGATCATCTTAAACAATGT
CTTTTGAATCATTTCATAATTGATTTATCTAAGAAGGTGGTACACAAATTCATTGACCACGAGAAGCAAACCATTTTTAAAGAGTACCGGGCAGAGTTGGTCAAACACTA
TCGAAAATATTCCACTCTAGAGGAGGGACGTGCACACCCACATCCTCGAGTGGTTGACATAGCTGATTGGCATATTATGTGCAATAGATGGGAGGACCCCAAGTGGAAGA
ACATGGGTGAGAAGAATAAACAGAATCGTGCCAAACTTCCTTCCAACCACTACACTTGGTCAAAATCATTTCTTCACATACAACAGAAGAATAAAACTAGAGAAGGTGAG
AATTTGGGTCCTATTGATTTATTTCATAAGACCAGATACAAGGAAGGAAAGGGGTGGGTTAGTCAAGCCGCCAATGAAGCTCATGATCAAATGGTGGCATTGAAACAGGC
ACCCACCCTAGATAGGACCCAACCGCCTAAACCAGATGACATATGTGAAGTTGTTCTGGGTAGTCGATCGCATCAAATTAAAGGGCTTGGTTGGGGACCAAAGCCGAAGT
CGAGAAAGTGTACCTCCTCATCCGATGCAACTTCATTAAGACGAGGGCATGAACTAAGTGCAGCCTTAAGTGAGACCGAAGAACAGTTGCATGCAACAAAAGAACAGTTG
GATCAGCGAACGAAAAATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTGGGTCCGATTGAGGAACGCAATGCAGATCGATTTTTTGCTGAGGGTGAAGACAATCATGTCGCCACAGTATCTTCCACGGGCGAGACATCAGATACGTCTGCACG
TAGAACGAGAGGATTTTCTAGGAATTTTGAGCTGGACAGATACGTTCGAATCCATGGGAAGATAACTATCGAGATCACCGACGAATTGAGAAAGCCGGTGTGTGACAATA
CAGTAAAATTCAGCGGTGCCATTGGTAAAATTGTCAGAGAATCATTCTCTGTACGTTATGCAACGTGGAAAGTTGTACCGAAGGACGCAAGGGATCATCTTAAACAATGT
CTTTTGAATCATTTCATAATTGATTTATCTAAGAAGGTGGTACACAAATTCATTGACCACGAGAAGCAAACCATTTTTAAAGAGTACCGGGCAGAGTTGGTCAAACACTA
TCGAAAATATTCCACTCTAGAGGAGGGACGTGCACACCCACATCCTCGAGTGGTTGACATAGCTGATTGGCATATTATGTGCAATAGATGGGAGGACCCCAAGTGGAAGA
ACATGGGTGAGAAGAATAAACAGAATCGTGCCAAACTTCCTTCCAACCACTACACTTGGTCAAAATCATTTCTTCACATACAACAGAAGAATAAAACTAGAGAAGGTGAG
AATTTGGGTCCTATTGATTTATTTCATAAGACCAGATACAAGGAAGGAAAGGGGTGGGTTAGTCAAGCCGCCAATGAAGCTCATGATCAAATGGTGGCATTGAAACAGGC
ACCCACCCTAGATAGGACCCAACCGCCTAAACCAGATGACATATGTGAAGTTGTTCTGGGTAGTCGATCGCATCAAATTAAAGGGCTTGGTTGGGGACCAAAGCCGAAGT
CGAGAAAGTGTACCTCCTCATCCGATGCAACTTCATTAAGACGAGGGCATGAACTAAGTGCAGCCTTAAGTGAGACCGAAGAACAGTTGCATGCAACAAAAGAACAGTTG
GATCAGCGAACGAAAAATTAA
Protein sequenceShow/hide protein sequence
MVGPIEERNADRFFAEGEDNHVATVSSTGETSDTSARRTRGFSRNFELDRYVRIHGKITIEITDELRKPVCDNTVKFSGAIGKIVRESFSVRYATWKVVPKDARDHLKQC
LLNHFIIDLSKKVVHKFIDHEKQTIFKEYRAELVKHYRKYSTLEEGRAHPHPRVVDIADWHIMCNRWEDPKWKNMGEKNKQNRAKLPSNHYTWSKSFLHIQQKNKTREGE
NLGPIDLFHKTRYKEGKGWVSQAANEAHDQMVALKQAPTLDRTQPPKPDDICEVVLGSRSHQIKGLGWGPKPKSRKCTSSSDATSLRRGHELSAALSETEEQLHATKEQL
DQRTKN