; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0036214 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0036214
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionMuDRA-like transposase
Genome locationchr3:41682720..41684055
RNA-Seq ExpressionLag0036214
SyntenyLag0036214
Gene Ontology termsNA
InterPro domainsIPR018289 - MULE transposase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022131652.1 protein FAR1-RELATED SEQUENCE 4-like [Momordica charantia]1.0e-6555.45Show/hide
Query:  VGRTYRPKDIVNDIRQDFGVNLSYDKVWQASEEAFIFARGSPEESYRLLPRFGEALKIENPNTVFDLELEDDGRFKHVFMALGASIGGFKSSIRPVLVVD
        V RTYRPKDI+ DIR+++GVN+SYDK W++SEEA    RG P  SY LLP +GEA+KI NP T+F+LEL+D   FK+VFMA+G SI GF   IRPVLV+D
Subjt:  VGRTYRPKDIVNDIRQDFGVNLSYDKVWQASEEAFIFARGSPEESYRLLPRFGEALKIENPNTVFDLELEDDGRFKHVFMALGASIGGFKSSIRPVLVVD

Query:  GTHLRGKFCGKLLLATGVDGNNQLYPVGWAFAGGETDQSWTYFFRQMERAVGQVPGLVIVSDRHASIGKAVKTVFPNSFHCVCIWHLGENLKTHFKTGNF
        G HL+GK+ G LL A+ VD NNQ+YPV +A  G E+  SW +F  Q++  V  V  LV +SDRHA+I KA+  VFP +FHC CI HL  NL   FKT   
Subjt:  GTHLRGKFCGKLLLATGVDGNNQLYPVGWAFAGGETDQSWTYFFRQMERAVGQVPGLVIVSDRHASIGKAVKTVFPNSFHCVCIWHLGENLKTHFKTGNF

Query:  MKVFDGVARAFRVVELNEYW
          +F   A+AFR    NE W
Subjt:  MKVFDGVARAFRVVELNEYW

XP_022142677.1 uncharacterized protein LOC111012733 [Momordica charantia]4.7e-6354.09Show/hide
Query:  VGRTYRPKDIVNDIRQDFGVNLSYDKVWQASEEAFIFARGSPEESYRLLPRFGEALKIENPNTVFDLELEDDGRFKHVFMALGASIGGFKSSIRPVLVVD
        V RTYRPKDI+ DIR+++GVN+SYDK W+ SEEA    RG P  SY LLP +GEA+KI N  T+F+LEL+D   FK+VFMA+G SI GF   IRPVLV+D
Subjt:  VGRTYRPKDIVNDIRQDFGVNLSYDKVWQASEEAFIFARGSPEESYRLLPRFGEALKIENPNTVFDLELEDDGRFKHVFMALGASIGGFKSSIRPVLVVD

Query:  GTHLRGKFCGKLLLATGVDGNNQLYPVGWAFAGGETDQSWTYFFRQMERAVGQVPGLVIVSDRHASIGKAVKTVFPNSFHCVCIWHLGENLKTHFKTGNF
        G HL+GK+ G LL A+  D NNQ+YP+ +A  G E+  SW +F  Q++  V  V  LV +SDRHA+I KA+  VFP  FHC CI HL  NL   FKT   
Subjt:  GTHLRGKFCGKLLLATGVDGNNQLYPVGWAFAGGETDQSWTYFFRQMERAVGQVPGLVIVSDRHASIGKAVKTVFPNSFHCVCIWHLGENLKTHFKTGNF

Query:  MKVFDGVARAFRVVELNEYW
          +F   A+AFR    NE W
Subjt:  MKVFDGVARAFRVVELNEYW

XP_022151512.1 protein FAR1-RELATED SEQUENCE 4-like [Momordica charantia]1.1e-6254.55Show/hide
Query:  VGRTYRPKDIVNDIRQDFGVNLSYDKVWQASEEAFIFARGSPEESYRLLPRFGEALKIENPNTVFDLELEDDGRFKHVFMALGASIGGFKSSIRPVLVVD
        V RTYR KDI+ DIR+++GVN+SYDK W +SEEA    RG P  SY LL  +GEA KI NP T+F+LEL+D   FK+VFMA+G SI GF   IRPVLV+D
Subjt:  VGRTYRPKDIVNDIRQDFGVNLSYDKVWQASEEAFIFARGSPEESYRLLPRFGEALKIENPNTVFDLELEDDGRFKHVFMALGASIGGFKSSIRPVLVVD

Query:  GTHLRGKFCGKLLLATGVDGNNQLYPVGWAFAGGETDQSWTYFFRQMERAVGQVPGLVIVSDRHASIGKAVKTVFPNSFHCVCIWHLGENLKTHFKTGNF
        G HL+GK+ G LL A+ VD NNQ+YPV +A  G E+  SW +F  Q++  V  V  LV +SDRHA+I KA+  VFP +FHC CI HL  NL   FKT   
Subjt:  GTHLRGKFCGKLLLATGVDGNNQLYPVGWAFAGGETDQSWTYFFRQMERAVGQVPGLVIVSDRHASIGKAVKTVFPNSFHCVCIWHLGENLKTHFKTGNF

Query:  MKVFDGVARAFRVVELNEYW
         ++F   A+AFR    NE W
Subjt:  MKVFDGVARAFRVVELNEYW

XP_022154923.1 protein FAR1-RELATED SEQUENCE 4-like [Momordica charantia]1.2e-6354.55Show/hide
Query:  VGRTYRPKDIVNDIRQDFGVNLSYDKVWQASEEAFIFARGSPEESYRLLPRFGEALKIENPNTVFDLELEDDGRFKHVFMALGASIGGFKSSIRPVLVVD
        V RTYRPKDI+ DIR+++GVN+SYDK W++SEEA    RG P  SY LLP +GEA+KI NP T+F+LEL+D   FK+VFMA+G SI GF   IRPVLV+D
Subjt:  VGRTYRPKDIVNDIRQDFGVNLSYDKVWQASEEAFIFARGSPEESYRLLPRFGEALKIENPNTVFDLELEDDGRFKHVFMALGASIGGFKSSIRPVLVVD

Query:  GTHLRGKFCGKLLLATGVDGNNQLYPVGWAFAGGETDQSWTYFFRQMERAVGQVPGLVIVSDRHASIGKAVKTVFPNSFHCVCIWHLGENLKTHFKTGNF
        G HL+GK+ G LL A+ VD NNQ+Y V +A  G E+  SW +F  Q++  V  V  LV +SDRHA+I KA+  VFP +FHC CI HL  NL   FKT   
Subjt:  GTHLRGKFCGKLLLATGVDGNNQLYPVGWAFAGGETDQSWTYFFRQMERAVGQVPGLVIVSDRHASIGKAVKTVFPNSFHCVCIWHLGENLKTHFKTGNF

Query:  MKVFDGVARAFRVVELNEYW
          +F   A+AF     NE W
Subjt:  MKVFDGVARAFRVVELNEYW

XP_038882416.1 protein FAR1-RELATED SEQUENCE 8-like [Benincasa hispida]1.9e-6757.01Show/hide
Query:  QGVGRTYRPKDIVNDIRQDFGVNLSYDKVWQASEEAFIFARGSPEESYRLLPRFGEALKIENPNTVFDLELEDDGRFKHVFMALGASIGGFKSSIRPVLV
        + + R+Y+PKDIV DI+Q++GV+LSYDKVW+  EEA +   GSP+ESY+ L +FGEAL+IEN  + F  +L++D  FKHVFMAL ASI GF + IRP+L+
Subjt:  QGVGRTYRPKDIVNDIRQDFGVNLSYDKVWQASEEAFIFARGSPEESYRLLPRFGEALKIENPNTVFDLELEDDGRFKHVFMALGASIGGFKSSIRPVLV

Query:  VDGTHLRGKFCGKLLLATGVDGNNQLYPVGWAFAGGETDQSWTYFFRQMERAVGQVPGLVIVSDRHASIGKAVKTVFPNSFHCVCIWHLGENLKTHFKTG
        V GTHLRGK+ GKLLLATGVDGNNQ+YPV +  + GET++SW +F +Q+  A+GQV  +VIVSDRH SI K + TVFP +FH +CI HL  NL  +FK  
Subjt:  VDGTHLRGKFCGKLLLATGVDGNNQLYPVGWAFAGGETDQSWTYFFRQMERAVGQVPGLVIVSDRHASIGKAVKTVFPNSFHCVCIWHLGENLKTHFKTG

Query:  NFMKVFDGVARAFR
        + + +FD  A+A R
Subjt:  NFMKVFDGVARAFR

TrEMBL top hitse value%identityAlignment
A0A6J1BRM2 protein FAR1-RELATED SEQUENCE 4-like4.9e-6655.45Show/hide
Query:  VGRTYRPKDIVNDIRQDFGVNLSYDKVWQASEEAFIFARGSPEESYRLLPRFGEALKIENPNTVFDLELEDDGRFKHVFMALGASIGGFKSSIRPVLVVD
        V RTYRPKDI+ DIR+++GVN+SYDK W++SEEA    RG P  SY LLP +GEA+KI NP T+F+LEL+D   FK+VFMA+G SI GF   IRPVLV+D
Subjt:  VGRTYRPKDIVNDIRQDFGVNLSYDKVWQASEEAFIFARGSPEESYRLLPRFGEALKIENPNTVFDLELEDDGRFKHVFMALGASIGGFKSSIRPVLVVD

Query:  GTHLRGKFCGKLLLATGVDGNNQLYPVGWAFAGGETDQSWTYFFRQMERAVGQVPGLVIVSDRHASIGKAVKTVFPNSFHCVCIWHLGENLKTHFKTGNF
        G HL+GK+ G LL A+ VD NNQ+YPV +A  G E+  SW +F  Q++  V  V  LV +SDRHA+I KA+  VFP +FHC CI HL  NL   FKT   
Subjt:  GTHLRGKFCGKLLLATGVDGNNQLYPVGWAFAGGETDQSWTYFFRQMERAVGQVPGLVIVSDRHASIGKAVKTVFPNSFHCVCIWHLGENLKTHFKTGNF

Query:  MKVFDGVARAFRVVELNEYW
          +F   A+AFR    NE W
Subjt:  MKVFDGVARAFRVVELNEYW

A0A6J1CNJ2 uncharacterized protein LOC1110127332.3e-6354.09Show/hide
Query:  VGRTYRPKDIVNDIRQDFGVNLSYDKVWQASEEAFIFARGSPEESYRLLPRFGEALKIENPNTVFDLELEDDGRFKHVFMALGASIGGFKSSIRPVLVVD
        V RTYRPKDI+ DIR+++GVN+SYDK W+ SEEA    RG P  SY LLP +GEA+KI N  T+F+LEL+D   FK+VFMA+G SI GF   IRPVLV+D
Subjt:  VGRTYRPKDIVNDIRQDFGVNLSYDKVWQASEEAFIFARGSPEESYRLLPRFGEALKIENPNTVFDLELEDDGRFKHVFMALGASIGGFKSSIRPVLVVD

Query:  GTHLRGKFCGKLLLATGVDGNNQLYPVGWAFAGGETDQSWTYFFRQMERAVGQVPGLVIVSDRHASIGKAVKTVFPNSFHCVCIWHLGENLKTHFKTGNF
        G HL+GK+ G LL A+  D NNQ+YP+ +A  G E+  SW +F  Q++  V  V  LV +SDRHA+I KA+  VFP  FHC CI HL  NL   FKT   
Subjt:  GTHLRGKFCGKLLLATGVDGNNQLYPVGWAFAGGETDQSWTYFFRQMERAVGQVPGLVIVSDRHASIGKAVKTVFPNSFHCVCIWHLGENLKTHFKTGNF

Query:  MKVFDGVARAFRVVELNEYW
          +F   A+AFR    NE W
Subjt:  MKVFDGVARAFRVVELNEYW

A0A6J1DDQ3 protein FAR1-RELATED SEQUENCE 4-like5.1e-6354.55Show/hide
Query:  VGRTYRPKDIVNDIRQDFGVNLSYDKVWQASEEAFIFARGSPEESYRLLPRFGEALKIENPNTVFDLELEDDGRFKHVFMALGASIGGFKSSIRPVLVVD
        V RTYR KDI+ DIR+++GVN+SYDK W +SEEA    RG P  SY LL  +GEA KI NP T+F+LEL+D   FK+VFMA+G SI GF   IRPVLV+D
Subjt:  VGRTYRPKDIVNDIRQDFGVNLSYDKVWQASEEAFIFARGSPEESYRLLPRFGEALKIENPNTVFDLELEDDGRFKHVFMALGASIGGFKSSIRPVLVVD

Query:  GTHLRGKFCGKLLLATGVDGNNQLYPVGWAFAGGETDQSWTYFFRQMERAVGQVPGLVIVSDRHASIGKAVKTVFPNSFHCVCIWHLGENLKTHFKTGNF
        G HL+GK+ G LL A+ VD NNQ+YPV +A  G E+  SW +F  Q++  V  V  LV +SDRHA+I KA+  VFP +FHC CI HL  NL   FKT   
Subjt:  GTHLRGKFCGKLLLATGVDGNNQLYPVGWAFAGGETDQSWTYFFRQMERAVGQVPGLVIVSDRHASIGKAVKTVFPNSFHCVCIWHLGENLKTHFKTGNF

Query:  MKVFDGVARAFRVVELNEYW
         ++F   A+AFR    NE W
Subjt:  MKVFDGVARAFRVVELNEYW

A0A6J1DJT1 uncharacterized protein LOC1110207151.5e-6254.5Show/hide
Query:  VGRTYRPKDIVNDIRQDFGVNLSYDKVWQASEEAFIFARGSPEESYRLLPRFGEALKIENPNTVFDLELEDDGRFKHVFMALGASIGGFKSSIRPVLVVD
        V RTYRPKDI+ D+R+++GVNLSYDK W++SEEA    RG P  SY LLP +GEALKI NP T+F+LEL+    FK+VFMALG SI GF + IRPVLVVD
Subjt:  VGRTYRPKDIVNDIRQDFGVNLSYDKVWQASEEAFIFARGSPEESYRLLPRFGEALKIENPNTVFDLELEDDGRFKHVFMALGASIGGFKSSIRPVLVVD

Query:  GTHLRGKFCGKLLLATGVDGNNQLYPVGWAFAGGETDQSWTYFFRQMERAVGQVPGLVIVSDRHASIGKAVKTVFPNSFHCVCIWHLGENLKTHFKTG--
        G HL+GKF G LL+A+G D NNQ+YPV +A   GET  SW +F  Q++   G V  LV VS+RH +I KA+  VFP +FHC CI H+  NL   FK    
Subjt:  GTHLRGKFCGKLLLATGVDGNNQLYPVGWAFAGGETDQSWTYFFRQMERAVGQVPGLVIVSDRHASIGKAVKTVFPNSFHCVCIWHLGENLKTHFKTG--

Query:  NFMKVFDGVARAFRVVELNEYW
           ++F   A+A+R    N  W
Subjt:  NFMKVFDGVARAFRVVELNEYW

A0A6J1DLL7 protein FAR1-RELATED SEQUENCE 4-like6.0e-6454.55Show/hide
Query:  VGRTYRPKDIVNDIRQDFGVNLSYDKVWQASEEAFIFARGSPEESYRLLPRFGEALKIENPNTVFDLELEDDGRFKHVFMALGASIGGFKSSIRPVLVVD
        V RTYRPKDI+ DIR+++GVN+SYDK W++SEEA    RG P  SY LLP +GEA+KI NP T+F+LEL+D   FK+VFMA+G SI GF   IRPVLV+D
Subjt:  VGRTYRPKDIVNDIRQDFGVNLSYDKVWQASEEAFIFARGSPEESYRLLPRFGEALKIENPNTVFDLELEDDGRFKHVFMALGASIGGFKSSIRPVLVVD

Query:  GTHLRGKFCGKLLLATGVDGNNQLYPVGWAFAGGETDQSWTYFFRQMERAVGQVPGLVIVSDRHASIGKAVKTVFPNSFHCVCIWHLGENLKTHFKTGNF
        G HL+GK+ G LL A+ VD NNQ+Y V +A  G E+  SW +F  Q++  V  V  LV +SDRHA+I KA+  VFP +FHC CI HL  NL   FKT   
Subjt:  GTHLRGKFCGKLLLATGVDGNNQLYPVGWAFAGGETDQSWTYFFRQMERAVGQVPGLVIVSDRHASIGKAVKTVFPNSFHCVCIWHLGENLKTHFKTGNF

Query:  MKVFDGVARAFRVVELNEYW
          +F   A+AF     NE W
Subjt:  MKVFDGVARAFRVVELNEYW

SwissProt top hitse value%identityAlignment
Q6NQJ7 Protein FAR1-RELATED SEQUENCE 43.1e-0923.46Show/hide
Query:  LLPRFGEALKIENPNTVFDLELEDDGRFKHVFMALGASIGGFKSSIRPVLVVDGTHLRGKFCGKLLLATGVDGNNQLYPVGWAFAGGETDQSWTYFFRQM
        +L  F   ++ ENP   F ++  +D   ++VF      I  +K S   V+  + ++   K+   L+L  GV+ + Q   +G      +T  ++ +  +  
Subjt:  LLPRFGEALKIENPNTVFDLELEDDGRFKHVFMALGASIGGFKSSIRPVLVVDGTHLRGKFCGKLLLATGVDGNNQLYPVGWAFAGGETDQSWTYFFRQM

Query:  ERAVGQVPGLVIVSDRHASIGKAVKTVFPNSFHCVCIWHLGENLKTHFK-----TGNFM-KVFDGVARAFRVVELNEYW
          A+G     V+++D++ +I  A+  V P + HC C+WH+ + L  +          FM K+F  + R++   E +  W
Subjt:  ERAVGQVPGLVIVSDRHASIGKAVKTVFPNSFHCVCIWHLGENLKTHFK-----TGNFM-KVFDGVARAFRVVELNEYW

Arabidopsis top hitse value%identityAlignment
AT1G49920.1 MuDR family transposase4.8e-1329.01Show/hide
Query:  GSPEESYRLLPRFGEALKIENPNTV---FDLELED--DGRFKHVFMALGASIGGFKSSIRPVLVVDGTHLRGKFCGKLLLATGVDGNNQLYPVGWAFAGG
        G  ++S+RL+P+    L   N   V   +D    D     F+ +F A   SI GF+   RP++VVD  +L GK+  KL++A+  D  NQ +P+ +A    
Subjt:  GSPEESYRLLPRFGEALKIENPNTV---FDLELED--DGRFKHVFMALGASIGGFKSSIRPVLVVDGTHLRGKFCGKLLLATGVDGNNQLYPVGWAFAGG

Query:  ETDQSWTYFFRQMERAVGQVPGLVIVSDRHASIGKAV-----KTVFPNSFHCVCIWHLGENL
         +  SW +F  ++   V Q  G+ ++S     I   +     +   P ++H  C++HL   L
Subjt:  ETDQSWTYFFRQMERAVGQVPGLVIVSDRHASIGKAV-----KTVFPNSFHCVCIWHLGENL

AT1G64255.1 MuDR family transposase3.4e-1128.65Show/hide
Query:  YRPKDIVNDI----RQDFGVNLSYDKVWQASEEAFIFARGSPEESYRLLPRFGEALKIENPNTV---FDLELEDD-GRFKHVFMALGASIGGFKSSIRPV
        Y P   ++++    ++  G  L    V  A E+A     G  ++S+   P+   AL   N   V   +DL    +   F  VF A   SI GF+   RP+
Subjt:  YRPKDIVNDI----RQDFGVNLSYDKVWQASEEAFIFARGSPEESYRLLPRFGEALKIENPNTV---FDLELEDD-GRFKHVFMALGASIGGFKSSIRPV

Query:  LVVDGTHLRGKFCGKLLLATGVDGNNQLYPVGWAFAGGETDQSWTYFFRQMERAVGQVPGLVIVSDRHASI
        +VVD  +L  ++  KL++A+GVD  N+ +P+ +A     +   W +F   +   V Q  GL ++S  H  I
Subjt:  LVVDGTHLRGKFCGKLLLATGVDGNNQLYPVGWAFAGGETDQSWTYFFRQMERAVGQVPGLVIVSDRHASI

AT1G64260.1 MuDR family transposase8.7e-1526.64Show/hide
Query:  RQDFGVNLSYDKVWQASEEAFIFARGSPEESYRLLPRFGEALKIENPNTV---FDLELEDD-GRFKHVFMALGASIGGFKSSIRPVLVVDGTHLRGKFCG
        ++  G  L   K+     E      G  ++S+R++P+   A    N   V   +DL    D   F+ VF +   SI GF+   RP++VVD   L GK+  
Subjt:  RQDFGVNLSYDKVWQASEEAFIFARGSPEESYRLLPRFGEALKIENPNTV---FDLELEDD-GRFKHVFMALGASIGGFKSSIRPVLVVDGTHLRGKFCG

Query:  KLLLATGVDGNNQLYPVGWAFAGGETDQSWTYFFRQMERAVGQVPGLVIVSDRHASIGKAVKT-----VFPNSFHCVCIWHLGENLKTHFKTGNFMKVFD
        KL++A+GVD  N+ +P+ +A     +  SW +FF ++   V Q   L ++S     I   V         P + H  C+ HL       F+  N   + +
Subjt:  KLLLATGVDGNNQLYPVGWAFAGGETDQSWTYFFRQMERAVGQVPGLVIVSDRHASIGKAVKT-----VFPNSFHCVCIWHLGENLKTHFKTGNFMKVFD

Query:  GVARAFRVVELNEY
              +  E + Y
Subjt:  GVARAFRVVELNEY

AT1G76320.1 FAR1-related sequence 42.2e-1023.46Show/hide
Query:  LLPRFGEALKIENPNTVFDLELEDDGRFKHVFMALGASIGGFKSSIRPVLVVDGTHLRGKFCGKLLLATGVDGNNQLYPVGWAFAGGETDQSWTYFFRQM
        +L  F   ++ ENP   F ++  +D   ++VF      I  +K S   V+  + ++   K+   L+L  GV+ + Q   +G      +T  ++ +  +  
Subjt:  LLPRFGEALKIENPNTVFDLELEDDGRFKHVFMALGASIGGFKSSIRPVLVVDGTHLRGKFCGKLLLATGVDGNNQLYPVGWAFAGGETDQSWTYFFRQM

Query:  ERAVGQVPGLVIVSDRHASIGKAVKTVFPNSFHCVCIWHLGENLKTHFK-----TGNFM-KVFDGVARAFRVVELNEYW
          A+G     V+++D++ +I  A+  V P + HC C+WH+ + L  +          FM K+F  + R++   E +  W
Subjt:  ERAVGQVPGLVIVSDRHASIGKAVKTVFPNSFHCVCIWHLGENLKTHFK-----TGNFM-KVFDGVARAFRVVELNEYW

AT1G76320.2 FAR1-related sequence 42.2e-1023.46Show/hide
Query:  LLPRFGEALKIENPNTVFDLELEDDGRFKHVFMALGASIGGFKSSIRPVLVVDGTHLRGKFCGKLLLATGVDGNNQLYPVGWAFAGGETDQSWTYFFRQM
        +L  F   ++ ENP   F ++  +D   ++VF      I  +K S   V+  + ++   K+   L+L  GV+ + Q   +G      +T  ++ +  +  
Subjt:  LLPRFGEALKIENPNTVFDLELEDDGRFKHVFMALGASIGGFKSSIRPVLVVDGTHLRGKFCGKLLLATGVDGNNQLYPVGWAFAGGETDQSWTYFFRQM

Query:  ERAVGQVPGLVIVSDRHASIGKAVKTVFPNSFHCVCIWHLGENLKTHFK-----TGNFM-KVFDGVARAFRVVELNEYW
          A+G     V+++D++ +I  A+  V P + HC C+WH+ + L  +          FM K+F  + R++   E +  W
Subjt:  ERAVGQVPGLVIVSDRHASIGKAVKTVFPNSFHCVCIWHLGENLKTHFK-----TGNFM-KVFDGVARAFRVVELNEYW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAATAATCCGTATAAGGTGGAGCAAAATTTATCATCGGCATCAACCCCAAACAATATACCATCGGCATCTACATCCTTCCCATGTGATGAATCAAAAAATATTCA
TGTGTATAACTTGGGGGACGATGAAGACCATGGTAGTGAACTTTATGGTGGCCAAGATTGGGGGGACTACGGACATGATGAGGAGTACGATGATGACGAGGACACAGACA
CGAAGGTTGATGTAGCTGTGGGAGATGATGAGGAAGAGATTCACGTACAGTACACTGAGGTTCCTTCTGCGCCTGAACAGGTGGTGACATGCTTTTCCGTTCCAAATGCT
TCATCGAAGGCAATCATCATGACAGGCCAGGCAGGCCAAGAGTTTGGTTATTGGAAATTTGATCAAGGCGTCGGTCGTACTTATAGGCCAAAGGATATTGTGAACGATAT
TAGACAAGATTTTGGTGTGAATTTAAGCTATGACAAGGTTTGGCAGGCTAGTGAAGAAGCTTTTATTTTTGCTAGAGGGTCTCCAGAAGAATCTTACAGACTGTTACCGA
GATTTGGTGAAGCATTGAAAATAGAAAATCCCAATACAGTGTTCGACTTAGAACTTGAAGATGATGGACGCTTTAAGCATGTGTTTATGGCACTAGGTGCTTCTATTGGA
GGGTTCAAGAGCTCCATTCGTCCAGTGCTAGTGGTTGATGGAACACACTTACGGGGAAAATTTTGTGGGAAACTACTTCTTGCGACCGGTGTAGATGGAAACAACCAGTT
ATATCCTGTAGGGTGGGCCTTTGCCGGGGGAGAAACTGATCAATCATGGACGTACTTTTTTCGACAGATGGAACGTGCAGTTGGACAAGTTCCTGGTCTGGTCATTGTGT
CTGATAGACATGCCAGCATCGGTAAGGCGGTAAAAACTGTGTTCCCTAATTCATTTCACTGTGTGTGTATCTGGCACTTAGGGGAGAACCTGAAAACACATTTTAAGACC
GGGAACTTTATGAAAGTATTTGACGGAGTTGCTAGGGCATTTCGTGTGGTTGAACTCAATGAGTACTGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCAAATAATCCGTATAAGGTGGAGCAAAATTTATCATCGGCATCAACCCCAAACAATATACCATCGGCATCTACATCCTTCCCATGTGATGAATCAAAAAATATTCA
TGTGTATAACTTGGGGGACGATGAAGACCATGGTAGTGAACTTTATGGTGGCCAAGATTGGGGGGACTACGGACATGATGAGGAGTACGATGATGACGAGGACACAGACA
CGAAGGTTGATGTAGCTGTGGGAGATGATGAGGAAGAGATTCACGTACAGTACACTGAGGTTCCTTCTGCGCCTGAACAGGTGGTGACATGCTTTTCCGTTCCAAATGCT
TCATCGAAGGCAATCATCATGACAGGCCAGGCAGGCCAAGAGTTTGGTTATTGGAAATTTGATCAAGGCGTCGGTCGTACTTATAGGCCAAAGGATATTGTGAACGATAT
TAGACAAGATTTTGGTGTGAATTTAAGCTATGACAAGGTTTGGCAGGCTAGTGAAGAAGCTTTTATTTTTGCTAGAGGGTCTCCAGAAGAATCTTACAGACTGTTACCGA
GATTTGGTGAAGCATTGAAAATAGAAAATCCCAATACAGTGTTCGACTTAGAACTTGAAGATGATGGACGCTTTAAGCATGTGTTTATGGCACTAGGTGCTTCTATTGGA
GGGTTCAAGAGCTCCATTCGTCCAGTGCTAGTGGTTGATGGAACACACTTACGGGGAAAATTTTGTGGGAAACTACTTCTTGCGACCGGTGTAGATGGAAACAACCAGTT
ATATCCTGTAGGGTGGGCCTTTGCCGGGGGAGAAACTGATCAATCATGGACGTACTTTTTTCGACAGATGGAACGTGCAGTTGGACAAGTTCCTGGTCTGGTCATTGTGT
CTGATAGACATGCCAGCATCGGTAAGGCGGTAAAAACTGTGTTCCCTAATTCATTTCACTGTGTGTGTATCTGGCACTTAGGGGAGAACCTGAAAACACATTTTAAGACC
GGGAACTTTATGAAAGTATTTGACGGAGTTGCTAGGGCATTTCGTGTGGTTGAACTCAATGAGTACTGGTAG
Protein sequenceShow/hide protein sequence
MANNPYKVEQNLSSASTPNNIPSASTSFPCDESKNIHVYNLGDDEDHGSELYGGQDWGDYGHDEEYDDDEDTDTKVDVAVGDDEEEIHVQYTEVPSAPEQVVTCFSVPNA
SSKAIIMTGQAGQEFGYWKFDQGVGRTYRPKDIVNDIRQDFGVNLSYDKVWQASEEAFIFARGSPEESYRLLPRFGEALKIENPNTVFDLELEDDGRFKHVFMALGASIG
GFKSSIRPVLVVDGTHLRGKFCGKLLLATGVDGNNQLYPVGWAFAGGETDQSWTYFFRQMERAVGQVPGLVIVSDRHASIGKAVKTVFPNSFHCVCIWHLGENLKTHFKT
GNFMKVFDGVARAFRVVELNEYW