; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0038482 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0038482
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionMuDRA-like transposase
Genome locationchr2:18459548..18463044
RNA-Seq ExpressionLag0038482
SyntenyLag0038482
Gene Ontology termsNA
InterPro domainsIPR004332 - Transposase, MuDR, plant
IPR018289 - MULE transposase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022155431.1 uncharacterized protein LOC111022579 isoform X1 [Momordica charantia]1.1e-4128.19Show/hide
Query:  WKESEKEYDGGEVRDLDVDVGISYNEFLGRVYEISSINLNDNNIVLRCLLQL--RSKAPAFETKENIGV-------------------------------
        W + + EY GG ++ + V   I+Y + L  +Y ++  +  ++++ ++C+ ++  R++ P FE   +  +                               
Subjt:  WKESEKEYDGGEVRDLDVDVGISYNEFLGRVYEISSINLNDNNIVLRCLLQL--RSKAPAFETKENIGV-------------------------------

Query:  --------HQEVLGTAQN--------VDP---HV--------MSVNMAPDP-SMTTCVSNSIVMPGQYCNYKDIEVEDIFLSKKDLQMRLSVLGMRENFE
                H+E +   +N        V+P   HV        + VN   D       +  ++   G   N  + +V  IF  K+DL M+LSV+ M+ NFE
Subjt:  --------HQEVLGTAQN--------VDP---HV--------MSVNMAPDP-SMTTCVSNSIVMPGQYCNYKDIEVEDIFLSKKDLQMRLSVLGMRENFE

Query:  YRVKKSNKKIFKIRCVVEECKWRIRATILKGCDVFTLT----------------------------------------KPKDIISDMPQKLGVNLSYDMA
        +RVKKS K++  I CV E CKWRIRA  L G D+F ++                                        KP++II DM Q  G+N+SY+ A
Subjt:  YRVKKSNKKIFKIRCVVEECKWRIRATILKGCDVFTLT----------------------------------------KPKDIISDMPQKLGVNLSYDMA

Query:  WSAREHALVLARGSPENG-------------------------RGSLLQARVHGIGSMIRAFLNSICPVLVVDGIHMRGKFSEKLLTAVGIDGNNQIYPV
        W ARE+  +  +GS E                            GS  +     +G  IR FLN I PV+V+DG  ++ K+   L+ A  +DGNNQIYP+
Subjt:  WSAREHALVLARGSPENG-------------------------RGSLLQARVHGIGSMIRAFLNSICPVLVVDGIHMRGKFSEKLLTAVGIDGNNQIYPV

Query:  WFGIGSGETDEPWTYFFRQIECVIGQLNPLMIASDRHKSILKAVHTVFLMQPTA
         FGI   ETD+   +FF +++  IG++  L+  SDR+ SI K++  VF   PTA
Subjt:  WFGIGSGETDEPWTYFFRQIECVIGQLNPLMIASDRHKSILKAVHTVFLMQPTA

XP_022155432.1 uncharacterized protein LOC111022579 isoform X2 [Momordica charantia]1.1e-4128.19Show/hide
Query:  WKESEKEYDGGEVRDLDVDVGISYNEFLGRVYEISSINLNDNNIVLRCLLQL--RSKAPAFETKENIGV-------------------------------
        W + + EY GG ++ + V   I+Y + L  +Y ++  +  ++++ ++C+ ++  R++ P FE   +  +                               
Subjt:  WKESEKEYDGGEVRDLDVDVGISYNEFLGRVYEISSINLNDNNIVLRCLLQL--RSKAPAFETKENIGV-------------------------------

Query:  --------HQEVLGTAQN--------VDP---HV--------MSVNMAPDP-SMTTCVSNSIVMPGQYCNYKDIEVEDIFLSKKDLQMRLSVLGMRENFE
                H+E +   +N        V+P   HV        + VN   D       +  ++   G   N  + +V  IF  K+DL M+LSV+ M+ NFE
Subjt:  --------HQEVLGTAQN--------VDP---HV--------MSVNMAPDP-SMTTCVSNSIVMPGQYCNYKDIEVEDIFLSKKDLQMRLSVLGMRENFE

Query:  YRVKKSNKKIFKIRCVVEECKWRIRATILKGCDVFTLT----------------------------------------KPKDIISDMPQKLGVNLSYDMA
        +RVKKS K++  I CV E CKWRIRA  L G D+F ++                                        KP++II DM Q  G+N+SY+ A
Subjt:  YRVKKSNKKIFKIRCVVEECKWRIRATILKGCDVFTLT----------------------------------------KPKDIISDMPQKLGVNLSYDMA

Query:  WSAREHALVLARGSPENG-------------------------RGSLLQARVHGIGSMIRAFLNSICPVLVVDGIHMRGKFSEKLLTAVGIDGNNQIYPV
        W ARE+  +  +GS E                            GS  +     +G  IR FLN I PV+V+DG  ++ K+   L+ A  +DGNNQIYP+
Subjt:  WSAREHALVLARGSPENG-------------------------RGSLLQARVHGIGSMIRAFLNSICPVLVVDGIHMRGKFSEKLLTAVGIDGNNQIYPV

Query:  WFGIGSGETDEPWTYFFRQIECVIGQLNPLMIASDRHKSILKAVHTVFLMQPTA
         FGI   ETD+   +FF +++  IG++  L+  SDR+ SI K++  VF   PTA
Subjt:  WFGIGSGETDEPWTYFFRQIECVIGQLNPLMIASDRHKSILKAVHTVFLMQPTA

XP_028949183.1 uncharacterized protein LOC103401792 [Malus domestica]2.4e-4140.16Show/hide
Query:  DIEVEDIFLSKKDLQMRLSVLGMRENFEYRVKKSNKKIFKIRCVVEECKWRIRATILKGCDVF--------TLTKPKDIISDMPQKLGVNLSYDMAWSAR
        DI V  ++ SKK+LQ +L+++ MR+N+E++V++S K   +IRCV + C+WR+RAT L+  + F         + +PKDII DM  ++GVN+SY+ AW AR
Subjt:  DIEVEDIFLSKKDLQMRLSVLGMRENFEYRVKKSNKKIFKIRCVVEECKWRIRATILKGCDVF--------TLTKPKDIISDMPQKLGVNLSYDMAWSAR

Query:  EHALVLARGSPENGRGSL------LQARVHG-------------------IGSMIRAFLNSICPVLVVDGIHMRGKFSEKLLTAVGIDGNNQIYPVWFGI
        EHA  + RGSPE    +L      L+++  G                   +G+ IR F  S+ PV+ VDG  ++GK+   L   V  DG NQIYP+ FG+
Subjt:  EHALVLARGSPENGRGSL------LQARVHG-------------------IGSMIRAFLNSICPVLVVDGIHMRGKFSEKLLTAVGIDGNNQIYPVWFGI

Query:  GSGETDEPWTYFFRQIECVIGQLNPLMIASDRHKSILKAVHTVF
        G  E D  WT+F  ++   IG++  L+  SDRH SI K V TVF
Subjt:  GSGETDEPWTYFFRQIECVIGQLNPLMIASDRHKSILKAVHTVF

XP_038887209.1 uncharacterized protein LOC120077397 [Benincasa hispida]2.4e-4941.72Show/hide
Query:  SNSIVMPGQYCNYKDIEVEDIFLSKKDLQMRLSVLGMRENFEYRVKKSNKKIFKIRCVVEECKWRIRATILKGCDVFTLTK-------------------
        S++I MPGQ  + +D++V DIF+SKKDL+M LS+L +R NF++RVKKS   ++K+  +VEECK ++RA  LK C++F ++K                   
Subjt:  SNSIVMPGQYCNYKDIEVEDIFLSKKDLQMRLSVLGMRENFEYRVKKSNKKIFKIRCVVEECKWRIRATILKGCDVFTLTK-------------------

Query:  ---------------------PKDIISDMPQKLGVNLSYDMAWSAREHALVLARGSPENGR------GSLLQARVHG-------------------IGSM
                             PKDI+ D+ Q+  V+LSYD AW ARE  LVL  GS E         G  LQ    G                   +G+ 
Subjt:  ---------------------PKDIISDMPQKLGVNLSYDMAWSAREHALVLARGSPENGR------GSLLQARVHG-------------------IGSM

Query:  IRAFLNSICPVLVVDGIHMRGKFSEKLLTAVGIDGNNQIYPVWFGIGSGETDEPWTYFFRQIECVIGQLNPLMIASDRHKSILKAVHTVF
        IR FLN I PVL+VDG  +R K+S KLL A+ +D NNQIY V FGI   ETDE W +F +Q+ CVIGQ+  L+I SDRH SI K + TVF
Subjt:  IRAFLNSICPVLVVDGIHMRGKFSEKLLTAVGIDGNNQIYPVWFGIGSGETDEPWTYFFRQIECVIGQLNPLMIASDRHKSILKAVHTVF

XP_038889297.1 uncharacterized protein LOC120079207 [Benincasa hispida]1.7e-4238.62Show/hide
Query:  SNSIVMPGQYCNYKDIEVEDIFLSKKDLQMRLSVLGMRENFEYRVKKSNKKIFKIRCVVEECKWRIRATILKGCDVFTLTK-------------------
        S++I MPGQ  + +D+++ DIFLSKKDL+MRLS+L +R NF++R+KKS   ++K+ C+VEECKWRIRA  LK C++F + K                   
Subjt:  SNSIVMPGQYCNYKDIEVEDIFLSKKDLQMRLSVLGMRENFEYRVKKSNKKIFKIRCVVEECKWRIRATILKGCDVFTLTK-------------------

Query:  ---------------------PKDIISDMPQKLGVNLSYDMAWSAREHALVLARGSPENGR------GSLLQARVHG-------------------IGSM
                             PKDI+ D+ Q+ GV+LS D AW ARE ALVL   SPE         G  LQ    G                   +   
Subjt:  ---------------------PKDIISDMPQKLGVNLSYDMAWSAREHALVLARGSPENGR------GSLLQARVHG-------------------IGSM

Query:  IRAFLNSICPVLVVDGIHMRGKFSEKLLTAVGIDGNNQIYPVWFGIGSGETDEPWTYFFRQIECVIGQLNPLMIASDRHKSILKAVHTVF
        IR FLN I P+L+VDG H+RGK+S KLL A+G++GN +                +T    Q+ C IGQ++ L+I SDRH SI K +  VF
Subjt:  IRAFLNSICPVLVVDGIHMRGKFSEKLLTAVGIDGNNQIYPVWFGIGSGETDEPWTYFFRQIECVIGQLNPLMIASDRHKSILKAVHTVF

TrEMBL top hitse value%identityAlignment
A0A6J1DJT1 uncharacterized protein LOC1110207154.6e-3836.17Show/hide
Query:  DIEVEDIFLSKKDLQMRLSVLGMRENFEYRVKKSNKKIFKIRCVVEECKWRIRATILKGCDVFTLTK---------------------------------
        D++  ++F +KK+L +R+ ++ MR NF+++VKKS  +++ + CV   C WR+RAT L+ C++F + K                                 
Subjt:  DIEVEDIFLSKKDLQMRLSVLGMRENFEYRVKKSNKKIFKIRCVVEECKWRIRATILKGCDVFTLTK---------------------------------

Query:  -------PKDIISDMPQKLGVNLSYDMAWSAREHALVLARGSPEN---------------GRGSLLQARVHG----------IGSMIRAFLNSICPVLVV
               PKDII DM ++ GVNLSYD AW + E AL L RG P +                 G++ +  + G          +G  IR FL  I PVLVV
Subjt:  -------PKDIISDMPQKLGVNLSYDMAWSAREHALVLARGSPEN---------------GRGSLLQARVHG----------IGSMIRAFLNSICPVLVV

Query:  DGIHMRGKFSEKLLTAVGIDGNNQIYPVWFGIGSGETDEPWTYFFRQIECVIGQLNPLMIASDRHKSILKAVHTVFLMQPTA
        DG H++GKF   LL A G D NNQIYPV F I  GET   W +F  Q++   G +N L+  S+RH +I KA+  VF   PTA
Subjt:  DGIHMRGKFSEKLLTAVGIDGNNQIYPVWFGIGSGETDEPWTYFFRQIECVIGQLNPLMIASDRHKSILKAVHTVFLMQPTA

A0A6J1DL12 uncharacterized protein LOC1110220776.1e-3834.78Show/hide
Query:  DIEVEDIFLSKKDLQMRLSVLGMRENFEYRVKKSNKKIFKIRCVVEECKWRIRATILKGCDVFTLTK---------------------------------
        DI V  IF SK +L+ +L VL M+ NFE+RVKKS K ++ + C+   CKW + A  ++G D FT++K                                 
Subjt:  DIEVEDIFLSKKDLQMRLSVLGMRENFEYRVKKSNKKIFKIRCVVEECKWRIRATILKGCDVFTLTK---------------------------------

Query:  -------PKDIISDMPQKLGVNLSYDMAWSAREHALVLARGSPEN---------------GRGSLLQARVH----------GIGSMIRAFLNSICPVLVV
               PKDI++DM ++ GVN+ Y+ AW A+E AL +  GSP+                  G++ +  +            +G  IR F + I PVLV+
Subjt:  -------PKDIISDMPQKLGVNLSYDMAWSAREHALVLARGSPEN---------------GRGSLLQARVH----------GIGSMIRAFLNSICPVLVV

Query:  DGIHMRGKFSEKLLTAVGIDGNNQIYPVWFGIGSGETDEPWTYFFRQIECVIGQLNPLMIASDRHKSILKAVHTVF
        DG H++GK+   +LTA  +DGNNQIYP+ FGI   E+D  W++F  +++  IG+++ L+  SDRH SI K+V  VF
Subjt:  DGIHMRGKFSEKLLTAVGIDGNNQIYPVWFGIGSGETDEPWTYFFRQIECVIGQLNPLMIASDRHKSILKAVHTVF

A0A6J1DPC2 uncharacterized protein LOC111022579 isoform X25.3e-4228.19Show/hide
Query:  WKESEKEYDGGEVRDLDVDVGISYNEFLGRVYEISSINLNDNNIVLRCLLQL--RSKAPAFETKENIGV-------------------------------
        W + + EY GG ++ + V   I+Y + L  +Y ++  +  ++++ ++C+ ++  R++ P FE   +  +                               
Subjt:  WKESEKEYDGGEVRDLDVDVGISYNEFLGRVYEISSINLNDNNIVLRCLLQL--RSKAPAFETKENIGV-------------------------------

Query:  --------HQEVLGTAQN--------VDP---HV--------MSVNMAPDP-SMTTCVSNSIVMPGQYCNYKDIEVEDIFLSKKDLQMRLSVLGMRENFE
                H+E +   +N        V+P   HV        + VN   D       +  ++   G   N  + +V  IF  K+DL M+LSV+ M+ NFE
Subjt:  --------HQEVLGTAQN--------VDP---HV--------MSVNMAPDP-SMTTCVSNSIVMPGQYCNYKDIEVEDIFLSKKDLQMRLSVLGMRENFE

Query:  YRVKKSNKKIFKIRCVVEECKWRIRATILKGCDVFTLT----------------------------------------KPKDIISDMPQKLGVNLSYDMA
        +RVKKS K++  I CV E CKWRIRA  L G D+F ++                                        KP++II DM Q  G+N+SY+ A
Subjt:  YRVKKSNKKIFKIRCVVEECKWRIRATILKGCDVFTLT----------------------------------------KPKDIISDMPQKLGVNLSYDMA

Query:  WSAREHALVLARGSPENG-------------------------RGSLLQARVHGIGSMIRAFLNSICPVLVVDGIHMRGKFSEKLLTAVGIDGNNQIYPV
        W ARE+  +  +GS E                            GS  +     +G  IR FLN I PV+V+DG  ++ K+   L+ A  +DGNNQIYP+
Subjt:  WSAREHALVLARGSPENG-------------------------RGSLLQARVHGIGSMIRAFLNSICPVLVVDGIHMRGKFSEKLLTAVGIDGNNQIYPV

Query:  WFGIGSGETDEPWTYFFRQIECVIGQLNPLMIASDRHKSILKAVHTVFLMQPTA
         FGI   ETD+   +FF +++  IG++  L+  SDR+ SI K++  VF   PTA
Subjt:  WFGIGSGETDEPWTYFFRQIECVIGQLNPLMIASDRHKSILKAVHTVFLMQPTA

A0A6J1DRN0 uncharacterized protein LOC111022579 isoform X15.3e-4228.19Show/hide
Query:  WKESEKEYDGGEVRDLDVDVGISYNEFLGRVYEISSINLNDNNIVLRCLLQL--RSKAPAFETKENIGV-------------------------------
        W + + EY GG ++ + V   I+Y + L  +Y ++  +  ++++ ++C+ ++  R++ P FE   +  +                               
Subjt:  WKESEKEYDGGEVRDLDVDVGISYNEFLGRVYEISSINLNDNNIVLRCLLQL--RSKAPAFETKENIGV-------------------------------

Query:  --------HQEVLGTAQN--------VDP---HV--------MSVNMAPDP-SMTTCVSNSIVMPGQYCNYKDIEVEDIFLSKKDLQMRLSVLGMRENFE
                H+E +   +N        V+P   HV        + VN   D       +  ++   G   N  + +V  IF  K+DL M+LSV+ M+ NFE
Subjt:  --------HQEVLGTAQN--------VDP---HV--------MSVNMAPDP-SMTTCVSNSIVMPGQYCNYKDIEVEDIFLSKKDLQMRLSVLGMRENFE

Query:  YRVKKSNKKIFKIRCVVEECKWRIRATILKGCDVFTLT----------------------------------------KPKDIISDMPQKLGVNLSYDMA
        +RVKKS K++  I CV E CKWRIRA  L G D+F ++                                        KP++II DM Q  G+N+SY+ A
Subjt:  YRVKKSNKKIFKIRCVVEECKWRIRATILKGCDVFTLT----------------------------------------KPKDIISDMPQKLGVNLSYDMA

Query:  WSAREHALVLARGSPENG-------------------------RGSLLQARVHGIGSMIRAFLNSICPVLVVDGIHMRGKFSEKLLTAVGIDGNNQIYPV
        W ARE+  +  +GS E                            GS  +     +G  IR FLN I PV+V+DG  ++ K+   L+ A  +DGNNQIYP+
Subjt:  WSAREHALVLARGSPENG-------------------------RGSLLQARVHGIGSMIRAFLNSICPVLVVDGIHMRGKFSEKLLTAVGIDGNNQIYPV

Query:  WFGIGSGETDEPWTYFFRQIECVIGQLNPLMIASDRHKSILKAVHTVFLMQPTA
         FGI   ETD+   +FF +++  IG++  L+  SDR+ SI K++  VF   PTA
Subjt:  WFGIGSGETDEPWTYFFRQIECVIGQLNPLMIASDRHKSILKAVHTVFLMQPTA

A0A6J1DTG5 uncharacterized protein LOC1110238432.0e-4136.59Show/hide
Query:  DIEVEDIFLSKKDLQMRLSVLGMRENFEYRVKKSNKKIFKIRCVVEECKWRIRATILKGCDVFTLTK---------------------------------
        D+   D+F +KK+L +++ ++ MR+NF+++VKKS  K++ +RCV  +C WR+RAT LK C +F + K                                 
Subjt:  DIEVEDIFLSKKDLQMRLSVLGMRENFEYRVKKSNKKIFKIRCVVEECKWRIRATILKGCDVFTLTK---------------------------------

Query:  -------PKDIISDMPQKLGVNLSYDMAWSAREHALVLARGSPEN---------------GRGSLLQARVHG----------IGSMIRAFLNSICPVLVV
               PKDII DM ++ GVNLSYD AW + E AL L RG P +                 G++ +  + G          +G  IR FL+ I PVLVV
Subjt:  -------PKDIISDMPQKLGVNLSYDMAWSAREHALVLARGSPEN---------------GRGSLLQARVHG----------IGSMIRAFLNSICPVLVV

Query:  DGIHMRGKFSEKLLTAVGIDGNNQIYPVWFGIGSGETDEPWTYFFRQIECVIGQLNPLMIASDRHKSILKAVHTVF
        DG H++GKF   L +A G+D NNQIYPV F I  GET   W +F  QI+ V+  ++ L+  SDRH++I K +  VF
Subjt:  DGIHMRGKFSEKLLTAVGIDGNNQIYPVWFGIGSGETDEPWTYFFRQIECVIGQLNPLMIASDRHKSILKAVHTVF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCTCGGCCGGGTTGCTTGGAGAGACACCGGGAATAAGGCTAGAAAATACTTTTTTTTGGTAGAAATTGTCTTTCCCGGTCTCGATCTCGGTCGGGAAATC
CGACCGGAATTCTCGAATGTCTCGGGGACGGTGGTAGTGGGAGTTGCGTGCGTGAGTTCCATGTTTCGAGTCCCGCTATACGTTTTTCGTATTATCCAAGTATTG
AACACAGGACCGCACAGTGCTTCAGTAATTAAGGGAACGGATGTGCCAAAAAGCCAAATTCTAATTGTTGAACCCGATCTAATCTCGTCCGGAATCAGGAAAATC
CCGATCGAACCATCAATTTTCCGGTCGGGATTCATACATATTGAAGAACGGCCGAACCGATTTTTTCCCAACCGAACCAACATTTTCCCGACCGAGCCGTCACCG
AGATTGGACCGAGATGAGGATGCCTCGTGTTTGGGTATGTTTTGGTGGTGTTGGAAGGAGAGTGAAAAGGAGTACGATGGTGGAGAGGTGAGGGACTTAGACGTG
GATGTTGGAATTTCTTATAACGAGTTCTTAGGTCGGGTGTATGAAATAAGTAGTATAAATCTGAATGATAACAACATTGTTTTGAGGTGTCTGCTTCAACTGAGG
TCTAAAGCTCCCGCTTTTGAGACAAAGGAGAACATCGGGGTTCATCAAGAGGTACTTGGAACTGCACAAAATGTTGATCCTCATGTTATGTCGGTTAATATGGCA
CCCGACCCTTCTATGACAACTTGTGTATCTAATTCAATAGTCATGCCAGGTCAGTACTGTAATTACAAGGATATAGAAGTGGAAGATATATTCTTGTCTAAGAAG
GACTTGCAGATGAGACTGTCTGTTTTAGGGATGAGAGAGAACTTTGAATATAGGGTTAAGAAGTCTAACAAGAAAATATTCAAGATTCGGTGTGTTGTTGAGGAA
TGTAAGTGGAGGATCCGAGCTACGATCCTGAAAGGTTGTGATGTCTTTACACTTACCAAACCGAAGGACATCATAAGTGATATGCCACAAAAGCTTGGTGTGAAT
CTGAGTTATGACATGGCATGGAGTGCTAGAGAACATGCCTTGGTTCTTGCCAGAGGCTCACCTGAAAATGGACGAGGGTCGTTACTTCAAGCACGTGTTCATGGA
ATTGGGTCCATGATTAGAGCGTTCTTGAACTCTATTTGTCCGGTTTTGGTAGTAGATGGGATCCACATGAGGGGAAAGTTTAGTGAAAAGCTCCTAACTGCAGTA
GGCATCGATGGAAACAACCAAATATACCCCGTATGGTTTGGCATTGGGAGTGGAGAAACTGATGAACCATGGACCTACTTTTTCCGACAAATCGAGTGTGTTATT
GGTCAACTTAATCCCCTAATGATTGCATCTGATAGACACAAGAGCATATTGAAAGCAGTACATACTGTGTTTCTGATGCAGCCCACTGCTACTGTATCCACCATC
TAG
mRNA sequenceShow/hide mRNA sequence
ATGAATCTCGGCCGGGTTGCTTGGAGAGACACCGGGAATAAGGCTAGAAAATACTTTTTTTTGGTAGAAATTGTCTTTCCCGGTCTCGATCTCGGTCGGGAAATC
CGACCGGAATTCTCGAATGTCTCGGGGACGGTGGTAGTGGGAGTTGCGTGCGTGAGTTCCATGTTTCGAGTCCCGCTATACGTTTTTCGTATTATCCAAGTATTG
AACACAGGACCGCACAGTGCTTCAGTAATTAAGGGAACGGATGTGCCAAAAAGCCAAATTCTAATTGTTGAACCCGATCTAATCTCGTCCGGAATCAGGAAAATC
CCGATCGAACCATCAATTTTCCGGTCGGGATTCATACATATTGAAGAACGGCCGAACCGATTTTTTCCCAACCGAACCAACATTTTCCCGACCGAGCCGTCACCG
AGATTGGACCGAGATGAGGATGCCTCGTGTTTGGGTATGTTTTGGTGGTGTTGGAAGGAGAGTGAAAAGGAGTACGATGGTGGAGAGGTGAGGGACTTAGACGTG
GATGTTGGAATTTCTTATAACGAGTTCTTAGGTCGGGTGTATGAAATAAGTAGTATAAATCTGAATGATAACAACATTGTTTTGAGGTGTCTGCTTCAACTGAGG
TCTAAAGCTCCCGCTTTTGAGACAAAGGAGAACATCGGGGTTCATCAAGAGGTACTTGGAACTGCACAAAATGTTGATCCTCATGTTATGTCGGTTAATATGGCA
CCCGACCCTTCTATGACAACTTGTGTATCTAATTCAATAGTCATGCCAGGTCAGTACTGTAATTACAAGGATATAGAAGTGGAAGATATATTCTTGTCTAAGAAG
GACTTGCAGATGAGACTGTCTGTTTTAGGGATGAGAGAGAACTTTGAATATAGGGTTAAGAAGTCTAACAAGAAAATATTCAAGATTCGGTGTGTTGTTGAGGAA
TGTAAGTGGAGGATCCGAGCTACGATCCTGAAAGGTTGTGATGTCTTTACACTTACCAAACCGAAGGACATCATAAGTGATATGCCACAAAAGCTTGGTGTGAAT
CTGAGTTATGACATGGCATGGAGTGCTAGAGAACATGCCTTGGTTCTTGCCAGAGGCTCACCTGAAAATGGACGAGGGTCGTTACTTCAAGCACGTGTTCATGGA
ATTGGGTCCATGATTAGAGCGTTCTTGAACTCTATTTGTCCGGTTTTGGTAGTAGATGGGATCCACATGAGGGGAAAGTTTAGTGAAAAGCTCCTAACTGCAGTA
GGCATCGATGGAAACAACCAAATATACCCCGTATGGTTTGGCATTGGGAGTGGAGAAACTGATGAACCATGGACCTACTTTTTCCGACAAATCGAGTGTGTTATT
GGTCAACTTAATCCCCTAATGATTGCATCTGATAGACACAAGAGCATATTGAAAGCAGTACATACTGTGTTTCTGATGCAGCCCACTGCTACTGTATCCACCATC
TAG
Protein sequenceShow/hide protein sequence
MNLGRVAWRDTGNKARKYFFLVEIVFPGLDLGREIRPEFSNVSGTVVVGVACVSSMFRVPLYVFRIIQVLNTGPHSASVIKGTDVPKSQILIVEPDLISSGIRKI
PIEPSIFRSGFIHIEERPNRFFPNRTNIFPTEPSPRLDRDEDASCLGMFWWCWKESEKEYDGGEVRDLDVDVGISYNEFLGRVYEISSINLNDNNIVLRCLLQLR
SKAPAFETKENIGVHQEVLGTAQNVDPHVMSVNMAPDPSMTTCVSNSIVMPGQYCNYKDIEVEDIFLSKKDLQMRLSVLGMRENFEYRVKKSNKKIFKIRCVVEE
CKWRIRATILKGCDVFTLTKPKDIISDMPQKLGVNLSYDMAWSAREHALVLARGSPENGRGSLLQARVHGIGSMIRAFLNSICPVLVVDGIHMRGKFSEKLLTAV
GIDGNNQIYPVWFGIGSGETDEPWTYFFRQIECVIGQLNPLMIASDRHKSILKAVHTVFLMQPTATVSTI