; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0036086 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0036086
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase
Genome locationchr3:38743609..38744211
RNA-Seq ExpressionLag0036086
SyntenyLag0036086
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYJ95504.1 integrase [Cucumis melo var. makuwa]3.4e-7476.77Show/hide
Query:  NEDAQDSVQVEIQAVASYSSTLPSSTSDDDMSPRRMRNIQEIYNSTNIIDDD-VVDFALFFDVDPINFDEAVQDEKSKIAMDQKIDTIKRNETWELIYLL
        NE AQ+  Q EIQAV S SS+  SSTS+D++SPRRMR+IQEIYN+TN I+DD   +FALF  VDP+ FDEA+QDEK KIAMDQ+ID I+RNETWEL+ L 
Subjt:  NEDAQDSVQVEIQAVASYSSTLPSSTSDDDMSPRRMRNIQEIYNSTNIIDDD-VVDFALFFDVDPINFDEAVQDEKSKIAMDQKIDTIKRNETWELIYLL

Query:  VNKQDLGVKLVYRTKLKSNGKVEKYKTRLVVKGYKQEYGVNYEEIFAPVTRIETVRLILSLAAQNGWKVYQMDVKFVFLNGHLKEEIFVEQPLGYVKK
         NKQ LGVK VYRTKLKS+G VEKYK RLVVKGYKQEYGV+YEEIFAPVTRIET+RLILSLAAQNGWKVYQMDVK  FLNGHLKEEIFV QPLGYV++
Subjt:  VNKQDLGVKLVYRTKLKSNGKVEKYKTRLVVKGYKQEYGVNYEEIFAPVTRIETVRLILSLAAQNGWKVYQMDVKFVFLNGHLKEEIFVEQPLGYVKK

TYJ98761.1 integrase [Cucumis melo var. makuwa]3.4e-7476.77Show/hide
Query:  NEDAQDSVQVEIQAVASYSSTLPSSTSDDDMSPRRMRNIQEIYNSTNIIDDD-VVDFALFFDVDPINFDEAVQDEKSKIAMDQKIDTIKRNETWELIYLL
        NE AQ+  Q EIQAV S SS+  SSTS+D++SPRRMR+IQEIYN+TN I+DD   +FALF  VDP+ FDEA+QDEK KIAMDQ+ID I+RNETWEL+ L 
Subjt:  NEDAQDSVQVEIQAVASYSSTLPSSTSDDDMSPRRMRNIQEIYNSTNIIDDD-VVDFALFFDVDPINFDEAVQDEKSKIAMDQKIDTIKRNETWELIYLL

Query:  VNKQDLGVKLVYRTKLKSNGKVEKYKTRLVVKGYKQEYGVNYEEIFAPVTRIETVRLILSLAAQNGWKVYQMDVKFVFLNGHLKEEIFVEQPLGYVKK
         NKQ LGVK VYRTKLKS+G VEKYK RLVVKGYKQEYGV+YEEIFAPVTRIET+RLILSLAAQNGWKVYQMDVK  FLNGHLKEEIFV QPLGYV++
Subjt:  VNKQDLGVKLVYRTKLKSNGKVEKYKTRLVVKGYKQEYGVNYEEIFAPVTRIETVRLILSLAAQNGWKVYQMDVKFVFLNGHLKEEIFVEQPLGYVKK

TYK07359.1 integrase [Cucumis melo var. makuwa]2.6e-7476.77Show/hide
Query:  NEDAQDSVQVEIQAVASYSSTLPSSTSDDDMSPRRMRNIQEIYNSTNIIDDD-VVDFALFFDVDPINFDEAVQDEKSKIAMDQKIDTIKRNETWELIYLL
        N+ AQ+  Q EIQAV S SS+  SSTSDD++SPRRMR+IQEIYN+TN I+DD   +FALF  VDP+ FDEA+QDEK KIAMDQ+ID I+RNETWEL+ L 
Subjt:  NEDAQDSVQVEIQAVASYSSTLPSSTSDDDMSPRRMRNIQEIYNSTNIIDDD-VVDFALFFDVDPINFDEAVQDEKSKIAMDQKIDTIKRNETWELIYLL

Query:  VNKQDLGVKLVYRTKLKSNGKVEKYKTRLVVKGYKQEYGVNYEEIFAPVTRIETVRLILSLAAQNGWKVYQMDVKFVFLNGHLKEEIFVEQPLGYVKK
         NKQ LGVK VYRTKLKS+G VEKYK RLVVKGYKQEYGV+YEEIFAPVTRIET+RLILSLAAQNGWKVYQMDVK  FLNGHLKEEIFV QPLGYV++
Subjt:  VNKQDLGVKLVYRTKLKSNGKVEKYKTRLVVKGYKQEYGVNYEEIFAPVTRIETVRLILSLAAQNGWKVYQMDVKFVFLNGHLKEEIFVEQPLGYVKK

TYK24556.1 integrase [Cucumis melo var. makuwa]3.4e-7476.77Show/hide
Query:  NEDAQDSVQVEIQAVASYSSTLPSSTSDDDMSPRRMRNIQEIYNSTNIIDDD-VVDFALFFDVDPINFDEAVQDEKSKIAMDQKIDTIKRNETWELIYLL
        NE AQ+  Q EIQAV S SS+  SSTS+D++SPRRMR+IQEIYN+TN I+DD   +FALF  VDP+ FDEA+QDEK KIAMDQ+ID I+RNETWEL+ L 
Subjt:  NEDAQDSVQVEIQAVASYSSTLPSSTSDDDMSPRRMRNIQEIYNSTNIIDDD-VVDFALFFDVDPINFDEAVQDEKSKIAMDQKIDTIKRNETWELIYLL

Query:  VNKQDLGVKLVYRTKLKSNGKVEKYKTRLVVKGYKQEYGVNYEEIFAPVTRIETVRLILSLAAQNGWKVYQMDVKFVFLNGHLKEEIFVEQPLGYVKK
         NKQ LGVK VYRTKLKS+G VEKYK RLVVKGYKQEYGV+YEEIFAPVTRIET+RLILSLAAQNGWKVYQMDVK  FLNGHLKEEIFV QPLGYV++
Subjt:  VNKQDLGVKLVYRTKLKSNGKVEKYKTRLVVKGYKQEYGVNYEEIFAPVTRIETVRLILSLAAQNGWKVYQMDVKFVFLNGHLKEEIFVEQPLGYVKK

TYK30104.1 integrase [Cucumis melo var. makuwa]3.4e-7476.77Show/hide
Query:  NEDAQDSVQVEIQAVASYSSTLPSSTSDDDMSPRRMRNIQEIYNSTNIIDDD-VVDFALFFDVDPINFDEAVQDEKSKIAMDQKIDTIKRNETWELIYLL
        NE AQ+  Q EIQAV S SS+  SSTS+D++SPRRMR+IQEIYN+TN I+DD   +FALF  VDP+ FDEA+QDEK KIAMDQ+ID I+RNETWEL+ L 
Subjt:  NEDAQDSVQVEIQAVASYSSTLPSSTSDDDMSPRRMRNIQEIYNSTNIIDDD-VVDFALFFDVDPINFDEAVQDEKSKIAMDQKIDTIKRNETWELIYLL

Query:  VNKQDLGVKLVYRTKLKSNGKVEKYKTRLVVKGYKQEYGVNYEEIFAPVTRIETVRLILSLAAQNGWKVYQMDVKFVFLNGHLKEEIFVEQPLGYVKK
         NKQ LGVK VYRTKLKS+G VEKYK RLVVKGYKQEYGV+YEEIFAPVTRIET+RLILSLAAQNGWKVYQMDVK  FLNGHLKEEIFV QPLGYV++
Subjt:  VNKQDLGVKLVYRTKLKSNGKVEKYKTRLVVKGYKQEYGVNYEEIFAPVTRIETVRLILSLAAQNGWKVYQMDVKFVFLNGHLKEEIFVEQPLGYVKK

TrEMBL top hitse value%identityAlignment
A0A5D3BJ80 Integrase1.6e-7476.77Show/hide
Query:  NEDAQDSVQVEIQAVASYSSTLPSSTSDDDMSPRRMRNIQEIYNSTNIIDDD-VVDFALFFDVDPINFDEAVQDEKSKIAMDQKIDTIKRNETWELIYLL
        NE AQ+  Q EIQAV S SS+  SSTS+D++SPRRMR+IQEIYN+TN I+DD   +FALF  VDP+ FDEA+QDEK KIAMDQ+ID I+RNETWEL+ L 
Subjt:  NEDAQDSVQVEIQAVASYSSTLPSSTSDDDMSPRRMRNIQEIYNSTNIIDDD-VVDFALFFDVDPINFDEAVQDEKSKIAMDQKIDTIKRNETWELIYLL

Query:  VNKQDLGVKLVYRTKLKSNGKVEKYKTRLVVKGYKQEYGVNYEEIFAPVTRIETVRLILSLAAQNGWKVYQMDVKFVFLNGHLKEEIFVEQPLGYVKK
         NKQ LGVK VYRTKLKS+G VEKYK RLVVKGYKQEYGV+YEEIFAPVTRIET+RLILSLAAQNGWKVYQMDVK  FLNGHLKEEIFV QPLGYV++
Subjt:  VNKQDLGVKLVYRTKLKSNGKVEKYKTRLVVKGYKQEYGVNYEEIFAPVTRIETVRLILSLAAQNGWKVYQMDVKFVFLNGHLKEEIFVEQPLGYVKK

A0A5D3CAM0 Integrase1.3e-7476.77Show/hide
Query:  NEDAQDSVQVEIQAVASYSSTLPSSTSDDDMSPRRMRNIQEIYNSTNIIDDD-VVDFALFFDVDPINFDEAVQDEKSKIAMDQKIDTIKRNETWELIYLL
        N+ AQ+  Q EIQAV S SS+  SSTSDD++SPRRMR+IQEIYN+TN I+DD   +FALF  VDP+ FDEA+QDEK KIAMDQ+ID I+RNETWEL+ L 
Subjt:  NEDAQDSVQVEIQAVASYSSTLPSSTSDDDMSPRRMRNIQEIYNSTNIIDDD-VVDFALFFDVDPINFDEAVQDEKSKIAMDQKIDTIKRNETWELIYLL

Query:  VNKQDLGVKLVYRTKLKSNGKVEKYKTRLVVKGYKQEYGVNYEEIFAPVTRIETVRLILSLAAQNGWKVYQMDVKFVFLNGHLKEEIFVEQPLGYVKK
         NKQ LGVK VYRTKLKS+G VEKYK RLVVKGYKQEYGV+YEEIFAPVTRIET+RLILSLAAQNGWKVYQMDVK  FLNGHLKEEIFV QPLGYV++
Subjt:  VNKQDLGVKLVYRTKLKSNGKVEKYKTRLVVKGYKQEYGVNYEEIFAPVTRIETVRLILSLAAQNGWKVYQMDVKFVFLNGHLKEEIFVEQPLGYVKK

A0A5D3CLV1 Integrase1.6e-7476.77Show/hide
Query:  NEDAQDSVQVEIQAVASYSSTLPSSTSDDDMSPRRMRNIQEIYNSTNIIDDD-VVDFALFFDVDPINFDEAVQDEKSKIAMDQKIDTIKRNETWELIYLL
        NE AQ+  Q EIQAV S SS+  SSTS+D++SPRRMR+IQEIYN+TN I+DD   +FALF  VDP+ FDEA+QDEK KIAMDQ+ID I+RNETWEL+ L 
Subjt:  NEDAQDSVQVEIQAVASYSSTLPSSTSDDDMSPRRMRNIQEIYNSTNIIDDD-VVDFALFFDVDPINFDEAVQDEKSKIAMDQKIDTIKRNETWELIYLL

Query:  VNKQDLGVKLVYRTKLKSNGKVEKYKTRLVVKGYKQEYGVNYEEIFAPVTRIETVRLILSLAAQNGWKVYQMDVKFVFLNGHLKEEIFVEQPLGYVKK
         NKQ LGVK VYRTKLKS+G VEKYK RLVVKGYKQEYGV+YEEIFAPVTRIET+RLILSLAAQNGWKVYQMDVK  FLNGHLKEEIFV QPLGYV++
Subjt:  VNKQDLGVKLVYRTKLKSNGKVEKYKTRLVVKGYKQEYGVNYEEIFAPVTRIETVRLILSLAAQNGWKVYQMDVKFVFLNGHLKEEIFVEQPLGYVKK

A0A5D3DLN8 Integrase1.6e-7476.77Show/hide
Query:  NEDAQDSVQVEIQAVASYSSTLPSSTSDDDMSPRRMRNIQEIYNSTNIIDDD-VVDFALFFDVDPINFDEAVQDEKSKIAMDQKIDTIKRNETWELIYLL
        NE AQ+  Q EIQAV S SS+  SSTS+D++SPRRMR+IQEIYN+TN I+DD   +FALF  VDP+ FDEA+QDEK KIAMDQ+ID I+RNETWEL+ L 
Subjt:  NEDAQDSVQVEIQAVASYSSTLPSSTSDDDMSPRRMRNIQEIYNSTNIIDDD-VVDFALFFDVDPINFDEAVQDEKSKIAMDQKIDTIKRNETWELIYLL

Query:  VNKQDLGVKLVYRTKLKSNGKVEKYKTRLVVKGYKQEYGVNYEEIFAPVTRIETVRLILSLAAQNGWKVYQMDVKFVFLNGHLKEEIFVEQPLGYVKK
         NKQ LGVK VYRTKLKS+G VEKYK RLVVKGYKQEYGV+YEEIFAPVTRIET+RLILSLAAQNGWKVYQMDVK  FLNGHLKEEIFV QPLGYV++
Subjt:  VNKQDLGVKLVYRTKLKSNGKVEKYKTRLVVKGYKQEYGVNYEEIFAPVTRIETVRLILSLAAQNGWKVYQMDVKFVFLNGHLKEEIFVEQPLGYVKK

A0A5D3E2J1 Integrase1.6e-7476.77Show/hide
Query:  NEDAQDSVQVEIQAVASYSSTLPSSTSDDDMSPRRMRNIQEIYNSTNIIDDD-VVDFALFFDVDPINFDEAVQDEKSKIAMDQKIDTIKRNETWELIYLL
        NE AQ+  Q EIQAV S SS+  SSTS+D++SPRRMR+IQEIYN+TN I+DD   +FALF  VDP+ FDEA+QDEK KIAMDQ+ID I+RNETWEL+ L 
Subjt:  NEDAQDSVQVEIQAVASYSSTLPSSTSDDDMSPRRMRNIQEIYNSTNIIDDD-VVDFALFFDVDPINFDEAVQDEKSKIAMDQKIDTIKRNETWELIYLL

Query:  VNKQDLGVKLVYRTKLKSNGKVEKYKTRLVVKGYKQEYGVNYEEIFAPVTRIETVRLILSLAAQNGWKVYQMDVKFVFLNGHLKEEIFVEQPLGYVKK
         NKQ LGVK VYRTKLKS+G VEKYK RLVVKGYKQEYGV+YEEIFAPVTRIET+RLILSLAAQNGWKVYQMDVK  FLNGHLKEEIFV QPLGYV++
Subjt:  VNKQDLGVKLVYRTKLKSNGKVEKYKTRLVVKGYKQEYGVNYEEIFAPVTRIETVRLILSLAAQNGWKVYQMDVKFVFLNGHLKEEIFVEQPLGYVKK

SwissProt top hitse value%identityAlignment
P04146 Copia protein8.5e-2034.5Show/hide
Query:  NGNEDAQDSVQVEIQAVASYSSTLPSSTSDDDMSPRRMRNIQEI-YN-STNIIDDDVVDFALFFDVDPINFDE-AVQDEKS--KIAMDQKIDTIKRNETW
        N NE  +      ++ +   + T        +    R++   +I YN   N ++  V++    F+  P +FDE   +D+KS  + A++ +++  K N TW
Subjt:  NGNEDAQDSVQVEIQAVASYSSTLPSSTSDDDMSPRRMRNIQEI-YN-STNIIDDDVVDFALFFDVDPINFDE-AVQDEKS--KIAMDQKIDTIKRNETW

Query:  ELIYLLVNKQDLGVKLVYRTKLKSNGKVEKYKTRLVVKGYKQEYGVNYEEIFAPVTRIETVRLILSLAAQNGWKVYQMDVKFVFLNGHLKEEIFVEQPLG
         +     NK  +  + V+  K    G   +YK RLV +G+ Q+Y ++YEE FAPV RI + R ILSL  Q   KV+QMDVK  FLNG LKEEI++  P G
Subjt:  ELIYLLVNKQDLGVKLVYRTKLKSNGKVEKYKTRLVVKGYKQEYGVNYEEIFAPVTRIETVRLILSLAAQNGWKVYQMDVKFVFLNGHLKEEIFVEQPLG

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.5e-2140.15Show/hide
Query:  DVDPINFDEAV-QDEKSKI--AMDQKIDTIKRNETWELIYLLVNKQDLGVKLVYRTKLKSNGKVEKYKTRLVVKGYKQEYGVNYEEIFAPVTRIETVRLI
        D +P +  E +   EK+++  AM ++++++++N T++L+ L   K+ L  K V++ K   + K+ +YK RLVVKG++Q+ G++++EIF+PV ++ ++R I
Subjt:  DVDPINFDEAV-QDEKSKI--AMDQKIDTIKRNETWELIYLLVNKQDLGVKLVYRTKLKSNGKVEKYKTRLVVKGYKQEYGVNYEEIFAPVTRIETVRLI

Query:  LSLAAQNGWKVYQMDVKFVFLNGHLKEEIFVEQPLGY
        LSLAA    +V Q+DVK  FL+G L+EEI++EQP G+
Subjt:  LSLAAQNGWKVYQMDVKFVFLNGHLKEEIFVEQPLGY

P92520 Uncharacterized mitochondrial protein AtMg008207.0e-1444.71Show/hide
Query:  AMDQKIDTIKRNETWELIYLLVNKQDLGVKLVYRTKLKSNGKVEKYKTRLVVKGYKQEYGVNYEEIFAPVTRIETVRLILSLAAQ
        AM +++D + RN+TW L+   VN+  LG K V++TKL S+G +++ K RLV KG+ QE G+ + E ++PV R  T+R IL++A Q
Subjt:  AMDQKIDTIKRNETWELIYLLVNKQDLGVKLVYRTKLKSNGKVEKYKTRLVVKGYKQEYGVNYEEIFAPVTRIETVRLILSLAAQ

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.6e-1832.39Show/hide
Query:  ALFFDVDPINFDEAVQDEKSKIAMDQKIDTIKRNETWELIYLLVNKQDL-GVKLVYRTKLKSNGKVEKYKTRLVVKGYKQEYGVNYEEIFAPVTRIETVR
        +L  + +P    +A++DE+ + AM  +I+    N TW+L+    +   + G + ++  K  S+G + +YK RLV KGY Q  G++Y E F+PV +  ++R
Subjt:  ALFFDVDPINFDEAVQDEKSKIAMDQKIDTIKRNETWELIYLLVNKQDL-GVKLVYRTKLKSNGKVEKYKTRLVVKGYKQEYGVNYEEIFAPVTRIETVR

Query:  LILSLAAQNGWKVYQMDVKFVFLNGHLKEEIFVEQPLGYVKK
        ++L +A    W + Q+DV   FL G L +++++ QP G++ K
Subjt:  LILSLAAQNGWKVYQMDVKFVFLNGHLKEEIFVEQPLGYVKK

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.6e-1833.82Show/hide
Query:  DPINFDEAVQDEKSKIAMDQKIDTIKRNETWELIYLLVNKQDL-GVKLVYRTKLKSNGKVEKYKTRLVVKGYKQEYGVNYEEIFAPVTRIETVRLILSLA
        +P    +A++D++ + AM  +I+    N TW+L+        + G + ++  K  S+G + +YK RLV KGY Q  G++Y E F+PV +  ++R++L +A
Subjt:  DPINFDEAVQDEKSKIAMDQKIDTIKRNETWELIYLLVNKQDL-GVKLVYRTKLKSNGKVEKYKTRLVVKGYKQEYGVNYEEIFAPVTRIETVRLILSLA

Query:  AQNGWKVYQMDVKFVFLNGHLKEEIFVEQPLGYVKK
            W + Q+DV   FL G L +E+++ QP G+V K
Subjt:  AQNGWKVYQMDVKFVFLNGHLKEEIFVEQPLGYVKK

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.0e-2339.26Show/hide
Query:  DPINFDEAVQDEKSKIAMDQKIDTIKRNETWELIYLLVNKQDLGVKLVYRTKLKSNGKVEKYKTRLVVKGYKQEYGVNYEEIFAPVTRIETVRLILSLAA
        +P  ++EA +      AMD +I  ++   TWE+  L  NK+ +G K VY+ K  S+G +E+YK RLV KGY Q+ G+++ E F+PV ++ +V+LIL+++A
Subjt:  DPINFDEAVQDEKSKIAMDQKIDTIKRNETWELIYLLVNKQDLGVKLVYRTKLKSNGKVEKYKTRLVVKGYKQEYGVNYEEIFAPVTRIETVRLILSLAA

Query:  QNGWKVYQMDVKFVFLNGHLKEEIFVEQPLGYVKK
           + ++Q+D+   FLNG L EEI+++ P GY  +
Subjt:  QNGWKVYQMDVKFVFLNGHLKEEIFVEQPLGYVKK

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)5.0e-1544.71Show/hide
Query:  AMDQKIDTIKRNETWELIYLLVNKQDLGVKLVYRTKLKSNGKVEKYKTRLVVKGYKQEYGVNYEEIFAPVTRIETVRLILSLAAQ
        AM +++D + RN+TW L+   VN+  LG K V++TKL S+G +++ K RLV KG+ QE G+ + E ++PV R  T+R IL++A Q
Subjt:  AMDQKIDTIKRNETWELIYLLVNKQDLGVKLVYRTKLKSNGKVEKYKTRLVVKGYKQEYGVNYEEIFAPVTRIETVRLILSLAAQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATGGGAATGAAGATGCTCAAGACTCAGTGCAAGTAGAGATTCAGGCAGTGGCGTCATATTCATCTACATTGCCATCTTCCACAAGTGATGATGATATGTCACCAAG
GAGAATGAGAAATATTCAAGAAATTTATAATTCTACTAACATAATTGACGATGATGTTGTTGATTTTGCATTGTTTTTCGATGTTGATCCTATAAATTTTGATGAGGCGG
TCCAAGATGAAAAATCGAAGATTGCAATGGATCAAAAAATTGATACAATAAAAAGAAATGAAACATGGGAGTTGATCTATCTTCTAGTAAACAAACAAGATCTTGGAGTA
AAATTGGTATACAGAACAAAGCTGAAGTCAAATGGTAAAGTTGAAAAATACAAGACAAGACTTGTTGTAAAAGGCTACAAGCAGGAATATGGTGTAAATTATGAAGAGAT
CTTTGCTCCTGTGACAAGAATTGAGACAGTTCGATTGATTTTGTCTTTGGCTGCTCAAAATGGATGGAAAGTTTATCAAATGGATGTAAAATTCGTTTTTTTGAATGGAC
ACTTAAAGGAAGAGATATTCGTTGAACAACCTTTGGGTTATGTGAAAAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGAATGGGAATGAAGATGCTCAAGACTCAGTGCAAGTAGAGATTCAGGCAGTGGCGTCATATTCATCTACATTGCCATCTTCCACAAGTGATGATGATATGTCACCAAG
GAGAATGAGAAATATTCAAGAAATTTATAATTCTACTAACATAATTGACGATGATGTTGTTGATTTTGCATTGTTTTTCGATGTTGATCCTATAAATTTTGATGAGGCGG
TCCAAGATGAAAAATCGAAGATTGCAATGGATCAAAAAATTGATACAATAAAAAGAAATGAAACATGGGAGTTGATCTATCTTCTAGTAAACAAACAAGATCTTGGAGTA
AAATTGGTATACAGAACAAAGCTGAAGTCAAATGGTAAAGTTGAAAAATACAAGACAAGACTTGTTGTAAAAGGCTACAAGCAGGAATATGGTGTAAATTATGAAGAGAT
CTTTGCTCCTGTGACAAGAATTGAGACAGTTCGATTGATTTTGTCTTTGGCTGCTCAAAATGGATGGAAAGTTTATCAAATGGATGTAAAATTCGTTTTTTTGAATGGAC
ACTTAAAGGAAGAGATATTCGTTGAACAACCTTTGGGTTATGTGAAAAAATGA
Protein sequenceShow/hide protein sequence
MNGNEDAQDSVQVEIQAVASYSSTLPSSTSDDDMSPRRMRNIQEIYNSTNIIDDDVVDFALFFDVDPINFDEAVQDEKSKIAMDQKIDTIKRNETWELIYLLVNKQDLGV
KLVYRTKLKSNGKVEKYKTRLVVKGYKQEYGVNYEEIFAPVTRIETVRLILSLAAQNGWKVYQMDVKFVFLNGHLKEEIFVEQPLGYVKK