; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0028255 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0028255
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionPlant transposase
Genome locationchr8:16743806..16745930
RNA-Seq ExpressionLag0028255
SyntenyLag0028255
Gene Ontology termsNA
InterPro domainsIPR004264 - Transposase, Tnp1/En/Spm-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_031738366.1 uncharacterized protein LOC101217008 isoform X7 [Cucumis sativus]2.6e-8756.84Show/hide
Query:  MGRLWRAGKSRIVSKIQNVSHEDELCKLKPSNIQSMHDWIDFVKEKKSARFKAKSERFKVMKKNQLPHTCSRKGYARLADEM------------------
        MGRLWRAGKSRIVS+IQ+ S  +EL K+KPSNI+SMHDW++FVKEKKSA FKAKSE+FK MKK QLPHTCSRKGYARLA+EM                  
Subjt:  MGRLWRAGKSRIVSKIQNVSHEDELCKLKPSNIQSMHDWIDFVKEKKSARFKAKSERFKVMKKNQLPHTCSRKGYARLADEM------------------

Query:  ------------------VRIEQIDNER----------TTPSTNVANDAISKVLGPDRGHIRGLGFGVTSSKLSTFSQRDNNYMKLEEKYKKMEEQVFEM
                           RIEQIDNE            T S NV NDAISKVLGPD+G    LGFGVT  K   FSQR+ +Y KLEEKYKKME ++ EM
Subjt:  ------------------VRIEQIDNER----------TTPSTNVANDAISKVLGPDRGHIRGLGFGVTSSKLSTFSQRDNNYMKLEEKYKKMEEQVFEM

Query:  RSMVSRLLKSQ-----------------------------SINNKNDLRKCKLLDWCGSGEVVAEGRWSSNDPNVLVHHVPLGPLAVRVWVDLPTNPKAY
        RS++S++LKSQ                             SIN+ N L KCK+LDWCG+GEVVAEGRWSSNDP V+VHHVPLGP AV+VWVDLP    A+
Subjt:  RSMVSRLLKSQ-----------------------------SINNKNDLRKCKLLDWCGSGEVVAEGRWSSNDPNVLVHHVPLGPLAVRVWVDLPTNPKAY

Query:  LWRPNSEMTYIEDAIGSTVAWPSNKVIIS
        LWRPNSEM YI+DA+GS VAWP +KV++S
Subjt:  LWRPNSEMTYIEDAIGSTVAWPSNKVIIS

XP_038904085.1 uncharacterized protein LOC120090469 isoform X1 [Benincasa hispida]5.8e-9559.7Show/hide
Query:  MGRLWRAGKSRIVSKIQNVSHEDELCKLKPSNIQSMHDWIDFVKEKKSARFKAKSERFKVMKKNQLPHTCSRKGYARLADEMVR----------------
        MGRLWRAGKSRIVS+IQ+ S  +EL K+KPSNIQSMHDW+DFVKEKKSA FKAKSERFK MKK QLPHTCSRKGYARLA+EM +                
Subjt:  MGRLWRAGKSRIVSKIQNVSHEDELCKLKPSNIQSMHDWIDFVKEKKSARFKAKSERFKVMKKNQLPHTCSRKGYARLADEMVR----------------

Query:  -------------------IEQIDNERTTPSTNVANDAISKVLGPDRGHIRGLGFGVTSSKLS------------TFSQRDNNYMKLEEKYKKMEEQVFE
                           IEQ D E    + NV NDAISKVLGPD  HI  LGFGVT SK S            + SQRD++Y +LEEKYKKME ++ E
Subjt:  -------------------IEQIDNERTTPSTNVANDAISKVLGPDRGHIRGLGFGVTSSKLS------------TFSQRDNNYMKLEEKYKKMEEQVFE

Query:  MRSMVSRLLKSQ-----------------------------SINNKNDLRKCKLLDWCGSGEVVAEGRWSSNDPNVLVHHVPLGPLAVRVWVDLPTNPKA
        MRS++S LLKSQ                             SINN N LRKCKLLDWCG+GEVVAEGRWSSNDP V+VHHVPLGP AVRVWVDLP    A
Subjt:  MRSMVSRLLKSQ-----------------------------SINNKNDLRKCKLLDWCGSGEVVAEGRWSSNDPNVLVHHVPLGPLAVRVWVDLPTNPKA

Query:  YLWRPNSEMTYIEDAIGSTVAWPSNKVIIS
        +LWRPNSEMTYI+DAIGSTVAWP +KVIIS
Subjt:  YLWRPNSEMTYIEDAIGSTVAWPSNKVIIS

XP_038904087.1 uncharacterized protein LOC120090469 isoform X2 [Benincasa hispida]5.8e-9559.7Show/hide
Query:  MGRLWRAGKSRIVSKIQNVSHEDELCKLKPSNIQSMHDWIDFVKEKKSARFKAKSERFKVMKKNQLPHTCSRKGYARLADEMVR----------------
        MGRLWRAGKSRIVS+IQ+ S  +EL K+KPSNIQSMHDW+DFVKEKKSA FKAKSERFK MKK QLPHTCSRKGYARLA+EM +                
Subjt:  MGRLWRAGKSRIVSKIQNVSHEDELCKLKPSNIQSMHDWIDFVKEKKSARFKAKSERFKVMKKNQLPHTCSRKGYARLADEMVR----------------

Query:  -------------------IEQIDNERTTPSTNVANDAISKVLGPDRGHIRGLGFGVTSSKLS------------TFSQRDNNYMKLEEKYKKMEEQVFE
                           IEQ D E    + NV NDAISKVLGPD  HI  LGFGVT SK S            + SQRD++Y +LEEKYKKME ++ E
Subjt:  -------------------IEQIDNERTTPSTNVANDAISKVLGPDRGHIRGLGFGVTSSKLS------------TFSQRDNNYMKLEEKYKKMEEQVFE

Query:  MRSMVSRLLKSQ-----------------------------SINNKNDLRKCKLLDWCGSGEVVAEGRWSSNDPNVLVHHVPLGPLAVRVWVDLPTNPKA
        MRS++S LLKSQ                             SINN N LRKCKLLDWCG+GEVVAEGRWSSNDP V+VHHVPLGP AVRVWVDLP    A
Subjt:  MRSMVSRLLKSQ-----------------------------SINNKNDLRKCKLLDWCGSGEVVAEGRWSSNDPNVLVHHVPLGPLAVRVWVDLPTNPKA

Query:  YLWRPNSEMTYIEDAIGSTVAWPSNKVIIS
        +LWRPNSEMTYI+DAIGSTVAWP +KVIIS
Subjt:  YLWRPNSEMTYIEDAIGSTVAWPSNKVIIS

XP_038904088.1 uncharacterized protein LOC120090469 isoform X3 [Benincasa hispida]5.8e-9559.7Show/hide
Query:  MGRLWRAGKSRIVSKIQNVSHEDELCKLKPSNIQSMHDWIDFVKEKKSARFKAKSERFKVMKKNQLPHTCSRKGYARLADEMVR----------------
        MGRLWRAGKSRIVS+IQ+ S  +EL K+KPSNIQSMHDW+DFVKEKKSA FKAKSERFK MKK QLPHTCSRKGYARLA+EM +                
Subjt:  MGRLWRAGKSRIVSKIQNVSHEDELCKLKPSNIQSMHDWIDFVKEKKSARFKAKSERFKVMKKNQLPHTCSRKGYARLADEMVR----------------

Query:  -------------------IEQIDNERTTPSTNVANDAISKVLGPDRGHIRGLGFGVTSSKLS------------TFSQRDNNYMKLEEKYKKMEEQVFE
                           IEQ D E    + NV NDAISKVLGPD  HI  LGFGVT SK S            + SQRD++Y +LEEKYKKME ++ E
Subjt:  -------------------IEQIDNERTTPSTNVANDAISKVLGPDRGHIRGLGFGVTSSKLS------------TFSQRDNNYMKLEEKYKKMEEQVFE

Query:  MRSMVSRLLKSQ-----------------------------SINNKNDLRKCKLLDWCGSGEVVAEGRWSSNDPNVLVHHVPLGPLAVRVWVDLPTNPKA
        MRS++S LLKSQ                             SINN N LRKCKLLDWCG+GEVVAEGRWSSNDP V+VHHVPLGP AVRVWVDLP    A
Subjt:  MRSMVSRLLKSQ-----------------------------SINNKNDLRKCKLLDWCGSGEVVAEGRWSSNDPNVLVHHVPLGPLAVRVWVDLPTNPKA

Query:  YLWRPNSEMTYIEDAIGSTVAWPSNKVIIS
        +LWRPNSEMTYI+DAIGSTVAWP +KVIIS
Subjt:  YLWRPNSEMTYIEDAIGSTVAWPSNKVIIS

XP_038904089.1 uncharacterized protein LOC120090469 isoform X4 [Benincasa hispida]5.8e-9559.7Show/hide
Query:  MGRLWRAGKSRIVSKIQNVSHEDELCKLKPSNIQSMHDWIDFVKEKKSARFKAKSERFKVMKKNQLPHTCSRKGYARLADEMVR----------------
        MGRLWRAGKSRIVS+IQ+ S  +EL K+KPSNIQSMHDW+DFVKEKKSA FKAKSERFK MKK QLPHTCSRKGYARLA+EM +                
Subjt:  MGRLWRAGKSRIVSKIQNVSHEDELCKLKPSNIQSMHDWIDFVKEKKSARFKAKSERFKVMKKNQLPHTCSRKGYARLADEMVR----------------

Query:  -------------------IEQIDNERTTPSTNVANDAISKVLGPDRGHIRGLGFGVTSSKLS------------TFSQRDNNYMKLEEKYKKMEEQVFE
                           IEQ D E    + NV NDAISKVLGPD  HI  LGFGVT SK S            + SQRD++Y +LEEKYKKME ++ E
Subjt:  -------------------IEQIDNERTTPSTNVANDAISKVLGPDRGHIRGLGFGVTSSKLS------------TFSQRDNNYMKLEEKYKKMEEQVFE

Query:  MRSMVSRLLKSQ-----------------------------SINNKNDLRKCKLLDWCGSGEVVAEGRWSSNDPNVLVHHVPLGPLAVRVWVDLPTNPKA
        MRS++S LLKSQ                             SINN N LRKCKLLDWCG+GEVVAEGRWSSNDP V+VHHVPLGP AVRVWVDLP    A
Subjt:  MRSMVSRLLKSQ-----------------------------SINNKNDLRKCKLLDWCGSGEVVAEGRWSSNDPNVLVHHVPLGPLAVRVWVDLPTNPKA

Query:  YLWRPNSEMTYIEDAIGSTVAWPSNKVIIS
        +LWRPNSEMTYI+DAIGSTVAWP +KVIIS
Subjt:  YLWRPNSEMTYIEDAIGSTVAWPSNKVIIS

TrEMBL top hitse value%identityAlignment
A0A1S4DZ27 uncharacterized protein LOC103493280 isoform X22.8e-8750.94Show/hide
Query:  MGRLWRAGKSRIVSKIQNVSHEDELCKLKPSNIQSMHDWIDFVKEKKSARFKAKSERFKVMKKNQLPHTCSRKGYARLADEM------------------
        MGRLWRAGKSRIVS+IQ+ S  +EL K+KPSNI+SMHDW+DFVKEK SA FKAKSE+FK MKK QLPHTCSRKGYARL +EM                  
Subjt:  MGRLWRAGKSRIVSKIQNVSHEDELCKLKPSNIQSMHDWIDFVKEKKSARFKAKSERFKVMKKNQLPHTCSRKGYARLADEM------------------

Query:  ------------------VRIEQIDNER-TTPSTNVANDAISKVLGPDRGHIRGLGFGVTSSKLSTFSQRDN----------------------------
                           RIEQIDNE   T S N  N+ ISKVLG DR HI  LGFG T  K S  SQ D+                            
Subjt:  ------------------VRIEQIDNER-TTPSTNVANDAISKVLGPDRGHIRGLGFGVTSSKLSTFSQRDN----------------------------

Query:  -----------------------NYMKLEEKYKKMEEQVFEMRSMVSRLLKSQ-----------------------------SINNKNDLRKCKLLDWCG
                               +Y KLEEKYKKME ++ EMRS++S+LLKSQ                             SIN+ N LRKCK+LDWCG
Subjt:  -----------------------NYMKLEEKYKKMEEQVFEMRSMVSRLLKSQ-----------------------------SINNKNDLRKCKLLDWCG

Query:  SGEVVAEGRWSSNDPNVLVHHVPLGPLAVRVWVDLPTNPKAYLWRPNSEMTYIEDAIGSTVAWPSNKVIIS
        +GEVVAEGRWSSNDP V+VHHVPLGP AVRVWVDLP  P A+LWRPNSEMTY++DA+GST+AWP +KVIIS
Subjt:  SGEVVAEGRWSSNDPNVLVHHVPLGPLAVRVWVDLPTNPKAYLWRPNSEMTYIEDAIGSTVAWPSNKVIIS

A0A1S4DZ36 uncharacterized protein LOC103493280 isoform X12.8e-8750.94Show/hide
Query:  MGRLWRAGKSRIVSKIQNVSHEDELCKLKPSNIQSMHDWIDFVKEKKSARFKAKSERFKVMKKNQLPHTCSRKGYARLADEM------------------
        MGRLWRAGKSRIVS+IQ+ S  +EL K+KPSNI+SMHDW+DFVKEK SA FKAKSE+FK MKK QLPHTCSRKGYARL +EM                  
Subjt:  MGRLWRAGKSRIVSKIQNVSHEDELCKLKPSNIQSMHDWIDFVKEKKSARFKAKSERFKVMKKNQLPHTCSRKGYARLADEM------------------

Query:  ------------------VRIEQIDNER-TTPSTNVANDAISKVLGPDRGHIRGLGFGVTSSKLSTFSQRDN----------------------------
                           RIEQIDNE   T S N  N+ ISKVLG DR HI  LGFG T  K S  SQ D+                            
Subjt:  ------------------VRIEQIDNER-TTPSTNVANDAISKVLGPDRGHIRGLGFGVTSSKLSTFSQRDN----------------------------

Query:  -----------------------NYMKLEEKYKKMEEQVFEMRSMVSRLLKSQ-----------------------------SINNKNDLRKCKLLDWCG
                               +Y KLEEKYKKME ++ EMRS++S+LLKSQ                             SIN+ N LRKCK+LDWCG
Subjt:  -----------------------NYMKLEEKYKKMEEQVFEMRSMVSRLLKSQ-----------------------------SINNKNDLRKCKLLDWCG

Query:  SGEVVAEGRWSSNDPNVLVHHVPLGPLAVRVWVDLPTNPKAYLWRPNSEMTYIEDAIGSTVAWPSNKVIIS
        +GEVVAEGRWSSNDP V+VHHVPLGP AVRVWVDLP  P A+LWRPNSEMTY++DA+GST+AWP +KVIIS
Subjt:  SGEVVAEGRWSSNDPNVLVHHVPLGPLAVRVWVDLPTNPKAYLWRPNSEMTYIEDAIGSTVAWPSNKVIIS

A0A1S4DZ41 uncharacterized protein LOC103493280 isoform X52.8e-8754.6Show/hide
Query:  MGRLWRAGKSRIVSKIQNVSHEDELCKLKPSNIQSMHDWIDFVKEKKSARFKAKSERFKVMKKNQLPHTCSRKGYARLADEM------------------
        MGRLWRAGKSRIVS+IQ+ S  +EL K+KPSNI+SMHDW+DFVKEK SA FKAKSE+FK MKK QLPHTCSRKGYARL +EM                  
Subjt:  MGRLWRAGKSRIVSKIQNVSHEDELCKLKPSNIQSMHDWIDFVKEKKSARFKAKSERFKVMKKNQLPHTCSRKGYARLADEM------------------

Query:  ------------------VRIEQIDNER-----------------------------TTPSTNVANDAISKVLGPDRGHIRGLGFGVTSSKLSTFSQRDN
                           RIEQIDNE                               T S NV NDAISKVLGP++G    LGFGV   K   FSQR+ 
Subjt:  ------------------VRIEQIDNER-----------------------------TTPSTNVANDAISKVLGPDRGHIRGLGFGVTSSKLSTFSQRDN

Query:  NYMKLEEKYKKMEEQVFEMRSMVSRLLKSQ-----------------------------SINNKNDLRKCKLLDWCGSGEVVAEGRWSSNDPNVLVHHVP
        +Y KLEEKYKKME ++ EMRS++S+LLKSQ                             SIN+ N LRKCK+LDWCG+GEVVAEGRWSSNDP V+VHHVP
Subjt:  NYMKLEEKYKKMEEQVFEMRSMVSRLLKSQ-----------------------------SINNKNDLRKCKLLDWCGSGEVVAEGRWSSNDPNVLVHHVP

Query:  LGPLAVRVWVDLPTNPKAYLWRPNSEMTYIEDAIGSTVAWPSNKVIIS
        LGP AVRVWVDLP  P A+LWRPNSEMTY++DA+GST+AWP +KVIIS
Subjt:  LGPLAVRVWVDLPTNPKAYLWRPNSEMTYIEDAIGSTVAWPSNKVIIS

A0A5A7U615 Plant transposase2.8e-8756.56Show/hide
Query:  MGRLWRAGKSRIVSKIQNVSHEDELCKLKPSNIQSMHDWIDFVKEKKSARFKAKSERFKVMKKNQLPHTCSRKGYARLADEM------------------
        MGRLWRAGKSR+V KIQ+ S+ +EL K+KPSNIQS  DW+DFVKEKK ARFKA+S +FK MKK QLPHTCSRKGYARLA++M                  
Subjt:  MGRLWRAGKSRIVSKIQNVSHEDELCKLKPSNIQSMHDWIDFVKEKKSARFKAKSERFKVMKKNQLPHTCSRKGYARLADEM------------------

Query:  ------------------VRIEQIDNER-TTPSTNVANDAISKVLGPDRGHIRGLGFGVTSSKLSTFSQRDNNYMKLEEKYKKMEEQVFEMRSMVSRLLK
                           RI +I+NE     S +VAN  ISKVLGPDRGHIR  GFGVT+S  S  SQ+D++Y KLEEK +KME ++ +MR ++S LLK
Subjt:  ------------------VRIEQIDNER-TTPSTNVANDAISKVLGPDRGHIRGLGFGVTSSKLSTFSQRDNNYMKLEEKYKKMEEQVFEMRSMVSRLLK

Query:  SQ-----------------------------SINNKNDLRKCKLLDWCGSGEVVAEGRWSSNDPNVLVHHVPLGPLAVRVWVDLPTNPKAYLWRPNSEMT
        SQ                             SINN N LRKC LLD  G+GEVVAEGRWSSNDPNV VHHVPLGP AV+VWVDLP    A+LWRPNSEMT
Subjt:  SQ-----------------------------SINNKNDLRKCKLLDWCGSGEVVAEGRWSSNDPNVLVHHVPLGPLAVRVWVDLPTNPKAYLWRPNSEMT

Query:  YIEDAIGSTVAWPSNKVIIS
        Y EDA+GSTVAWP +KVI+S
Subjt:  YIEDAIGSTVAWPSNKVIIS

A0A5D3D4T6 Plant transposase2.8e-8750.94Show/hide
Query:  MGRLWRAGKSRIVSKIQNVSHEDELCKLKPSNIQSMHDWIDFVKEKKSARFKAKSERFKVMKKNQLPHTCSRKGYARLADEM------------------
        MGRLWRAGKSRIVS+IQ+ S  +EL K+KPSNI+SMHDW+DFVKEK SA FKAKSE+FK MKK QLPHTCSRKGYARL +EM                  
Subjt:  MGRLWRAGKSRIVSKIQNVSHEDELCKLKPSNIQSMHDWIDFVKEKKSARFKAKSERFKVMKKNQLPHTCSRKGYARLADEM------------------

Query:  ------------------VRIEQIDNER-TTPSTNVANDAISKVLGPDRGHIRGLGFGVTSSKLSTFSQRDN----------------------------
                           RIEQIDNE   T S N  N+ ISKVLG DR HI  LGFG T  K S  SQ D+                            
Subjt:  ------------------VRIEQIDNER-TTPSTNVANDAISKVLGPDRGHIRGLGFGVTSSKLSTFSQRDN----------------------------

Query:  -----------------------NYMKLEEKYKKMEEQVFEMRSMVSRLLKSQ-----------------------------SINNKNDLRKCKLLDWCG
                               +Y KLEEKYKKME ++ EMRS++S+LLKSQ                             SIN+ N LRKCK+LDWCG
Subjt:  -----------------------NYMKLEEKYKKMEEQVFEMRSMVSRLLKSQ-----------------------------SINNKNDLRKCKLLDWCG

Query:  SGEVVAEGRWSSNDPNVLVHHVPLGPLAVRVWVDLPTNPKAYLWRPNSEMTYIEDAIGSTVAWPSNKVIIS
        +GEVVAEGRWSSNDP V+VHHVPLGP AVRVWVDLP  P A+LWRPNSEMTY++DA+GST+AWP +KVIIS
Subjt:  SGEVVAEGRWSSNDPNVLVHHVPLGPLAVRVWVDLPTNPKAYLWRPNSEMTYIEDAIGSTVAWPSNKVIIS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAGGTTATGGAGAGCAGGAAAATCACGAATTGTGTCCAAAATTCAAAATGTTTCCCATGAAGACGAACTTTGTAAATTAAAGCCAAGTAATATACAATCTATGCA
TGATTGGATCGATTTCGTGAAGGAAAAGAAGAGTGCACGATTCAAGGCAAAAAGTGAAAGATTTAAAGTCATGAAGAAGAATCAACTTCCACATACATGTAGCCGCAAAG
GATATGCTCGATTGGCAGATGAAATGGTACGCATTGAACAAATTGACAATGAAAGAACCACTCCTTCAACTAATGTGGCCAATGATGCAATAAGTAAAGTTCTTGGTCCC
GATCGTGGTCATATTCGAGGACTTGGATTTGGAGTTACTTCTTCAAAGTTGTCTACATTCTCTCAAAGAGATAACAATTATATGAAACTTGAAGAAAAGTATAAAAAGAT
GGAGGAACAAGTGTTTGAAATGAGATCCATGGTGTCTCGTCTATTGAAATCTCAAAGTATTAACAATAAAAACGATCTTCGCAAGTGCAAGTTGCTAGATTGGTGTGGTT
CAGGAGAGGTTGTTGCTGAAGGTCGATGGTCTTCAAATGACCCAAACGTTCTTGTTCATCATGTTCCCCTTGGTCCGTTAGCAGTTAGAGTTTGGGTAGATTTACCAACG
AATCCCAAGGCATATTTATGGAGGCCTAACTCAGAAATGACATATATCGAGGATGCTATTGGTAGTACAGTTGCATGGCCTTCTAACAAAGTTATAATAAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGAAGGTTATGGAGAGCAGGAAAATCACGAATTGTGTCCAAAATTCAAAATGTTTCCCATGAAGACGAACTTTGTAAATTAAAGCCAAGTAATATACAATCTATGCA
TGATTGGATCGATTTCGTGAAGGAAAAGAAGAGTGCACGATTCAAGGCAAAAAGTGAAAGATTTAAAGTCATGAAGAAGAATCAACTTCCACATACATGTAGCCGCAAAG
GATATGCTCGATTGGCAGATGAAATGGTACGCATTGAACAAATTGACAATGAAAGAACCACTCCTTCAACTAATGTGGCCAATGATGCAATAAGTAAAGTTCTTGGTCCC
GATCGTGGTCATATTCGAGGACTTGGATTTGGAGTTACTTCTTCAAAGTTGTCTACATTCTCTCAAAGAGATAACAATTATATGAAACTTGAAGAAAAGTATAAAAAGAT
GGAGGAACAAGTGTTTGAAATGAGATCCATGGTGTCTCGTCTATTGAAATCTCAAAGTATTAACAATAAAAACGATCTTCGCAAGTGCAAGTTGCTAGATTGGTGTGGTT
CAGGAGAGGTTGTTGCTGAAGGTCGATGGTCTTCAAATGACCCAAACGTTCTTGTTCATCATGTTCCCCTTGGTCCGTTAGCAGTTAGAGTTTGGGTAGATTTACCAACG
AATCCCAAGGCATATTTATGGAGGCCTAACTCAGAAATGACATATATCGAGGATGCTATTGGTAGTACAGTTGCATGGCCTTCTAACAAAGTTATAATAAGTTGA
Protein sequenceShow/hide protein sequence
MGRLWRAGKSRIVSKIQNVSHEDELCKLKPSNIQSMHDWIDFVKEKKSARFKAKSERFKVMKKNQLPHTCSRKGYARLADEMVRIEQIDNERTTPSTNVANDAISKVLGP
DRGHIRGLGFGVTSSKLSTFSQRDNNYMKLEEKYKKMEEQVFEMRSMVSRLLKSQSINNKNDLRKCKLLDWCGSGEVVAEGRWSSNDPNVLVHHVPLGPLAVRVWVDLPT
NPKAYLWRPNSEMTYIEDAIGSTVAWPSNKVIIS