; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0021255 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0021255
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionUnknown protein
Genome locationchr7:5917033..5921187
RNA-Seq ExpressionLag0021255
SyntenyLag0021255
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138115.1 uncharacterized protein LOC111009363 isoform X1 [Momordica charantia]9.5e-11785.94Show/hide
Query:  MGEALFELEQVLRSKQNSLTFEEANLLQTCKSKAVLDFTFGALFGGGVTWAGTWRLNKFIRLNLSGGAAAIFGLWRFNRSLNSCIDHILALDGSRMQKEL
        MGEALFELEQVLRSKQNSLT EEA LLQTCKSKAV DFTFGAL GGGVTWAGTWRLNKFIRLNLSGGAAA+FGLWRF+RSLNSC+DHILALDGSRMQKEL
Subjt:  MGEALFELEQVLRSKQNSLTFEEANLLQTCKSKAVLDFTFGALFGGGVTWAGTWRLNKFIRLNLSGGAAAIFGLWRFNRSLNSCIDHILALDGSRMQKEL

Query:  ANIVVTKYHNDPRTMQHISKHFYYEKVFDDSTLDRPKIRWRYRNFFSDDVAHAQRAHDNDPKNNLHGNSHD-SSNRDSSSNQGDSYGESDGKGNALEFKP
        ANIVVTKYHNDPRTMQHISKHFYYEKVFDDSTLDRP+IRWRYRNFFSDDVAH QR HDND KNNLHGNSH  SSN DS+SNQ  SY E D KGNALEFKP
Subjt:  ANIVVTKYHNDPRTMQHISKHFYYEKVFDDSTLDRPKIRWRYRNFFSDDVAHAQRAHDNDPKNNLHGNSHD-SSNRDSSSNQGDSYGESDGKGNALEFKP

Query:  VHTKLGMDATADPLDCIFDSLAREEEIQQSSTSSTSPKSHPRSRRYHRRHRRHNET
        V TK G DATADPLDC+F  LA+ EEIQ S++S+T+ KSH RSRRYHRRHRRHN+T
Subjt:  VHTKLGMDATADPLDCIFDSLAREEEIQQSSTSSTSPKSHPRSRRYHRRHRRHNET

XP_022138116.1 uncharacterized protein LOC111009363 isoform X2 [Momordica charantia]9.5e-11785.94Show/hide
Query:  MGEALFELEQVLRSKQNSLTFEEANLLQTCKSKAVLDFTFGALFGGGVTWAGTWRLNKFIRLNLSGGAAAIFGLWRFNRSLNSCIDHILALDGSRMQKEL
        MGEALFELEQVLRSKQNSLT EEA LLQTCKSKAV DFTFGAL GGGVTWAGTWRLNKFIRLNLSGGAAA+FGLWRF+RSLNSC+DHILALDGSRMQKEL
Subjt:  MGEALFELEQVLRSKQNSLTFEEANLLQTCKSKAVLDFTFGALFGGGVTWAGTWRLNKFIRLNLSGGAAAIFGLWRFNRSLNSCIDHILALDGSRMQKEL

Query:  ANIVVTKYHNDPRTMQHISKHFYYEKVFDDSTLDRPKIRWRYRNFFSDDVAHAQRAHDNDPKNNLHGNSHD-SSNRDSSSNQGDSYGESDGKGNALEFKP
        ANIVVTKYHNDPRTMQHISKHFYYEKVFDDSTLDRP+IRWRYRNFFSDDVAH QR HDND KNNLHGNSH  SSN DS+SNQ  SY E D KGNALEFKP
Subjt:  ANIVVTKYHNDPRTMQHISKHFYYEKVFDDSTLDRPKIRWRYRNFFSDDVAHAQRAHDNDPKNNLHGNSHD-SSNRDSSSNQGDSYGESDGKGNALEFKP

Query:  VHTKLGMDATADPLDCIFDSLAREEEIQQSSTSSTSPKSHPRSRRYHRRHRRHNET
        V TK G DATADPLDC+F  LA+ EEIQ S++S+T+ KSH RSRRYHRRHRRHN+T
Subjt:  VHTKLGMDATADPLDCIFDSLAREEEIQQSSTSSTSPKSHPRSRRYHRRHRRHNET

XP_022956077.1 uncharacterized protein LOC111457878 [Cucurbita moschata]2.3e-11884.09Show/hide
Query:  MGEALFELEQVLRSKQNSLTFEEANLLQTCKSKAVLDFTFGALFGGGVTWAGTWRLNKFIRLNLSGGAAAIFGLWRFNRSLNSCIDHILALDGSRMQKEL
        MGEALFELEQVLRSKQNSLT EEAN+LQTCKSKAV DFTFG L GGGVTWAGTWRLNKF+RLNLSGGA A+FGL RF+RSL+SC+DHILALDGSRMQKEL
Subjt:  MGEALFELEQVLRSKQNSLTFEEANLLQTCKSKAVLDFTFGALFGGGVTWAGTWRLNKFIRLNLSGGAAAIFGLWRFNRSLNSCIDHILALDGSRMQKEL

Query:  ANIVVTKYHNDPRTMQHISKHFYYEKVFDDSTLDRPKIRWRYRNFFSDDVAHAQRAHDNDPKNNLHGNSHDSSNRDSSSNQGDSYGESDGKGNALEFKPV
        ANIVVTKYHNDPRTMQHISKHF+YE+VFDDSTLDRPKIRWRYRNFFSDDVAHAQR H NDPK+NLHGN HDSSNRDS+ NQ DSYG+ D KGNA EF PV
Subjt:  ANIVVTKYHNDPRTMQHISKHFYYEKVFDDSTLDRPKIRWRYRNFFSDDVAHAQRAHDNDPKNNLHGNSHDSSNRDSSSNQGDSYGESDGKGNALEFKPV

Query:  HTKLGMD-ATADPLDCIFDSLAREEEIQQSSTSSTSPKSHPRSRRYHRRHRRHNETMPTSFEHV
         TK G D ATADPLD IF +L REEEIQ SS SS SPKSH RS+RY+RRHRRHN+TMPT FEHV
Subjt:  HTKLGMD-ATADPLDCIFDSLAREEEIQQSSTSSTSPKSHPRSRRYHRRHRRHNETMPTSFEHV

XP_022980008.1 uncharacterized protein LOC111479542 [Cucurbita maxima]8.6e-11884.09Show/hide
Query:  MGEALFELEQVLRSKQNSLTFEEANLLQTCKSKAVLDFTFGALFGGGVTWAGTWRLNKFIRLNLSGGAAAIFGLWRFNRSLNSCIDHILALDGSRMQKEL
        MGEALFELEQVLRSKQNSLT EEAN+LQTCKSKAV DFTFG L GGGVTWAGTWRLNKF+RLNLSGGA A+FGL RF+RSL+SC+DHILALDGSRMQKEL
Subjt:  MGEALFELEQVLRSKQNSLTFEEANLLQTCKSKAVLDFTFGALFGGGVTWAGTWRLNKFIRLNLSGGAAAIFGLWRFNRSLNSCIDHILALDGSRMQKEL

Query:  ANIVVTKYHNDPRTMQHISKHFYYEKVFDDSTLDRPKIRWRYRNFFSDDVAHAQRAHDNDPKNNLHGNSHDSSNRDSSSNQGDSYGESDGKGNALEFKPV
        ANI+VTK HNDPRTMQHISKHF+YE+VFDDSTLDRPKIRWRYRNFFSDDVAHAQRAH NDPK+NLHGN HDSSNRDS+ NQ DSYGE D KGNA EF PV
Subjt:  ANIVVTKYHNDPRTMQHISKHFYYEKVFDDSTLDRPKIRWRYRNFFSDDVAHAQRAHDNDPKNNLHGNSHDSSNRDSSSNQGDSYGESDGKGNALEFKPV

Query:  HTKLGMD-ATADPLDCIFDSLAREEEIQQSSTSSTSPKSHPRSRRYHRRHRRHNETMPTSFEHV
         TK G D ATADPLD IF +L REEEIQ SS SS SPKSH RS+RY+RRHRRHN+TMPT FEHV
Subjt:  HTKLGMD-ATADPLDCIFDSLAREEEIQQSSTSSTSPKSHPRSRRYHRRHRRHNETMPTSFEHV

XP_023527180.1 uncharacterized protein LOC111790494 isoform X1 [Cucurbita pepo subsp. pepo]3.8e-11883.71Show/hide
Query:  MGEALFELEQVLRSKQNSLTFEEANLLQTCKSKAVLDFTFGALFGGGVTWAGTWRLNKFIRLNLSGGAAAIFGLWRFNRSLNSCIDHILALDGSRMQKEL
        MGEALFELEQVLRSKQNSLT EEAN+LQTCKSKAV DFTFG L GGGVTWAGTWRLNKF+RLNLSGGA A+FGL RF+RSL+SC+DHILALDGSRMQKEL
Subjt:  MGEALFELEQVLRSKQNSLTFEEANLLQTCKSKAVLDFTFGALFGGGVTWAGTWRLNKFIRLNLSGGAAAIFGLWRFNRSLNSCIDHILALDGSRMQKEL

Query:  ANIVVTKYHNDPRTMQHISKHFYYEKVFDDSTLDRPKIRWRYRNFFSDDVAHAQRAHDNDPKNNLHGNSHDSSNRDSSSNQGDSYGESDGKGNALEFKPV
        ANIVVTKYHNDPRTMQHISKHF+YE+VFDDSTLDRPKIRWRYRNFFSDDVAHAQR H NDPK+NLHGN HDSSNRDS+ NQ DSYG+ D KGNA EF PV
Subjt:  ANIVVTKYHNDPRTMQHISKHFYYEKVFDDSTLDRPKIRWRYRNFFSDDVAHAQRAHDNDPKNNLHGNSHDSSNRDSSSNQGDSYGESDGKGNALEFKPV

Query:  HTKLGMD-ATADPLDCIFDSLAREEEIQQSSTSSTSPKSHPRSRRYHRRHRRHNETMPTSFEHV
         TK G D ATADPLD IF ++ REEEIQ SS SS SPKSH RS+RY+RRHRRHN+TMPT FEHV
Subjt:  HTKLGMD-ATADPLDCIFDSLAREEEIQQSSTSSTSPKSHPRSRRYHRRHRRHNETMPTSFEHV

TrEMBL top hitse value%identityAlignment
A0A1S3AWL2 uncharacterized protein LOC103483703 isoform X14.5e-10475.76Show/hide
Query:  MGEALFELEQVLRSKQNSLTFEEANLLQTCKSKAVLDFTFGALFGGGVTWAGTWRLNKFIRLNLSGGAAAIFGLWRFNRSLNSCIDHILALDGSRMQKEL
        MGE L ELE VLRSK N LT EEA LLQTC+SKAV DFTFG + GGG+TWAGTWRLNKF RLNLSGGAAA+ G WRF+RSLNSC+D+IL+LDGSRMQKEL
Subjt:  MGEALFELEQVLRSKQNSLTFEEANLLQTCKSKAVLDFTFGALFGGGVTWAGTWRLNKFIRLNLSGGAAAIFGLWRFNRSLNSCIDHILALDGSRMQKEL

Query:  ANIVVTKYHNDPRTMQHISKHFYYEKVFDDSTLDRPKIRWRYRNFFSDDVAHAQRAHDNDPKNNLHGNSHDSSNRDSSSNQGDSYGESDGKGNALEFKPV
        ANIVVT+YHNDPR MQ+ISKHF+YE+VFDDST DRPKIRWRYRNFFSDDVAH+QR H ND  NN+H NSH    RDSS++Q DSYG+SD KGNA EFKPV
Subjt:  ANIVVTKYHNDPRTMQHISKHFYYEKVFDDSTLDRPKIRWRYRNFFSDDVAHAQRAHDNDPKNNLHGNSHDSSNRDSSSNQGDSYGESDGKGNALEFKPV

Query:  HTKLGMD-ATADPLDCIFDSLAREEEIQQSSTSSTSPKSHPRSRRYHRRHRRHNETMPTSFEHV
         TK G D ATADPLDCIF +LAREEEIQ S+ S+ SPK H RSRRY+RRHR+ N+T PT+FE+V
Subjt:  HTKLGMD-ATADPLDCIFDSLAREEEIQQSSTSSTSPKSHPRSRRYHRRHRRHNETMPTSFEHV

A0A6J1C8I6 uncharacterized protein LOC111009363 isoform X14.6e-11785.94Show/hide
Query:  MGEALFELEQVLRSKQNSLTFEEANLLQTCKSKAVLDFTFGALFGGGVTWAGTWRLNKFIRLNLSGGAAAIFGLWRFNRSLNSCIDHILALDGSRMQKEL
        MGEALFELEQVLRSKQNSLT EEA LLQTCKSKAV DFTFGAL GGGVTWAGTWRLNKFIRLNLSGGAAA+FGLWRF+RSLNSC+DHILALDGSRMQKEL
Subjt:  MGEALFELEQVLRSKQNSLTFEEANLLQTCKSKAVLDFTFGALFGGGVTWAGTWRLNKFIRLNLSGGAAAIFGLWRFNRSLNSCIDHILALDGSRMQKEL

Query:  ANIVVTKYHNDPRTMQHISKHFYYEKVFDDSTLDRPKIRWRYRNFFSDDVAHAQRAHDNDPKNNLHGNSHD-SSNRDSSSNQGDSYGESDGKGNALEFKP
        ANIVVTKYHNDPRTMQHISKHFYYEKVFDDSTLDRP+IRWRYRNFFSDDVAH QR HDND KNNLHGNSH  SSN DS+SNQ  SY E D KGNALEFKP
Subjt:  ANIVVTKYHNDPRTMQHISKHFYYEKVFDDSTLDRPKIRWRYRNFFSDDVAHAQRAHDNDPKNNLHGNSHD-SSNRDSSSNQGDSYGESDGKGNALEFKP

Query:  VHTKLGMDATADPLDCIFDSLAREEEIQQSSTSSTSPKSHPRSRRYHRRHRRHNET
        V TK G DATADPLDC+F  LA+ EEIQ S++S+T+ KSH RSRRYHRRHRRHN+T
Subjt:  VHTKLGMDATADPLDCIFDSLAREEEIQQSSTSSTSPKSHPRSRRYHRRHRRHNET

A0A6J1C8T0 uncharacterized protein LOC111009363 isoform X24.6e-11785.94Show/hide
Query:  MGEALFELEQVLRSKQNSLTFEEANLLQTCKSKAVLDFTFGALFGGGVTWAGTWRLNKFIRLNLSGGAAAIFGLWRFNRSLNSCIDHILALDGSRMQKEL
        MGEALFELEQVLRSKQNSLT EEA LLQTCKSKAV DFTFGAL GGGVTWAGTWRLNKFIRLNLSGGAAA+FGLWRF+RSLNSC+DHILALDGSRMQKEL
Subjt:  MGEALFELEQVLRSKQNSLTFEEANLLQTCKSKAVLDFTFGALFGGGVTWAGTWRLNKFIRLNLSGGAAAIFGLWRFNRSLNSCIDHILALDGSRMQKEL

Query:  ANIVVTKYHNDPRTMQHISKHFYYEKVFDDSTLDRPKIRWRYRNFFSDDVAHAQRAHDNDPKNNLHGNSHD-SSNRDSSSNQGDSYGESDGKGNALEFKP
        ANIVVTKYHNDPRTMQHISKHFYYEKVFDDSTLDRP+IRWRYRNFFSDDVAH QR HDND KNNLHGNSH  SSN DS+SNQ  SY E D KGNALEFKP
Subjt:  ANIVVTKYHNDPRTMQHISKHFYYEKVFDDSTLDRPKIRWRYRNFFSDDVAHAQRAHDNDPKNNLHGNSHD-SSNRDSSSNQGDSYGESDGKGNALEFKP

Query:  VHTKLGMDATADPLDCIFDSLAREEEIQQSSTSSTSPKSHPRSRRYHRRHRRHNET
        V TK G DATADPLDC+F  LA+ EEIQ S++S+T+ KSH RSRRYHRRHRRHN+T
Subjt:  VHTKLGMDATADPLDCIFDSLAREEEIQQSSTSSTSPKSHPRSRRYHRRHRRHNET

A0A6J1GVC2 uncharacterized protein LOC1114578781.1e-11884.09Show/hide
Query:  MGEALFELEQVLRSKQNSLTFEEANLLQTCKSKAVLDFTFGALFGGGVTWAGTWRLNKFIRLNLSGGAAAIFGLWRFNRSLNSCIDHILALDGSRMQKEL
        MGEALFELEQVLRSKQNSLT EEAN+LQTCKSKAV DFTFG L GGGVTWAGTWRLNKF+RLNLSGGA A+FGL RF+RSL+SC+DHILALDGSRMQKEL
Subjt:  MGEALFELEQVLRSKQNSLTFEEANLLQTCKSKAVLDFTFGALFGGGVTWAGTWRLNKFIRLNLSGGAAAIFGLWRFNRSLNSCIDHILALDGSRMQKEL

Query:  ANIVVTKYHNDPRTMQHISKHFYYEKVFDDSTLDRPKIRWRYRNFFSDDVAHAQRAHDNDPKNNLHGNSHDSSNRDSSSNQGDSYGESDGKGNALEFKPV
        ANIVVTKYHNDPRTMQHISKHF+YE+VFDDSTLDRPKIRWRYRNFFSDDVAHAQR H NDPK+NLHGN HDSSNRDS+ NQ DSYG+ D KGNA EF PV
Subjt:  ANIVVTKYHNDPRTMQHISKHFYYEKVFDDSTLDRPKIRWRYRNFFSDDVAHAQRAHDNDPKNNLHGNSHDSSNRDSSSNQGDSYGESDGKGNALEFKPV

Query:  HTKLGMD-ATADPLDCIFDSLAREEEIQQSSTSSTSPKSHPRSRRYHRRHRRHNETMPTSFEHV
         TK G D ATADPLD IF +L REEEIQ SS SS SPKSH RS+RY+RRHRRHN+TMPT FEHV
Subjt:  HTKLGMD-ATADPLDCIFDSLAREEEIQQSSTSSTSPKSHPRSRRYHRRHRRHNETMPTSFEHV

A0A6J1IXZ4 uncharacterized protein LOC1114795424.1e-11884.09Show/hide
Query:  MGEALFELEQVLRSKQNSLTFEEANLLQTCKSKAVLDFTFGALFGGGVTWAGTWRLNKFIRLNLSGGAAAIFGLWRFNRSLNSCIDHILALDGSRMQKEL
        MGEALFELEQVLRSKQNSLT EEAN+LQTCKSKAV DFTFG L GGGVTWAGTWRLNKF+RLNLSGGA A+FGL RF+RSL+SC+DHILALDGSRMQKEL
Subjt:  MGEALFELEQVLRSKQNSLTFEEANLLQTCKSKAVLDFTFGALFGGGVTWAGTWRLNKFIRLNLSGGAAAIFGLWRFNRSLNSCIDHILALDGSRMQKEL

Query:  ANIVVTKYHNDPRTMQHISKHFYYEKVFDDSTLDRPKIRWRYRNFFSDDVAHAQRAHDNDPKNNLHGNSHDSSNRDSSSNQGDSYGESDGKGNALEFKPV
        ANI+VTK HNDPRTMQHISKHF+YE+VFDDSTLDRPKIRWRYRNFFSDDVAHAQRAH NDPK+NLHGN HDSSNRDS+ NQ DSYGE D KGNA EF PV
Subjt:  ANIVVTKYHNDPRTMQHISKHFYYEKVFDDSTLDRPKIRWRYRNFFSDDVAHAQRAHDNDPKNNLHGNSHDSSNRDSSSNQGDSYGESDGKGNALEFKPV

Query:  HTKLGMD-ATADPLDCIFDSLAREEEIQQSSTSSTSPKSHPRSRRYHRRHRRHNETMPTSFEHV
         TK G D ATADPLD IF +L REEEIQ SS SS SPKSH RS+RY+RRHRRHN+TMPT FEHV
Subjt:  HTKLGMD-ATADPLDCIFDSLAREEEIQQSSTSSTSPKSHPRSRRYHRRHRRHNETMPTSFEHV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G05430.1 unknown protein3.5e-1631.02Show/hide
Query:  ALFELEQVLRSK--QNSLTFEEANLLQTCKSKAVLDFTFGALFGGGVTWAGTWRLNK---FIRLNLSGGAAA----IFGLWRFNRSLNSCIDHILALDGS
        AL +L  VL SK  Q  +T EE+  + +C  KA+    F +  GGG+TW  T +L K     R+ L+ G AA    +   W  ++   S +DHIL+ D +
Subjt:  ALFELEQVLRSK--QNSLTFEEANLLQTCKSKAVLDFTFGALFGGGVTWAGTWRLNK---FIRLNLSGGAAA----IFGLWRFNRSLNSCIDHILALDGS

Query:  RMQKELANIVVTKYHNDPRTMQHISKHFYYEKVFDDSTLDRPKIRWRYRNFFSDDVAHAQRAHDNDPKNNLHGNSHDSSNRDSSSNQG--------DSYG
        RMQKEL N++V     +    Q +SKHFY E V+ D   D+P++RWR R  F++  +     +    + N +G  + S  R S  +          +S G
Subjt:  RMQKELANIVVTKYHNDPRTMQHISKHFYYEKVFDDSTLDRPKIRWRYRNFFSDDVAHAQRAHDNDPKNNLHGNSHDSSNRDSSSNQG--------DSYG

Query:  ESDGKGNALEFKPVHTKLGMDATADPLDCIFDSLAREEEIQQSSTSSTSPKSHPR-SRRYHRRHRRHNETMPTS
         SDG+                A  D LD +F      E I     S  + K+  R  +R  RR R  N    T+
Subjt:  ESDGKGNALEFKPVHTKLGMDATADPLDCIFDSLAREEEIQQSSTSSTSPKSHPR-SRRYHRRHRRHNETMPTS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCGAAGCTTTATTCGAACTCGAACAAGTTCTCAGGTCCAAACAGAACAGCTTGACGTTCGAGGAAGCGAATTTGCTCCAAACATGTAAGTCTAAGGCTGTTCTAGA
TTTTACATTTGGAGCTCTCTTTGGAGGTGGTGTGACATGGGCAGGAACATGGAGGCTGAATAAGTTCATTCGGTTAAATCTTTCTGGAGGAGCTGCTGCGATATTTGGAT
TATGGAGATTTAACAGGTCCCTAAATTCATGCATCGATCATATTCTTGCACTGGATGGAAGTAGGATGCAAAAGGAGTTGGCAAATATTGTAGTGACGAAATATCACAAT
GATCCTCGTACAATGCAGCACATATCCAAGCATTTTTATTATGAGAAAGTGTTTGACGATTCAACATTGGACCGGCCAAAAATAAGGTGGCGTTATCGAAATTTCTTTAG
TGATGATGTTGCTCATGCTCAGAGGGCACATGACAATGACCCTAAGAATAACTTGCATGGAAATTCCCACGATTCATCCAACCGCGACTCCAGTTCCAACCAGGGTGACT
CCTATGGTGAGTCTGATGGCAAAGGAAATGCACTTGAGTTCAAGCCAGTCCATACTAAGCTGGGCATGGATGCTACCGCAGACCCTCTGGATTGTATTTTCGATTCACTG
GCAAGAGAAGAAGAAATTCAGCAATCGAGTACCTCTAGCACATCACCGAAATCTCACCCTCGTAGTAGAAGATACCACCGTCGGCATCGAAGACATAATGAGACAATGCC
AACAAGCTTTGAACATGTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGCGAAGCTTTATTCGAACTCGAACAAGTTCTCAGGTCCAAACAGAACAGCTTGACGTTCGAGGAAGCGAATTTGCTCCAAACATGTAAGTCTAAGGCTGTTCTAGA
TTTTACATTTGGAGCTCTCTTTGGAGGTGGTGTGACATGGGCAGGAACATGGAGGCTGAATAAGTTCATTCGGTTAAATCTTTCTGGAGGAGCTGCTGCGATATTTGGAT
TATGGAGATTTAACAGGTCCCTAAATTCATGCATCGATCATATTCTTGCACTGGATGGAAGTAGGATGCAAAAGGAGTTGGCAAATATTGTAGTGACGAAATATCACAAT
GATCCTCGTACAATGCAGCACATATCCAAGCATTTTTATTATGAGAAAGTGTTTGACGATTCAACATTGGACCGGCCAAAAATAAGGTGGCGTTATCGAAATTTCTTTAG
TGATGATGTTGCTCATGCTCAGAGGGCACATGACAATGACCCTAAGAATAACTTGCATGGAAATTCCCACGATTCATCCAACCGCGACTCCAGTTCCAACCAGGGTGACT
CCTATGGTGAGTCTGATGGCAAAGGAAATGCACTTGAGTTCAAGCCAGTCCATACTAAGCTGGGCATGGATGCTACCGCAGACCCTCTGGATTGTATTTTCGATTCACTG
GCAAGAGAAGAAGAAATTCAGCAATCGAGTACCTCTAGCACATCACCGAAATCTCACCCTCGTAGTAGAAGATACCACCGTCGGCATCGAAGACATAATGAGACAATGCC
AACAAGCTTTGAACATGTGTAG
Protein sequenceShow/hide protein sequence
MGEALFELEQVLRSKQNSLTFEEANLLQTCKSKAVLDFTFGALFGGGVTWAGTWRLNKFIRLNLSGGAAAIFGLWRFNRSLNSCIDHILALDGSRMQKELANIVVTKYHN
DPRTMQHISKHFYYEKVFDDSTLDRPKIRWRYRNFFSDDVAHAQRAHDNDPKNNLHGNSHDSSNRDSSSNQGDSYGESDGKGNALEFKPVHTKLGMDATADPLDCIFDSL
AREEEIQQSSTSSTSPKSHPRSRRYHRRHRRHNETMPTSFEHV