; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0030687 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0030687
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionUnknown protein
Genome locationchr11:407184..408776
RNA-Seq ExpressionLag0030687
SyntenyLag0030687
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004145897.1 uncharacterized protein LOC101214743 [Cucumis sativus]3.7e-10783.81Show/hide
Query:  MSGGVGPTCSDISLPKEQEVLHKEACNLKPAAVH-GRRKAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVVFSIMYMYFISRVAFPTMAG-GEATVFS
        MSGGVGP C DISLPKEQE +HKEA + K      GRRKAAAFLS RQLNALAVV+IFSASGMVCAEDL FV+FSIMYMYF+SRVAFP + G G+A VF 
Subjt:  MSGGVGPTCSDISLPKEQEVLHKEACNLKPAAVH-GRRKAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVVFSIMYMYFISRVAFPTMAG-GEATVFS

Query:  PENVRMLRLYVFFTAVVGLFLPIAYILEGFFEDDKEGIKAASPHVFLLASQVFVEGVAWNDRFSTPIRVFVPVFYNSRRIFTLAEWLRDEFAKEDKEYSG
        PEN R+LRLYV F A+VGLFLP+AYILEGFFE+DKEGIKAASPHVFLLASQVF+EGVA ND FSTPIRVFVPVFYNSRRIFTL EWLR+EFAKEDKEYSG
Subjt:  PENVRMLRLYVFFTAVVGLFLPIAYILEGFFEDDKEGIKAASPHVFLLASQVFVEGVAWNDRFSTPIRVFVPVFYNSRRIFTLAEWLRDEFAKEDKEYSG

Query:  SVRRLLVGRALAVANMALWSFNLFGFLLPVFLPMAFKRYYSLYKSKD
        SVRRL+VGRALAVANMALWSFNLFGFLLPV+LP AFKRYYSLYKSKD
Subjt:  SVRRLLVGRALAVANMALWSFNLFGFLLPVFLPMAFKRYYSLYKSKD

XP_008437515.1 PREDICTED: uncharacterized protein LOC103482907 [Cucumis melo]5.0e-11287.8Show/hide
Query:  MSGGVGPTCSDISLPKEQEVLHKEACNLKPAAVHGRRKAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVVFSIMYMYFISRVAFPTMAG-GEATVFSP
        MSGGVGPTCSDISLPKEQE +HKEA + K   V GRRKAAAFLS RQLNALAVV+IFSASGMVCAEDLAFVVFSIMYMYFISRVAFP M G G+A VF P
Subjt:  MSGGVGPTCSDISLPKEQEVLHKEACNLKPAAVHGRRKAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVVFSIMYMYFISRVAFPTMAG-GEATVFSP

Query:  ENVRMLRLYVFFTAVVGLFLPIAYILEGFFEDDKEGIKAASPHVFLLASQVFVEGVAWNDRFSTPIRVFVPVFYNSRRIFTLAEWLRDEFAKEDKEYSGS
        EN R+LRLYV F A++GLFLPIAYILEGFFE+DKEGIKAASPHVFLLASQVF+EGVA NDRFSTPIRVFVPVFYNSRRIFTL EWLRDEFAKEDKEYSGS
Subjt:  ENVRMLRLYVFFTAVVGLFLPIAYILEGFFEDDKEGIKAASPHVFLLASQVFVEGVAWNDRFSTPIRVFVPVFYNSRRIFTLAEWLRDEFAKEDKEYSGS

Query:  VRRLLVGRALAVANMALWSFNLFGFLLPVFLPMAFKRYYSLYKSKD
        VRRL+VGRALAVANMALWSFNLFGFLLPV+LP AFKRYYSLYKSKD
Subjt:  VRRLLVGRALAVANMALWSFNLFGFLLPVFLPMAFKRYYSLYKSKD

XP_022156297.1 uncharacterized protein LOC111023219 [Momordica charantia]1.4e-10682.94Show/hide
Query:  MSGGVGPTCSDISLPKEQEVLHKEACNLKP-AAVHG---RRKAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVVFSIMYMYFISRVAFPTMAGGE-AT
        MSGGVGPTCSDISLP EQE LHKE C+ K  A VHG   R+ AAAFLSFRQLNALAVVVIFSASGMVCAEDLAFV+FS+ YMYFISRVAFP   GGE   
Subjt:  MSGGVGPTCSDISLPKEQEVLHKEACNLKP-AAVHG---RRKAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVVFSIMYMYFISRVAFPTMAGGE-AT

Query:  VFSPENVRMLRLYVFFTAVVGLFLPIAYILEGFFEDDKEGIKAASPHVFLLASQVFVEGVAWNDRFSTPIRVFVPVFYNSRRIFTLAEWLRDEFAKEDKE
        VFSP   ++LRLYV   A +GLFLPI YILEGFFEDDKEGIKAASPHVFLLASQVF+EGVA NDRFSTPIRVFVPVFYNSRRIFT+AEWLRDEFAKEDKE
Subjt:  VFSPENVRMLRLYVFFTAVVGLFLPIAYILEGFFEDDKEGIKAASPHVFLLASQVFVEGVAWNDRFSTPIRVFVPVFYNSRRIFTLAEWLRDEFAKEDKE

Query:  YSGSVRRLLVGRALAVANMALWSFNLFGFLLPVFLPMAFKRYY-SLYKSKDH
        +SGSVRRLLVGRALAVANMALWSFNLFGFLLPV+LP A  RYY SL+KSKDH
Subjt:  YSGSVRRLLVGRALAVANMALWSFNLFGFLLPVFLPMAFKRYY-SLYKSKDH

XP_022973006.1 uncharacterized protein LOC111471524 [Cucurbita maxima]2.1e-10277.69Show/hide
Query:  MSGGVGPTCSDISLPKEQEVLHKEACNLK------------PAAVHGRR--KAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVVFSIMYMYFISRVAF
        MSGGVGPTCSDISLP EQE+LHKE+C+ K            P  V  +R   AAAFLSF+QLNALAVVVIFSASGMVCAEDLAFVVFS+MYMYFISRVAF
Subjt:  MSGGVGPTCSDISLPKEQEVLHKEACNLK------------PAAVHGRR--KAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVVFSIMYMYFISRVAF

Query:  PTMAG-GEATVFSPENVRMLRLYVFFTAVVGLFLPIAYILEGFFEDDKEGIKAASPHVFLLASQVFVEGVAWNDRFSTPIRVFVPVFYNSRRIFTLAEWL
        P +AG GE TVFS EN R+LRLY FFT VVG FLPIAYILEGFFE+DKEGIKAASPHVFLLASQ F+EGVA NDRFSTPIRVFVPV YN+RR+FTL EWL
Subjt:  PTMAG-GEATVFSPENVRMLRLYVFFTAVVGLFLPIAYILEGFFEDDKEGIKAASPHVFLLASQVFVEGVAWNDRFSTPIRVFVPVFYNSRRIFTLAEWL

Query:  RDEFAKEDKEYSGSVRRLLVGRALAVANMALWSFNLFGFLLPVFLPMAFKRYYSLYKSKD
        RDEFAKEDKEYSGSVRR ++GR LAV NMA+WSFNLFG LLP+++P AFKRYYS+ KSKD
Subjt:  RDEFAKEDKEYSGSVRRLLVGRALAVANMALWSFNLFGFLLPVFLPMAFKRYYSLYKSKD

XP_038906981.1 uncharacterized protein LOC120092829 [Benincasa hispida]3.5e-11387.01Show/hide
Query:  MSGGVGPTCSDISLPKEQEVLHKEACNLK--------PAAVHGRRKAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVVFSIMYMYFISRVAFPTMA-G
        MSGGVGPTCSDISLPKEQE LHKEA + K        P+   GRRKAAAFLSFRQLNALAVV+IFSASGMVCAEDLAFVVFS+MYMYFISRVAFPT+   
Subjt:  MSGGVGPTCSDISLPKEQEVLHKEACNLK--------PAAVHGRRKAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVVFSIMYMYFISRVAFPTMA-G

Query:  GEATVFSPENVRMLRLYVFFTAVVGLFLPIAYILEGFFEDDKEGIKAASPHVFLLASQVFVEGVAWNDRFSTPIRVFVPVFYNSRRIFTLAEWLRDEFAK
        GE TVFSPEN +MLRLYVFF AVVGLFLPIAYILEGFFE+DKEGIKAASPHVFLLASQVF+EGVA  DRFSTPIRVFVPVFYNSRRIFTLAEWLRDEF K
Subjt:  GEATVFSPENVRMLRLYVFFTAVVGLFLPIAYILEGFFEDDKEGIKAASPHVFLLASQVFVEGVAWNDRFSTPIRVFVPVFYNSRRIFTLAEWLRDEFAK

Query:  EDKEYSGSVRRLLVGRALAVANMALWSFNLFGFLLPVFLPMAFKRYYSLYKSKD
        EDKEYSGSVRRLLVGRALAVANMALWSFNLFGFLLPV+LP AFKRYYSLYKSKD
Subjt:  EDKEYSGSVRRLLVGRALAVANMALWSFNLFGFLLPVFLPMAFKRYYSLYKSKD

TrEMBL top hitse value%identityAlignment
A0A0A0KK26 Uncharacterized protein1.8e-10783.81Show/hide
Query:  MSGGVGPTCSDISLPKEQEVLHKEACNLKPAAVH-GRRKAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVVFSIMYMYFISRVAFPTMAG-GEATVFS
        MSGGVGP C DISLPKEQE +HKEA + K      GRRKAAAFLS RQLNALAVV+IFSASGMVCAEDL FV+FSIMYMYF+SRVAFP + G G+A VF 
Subjt:  MSGGVGPTCSDISLPKEQEVLHKEACNLKPAAVH-GRRKAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVVFSIMYMYFISRVAFPTMAG-GEATVFS

Query:  PENVRMLRLYVFFTAVVGLFLPIAYILEGFFEDDKEGIKAASPHVFLLASQVFVEGVAWNDRFSTPIRVFVPVFYNSRRIFTLAEWLRDEFAKEDKEYSG
        PEN R+LRLYV F A+VGLFLP+AYILEGFFE+DKEGIKAASPHVFLLASQVF+EGVA ND FSTPIRVFVPVFYNSRRIFTL EWLR+EFAKEDKEYSG
Subjt:  PENVRMLRLYVFFTAVVGLFLPIAYILEGFFEDDKEGIKAASPHVFLLASQVFVEGVAWNDRFSTPIRVFVPVFYNSRRIFTLAEWLRDEFAKEDKEYSG

Query:  SVRRLLVGRALAVANMALWSFNLFGFLLPVFLPMAFKRYYSLYKSKD
        SVRRL+VGRALAVANMALWSFNLFGFLLPV+LP AFKRYYSLYKSKD
Subjt:  SVRRLLVGRALAVANMALWSFNLFGFLLPVFLPMAFKRYYSLYKSKD

A0A1S3AUS6 uncharacterized protein LOC1034829072.4e-11287.8Show/hide
Query:  MSGGVGPTCSDISLPKEQEVLHKEACNLKPAAVHGRRKAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVVFSIMYMYFISRVAFPTMAG-GEATVFSP
        MSGGVGPTCSDISLPKEQE +HKEA + K   V GRRKAAAFLS RQLNALAVV+IFSASGMVCAEDLAFVVFSIMYMYFISRVAFP M G G+A VF P
Subjt:  MSGGVGPTCSDISLPKEQEVLHKEACNLKPAAVHGRRKAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVVFSIMYMYFISRVAFPTMAG-GEATVFSP

Query:  ENVRMLRLYVFFTAVVGLFLPIAYILEGFFEDDKEGIKAASPHVFLLASQVFVEGVAWNDRFSTPIRVFVPVFYNSRRIFTLAEWLRDEFAKEDKEYSGS
        EN R+LRLYV F A++GLFLPIAYILEGFFE+DKEGIKAASPHVFLLASQVF+EGVA NDRFSTPIRVFVPVFYNSRRIFTL EWLRDEFAKEDKEYSGS
Subjt:  ENVRMLRLYVFFTAVVGLFLPIAYILEGFFEDDKEGIKAASPHVFLLASQVFVEGVAWNDRFSTPIRVFVPVFYNSRRIFTLAEWLRDEFAKEDKEYSGS

Query:  VRRLLVGRALAVANMALWSFNLFGFLLPVFLPMAFKRYYSLYKSKD
        VRRL+VGRALAVANMALWSFNLFGFLLPV+LP AFKRYYSLYKSKD
Subjt:  VRRLLVGRALAVANMALWSFNLFGFLLPVFLPMAFKRYYSLYKSKD

A0A5D3C472 Uncharacterized protein2.4e-11287.8Show/hide
Query:  MSGGVGPTCSDISLPKEQEVLHKEACNLKPAAVHGRRKAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVVFSIMYMYFISRVAFPTMAG-GEATVFSP
        MSGGVGPTCSDISLPKEQE +HKEA + K   V GRRKAAAFLS RQLNALAVV+IFSASGMVCAEDLAFVVFSIMYMYFISRVAFP M G G+A VF P
Subjt:  MSGGVGPTCSDISLPKEQEVLHKEACNLKPAAVHGRRKAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVVFSIMYMYFISRVAFPTMAG-GEATVFSP

Query:  ENVRMLRLYVFFTAVVGLFLPIAYILEGFFEDDKEGIKAASPHVFLLASQVFVEGVAWNDRFSTPIRVFVPVFYNSRRIFTLAEWLRDEFAKEDKEYSGS
        EN R+LRLYV F A++GLFLPIAYILEGFFE+DKEGIKAASPHVFLLASQVF+EGVA NDRFSTPIRVFVPVFYNSRRIFTL EWLRDEFAKEDKEYSGS
Subjt:  ENVRMLRLYVFFTAVVGLFLPIAYILEGFFEDDKEGIKAASPHVFLLASQVFVEGVAWNDRFSTPIRVFVPVFYNSRRIFTLAEWLRDEFAKEDKEYSGS

Query:  VRRLLVGRALAVANMALWSFNLFGFLLPVFLPMAFKRYYSLYKSKD
        VRRL+VGRALAVANMALWSFNLFGFLLPV+LP AFKRYYSLYKSKD
Subjt:  VRRLLVGRALAVANMALWSFNLFGFLLPVFLPMAFKRYYSLYKSKD

A0A6J1DRN6 uncharacterized protein LOC1110232196.9e-10782.94Show/hide
Query:  MSGGVGPTCSDISLPKEQEVLHKEACNLKP-AAVHG---RRKAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVVFSIMYMYFISRVAFPTMAGGE-AT
        MSGGVGPTCSDISLP EQE LHKE C+ K  A VHG   R+ AAAFLSFRQLNALAVVVIFSASGMVCAEDLAFV+FS+ YMYFISRVAFP   GGE   
Subjt:  MSGGVGPTCSDISLPKEQEVLHKEACNLKP-AAVHG---RRKAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVVFSIMYMYFISRVAFPTMAGGE-AT

Query:  VFSPENVRMLRLYVFFTAVVGLFLPIAYILEGFFEDDKEGIKAASPHVFLLASQVFVEGVAWNDRFSTPIRVFVPVFYNSRRIFTLAEWLRDEFAKEDKE
        VFSP   ++LRLYV   A +GLFLPI YILEGFFEDDKEGIKAASPHVFLLASQVF+EGVA NDRFSTPIRVFVPVFYNSRRIFT+AEWLRDEFAKEDKE
Subjt:  VFSPENVRMLRLYVFFTAVVGLFLPIAYILEGFFEDDKEGIKAASPHVFLLASQVFVEGVAWNDRFSTPIRVFVPVFYNSRRIFTLAEWLRDEFAKEDKE

Query:  YSGSVRRLLVGRALAVANMALWSFNLFGFLLPVFLPMAFKRYY-SLYKSKDH
        +SGSVRRLLVGRALAVANMALWSFNLFGFLLPV+LP A  RYY SL+KSKDH
Subjt:  YSGSVRRLLVGRALAVANMALWSFNLFGFLLPVFLPMAFKRYY-SLYKSKDH

A0A6J1I7I0 uncharacterized protein LOC1114715241.0e-10277.69Show/hide
Query:  MSGGVGPTCSDISLPKEQEVLHKEACNLK------------PAAVHGRR--KAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVVFSIMYMYFISRVAF
        MSGGVGPTCSDISLP EQE+LHKE+C+ K            P  V  +R   AAAFLSF+QLNALAVVVIFSASGMVCAEDLAFVVFS+MYMYFISRVAF
Subjt:  MSGGVGPTCSDISLPKEQEVLHKEACNLK------------PAAVHGRR--KAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVVFSIMYMYFISRVAF

Query:  PTMAG-GEATVFSPENVRMLRLYVFFTAVVGLFLPIAYILEGFFEDDKEGIKAASPHVFLLASQVFVEGVAWNDRFSTPIRVFVPVFYNSRRIFTLAEWL
        P +AG GE TVFS EN R+LRLY FFT VVG FLPIAYILEGFFE+DKEGIKAASPHVFLLASQ F+EGVA NDRFSTPIRVFVPV YN+RR+FTL EWL
Subjt:  PTMAG-GEATVFSPENVRMLRLYVFFTAVVGLFLPIAYILEGFFEDDKEGIKAASPHVFLLASQVFVEGVAWNDRFSTPIRVFVPVFYNSRRIFTLAEWL

Query:  RDEFAKEDKEYSGSVRRLLVGRALAVANMALWSFNLFGFLLPVFLPMAFKRYYSLYKSKD
        RDEFAKEDKEYSGSVRR ++GR LAV NMA+WSFNLFG LLP+++P AFKRYYS+ KSKD
Subjt:  RDEFAKEDKEYSGSVRRLLVGRALAVANMALWSFNLFGFLLPVFLPMAFKRYYSLYKSKD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G27990.1 unknown protein2.3e-3036.73Show/hide
Query:  QLNALAVVVIFSASGMVCAEDLAFVVFSIMYMYFISRVAFPTMAGGEATVFSPENVRMLRLYVFFTAVVGLFLPIAYILEGFFEDDKEGIKAASPHVFLL
        +L  +A +++FSASG+V   D+ F  F+ +Y+  +SR+AFP+     A+       ++ RLYV     +GLFLP+AY+L GF   D   +++A+PH+FLL
Subjt:  QLNALAVVVIFSASGMVCAEDLAFVVFSIMYMYFISRVAFPTMAGGEATVFSPENVRMLRLYVFFTAVVGLFLPIAYILEGFFEDDKEGIKAASPHVFLL

Query:  ASQVFVEGV-AWNDRFSTPIRVFVPVFYNSRRIFTLAEWLRDEFAKEDKEYSGS---VRRLLVGRALAVANMALWSFNLFGFLLPVFLPMAFKRYY
        + Q+  E V +    FS P+R  VP+ Y   RIF +  W +D +  +    + +   V     GR LA+AN+  +  NL  FL+P FLP AF++Y+
Subjt:  ASQVFVEGV-AWNDRFSTPIRVFVPVFYNSRRIFTLAEWLRDEFAKEDKEYSGS---VRRLLVGRALAVANMALWSFNLFGFLLPVFLPMAFKRYY

AT5G23920.1 unknown protein2.7e-3937.67Show/hide
Query:  QEVLHKEACNLKPAAVHGRRKAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVVFSIMYMY-FISRVAFPTMAGGEATVFSPENVRMLRLYVFFTAVVG
        +E+  K+  + KP+      K       RQL  L+ +++ +A G+V   ++AFV+   +Y+Y F+SR AFP     +    S    ++ + Y   TA++G
Subjt:  QEVLHKEACNLKPAAVHGRRKAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVVFSIMYMY-FISRVAFPTMAGGEATVFSPENVRMLRLYVFFTAVVG

Query:  LFLPIAYILEGFFEDDKEGIKAASPHVFLLASQVFVEGVAWNDRFSTPIRVFVPVFYNSRRIFTLAEWLRDEFAKEDKEYSGSVRRLLVGRALAVANMAL
        L  P+ YI +G +  D  G  AA+PH+FLL+ Q F E + ++D++S PI +  PVFYN+RRIF L +W++ EF+  D +  G   RL  GR +A  N  +
Subjt:  LFLPIAYILEGFFEDDKEGIKAASPHVFLLASQVFVEGVAWNDRFSTPIRVFVPVFYNSRRIFTLAEWLRDEFAKEDKEYSGSVRRLLVGRALAVANMAL

Query:  WSFNLFGFLLPVFLPMAFKRYYS
        W +NLFG LLPVFLP + + Y+S
Subjt:  WSFNLFGFLLPVFLPMAFKRYYS

AT5G52420.1 unknown protein1.6e-6349.58Show/hide
Query:  MSGGVGPTCSDISLPKEQEVLHKEACNLKPAAVHGRRKAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVVFSIMYMYFISRVAFP--TMAGGEATVFS
        MSGGVGPT +DI+LPKE+E  H+       + V    K A F SFRQLN LA++++ SASG+V  +D  F + +++Y +F+S++ FP       +A + S
Subjt:  MSGGVGPTCSDISLPKEQEVLHKEACNLKPAAVHGRRKAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVVFSIMYMYFISRVAFP--TMAGGEATVFS

Query:  PENVRMLRLYVFFTAVVGLFLPIAYILEGFFEDDKEGIKAASPHVFLLASQVFVEGVAWNDRFSTPIRVFVPVFYNSRRIFTLAEWLRDEFAKEDKEYSG
          N ++ R+YV    +VGL +PI YI EG  EDDK G+ AA+PHVFLLASQ+F+EG+A    FS P R+ VP+ YN+RR+ TL EW+  EF++ED   + 
Subjt:  PENVRMLRLYVFFTAVVGLFLPIAYILEGFFEDDKEGIKAASPHVFLLASQVFVEGVAWNDRFSTPIRVFVPVFYNSRRIFTLAEWLRDEFAKEDKEYSG

Query:  SVRRLLVGRALAVANMALWSFNLFGFLLPVFLPMAFKRYY
        S RR+  G+ LA AN+ +WSFNLFG L+PV+LP AFKRYY
Subjt:  SVRRLLVGRALAVANMALWSFNLFGFLLPVFLPMAFKRYY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGGGTGGGGTGGGGCCTACGTGCAGCGACATAAGCCTCCCGAAGGAACAGGAAGTCCTCCACAAAGAAGCCTGCAATCTCAAGCCCGCCGCCGTCCATGGGCGGAG
GAAGGCGGCGGCGTTTCTGAGTTTCCGGCAACTGAACGCGCTGGCGGTGGTGGTGATCTTCTCGGCCAGCGGAATGGTGTGCGCGGAGGACTTGGCGTTTGTGGTATTCT
CGATCATGTACATGTACTTCATCTCGAGAGTGGCGTTTCCGACGATGGCGGGCGGGGAGGCGACGGTGTTCAGCCCGGAGAACGTCAGGATGCTCCGGCTGTATGTGTTC
TTCACCGCCGTGGTGGGGCTGTTCTTGCCGATTGCGTACATTCTGGAGGGGTTCTTTGAAGATGATAAAGAAGGCATCAAGGCCGCTTCTCCACATGTGTTTCTTCTCGC
CAGCCAGGTTTTCGTGGAAGGTGTGGCGTGGAACGACAGATTCTCGACGCCAATCCGCGTGTTCGTGCCTGTTTTCTACAACTCGAGGAGGATCTTCACGCTCGCGGAGT
GGCTGCGGGACGAGTTCGCCAAGGAAGACAAGGAATACAGCGGCTCGGTGCGGCGGTTGCTGGTCGGAAGAGCGCTTGCTGTGGCGAACATGGCGCTTTGGAGCTTCAAT
CTGTTTGGGTTCTTATTGCCTGTGTTTCTGCCAATGGCTTTCAAGAGATATTATTCTCTTTACAAATCCAAAGATCATTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCGGGTGGGGTGGGGCCTACGTGCAGCGACATAAGCCTCCCGAAGGAACAGGAAGTCCTCCACAAAGAAGCCTGCAATCTCAAGCCCGCCGCCGTCCATGGGCGGAG
GAAGGCGGCGGCGTTTCTGAGTTTCCGGCAACTGAACGCGCTGGCGGTGGTGGTGATCTTCTCGGCCAGCGGAATGGTGTGCGCGGAGGACTTGGCGTTTGTGGTATTCT
CGATCATGTACATGTACTTCATCTCGAGAGTGGCGTTTCCGACGATGGCGGGCGGGGAGGCGACGGTGTTCAGCCCGGAGAACGTCAGGATGCTCCGGCTGTATGTGTTC
TTCACCGCCGTGGTGGGGCTGTTCTTGCCGATTGCGTACATTCTGGAGGGGTTCTTTGAAGATGATAAAGAAGGCATCAAGGCCGCTTCTCCACATGTGTTTCTTCTCGC
CAGCCAGGTTTTCGTGGAAGGTGTGGCGTGGAACGACAGATTCTCGACGCCAATCCGCGTGTTCGTGCCTGTTTTCTACAACTCGAGGAGGATCTTCACGCTCGCGGAGT
GGCTGCGGGACGAGTTCGCCAAGGAAGACAAGGAATACAGCGGCTCGGTGCGGCGGTTGCTGGTCGGAAGAGCGCTTGCTGTGGCGAACATGGCGCTTTGGAGCTTCAAT
CTGTTTGGGTTCTTATTGCCTGTGTTTCTGCCAATGGCTTTCAAGAGATATTATTCTCTTTACAAATCCAAAGATCATTGA
Protein sequenceShow/hide protein sequence
MSGGVGPTCSDISLPKEQEVLHKEACNLKPAAVHGRRKAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVVFSIMYMYFISRVAFPTMAGGEATVFSPENVRMLRLYVF
FTAVVGLFLPIAYILEGFFEDDKEGIKAASPHVFLLASQVFVEGVAWNDRFSTPIRVFVPVFYNSRRIFTLAEWLRDEFAKEDKEYSGSVRRLLVGRALAVANMALWSFN
LFGFLLPVFLPMAFKRYYSLYKSKDH