; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS017300 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS017300
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionProtein of unknown function (DUF674)
Genome locationscaffold33:1464046..1464915
RNA-Seq ExpressionMS017300
SyntenyMS017300
Gene Ontology termsNA
InterPro domainsIPR007750 - Protein of unknown function DUF674


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004147723.1 uncharacterized protein LOC101207526 [Cucumis sativus]2.7e-4251.43Show/hide
Query:  RLKLVINSKAQRGLYGEADKNFIDFLFNLLCLPLGAVIRLLKKEAMVGCLGNLYGSVETFNEAYIQPNQSKDRVLKPNLSFICSTSLRPSFWDDRSSSSI
        +LKL+I+SK +R L+GEADKN IDFLFNLL LPLG VIRLLKK+ M G L NLYGSVE  N+ Y+QPNQSKD +LKP +SF  ST L P+         +
Subjt:  RLKLVINSKAQRGLYGEADKNFIDFLFNLLCLPLGAVIRLLKKEAMVGCLGNLYGSVETFNEAYIQPNQSKDRVLKPNLSFICSTSLRPSFWDDRSSSSI

Query:  PPCN-----NTSPSGTNVQPPSRSREASLC-------AKSEPAAPYGEKLGFVKGLATYIVMDDLTVKPLSEFSIITLLKELNIKDVDSLEEKVITLDIN
          C      N + + T V P  RS  +  C       A ++     GE  GFVKG+ TY+VMDDL+VKP+S  S ITLL + NIK+V +LEEKV+TLD++
Subjt:  PPCN-----NTSPSGTNVQPPSRSREASLC-------AKSEPAAPYGEKLGFVKGLATYIVMDDLTVKPLSEFSIITLLKELNIKDVDSLEEKVITLDIN

Query:  EVIYIHKYSL
        + + + K SL
Subjt:  EVIYIHKYSL

XP_008461735.1 PREDICTED: uncharacterized protein LOC103500268 [Cucumis melo]1.4e-4352.86Show/hide
Query:  RLKLVINSKAQRGLYGEADKNFIDFLFNLLCLPLGAVIRLLKKEAMVGCLGNLYGSVETFNEAYIQPNQSKDRVLKPNLSFICSTSLRPSFWDDRSSSSI
        RLKL+I+S+ +R L+GEADKN IDFLFNLL LPLG VIRLLKK+ MVG L NLY SVE  N+ Y+QPNQSKD +LKP +SF  ST L P+          
Subjt:  RLKLVINSKAQRGLYGEADKNFIDFLFNLLCLPLGAVIRLLKKEAMVGCLGNLYGSVETFNEAYIQPNQSKDRVLKPNLSFICSTSLRPSFWDDRSSSSI

Query:  PPCN-----NTSPSGTNVQPPSR---SREASLC----AKSEPAAPYGEKLGFVKGLATYIVMDDLTVKPLSEFSIITLLKELNIKDVDSLEEKVITLDIN
          C      N + + T V P  R   SRE +L     A ++     GE  GFVKG+ TY+VMDDL+VKP+S  S ITLL + NIK+V +LEEKVITLD+N
Subjt:  PPCN-----NTSPSGTNVQPPSR---SREASLC----AKSEPAAPYGEKLGFVKGLATYIVMDDLTVKPLSEFSIITLLKELNIKDVDSLEEKVITLDIN

Query:  EVIYIHKYSL
        + + + + SL
Subjt:  EVIYIHKYSL

XP_008465479.1 PREDICTED: uncharacterized protein LOC103503094 [Cucumis melo]2.7e-4252.53Show/hide
Query:  LKLVINSKAQRGLYGEADKNFIDFLFNLLCLPLGAVIRLLKKEAMVGCLGNLYGSVETFNEAYIQPNQSKDRVLKPNLSFICSTSLRPSFWDDRSSSSIP
        LKL+I+ K +R LYGEADK FIDFL N+L LPLG VI LLKK  MVGCLGNLY SVET N++Y+QPNQS+D VLKP + F   T L P+      +++ P
Subjt:  LKLVINSKAQRGLYGEADKNFIDFLFNLLCLPLGAVIRLLKKEAMVGCLGNLYGSVETFNEAYIQPNQSKDRVLKPNLSFICSTSLRPSFWDDRSSSSIP

Query:  P----CNNT---------SPSGTNVQPPSRSREASLC-------AKSEPAAPYGEKLGFVKGLATYIVMDDLTVKPLSEFSIITLLKELNIKDVDSLEEK
        P    CN +         S S + V P    + +  C       A+++PA+  G     VK LATYIVMDDLTVK +S+FSI TLLK+ NIKDVDSLEEK
Subjt:  P----CNNT---------SPSGTNVQPPSRSREASLC-------AKSEPAAPYGEKLGFVKGLATYIVMDDLTVKPLSEFSIITLLKELNIKDVDSLEEK

Query:  VITLDINEVIYIHKYSL
        VITLD++E + + + SL
Subjt:  VITLDINEVIYIHKYSL

XP_022138964.1 uncharacterized protein LOC111010013 [Momordica charantia]4.5e-4550.89Show/hide
Query:  RLKLVINSKAQRGLYGEADKNFIDFLFNLLCLPLGAVIRLLKKEAMVGCLGNLYGSVETFNEAYIQPNQSKDRVLKPNLSFI--CSTSLRPSFWDDRSSS
        RLKL+I+SK QR L+GEADKN IDFLFNLL LPLG VIRLLKK+ MVGCLGNLY SVET N+ Y+QPNQSKD +LKP +SF    ST L P+     +++
Subjt:  RLKLVINSKAQRGLYGEADKNFIDFLFNLLCLPLGAVIRLLKKEAMVGCLGNLYGSVETFNEAYIQPNQSKDRVLKPNLSFI--CSTSLRPSFWDDRSSS

Query:  SIPPCNNTSPS------------------------GTNVQPPSRSREASLCAKSEPAAPYGEKLGFVKGLATYIVMDDLTVKPLSEFSIITLLKELNIKD
        +   CN+T+ +                        GT V+PPS S      AK +         GFVKG+ TY+VMDDL+VKP+S  S I LL + N+K+
Subjt:  SIPPCNNTSPS------------------------GTNVQPPSRSREASLCAKSEPAAPYGEKLGFVKGLATYIVMDDLTVKPLSEFSIITLLKELNIKD

Query:  VDSLEEKVITLDINEVIYIHKYSL
        V +LEEKV+TLD+NE + + K SL
Subjt:  VDSLEEKVITLDINEVIYIHKYSL

XP_031741245.1 uncharacterized protein LOC105435653 [Cucumis sativus]1.4e-4353.55Show/hide
Query:  LKLVINSKAQRGLYGEADKNFIDFLFNLLCLPLGAVIRLLKKEAMVGCLGNLYGSVETFNEAYIQPNQSKDRVLKPNLSFICSTSLRPSFWDDRSSSSIP
        LKL+I+ K +R LYGEADK FIDFL N+L LPLG VIRLLKK+ MVGCLGNLY SVET N +Y+QPNQS+D VLKP + F   T L P+  D   +++ P
Subjt:  LKLVINSKAQRGLYGEADKNFIDFLFNLLCLPLGAVIRLLKKEAMVGCLGNLYGSVETFNEAYIQPNQSKDRVLKPNLSFICSTSLRPSFWDDRSSSSIP

Query:  P----------CNNTSPSGTNVQPPSRSREASLCAKSEPAAPYGEK----LGFVKGLATYIVMDDLTVKPLSEFSIITLLKELNIKDVDSLEEKVITLDI
        P          C        +   PS S+  S   +       G +     GFVK LATYIV DDLTVK +S+FSI TLLK+ NIKDVDSLEEKVITLD+
Subjt:  P----------CNNTSPSGTNVQPPSRSREASLCAKSEPAAPYGEK----LGFVKGLATYIVMDDLTVKPLSEFSIITLLKELNIKDVDSLEEKVITLDI

Query:  NEVIYIHKYSL
        NE + + + SL
Subjt:  NEVIYIHKYSL

TrEMBL top hitse value%identityAlignment
A0A1S3CGQ2 uncharacterized protein LOC1035002687.0e-4452.86Show/hide
Query:  RLKLVINSKAQRGLYGEADKNFIDFLFNLLCLPLGAVIRLLKKEAMVGCLGNLYGSVETFNEAYIQPNQSKDRVLKPNLSFICSTSLRPSFWDDRSSSSI
        RLKL+I+S+ +R L+GEADKN IDFLFNLL LPLG VIRLLKK+ MVG L NLY SVE  N+ Y+QPNQSKD +LKP +SF  ST L P+          
Subjt:  RLKLVINSKAQRGLYGEADKNFIDFLFNLLCLPLGAVIRLLKKEAMVGCLGNLYGSVETFNEAYIQPNQSKDRVLKPNLSFICSTSLRPSFWDDRSSSSI

Query:  PPCN-----NTSPSGTNVQPPSR---SREASLC----AKSEPAAPYGEKLGFVKGLATYIVMDDLTVKPLSEFSIITLLKELNIKDVDSLEEKVITLDIN
          C      N + + T V P  R   SRE +L     A ++     GE  GFVKG+ TY+VMDDL+VKP+S  S ITLL + NIK+V +LEEKVITLD+N
Subjt:  PPCN-----NTSPSGTNVQPPSR---SREASLC----AKSEPAAPYGEKLGFVKGLATYIVMDDLTVKPLSEFSIITLLKELNIKDVDSLEEKVITLDIN

Query:  EVIYIHKYSL
        + + + + SL
Subjt:  EVIYIHKYSL

A0A1S3CPD6 uncharacterized protein LOC1035030941.3e-4252.53Show/hide
Query:  LKLVINSKAQRGLYGEADKNFIDFLFNLLCLPLGAVIRLLKKEAMVGCLGNLYGSVETFNEAYIQPNQSKDRVLKPNLSFICSTSLRPSFWDDRSSSSIP
        LKL+I+ K +R LYGEADK FIDFL N+L LPLG VI LLKK  MVGCLGNLY SVET N++Y+QPNQS+D VLKP + F   T L P+      +++ P
Subjt:  LKLVINSKAQRGLYGEADKNFIDFLFNLLCLPLGAVIRLLKKEAMVGCLGNLYGSVETFNEAYIQPNQSKDRVLKPNLSFICSTSLRPSFWDDRSSSSIP

Query:  P----CNNT---------SPSGTNVQPPSRSREASLC-------AKSEPAAPYGEKLGFVKGLATYIVMDDLTVKPLSEFSIITLLKELNIKDVDSLEEK
        P    CN +         S S + V P    + +  C       A+++PA+  G     VK LATYIVMDDLTVK +S+FSI TLLK+ NIKDVDSLEEK
Subjt:  P----CNNT---------SPSGTNVQPPSRSREASLC-------AKSEPAAPYGEKLGFVKGLATYIVMDDLTVKPLSEFSIITLLKELNIKDVDSLEEK

Query:  VITLDINEVIYIHKYSL
        VITLD++E + + + SL
Subjt:  VITLDINEVIYIHKYSL

A0A5A7U8V2 DUF674 domain-containing protein7.0e-4452.86Show/hide
Query:  RLKLVINSKAQRGLYGEADKNFIDFLFNLLCLPLGAVIRLLKKEAMVGCLGNLYGSVETFNEAYIQPNQSKDRVLKPNLSFICSTSLRPSFWDDRSSSSI
        RLKL+I+S+ +R L+GEADKN IDFLFNLL LPLG VIRLLKK+ MVG L NLY SVE  N+ Y+QPNQSKD +LKP +SF  ST L P+          
Subjt:  RLKLVINSKAQRGLYGEADKNFIDFLFNLLCLPLGAVIRLLKKEAMVGCLGNLYGSVETFNEAYIQPNQSKDRVLKPNLSFICSTSLRPSFWDDRSSSSI

Query:  PPCN-----NTSPSGTNVQPPSR---SREASLC----AKSEPAAPYGEKLGFVKGLATYIVMDDLTVKPLSEFSIITLLKELNIKDVDSLEEKVITLDIN
          C      N + + T V P  R   SRE +L     A ++     GE  GFVKG+ TY+VMDDL+VKP+S  S ITLL + NIK+V +LEEKVITLD+N
Subjt:  PPCN-----NTSPSGTNVQPPSR---SREASLC----AKSEPAAPYGEKLGFVKGLATYIVMDDLTVKPLSEFSIITLLKELNIKDVDSLEEKVITLDIN

Query:  EVIYIHKYSL
        + + + + SL
Subjt:  EVIYIHKYSL

A0A5A7V731 Putative DNA polymerase zeta catalytic subunit1.3e-4252.53Show/hide
Query:  LKLVINSKAQRGLYGEADKNFIDFLFNLLCLPLGAVIRLLKKEAMVGCLGNLYGSVETFNEAYIQPNQSKDRVLKPNLSFICSTSLRPSFWDDRSSSSIP
        LKL+I+ K +R LYGEADK FIDFL N+L LPLG VI LLKK  MVGCLGNLY SVET N++Y+QPNQS+D VLKP + F   T L P+      +++ P
Subjt:  LKLVINSKAQRGLYGEADKNFIDFLFNLLCLPLGAVIRLLKKEAMVGCLGNLYGSVETFNEAYIQPNQSKDRVLKPNLSFICSTSLRPSFWDDRSSSSIP

Query:  P----CNNT---------SPSGTNVQPPSRSREASLC-------AKSEPAAPYGEKLGFVKGLATYIVMDDLTVKPLSEFSIITLLKELNIKDVDSLEEK
        P    CN +         S S + V P    + +  C       A+++PA+  G     VK LATYIVMDDLTVK +S+FSI TLLK+ NIKDVDSLEEK
Subjt:  P----CNNT---------SPSGTNVQPPSRSREASLC-------AKSEPAAPYGEKLGFVKGLATYIVMDDLTVKPLSEFSIITLLKELNIKDVDSLEEK

Query:  VITLDINEVIYIHKYSL
        VITLD++E + + + SL
Subjt:  VITLDINEVIYIHKYSL

A0A6J1CBJ8 uncharacterized protein LOC1110100132.2e-4550.89Show/hide
Query:  RLKLVINSKAQRGLYGEADKNFIDFLFNLLCLPLGAVIRLLKKEAMVGCLGNLYGSVETFNEAYIQPNQSKDRVLKPNLSFI--CSTSLRPSFWDDRSSS
        RLKL+I+SK QR L+GEADKN IDFLFNLL LPLG VIRLLKK+ MVGCLGNLY SVET N+ Y+QPNQSKD +LKP +SF    ST L P+     +++
Subjt:  RLKLVINSKAQRGLYGEADKNFIDFLFNLLCLPLGAVIRLLKKEAMVGCLGNLYGSVETFNEAYIQPNQSKDRVLKPNLSFI--CSTSLRPSFWDDRSSS

Query:  SIPPCNNTSPS------------------------GTNVQPPSRSREASLCAKSEPAAPYGEKLGFVKGLATYIVMDDLTVKPLSEFSIITLLKELNIKD
        +   CN+T+ +                        GT V+PPS S      AK +         GFVKG+ TY+VMDDL+VKP+S  S I LL + N+K+
Subjt:  SIPPCNNTSPS------------------------GTNVQPPSRSREASLCAKSEPAAPYGEKLGFVKGLATYIVMDDLTVKPLSEFSIITLLKELNIKD

Query:  VDSLEEKVITLDINEVIYIHKYSL
        V +LEEKV+TLD+NE + + K SL
Subjt:  VDSLEEKVITLDINEVIYIHKYSL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G09110.1 Protein of unknown function (DUF674)1.2e-0825.87Show/hide
Query:  LKLVINSKAQRGLYGEADKNFIDFLFNLLCLPLGAVIRLLKK-----EAMVGCLGNLYGSVETFNEAYIQPNQSKDRVLKPNLSFICSTSLRPSFWDDRS
        L+L+I+ +  R +  EA K+F+D L +LL LP+G ++RLL+K      ++VGCL NLY SV   +    +    K  +L P  +            DD  
Subjt:  LKLVINSKAQRGLYGEADKNFIDFLFNLLCLPLGAVIRLLKK-----EAMVGCLGNLYGSVETFNEAYIQPNQSKDRVLKPNLSFICSTSLRPSFWDDRS

Query:  SSSIPPCNNTSPSG------TNVQPPSRSREASLCAKSEPAAPYGEKLGFVKGLATYIVMDDLTVKPLSEFSIITLLKELNIKDVDSLEEKVITLDINEV
        ++    C N   +       +NV    + R  S   +  P         F     ++++ DDL V   S   ++ +L +      D L+E +I +   E+
Subjt:  SSSIPPCNNTSPSG------TNVQPPSRSREASLCAKSEPAAPYGEKLGFVKGLATYIVMDDLTVKPLSEFSIITLLKELNIKDVDSLEEKVITLDINEV

Query:  I
        +
Subjt:  I

AT5G01120.1 Protein of unknown function (DUF674)1.5e-0927.03Show/hide
Query:  LKLVINSKAQRGLYGEADKNFIDFLFNLLCLPLGAVIRLLK-----KEAMVGCLGNLYGSVETFNEAYIQPNQSKDRVLKPNLSFICSTSLRPSFWDDRS
        LKL+I+ +  + ++ EA  +F+D LF+   LP+G ++RLL+     +   +GC  N+Y SV +    +      K  +L P               DD  
Subjt:  LKLVINSKAQRGLYGEADKNFIDFLFNLLCLPLGAVIRLLK-----KEAMVGCLGNLYGSVETFNEAYIQPNQSKDRVLKPNLSFICSTSLRPSFWDDRS

Query:  SSSIPPCNNTSPSGTNVQPPSRSREASLCA-----------KSEPAAPYGEKLG-FVKGLAT-YIVMDDLTVKPLSEFSIITLLKELNIKDVDSLEEKVI
        ++    C     SG   +  S  +  S C+           + E     G +   FV+G  T +I+ DDL V+  S  S + +LK+L   D D L E ++
Subjt:  SSSIPPCNNTSPSGTNVQPPSRSREASLCA-----------KSEPAAPYGEKLG-FVKGLAT-YIVMDDLTVKPLSEFSIITLLKELNIKDVDSLEEKVI

Query:  TLDINEVIYIHKYSLLVGIFVS
         +++ EV      +LLV +F S
Subjt:  TLDINEVIYIHKYSLLVGIFVS

AT5G01150.1 Protein of unknown function (DUF674)2.9e-1025.58Show/hide
Query:  LKLVINSKAQRGLYGEADKNFIDFLFNLLCLPLGAVIRLLK-----KEAMVGCLGNLYGSV----------ETFNEAYIQPNQSKD---RVLKPNLS---
        L+L+++ +  + +  EA ++F+D LF+LL LP+G ++RLL+     +   +GC  NLY SV          E   +  + P   KD   + LK N++   
Subjt:  LKLVINSKAQRGLYGEADKNFIDFLFNLLCLPLGAVIRLLK-----KEAMVGCLGNLYGSV----------ETFNEAYIQPNQSKD---RVLKPNLS---

Query:  ----FICSTSLRPSFWDDRSSSSIPPCNNTSPSGTNVQPPSRSREASLCAKSEPAAPYGEKLGFVKGLATYIVMDDLTVKPLSEFSIITLLKELNIKDVD
            F CS     S+    S+ S   C         +Q  +  ++       +          FV G  ++++ DDL V   S   ++  LK L   DV 
Subjt:  ----FICSTSLRPSFWDDRSSSSIPPCNNTSPSGTNVQPPSRSREASLCAKSEPAAPYGEKLGFVKGLATYIVMDDLTVKPLSEFSIITLLKELNIKDVD

Query:  SLEEKVITLDINEVI
         L E+++ + + EV+
Subjt:  SLEEKVITLDINEVI

AT5G43240.1 Protein of unknown function (DUF674)2.1e-0827.06Show/hide
Query:  RLKLVINSKAQRGLYGEADKNFIDFLFNLLCLPLGAVIRLLK-----KEAMVGCLGNLYGSV----------ETFNEAYIQP---NQSKDRVLKPNLS--
        +LKL+I+ +  + ++ EA K+F+D LF+   LP+G ++RLL+     ++  +GC  N+Y SV          E   +  + P   N  K R LK  +   
Subjt:  RLKLVINSKAQRGLYGEADKNFIDFLFNLLCLPLGAVIRLLK-----KEAMVGCLGNLYGSV----------ETFNEAYIQP---NQSKDRVLKPNLS--

Query:  -----FICSTSL-RPSFWDDRSSSSIPPCNNTSPSGTNVQPPSRSREASLCAKSEPAAPYGEKLG-FVKGLAT-YIVMDDLTVKPLSEFSIITLLKELNI
             F+C   + R    +  S+ +   C+         Q   R   AS        A  G + G FV+   T +++ DDL V+  S    + +LK+L  
Subjt:  -----FICSTSL-RPSFWDDRSSSSIPPCNNTSPSGTNVQPPSRSREASLCAKSEPAAPYGEKLG-FVKGLAT-YIVMDDLTVKPLSEFSIITLLKELNI

Query:  KDVDSLEEKVITLDINEV
         D + L+EK+  +++ EV
Subjt:  KDVDSLEEKVITLDINEV

AT5G43240.3 Protein of unknown function (DUF674)2.1e-0827.06Show/hide
Query:  RLKLVINSKAQRGLYGEADKNFIDFLFNLLCLPLGAVIRLLK-----KEAMVGCLGNLYGSV----------ETFNEAYIQP---NQSKDRVLKPNLS--
        +LKL+I+ +  + ++ EA K+F+D LF+   LP+G ++RLL+     ++  +GC  N+Y SV          E   +  + P   N  K R LK  +   
Subjt:  RLKLVINSKAQRGLYGEADKNFIDFLFNLLCLPLGAVIRLLK-----KEAMVGCLGNLYGSV----------ETFNEAYIQP---NQSKDRVLKPNLS--

Query:  -----FICSTSL-RPSFWDDRSSSSIPPCNNTSPSGTNVQPPSRSREASLCAKSEPAAPYGEKLG-FVKGLAT-YIVMDDLTVKPLSEFSIITLLKELNI
             F+C   + R    +  S+ +   C+         Q   R   AS        A  G + G FV+   T +++ DDL V+  S    + +LK+L  
Subjt:  -----FICSTSL-RPSFWDDRSSSSIPPCNNTSPSGTNVQPPSRSREASLCAKSEPAAPYGEKLG-FVKGLAT-YIVMDDLTVKPLSEFSIITLLKELNI

Query:  KDVDSLEEKVITLDINEV
         D + L+EK+  +++ EV
Subjt:  KDVDSLEEKVITLDINEV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GCAAGATTGAAGCTTGTGATAAACTCGAAAGCACAAAGAGGTCTCTATGGTGAAGCGGACAAGAATTTCATTGACTTTCTTTTCAACCTATTATGTCTTCCGCTTGGGGC
TGTAATTAGGCTGTTGAAGAAGGAAGCCATGGTGGGGTGCTTGGGGAATCTCTACGGCAGTGTAGAGACCTTCAACGAGGCATATATACAGCCAAACCAGAGCAAAGACA
GAGTCTTGAAACCCAACCTCTCATTCATCTGTTCCACTTCACTTCGACCGAGCTTTTGGGATGATCGTTCTTCCAGTAGTATTCCACCTTGTAATAACACGAGCCCATCT
GGTACAAATGTGCAGCCTCCAAGTCGAAGCAGAGAGGCGTCACTTTGTGCAAAAAGTGAACCAGCAGCTCCATATGGGGAAAAATTGGGATTTGTGAAGGGCTTGGCTAC
TTACATTGTCATGGATGACCTTACTGTGAAGCCCCTTTCTGAATTCTCCATCATTACTCTCTTGAAAGAGTTGAATATCAAAGATGTGGATTCTTTGGAGGAGAAAGTTA
TCACTTTGGATATCAATGAGGTAATATATATCCACAAATATTCACTTTTGGTAGGGATTTTTGTTTCTCATATTCTT
mRNA sequenceShow/hide mRNA sequence
GCAAGATTGAAGCTTGTGATAAACTCGAAAGCACAAAGAGGTCTCTATGGTGAAGCGGACAAGAATTTCATTGACTTTCTTTTCAACCTATTATGTCTTCCGCTTGGGGC
TGTAATTAGGCTGTTGAAGAAGGAAGCCATGGTGGGGTGCTTGGGGAATCTCTACGGCAGTGTAGAGACCTTCAACGAGGCATATATACAGCCAAACCAGAGCAAAGACA
GAGTCTTGAAACCCAACCTCTCATTCATCTGTTCCACTTCACTTCGACCGAGCTTTTGGGATGATCGTTCTTCCAGTAGTATTCCACCTTGTAATAACACGAGCCCATCT
GGTACAAATGTGCAGCCTCCAAGTCGAAGCAGAGAGGCGTCACTTTGTGCAAAAAGTGAACCAGCAGCTCCATATGGGGAAAAATTGGGATTTGTGAAGGGCTTGGCTAC
TTACATTGTCATGGATGACCTTACTGTGAAGCCCCTTTCTGAATTCTCCATCATTACTCTCTTGAAAGAGTTGAATATCAAAGATGTGGATTCTTTGGAGGAGAAAGTTA
TCACTTTGGATATCAATGAGGTAATATATATCCACAAATATTCACTTTTGGTAGGGATTTTTGTTTCTCATATTCTT
Protein sequenceShow/hide protein sequence
ARLKLVINSKAQRGLYGEADKNFIDFLFNLLCLPLGAVIRLLKKEAMVGCLGNLYGSVETFNEAYIQPNQSKDRVLKPNLSFICSTSLRPSFWDDRSSSSIPPCNNTSPS
GTNVQPPSRSREASLCAKSEPAAPYGEKLGFVKGLATYIVMDDLTVKPLSEFSIITLLKELNIKDVDSLEEKVITLDINEVIYIHKYSLLVGIFVSHIL