; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS017304 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS017304
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionProtein of unknown function (DUF674)
Genome locationscaffold33:1476188..1476787
RNA-Seq ExpressionMS017304
SyntenyMS017304
Gene Ontology termsNA
InterPro domainsIPR007750 - Protein of unknown function DUF674


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008461735.1 PREDICTED: uncharacterized protein LOC103500268 [Cucumis melo]2.4e-4052.43Show/hide
Query:  MRLNLVIDSEAERVVYAETDKKFIDFLFNLLSLPLGAVIHLLKKEAMVGCLGNVYESLETSLNEAYFLQSNQSKDTLLKPKLS-----LLLPNIDAPSAA
        +RL L+IDSE +RV++ E DK  IDFLFNLLSLPLG VI LLKK+ MVG L N+YES+E +LN+ Y LQ NQSKD LLKPK+S     LLLPNI++  A 
Subjt:  MRLNLVIDSEAERVVYAETDKKFIDFLFNLLSLPLGAVIHLLKKEAMVGCLGNVYESLETSLNEAYFLQSNQSKDTLLKPKLS-----LLLPNIDAPSAA

Query:  TATIYVCGHTWSCTSFSDGPNAFCPTCAGFMRKIGKYVRPRTS-------VGRQQGFVKDSETYVVMDDLSVTHISDFSIATVLDKFNVKDVDSLEEKLI
            Y+CG+     + +  P A CP+C   M +    V P  +       VG   GFVK   TY+VMDDLSV  +S  S  T+L+KFN+K+V +LEEK+I
Subjt:  TATIYVCGHTWSCTSFSDGPNAFCPTCAGFMRKIGKYVRPRTS-------VGRQQGFVKDSETYVVMDDLSVTHISDFSIATVLDKFNVKDVDSLEEKLI

Query:  TLDVNK
        TLDVN+
Subjt:  TLDVNK

XP_008465479.1 PREDICTED: uncharacterized protein LOC103503094 [Cucumis melo]2.1e-4757.08Show/hide
Query:  MEQTDRMRLNLVIDSEAERVVYAETDKKFIDFLFNLLSLPLGAVIHLLKKEAMVGCLGNVYESLETSLNEAYFLQSNQSKDTLLKPKL-----SLLLPNI
        MEQTD + L L+ID + ERV+Y E DKKFIDFL N+LSLPLG VI LLKK  MVGCLGN+YES+ET LN++Y LQ NQS+DT+LKPK+     + LLPN+
Subjt:  MEQTDRMRLNLVIDSEAERVVYAETDKKFIDFLFNLLSLPLGAVIHLLKKEAMVGCLGNVYESLETSLNEAYFLQSNQSKDTLLKPKL-----SLLLPNI

Query:  DAPSAAT--ATIYVCGHTW-SCTSF-SDGPNAFCPTCAGFMRKIGKYVRPRTSVGRQQ---GFVKDSETYVVMDDLSVTHISDFSIATVLDKFNVKDVDS
          P+AAT  A  Y  G T+ SC ++ S   +A CP C   + +   YV P  +  +     G VKD  TY+VMDDL+V HISDFSI T+L KFN+KDVDS
Subjt:  DAPSAAT--ATIYVCGHTW-SCTSF-SDGPNAFCPTCAGFMRKIGKYVRPRTSVGRQQ---GFVKDSETYVVMDDLSVTHISDFSIATVLDKFNVKDVDS

Query:  LEEKLITLDVNK
        LEEK+ITLDV++
Subjt:  LEEKLITLDVNK

XP_022138964.1 uncharacterized protein LOC111010013 [Momordica charantia]1.8e-4655.66Show/hide
Query:  MRLNLVIDSEAERVVYAETDKKFIDFLFNLLSLPLGAVIHLLKKEAMVGCLGNVYESLETSLNEAYFLQSNQSKDTLLKPKLS-------LLLPNIDAPS
        +RL L+IDS+ +RV++ E DK  IDFLFNLLSLPLG VI LLKK+ MVGCLGN+YES+ET LN+ Y LQ NQSKD LLKPK+S       +LLPNID  +
Subjt:  MRLNLVIDSEAERVVYAETDKKFIDFLFNLLSLPLGAVIHLLKKEAMVGCLGNVYESLETSLNEAYFLQSNQSKDTLLKPKLS-------LLLPNIDAPS

Query:  AATATIYVCGHTWSCT---SFSDGPNAFCPTCAGFMRKIGKYVRP----RTSVGR----QQGFVKDSETYVVMDDLSVTHISDFSIATVLDKFNVKDVDS
        AAT T Y+C  T       S SDGPNA CP C   M ++G +V+P     T+V      + GFVK   TY+VMDDLSV  +S  S   +L+KFNVK+V +
Subjt:  AATATIYVCGHTWSCT---SFSDGPNAFCPTCAGFMRKIGKYVRP----RTSVGR----QQGFVKDSETYVVMDDLSVTHISDFSIATVLDKFNVKDVDS

Query:  LEEKLITLDVNK
        LEEK++TLDVN+
Subjt:  LEEKLITLDVNK

XP_022139194.1 uncharacterized protein LOC111010162 [Momordica charantia]5.8e-4252.22Show/hide
Query:  MRLNLVIDSEAERVVYAETDKKFIDFLFNLLSLPLGAVIHLLKKEAMVGCLGNVYESLETSLNEAYFLQSNQSKDTLLKPKL-----SLLLPNIDAPSAA
        +RL  +IDS+ +RV++ E D+ FIDFLFNLLSLPLG V+  LKK+ MVGCLGN+YES+ET LN+ Y LQ NQSKD+LLKP       ++LLP+ D  S+ 
Subjt:  MRLNLVIDSEAERVVYAETDKKFIDFLFNLLSLPLGAVIHLLKKEAMVGCLGNVYESLETSLNEAYFLQSNQSKDTLLKPKL-----SLLLPNIDAPSAA

Query:  TATIYVC---GHTWSCTSFSDGPNAFCPTCAGFMRKIGKYVRPRTSVGRQQGFVK-DSETYVVMDDLSVTHISDFSIATVLDKFNVKDVDSLEEKLITLD
        + T YVC    +      F+D PNA CP C   M K G++V+P      + GFVK    TY+VMDDLSV  +S  S   +L+KFNV  V  LEEK+ITLD
Subjt:  TATIYVC---GHTWSCTSFSDGPNAFCPTCAGFMRKIGKYVRPRTSVGRQQGFVK-DSETYVVMDDLSVTHISDFSIATVLDKFNVKDVDSLEEKLITLD

Query:  VNK
        VN+
Subjt:  VNK

XP_031741245.1 uncharacterized protein LOC105435653 [Cucumis sativus]4.2e-4857.08Show/hide
Query:  MEQTDRMRLNLVIDSEAERVVYAETDKKFIDFLFNLLSLPLGAVIHLLKKEAMVGCLGNVYESLETSLNEAYFLQSNQSKDTLLKPKL-----SLLLPNI
        MEQTD + L L+ID + ERV+Y E DKKFIDFL N+LSLPLG VI LLKK+ MVGCLGN+YES+ET LN +Y LQ NQS+D +LKPK+     + L+PN+
Subjt:  MEQTDRMRLNLVIDSEAERVVYAETDKKFIDFLFNLLSLPLGAVIHLLKKEAMVGCLGNVYESLETSLNEAYFLQSNQSKDTLLKPKL-----SLLLPNI

Query:  D-APSAAT-ATIYVCGHTWSCTSF-SDGPNAFCPTCAGFMRKIGKYVRPRTSVGRQ----QGFVKDSETYVVMDDLSVTHISDFSIATVLDKFNVKDVDS
        D  P+AAT   I+ C     C  + S   +AFCP+C+  + +  +YV P    G Q    +GFVKD  TY+V DDL+V HISDFSI T+L KFN+KDVDS
Subjt:  D-APSAAT-ATIYVCGHTWSCTSF-SDGPNAFCPTCAGFMRKIGKYVRPRTSVGRQ----QGFVKDSETYVVMDDLSVTHISDFSIATVLDKFNVKDVDS

Query:  LEEKLITLDVNK
        LEEK+ITLDVN+
Subjt:  LEEKLITLDVNK

TrEMBL top hitse value%identityAlignment
A0A1S3CPD6 uncharacterized protein LOC1035030941.0e-4757.08Show/hide
Query:  MEQTDRMRLNLVIDSEAERVVYAETDKKFIDFLFNLLSLPLGAVIHLLKKEAMVGCLGNVYESLETSLNEAYFLQSNQSKDTLLKPKL-----SLLLPNI
        MEQTD + L L+ID + ERV+Y E DKKFIDFL N+LSLPLG VI LLKK  MVGCLGN+YES+ET LN++Y LQ NQS+DT+LKPK+     + LLPN+
Subjt:  MEQTDRMRLNLVIDSEAERVVYAETDKKFIDFLFNLLSLPLGAVIHLLKKEAMVGCLGNVYESLETSLNEAYFLQSNQSKDTLLKPKL-----SLLLPNI

Query:  DAPSAAT--ATIYVCGHTW-SCTSF-SDGPNAFCPTCAGFMRKIGKYVRPRTSVGRQQ---GFVKDSETYVVMDDLSVTHISDFSIATVLDKFNVKDVDS
          P+AAT  A  Y  G T+ SC ++ S   +A CP C   + +   YV P  +  +     G VKD  TY+VMDDL+V HISDFSI T+L KFN+KDVDS
Subjt:  DAPSAAT--ATIYVCGHTW-SCTSF-SDGPNAFCPTCAGFMRKIGKYVRPRTSVGRQQ---GFVKDSETYVVMDDLSVTHISDFSIATVLDKFNVKDVDS

Query:  LEEKLITLDVNK
        LEEK+ITLDV++
Subjt:  LEEKLITLDVNK

A0A5A7U8V2 DUF674 domain-containing protein1.2e-4052.43Show/hide
Query:  MRLNLVIDSEAERVVYAETDKKFIDFLFNLLSLPLGAVIHLLKKEAMVGCLGNVYESLETSLNEAYFLQSNQSKDTLLKPKLS-----LLLPNIDAPSAA
        +RL L+IDSE +RV++ E DK  IDFLFNLLSLPLG VI LLKK+ MVG L N+YES+E +LN+ Y LQ NQSKD LLKPK+S     LLLPNI++  A 
Subjt:  MRLNLVIDSEAERVVYAETDKKFIDFLFNLLSLPLGAVIHLLKKEAMVGCLGNVYESLETSLNEAYFLQSNQSKDTLLKPKLS-----LLLPNIDAPSAA

Query:  TATIYVCGHTWSCTSFSDGPNAFCPTCAGFMRKIGKYVRPRTS-------VGRQQGFVKDSETYVVMDDLSVTHISDFSIATVLDKFNVKDVDSLEEKLI
            Y+CG+     + +  P A CP+C   M +    V P  +       VG   GFVK   TY+VMDDLSV  +S  S  T+L+KFN+K+V +LEEK+I
Subjt:  TATIYVCGHTWSCTSFSDGPNAFCPTCAGFMRKIGKYVRPRTS-------VGRQQGFVKDSETYVVMDDLSVTHISDFSIATVLDKFNVKDVDSLEEKLI

Query:  TLDVNK
        TLDVN+
Subjt:  TLDVNK

A0A5A7V731 Putative DNA polymerase zeta catalytic subunit1.0e-4757.08Show/hide
Query:  MEQTDRMRLNLVIDSEAERVVYAETDKKFIDFLFNLLSLPLGAVIHLLKKEAMVGCLGNVYESLETSLNEAYFLQSNQSKDTLLKPKL-----SLLLPNI
        MEQTD + L L+ID + ERV+Y E DKKFIDFL N+LSLPLG VI LLKK  MVGCLGN+YES+ET LN++Y LQ NQS+DT+LKPK+     + LLPN+
Subjt:  MEQTDRMRLNLVIDSEAERVVYAETDKKFIDFLFNLLSLPLGAVIHLLKKEAMVGCLGNVYESLETSLNEAYFLQSNQSKDTLLKPKL-----SLLLPNI

Query:  DAPSAAT--ATIYVCGHTW-SCTSF-SDGPNAFCPTCAGFMRKIGKYVRPRTSVGRQQ---GFVKDSETYVVMDDLSVTHISDFSIATVLDKFNVKDVDS
          P+AAT  A  Y  G T+ SC ++ S   +A CP C   + +   YV P  +  +     G VKD  TY+VMDDL+V HISDFSI T+L KFN+KDVDS
Subjt:  DAPSAAT--ATIYVCGHTW-SCTSF-SDGPNAFCPTCAGFMRKIGKYVRPRTSVGRQQ---GFVKDSETYVVMDDLSVTHISDFSIATVLDKFNVKDVDS

Query:  LEEKLITLDVNK
        LEEK+ITLDV++
Subjt:  LEEKLITLDVNK

A0A6J1CBJ8 uncharacterized protein LOC1110100138.5e-4755.66Show/hide
Query:  MRLNLVIDSEAERVVYAETDKKFIDFLFNLLSLPLGAVIHLLKKEAMVGCLGNVYESLETSLNEAYFLQSNQSKDTLLKPKLS-------LLLPNIDAPS
        +RL L+IDS+ +RV++ E DK  IDFLFNLLSLPLG VI LLKK+ MVGCLGN+YES+ET LN+ Y LQ NQSKD LLKPK+S       +LLPNID  +
Subjt:  MRLNLVIDSEAERVVYAETDKKFIDFLFNLLSLPLGAVIHLLKKEAMVGCLGNVYESLETSLNEAYFLQSNQSKDTLLKPKLS-------LLLPNIDAPS

Query:  AATATIYVCGHTWSCT---SFSDGPNAFCPTCAGFMRKIGKYVRP----RTSVGR----QQGFVKDSETYVVMDDLSVTHISDFSIATVLDKFNVKDVDS
        AAT T Y+C  T       S SDGPNA CP C   M ++G +V+P     T+V      + GFVK   TY+VMDDLSV  +S  S   +L+KFNVK+V +
Subjt:  AATATIYVCGHTWSCT---SFSDGPNAFCPTCAGFMRKIGKYVRP----RTSVGR----QQGFVKDSETYVVMDDLSVTHISDFSIATVLDKFNVKDVDS

Query:  LEEKLITLDVNK
        LEEK++TLDVN+
Subjt:  LEEKLITLDVNK

A0A6J1CC87 uncharacterized protein LOC1110101622.8e-4252.22Show/hide
Query:  MRLNLVIDSEAERVVYAETDKKFIDFLFNLLSLPLGAVIHLLKKEAMVGCLGNVYESLETSLNEAYFLQSNQSKDTLLKPKL-----SLLLPNIDAPSAA
        +RL  +IDS+ +RV++ E D+ FIDFLFNLLSLPLG V+  LKK+ MVGCLGN+YES+ET LN+ Y LQ NQSKD+LLKP       ++LLP+ D  S+ 
Subjt:  MRLNLVIDSEAERVVYAETDKKFIDFLFNLLSLPLGAVIHLLKKEAMVGCLGNVYESLETSLNEAYFLQSNQSKDTLLKPKL-----SLLLPNIDAPSAA

Query:  TATIYVC---GHTWSCTSFSDGPNAFCPTCAGFMRKIGKYVRPRTSVGRQQGFVK-DSETYVVMDDLSVTHISDFSIATVLDKFNVKDVDSLEEKLITLD
        + T YVC    +      F+D PNA CP C   M K G++V+P      + GFVK    TY+VMDDLSV  +S  S   +L+KFNV  V  LEEK+ITLD
Subjt:  TATIYVC---GHTWSCTSFSDGPNAFCPTCAGFMRKIGKYVRPRTSVGRQQGFVK-DSETYVVMDDLSVTHISDFSIATVLDKFNVKDVDSLEEKLITLD

Query:  VNK
        VN+
Subjt:  VNK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G09110.1 Protein of unknown function (DUF674)1.5e-1130.1Show/hide
Query:  RMRLNLVIDSEAERVVYAETDKKFIDFLFNLLSLPLGAVIHLLKK-----EAMVGCLGNVYESLETSLNEAYFLQSNQSKDTLLKPKLSL------LLPN
        +  L L+ID E  RV+ AE  K F+D L +LL+LP+G ++ LL+K      ++VGCL N+Y+S+     +    +S   K  LL P+ +       L  N
Subjt:  RMRLNLVIDSEAERVVYAETDKKFIDFLFNLLSLPLGAVIHLLKK-----EAMVGCLGNVYESLETSLNEAYFLQSNQSKDTLLKPKLSL------LLPN

Query:  IDAPSAATATIYVCGH---TWSCTSFSDGPNAFCPTCAGFMRKIGKYVRPRTSVGRQQG---FVKDSETYVVMDDLSVTHISDFSIATVLDKFNVKDVDS
        ID   A     +VC +   T +C       +     C   M +          V  QQ    F     ++V+ DDL V   S   +  VL+ F     D 
Subjt:  IDAPSAATATIYVCGH---TWSCTSFSDGPNAFCPTCAGFMRKIGKYVRPRTSVGRQQG---FVKDSETYVVMDDLSVTHISDFSIATVLDKFNVKDVDS

Query:  LEEKLI
        L+E LI
Subjt:  LEEKLI

AT5G01120.1 Protein of unknown function (DUF674)6.3e-1025.47Show/hide
Query:  MRLNLVIDSEAERVVYAETDKKFIDFLFNLLSLPLGAVIHLLK-----KEAMVGCLGNVYESLETSLNEAYFLQSNQSKDTLLKP---------KLSLLL
        + L L+ID E  +VV+AE    F+D LF+  +LP+G ++ LL+     +   +GC  N+Y S+  S+   +F  +   K  LL P          + L +
Subjt:  MRLNLVIDSEAERVVYAETDKKFIDFLFNLLSLPLGAVIHLLK-----KEAMVGCLGNVYESLETSLNEAYFLQSNQSKDTLLKP---------KLSLLL

Query:  PNIDAPSAATATIYVCGHTWSCT-SFSDGPNAFCPTCAGFMRKIGKYVRP--RTSVGRQQGFVKDSET-YVVMDDLSVTHISDFSIATVLDKFNVKDVDS
         + +A       ++V   +  C+  +S+   + C +C  FM ++ ++     R +  + + FV+ + T +++ DDL V   S  S   VL      D D 
Subjt:  PNIDAPSAATATIYVCGHTWSCT-SFSDGPNAFCPTCAGFMRKIGKYVRP--RTSVGRQQGFVKDSET-YVVMDDLSVTHISDFSIATVLDKFNVKDVDS

Query:  LEEKLITLDVNK
        L E ++ +++ +
Subjt:  LEEKLITLDVNK

AT5G01150.1 Protein of unknown function (DUF674)1.3e-1026.57Show/hide
Query:  RMRLNLVIDSEAERVVYAETDKKFIDFLFNLLSLPLGAVIHLLK-----KEAMVGCLGNVYESLETSLNEAYFLQSNQSKDTLLKPK--LSLLLPNIDAP
        +  L L++D E  +VV AE  + F+D LF+LL+LP+G ++ LL+     +   +GC  N+Y S+    ++ +  ++   K  L+ PK    L    +   
Subjt:  RMRLNLVIDSEAERVVYAETDKKFIDFLFNLLSLPLGAVIHLLK-----KEAMVGCLGNVYESLETSLNEAYFLQSNQSKDTLLKPK--LSLLLPNIDAP

Query:  SAATATIYVCGHTWSCTSFSDGPNAFCPTCAGFMRKIGKYVRPRTSVGRQQG-----FVKDSETYVVMDDLSVTHISDFSIATVLDKFNVKDVDSLEEKL
           T  I     +  C  +S+   + C  C  FM +  +       +GR Q      FV    ++V+ DDL V+  S   +   L      DV  L E+L
Subjt:  SAATATIYVCGHTWSCTSFSDGPNAFCPTCAGFMRKIGKYVRPRTSVGRQQG-----FVKDSETYVVMDDLSVTHISDFSIATVLDKFNVKDVDSLEEKL

Query:  ITLDVNK
        + + V +
Subjt:  ITLDVNK

AT5G43240.1 Protein of unknown function (DUF674)1.8e-0928.02Show/hide
Query:  MRLNLVIDSEAERVVYAETDKKFIDFLFNLLSLPLGAVIHLLK-----KEAMVGCLGNVYESLETSLNEAYFLQSNQSKDTLLKP------KLSLLLPNI
        ++L L+ID E  +VV+ E  K F+D LF+  +LP+G ++ LL+     ++  +GC  N+Y S+  S+   +FL +   K  LL P      K   L   +
Subjt:  MRLNLVIDSEAERVVYAETDKKFIDFLFNLLSLPLGAVIHLLK-----KEAMVGCLGNVYESLETSLNEAYFLQSNQSKDTLLKP------KLSLLLPNI

Query:  DAPSAATATIYVCG---HTWSCTSFSDGPNAFCPTCAGFMRKI----GKYVRPRTSVGRQQG-FVK-DSETYVVMDDLSVTHISDFSIATVLDKFNVKDV
        D   A     +VC        CT      N    +C   M ++    G+        G + G FV+ D  ++++ DDL V   S      VL      D 
Subjt:  DAPSAATATIYVCG---HTWSCTSFSDGPNAFCPTCAGFMRKI----GKYVRPRTSVGRQQG-FVK-DSETYVVMDDLSVTHISDFSIATVLDKFNVKDV

Query:  DSLEEKL
        + L+EK+
Subjt:  DSLEEKL

AT5G43240.3 Protein of unknown function (DUF674)1.8e-0928.02Show/hide
Query:  MRLNLVIDSEAERVVYAETDKKFIDFLFNLLSLPLGAVIHLLK-----KEAMVGCLGNVYESLETSLNEAYFLQSNQSKDTLLKP------KLSLLLPNI
        ++L L+ID E  +VV+ E  K F+D LF+  +LP+G ++ LL+     ++  +GC  N+Y S+  S+   +FL +   K  LL P      K   L   +
Subjt:  MRLNLVIDSEAERVVYAETDKKFIDFLFNLLSLPLGAVIHLLK-----KEAMVGCLGNVYESLETSLNEAYFLQSNQSKDTLLKP------KLSLLLPNI

Query:  DAPSAATATIYVCG---HTWSCTSFSDGPNAFCPTCAGFMRKI----GKYVRPRTSVGRQQG-FVK-DSETYVVMDDLSVTHISDFSIATVLDKFNVKDV
        D   A     +VC        CT      N    +C   M ++    G+        G + G FV+ D  ++++ DDL V   S      VL      D 
Subjt:  DAPSAATATIYVCG---HTWSCTSFSDGPNAFCPTCAGFMRKI----GKYVRPRTSVGRQQG-FVK-DSETYVVMDDLSVTHISDFSIATVLDKFNVKDV

Query:  DSLEEKL
        + L+EK+
Subjt:  DSLEEKL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAACAAACCGATAGGATGAGATTAAATCTTGTGATAGACTCGGAAGCAGAAAGAGTTGTTTATGCGGAAACAGACAAGAAATTCATCGACTTTCTTTTCAATCTGCT
TTCCCTCCCACTGGGGGCCGTGATTCACCTCCTGAAAAAGGAAGCCATGGTTGGTTGCTTGGGAAATGTTTACGAGAGCTTAGAAACAAGCTTAAACGAGGCCTATTTTC
TGCAGTCAAACCAGAGCAAAGACACGCTCTTAAAACCCAAACTCTCGTTGCTTTTGCCAAATATTGATGCCCCATCTGCAGCTACTGCTACAATTTATGTATGTGGTCAT
ACTTGGTCTTGTACTTCATTTTCTGATGGGCCTAATGCGTTTTGTCCCACTTGTGCTGGTTTCATGAGGAAAATTGGTAAATATGTGCGGCCTCGGACAAGTGTAGGAAG
GCAACAGGGATTTGTGAAGGATTCTGAAACTTATGTTGTGATGGATGATCTGAGTGTCACCCACATTTCTGACTTCTCCATCGCTACTGTTTTGGACAAGTTCAATGTCA
AGGATGTGGATTCTTTGGAGGAGAAACTCATCACCTTGGATGTCAACAAG
mRNA sequenceShow/hide mRNA sequence
ATGGAACAAACCGATAGGATGAGATTAAATCTTGTGATAGACTCGGAAGCAGAAAGAGTTGTTTATGCGGAAACAGACAAGAAATTCATCGACTTTCTTTTCAATCTGCT
TTCCCTCCCACTGGGGGCCGTGATTCACCTCCTGAAAAAGGAAGCCATGGTTGGTTGCTTGGGAAATGTTTACGAGAGCTTAGAAACAAGCTTAAACGAGGCCTATTTTC
TGCAGTCAAACCAGAGCAAAGACACGCTCTTAAAACCCAAACTCTCGTTGCTTTTGCCAAATATTGATGCCCCATCTGCAGCTACTGCTACAATTTATGTATGTGGTCAT
ACTTGGTCTTGTACTTCATTTTCTGATGGGCCTAATGCGTTTTGTCCCACTTGTGCTGGTTTCATGAGGAAAATTGGTAAATATGTGCGGCCTCGGACAAGTGTAGGAAG
GCAACAGGGATTTGTGAAGGATTCTGAAACTTATGTTGTGATGGATGATCTGAGTGTCACCCACATTTCTGACTTCTCCATCGCTACTGTTTTGGACAAGTTCAATGTCA
AGGATGTGGATTCTTTGGAGGAGAAACTCATCACCTTGGATGTCAACAAG
Protein sequenceShow/hide protein sequence
MEQTDRMRLNLVIDSEAERVVYAETDKKFIDFLFNLLSLPLGAVIHLLKKEAMVGCLGNVYESLETSLNEAYFLQSNQSKDTLLKPKLSLLLPNIDAPSAATATIYVCGH
TWSCTSFSDGPNAFCPTCAGFMRKIGKYVRPRTSVGRQQGFVKDSETYVVMDDLSVTHISDFSIATVLDKFNVKDVDSLEEKLITLDVNK