; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G12850 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G12850
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptiontRNA-uridine aminocarboxypropyltransferase
Genome locationctg1838:5596863..5598787
RNA-Seq ExpressionCucsat.G12850
SyntenyCucsat.G12850
Gene Ontology termsGO:0008033 - tRNA processing (biological process)
GO:0016432 - tRNA-uridine aminocarboxypropyltransferase activity (molecular function)
InterPro domainsIPR005636 - DTW
IPR039262 - DTW domain-containing protein DTWD2/YfiP


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0036817.1 DTW domain-containing protein 2 [Cucumis melo var. makuwa]7.02e-12589.32Show/hide
Query:  MADVLPSSRRPVCPSCSKPARICLCSRFRSSSVENSVGVIILQHSLEKNHPLNSARIVKLGLKNVEIATVSDVNFEARFTIRLPEPNSAAQNLDPDIECS
        MADVLPSSRRPVCPSCSKPARICLCSRFRS S+ENSVG+IILQHS EKNHPLNSARI KLGLKNV+IATVSDVNFEA FTIRLPEPNSAAQNLDPD ECS
Subjt:  MADVLPSSRRPVCPSCSKPARICLCSRFRSSSVENSVGVIILQHSLEKNHPLNSARIVKLGLKNVEIATVSDVNFEARFTIRLPEPNSAAQNLDPDIECS

Query:  FRNGHGTIQKPQIQDGSIVDKLNCTKTNEGAAISVTIGKHGVVNSFDHIWMQLPGFQELKINEFLASPEIRASLAKGFIVKKMQKRQLVESKGLEEYAEF
        FRNGH   QK QIQD SIVDKLN TK N+GAAIS TIGK+GVVNSFDHIWMQ PGFQELKINE LASPEIRASLAKGFIVKK+QKR+LVESKGLEEYAEF
Subjt:  FRNGHGTIQKPQIQDGSIVDKLNCTKTNEGAAISVTIGKHGVVNSFDHIWMQLPGFQELKINEFLASPEIRASLAKGFIVKKMQKRQLVESKGLEEYAEF

Query:  EIQVPP
        EIQVPP
Subjt:  EIQVPP

XP_004150112.3 uncharacterized protein LOC101217537 [Cucumis sativus]1.14e-178100Show/hide
Query:  MPVDDKAPLRRMIHFSFPIRFGNPKNLLNPQPNRHQSKLFHNLPMADVLPSSRRPVCPSCSKPARICLCSRFRSSSVENSVGVIILQHSLEKNHPLNSAR
        MPVDDKAPLRRMIHFSFPIRFGNPKNLLNPQPNRHQSKLFHNLPMADVLPSSRRPVCPSCSKPARICLCSRFRSSSVENSVGVIILQHSLEKNHPLNSAR
Subjt:  MPVDDKAPLRRMIHFSFPIRFGNPKNLLNPQPNRHQSKLFHNLPMADVLPSSRRPVCPSCSKPARICLCSRFRSSSVENSVGVIILQHSLEKNHPLNSAR

Query:  IVKLGLKNVEIATVSDVNFEARFTIRLPEPNSAAQNLDPDIECSFRNGHGTIQKPQIQDGSIVDKLNCTKTNEGAAISVTIGKHGVVNSFDHIWMQLPGF
        IVKLGLKNVEIATVSDVNFEARFTIRLPEPNSAAQNLDPDIECSFRNGHGTIQKPQIQDGSIVDKLNCTKTNEGAAISVTIGKHGVVNSFDHIWMQLPGF
Subjt:  IVKLGLKNVEIATVSDVNFEARFTIRLPEPNSAAQNLDPDIECSFRNGHGTIQKPQIQDGSIVDKLNCTKTNEGAAISVTIGKHGVVNSFDHIWMQLPGF

Query:  QELKINEFLASPEIRASLAKGFIVKKMQKRQLVESKGLEEYAEFEIQVPP
        QELKINEFLASPEIRASLAKGFIVKKMQKRQLVESKGLEEYAEFEIQVPP
Subjt:  QELKINEFLASPEIRASLAKGFIVKKMQKRQLVESKGLEEYAEFEIQVPP

XP_008454653.1 PREDICTED: uncharacterized protein LOC103495005 isoform X1 [Cucumis melo]9.37e-14587.87Show/hide
Query:  MIHFSFPIRFGNPKNLLNPQPNRHQSKLFHNLPMADVLPSSRRPVCPSCSKPARICLCSRFRSSSVENSVGVIILQHSLEKNHPLNSARIVKLGLKNVEI
        MIHF F IRFGNPK+LLNP PNR QSKLF+NL MADVLPSSRRPVCPSCSKPARICLCSRFRS S+ENSVG+IILQHS EKNHPLNSARI KLGLKNV+I
Subjt:  MIHFSFPIRFGNPKNLLNPQPNRHQSKLFHNLPMADVLPSSRRPVCPSCSKPARICLCSRFRSSSVENSVGVIILQHSLEKNHPLNSARIVKLGLKNVEI

Query:  ATVSDVNFEARFTIRLPEPNSAAQNLDPDIECSFRNGHGTIQKPQIQDGSIVDKLNCTKTNEGAAISVTIGKHGVVNSFDHIWMQLPGFQELKINEFLAS
        ATVSDVNFEA FTIRLPEPNSAAQNLDPD ECSFRNGH   QK QIQD SIVDKLN TK N+GAAIS TIGK+GVVNSFDHIWMQ PGFQELKINE LAS
Subjt:  ATVSDVNFEARFTIRLPEPNSAAQNLDPDIECSFRNGHGTIQKPQIQDGSIVDKLNCTKTNEGAAISVTIGKHGVVNSFDHIWMQLPGFQELKINEFLAS

Query:  PEIRASLAKGFIVKKMQKRQLVESKGLEEYAEFEIQVPP
        PEIRASLAKGFIVKK+QKR+LVESKGLEEYAEFEIQVPP
Subjt:  PEIRASLAKGFIVKKMQKRQLVESKGLEEYAEFEIQVPP

XP_008454654.1 PREDICTED: uncharacterized protein LOC103495005 isoform X2 [Cucumis melo]1.69e-15688.14Show/hide
Query:  MIHFSFPIRFGNPKNLLNPQPNRHQSKLFHNLPMADVLPSSRRPVCPSCSKPARICLCSRFRSSSVENSVGVIILQHSLEKNHPLNSARIVKLGLKNVEI
        MIHF F IRFGNPK+LLNP PNR QSKLF+NL MADVLPSSRRPVCPSCSKPARICLCSRFRS S+ENSVG+IILQHS EKNHPLNSARI KLGLKNV+I
Subjt:  MIHFSFPIRFGNPKNLLNPQPNRHQSKLFHNLPMADVLPSSRRPVCPSCSKPARICLCSRFRSSSVENSVGVIILQHSLEKNHPLNSARIVKLGLKNVEI

Query:  ATVSDVNFEARFTIRLPEPNSAAQNLDPDIECSFRNGHGTIQKPQIQDGSIVDKLNCTKTNEGAAISVTIGKHGVVNSFDHIWMQLPGFQELKINEFLAS
        ATVSDVNFEA FTIRLPEPNSAAQNLDPD ECSFRNGH   QK QIQD SIVDKLN TK N+GAAIS TIGK+GVVNSFDHIWMQ PGFQELKINE LAS
Subjt:  ATVSDVNFEARFTIRLPEPNSAAQNLDPDIECSFRNGHGTIQKPQIQDGSIVDKLNCTKTNEGAAISVTIGKHGVVNSFDHIWMQLPGFQELKINEFLAS

Query:  PEIRASLAKGFIVKKMQKRQLVESKGLEEYAEFEIQVPPNFRWNMGKSKENVQ
        PEIRASLAKGFIVKK+QKR+LVESKGLEEYAEFEIQVPPNFR NMGKSKENVQ
Subjt:  PEIRASLAKGFIVKKMQKRQLVESKGLEEYAEFEIQVPPNFRWNMGKSKENVQ

XP_038897268.1 uncharacterized protein LOC120085387 [Benincasa hispida]3.49e-12477.37Show/hide
Query:  MIHFSFPIRFGNPKNLLNPQPNRHQSKLFHNLPMADVLPSSRRPVCPSCSKPARICLCSRFRSSSVENSVGVIILQHSLEKNHPLNSARIVKLGLKNVEI
        MIHF F IRFG+PKN LN  PN  QSKL  NL MA+V PSSRRPVCPSCSKPARICLCSR +S S+ENSVGVIILQHSLEKNHPLNSARI KLGLKNVEI
Subjt:  MIHFSFPIRFGNPKNLLNPQPNRHQSKLFHNLPMADVLPSSRRPVCPSCSKPARICLCSRFRSSSVENSVGVIILQHSLEKNHPLNSARIVKLGLKNVEI

Query:  ATVSDVNFEARFTIRLPEPNSAAQNLDPDIECSFRNGHGTIQKPQIQ----DGSIVDKLNCTKTNEGAAISVTIGKHGVVNSFDHIWMQLPGFQELKINE
        ATVSDVNFEARFTIRLPEPNSAAQN DPD+ECSFRNG    + PQIQ      S+V+KL  T  +EGA I+ TIGK+GVVNSFDHIWM  P  QELKINE
Subjt:  ATVSDVNFEARFTIRLPEPNSAAQNLDPDIECSFRNGHGTIQKPQIQ----DGSIVDKLNCTKTNEGAAISVTIGKHGVVNSFDHIWMQLPGFQELKINE

Query:  FLASPEIRASLAKGFIVKKMQKRQLVESKGLEEYAEFEIQVPP
         LASPEIRAS+AKGFIVKK+QKRQL  SK LE+YAEFEI+VPP
Subjt:  FLASPEIRASLAKGFIVKKMQKRQLVESKGLEEYAEFEIQVPP

TrEMBL top hitse value%identityAlignment
A0A0A0LD09 tRNA-uridine aminocarboxypropyltransferase5.51e-179100Show/hide
Query:  MPVDDKAPLRRMIHFSFPIRFGNPKNLLNPQPNRHQSKLFHNLPMADVLPSSRRPVCPSCSKPARICLCSRFRSSSVENSVGVIILQHSLEKNHPLNSAR
        MPVDDKAPLRRMIHFSFPIRFGNPKNLLNPQPNRHQSKLFHNLPMADVLPSSRRPVCPSCSKPARICLCSRFRSSSVENSVGVIILQHSLEKNHPLNSAR
Subjt:  MPVDDKAPLRRMIHFSFPIRFGNPKNLLNPQPNRHQSKLFHNLPMADVLPSSRRPVCPSCSKPARICLCSRFRSSSVENSVGVIILQHSLEKNHPLNSAR

Query:  IVKLGLKNVEIATVSDVNFEARFTIRLPEPNSAAQNLDPDIECSFRNGHGTIQKPQIQDGSIVDKLNCTKTNEGAAISVTIGKHGVVNSFDHIWMQLPGF
        IVKLGLKNVEIATVSDVNFEARFTIRLPEPNSAAQNLDPDIECSFRNGHGTIQKPQIQDGSIVDKLNCTKTNEGAAISVTIGKHGVVNSFDHIWMQLPGF
Subjt:  IVKLGLKNVEIATVSDVNFEARFTIRLPEPNSAAQNLDPDIECSFRNGHGTIQKPQIQDGSIVDKLNCTKTNEGAAISVTIGKHGVVNSFDHIWMQLPGF

Query:  QELKINEFLASPEIRASLAKGFIVKKMQKRQLVESKGLEEYAEFEIQVPP
        QELKINEFLASPEIRASLAKGFIVKKMQKRQLVESKGLEEYAEFEIQVPP
Subjt:  QELKINEFLASPEIRASLAKGFIVKKMQKRQLVESKGLEEYAEFEIQVPP

A0A1S3BZW7 tRNA-uridine aminocarboxypropyltransferase4.53e-14587.87Show/hide
Query:  MIHFSFPIRFGNPKNLLNPQPNRHQSKLFHNLPMADVLPSSRRPVCPSCSKPARICLCSRFRSSSVENSVGVIILQHSLEKNHPLNSARIVKLGLKNVEI
        MIHF F IRFGNPK+LLNP PNR QSKLF+NL MADVLPSSRRPVCPSCSKPARICLCSRFRS S+ENSVG+IILQHS EKNHPLNSARI KLGLKNV+I
Subjt:  MIHFSFPIRFGNPKNLLNPQPNRHQSKLFHNLPMADVLPSSRRPVCPSCSKPARICLCSRFRSSSVENSVGVIILQHSLEKNHPLNSARIVKLGLKNVEI

Query:  ATVSDVNFEARFTIRLPEPNSAAQNLDPDIECSFRNGHGTIQKPQIQDGSIVDKLNCTKTNEGAAISVTIGKHGVVNSFDHIWMQLPGFQELKINEFLAS
        ATVSDVNFEA FTIRLPEPNSAAQNLDPD ECSFRNGH   QK QIQD SIVDKLN TK N+GAAIS TIGK+GVVNSFDHIWMQ PGFQELKINE LAS
Subjt:  ATVSDVNFEARFTIRLPEPNSAAQNLDPDIECSFRNGHGTIQKPQIQDGSIVDKLNCTKTNEGAAISVTIGKHGVVNSFDHIWMQLPGFQELKINEFLAS

Query:  PEIRASLAKGFIVKKMQKRQLVESKGLEEYAEFEIQVPP
        PEIRASLAKGFIVKK+QKR+LVESKGLEEYAEFEIQVPP
Subjt:  PEIRASLAKGFIVKKMQKRQLVESKGLEEYAEFEIQVPP

A0A1S3C0C5 tRNA-uridine aminocarboxypropyltransferase8.18e-15788.14Show/hide
Query:  MIHFSFPIRFGNPKNLLNPQPNRHQSKLFHNLPMADVLPSSRRPVCPSCSKPARICLCSRFRSSSVENSVGVIILQHSLEKNHPLNSARIVKLGLKNVEI
        MIHF F IRFGNPK+LLNP PNR QSKLF+NL MADVLPSSRRPVCPSCSKPARICLCSRFRS S+ENSVG+IILQHS EKNHPLNSARI KLGLKNV+I
Subjt:  MIHFSFPIRFGNPKNLLNPQPNRHQSKLFHNLPMADVLPSSRRPVCPSCSKPARICLCSRFRSSSVENSVGVIILQHSLEKNHPLNSARIVKLGLKNVEI

Query:  ATVSDVNFEARFTIRLPEPNSAAQNLDPDIECSFRNGHGTIQKPQIQDGSIVDKLNCTKTNEGAAISVTIGKHGVVNSFDHIWMQLPGFQELKINEFLAS
        ATVSDVNFEA FTIRLPEPNSAAQNLDPD ECSFRNGH   QK QIQD SIVDKLN TK N+GAAIS TIGK+GVVNSFDHIWMQ PGFQELKINE LAS
Subjt:  ATVSDVNFEARFTIRLPEPNSAAQNLDPDIECSFRNGHGTIQKPQIQDGSIVDKLNCTKTNEGAAISVTIGKHGVVNSFDHIWMQLPGFQELKINEFLAS

Query:  PEIRASLAKGFIVKKMQKRQLVESKGLEEYAEFEIQVPPNFRWNMGKSKENVQ
        PEIRASLAKGFIVKK+QKR+LVESKGLEEYAEFEIQVPPNFR NMGKSKENVQ
Subjt:  PEIRASLAKGFIVKKMQKRQLVESKGLEEYAEFEIQVPPNFRWNMGKSKENVQ

A0A5A7T5E2 tRNA-uridine aminocarboxypropyltransferase3.40e-12589.32Show/hide
Query:  MADVLPSSRRPVCPSCSKPARICLCSRFRSSSVENSVGVIILQHSLEKNHPLNSARIVKLGLKNVEIATVSDVNFEARFTIRLPEPNSAAQNLDPDIECS
        MADVLPSSRRPVCPSCSKPARICLCSRFRS S+ENSVG+IILQHS EKNHPLNSARI KLGLKNV+IATVSDVNFEA FTIRLPEPNSAAQNLDPD ECS
Subjt:  MADVLPSSRRPVCPSCSKPARICLCSRFRSSSVENSVGVIILQHSLEKNHPLNSARIVKLGLKNVEIATVSDVNFEARFTIRLPEPNSAAQNLDPDIECS

Query:  FRNGHGTIQKPQIQDGSIVDKLNCTKTNEGAAISVTIGKHGVVNSFDHIWMQLPGFQELKINEFLASPEIRASLAKGFIVKKMQKRQLVESKGLEEYAEF
        FRNGH   QK QIQD SIVDKLN TK N+GAAIS TIGK+GVVNSFDHIWMQ PGFQELKINE LASPEIRASLAKGFIVKK+QKR+LVESKGLEEYAEF
Subjt:  FRNGHGTIQKPQIQDGSIVDKLNCTKTNEGAAISVTIGKHGVVNSFDHIWMQLPGFQELKINEFLASPEIRASLAKGFIVKKMQKRQLVESKGLEEYAEF

Query:  EIQVPP
        EIQVPP
Subjt:  EIQVPP

A0A6J1INS0 tRNA-uridine aminocarboxypropyltransferase4.28e-11873.12Show/hide
Query:  MIHFSFPIRFGNPKNLLNPQPNRH-QSKLFHNLPMADVLPSSRRPVCPSCSKPARICLCSRFRSSSVENSVGVIILQHSLEKNHPLNSARIVKLGLKNVE
        M+HF F IRFGN KN LN  P  H Q KL   L MADV PSS R VCPSCSKPARICLCSR +S S+ENSVGVIILQHSLEK HPLNS+RI KLGLKNVE
Subjt:  MIHFSFPIRFGNPKNLLNPQPNRH-QSKLFHNLPMADVLPSSRRPVCPSCSKPARICLCSRFRSSSVENSVGVIILQHSLEKNHPLNSARIVKLGLKNVE

Query:  IATVSDVNFEARFTIRLPEPNSAAQNLDPDIECSFRNGHGTIQKPQIQ----DGSIVDKLNCTKTNEGAAISVTIGKHGVVNSFDHIWMQLPGFQELKIN
        IATVSDVNFEARFTIRLPE NS       D E SF NG    QKPQIQ    D S+VDK  CTK++EGAAI+ TIGK+GVVNSFDHIWM  P  Q+LKI+
Subjt:  IATVSDVNFEARFTIRLPEPNSAAQNLDPDIECSFRNGHGTIQKPQIQ----DGSIVDKLNCTKTNEGAAISVTIGKHGVVNSFDHIWMQLPGFQELKIN

Query:  EFLASPEIRASLAKGFIVKKMQKRQLVESKGLEEYAEFEIQVPPNFRWNMGKS
        E  ASPEIRAS+AKGFIVKK+QKR++ ESK LEEYAEFEI+VPPNFRWNMGKS
Subjt:  EFLASPEIRASLAKGFIVKKMQKRQLVESKGLEEYAEFEIQVPPNFRWNMGKS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G41750.1 DTW domain-containing protein2.3e-0435.09Show/hide
Query:  RRPVCPSCSKPARICLCSRFRSSSVENSVGVIILQHSLEKNHPLNSARIVKLGLKNV
        RR +C +C +P  ICLC    +  +  +  +IIL H  E  H LN+  ++   L NV
Subjt:  RRPVCPSCSKPARICLCSRFRSSSVENSVGVIILQHSLEKNHPLNSARIVKLGLKNV

AT5G54880.1 DTW domain-containing protein5.7e-2733.06Show/hide
Query:  SRRPVCPSCSKPARICLCSRFRSSSVENSVGVIILQHSLEKNHPLNSARIVKLGLKNVEIATVSDVNFEARFTIRLPEPNSA---AQNLDP---------
        ++RP CPSC KP+++CLC + RS   +N V + ILQHSLE+ H LNS RI +LGLKNV + TV DV+ EA F IR+     +     +LD          
Subjt:  SRRPVCPSCSKPARICLCSRFRSSSVENSVGVIILQHSLEKNHPLNSARIVKLGLKNVEIATVSDVNFEARFTIRLPEPNSA---AQNLDP---------

Query:  -DIECSFRNG------------------HGTIQ------------KPQIQDGSIVDKLNCTKTNEGAAISVTIGKHGVVNSFDHIWMQLPGFQELKINEF
         +++ SF+ G                  HG+++            +   +D  + DK+    + E   I + + KHGV+++  H  M     + L  +  
Subjt:  -DIECSFRNG------------------HGTIQ------------KPQIQDGSIVDKLNCTKTNEGAAISVTIGKHGVVNSFDHIWMQLPGFQELKINEF

Query:  LASPEIRASLAKGFIVKKMQKRQLVESKGLEEYAEFEIQVPP
        LASP     LAKGF+V K            E   EFE++VPP
Subjt:  LASPEIRASLAKGFIVKKMQKRQLVESKGLEEYAEFEIQVPP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCCGTCGACGACAAAGCGCCGCTGAGAAGGATGATTCATTTTTCATTTCCAATACGATTCGGGAACCCTAAAAACCTGCTTAATCCCCAGCCAAATCGTCATCAATC
CAAGCTTTTCCATAACCTACCAATGGCGGATGTTCTTCCCAGTTCCAGAAGACCCGTTTGTCCATCCTGCTCGAAACCGGCCCGTATCTGTCTCTGTTCGCGGTTTCGTA
GTTCCAGTGTCGAGAACTCAGTTGGGGTAATCATTCTTCAGCACAGTTTAGAGAAAAACCATCCGCTTAATTCTGCTAGAATCGTAAAATTAGGTCTCAAGAATGTGGAG
ATCGCCACCGTTTCCGACGTTAACTTCGAGGCTCGCTTTACTATTCGGCTGCCTGAACCTAATTCCGCAGCACAAAATCTTGATCCTGATATAGAATGTAGTTTCAGAAA
TGGTCATGGTACGATTCAAAAGCCACAAATTCAGGATGGGTCTATTGTCGATAAACTTAACTGCACAAAAACTAACGAAGGAGCTGCAATTTCTGTTACGATTGGGAAAC
ATGGCGTCGTTAACTCATTCGATCACATCTGGATGCAACTACCGGGTTTTCAGGAGCTTAAAATCAATGAGTTCTTGGCATCTCCAGAAATTAGGGCATCATTAGCAAAA
GGGTTCATCGTCAAGAAAATGCAGAAGCGGCAGCTAGTTGAAAGCAAGGGGCTAGAAGAATATGCTGAGTTCGAGATTCAGGTTCCTCCCAATTTTAGATGGAACATGGG
CAAAAGCAAAGAGAATGTACAATGA
mRNA sequenceShow/hide mRNA sequence
ATGCCCGTCGACGACAAAGCGCCGCTGAGAAGGATGATTCATTTTTCATTTCCAATACGATTCGGGAACCCTAAAAACCTGCTTAATCCCCAGCCAAATCGTCATCAATC
CAAGCTTTTCCATAACCTACCAATGGCGGATGTTCTTCCCAGTTCCAGAAGACCCGTTTGTCCATCCTGCTCGAAACCGGCCCGTATCTGTCTCTGTTCGCGGTTTCGTA
GTTCCAGTGTCGAGAACTCAGTTGGGGTAATCATTCTTCAGCACAGTTTAGAGAAAAACCATCCGCTTAATTCTGCTAGAATCGTAAAATTAGGTCTCAAGAATGTGGAG
ATCGCCACCGTTTCCGACGTTAACTTCGAGGCTCGCTTTACTATTCGGCTGCCTGAACCTAATTCCGCAGCACAAAATCTTGATCCTGATATAGAATGTAGTTTCAGAAA
TGGTCATGGTACGATTCAAAAGCCACAAATTCAGGATGGGTCTATTGTCGATAAACTTAACTGCACAAAAACTAACGAAGGAGCTGCAATTTCTGTTACGATTGGGAAAC
ATGGCGTCGTTAACTCATTCGATCACATCTGGATGCAACTACCGGGTTTTCAGGAGCTTAAAATCAATGAGTTCTTGGCATCTCCAGAAATTAGGGCATCATTAGCAAAA
GGGTTCATCGTCAAGAAAATGCAGAAGCGGCAGCTAGTTGAAAGCAAGGGGCTAGAAGAATATGCTGAGTTCGAGATTCAGGTTCCTCCCAATTTTAGATGGAACATGGG
CAAAAGCAAAGAGAATGTACAATGA
Protein sequenceShow/hide protein sequence
MPVDDKAPLRRMIHFSFPIRFGNPKNLLNPQPNRHQSKLFHNLPMADVLPSSRRPVCPSCSKPARICLCSRFRSSSVENSVGVIILQHSLEKNHPLNSARIVKLGLKNVE
IATVSDVNFEARFTIRLPEPNSAAQNLDPDIECSFRNGHGTIQKPQIQDGSIVDKLNCTKTNEGAAISVTIGKHGVVNSFDHIWMQLPGFQELKINEFLASPEIRASLAK
GFIVKKMQKRQLVESKGLEEYAEFEIQVPPNFRWNMGKSKENVQ