; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI07G08220 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI07G08220
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationChr7:5976312..5979516
RNA-Seq ExpressionCSPI07G08220
SyntenyCSPI07G08220
Gene Ontology termsGO:0005488 - binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0052114.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]1.6e-2754.61Show/hide
Query:  GEQRLGPFAKYLEKCGIVSQYTMPEKPSMNGVAKRK--------EDLVLAISTSEVVQLRLGLIGLTKENWTQELLAVILLGILRISPSFKFYDPTSRSF
        G+QR GPFAKYLE+CGIV QY MP KPSMNGV +R+          LVL I TS VVQLRL L GLTKENWTQELLA ILLG  +I              
Subjt:  GEQRLGPFAKYLEKCGIVSQYTMPEKPSMNGVAKRK--------EDLVLAISTSEVVQLRLGLIGLTKENWTQELLAVILLGILRISPSFKFYDPTSRSF

Query:  FETENARFLEDVEFEAPILDFTIKPIIEQDNNKVLVEPKVQTRQPQEVPLEK
            N   ++DV  + PI DFT++P IEQDNN+VL   +VQT+Q QE+PL +
Subjt:  FETENARFLEDVEFEAPILDFTIKPIIEQDNNKVLVEPKVQTRQPQEVPLEK

KYP41105.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan]1.0e-2630.26Show/hide
Query:  MDLDLALRIDKPTSTKEQLNTTNIEKWELSNCMNLMIIRHSIPESFRGSITESENAKKCFVEIEKYFAKNGKGEAKKGLLKKDCLKY-------------
        MDLDLA R++KPT T E L+   +EKWE SN M LMI++ S+PE F+GSI+ES+N K     +E YF  N K +A++   K   ++Y             
Subjt:  MDLDLALRIDKPTSTKEQLNTTNIEKWELSNCMNLMIIRHSIPESFRGSITESENAKKCFVEIEKYFAKNGKGEAKKGLLKKDCLKY-------------

Query:  -----------------AKWHVSLAYVPTDI--------------------------MDQGEQRLGPFAKYLEKCGIVSQYTMPEKPSMNGVAKRK----
                            H+ L  +PT                             ++ ++     A +L +CGI  QYTMP KPSMNGVA+R+    
Subjt:  -----------------AKWHVSLAYVPTDI--------------------------MDQGEQRLGPFAKYLEKCGIVSQYTMPEKPSMNGVAKRK----

Query:  EDLVL---------------AISTSEVVQLRLGLIGLTK---ENWT-----------------------------QELLAVILLGILRISPSFKFYDPTS
        +D+V                A+ T+  +  R+    + K   E WT                                ++   +G    S  +KFY+PT+
Subjt:  EDLVL---------------AISTSEVVQLRLGLIGLTK---ENWT-----------------------------QELLAVILLGILRISPSFKFYDPTS

Query:  RSFFETENARFLEDVEF------------EAPILD-------FTI---KPII------------EQDNNKVLVE-PKVQTRQPQEVPLEK
        RSFFET NARFLEDVEF            E P++D        TI    P+I             QDN +VL + P  Q +QPQEVPL +
Subjt:  RSFFETENARFLEDVEF------------EAPILD-------FTI---KPII------------EQDNNKVLVE-PKVQTRQPQEVPLEK

RVW95606.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]2.7e-2727.81Show/hide
Query:  MDLDLALRIDKP---TSTKEQLNTTNIEKWELSNCMNLMIIRHSIPESFRGSITESENAKKCFVEIEKYFAKNGKGE-----------------------
        MDLD ALR D+P   TS       + +EKWE SN M+LMI++HSIPE+ RG + E   AK    +IE  FA N K E                       
Subjt:  MDLDLALRIDKP---TSTKEQLNTTNIEKWELSNCMNLMIIRHSIPESFRGSITESENAKKCFVEIEKYFAKNGKGE-----------------------

Query:  ---------------------------------------------------------------------------AKKGLLKKDCLKYAKWHVSLAYVPT
                                                                                    K G +KK C KY  WH   A V  
Subjt:  ---------------------------------------------------------------------------AKKGLLKKDCLKYAKWHVSLAYVPT

Query:  DIMDQ-------------------GEQRLGPFAKYLEKCGIVSQYTMPEKPSMNGVAKRKEDLVLAISTSEVVQLRLGLIGLTKENWTQELLAVILLGIL
         +  +                   GEQR  PFAKYL +CGIV QYTM    S NGV +R+       +  ++V+  +    L +  W + +   +   IL
Subjt:  DIMDQ-------------------GEQRLGPFAKYLEKCGIVSQYTMPEKPSMNGVAKRKEDLVLAISTSEVVQLRLGLIGLTKENWTQELLAVILLGIL

Query:  RISPS------------------------FKFYDPTSRSFFETENARFLEDVE-----------FEAPILDFTIKPIIEQDNNKVLVEPKVQTRQP----
           PS                        FKFYDP++RSFFET N +F+EDVE           FE    +F   PII      ++ +  +Q  QP    
Subjt:  RISPS------------------------FKFYDPTSRSFFETENARFLEDVE-----------FEAPILDFTIKPIIEQDNNKVLVEPKVQTRQP----

Query:  ---------------------------------------QEVPL-EKKEVIMALVAHLDLELHQMDVKTVFLNENIVQTIYMT-PSNDVSLLYDIKRLLK
                                                E+P+ +   +IMAL+AH DL+LHQMDVKT FLN NI +TIYM  P N  S   D K+L+ 
Subjt:  ---------------------------------------QEVPL-EKKEVIMALVAHLDLELHQMDVKTVFLNENIVQTIYMT-PSNDVSLLYDIKRLLK

Query:  KNFEIKD
         +F+ K+
Subjt:  KNFEIKD

RVX19364.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]2.5e-2831Show/hide
Query:  MDLDLALRIDKP-----TSTKEQLNTTNIEKWELSNCMNLMIIRHSIPESFRGSITESENAKKCFVEIEKYFAKNGKGEAKKGLLKKDCLKY--------
        MDLD ALR ++P      ST EQ +T  +EKWE SNCM+LMI++HSIPE+ R +I E   AK    +I   FA N K E    +LK + +          
Subjt:  MDLDLALRIDKP-----TSTKEQLNTTNIEKWELSNCMNLMIIRHSIPESFRGSITESENAKKCFVEIEKYFAKNGKGEAKKGLLKKDCLKY--------

Query:  --AKWHVSL-------AYVPTD------------------------------IMDQG-----------EQRLGPFAKYLEKCGIVSQYTMPEKPSMNGVA
          A  H+S+       + +PTD                                D+G           EQ LGPFAKYL +CGIV QYTMP  PS NGVA
Subjt:  --AKWHVSL-------AYVPTD------------------------------IMDQG-----------EQRLGPFAKYLEKCGIVSQYTMPEKPSMNGVA

Query:  KRKEDLVLAISTSEVVQLRLGLIGLTKENWTQELLAVILLGILRISPSFKFYDPTSRSFFETENARFLEDVEFEA--PILDFTIKPIIEQDNNKV----L
        +R+   +  +     V+ R       ++      ++   +G    S  FKFYD ++R FFET NA+F+EDVE     P+     K    QD  K+    +
Subjt:  KRKEDLVLAISTSEVVQLRLGLIGLTKENWTQELLAVILLGILRISPSFKFYDPTSRSFFETENARFLEDVEFEA--PILDFTIKPIIEQDNNKV----L

Query:  VEP----KVQTRQPQE----VPLEKKE-------------------------------------------------------------------------
        +EP    +  T+QPQE    VPL +                                                                           
Subjt:  VEP----KVQTRQPQE----VPLEKKE-------------------------------------------------------------------------

Query:  --VIMALVAHLDLELHQMDVKTVFLNENIVQTIYMTPSNDVSLLYDIKRLLKKNFEIKDLEYGSFVTTRIW
          +IMALVAH DLELHQM+VK  FLN NI +TIYM  S +     D K+ + +   +K   YG   T+R W
Subjt:  --VIMALVAHLDLELHQMDVKTVFLNENIVQTIYMTPSNDVSLLYDIKRLLKKNFEIKDLEYGSFVTTRIW

TYJ98082.1 uncharacterized protein E5676_scaffold565G00130 [Cucumis melo var. makuwa]1.6e-2754.61Show/hide
Query:  GEQRLGPFAKYLEKCGIVSQYTMPEKPSMNGVAKRK--------EDLVLAISTSEVVQLRLGLIGLTKENWTQELLAVILLGILRISPSFKFYDPTSRSF
        G+QR GPFAKYLE+CGIV QY MP KPSMNGV +R+          LVL I TS VVQLRL L GLTKENWTQELLA ILLG  +I              
Subjt:  GEQRLGPFAKYLEKCGIVSQYTMPEKPSMNGVAKRK--------EDLVLAISTSEVVQLRLGLIGLTKENWTQELLAVILLGILRISPSFKFYDPTSRSF

Query:  FETENARFLEDVEFEAPILDFTIKPIIEQDNNKVLVEPKVQTRQPQEVPLEK
            N   ++DV  + PI DFT++P IEQDNN+VL   +VQT+Q QE+PL +
Subjt:  FETENARFLEDVEFEAPILDFTIKPIIEQDNNKVLVEPKVQTRQPQEVPLEK

TrEMBL top hitse value%identityAlignment
A0A151RFD4 Retrovirus-related Pol polyprotein from transposon TNT 1-945.0e-2730.26Show/hide
Query:  MDLDLALRIDKPTSTKEQLNTTNIEKWELSNCMNLMIIRHSIPESFRGSITESENAKKCFVEIEKYFAKNGKGEAKKGLLKKDCLKY-------------
        MDLDLA R++KPT T E L+   +EKWE SN M LMI++ S+PE F+GSI+ES+N K     +E YF  N K +A++   K   ++Y             
Subjt:  MDLDLALRIDKPTSTKEQLNTTNIEKWELSNCMNLMIIRHSIPESFRGSITESENAKKCFVEIEKYFAKNGKGEAKKGLLKKDCLKY-------------

Query:  -----------------AKWHVSLAYVPTDI--------------------------MDQGEQRLGPFAKYLEKCGIVSQYTMPEKPSMNGVAKRK----
                            H+ L  +PT                             ++ ++     A +L +CGI  QYTMP KPSMNGVA+R+    
Subjt:  -----------------AKWHVSLAYVPTDI--------------------------MDQGEQRLGPFAKYLEKCGIVSQYTMPEKPSMNGVAKRK----

Query:  EDLVL---------------AISTSEVVQLRLGLIGLTK---ENWT-----------------------------QELLAVILLGILRISPSFKFYDPTS
        +D+V                A+ T+  +  R+    + K   E WT                                ++   +G    S  +KFY+PT+
Subjt:  EDLVL---------------AISTSEVVQLRLGLIGLTK---ENWT-----------------------------QELLAVILLGILRISPSFKFYDPTS

Query:  RSFFETENARFLEDVEF------------EAPILD-------FTI---KPII------------EQDNNKVLVE-PKVQTRQPQEVPLEK
        RSFFET NARFLEDVEF            E P++D        TI    P+I             QDN +VL + P  Q +QPQEVPL +
Subjt:  RSFFETENARFLEDVEF------------EAPILD-------FTI---KPII------------EQDNNKVLVE-PKVQTRQPQEVPLEK

A0A438IFS9 Retrovirus-related Pol polyprotein from transposon TNT 1-941.3e-2727.81Show/hide
Query:  MDLDLALRIDKP---TSTKEQLNTTNIEKWELSNCMNLMIIRHSIPESFRGSITESENAKKCFVEIEKYFAKNGKGE-----------------------
        MDLD ALR D+P   TS       + +EKWE SN M+LMI++HSIPE+ RG + E   AK    +IE  FA N K E                       
Subjt:  MDLDLALRIDKP---TSTKEQLNTTNIEKWELSNCMNLMIIRHSIPESFRGSITESENAKKCFVEIEKYFAKNGKGE-----------------------

Query:  ---------------------------------------------------------------------------AKKGLLKKDCLKYAKWHVSLAYVPT
                                                                                    K G +KK C KY  WH   A V  
Subjt:  ---------------------------------------------------------------------------AKKGLLKKDCLKYAKWHVSLAYVPT

Query:  DIMDQ-------------------GEQRLGPFAKYLEKCGIVSQYTMPEKPSMNGVAKRKEDLVLAISTSEVVQLRLGLIGLTKENWTQELLAVILLGIL
         +  +                   GEQR  PFAKYL +CGIV QYTM    S NGV +R+       +  ++V+  +    L +  W + +   +   IL
Subjt:  DIMDQ-------------------GEQRLGPFAKYLEKCGIVSQYTMPEKPSMNGVAKRKEDLVLAISTSEVVQLRLGLIGLTKENWTQELLAVILLGIL

Query:  RISPS------------------------FKFYDPTSRSFFETENARFLEDVE-----------FEAPILDFTIKPIIEQDNNKVLVEPKVQTRQP----
           PS                        FKFYDP++RSFFET N +F+EDVE           FE    +F   PII      ++ +  +Q  QP    
Subjt:  RISPS------------------------FKFYDPTSRSFFETENARFLEDVE-----------FEAPILDFTIKPIIEQDNNKVLVEPKVQTRQP----

Query:  ---------------------------------------QEVPL-EKKEVIMALVAHLDLELHQMDVKTVFLNENIVQTIYMT-PSNDVSLLYDIKRLLK
                                                E+P+ +   +IMAL+AH DL+LHQMDVKT FLN NI +TIYM  P N  S   D K+L+ 
Subjt:  ---------------------------------------QEVPL-EKKEVIMALVAHLDLELHQMDVKTVFLNENIVQTIYMT-PSNDVSLLYDIKRLLK

Query:  KNFEIKD
         +F+ K+
Subjt:  KNFEIKD

A0A438KDT7 Retrovirus-related Pol polyprotein from transposon TNT 1-941.2e-2831Show/hide
Query:  MDLDLALRIDKP-----TSTKEQLNTTNIEKWELSNCMNLMIIRHSIPESFRGSITESENAKKCFVEIEKYFAKNGKGEAKKGLLKKDCLKY--------
        MDLD ALR ++P      ST EQ +T  +EKWE SNCM+LMI++HSIPE+ R +I E   AK    +I   FA N K E    +LK + +          
Subjt:  MDLDLALRIDKP-----TSTKEQLNTTNIEKWELSNCMNLMIIRHSIPESFRGSITESENAKKCFVEIEKYFAKNGKGEAKKGLLKKDCLKY--------

Query:  --AKWHVSL-------AYVPTD------------------------------IMDQG-----------EQRLGPFAKYLEKCGIVSQYTMPEKPSMNGVA
          A  H+S+       + +PTD                                D+G           EQ LGPFAKYL +CGIV QYTMP  PS NGVA
Subjt:  --AKWHVSL-------AYVPTD------------------------------IMDQG-----------EQRLGPFAKYLEKCGIVSQYTMPEKPSMNGVA

Query:  KRKEDLVLAISTSEVVQLRLGLIGLTKENWTQELLAVILLGILRISPSFKFYDPTSRSFFETENARFLEDVEFEA--PILDFTIKPIIEQDNNKV----L
        +R+   +  +     V+ R       ++      ++   +G    S  FKFYD ++R FFET NA+F+EDVE     P+     K    QD  K+    +
Subjt:  KRKEDLVLAISTSEVVQLRLGLIGLTKENWTQELLAVILLGILRISPSFKFYDPTSRSFFETENARFLEDVEFEA--PILDFTIKPIIEQDNNKV----L

Query:  VEP----KVQTRQPQE----VPLEKKE-------------------------------------------------------------------------
        +EP    +  T+QPQE    VPL +                                                                           
Subjt:  VEP----KVQTRQPQE----VPLEKKE-------------------------------------------------------------------------

Query:  --VIMALVAHLDLELHQMDVKTVFLNENIVQTIYMTPSNDVSLLYDIKRLLKKNFEIKDLEYGSFVTTRIW
          +IMALVAH DLELHQM+VK  FLN NI +TIYM  S +     D K+ + +   +K   YG   T+R W
Subjt:  --VIMALVAHLDLELHQMDVKTVFLNENIVQTIYMTPSNDVSLLYDIKRLLKKNFEIKDLEYGSFVTTRIW

A0A5A7U9T3 Retrovirus-related Pol polyprotein from transposon TNT 1-947.7e-2854.61Show/hide
Query:  GEQRLGPFAKYLEKCGIVSQYTMPEKPSMNGVAKRK--------EDLVLAISTSEVVQLRLGLIGLTKENWTQELLAVILLGILRISPSFKFYDPTSRSF
        G+QR GPFAKYLE+CGIV QY MP KPSMNGV +R+          LVL I TS VVQLRL L GLTKENWTQELLA ILLG  +I              
Subjt:  GEQRLGPFAKYLEKCGIVSQYTMPEKPSMNGVAKRK--------EDLVLAISTSEVVQLRLGLIGLTKENWTQELLAVILLGILRISPSFKFYDPTSRSF

Query:  FETENARFLEDVEFEAPILDFTIKPIIEQDNNKVLVEPKVQTRQPQEVPLEK
            N   ++DV  + PI DFT++P IEQDNN+VL   +VQT+Q QE+PL +
Subjt:  FETENARFLEDVEFEAPILDFTIKPIIEQDNNKVLVEPKVQTRQPQEVPLEK

A0A5D3BE89 Integrase catalytic domain-containing protein7.7e-2854.61Show/hide
Query:  GEQRLGPFAKYLEKCGIVSQYTMPEKPSMNGVAKRK--------EDLVLAISTSEVVQLRLGLIGLTKENWTQELLAVILLGILRISPSFKFYDPTSRSF
        G+QR GPFAKYLE+CGIV QY MP KPSMNGV +R+          LVL I TS VVQLRL L GLTKENWTQELLA ILLG  +I              
Subjt:  GEQRLGPFAKYLEKCGIVSQYTMPEKPSMNGVAKRK--------EDLVLAISTSEVVQLRLGLIGLTKENWTQELLAVILLGILRISPSFKFYDPTSRSF

Query:  FETENARFLEDVEFEAPILDFTIKPIIEQDNNKVLVEPKVQTRQPQEVPLEK
            N   ++DV  + PI DFT++P IEQDNN+VL   +VQT+Q QE+PL +
Subjt:  FETENARFLEDVEFEAPILDFTIKPIIEQDNNKVLVEPKVQTRQPQEVPLEK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCTAGACCTTGCATTGAGGATCGATAAACCTACTTCTACTAAGGAACAACTGAATACGACTAATATTGAGAAGTGGGAACTGTCAAATTGCATGAACCTGATGAT
TATTAGGCACTCCATTCCAGAGTCTTTTCGGGGTTCTATCACTGAAAGTGAAAATGCCAAAAAGTGTTTTGTCGAAATTGAAAAATATTTTGCTAAAAATGGAAAAGGGG
AAGCAAAGAAAGGTCTTCTCAAGAAAGATTGTCTCAAGTATGCCAAATGGCATGTTAGTTTAGCTTATGTACCTACAGATATAATGGATCAGGGTGAACAACGTTTAGGA
CCCTTTGCCAAATACCTAGAAAAATGTGGAATCGTCTCGCAATACACTATGCCAGAAAAACCTAGCATGAATGGTGTAGCGAAAAGGAAAGAAGACCTAGTATTAGCCAT
CTCAACGTCTGAGGTTGTCCAACTAAGGCTAGGCCTTATAGGCCTAACAAAAGAAAATTGGACCCAAGAACTATTAGCTGTTATTTTGTTGGGTATTCTAAGGATTTCTC
CGAGTTTTAAGTTTTATGATCCCACTTCGAGATCGTTTTTTGAGACTGAAAATGCTAGATTCCTTGAGGATGTTGAGTTTGAGGCTCCAATTCTTGACTTCACTATAAAA
CCAATTATAGAACAAGACAACAACAAAGTCCTTGTTGAACCTAAAGTTCAAACTCGACAACCTCAAGAAGTGCCATTAGAGAAAAAAGAAGTAATCATGGCATTAGTAGC
TCACTTAGATTTAGAGCTACATCAGATGGATGTGAAAACTGTGTTTCTCAATGAAAACATTGTTCAGACGATTTATATGACTCCAAGTAATGATGTAAGTTTATTGTATG
ACATTAAGAGGCTTCTCAAAAAGAATTTTGAGATAAAGGATCTTGAATATGGTTCTTTTGTCACTACAAGGATTTGGGAAATTCGACATTTGTACAAAAACTCTTCGAGA
AAACCTCTTCTGACGTTAGATGCTCCATCGGGAAGATGTCGAGCATGCTACGTTAGGGAAGCAATTTTCGATTCACAAACAAGAACGTCAAGAATGTCGTATCCTCGACG
CCATCAATGCCAACGGATGGTTGACGTCGATAGTGGCTTACGCTCAATGTCTCCTCCGTATATTATCTCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATCTAGACCTTGCATTGAGGATCGATAAACCTACTTCTACTAAGGAACAACTGAATACGACTAATATTGAGAAGTGGGAACTGTCAAATTGCATGAACCTGATGAT
TATTAGGCACTCCATTCCAGAGTCTTTTCGGGGTTCTATCACTGAAAGTGAAAATGCCAAAAAGTGTTTTGTCGAAATTGAAAAATATTTTGCTAAAAATGGAAAAGGGG
AAGCAAAGAAAGGTCTTCTCAAGAAAGATTGTCTCAAGTATGCCAAATGGCATGTTAGTTTAGCTTATGTACCTACAGATATAATGGATCAGGGTGAACAACGTTTAGGA
CCCTTTGCCAAATACCTAGAAAAATGTGGAATCGTCTCGCAATACACTATGCCAGAAAAACCTAGCATGAATGGTGTAGCGAAAAGGAAAGAAGACCTAGTATTAGCCAT
CTCAACGTCTGAGGTTGTCCAACTAAGGCTAGGCCTTATAGGCCTAACAAAAGAAAATTGGACCCAAGAACTATTAGCTGTTATTTTGTTGGGTATTCTAAGGATTTCTC
CGAGTTTTAAGTTTTATGATCCCACTTCGAGATCGTTTTTTGAGACTGAAAATGCTAGATTCCTTGAGGATGTTGAGTTTGAGGCTCCAATTCTTGACTTCACTATAAAA
CCAATTATAGAACAAGACAACAACAAAGTCCTTGTTGAACCTAAAGTTCAAACTCGACAACCTCAAGAAGTGCCATTAGAGAAAAAAGAAGTAATCATGGCATTAGTAGC
TCACTTAGATTTAGAGCTACATCAGATGGATGTGAAAACTGTGTTTCTCAATGAAAACATTGTTCAGACGATTTATATGACTCCAAGTAATGATGTAAGTTTATTGTATG
ACATTAAGAGGCTTCTCAAAAAGAATTTTGAGATAAAGGATCTTGAATATGGTTCTTTTGTCACTACAAGGATTTGGGAAATTCGACATTTGTACAAAAACTCTTCGAGA
AAACCTCTTCTGACGTTAGATGCTCCATCGGGAAGATGTCGAGCATGCTACGTTAGGGAAGCAATTTTCGATTCACAAACAAGAACGTCAAGAATGTCGTATCCTCGACG
CCATCAATGCCAACGGATGGTTGACGTCGATAGTGGCTTACGCTCAATGTCTCCTCCGTATATTATCTCTTAA
Protein sequenceShow/hide protein sequence
MDLDLALRIDKPTSTKEQLNTTNIEKWELSNCMNLMIIRHSIPESFRGSITESENAKKCFVEIEKYFAKNGKGEAKKGLLKKDCLKYAKWHVSLAYVPTDIMDQGEQRLG
PFAKYLEKCGIVSQYTMPEKPSMNGVAKRKEDLVLAISTSEVVQLRLGLIGLTKENWTQELLAVILLGILRISPSFKFYDPTSRSFFETENARFLEDVEFEAPILDFTIK
PIIEQDNNKVLVEPKVQTRQPQEVPLEKKEVIMALVAHLDLELHQMDVKTVFLNENIVQTIYMTPSNDVSLLYDIKRLLKKNFEIKDLEYGSFVTTRIWEIRHLYKNSSR
KPLLTLDAPSGRCRACYVREAIFDSQTRTSRMSYPRRHQCQRMVDVDSGLRSMSPPYIIS