; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc09G15140 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc09G15140
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionLOW QUALITY PROTEIN: uncharacterized protein LOC111469947
Genome locationClcChr09:20753660..20755332
RNA-Seq ExpressionClc09G15140
SyntenyClc09G15140
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022930025.1 uncharacterized protein LOC111436463, partial [Cucurbita moschata]1.4e-16464.63Show/hide
Query:  NFLGFVVSSNGVEVDEEKIRAIRDWPTPKSVSEVRSFHGLAGFYRRFIKDFSTIASPLNELVKKNVSFVWKPKHEVAFNTLKDKLSSAPLLALPNFDLTF
        NFLGFVVSSNGVEVDEEK++AI+DWPTPK+VSEVRSFHGLA FYRRFIK+FSTIASPLNELVKKNVSF+W+   E+AFNTLKDKLSSAPLLALPNF+ TF
Subjt:  NFLGFVVSSNGVEVDEEKIRAIRDWPTPKSVSEVRSFHGLAGFYRRFIKDFSTIASPLNELVKKNVSFVWKPKHEVAFNTLKDKLSSAPLLALPNFDLTF

Query:  EIKCDASGEGIGAVLMQNQRPLMFFSEKLNGASLQNPTYDKELYAL------------------------------------------------------
        EI+CDASG GIGAVLMQNQRPLMFFSEKL GASL+ PTYDKELYAL                                                      
Subjt:  EIKCDASGEGIGAVLMQNQRPLMFFSEKLNGASLQNPTYDKELYAL------------------------------------------------------

Query:  --GKEKIVADPLSR-------------------------------------------------------------------------RLQPHGLYSPLPV
          GKE IVAD LSR                                                                         RLQPHGL SPLPV
Subjt:  --GKEKIVADPLSR-------------------------------------------------------------------------RLQPHGLYSPLPV

Query:  PTNPWIDISMDFVLGLPRTQKGKDSIFVVVDRFSKMAHFIPCHKTNDTKHIADLFFRKVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYSTTCH
        P  PWIDISMDFVLGLPRT+KG DSIFVVVDRFSKMAHFIPCHKT+D KHIADLFFR+VVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYSTTCH
Subjt:  PTNPWIDISMDFVLGLPRTQKGKDSIFVVVDRFSKMAHFIPCHKTNDTKHIADLFFRKVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYSTTCH

Query:  PQTDGQTEVVNRTMTTMLRAIIDKNLKTWEDCLPFIEFAYNRVVHSTSKCTSFEIIYGFSPLTPMDFLPMPPNEF
        PQTDGQTEVVNRTMT MLRAIIDKNLKTWEDCLPFIEFAYNRVVHST+KCT FEI+YGF+PLTP+D LP+P  EF
Subjt:  PQTDGQTEVVNRTMTTMLRAIIDKNLKTWEDCLPFIEFAYNRVVHSTSKCTSFEIIYGFSPLTPMDFLPMPPNEF

XP_022930059.1 uncharacterized protein LOC111436530, partial [Cucurbita moschata]1.8e-16464.42Show/hide
Query:  NFLGFVVSSNGVEVDEEKIRAIRDWPTPKSVSEVRSFHGLAGFYRRFIKDFSTIASPLNELVKKNVSFVWKPKHEVAFNTLKDKLSSAPLLALPNFDLTF
        NFLGFVVSSNGVEVDEEK++AI+DWPTPK+VSEVRSFHGLA FYRRFIK+FSTIASPLNELVKKNVSF+W+   E+AFNTLK+KLSSAPLLALPNF+  F
Subjt:  NFLGFVVSSNGVEVDEEKIRAIRDWPTPKSVSEVRSFHGLAGFYRRFIKDFSTIASPLNELVKKNVSFVWKPKHEVAFNTLKDKLSSAPLLALPNFDLTF

Query:  EIKCDASGEGIGAVLMQNQRPLMFFSEKLNGASLQNPTYDKELYAL------------------------------------------------------
        EI+CDASG GIGAVLMQNQRPLMFFSEKL GASL+ PTYDKELYAL                                                      
Subjt:  EIKCDASGEGIGAVLMQNQRPLMFFSEKLNGASLQNPTYDKELYAL------------------------------------------------------

Query:  --GKEKIVADPLSR-------------------------------------------------------------------------RLQPHGLYSPLPV
          GKE IVAD LSR                                                                         RLQPHGLYSPLPV
Subjt:  --GKEKIVADPLSR-------------------------------------------------------------------------RLQPHGLYSPLPV

Query:  PTNPWIDISMDFVLGLPRTQKGKDSIFVVVDRFSKMAHFIPCHKTNDTKHIADLFFRKVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYSTTCH
        P  PWIDISMDFVLGLPRT+KG DSIFVVVDRFSKMAHFIPCHKT+D KHIADLFFR+VVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYSTTCH
Subjt:  PTNPWIDISMDFVLGLPRTQKGKDSIFVVVDRFSKMAHFIPCHKTNDTKHIADLFFRKVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYSTTCH

Query:  PQTDGQTEVVNRTMTTMLRAIIDKNLKTWEDCLPFIEFAYNRVVHSTSKCTSFEIIYGFSPLTPMDFLPMPPNEF
        PQTDGQTEVVNRTMT MLRAIIDKNLKTWEDCLPFIEFAYNRVVHST+KCT FEI+YGF+PLTP+D LP+P  EF
Subjt:  PQTDGQTEVVNRTMTTMLRAIIDKNLKTWEDCLPFIEFAYNRVVHSTSKCTSFEIIYGFSPLTPMDFLPMPPNEF

XP_022946091.1 uncharacterized protein LOC111450286, partial [Cucurbita moschata]3.6e-16564.63Show/hide
Query:  NFLGFVVSSNGVEVDEEKIRAIRDWPTPKSVSEVRSFHGLAGFYRRFIKDFSTIASPLNELVKKNVSFVWKPKHEVAFNTLKDKLSSAPLLALPNFDLTF
        NFLGFVVSSNGVEVDEEK++AI+DWPTPK+VSEVRSFHGLA FYRRFIK+FSTIASPLNELVKKNVSF+W+   E+AFNTLK+KLSSAPLLALPNF+ TF
Subjt:  NFLGFVVSSNGVEVDEEKIRAIRDWPTPKSVSEVRSFHGLAGFYRRFIKDFSTIASPLNELVKKNVSFVWKPKHEVAFNTLKDKLSSAPLLALPNFDLTF

Query:  EIKCDASGEGIGAVLMQNQRPLMFFSEKLNGASLQNPTYDKELYAL------------------------------------------------------
        EI+CDASG GIGAVLMQNQRPLMFFSEKL GASL+ PTYDKELYAL                                                      
Subjt:  EIKCDASGEGIGAVLMQNQRPLMFFSEKLNGASLQNPTYDKELYAL------------------------------------------------------

Query:  --GKEKIVADPLSR-------------------------------------------------------------------------RLQPHGLYSPLPV
          GKE IVAD LSR                                                                         RLQPHGLYSPLPV
Subjt:  --GKEKIVADPLSR-------------------------------------------------------------------------RLQPHGLYSPLPV

Query:  PTNPWIDISMDFVLGLPRTQKGKDSIFVVVDRFSKMAHFIPCHKTNDTKHIADLFFRKVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYSTTCH
        P  PWIDISMDFVLGLPRT+KG DSIFVVVDRFSKMAHFIPCHKT+D KHIADLFFR+VVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYSTTCH
Subjt:  PTNPWIDISMDFVLGLPRTQKGKDSIFVVVDRFSKMAHFIPCHKTNDTKHIADLFFRKVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYSTTCH

Query:  PQTDGQTEVVNRTMTTMLRAIIDKNLKTWEDCLPFIEFAYNRVVHSTSKCTSFEIIYGFSPLTPMDFLPMPPNEF
        PQTDGQTEVVNRTMT MLRAIIDKNLKTWEDCLPFIEFAYNRVVHST+KCT FEI+YGF+PLTP+D LP+P  EF
Subjt:  PQTDGQTEVVNRTMTTMLRAIIDKNLKTWEDCLPFIEFAYNRVVHSTSKCTSFEIIYGFSPLTPMDFLPMPPNEF

XP_022971168.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111469947 [Cucurbita maxima]3.6e-16564.42Show/hide
Query:  NFLGFVVSSNGVEVDEEKIRAIRDWPTPKSVSEVRSFHGLAGFYRRFIKDFSTIASPLNELVKKNVSFVWKPKHEVAFNTLKDKLSSAPLLALPNFDLTF
        NFLGFVVSSNGVEVDEEK++AI+DWPTPK+VSEVRSFHGLA FYRRFIK+FSTIASPLNELVKKNVSF+W+   E+AFNTLK+KLSSAPLLALPNF+ TF
Subjt:  NFLGFVVSSNGVEVDEEKIRAIRDWPTPKSVSEVRSFHGLAGFYRRFIKDFSTIASPLNELVKKNVSFVWKPKHEVAFNTLKDKLSSAPLLALPNFDLTF

Query:  EIKCDASGEGIGAVLMQNQRPLMFFSEKLNGASLQNPTYDKELYAL------------------------------------------------------
        EI+CDASG GIGAVLMQNQRPLMFFSEKL GASL+ PTYDKELYAL                                                      
Subjt:  EIKCDASGEGIGAVLMQNQRPLMFFSEKLNGASLQNPTYDKELYAL------------------------------------------------------

Query:  --GKEKIVADPLSR-------------------------------------------------------------------------RLQPHGLYSPLPV
          GKE IVAD LSR                                                                         RLQPHGLYSPLPV
Subjt:  --GKEKIVADPLSR-------------------------------------------------------------------------RLQPHGLYSPLPV

Query:  PTNPWIDISMDFVLGLPRTQKGKDSIFVVVDRFSKMAHFIPCHKTNDTKHIADLFFRKVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYSTTCH
        P  PWIDISMDFVLGLPRT+KG DSIFVVVDRFSKMAHFIPCHKT+D KHIADLFFR+VVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKL+YSTTCH
Subjt:  PTNPWIDISMDFVLGLPRTQKGKDSIFVVVDRFSKMAHFIPCHKTNDTKHIADLFFRKVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYSTTCH

Query:  PQTDGQTEVVNRTMTTMLRAIIDKNLKTWEDCLPFIEFAYNRVVHSTSKCTSFEIIYGFSPLTPMDFLPMPPNEF
        PQTDGQTEVVNRTMT MLRAIIDKNLKTWEDCLPFIEFAYNRVVHST+KCT FEI+YGF+PLTP+D LP+P  EF
Subjt:  PQTDGQTEVVNRTMTTMLRAIIDKNLKTWEDCLPFIEFAYNRVVHSTSKCTSFEIIYGFSPLTPMDFLPMPPNEF

XP_022973897.1 uncharacterized protein LOC111472489, partial [Cucurbita maxima]9.7e-16363.79Show/hide
Query:  NFLGFVVSSNGVEVDEEKIRAIRDWPTPKSVSEVRSFHGLAGFYRRFIKDFSTIASPLNELVKKNVSFVWKPKHEVAFNTLKDKLSSAPLLALPNFDLTF
        NFLGFVVSSNGVEVDEEK++AI+DWPTPK+VSEVRSFHGLA FYRRFIK+FSTIASPLNELVKKNVSF+W+   E+AFNTLK+KLSSAPLLALPNF+ TF
Subjt:  NFLGFVVSSNGVEVDEEKIRAIRDWPTPKSVSEVRSFHGLAGFYRRFIKDFSTIASPLNELVKKNVSFVWKPKHEVAFNTLKDKLSSAPLLALPNFDLTF

Query:  EIKCDASGEGIGAVLMQNQRPLMFFSEKLNGASLQNPTYDKELYAL------------------------------------------------------
        EI+CDASG GIGAVLMQNQRPLMFFSEKL GASL+ PTYDKELYAL                                                      
Subjt:  EIKCDASGEGIGAVLMQNQRPLMFFSEKLNGASLQNPTYDKELYAL------------------------------------------------------

Query:  --GKEKIVADPLSR-------------------------------------------------------------------------RLQPHGLYSPLPV
          GKE IVAD LSR                                                                         RLQPHGLYSPLPV
Subjt:  --GKEKIVADPLSR-------------------------------------------------------------------------RLQPHGLYSPLPV

Query:  PTNPWIDISMDFVLGLPRTQKGKDSIFVVVDRFSKMAHFIPCHKTNDTKHIADLFFRKVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYSTTCH
        P  PWIDISMDFVLGLPR +KG DSIFVVVDRFSKMAHFIPCHKT+D KHIADLFFR+VVRLHG PKSIVSDRDVKFLSHFWRVLWGKLGTKL+YSTTCH
Subjt:  PTNPWIDISMDFVLGLPRTQKGKDSIFVVVDRFSKMAHFIPCHKTNDTKHIADLFFRKVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYSTTCH

Query:  PQTDGQTEVVNRTMTTMLRAIIDKNLKTWEDCLPFIEFAYNRVVHSTSKCTSFEIIYGFSPLTPMDFLPMPPNEF
        PQTDGQTEVVNRTMT MLRAIIDKNLKTWE CLPFIEFAYNRVVHST+KCT FEI+YGF+PLTP+D LP+P  EF
Subjt:  PQTDGQTEVVNRTMTTMLRAIIDKNLKTWEDCLPFIEFAYNRVVHSTSKCTSFEIIYGFSPLTPMDFLPMPPNEF

TrEMBL top hitse value%identityAlignment
A0A6J1EQJ1 uncharacterized protein LOC1114365308.6e-16564.42Show/hide
Query:  NFLGFVVSSNGVEVDEEKIRAIRDWPTPKSVSEVRSFHGLAGFYRRFIKDFSTIASPLNELVKKNVSFVWKPKHEVAFNTLKDKLSSAPLLALPNFDLTF
        NFLGFVVSSNGVEVDEEK++AI+DWPTPK+VSEVRSFHGLA FYRRFIK+FSTIASPLNELVKKNVSF+W+   E+AFNTLK+KLSSAPLLALPNF+  F
Subjt:  NFLGFVVSSNGVEVDEEKIRAIRDWPTPKSVSEVRSFHGLAGFYRRFIKDFSTIASPLNELVKKNVSFVWKPKHEVAFNTLKDKLSSAPLLALPNFDLTF

Query:  EIKCDASGEGIGAVLMQNQRPLMFFSEKLNGASLQNPTYDKELYAL------------------------------------------------------
        EI+CDASG GIGAVLMQNQRPLMFFSEKL GASL+ PTYDKELYAL                                                      
Subjt:  EIKCDASGEGIGAVLMQNQRPLMFFSEKLNGASLQNPTYDKELYAL------------------------------------------------------

Query:  --GKEKIVADPLSR-------------------------------------------------------------------------RLQPHGLYSPLPV
          GKE IVAD LSR                                                                         RLQPHGLYSPLPV
Subjt:  --GKEKIVADPLSR-------------------------------------------------------------------------RLQPHGLYSPLPV

Query:  PTNPWIDISMDFVLGLPRTQKGKDSIFVVVDRFSKMAHFIPCHKTNDTKHIADLFFRKVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYSTTCH
        P  PWIDISMDFVLGLPRT+KG DSIFVVVDRFSKMAHFIPCHKT+D KHIADLFFR+VVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYSTTCH
Subjt:  PTNPWIDISMDFVLGLPRTQKGKDSIFVVVDRFSKMAHFIPCHKTNDTKHIADLFFRKVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYSTTCH

Query:  PQTDGQTEVVNRTMTTMLRAIIDKNLKTWEDCLPFIEFAYNRVVHSTSKCTSFEIIYGFSPLTPMDFLPMPPNEF
        PQTDGQTEVVNRTMT MLRAIIDKNLKTWEDCLPFIEFAYNRVVHST+KCT FEI+YGF+PLTP+D LP+P  EF
Subjt:  PQTDGQTEVVNRTMTTMLRAIIDKNLKTWEDCLPFIEFAYNRVVHSTSKCTSFEIIYGFSPLTPMDFLPMPPNEF

A0A6J1EVV9 uncharacterized protein LOC1114364636.6e-16564.63Show/hide
Query:  NFLGFVVSSNGVEVDEEKIRAIRDWPTPKSVSEVRSFHGLAGFYRRFIKDFSTIASPLNELVKKNVSFVWKPKHEVAFNTLKDKLSSAPLLALPNFDLTF
        NFLGFVVSSNGVEVDEEK++AI+DWPTPK+VSEVRSFHGLA FYRRFIK+FSTIASPLNELVKKNVSF+W+   E+AFNTLKDKLSSAPLLALPNF+ TF
Subjt:  NFLGFVVSSNGVEVDEEKIRAIRDWPTPKSVSEVRSFHGLAGFYRRFIKDFSTIASPLNELVKKNVSFVWKPKHEVAFNTLKDKLSSAPLLALPNFDLTF

Query:  EIKCDASGEGIGAVLMQNQRPLMFFSEKLNGASLQNPTYDKELYAL------------------------------------------------------
        EI+CDASG GIGAVLMQNQRPLMFFSEKL GASL+ PTYDKELYAL                                                      
Subjt:  EIKCDASGEGIGAVLMQNQRPLMFFSEKLNGASLQNPTYDKELYAL------------------------------------------------------

Query:  --GKEKIVADPLSR-------------------------------------------------------------------------RLQPHGLYSPLPV
          GKE IVAD LSR                                                                         RLQPHGL SPLPV
Subjt:  --GKEKIVADPLSR-------------------------------------------------------------------------RLQPHGLYSPLPV

Query:  PTNPWIDISMDFVLGLPRTQKGKDSIFVVVDRFSKMAHFIPCHKTNDTKHIADLFFRKVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYSTTCH
        P  PWIDISMDFVLGLPRT+KG DSIFVVVDRFSKMAHFIPCHKT+D KHIADLFFR+VVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYSTTCH
Subjt:  PTNPWIDISMDFVLGLPRTQKGKDSIFVVVDRFSKMAHFIPCHKTNDTKHIADLFFRKVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYSTTCH

Query:  PQTDGQTEVVNRTMTTMLRAIIDKNLKTWEDCLPFIEFAYNRVVHSTSKCTSFEIIYGFSPLTPMDFLPMPPNEF
        PQTDGQTEVVNRTMT MLRAIIDKNLKTWEDCLPFIEFAYNRVVHST+KCT FEI+YGF+PLTP+D LP+P  EF
Subjt:  PQTDGQTEVVNRTMTTMLRAIIDKNLKTWEDCLPFIEFAYNRVVHSTSKCTSFEIIYGFSPLTPMDFLPMPPNEF

A0A6J1G2Q3 uncharacterized protein LOC1114502861.7e-16564.63Show/hide
Query:  NFLGFVVSSNGVEVDEEKIRAIRDWPTPKSVSEVRSFHGLAGFYRRFIKDFSTIASPLNELVKKNVSFVWKPKHEVAFNTLKDKLSSAPLLALPNFDLTF
        NFLGFVVSSNGVEVDEEK++AI+DWPTPK+VSEVRSFHGLA FYRRFIK+FSTIASPLNELVKKNVSF+W+   E+AFNTLK+KLSSAPLLALPNF+ TF
Subjt:  NFLGFVVSSNGVEVDEEKIRAIRDWPTPKSVSEVRSFHGLAGFYRRFIKDFSTIASPLNELVKKNVSFVWKPKHEVAFNTLKDKLSSAPLLALPNFDLTF

Query:  EIKCDASGEGIGAVLMQNQRPLMFFSEKLNGASLQNPTYDKELYAL------------------------------------------------------
        EI+CDASG GIGAVLMQNQRPLMFFSEKL GASL+ PTYDKELYAL                                                      
Subjt:  EIKCDASGEGIGAVLMQNQRPLMFFSEKLNGASLQNPTYDKELYAL------------------------------------------------------

Query:  --GKEKIVADPLSR-------------------------------------------------------------------------RLQPHGLYSPLPV
          GKE IVAD LSR                                                                         RLQPHGLYSPLPV
Subjt:  --GKEKIVADPLSR-------------------------------------------------------------------------RLQPHGLYSPLPV

Query:  PTNPWIDISMDFVLGLPRTQKGKDSIFVVVDRFSKMAHFIPCHKTNDTKHIADLFFRKVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYSTTCH
        P  PWIDISMDFVLGLPRT+KG DSIFVVVDRFSKMAHFIPCHKT+D KHIADLFFR+VVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYSTTCH
Subjt:  PTNPWIDISMDFVLGLPRTQKGKDSIFVVVDRFSKMAHFIPCHKTNDTKHIADLFFRKVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYSTTCH

Query:  PQTDGQTEVVNRTMTTMLRAIIDKNLKTWEDCLPFIEFAYNRVVHSTSKCTSFEIIYGFSPLTPMDFLPMPPNEF
        PQTDGQTEVVNRTMT MLRAIIDKNLKTWEDCLPFIEFAYNRVVHST+KCT FEI+YGF+PLTP+D LP+P  EF
Subjt:  PQTDGQTEVVNRTMTTMLRAIIDKNLKTWEDCLPFIEFAYNRVVHSTSKCTSFEIIYGFSPLTPMDFLPMPPNEF

A0A6J1I622 LOW QUALITY PROTEIN: uncharacterized protein LOC1114699471.7e-16564.42Show/hide
Query:  NFLGFVVSSNGVEVDEEKIRAIRDWPTPKSVSEVRSFHGLAGFYRRFIKDFSTIASPLNELVKKNVSFVWKPKHEVAFNTLKDKLSSAPLLALPNFDLTF
        NFLGFVVSSNGVEVDEEK++AI+DWPTPK+VSEVRSFHGLA FYRRFIK+FSTIASPLNELVKKNVSF+W+   E+AFNTLK+KLSSAPLLALPNF+ TF
Subjt:  NFLGFVVSSNGVEVDEEKIRAIRDWPTPKSVSEVRSFHGLAGFYRRFIKDFSTIASPLNELVKKNVSFVWKPKHEVAFNTLKDKLSSAPLLALPNFDLTF

Query:  EIKCDASGEGIGAVLMQNQRPLMFFSEKLNGASLQNPTYDKELYAL------------------------------------------------------
        EI+CDASG GIGAVLMQNQRPLMFFSEKL GASL+ PTYDKELYAL                                                      
Subjt:  EIKCDASGEGIGAVLMQNQRPLMFFSEKLNGASLQNPTYDKELYAL------------------------------------------------------

Query:  --GKEKIVADPLSR-------------------------------------------------------------------------RLQPHGLYSPLPV
          GKE IVAD LSR                                                                         RLQPHGLYSPLPV
Subjt:  --GKEKIVADPLSR-------------------------------------------------------------------------RLQPHGLYSPLPV

Query:  PTNPWIDISMDFVLGLPRTQKGKDSIFVVVDRFSKMAHFIPCHKTNDTKHIADLFFRKVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYSTTCH
        P  PWIDISMDFVLGLPRT+KG DSIFVVVDRFSKMAHFIPCHKT+D KHIADLFFR+VVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKL+YSTTCH
Subjt:  PTNPWIDISMDFVLGLPRTQKGKDSIFVVVDRFSKMAHFIPCHKTNDTKHIADLFFRKVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYSTTCH

Query:  PQTDGQTEVVNRTMTTMLRAIIDKNLKTWEDCLPFIEFAYNRVVHSTSKCTSFEIIYGFSPLTPMDFLPMPPNEF
        PQTDGQTEVVNRTMT MLRAIIDKNLKTWEDCLPFIEFAYNRVVHST+KCT FEI+YGF+PLTP+D LP+P  EF
Subjt:  PQTDGQTEVVNRTMTTMLRAIIDKNLKTWEDCLPFIEFAYNRVVHSTSKCTSFEIIYGFSPLTPMDFLPMPPNEF

A0A6J1I8S0 uncharacterized protein LOC1114724894.7e-16363.79Show/hide
Query:  NFLGFVVSSNGVEVDEEKIRAIRDWPTPKSVSEVRSFHGLAGFYRRFIKDFSTIASPLNELVKKNVSFVWKPKHEVAFNTLKDKLSSAPLLALPNFDLTF
        NFLGFVVSSNGVEVDEEK++AI+DWPTPK+VSEVRSFHGLA FYRRFIK+FSTIASPLNELVKKNVSF+W+   E+AFNTLK+KLSSAPLLALPNF+ TF
Subjt:  NFLGFVVSSNGVEVDEEKIRAIRDWPTPKSVSEVRSFHGLAGFYRRFIKDFSTIASPLNELVKKNVSFVWKPKHEVAFNTLKDKLSSAPLLALPNFDLTF

Query:  EIKCDASGEGIGAVLMQNQRPLMFFSEKLNGASLQNPTYDKELYAL------------------------------------------------------
        EI+CDASG GIGAVLMQNQRPLMFFSEKL GASL+ PTYDKELYAL                                                      
Subjt:  EIKCDASGEGIGAVLMQNQRPLMFFSEKLNGASLQNPTYDKELYAL------------------------------------------------------

Query:  --GKEKIVADPLSR-------------------------------------------------------------------------RLQPHGLYSPLPV
          GKE IVAD LSR                                                                         RLQPHGLYSPLPV
Subjt:  --GKEKIVADPLSR-------------------------------------------------------------------------RLQPHGLYSPLPV

Query:  PTNPWIDISMDFVLGLPRTQKGKDSIFVVVDRFSKMAHFIPCHKTNDTKHIADLFFRKVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYSTTCH
        P  PWIDISMDFVLGLPR +KG DSIFVVVDRFSKMAHFIPCHKT+D KHIADLFFR+VVRLHG PKSIVSDRDVKFLSHFWRVLWGKLGTKL+YSTTCH
Subjt:  PTNPWIDISMDFVLGLPRTQKGKDSIFVVVDRFSKMAHFIPCHKTNDTKHIADLFFRKVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYSTTCH

Query:  PQTDGQTEVVNRTMTTMLRAIIDKNLKTWEDCLPFIEFAYNRVVHSTSKCTSFEIIYGFSPLTPMDFLPMPPNEF
        PQTDGQTEVVNRTMT MLRAIIDKNLKTWE CLPFIEFAYNRVVHST+KCT FEI+YGF+PLTP+D LP+P  EF
Subjt:  PQTDGQTEVVNRTMTTMLRAIIDKNLKTWEDCLPFIEFAYNRVVHSTSKCTSFEIIYGFSPLTPMDFLPMPPNEF

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein1.2e-3823.15Show/hide
Query:  KVNFLGFVVSSNGVEVDEEKIRAIRDWPTPKSVSEVRSFHGLAGFYRRFIKDFSTIASPLNELVKKNVSFVWKPKHEVAFNTLKDKLSSAPLLALPNFDL
        +V F+G+ +S  G    +E I  +  W  PK+  E+R F G   + R+FI   S +  PLN L+KK+V + W P    A   +K  L S P+L   +F  
Subjt:  KVNFLGFVVSSNGVEVDEEKIRAIRDWPTPKSVSEVRSFHGLAGFYRRFIKDFSTIASPLNELVKKNVSFVWKPKHEVAFNTLKDKLSSAPLLALPNFDL

Query:  TFEIKCDASGEGIGAVLMQNQ-----RPLMFFSEKLNGASLQNPTYDKELYAL-----------------------------------------------
           ++ DAS   +GAVL Q        P+ ++S K++ A L     DKE+ A+                                               
Subjt:  TFEIKCDASGEGIGAVLMQNQ-----RPLMFFSEKLNGASLQNPTYDKELYAL-----------------------------------------------

Query:  -------------GKEKIVADPLSRRL-------------------------------------------------------------------------
                     G    +AD LSR +                                                                         
Subjt:  -------------GKEKIVADPLSRRL-------------------------------------------------------------------------

Query:  --------------------------------------------------------------QPHGLYSPLPVPTNPWIDISMDFVLGLPRTQKGKDSIF
                                                                      +P+G   P+P    PW  +SMDF+  LP +  G +++F
Subjt:  --------------------------------------------------------------QPHGLYSPLPVPTNPWIDISMDFVLGLPRTQKGKDSIF

Query:  VVVDRFSKMAHFIPCHKTNDTKHIADLFFRKVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYSTTCHPQTDGQTEVVNRTMTTMLRAIIDKNLK
        VVVDRFSKMA  +PC K+   +  A +F ++V+   G PK I++D D  F S  W+    K    + +S    PQTDGQTE  N+T+  +LR +   +  
Subjt:  VVVDRFSKMAHFIPCHKTNDTKHIADLFFRKVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYSTTCHPQTDGQTEVVNRTMTTMLRAIIDKNLK

Query:  TWEDCLPFIEFAYNRVVHSTSKCTSFEIIYGFSP-LTPMD
        TW D +  ++ +YN  +HS ++ T FEI++ +SP L+P++
Subjt:  TWEDCLPFIEFAYNRVVHSTSKCTSFEIIYGFSP-LTPMD

P0CT41 Transposon Tf2-12 polyprotein1.2e-3823.15Show/hide
Query:  KVNFLGFVVSSNGVEVDEEKIRAIRDWPTPKSVSEVRSFHGLAGFYRRFIKDFSTIASPLNELVKKNVSFVWKPKHEVAFNTLKDKLSSAPLLALPNFDL
        +V F+G+ +S  G    +E I  +  W  PK+  E+R F G   + R+FI   S +  PLN L+KK+V + W P    A   +K  L S P+L   +F  
Subjt:  KVNFLGFVVSSNGVEVDEEKIRAIRDWPTPKSVSEVRSFHGLAGFYRRFIKDFSTIASPLNELVKKNVSFVWKPKHEVAFNTLKDKLSSAPLLALPNFDL

Query:  TFEIKCDASGEGIGAVLMQNQ-----RPLMFFSEKLNGASLQNPTYDKELYAL-----------------------------------------------
           ++ DAS   +GAVL Q        P+ ++S K++ A L     DKE+ A+                                               
Subjt:  TFEIKCDASGEGIGAVLMQNQ-----RPLMFFSEKLNGASLQNPTYDKELYAL-----------------------------------------------

Query:  -------------GKEKIVADPLSRRL-------------------------------------------------------------------------
                     G    +AD LSR +                                                                         
Subjt:  -------------GKEKIVADPLSRRL-------------------------------------------------------------------------

Query:  --------------------------------------------------------------QPHGLYSPLPVPTNPWIDISMDFVLGLPRTQKGKDSIF
                                                                      +P+G   P+P    PW  +SMDF+  LP +  G +++F
Subjt:  --------------------------------------------------------------QPHGLYSPLPVPTNPWIDISMDFVLGLPRTQKGKDSIF

Query:  VVVDRFSKMAHFIPCHKTNDTKHIADLFFRKVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYSTTCHPQTDGQTEVVNRTMTTMLRAIIDKNLK
        VVVDRFSKMA  +PC K+   +  A +F ++V+   G PK I++D D  F S  W+    K    + +S    PQTDGQTE  N+T+  +LR +   +  
Subjt:  VVVDRFSKMAHFIPCHKTNDTKHIADLFFRKVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYSTTCHPQTDGQTEVVNRTMTTMLRAIIDKNLK

Query:  TWEDCLPFIEFAYNRVVHSTSKCTSFEIIYGFSP-LTPMD
        TW D +  ++ +YN  +HS ++ T FEI++ +SP L+P++
Subjt:  TWEDCLPFIEFAYNRVVHSTSKCTSFEIIYGFSP-LTPMD

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein2.5e-4426.89Show/hide
Query:  EKVNFLGFVVSSNGVEVDEEKIRAIRDWPTPKSVSEVRSFHGLAGFYRRFIKDFSTIASPLNELVKKNVSFVWKPKHEVAFNTLKDKLSSAPLLALPNFD
        E+  FLG+ +    +   + K  AIRD+PTPK+V + + F G+  +YRRFI + S IA P+   +       W  K + A   LK  L ++P+L   N  
Subjt:  EKVNFLGFVVSSNGVEVDEEKIRAIRDWPTPKSVSEVRSFHGLAGFYRRFIKDFSTIASPLNELVKKNVSFVWKPKHEVAFNTLKDKLSSAPLLALPNFD

Query:  LTFEIKCDASGEGIGAVL--MQNQRPLM----FFSEKLNGA----------------------------------------SLQN---------------
          + +  DAS +GIGAVL  + N+  L+    +FS+ L  A                                        SLQN               
Subjt:  LTFEIKCDASGEGIGAVL--MQNQRPLM----FFSEKLNGA----------------------------------------SLQN---------------

Query:  PTYDKEL-YALGKEKIVADPLSR-----------------------------------------------------------------------------
         TYD  L Y  G + +VAD +SR                                                                             
Subjt:  PTYDKEL-YALGKEKIVADPLSR-----------------------------------------------------------------------------

Query:  ------------------------------------------------------------------RLQPHGLYSPLPVPTNPWIDISMDFVLGLPRTQK
                                                                          R + HGL  PLP+    W+DISMDFV GLP T  
Subjt:  ------------------------------------------------------------------RLQPHGLYSPLPVPTNPWIDISMDFVLGLPRTQK

Query:  GKDSIFVVVDRFSKMAHFIPCHKTNDTKHIADLFFRKVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYSTTCHPQTDGQTEVVNRTMTTMLRAI
          + I VVVDRFSK AHFI   KT D   + DL FR +   HG P++I SDRDV+  +  ++ L  +LG K   S+  HPQTDGQ+E   +T+  +LRA 
Subjt:  GKDSIFVVVDRFSKMAHFIPCHKTNDTKHIADLFFRKVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYSTTCHPQTDGQTEVVNRTMTTMLRAI

Query:  IDKNLKTWEDCLPFIEFAYNRVVHSTSKCTSFEIIYGFSPLTP
        +  N++ W   LP IEF YN     T   + FEI  G+ P TP
Subjt:  IDKNLKTWEDCLPFIEFAYNRVVHSTSKCTSFEIIYGFSPLTP

Q99315 Transposon Ty3-G Gag-Pol polyprotein6.6e-4527.07Show/hide
Query:  EKVNFLGFVVSSNGVEVDEEKIRAIRDWPTPKSVSEVRSFHGLAGFYRRFIKDFSTIASPLNELVKKNVSFVWKPKHEVAFNTLKDKLSSAPLLALPNFD
        E+  FLG+ +    +   + K  AIRD+PTPK+V + + F G+  +YRRFI + S IA P+   +       W  K + A + LKD L ++P+L   N  
Subjt:  EKVNFLGFVVSSNGVEVDEEKIRAIRDWPTPKSVSEVRSFHGLAGFYRRFIKDFSTIASPLNELVKKNVSFVWKPKHEVAFNTLKDKLSSAPLLALPNFD

Query:  LTFEIKCDASGEGIGAVL--MQNQRPLM----FFSEKLNGA----------------------------------------SLQN---------------
          + +  DAS +GIGAVL  + N+  L+    +FS+ L  A                                        SLQN               
Subjt:  LTFEIKCDASGEGIGAVL--MQNQRPLM----FFSEKLNGA----------------------------------------SLQN---------------

Query:  PTYDKEL-YALGKEKIVADPLSR-----------------------------------------------------------------------------
         TYD  L Y  G + +VAD +SR                                                                             
Subjt:  PTYDKEL-YALGKEKIVADPLSR-----------------------------------------------------------------------------

Query:  ------------------------------------------------------------------RLQPHGLYSPLPVPTNPWIDISMDFVLGLPRTQK
                                                                          R + HGL  PLP+    W+DISMDFV GLP T  
Subjt:  ------------------------------------------------------------------RLQPHGLYSPLPVPTNPWIDISMDFVLGLPRTQK

Query:  GKDSIFVVVDRFSKMAHFIPCHKTNDTKHIADLFFRKVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYSTTCHPQTDGQTEVVNRTMTTMLRAI
          + I VVVDRFSK AHFI   KT D   + DL FR +   HG P++I SDRDV+  +  ++ L  +LG K   S+  HPQTDGQ+E   +T+  +LRA 
Subjt:  GKDSIFVVVDRFSKMAHFIPCHKTNDTKHIADLFFRKVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYSTTCHPQTDGQTEVVNRTMTTMLRAI

Query:  IDKNLKTWEDCLPFIEFAYNRVVHSTSKCTSFEIIYGFSPLTP
           N++ W   LP IEF YN     T   + FEI  G+ P TP
Subjt:  IDKNLKTWEDCLPFIEFAYNRVVHSTSKCTSFEIIYGFSPLTP

Q9UR07 Transposon Tf2-11 polyprotein1.2e-3823.15Show/hide
Query:  KVNFLGFVVSSNGVEVDEEKIRAIRDWPTPKSVSEVRSFHGLAGFYRRFIKDFSTIASPLNELVKKNVSFVWKPKHEVAFNTLKDKLSSAPLLALPNFDL
        +V F+G+ +S  G    +E I  +  W  PK+  E+R F G   + R+FI   S +  PLN L+KK+V + W P    A   +K  L S P+L   +F  
Subjt:  KVNFLGFVVSSNGVEVDEEKIRAIRDWPTPKSVSEVRSFHGLAGFYRRFIKDFSTIASPLNELVKKNVSFVWKPKHEVAFNTLKDKLSSAPLLALPNFDL

Query:  TFEIKCDASGEGIGAVLMQNQ-----RPLMFFSEKLNGASLQNPTYDKELYAL-----------------------------------------------
           ++ DAS   +GAVL Q        P+ ++S K++ A L     DKE+ A+                                               
Subjt:  TFEIKCDASGEGIGAVLMQNQ-----RPLMFFSEKLNGASLQNPTYDKELYAL-----------------------------------------------

Query:  -------------GKEKIVADPLSRRL-------------------------------------------------------------------------
                     G    +AD LSR +                                                                         
Subjt:  -------------GKEKIVADPLSRRL-------------------------------------------------------------------------

Query:  --------------------------------------------------------------QPHGLYSPLPVPTNPWIDISMDFVLGLPRTQKGKDSIF
                                                                      +P+G   P+P    PW  +SMDF+  LP +  G +++F
Subjt:  --------------------------------------------------------------QPHGLYSPLPVPTNPWIDISMDFVLGLPRTQKGKDSIF

Query:  VVVDRFSKMAHFIPCHKTNDTKHIADLFFRKVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYSTTCHPQTDGQTEVVNRTMTTMLRAIIDKNLK
        VVVDRFSKMA  +PC K+   +  A +F ++V+   G PK I++D D  F S  W+    K    + +S    PQTDGQTE  N+T+  +LR +   +  
Subjt:  VVVDRFSKMAHFIPCHKTNDTKHIADLFFRKVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYSTTCHPQTDGQTEVVNRTMTTMLRAIIDKNLK

Query:  TWEDCLPFIEFAYNRVVHSTSKCTSFEIIYGFSP-LTPMD
        TW D +  ++ +YN  +HS ++ T FEI++ +SP L+P++
Subjt:  TWEDCLPFIEFAYNRVVHSTSKCTSFEIIYGFSP-LTPMD

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein4.9e-1942.31Show/hide
Query:  KVNFLG--FVVSSNGVEVDEEKIRAIRDWPTPKSVSEVRSFHGLAGFYRRFIKDFSTIASPLNELVKKNVSFVWKPKHEVAFNTLKDKLSSAPLLALPNF
        ++ +LG   ++S  GV  D  K+ A+  WP PK+ +E+R F GL G+YRRF+K++  I  PL EL+KKN S  W     +AF  LK  +++ P+LALP+ 
Subjt:  KVNFLG--FVVSSNGVEVDEEKIRAIRDWPTPKSVSEVRSFHGLAGFYRRFIKDFSTIASPLNELVKKNVSFVWKPKHEVAFNTLKDKLSSAPLLALPNF

Query:  DLTF
         L F
Subjt:  DLTF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAAAGTCAATTTTCTTGGATTTGTAGTTTCATCTAATGGTGTTGAAGTCGATGAGGAGAAGATAAGGGCTATAAGAGATTGGCCTACACCTAAAAGTGTAAGTGA
GGTAAGAAGTTTCCATGGTCTTGCCGGATTCTATCGTAGGTTCATTAAGGATTTTAGTACAATTGCATCTCCTTTAAATGAACTGGTTAAGAAGAATGTTTCTTTTGTTT
GGAAACCAAAACATGAAGTTGCTTTTAATACTTTGAAAGATAAATTGAGTTCAGCTCCATTGCTTGCATTACCTAATTTTGACTTAACATTTGAAATAAAATGTGATGCT
AGTGGAGAAGGAATAGGTGCTGTTTTGATGCAAAATCAAAGACCTTTAATGTTTTTTAGTGAAAAGTTGAATGGTGCTTCTTTGCAGAACCCAACTTATGATAAAGAACT
TTATGCTTTGGGTAAGGAGAAGATTGTTGCAGATCCTTTATCACGCAGGCTTCAACCCCATGGTTTGTATTCTCCTTTACCTGTTCCTACTAATCCTTGGATTGATATAT
CAATGGATTTTGTTTTAGGCTTACCAAGAACTCAAAAAGGTAAGGATAGTATCTTTGTTGTGGTTGATAGATTTAGTAAAATGGCTCATTTTATTCCTTGTCACAAAACT
AATGATACAAAACATATTGCTGACTTGTTCTTTCGGAAGGTTGTGAGATTACATGGCATTCCTAAAAGCATTGTTAGTGATCGTGATGTTAAATTTTTAAGCCACTTTTG
GCGTGTTTTATGGGGTAAATTGGGAACTAAGCTTGTGTACTCAACTACTTGTCATCCTCAAACGGATGGACAAACCGAGGTTGTTAATAGAACCATGACTACTATGCTTA
GGGCTATTATTGATAAGAATCTAAAGACTTGGGAGGATTGTTTACCGTTCATCGAATTTGCATATAATAGGGTTGTTCATAGCACTAGTAAATGCACTTCTTTTGAAATC
ATTTATGGTTTTAGTCCTCTAACTCCTATGGATTTTTTACCTATGCCTCCTAATGAGTTTGTGAATTTTGATGCGAACGCAAAGGTCGAATCTGTTCATAAGCTGCATAA
GCAAGTTAAGAGCAAATTGAGAAGCAAAATTCAAAAGTTGCTGCAAAAATAA
mRNA sequenceShow/hide mRNA sequence
ATGGAAAAAGTCAATTTTCTTGGATTTGTAGTTTCATCTAATGGTGTTGAAGTCGATGAGGAGAAGATAAGGGCTATAAGAGATTGGCCTACACCTAAAAGTGTAAGTGA
GGTAAGAAGTTTCCATGGTCTTGCCGGATTCTATCGTAGGTTCATTAAGGATTTTAGTACAATTGCATCTCCTTTAAATGAACTGGTTAAGAAGAATGTTTCTTTTGTTT
GGAAACCAAAACATGAAGTTGCTTTTAATACTTTGAAAGATAAATTGAGTTCAGCTCCATTGCTTGCATTACCTAATTTTGACTTAACATTTGAAATAAAATGTGATGCT
AGTGGAGAAGGAATAGGTGCTGTTTTGATGCAAAATCAAAGACCTTTAATGTTTTTTAGTGAAAAGTTGAATGGTGCTTCTTTGCAGAACCCAACTTATGATAAAGAACT
TTATGCTTTGGGTAAGGAGAAGATTGTTGCAGATCCTTTATCACGCAGGCTTCAACCCCATGGTTTGTATTCTCCTTTACCTGTTCCTACTAATCCTTGGATTGATATAT
CAATGGATTTTGTTTTAGGCTTACCAAGAACTCAAAAAGGTAAGGATAGTATCTTTGTTGTGGTTGATAGATTTAGTAAAATGGCTCATTTTATTCCTTGTCACAAAACT
AATGATACAAAACATATTGCTGACTTGTTCTTTCGGAAGGTTGTGAGATTACATGGCATTCCTAAAAGCATTGTTAGTGATCGTGATGTTAAATTTTTAAGCCACTTTTG
GCGTGTTTTATGGGGTAAATTGGGAACTAAGCTTGTGTACTCAACTACTTGTCATCCTCAAACGGATGGACAAACCGAGGTTGTTAATAGAACCATGACTACTATGCTTA
GGGCTATTATTGATAAGAATCTAAAGACTTGGGAGGATTGTTTACCGTTCATCGAATTTGCATATAATAGGGTTGTTCATAGCACTAGTAAATGCACTTCTTTTGAAATC
ATTTATGGTTTTAGTCCTCTAACTCCTATGGATTTTTTACCTATGCCTCCTAATGAGTTTGTGAATTTTGATGCGAACGCAAAGGTCGAATCTGTTCATAAGCTGCATAA
GCAAGTTAAGAGCAAATTGAGAAGCAAAATTCAAAAGTTGCTGCAAAAATAA
Protein sequenceShow/hide protein sequence
MEKVNFLGFVVSSNGVEVDEEKIRAIRDWPTPKSVSEVRSFHGLAGFYRRFIKDFSTIASPLNELVKKNVSFVWKPKHEVAFNTLKDKLSSAPLLALPNFDLTFEIKCDA
SGEGIGAVLMQNQRPLMFFSEKLNGASLQNPTYDKELYALGKEKIVADPLSRRLQPHGLYSPLPVPTNPWIDISMDFVLGLPRTQKGKDSIFVVVDRFSKMAHFIPCHKT
NDTKHIADLFFRKVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYSTTCHPQTDGQTEVVNRTMTTMLRAIIDKNLKTWEDCLPFIEFAYNRVVHSTSKCTSFEI
IYGFSPLTPMDFLPMPPNEFVNFDANAKVESVHKLHKQVKSKLRSKIQKLLQK