; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0025941 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0025941
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr10:24747830..24751465
RNA-Seq ExpressionLag0025941
SyntenyLag0025941
Gene Ontology termsGO:0006310 - DNA recombination (biological process)
GO:0015074 - DNA integration (biological process)
GO:0071897 - DNA biosynthetic process (biological process)
GO:0090502 - RNA phosphodiester bond hydrolysis, endonucleolytic (biological process)
GO:0005634 - nucleus (cellular component)
GO:0030430 - host cell cytoplasm (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0003887 - DNA-directed DNA polymerase activity (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EOX99415.1 Uncharacterized protein TCM_008095 [Theobroma cacao]2.4e-14634.38Show/hide
Query:  VRQWSENVQQIHGDSLVENVVSQFKD---VSFPESQLEAVKQAWEKLTVDRKAKFTSKYGYLAQLMYVQVNYSVLKALIRHWDPAYRCFTFGSIDMTPTI
        V Q   + QQ  GD L ++  S   D   +   ++    +   W+K     +A F  KYG++A+L+ VQV+  +LKA+++ WDP+YRCF F  +DM PTI
Subjt:  VRQWSENVQQIHGDSLVENVVSQFKD---VSFPESQLEAVKQAWEKLTVDRKAKFTSKYGYLAQLMYVQVNYSVLKALIRHWDPAYRCFTFGSIDMTPTI

Query:  EEYQSLLHMPVRTEVEAYSYDQELTMKRALSTLLG----------------------------------------------------KILCGYVEERVVK
        EEY +LL + +    + Y   Q+   +R L+ ++G                                                    K+L G++E  ++ 
Subjt:  EEYQSLLHMPVRTEVEAYSYDQELTMKRALSTLLG----------------------------------------------------KILCGYVEERVVK

Query:  LFAKIEIGVDPIIPVLAETFRSLNYCRTKGTGRFIGCAPLLYIWVLSHVKCPPEFKCPEIKFSSSWNKLQNPISEFVQSGWSSSSPERSAWEAFFSELKI
         F ++    +P   +LA+T RSLNYCR KG GRF+GCA LL IW++SH      F+C   KF   ++    PI EF +S W  +   +  W + F EL  
Subjt:  LFAKIEIGVDPIIPVLAETFRSLNYCRTKGTGRFIGCAPLLYIWVLSHVKCPPEFKCPEIKFSSSWNKLQNPISEFVQSGWSSSSPERSAWEAFFSELKI

Query:  EDVIWRAPWMSTRPMIYKCVRQIWV----------------------RQFIPATHELKDFEFAYDKGFCKDRIQKIVKAWKMITKIQSGQFHDDTTEAYK
         +V WRAPWM   P++YKC  + WV                       QF+P TH L   EFAY +     +I++I +AWK  +++  G++ D+ T  Y+
Subjt:  EDVIWRAPWMSTRPMIYKCVRQIWV----------------------RQFIPATHELKDFEFAYDKGFCKDRIQKIVKAWKMITKIQSGQFHDDTTEAYK

Query:  TWHANRAKTVLVSPKMKTKIKLNAKVIPDQQTEQAAREKECDELRKANSSLVQENERLQLEVKQGLLRNVELEKELNRLKGSVSKQEQLEKEISALDTEA
         WH  R K V V PK   +  L   V P+ +      E    +    N+S  Q  E LQ E ++      E  K++ +++G   K E L  + SA  +E 
Subjt:  TWHANRAKTVLVSPKMKTKIKLNAKVIPDQQTEQAAREKECDELRKANSSLVQENERLQLEVKQGLLRNVELEKELNRLKGSVSKQEQLEKEISALDTEA

Query:  RDLNRRMHRLRRDNEVSQATLKSRND----QVLKQQSEIASLHELMKELEDCISLRNQ---------TITEDATDRLMKDYTYLKEQYDRL---------
          L R +    R+N+ ++  L++ ND    QV  QQ  I  L +  +ELE  ++   Q         T  ++  +   + YT    Q D L         
Subjt:  RDLNRRMHRLRRDNEVSQATLKSRND----QVLKQQSEIASLHELMKELEDCISLRNQ---------TITEDATDRLMKDYTYLKEQYDRL---------

Query:  -------------SDDFGFARQNHATLRSKAEHML-------------TQIRRVTRRADELAEDARTLSKVIAP-----------TQPNSKNVLKFLGKL
                     S  F   +Q    ++   EH++             ++I R  + +DE   + R  ++++              Q   +  L  + +L
Subjt:  -------------SDDFGFARQNHATLRSKAEHML-------------TQIRRVTRRADELAEDARTLSKVIAP-----------TQPNSKNVLKFLGKL

Query:  RISL--------DPAYNERPSNRQS-TEGTTPKYHPLYNIPVEQ-----HPF---PFFKNEQVPVHNQP---GFSLPTEVPPKVTITVPNLDDPEIRKEL
         +SL        +PA +E P  + S  +   P Y P +  P  Q     HP    P + N   P+ +QP    F   + + P  +I +P+LDDP+ +++L
Subjt:  RISL--------DPAYNERPSNRQS-TEGTTPKYHPLYNIPVEQ-----HPF---PFFKNEQVPVHNQP---GFSLPTEVPPKVTITVPNLDDPEIRKEL

Query:  TG-----GEKVSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLIGPA
               G+     +K ++LEERLRA+EG D FG +DAT+LCLVPDV++P KFKVP+FEKYDG  CP  H+ MYCRKMAA   +DKLLI+ FQDSL G A
Subjt:  TG-----GEKVSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLIGPA

Query:  SRWYMQLDSTHICSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELSTMFINTLKSPFYDKMIGSASTNFSDI
        +RWY+QLD   I +WK LA +F+ QYKH  ++APDRL LQ MEKK +E+FKEYAQRWRDTAAQVQPPL DKE++ +FINTL++PFY+++IG+A  NF+D+
Subjt:  SRWYMQLDSTHICSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELSTMFINTLKSPFYDKMIGSASTNFSDI

Query:  MTIGERIEYGIKHGKIIDMAGSSTAKKGVPSKKKEGEVQMIGFNSRQPYPNAGVPQYHYPPPYVYPQPYINNTSAQYSSPYV
        +  GE IE  IK GK I+    +++KKG   KKKEG+VQ +  +S+Q + N   P Y Y PPY    P+I N +    +PYV
Subjt:  MTIGERIEYGIKHGKIIDMAGSSTAKKGVPSKKKEGEVQMIGFNSRQPYPNAGVPQYHYPPPYVYPQPYINNTSAQYSSPYV

EOY09468.1 Uncharacterized protein TCM_024883 [Theobroma cacao]3.9e-14435.94Show/hide
Query:  VRQWSENVQQIHGDSLVENVVSQFKD---VSFPESQLEAVKQAWEKLTVDRKAKFTSKYGYLAQLMYVQVNYSVLKALIRHWDPAYRCFTFGSIDMTP--
        V Q   + QQ  GD L ++  S   D   +   ++    +   W+K     +A F  KYG++A+L+ VQ++  +LKA+++ WDP+YRCF F  IDM    
Subjt:  VRQWSENVQQIHGDSLVENVVSQFKD---VSFPESQLEAVKQAWEKLTVDRKAKFTSKYGYLAQLMYVQVNYSVLKALIRHWDPAYRCFTFGSIDMTP--

Query:  ----------TIEEYQSLLHMPVRTEVEAYSYDQELTMKR------ALSTLLG--------KILCGYVEERVVKLFAKIEIGVDPIIPVLAETFRSLNYC
                  T  E    L      E   +S+ +   MK+       L   LG        K+L G++E R++  F ++    +P   +LAET RSLNYC
Subjt:  ----------TIEEYQSLLHMPVRTEVEAYSYDQELTMKR------ALSTLLG--------KILCGYVEERVVKLFAKIEIGVDPIIPVLAETFRSLNYC

Query:  RTKGTGRFIGCAPLLYIWVLSHVKCPPEFKCPEIKFSSSWNKLQNPISEFVQSGWSSSSPERSAWEAFFSELKIEDVIWRAPWMSTRPMIYKCVRQIWVR
        R KG GRF+GCA LL IW++SH      F+C   KF   ++    PI EF +S W  +   +  W +   EL   +V WRAPWM   P++YKC  + WV+
Subjt:  RTKGTGRFIGCAPLLYIWVLSHVKCPPEFKCPEIKFSSSWNKLQNPISEFVQSGWSSSSPERSAWEAFFSELKIEDVIWRAPWMSTRPMIYKCVRQIWVR

Query:  ----------------------QFIPATHELKDFEFAYDK-GFCKDRIQKIVKAWKMITKIQSGQFHDDTTEAYKTWHANRAKTVLVSPKMKTKIKLNAK
                              QF+P TH L   EFAY++ GF K RI++I +AWK  +++  G++ D+ T  Y+ WH  R K V V PK      L   
Subjt:  ----------------------QFIPATHELKDFEFAYDK-GFCKDRIQKIVKAWKMITKIQSGQFHDDTTEAYKTWHANRAKTVLVSPKMKTKIKLNAK

Query:  VIPDQQTEQAAREKECDELRKANSSLVQENERLQLEVKQGLLRNVELEKELNRLKGSVSKQEQLEKEISALDTEARDLNRRMH-RLRRDNEVSQATLKSR
        V P+ +      E    +    N+S  Q  E LQ E ++      E  K++ +++G   K E L  + S   T   +L R +  R  R NE     L++ 
Subjt:  VIPDQQTEQAAREKECDELRKANSSLVQENERLQLEVKQGLLRNVELEKELNRLKGSVSKQEQLEKEISALDTEARDLNRRMH-RLRRDNEVSQATLKSR

Query:  ND----QVLKQQSEIASLHELMKELEDCISLRNQTITEDATDRLMKDYTYLKEQYDRLSDDFGFARQNHATLRSKAEHMLTQIRRVTRRADELAEDARTL
        ND    QV  QQ  I  L +  +ELE  ++   Q            +Y  LK+Q  R+ +     RQ +     + ++++ Q+R V  +A  +A     L
Subjt:  ND----QVLKQQSEIASLHELMKELEDCISLRNQTITEDATDRLMKDYTYLKEQYDRLSDDFGFARQNHATLRSKAEHMLTQIRRVTRRADELAEDARTL

Query:  SKVIAPTQPNSKNVLKFLG--------------------------------KLRISL--------DPAYNERPSNRQS-TEGTTPKYHPLYNIPVEQ---
           I P     + ++K+L                                 KL +SL        +PA +E P  + S  +   P Y P +  P  Q   
Subjt:  SKVIAPTQPNSKNVLKFLG--------------------------------KLRISL--------DPAYNERPSNRQS-TEGTTPKYHPLYNIPVEQ---

Query:  --HPF---PFFKNEQVPVHNQP---GFSLPTEVPPKVTITVPNLDDPEIRKELTGG-----EKVSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDV
          HP      + N   P+ +QP    F     V P   I VP+LDDP+ +++L        E     +K ++LEERLRA+EG D FG +DAT+LCLVPDV
Subjt:  --HPF---PFFKNEQVPVHNQP---GFSLPTEVPPKVTITVPNLDDPEIRKELTGG-----EKVSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDV

Query:  ILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLIGPASRWYMQLDSTHICSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKST
        ++P KFKVPEFEKYD   CP  H+ M CRKMAA   +DKLLIH FQDSL G A+RWY+QLD   I +WK+LA +F+ QYKH  ++APDRL LQ MEKK +
Subjt:  ILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLIGPASRWYMQLDSTHICSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKST

Query:  ESFKEYAQRWRDTAAQVQPPLADKELSTMFINTLKSPFYDKMIGSASTNFSDIMTIGERIEYGIKHGKIIDMAGSSTAKKGVPSKKKEGEVQMIGFNSRQ
        E+FKEYAQRWRDTAAQVQPPL DKE++ +FINTL++PFY+++IG+A+ NF+D++  GE IE  IK GK I+    +++KKG   +KKEG+VQ +  +S+Q
Subjt:  ESFKEYAQRWRDTAAQVQPPLADKELSTMFINTLKSPFYDKMIGSASTNFSDIMTIGERIEYGIKHGKIIDMAGSSTAKKGVPSKKKEGEVQMIGFNSRQ

Query:  PYPNAGVPQYHYPPPYVYPQPYINNTSAQYSSPYV
         +       Y+  PPY    P+I N +    +PYV
Subjt:  PYPNAGVPQYHYPPPYVYPQPYINNTSAQYSSPYV

KAA0033221.1 uncharacterized protein E6C27_scaffold845G00100 [Cucumis melo var. makuwa]1.2e-15837.04Show/hide
Query:  VRQWSENVQQIHGDS---LVENVVSQFKDVSFPESQLEAVKQAWEKLTVDRKAKFTSKYGYLAQLMYVQVNYSVLKALIRHWDPAYRCFTFGSIDMTPTI
        V QWSE+ Q   G S    +E  + +   + F  + LE +K  WE L  +++AKF   YG +  L+Y  +N S L+AL   WDP  +CFTF + D+TPTI
Subjt:  VRQWSENVQQIHGDS---LVENVVSQFKDVSFPESQLEAVKQAWEKLTVDRKAKFTSKYGYLAQLMYVQVNYSVLKALIRHWDPAYRCFTFGSIDMTPTI

Query:  EEYQSLLHMPVRTEVEAYSYDQELTMKRALSTLLGKI-----------------------------------------LC-----------GYVEERVVK
        EEYQ+L+ +PV    + Y Y ++LT++R+LS  +G I                                         LC           GYVEE VVK
Subjt:  EEYQSLLHMPVRTEVEAYSYDQELTMKRALSTLLGKI-----------------------------------------LC-----------GYVEERVVK

Query:  LFAKIEIGVDPIIPVLAETFRSLNYCRTKGTGRFIGCAPLLYIWVLSHVKCPPEFKCPEIKFSSSWNKLQNPISEFVQSGWSSSSPERSAWEAFFSELKI
        +F  IE GV+P+IP++AETFRSLN+CR +G G+F  CAP+L+IW+ SH++ P  F  P+IKF+S WNK +N I EF  + W    PE  A+         
Subjt:  LFAKIEIGVDPIIPVLAETFRSLNYCRTKGTGRFIGCAPLLYIWVLSHVKCPPEFKCPEIKFSSSWNKLQNPISEFVQSGWSSSSPERSAWEAFFSELKI

Query:  EDVIWRAPWMSTRPMIYKCVRQIWVRQFIPATHELKDFEFAYDKGFCKDRIQKIVKAWKMITKIQSGQFHDDTTEAYKTWHANRAKTVLVSPKMKTKIKL
                 ++  P++   +RQ W +QF P    L+D+EF+Y+     D+I + V+AWK + K++S +  + TTE Y  W A+R   + ++P       +
Subjt:  EDVIWRAPWMSTRPMIYKCVRQIWVRQFIPATHELKDFEFAYDKGFCKDRIQKIVKAWKMITKIQSGQFHDDTTEAYKTWHANRAKTVLVSPKMKTKIKL

Query:  NAKVIPDQQTEQ-AAREKECDELRKANSSLVQENERLQLEVKQGLLRNVELEKELNRLKGSVSKQEQLEKEISALDTEARDLNRRMHRLRRDNEVSQATL
        +  ++P+ + ++   +EKE   LR+ N  L  ENE+L+ EVK+ + + V   ++L+      +K+++LE  +                     EV Q ++
Subjt:  NAKVIPDQQTEQ-AAREKECDELRKANSSLVQENERLQLEVKQGLLRNVELEKELNRLKGSVSKQEQLEKEISALDTEARDLNRRMHRLRRDNEVSQATL

Query:  KSRNDQVLKQQSEIASLHELMKELEDCISLRNQTITEDATDRLMKDYTYLKEQYDRLSDDFGFARQNHATLRSK-------AEHMLTQIRRVTRRADELA
        +    Q+++ +S+   L + +   E  +      I   A + +  DY  L E+Y  +S DF   +  + TLR K        E    ++R++ R AD+ +
Subjt:  KSRNDQVLKQQSEIASLHELMKELEDCISLRNQTITEDATDRLMKDYTYLKEQYDRLSDDFGFARQNHATLRSK-------AEHMLTQIRRVTRRADELA

Query:  EDARTLSKVIAPTQPNSKNVLKFLG------KLRISLDPAYNE--------RPSNRQSTEGTTPKYHPLYNIPVEQHPF--PFFKNEQVPVHNQPGFSLP
          ARTL + + PT+   K +  FLG       LR  ++   ++             +S  GT+ +     N  +E  P   P F  ++         + P
Subjt:  EDARTLSKVIAPTQPNSKNVLKFLG------KLRISLDPAYNE--------RPSNRQSTEGTTPKYHPLYNIPVEQHPF--PFFKNEQVPVHNQPGFSLP

Query:  TEVP-PKVTITV---PNLDDPEIRKELTGGEKVS----SSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMY
        T  P P    T     + ++P     + GG+K+S    S  +LE LEERLR +E                            +FEKY+G SCPK+HL+MY
Subjt:  TEVP-PKVTITV---PNLDDPEIRKELTGGEKVS----SSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMY

Query:  CRKMAAYVQNDKLLIHCFQDSLIGPASRWYMQLDSTHICSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELS
        CRKM+AY  +DKLLIHCFQDSL+ PASRWYMQLD + +  WK+LADSFLKQYK+NIDMAPDRLDLQRMEKK+ E+FKEYAQRWR+ AAQVQPPL DKEL+
Subjt:  CRKMAAYVQNDKLLIHCFQDSLIGPASRWYMQLDSTHICSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELS

Query:  TMFINTLKSPFYDKMIGSASTNFSDIMTIGERIEYGIKHGKIIDMAGSSTAKKGVPSKKKEGEVQMIGFNSR
         MFINTL++P+YD+M+GSASTNFSD++TIGERIE+G+K+G+I D A  S  ++ +  KKKEGEV  +    R
Subjt:  TMFINTLKSPFYDKMIGSASTNFSDIMTIGERIEYGIKHGKIIDMAGSSTAKKGVPSKKKEGEVQMIGFNSR

KAA0036933.1 uncharacterized protein E6C27_scaffold86G00060 [Cucumis melo var. makuwa]1.8e-17839.57Show/hide
Query:  WEKLTVDRKAKFTSKYGYLAQLMYVQVNYSVLKALIRHWDPAYRCFTFGSIDMTPTIEEYQSLLHMPVRTEVEAYSYDQELTMKRALSTLLGKI------
        WE LT  R+  F+ KYG++A+LMY+ VNY  L+A+I   DPAY CFTFGS D+ PTIEEYQ++L MP +     Y ++ + T KR LS  L  +      
Subjt:  WEKLTVDRKAKFTSKYGYLAQLMYVQVNYSVLKALIRHWDPAYRCFTFGSIDMTPTIEEYQSLLHMPVRTEVEAYSYDQELTMKRALSTLLGKI------

Query:  -----------------------------------LC-----------GYVEERVVKLFAKIEIGVDPIIPVLAETFRSLNYCRTKG--------TGRFI
                                           LC           GYV+ +V+KLF ++E GV+PIIP+LAETFRSLNYCR KG         G+  
Subjt:  -----------------------------------LC-----------GYVEERVVKLFAKIEIGVDPIIPVLAETFRSLNYCRTKG--------TGRFI

Query:  GCAPLLYIWVLSHVKCPPEFKCPEIKFSSSWNKLQNPISEFVQSGWSSSSPERSAWEAFFSELKIEDVIWRAPWMSTRPMIYKC----------------
         C PLLYIW+ SH+K P EF+CP + FSS WN ++N ISEF  + W  + P + AW +FF++L  E+VIW+A WM  + +IY+C                
Subjt:  GCAPLLYIWVLSHVKCPPEFKCPEIKFSSSWNKLQNPISEFVQSGWSSSSPERSAWEAFFSELKIEDVIWRAPWMSTRPMIYKC----------------

Query:  ------VRQIWVRQFIPATHELKDFEFAYDKGFCKDRIQKIVKAWKMITKIQSGQFHDDTTEAYKTWHANRAKTVL-VSPKMKTKIKLNAKVIPDQQTEQ
              +RQ+W++QFIP TH LK      DKG                        ++  T  Y+ W ANR K ++ +S ++  + K  +   P+Q  E+
Subjt:  ------VRQIWVRQFIPATHELKDFEFAYDKGFCKDRIQKIVKAWKMITKIQSGQFHDDTTEAYKTWHANRAKTVL-VSPKMKTKIKLNAKVIPDQQTEQ

Query:  AAREKECDELRKANSSLVQENERLQLEVKQGLLRNVELEKELNRLKGSVSKQEQLEKEISALDTEARDLNRRMHRLRRDNEVSQATLKSRNDQVLKQQSE
        +       EL + N  L QENE+L+ E  Q +     L+ EL + K  +  Q++LE ++  LD E R +N+    ++ +    QAT+      +  + +E
Subjt:  AAREKECDELRKANSSLVQENERLQLEVKQGLLRNVELEKELNRLKGSVSKQEQLEKEISALDTEARDLNRRMHRLRRDNEVSQATLKSRNDQVLKQQSE

Query:  IASLHELMKELEDCISLRNQTITEDATDRLMKDYTYLKEQYDRLSDDFGFARQNHATLRSKAEHMLTQIRRVTRRADELAEDARTLSKVIAPTQPNSKNV
         +  ++++K   D +  +  T  ++++ R+ ++Y  L   Y ++  D+    ++   L  + +  +  +R V++RA+  AE A   S       P +   
Subjt:  IASLHELMKELEDCISLRNQTITEDATDRLMKDYTYLKEQYDRLSDDFGFARQNHATLRSKAEHMLTQIRRVTRRADELAEDARTLSKVIAPTQPNSKNV

Query:  LKFLGKLRISLDPAYNERPSNRQSTEGTTPKYHPLYN-------IPVEQHPFPFFKNEQVPVHNQPGFSLPTEVPPKVTITVPNLDDPEIRKELTGGEKV
         ++  ++    D   ++      +      K   L +       +   Q   P  ++   P++  PGF            T  +++ P ++   T   ++
Subjt:  LKFLGKLRISLDPAYNERPSNRQSTEGTTPKYHPLYN-------IPVEQHPFPFFKNEQVPVHNQPGFSLPTEVPPKVTITVPNLDDPEIRKELTGGEKV

Query:  SSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLIGPASRWYMQLDSTH
         S +KL+VLEERLRA+E TDV+GNIDAT+LCLVP +I+P KFKVPEF KYDG++CP++HLIMYCRKMA ++ NDKLL+HCFQDSL  PASRWY+QLD+ H
Subjt:  SSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLIGPASRWYMQLDSTH

Query:  ICSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELSTMFINTLKSPFYDKMIGSASTNFSDIMTIGERIEYGI
        I  WK+LAD+FLKQYK NIDMAPDRLDLQRMEKKS+ESFKEYAQRWRD AA+VQPPL DKE+++MF+NTL++PFY++MIG+ASTNFSDI+ IGERIEYGI
Subjt:  ICSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELSTMFINTLKSPFYDKMIGSASTNFSDIMTIGERIEYGI

Query:  KHGKIIDMAGS-STAKKGVPSKKKEGEVQMIGF-NSRQPYPNAGVPQYHYPPPYVYPQPYINNTSAQYSSPYV
        KHG++ +        KKG  SKKKEGEV  IGF NS +     G+ +Y    P      YINN S    + YV
Subjt:  KHGKIIDMAGS-STAKKGVPSKKKEGEVQMIGF-NSRQPYPNAGVPQYHYPPPYVYPQPYINNTSAQYSSPYV

XP_031738551.1 LOW QUALITY PROTEIN: uncharacterized protein LOC101203611 [Cucumis sativus]7.3e-13541.45Show/hide
Query:  PWMSTRPMIYKCVRQIWVRQFIPATHELKDFEFAYDKGFCKDRIQKIVKAWKMITKIQSGQFHDDTTEAYKTWHANRAKTVLVS-PKMKTKIKLNAKVIP
        PW          +RQ+W++QFIP TH L+D +FAYD   C+ +  + V AWK + KI+    ++  T  Y+ W  NR K V+ +  ++  +++  +   P
Subjt:  PWMSTRPMIYKCVRQIWVRQFIPATHELKDFEFAYDKGFCKDRIQKIVKAWKMITKIQSGQFHDDTTEAYKTWHANRAKTVLVS-PKMKTKIKLNAKVIP

Query:  DQQTEQAAREKECDELRKANSSLVQENERLQLEVKQGLLRNVELEKELNRLKGSVSKQEQLEKEISALDTEARDLNRRMHRLRRDNEVSQATLKSRNDQV
        DQ  E+  +      L + N  L QENE+LQ E  Q +     L++EL + K  +  Q++LEK +  LD E R +N+    L+ +  + QAT++S+++ +
Subjt:  DQQTEQAAREKECDELRKANSSLVQENERLQLEVKQGLLRNVELEKELNRLKGSVSKQEQLEKEISALDTEARDLNRRMHRLRRDNEVSQATLKSRNDQV

Query:  L---------------------KQQSEIA--------------SLH-ELMKELEDCISLRNQ--------TITEDATDRLMKDYTYLKEQYDRLSDDFGF
                              K++++I               SLH ++++  ED   L+N         T  +++++R++++Y  LK  Y ++  D+  
Subjt:  L---------------------KQQSEIA--------------SLH-ELMKELEDCISLRNQ--------TITEDATDRLMKDYTYLKEQYDRLSDDFGF

Query:  ARQNHATLRSKAEHMLTQIRRVTRRADELAEDARTLSKVIAPTQPNSKNVLKFL----------GKLRISLDPAYNERPSNRQSTE------GTTPKYHP
         R++  TL  + +  +  +R V+RRA+  AE A  L       +P+S ++ +FL          GK ++ ++ A +  P   Q T+      G TP++  
Subjt:  ARQNHATLRSKAEHMLTQIRRVTRRADELAEDARTLSKVIAPTQPNSKNVLKFL----------GKLRISLDPAYNERPSNRQSTE------GTTPKYHP

Query:  LYNIPVEQHPFPFFKNEQVPVHNQPGFSLPTEVPPKVTITVPNLDDPEIRKELTGGEKVSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPK
               QH           V   P F +P  VP      +  L+     +++   E   + +KL+VLEERLRA+EGTDV+GNIDAT+LCLVP +I+P K
Subjt:  LYNIPVEQHPFPFFKNEQVPVHNQPGFSLPTEVPPKVTITVPNLDDPEIRKELTGGEKVSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPK

Query:  FKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLIGPASRWYMQLDSTHICSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKE
        FKVP F+KYDG+SCP++HLIMYCRKMAA++ NDKLLIHCFQDSL GPA+RWY+QLD+ HI  WK+LAD+FLKQYKHNIDMAPDRLDLQRMEKKS+ESFKE
Subjt:  FKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLIGPASRWYMQLDSTHICSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKE

Query:  YAQRWRDTAAQVQPPLADKELSTMFINTLKSPFYDKMIGSASTNFSDIMTIGERIEYGIKHGKIIDMAGS-STAKKGVPSKKKEGEVQMIGFNSRQPYPN
        YAQRWRD AA+VQPPL DKE++ MF+NTL++PFYD+MIG+A+TNFSDI+ IGERIEYGIKHG++I+ +      KKG   KKKEGEV  IGF      PN
Subjt:  YAQRWRDTAAQVQPPLADKELSTMFINTLKSPFYDKMIGSASTNFSDIMTIGERIEYGIKHGKIIDMAGS-STAKKGVPSKKKEGEVQMIGFNSRQPYPN

Query:  AG
         G
Subjt:  AG

TrEMBL top hitse value%identityAlignment
A0A061E378 G-patch domain-containing protein1.2e-14634.38Show/hide
Query:  VRQWSENVQQIHGDSLVENVVSQFKD---VSFPESQLEAVKQAWEKLTVDRKAKFTSKYGYLAQLMYVQVNYSVLKALIRHWDPAYRCFTFGSIDMTPTI
        V Q   + QQ  GD L ++  S   D   +   ++    +   W+K     +A F  KYG++A+L+ VQV+  +LKA+++ WDP+YRCF F  +DM PTI
Subjt:  VRQWSENVQQIHGDSLVENVVSQFKD---VSFPESQLEAVKQAWEKLTVDRKAKFTSKYGYLAQLMYVQVNYSVLKALIRHWDPAYRCFTFGSIDMTPTI

Query:  EEYQSLLHMPVRTEVEAYSYDQELTMKRALSTLLG----------------------------------------------------KILCGYVEERVVK
        EEY +LL + +    + Y   Q+   +R L+ ++G                                                    K+L G++E  ++ 
Subjt:  EEYQSLLHMPVRTEVEAYSYDQELTMKRALSTLLG----------------------------------------------------KILCGYVEERVVK

Query:  LFAKIEIGVDPIIPVLAETFRSLNYCRTKGTGRFIGCAPLLYIWVLSHVKCPPEFKCPEIKFSSSWNKLQNPISEFVQSGWSSSSPERSAWEAFFSELKI
         F ++    +P   +LA+T RSLNYCR KG GRF+GCA LL IW++SH      F+C   KF   ++    PI EF +S W  +   +  W + F EL  
Subjt:  LFAKIEIGVDPIIPVLAETFRSLNYCRTKGTGRFIGCAPLLYIWVLSHVKCPPEFKCPEIKFSSSWNKLQNPISEFVQSGWSSSSPERSAWEAFFSELKI

Query:  EDVIWRAPWMSTRPMIYKCVRQIWV----------------------RQFIPATHELKDFEFAYDKGFCKDRIQKIVKAWKMITKIQSGQFHDDTTEAYK
         +V WRAPWM   P++YKC  + WV                       QF+P TH L   EFAY +     +I++I +AWK  +++  G++ D+ T  Y+
Subjt:  EDVIWRAPWMSTRPMIYKCVRQIWV----------------------RQFIPATHELKDFEFAYDKGFCKDRIQKIVKAWKMITKIQSGQFHDDTTEAYK

Query:  TWHANRAKTVLVSPKMKTKIKLNAKVIPDQQTEQAAREKECDELRKANSSLVQENERLQLEVKQGLLRNVELEKELNRLKGSVSKQEQLEKEISALDTEA
         WH  R K V V PK   +  L   V P+ +      E    +    N+S  Q  E LQ E ++      E  K++ +++G   K E L  + SA  +E 
Subjt:  TWHANRAKTVLVSPKMKTKIKLNAKVIPDQQTEQAAREKECDELRKANSSLVQENERLQLEVKQGLLRNVELEKELNRLKGSVSKQEQLEKEISALDTEA

Query:  RDLNRRMHRLRRDNEVSQATLKSRND----QVLKQQSEIASLHELMKELEDCISLRNQ---------TITEDATDRLMKDYTYLKEQYDRL---------
          L R +    R+N+ ++  L++ ND    QV  QQ  I  L +  +ELE  ++   Q         T  ++  +   + YT    Q D L         
Subjt:  RDLNRRMHRLRRDNEVSQATLKSRND----QVLKQQSEIASLHELMKELEDCISLRNQ---------TITEDATDRLMKDYTYLKEQYDRL---------

Query:  -------------SDDFGFARQNHATLRSKAEHML-------------TQIRRVTRRADELAEDARTLSKVIAP-----------TQPNSKNVLKFLGKL
                     S  F   +Q    ++   EH++             ++I R  + +DE   + R  ++++              Q   +  L  + +L
Subjt:  -------------SDDFGFARQNHATLRSKAEHML-------------TQIRRVTRRADELAEDARTLSKVIAP-----------TQPNSKNVLKFLGKL

Query:  RISL--------DPAYNERPSNRQS-TEGTTPKYHPLYNIPVEQ-----HPF---PFFKNEQVPVHNQP---GFSLPTEVPPKVTITVPNLDDPEIRKEL
         +SL        +PA +E P  + S  +   P Y P +  P  Q     HP    P + N   P+ +QP    F   + + P  +I +P+LDDP+ +++L
Subjt:  RISL--------DPAYNERPSNRQS-TEGTTPKYHPLYNIPVEQ-----HPF---PFFKNEQVPVHNQP---GFSLPTEVPPKVTITVPNLDDPEIRKEL

Query:  TG-----GEKVSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLIGPA
               G+     +K ++LEERLRA+EG D FG +DAT+LCLVPDV++P KFKVP+FEKYDG  CP  H+ MYCRKMAA   +DKLLI+ FQDSL G A
Subjt:  TG-----GEKVSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLIGPA

Query:  SRWYMQLDSTHICSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELSTMFINTLKSPFYDKMIGSASTNFSDI
        +RWY+QLD   I +WK LA +F+ QYKH  ++APDRL LQ MEKK +E+FKEYAQRWRDTAAQVQPPL DKE++ +FINTL++PFY+++IG+A  NF+D+
Subjt:  SRWYMQLDSTHICSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELSTMFINTLKSPFYDKMIGSASTNFSDI

Query:  MTIGERIEYGIKHGKIIDMAGSSTAKKGVPSKKKEGEVQMIGFNSRQPYPNAGVPQYHYPPPYVYPQPYINNTSAQYSSPYV
        +  GE IE  IK GK I+    +++KKG   KKKEG+VQ +  +S+Q + N   P Y Y PPY    P+I N +    +PYV
Subjt:  MTIGERIEYGIKHGKIIDMAGSSTAKKGVPSKKKEGEVQMIGFNSRQPYPNAGVPQYHYPPPYVYPQPYINNTSAQYSSPYV

A0A061E733 Gag-pro-like protein1.1e-12536.42Show/hide
Query:  NVQQIHGDSLVENVVSQFKD---VSFPESQLEAVKQAWEKLTVDRKAKFTSKYGYLAQLMYVQVNYSVLKALIRHWDPAYRCFTFGSIDMTPTIEEYQSL
        + QQ  GD L ++  S   D   +   ++    +   W+K     +A F  KYG++A+L+ VQ           H     +     S ++   + +    
Subjt:  NVQQIHGDSLVENVVSQFKD---VSFPESQLEAVKQAWEKLTVDRKAKFTSKYGYLAQLMYVQVNYSVLKALIRHWDPAYRCFTFGSIDMTPTIEEYQSL

Query:  LHMPVRTEVEAYSYDQ------ELTMKRALSTLL--GKILCGYVEERVVKLFAKIEIGVDPIIPVLAETFRSLNYCRTKGTGRFIGCAPLLYIWVLSHVK
          +P  + +  Y   Q      +L M  A+  L+   K+L G+++  ++  F ++    +P   +LAET RSLNYCR KG GRF+GCA LL IW++SH  
Subjt:  LHMPVRTEVEAYSYDQ------ELTMKRALSTLL--GKILCGYVEERVVKLFAKIEIGVDPIIPVLAETFRSLNYCRTKGTGRFIGCAPLLYIWVLSHVK

Query:  CPPEFKCPEIKFSSSWNKLQNPISEFVQSGWSSSSPERSAWEAFFSELKIEDVIWRAPWMSTRPMIYKCVRQIWVRQFIPATHELKDFEFAY-DKGFCKD
            F+C   KF   ++    PI EF +S W  +   +  W + F EL    V WRAPWM   P             F+P TH L   EFAY + GF K 
Subjt:  CPPEFKCPEIKFSSSWNKLQNPISEFVQSGWSSSSPERSAWEAFFSELKIEDVIWRAPWMSTRPMIYKCVRQIWVRQFIPATHELKDFEFAY-DKGFCKD

Query:  RIQKIVKAWKMITKIQSGQFHDDTTEAYKTWHANRAKTVLVSPKMKTKIKLNAKVIPDQQTEQAAREKECDELRKANSSLVQENERLQLEVKQGLLRNVE
        RI++I +AWK  +++  G++ D+ T  Y+ WH  R K V V PK   +  L   V P+ +      E    +    N+S  Q  E LQ E ++      E
Subjt:  RIQKIVKAWKMITKIQSGQFHDDTTEAYKTWHANRAKTVLVSPKMKTKIKLNAKVIPDQQTEQAAREKECDELRKANSSLVQENERLQLEVKQGLLRNVE

Query:  LEKELNRLKGSVSKQEQLEKEISALDTEARDLNRRMHRLRRDNEVSQ---ATLKSRND----QVLKQQSEIASLHELMKELEDCISLRNQTITEDATDRL
        + ++ N+++    K E L  + SA  +E          LRR+ +V +     L++ ND    QV  QQ  I  L +  +ELE  +     T  +   +RL
Subjt:  LEKELNRLKGSVSKQEQLEKEISALDTEARDLNRRMHRLRRDNEVSQ---ATLKSRND----QVLKQQSEIASLHELMKELEDCISLRNQTITEDATDRL

Query:  MKDYTYLKEQYDRLSDDFGFARQNHATLRSKAEHMLTQIRRVTRRADELAEDAR-TLSKVIAPTQPNSKNVLKFLGKLRISLDPAYNERPSNRQS-TEGT
         +  T ++E   +++     + Q     R +A  M  + R    R +   E+ R  L+K++      SK      GK  I  +PA +E P  + S  +  
Subjt:  MKDYTYLKEQYDRLSDDFGFARQNHATLRSKAEHMLTQIRRVTRRADELAEDAR-TLSKVIAPTQPNSKNVLKFLGKLRISLDPAYNERPSNRQS-TEGT

Query:  TPKYHPLYNIPVEQ-----HPF---PFFKNEQVPVHNQP---GFSLPTEVPPKVTITVPNLDDPEIRKEL-----TGGEKVSSSEKLEVLEERLRAVEGT
         P Y P +  P  Q     HP    P + N   P+ +QP    F   + + P   I VP+LDDP+ +++L       GE     +K ++LEERLRA+EG 
Subjt:  TPKYHPLYNIPVEQ-----HPF---PFFKNEQVPVHNQP---GFSLPTEVPPKVTITVPNLDDPEIRKEL-----TGGEKVSSSEKLEVLEERLRAVEGT

Query:  DVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLIGPASRWYMQLDSTHICSWKNLADSFLKQYKHNI
        D FG +DAT+LCLVPDV++P KFKVPEFEKYDG  CP  H+ MYCRKMAA   +DKLLIH FQDSL G A+RWY+QLD   I +WK+LA +F+ QYKH  
Subjt:  DVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLIGPASRWYMQLDSTHICSWKNLADSFLKQYKHNI

Query:  DMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELSTMFINTLKSPFYDKMIGSASTNFSDIMTIGERIEYGIKHGKIIDMAGSSTAKKGVP
        ++APDRL LQ MEKK +E+FKEYAQRWRDTAAQVQPPL DKE++ +FINTL++PFY+++IG+A+ NF+D++  GE IE  IK GKI     +S+ K  + 
Subjt:  DMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELSTMFINTLKSPFYDKMIGSASTNFSDIMTIGERIEYGIKHGKIIDMAGSSTAKKGVP

Query:  SKKKEG
         + K+G
Subjt:  SKKKEG

A0A061EXR3 G-patch domain-containing protein1.9e-14435.94Show/hide
Query:  VRQWSENVQQIHGDSLVENVVSQFKD---VSFPESQLEAVKQAWEKLTVDRKAKFTSKYGYLAQLMYVQVNYSVLKALIRHWDPAYRCFTFGSIDMTP--
        V Q   + QQ  GD L ++  S   D   +   ++    +   W+K     +A F  KYG++A+L+ VQ++  +LKA+++ WDP+YRCF F  IDM    
Subjt:  VRQWSENVQQIHGDSLVENVVSQFKD---VSFPESQLEAVKQAWEKLTVDRKAKFTSKYGYLAQLMYVQVNYSVLKALIRHWDPAYRCFTFGSIDMTP--

Query:  ----------TIEEYQSLLHMPVRTEVEAYSYDQELTMKR------ALSTLLG--------KILCGYVEERVVKLFAKIEIGVDPIIPVLAETFRSLNYC
                  T  E    L      E   +S+ +   MK+       L   LG        K+L G++E R++  F ++    +P   +LAET RSLNYC
Subjt:  ----------TIEEYQSLLHMPVRTEVEAYSYDQELTMKR------ALSTLLG--------KILCGYVEERVVKLFAKIEIGVDPIIPVLAETFRSLNYC

Query:  RTKGTGRFIGCAPLLYIWVLSHVKCPPEFKCPEIKFSSSWNKLQNPISEFVQSGWSSSSPERSAWEAFFSELKIEDVIWRAPWMSTRPMIYKCVRQIWVR
        R KG GRF+GCA LL IW++SH      F+C   KF   ++    PI EF +S W  +   +  W +   EL   +V WRAPWM   P++YKC  + WV+
Subjt:  RTKGTGRFIGCAPLLYIWVLSHVKCPPEFKCPEIKFSSSWNKLQNPISEFVQSGWSSSSPERSAWEAFFSELKIEDVIWRAPWMSTRPMIYKCVRQIWVR

Query:  ----------------------QFIPATHELKDFEFAYDK-GFCKDRIQKIVKAWKMITKIQSGQFHDDTTEAYKTWHANRAKTVLVSPKMKTKIKLNAK
                              QF+P TH L   EFAY++ GF K RI++I +AWK  +++  G++ D+ T  Y+ WH  R K V V PK      L   
Subjt:  ----------------------QFIPATHELKDFEFAYDK-GFCKDRIQKIVKAWKMITKIQSGQFHDDTTEAYKTWHANRAKTVLVSPKMKTKIKLNAK

Query:  VIPDQQTEQAAREKECDELRKANSSLVQENERLQLEVKQGLLRNVELEKELNRLKGSVSKQEQLEKEISALDTEARDLNRRMH-RLRRDNEVSQATLKSR
        V P+ +      E    +    N+S  Q  E LQ E ++      E  K++ +++G   K E L  + S   T   +L R +  R  R NE     L++ 
Subjt:  VIPDQQTEQAAREKECDELRKANSSLVQENERLQLEVKQGLLRNVELEKELNRLKGSVSKQEQLEKEISALDTEARDLNRRMH-RLRRDNEVSQATLKSR

Query:  ND----QVLKQQSEIASLHELMKELEDCISLRNQTITEDATDRLMKDYTYLKEQYDRLSDDFGFARQNHATLRSKAEHMLTQIRRVTRRADELAEDARTL
        ND    QV  QQ  I  L +  +ELE  ++   Q            +Y  LK+Q  R+ +     RQ +     + ++++ Q+R V  +A  +A     L
Subjt:  ND----QVLKQQSEIASLHELMKELEDCISLRNQTITEDATDRLMKDYTYLKEQYDRLSDDFGFARQNHATLRSKAEHMLTQIRRVTRRADELAEDARTL

Query:  SKVIAPTQPNSKNVLKFLG--------------------------------KLRISL--------DPAYNERPSNRQS-TEGTTPKYHPLYNIPVEQ---
           I P     + ++K+L                                 KL +SL        +PA +E P  + S  +   P Y P +  P  Q   
Subjt:  SKVIAPTQPNSKNVLKFLG--------------------------------KLRISL--------DPAYNERPSNRQS-TEGTTPKYHPLYNIPVEQ---

Query:  --HPF---PFFKNEQVPVHNQP---GFSLPTEVPPKVTITVPNLDDPEIRKELTGG-----EKVSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDV
          HP      + N   P+ +QP    F     V P   I VP+LDDP+ +++L        E     +K ++LEERLRA+EG D FG +DAT+LCLVPDV
Subjt:  --HPF---PFFKNEQVPVHNQP---GFSLPTEVPPKVTITVPNLDDPEIRKELTGG-----EKVSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDV

Query:  ILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLIGPASRWYMQLDSTHICSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKST
        ++P KFKVPEFEKYD   CP  H+ M CRKMAA   +DKLLIH FQDSL G A+RWY+QLD   I +WK+LA +F+ QYKH  ++APDRL LQ MEKK +
Subjt:  ILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLIGPASRWYMQLDSTHICSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKST

Query:  ESFKEYAQRWRDTAAQVQPPLADKELSTMFINTLKSPFYDKMIGSASTNFSDIMTIGERIEYGIKHGKIIDMAGSSTAKKGVPSKKKEGEVQMIGFNSRQ
        E+FKEYAQRWRDTAAQVQPPL DKE++ +FINTL++PFY+++IG+A+ NF+D++  GE IE  IK GK I+    +++KKG   +KKEG+VQ +  +S+Q
Subjt:  ESFKEYAQRWRDTAAQVQPPLADKELSTMFINTLKSPFYDKMIGSASTNFSDIMTIGERIEYGIKHGKIIDMAGSSTAKKGVPSKKKEGEVQMIGFNSRQ

Query:  PYPNAGVPQYHYPPPYVYPQPYINNTSAQYSSPYV
         +       Y+  PPY    P+I N +    +PYV
Subjt:  PYPNAGVPQYHYPPPYVYPQPYINNTSAQYSSPYV

A0A5A7SUT0 Reverse transcriptase6.0e-15937.04Show/hide
Query:  VRQWSENVQQIHGDS---LVENVVSQFKDVSFPESQLEAVKQAWEKLTVDRKAKFTSKYGYLAQLMYVQVNYSVLKALIRHWDPAYRCFTFGSIDMTPTI
        V QWSE+ Q   G S    +E  + +   + F  + LE +K  WE L  +++AKF   YG +  L+Y  +N S L+AL   WDP  +CFTF + D+TPTI
Subjt:  VRQWSENVQQIHGDS---LVENVVSQFKDVSFPESQLEAVKQAWEKLTVDRKAKFTSKYGYLAQLMYVQVNYSVLKALIRHWDPAYRCFTFGSIDMTPTI

Query:  EEYQSLLHMPVRTEVEAYSYDQELTMKRALSTLLGKI-----------------------------------------LC-----------GYVEERVVK
        EEYQ+L+ +PV    + Y Y ++LT++R+LS  +G I                                         LC           GYVEE VVK
Subjt:  EEYQSLLHMPVRTEVEAYSYDQELTMKRALSTLLGKI-----------------------------------------LC-----------GYVEERVVK

Query:  LFAKIEIGVDPIIPVLAETFRSLNYCRTKGTGRFIGCAPLLYIWVLSHVKCPPEFKCPEIKFSSSWNKLQNPISEFVQSGWSSSSPERSAWEAFFSELKI
        +F  IE GV+P+IP++AETFRSLN+CR +G G+F  CAP+L+IW+ SH++ P  F  P+IKF+S WNK +N I EF  + W    PE  A+         
Subjt:  LFAKIEIGVDPIIPVLAETFRSLNYCRTKGTGRFIGCAPLLYIWVLSHVKCPPEFKCPEIKFSSSWNKLQNPISEFVQSGWSSSSPERSAWEAFFSELKI

Query:  EDVIWRAPWMSTRPMIYKCVRQIWVRQFIPATHELKDFEFAYDKGFCKDRIQKIVKAWKMITKIQSGQFHDDTTEAYKTWHANRAKTVLVSPKMKTKIKL
                 ++  P++   +RQ W +QF P    L+D+EF+Y+     D+I + V+AWK + K++S +  + TTE Y  W A+R   + ++P       +
Subjt:  EDVIWRAPWMSTRPMIYKCVRQIWVRQFIPATHELKDFEFAYDKGFCKDRIQKIVKAWKMITKIQSGQFHDDTTEAYKTWHANRAKTVLVSPKMKTKIKL

Query:  NAKVIPDQQTEQ-AAREKECDELRKANSSLVQENERLQLEVKQGLLRNVELEKELNRLKGSVSKQEQLEKEISALDTEARDLNRRMHRLRRDNEVSQATL
        +  ++P+ + ++   +EKE   LR+ N  L  ENE+L+ EVK+ + + V   ++L+      +K+++LE  +                     EV Q ++
Subjt:  NAKVIPDQQTEQ-AAREKECDELRKANSSLVQENERLQLEVKQGLLRNVELEKELNRLKGSVSKQEQLEKEISALDTEARDLNRRMHRLRRDNEVSQATL

Query:  KSRNDQVLKQQSEIASLHELMKELEDCISLRNQTITEDATDRLMKDYTYLKEQYDRLSDDFGFARQNHATLRSK-------AEHMLTQIRRVTRRADELA
        +    Q+++ +S+   L + +   E  +      I   A + +  DY  L E+Y  +S DF   +  + TLR K        E    ++R++ R AD+ +
Subjt:  KSRNDQVLKQQSEIASLHELMKELEDCISLRNQTITEDATDRLMKDYTYLKEQYDRLSDDFGFARQNHATLRSK-------AEHMLTQIRRVTRRADELA

Query:  EDARTLSKVIAPTQPNSKNVLKFLG------KLRISLDPAYNE--------RPSNRQSTEGTTPKYHPLYNIPVEQHPF--PFFKNEQVPVHNQPGFSLP
          ARTL + + PT+   K +  FLG       LR  ++   ++             +S  GT+ +     N  +E  P   P F  ++         + P
Subjt:  EDARTLSKVIAPTQPNSKNVLKFLG------KLRISLDPAYNE--------RPSNRQSTEGTTPKYHPLYNIPVEQHPF--PFFKNEQVPVHNQPGFSLP

Query:  TEVP-PKVTITV---PNLDDPEIRKELTGGEKVS----SSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMY
        T  P P    T     + ++P     + GG+K+S    S  +LE LEERLR +E                            +FEKY+G SCPK+HL+MY
Subjt:  TEVP-PKVTITV---PNLDDPEIRKELTGGEKVS----SSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMY

Query:  CRKMAAYVQNDKLLIHCFQDSLIGPASRWYMQLDSTHICSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELS
        CRKM+AY  +DKLLIHCFQDSL+ PASRWYMQLD + +  WK+LADSFLKQYK+NIDMAPDRLDLQRMEKK+ E+FKEYAQRWR+ AAQVQPPL DKEL+
Subjt:  CRKMAAYVQNDKLLIHCFQDSLIGPASRWYMQLDSTHICSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELS

Query:  TMFINTLKSPFYDKMIGSASTNFSDIMTIGERIEYGIKHGKIIDMAGSSTAKKGVPSKKKEGEVQMIGFNSR
         MFINTL++P+YD+M+GSASTNFSD++TIGERIE+G+K+G+I D A  S  ++ +  KKKEGEV  +    R
Subjt:  TMFINTLKSPFYDKMIGSASTNFSDIMTIGERIEYGIKHGKIIDMAGSSTAKKGVPSKKKEGEVQMIGFNSR

A0A5A7T1W2 Retrotrans_gag domain-containing protein8.9e-17939.57Show/hide
Query:  WEKLTVDRKAKFTSKYGYLAQLMYVQVNYSVLKALIRHWDPAYRCFTFGSIDMTPTIEEYQSLLHMPVRTEVEAYSYDQELTMKRALSTLLGKI------
        WE LT  R+  F+ KYG++A+LMY+ VNY  L+A+I   DPAY CFTFGS D+ PTIEEYQ++L MP +     Y ++ + T KR LS  L  +      
Subjt:  WEKLTVDRKAKFTSKYGYLAQLMYVQVNYSVLKALIRHWDPAYRCFTFGSIDMTPTIEEYQSLLHMPVRTEVEAYSYDQELTMKRALSTLLGKI------

Query:  -----------------------------------LC-----------GYVEERVVKLFAKIEIGVDPIIPVLAETFRSLNYCRTKG--------TGRFI
                                           LC           GYV+ +V+KLF ++E GV+PIIP+LAETFRSLNYCR KG         G+  
Subjt:  -----------------------------------LC-----------GYVEERVVKLFAKIEIGVDPIIPVLAETFRSLNYCRTKG--------TGRFI

Query:  GCAPLLYIWVLSHVKCPPEFKCPEIKFSSSWNKLQNPISEFVQSGWSSSSPERSAWEAFFSELKIEDVIWRAPWMSTRPMIYKC----------------
         C PLLYIW+ SH+K P EF+CP + FSS WN ++N ISEF  + W  + P + AW +FF++L  E+VIW+A WM  + +IY+C                
Subjt:  GCAPLLYIWVLSHVKCPPEFKCPEIKFSSSWNKLQNPISEFVQSGWSSSSPERSAWEAFFSELKIEDVIWRAPWMSTRPMIYKC----------------

Query:  ------VRQIWVRQFIPATHELKDFEFAYDKGFCKDRIQKIVKAWKMITKIQSGQFHDDTTEAYKTWHANRAKTVL-VSPKMKTKIKLNAKVIPDQQTEQ
              +RQ+W++QFIP TH LK      DKG                        ++  T  Y+ W ANR K ++ +S ++  + K  +   P+Q  E+
Subjt:  ------VRQIWVRQFIPATHELKDFEFAYDKGFCKDRIQKIVKAWKMITKIQSGQFHDDTTEAYKTWHANRAKTVL-VSPKMKTKIKLNAKVIPDQQTEQ

Query:  AAREKECDELRKANSSLVQENERLQLEVKQGLLRNVELEKELNRLKGSVSKQEQLEKEISALDTEARDLNRRMHRLRRDNEVSQATLKSRNDQVLKQQSE
        +       EL + N  L QENE+L+ E  Q +     L+ EL + K  +  Q++LE ++  LD E R +N+    ++ +    QAT+      +  + +E
Subjt:  AAREKECDELRKANSSLVQENERLQLEVKQGLLRNVELEKELNRLKGSVSKQEQLEKEISALDTEARDLNRRMHRLRRDNEVSQATLKSRNDQVLKQQSE

Query:  IASLHELMKELEDCISLRNQTITEDATDRLMKDYTYLKEQYDRLSDDFGFARQNHATLRSKAEHMLTQIRRVTRRADELAEDARTLSKVIAPTQPNSKNV
         +  ++++K   D +  +  T  ++++ R+ ++Y  L   Y ++  D+    ++   L  + +  +  +R V++RA+  AE A   S       P +   
Subjt:  IASLHELMKELEDCISLRNQTITEDATDRLMKDYTYLKEQYDRLSDDFGFARQNHATLRSKAEHMLTQIRRVTRRADELAEDARTLSKVIAPTQPNSKNV

Query:  LKFLGKLRISLDPAYNERPSNRQSTEGTTPKYHPLYN-------IPVEQHPFPFFKNEQVPVHNQPGFSLPTEVPPKVTITVPNLDDPEIRKELTGGEKV
         ++  ++    D   ++      +      K   L +       +   Q   P  ++   P++  PGF            T  +++ P ++   T   ++
Subjt:  LKFLGKLRISLDPAYNERPSNRQSTEGTTPKYHPLYN-------IPVEQHPFPFFKNEQVPVHNQPGFSLPTEVPPKVTITVPNLDDPEIRKELTGGEKV

Query:  SSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLIGPASRWYMQLDSTH
         S +KL+VLEERLRA+E TDV+GNIDAT+LCLVP +I+P KFKVPEF KYDG++CP++HLIMYCRKMA ++ NDKLL+HCFQDSL  PASRWY+QLD+ H
Subjt:  SSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLIGPASRWYMQLDSTH

Query:  ICSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELSTMFINTLKSPFYDKMIGSASTNFSDIMTIGERIEYGI
        I  WK+LAD+FLKQYK NIDMAPDRLDLQRMEKKS+ESFKEYAQRWRD AA+VQPPL DKE+++MF+NTL++PFY++MIG+ASTNFSDI+ IGERIEYGI
Subjt:  ICSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELSTMFINTLKSPFYDKMIGSASTNFSDIMTIGERIEYGI

Query:  KHGKIIDMAGS-STAKKGVPSKKKEGEVQMIGF-NSRQPYPNAGVPQYHYPPPYVYPQPYINNTSAQYSSPYV
        KHG++ +        KKG  SKKKEGEV  IGF NS +     G+ +Y    P      YINN S    + YV
Subjt:  KHGKIIDMAGS-STAKKGVPSKKKEGEVQMIGF-NSRQPYPNAGVPQYHYPPPYVYPQPYINNTSAQYSSPYV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTATAGAAGATCAAGCAACAGTACGTCAATGGTCAGAAAATGTACAACAAATCCACGGAGATTCTTTAGTAGAAAATGTTGTTTCTCAATTTAAGGATGTCAGTTT
TCCAGAAAGTCAATTAGAAGCAGTGAAACAGGCTTGGGAAAAATTAACTGTAGATAGGAAGGCTAAATTTACAAGCAAATATGGCTATCTAGCTCAGCTCATGTATGTAC
AAGTTAATTATTCTGTATTAAAAGCTTTGATTCGACATTGGGATCCAGCCTACAGATGTTTCACATTTGGCTCAATTGACATGACTCCTACAATAGAGGAATATCAATCC
CTTCTGCATATGCCAGTACGAACGGAGGTTGAAGCTTATTCTTACGATCAAGAGCTTACAATGAAAAGAGCATTATCTACTCTTTTGGGCAAGATTCTATGTGGATATGT
TGAGGAACGCGTGGTCAAGTTGTTTGCTAAGATAGAAATAGGAGTTGATCCCATTATTCCTGTATTGGCAGAAACTTTTCGTTCATTAAATTATTGCAGAACAAAAGGAA
CAGGAAGATTTATAGGGTGTGCACCATTATTGTACATTTGGGTCCTTAGTCATGTGAAGTGTCCACCCGAATTCAAATGTCCAGAGATCAAATTTTCAAGTTCCTGGAAT
AAGCTGCAAAATCCCATTTCAGAGTTCGTGCAATCAGGTTGGAGTTCATCCTCCCCTGAAAGGAGCGCTTGGGAAGCCTTTTTCTCCGAACTAAAGATAGAAGATGTAAT
ATGGAGAGCCCCTTGGATGTCAACTAGGCCAATGATCTATAAATGTGTACGCCAAATTTGGGTCCGACAATTCATCCCTGCTACGCATGAGTTGAAGGATTTCGAATTTG
CTTATGATAAAGGCTTCTGTAAAGATAGAATTCAGAAAATTGTGAAAGCATGGAAAATGATCACTAAAATCCAGAGTGGTCAGTTTCATGATGATACCACAGAGGCGTAC
AAAACATGGCATGCGAACAGAGCTAAAACCGTGCTTGTGTCACCAAAAATGAAAACCAAGATAAAACTTAATGCAAAGGTGATACCAGATCAACAGACAGAACAAGCAGC
ACGCGAAAAGGAATGTGATGAATTGAGAAAAGCGAATTCATCATTGGTTCAAGAAAATGAAAGGTTGCAATTGGAGGTAAAGCAAGGTTTGTTGCGCAATGTTGAACTAG
AAAAAGAGTTGAACCGATTAAAGGGCAGTGTTAGCAAACAAGAACAGTTAGAAAAAGAAATTTCAGCGTTAGACACAGAGGCCCGCGACCTGAACAGAAGAATGCATCGA
TTAAGAAGGGATAATGAAGTCTCCCAAGCAACTCTCAAGTCAAGGAATGACCAAGTTTTGAAGCAACAATCTGAGATTGCCTCACTTCATGAGTTGATGAAAGAGCTCGA
AGATTGCATTAGTTTGAGGAACCAAACGATTACTGAGGATGCCACTGATAGATTAATGAAAGACTATACCTATTTAAAGGAGCAGTACGACAGATTAAGCGATGATTTTG
GGTTTGCGAGACAGAACCACGCGACACTACGAAGTAAAGCGGAACATATGCTCACTCAGATTAGGAGAGTCACTCGAAGGGCAGATGAACTAGCAGAAGATGCACGTACT
CTCTCTAAAGTCATAGCACCTACACAGCCGAATAGCAAGAATGTGCTCAAGTTTCTGGGAAAACTTCGTATAAGTTTAGATCCAGCATACAATGAACGACCCTCCAATCG
GCAATCAACAGAGGGTACTACTCCAAAATATCATCCATTGTACAATATTCCAGTAGAGCAGCACCCGTTTCCATTTTTCAAGAATGAGCAAGTGCCTGTACACAATCAAC
CTGGATTTTCACTACCCACAGAGGTACCTCCCAAGGTGACCATTACAGTTCCCAATTTAGATGATCCTGAAATCAGAAAAGAGCTAACGGGAGGTGAGAAAGTCTCTTCT
AGTGAAAAGCTTGAAGTCCTGGAGGAAAGATTAAGGGCAGTAGAAGGAACAGACGTCTTCGGAAATATAGATGCGACCAAGCTATGCTTGGTACCAGATGTAATCCTCCC
TCCAAAATTCAAGGTGCCCGAGTTTGAAAAGTATGATGGAGCATCCTGCCCTAAAAACCATCTCATCATGTATTGCAGGAAGATGGCAGCATACGTCCAAAATGACAAAC
TGTTAATTCACTGCTTCCAGGACAGTCTTATTGGTCCAGCATCTCGATGGTATATGCAATTAGACAGCACTCATATATGTTCATGGAAGAATCTAGCCGATTCATTTTTA
AAGCAATATAAGCACAACATAGATATGGCTCCTGACCGCCTAGACCTCCAGAGGATGGAAAAGAAGAGCACAGAAAGCTTTAAAGAGTATGCCCAAAGGTGGAGGGATAC
TGCTGCTCAGGTGCAACCACCTTTAGCAGATAAGGAGCTGTCAACCATGTTTATTAATACTCTCAAATCTCCTTTCTATGATAAGATGATTGGGAGTGCCTCTACCAATT
TCTCTGACATAATGACAATTGGAGAGAGAATCGAGTACGGAATTAAGCATGGAAAGATAATTGATATGGCTGGATCATCGACAGCGAAAAAGGGGGTTCCATCGAAGAAA
AAAGAGGGAGAAGTTCAGATGATTGGTTTCAATTCAAGACAACCATACCCTAATGCCGGAGTGCCACAATATCACTATCCACCTCCATATGTTTACCCTCAACCATATAT
CAATAATACGTCAGCCCAATACTCATCCCCTTACGTCCAAATCCTCGTCCTACTCAAGGCTACCAGCCTCGGAATCAACAGAACACTCCTTATGTCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGTATAGAAGATCAAGCAACAGTACGTCAATGGTCAGAAAATGTACAACAAATCCACGGAGATTCTTTAGTAGAAAATGTTGTTTCTCAATTTAAGGATGTCAGTTT
TCCAGAAAGTCAATTAGAAGCAGTGAAACAGGCTTGGGAAAAATTAACTGTAGATAGGAAGGCTAAATTTACAAGCAAATATGGCTATCTAGCTCAGCTCATGTATGTAC
AAGTTAATTATTCTGTATTAAAAGCTTTGATTCGACATTGGGATCCAGCCTACAGATGTTTCACATTTGGCTCAATTGACATGACTCCTACAATAGAGGAATATCAATCC
CTTCTGCATATGCCAGTACGAACGGAGGTTGAAGCTTATTCTTACGATCAAGAGCTTACAATGAAAAGAGCATTATCTACTCTTTTGGGCAAGATTCTATGTGGATATGT
TGAGGAACGCGTGGTCAAGTTGTTTGCTAAGATAGAAATAGGAGTTGATCCCATTATTCCTGTATTGGCAGAAACTTTTCGTTCATTAAATTATTGCAGAACAAAAGGAA
CAGGAAGATTTATAGGGTGTGCACCATTATTGTACATTTGGGTCCTTAGTCATGTGAAGTGTCCACCCGAATTCAAATGTCCAGAGATCAAATTTTCAAGTTCCTGGAAT
AAGCTGCAAAATCCCATTTCAGAGTTCGTGCAATCAGGTTGGAGTTCATCCTCCCCTGAAAGGAGCGCTTGGGAAGCCTTTTTCTCCGAACTAAAGATAGAAGATGTAAT
ATGGAGAGCCCCTTGGATGTCAACTAGGCCAATGATCTATAAATGTGTACGCCAAATTTGGGTCCGACAATTCATCCCTGCTACGCATGAGTTGAAGGATTTCGAATTTG
CTTATGATAAAGGCTTCTGTAAAGATAGAATTCAGAAAATTGTGAAAGCATGGAAAATGATCACTAAAATCCAGAGTGGTCAGTTTCATGATGATACCACAGAGGCGTAC
AAAACATGGCATGCGAACAGAGCTAAAACCGTGCTTGTGTCACCAAAAATGAAAACCAAGATAAAACTTAATGCAAAGGTGATACCAGATCAACAGACAGAACAAGCAGC
ACGCGAAAAGGAATGTGATGAATTGAGAAAAGCGAATTCATCATTGGTTCAAGAAAATGAAAGGTTGCAATTGGAGGTAAAGCAAGGTTTGTTGCGCAATGTTGAACTAG
AAAAAGAGTTGAACCGATTAAAGGGCAGTGTTAGCAAACAAGAACAGTTAGAAAAAGAAATTTCAGCGTTAGACACAGAGGCCCGCGACCTGAACAGAAGAATGCATCGA
TTAAGAAGGGATAATGAAGTCTCCCAAGCAACTCTCAAGTCAAGGAATGACCAAGTTTTGAAGCAACAATCTGAGATTGCCTCACTTCATGAGTTGATGAAAGAGCTCGA
AGATTGCATTAGTTTGAGGAACCAAACGATTACTGAGGATGCCACTGATAGATTAATGAAAGACTATACCTATTTAAAGGAGCAGTACGACAGATTAAGCGATGATTTTG
GGTTTGCGAGACAGAACCACGCGACACTACGAAGTAAAGCGGAACATATGCTCACTCAGATTAGGAGAGTCACTCGAAGGGCAGATGAACTAGCAGAAGATGCACGTACT
CTCTCTAAAGTCATAGCACCTACACAGCCGAATAGCAAGAATGTGCTCAAGTTTCTGGGAAAACTTCGTATAAGTTTAGATCCAGCATACAATGAACGACCCTCCAATCG
GCAATCAACAGAGGGTACTACTCCAAAATATCATCCATTGTACAATATTCCAGTAGAGCAGCACCCGTTTCCATTTTTCAAGAATGAGCAAGTGCCTGTACACAATCAAC
CTGGATTTTCACTACCCACAGAGGTACCTCCCAAGGTGACCATTACAGTTCCCAATTTAGATGATCCTGAAATCAGAAAAGAGCTAACGGGAGGTGAGAAAGTCTCTTCT
AGTGAAAAGCTTGAAGTCCTGGAGGAAAGATTAAGGGCAGTAGAAGGAACAGACGTCTTCGGAAATATAGATGCGACCAAGCTATGCTTGGTACCAGATGTAATCCTCCC
TCCAAAATTCAAGGTGCCCGAGTTTGAAAAGTATGATGGAGCATCCTGCCCTAAAAACCATCTCATCATGTATTGCAGGAAGATGGCAGCATACGTCCAAAATGACAAAC
TGTTAATTCACTGCTTCCAGGACAGTCTTATTGGTCCAGCATCTCGATGGTATATGCAATTAGACAGCACTCATATATGTTCATGGAAGAATCTAGCCGATTCATTTTTA
AAGCAATATAAGCACAACATAGATATGGCTCCTGACCGCCTAGACCTCCAGAGGATGGAAAAGAAGAGCACAGAAAGCTTTAAAGAGTATGCCCAAAGGTGGAGGGATAC
TGCTGCTCAGGTGCAACCACCTTTAGCAGATAAGGAGCTGTCAACCATGTTTATTAATACTCTCAAATCTCCTTTCTATGATAAGATGATTGGGAGTGCCTCTACCAATT
TCTCTGACATAATGACAATTGGAGAGAGAATCGAGTACGGAATTAAGCATGGAAAGATAATTGATATGGCTGGATCATCGACAGCGAAAAAGGGGGTTCCATCGAAGAAA
AAAGAGGGAGAAGTTCAGATGATTGGTTTCAATTCAAGACAACCATACCCTAATGCCGGAGTGCCACAATATCACTATCCACCTCCATATGTTTACCCTCAACCATATAT
CAATAATACGTCAGCCCAATACTCATCCCCTTACGTCCAAATCCTCGTCCTACTCAAGGCTACCAGCCTCGGAATCAACAGAACACTCCTTATGTCCTAG
Protein sequenceShow/hide protein sequence
MGIEDQATVRQWSENVQQIHGDSLVENVVSQFKDVSFPESQLEAVKQAWEKLTVDRKAKFTSKYGYLAQLMYVQVNYSVLKALIRHWDPAYRCFTFGSIDMTPTIEEYQS
LLHMPVRTEVEAYSYDQELTMKRALSTLLGKILCGYVEERVVKLFAKIEIGVDPIIPVLAETFRSLNYCRTKGTGRFIGCAPLLYIWVLSHVKCPPEFKCPEIKFSSSWN
KLQNPISEFVQSGWSSSSPERSAWEAFFSELKIEDVIWRAPWMSTRPMIYKCVRQIWVRQFIPATHELKDFEFAYDKGFCKDRIQKIVKAWKMITKIQSGQFHDDTTEAY
KTWHANRAKTVLVSPKMKTKIKLNAKVIPDQQTEQAAREKECDELRKANSSLVQENERLQLEVKQGLLRNVELEKELNRLKGSVSKQEQLEKEISALDTEARDLNRRMHR
LRRDNEVSQATLKSRNDQVLKQQSEIASLHELMKELEDCISLRNQTITEDATDRLMKDYTYLKEQYDRLSDDFGFARQNHATLRSKAEHMLTQIRRVTRRADELAEDART
LSKVIAPTQPNSKNVLKFLGKLRISLDPAYNERPSNRQSTEGTTPKYHPLYNIPVEQHPFPFFKNEQVPVHNQPGFSLPTEVPPKVTITVPNLDDPEIRKELTGGEKVSS
SEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLIGPASRWYMQLDSTHICSWKNLADSFL
KQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELSTMFINTLKSPFYDKMIGSASTNFSDIMTIGERIEYGIKHGKIIDMAGSSTAKKGVPSKK
KEGEVQMIGFNSRQPYPNAGVPQYHYPPPYVYPQPYINNTSAQYSSPYVQILVLLKATSLGINRTLLMS