; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0006743 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0006743
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr6:45487015..45490660
RNA-Seq ExpressionLag0006743
SyntenyLag0006743
Gene Ontology termsGO:0006310 - DNA recombination (biological process)
GO:0015074 - DNA integration (biological process)
GO:0071897 - DNA biosynthetic process (biological process)
GO:0090502 - RNA phosphodiester bond hydrolysis, endonucleolytic (biological process)
GO:0005634 - nucleus (cellular component)
GO:0030430 - host cell cytoplasm (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0003887 - DNA-directed DNA polymerase activity (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EOX99415.1 Uncharacterized protein TCM_008095 [Theobroma cacao]1.1e-15133.58Show/hide
Query:  VRQWSENVQQIHGDSLVENVVSQFKD---VSFPESQLEAVKQAWEKLTVDRKAKFTSKYGYLAQLMYVQVNYSVLKALIRHWDPAYRCFTFGSIDMTPTI
        V Q   + QQ  GD L ++  S   D   +   ++    +   W+K     +A F  KYG++A+L+ VQV+  +LKA+++ WDP+YRCF F  +DM PTI
Subjt:  VRQWSENVQQIHGDSLVENVVSQFKD---VSFPESQLEAVKQAWEKLTVDRKAKFTSKYGYLAQLMYVQVNYSVLKALIRHWDPAYRCFTFGSIDMTPTI

Query:  EEYQSLLHMPTRTEVEAYSYDQELTMKRALSTLLGKIRTSDIEKQVKIKGENTCLPLDYILTLQQKFANEDKELTLLALCIFNVVLFPKICGYVEERVVK
        EEY +LL +      + Y   Q+   +R L+ ++G I ++++++ ++ KG+N C+P  ++ +   K  + ++   ++AL I+ +V+FPK+ G++E  ++ 
Subjt:  EEYQSLLHMPTRTEVEAYSYDQELTMKRALSTLLGKIRTSDIEKQVKIKGENTCLPLDYILTLQQKFANEDKELTLLALCIFNVVLFPKICGYVEERVVK

Query:  LFAKIEIGVDPIIPVLAETFRSLNYCRTKGTGRFIGCAPLLYIWVLSHVKC---------HPNSNVQRS-----------------------NFQVPGIS
         F ++    +P   +LA+T RSLNYCR KG GRF+GCA LL IW++SH +C         HP +   R                        + +V   +
Subjt:  LFAKIEIGVDPIIPVLAETFRSLNYCRTKGTGRFIGCAPLLYIWVLSHVKC---------HPNSNVQRS-----------------------NFQVPGIS

Query:  CKIPSQSSCNQVGVHP--PLKGALGSLFLRTKDRRLRQIWVRQFIPATHELKDFEFAYDKGFCKDRIQKIVKAWKMITKIQSGQFHDDTTEAYKTWHANR
          +P      + G  P  PL G  G +       R RQ    QF+P TH L   EFAY +     +I++I +AWK  +++  G++ D+ T  Y+ WH  R
Subjt:  CKIPSQSSCNQVGVHP--PLKGALGSLFLRTKDRRLRQIWVRQFIPATHELKDFEFAYDKGFCKDRIQKIVKAWKMITKIQSGQFHDDTTEAYKTWHANR

Query:  AKTVL----------VSPKMKTKIKLNAKVIPDQQTEQAA-------REKECDELRKANSSLVQENERLQLSVSKQEQLEKEISALDTEARDLNRRMHRL
         K V+          V P+ +  +  +       + E A+        +KEC+++++  S   ++ +++Q    K E L  + SA  +E   L R +   
Subjt:  AKTVL----------VSPKMKTKIKLNAKVIPDQQTEQAA-------REKECDELRKANSSLVQENERLQLSVSKQEQLEKEISALDTEARDLNRRMHRL

Query:  RRDNEVSQATLKSRND----QVLKQQSEIASLHELMKELEDCISLRNQTITEDATDRLMKDYTYLKEQYDRLSDDFGFARQNHATLRSKAEHMLTQIRRV
         R+N+ ++  L++ ND    QV  QQ  I  L +  +ELE  ++   Q            +Y  LK+Q  R+ +     RQ +    ++ ++++ Q+R V
Subjt:  RRDNEVSQATLKSRND----QVLKQQSEIASLHELMKELEDCISLRNQTITEDATDRLMKDYTYLKEQYDRLSDDFGFARQNHATLRSKAEHMLTQIRRV

Query:  TRRADELAEDARTLSKVIAPTQPNSKNVLKFLGK--------------------LPRSPRIHRTYVTRYRTRIMEEQSTE----MEKTRKDIEELREKMD
          +A  +A     L   I P     + ++K+L +                     P  P   R + TR R RIME++  E    ME+ ++++ E   KM 
Subjt:  TRRADELAEDARTLSKVIAPTQPNSKNVLKFLGK--------------------LPRSPRIHRTYVTRYRTRIMEEQSTE----MEKTRKDIEELREKMD

Query:  AILVALERGK--------------------------------------------IIPDI--------------------AQVPPKVTITVPNLDDPEIRK
         ++++L +GK                                            +IP +                    + + P  +I +P+LDDP+ ++
Subjt:  AILVALERGK--------------------------------------------IIPDI--------------------AQVPPKVTITVPNLDDPEIRK

Query:  ELTG-----GEKVSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLTG
        +L       G+     +K ++LEERLRA+EG D FG +DAT+LCLVPDV++P KFKVP+FEKYDG  CP  H+ MYCRKMAA   +DKLLI+ FQDSLTG
Subjt:  ELTG-----GEKVSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLTG

Query:  PASRWYMQLDSTHICSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELSTMFINTLKSPFYDKMIGSASTNFS
         A+RWY+QLD   I +WK LA +F+ QYKH  ++APDRL LQ MEKK +E+FKEYAQRWRDTAAQVQPPL DKE++ +FINTL++PFY+++IG+A  NF+
Subjt:  PASRWYMQLDSTHICSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELSTMFINTLKSPFYDKMIGSASTNFS

Query:  DIMTIGERIEYGIKHGKIIDTAGSSTAKKGVPSKKKEGEVQMIGFNSRQPYPNAGVPQYHYPPPYVYPQPYINNTSAQYSSPYV
        D++  GE IE  IK GK I+    +++KKG   KKKEG+VQ +  +S+Q + N   P Y Y PPY    P+I N +    +PYV
Subjt:  DIMTIGERIEYGIKHGKIIDTAGSSTAKKGVPSKKKEGEVQMIGFNSRQPYPNAGVPQYHYPPPYVYPQPYINNTSAQYSSPYV

EOY09468.1 Uncharacterized protein TCM_024883 [Theobroma cacao]1.0e-13933.11Show/hide
Query:  VRQWSENVQQIHGDSLVENVVSQFKD---VSFPESQLEAVKQAWEKLTVDRKAKFTSKYGYLAQLMYVQVNYSVLKALIRHWDPAYRCFTFGSIDMTPTI
        V Q   + QQ  GD L ++  S   D   +   ++    +   W+K     +A F  KYG++A+L+ VQ++  +LKA+++ WDP+YRCF F  IDM    
Subjt:  VRQWSENVQQIHGDSLVENVVSQFKD---VSFPESQLEAVKQAWEKLTVDRKAKFTSKYGYLAQLMYVQVNYSVLKALIRHWDPAYRCFTFGSIDMTPTI

Query:  EEYQSLLHMPTRTEVEAYSYDQELTMKRALSTLLGKIRTSDIEKQVKIKGENTCLPLDYILTLQQKFANEDKELTLLALCIFNVVLFPKICGYVEERVVK
                              +   +R L+ ++G I ++++++ ++ KG+N C+P  ++ +   K  + ++   ++AL I+ +V+FPK+ G++E R++ 
Subjt:  EEYQSLLHMPTRTEVEAYSYDQELTMKRALSTLLGKIRTSDIEKQVKIKGENTCLPLDYILTLQQKFANEDKELTLLALCIFNVVLFPKICGYVEERVVK

Query:  LFAKIEIGVDPIIPVLAETFRSLNYCRTKGTGRFIGCAPLLYIWVLSHVKC---------HPNSNVQRS-----------------------NFQVPGIS
         F ++    +P   +LAET RSLNYCR KG GRF+GCA LL IW++SH +C         HP +   R                        + +V   +
Subjt:  LFAKIEIGVDPIIPVLAETFRSLNYCRTKGTGRFIGCAPLLYIWVLSHVKC---------HPNSNVQRS-----------------------NFQVPGIS

Query:  CKIPSQSSCNQVGVHP--PLKGALGSLFLRTKDRRLRQIWVRQFIPATHELKDFEFAYDK-GFCKDRIQKIVKAWKMITKIQSGQFHDDTTEAYKTWHAN
          +P      + G  P   L G  G++       R RQ    QF+P TH L   EFAY++ GF K RI++I +AWK  +++  G++ D+ T  Y+ WH  
Subjt:  CKIPSQSSCNQVGVHP--PLKGALGSLFLRTKDRRLRQIWVRQFIPATHELKDFEFAYDK-GFCKDRIQKIVKAWKMITKIQSGQFHDDTTEAYKTWHAN

Query:  RAKTVL----------VSPKMKTKIKLNAKVIPDQQTEQAAREKECDELRKANSSLVQENERLQLSVSKQE-QLEKEISALDTEARDLNRRMH-RLRRDN
        R K V+          V P+ +  +  +       + E A+ ++  ++L+K    + +E    +  V K E + E       T   +L R +  R  R N
Subjt:  RAKTVL----------VSPKMKTKIKLNAKVIPDQQTEQAAREKECDELRKANSSLVQENERLQLSVSKQE-QLEKEISALDTEARDLNRRMH-RLRRDN

Query:  EVSQATLKSRND----QVLKQQSEIASLHELMKELEDCISLRNQTITEDATDRLMKDYTYLKEQYDRLSDDFGFARQNHATLRSKAEHMLTQIRRVTRRA
        E     L++ ND    QV  QQ  I  L +  +ELE  ++   Q            +Y  LK+Q  R+ +     RQ +     + ++++ Q+R V  +A
Subjt:  EVSQATLKSRND----QVLKQQSEIASLHELMKELEDCISLRNQTITEDATDRLMKDYTYLKEQYDRLSDDFGFARQNHATLRSKAEHMLTQIRRVTRRA

Query:  DELAEDARTLSKVIAPTQPNSKNVLKFLGKLPRSPRIHRTYVTRYRTRIMEEQSTE----MEKTRKDIEELREKMDAILVALERGK--------------
          +A     L   I P     + ++K+L +               R RIMEE+  E    ME+ ++++ E   KM  ++++L +GK              
Subjt:  DELAEDARTLSKVIAPTQPNSKNVLKFLGKLPRSPRIHRTYVTRYRTRIMEEQSTE----MEKTRKDIEELREKMDAILVALERGK--------------

Query:  ------------------------------IIPDI--------------------AQVPPKVTITVPNLDDPEIRKELTGG-----EKVSSSEKLEVLEE
                                      ++P +                      V P   I VP+LDDP+ +++L        E     +K ++LEE
Subjt:  ------------------------------IIPDI--------------------AQVPPKVTITVPNLDDPEIRKELTGG-----EKVSSSEKLEVLEE

Query:  RLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLTGPASRWYMQLDSTHICSWKNLADSF
        RLRA+EG D FG +DAT+LCLVPDV++P KFKVPEFEKYD   CP  H+ M CRKMAA   +DKLLIH FQDSLTG A+RWY+QLD   I +WK+LA +F
Subjt:  RLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLTGPASRWYMQLDSTHICSWKNLADSF

Query:  LKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELSTMFINTLKSPFYDKMIGSASTNFSDIMTIGERIEYGIKHGKIIDTAGS
        + QYKH  ++APDRL LQ MEKK +E+FKEYAQRWRDTAAQVQPPL DKE++ +FINTL++PFY+++IG+A+ NF+D++  GE IE  IK GK I+    
Subjt:  LKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELSTMFINTLKSPFYDKMIGSASTNFSDIMTIGERIEYGIKHGKIIDTAGS

Query:  STAKKGVPSKKKEGEVQMIGFNSRQPYPNAGVPQYHYPPPYVYPQPYINNTSAQYSSPYV
        +++KKG   +KKEG+VQ +  +S+Q +       Y+  PPY    P+I N +    +PYV
Subjt:  STAKKGVPSKKKEGEVQMIGFNSRQPYPNAGVPQYHYPPPYVYPQPYINNTSAQYSSPYV

KAA0033221.1 uncharacterized protein E6C27_scaffold845G00100 [Cucumis melo var. makuwa]1.3e-16838.16Show/hide
Query:  VRQWSENVQQIHGDS---LVENVVSQFKDVSFPESQLEAVKQAWEKLTVDRKAKFTSKYGYLAQLMYVQVNYSVLKALIRHWDPAYRCFTFGSIDMTPTI
        V QWSE+ Q   G S    +E  + +   + F  + LE +K  WE L  +++AKF   YG +  L+Y  +N S L+AL   WDP  +CFTF + D+TPTI
Subjt:  VRQWSENVQQIHGDS---LVENVVSQFKDVSFPESQLEAVKQAWEKLTVDRKAKFTSKYGYLAQLMYVQVNYSVLKALIRHWDPAYRCFTFGSIDMTPTI

Query:  EEYQSLLHMPTRTEVEAYSYDQELTMKRALSTLLGKIRTSDIEKQVKIKGENTCLPLDYILTLQQKFANEDKELTLLALCIFNVVLFPKICGYVEERVVK
        EEYQ+L+ +P     + Y Y ++LT++R+LS  +G I  S+++KQ+K K    C+P++Y++ L +      K L+L+ALCI+  V+FP+I GYVEE VVK
Subjt:  EEYQSLLHMPTRTEVEAYSYDQELTMKRALSTLLGKIRTSDIEKQVKIKGENTCLPLDYILTLQQKFANEDKELTLLALCIFNVVLFPKICGYVEERVVK

Query:  LFAKIEIGVDPIIPVLAETFRSLNYCRTKGTGRFIGCAPLLYIWVLSHVKCHPNSNVQRSNFQVPGISCKIPSQSSCNQV----GVH----PPLKGALGS
        +F  IE GV+P+IP++AETFRSLN+CR +G G+F  CAP+L+IW+ SH++ +PN      NF  P I    P   + N +      H     P  GA G 
Subjt:  LFAKIEIGVDPIIPVLAETFRSLNYCRTKGTGRFIGCAPLLYIWVLSHVKCHPNSNVQRSNFQVPGISCKIPSQSSCNQV----GVH----PPLKGALGS

Query:  LFLRTKDRRLRQIWVRQFIPATHELKDFEFAYDKGFCKDRIQKIVKAWKMITKIQSGQFHDDTTEAYKTWHANRAKTVLVSPKMKTKIKLNAKVIPDQQT
            +    LRQ W +QF P    L+D+EF+Y+     D+I + V+AWK + K++S +  + TTE Y  W A+R   + ++P       ++  ++P+ + 
Subjt:  LFLRTKDRRLRQIWVRQFIPATHELKDFEFAYDKGFCKDRIQKIVKAWKMITKIQSGQFHDDTTEAYKTWHANRAKTVLVSPKMKTKIKLNAKVIPDQQT

Query:  EQ-AAREKECDELRKANSSLVQENERLQLSVSKQEQLEKEISALDTEARDLN---RRMHRLRRDNEVSQATLKSRNDQVLKQQSEIASLHELMKELEDCI
        ++   +EKE   LR+ N  L  ENE+L      +E+++K +       RDL+   R+        EV Q +++    Q+++ +S+   L + +   E  +
Subjt:  EQ-AAREKECDELRKANSSLVQENERLQLSVSKQEQLEKEISALDTEARDLN---RRMHRLRRDNEVSQATLKSRNDQVLKQQSEIASLHELMKELEDCI

Query:  SLRNQTITEDATDRLMKDYTYLKEQYDRLSDDFGFARQNHATLRSK-------AEHMLTQIRRVTRRADELAEDARTLSKVIAPTQPNSKNVLKFLGKLP
              I   A + +  DY  L E+Y  +S DF   +  + TLR K        E    ++R++ R AD+ +  ARTL + + PT+   K +  FLG + 
Subjt:  SLRNQTITEDATDRLMKDYTYLKEQYDRLSDDFGFARQNHATLRSK-------AEHMLTQIRRVTRRADELAEDARTLSKVIAPTQPNSKNVLKFLGKLP

Query:  RSPRIHRTYVTRYRTRIMEEQSTEMEKTRKDIEELREKMDAILVALERGK-----------------IIPDIAQVPPKVT-----------------ITV
        +   +                       R+D+E L++++  IL  L  G+                 ++ D+   PP  T                    
Subjt:  RSPRIHRTYVTRYRTRIMEEQSTEMEKTRKDIEELREKMDAILVALERGK-----------------IIPDIAQVPPKVT-----------------ITV

Query:  PN----------LDDPEIRKELTGGEKVS----SSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKMA
        PN           ++P     + GG+K+S    S  +LE LEERLR +E                            +FEKY+G SCPK+HL+MYCRKM+
Subjt:  PN----------LDDPEIRKELTGGEKVS----SSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKMA

Query:  AYVQNDKLLIHCFQDSLTGPASRWYMQLDSTHICSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELSTMFIN
        AY  +DKLLIHCFQDSL  PASRWYMQLD + +  WK+LADSFLKQYK+NIDMAPDRLDLQRMEKK+ E+FKEYAQRWR+ AAQVQPPL DKEL+ MFIN
Subjt:  AYVQNDKLLIHCFQDSLTGPASRWYMQLDSTHICSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELSTMFIN

Query:  TLKSPFYDKMIGSASTNFSDIMTIGERIEYGIKHGKIIDTAGSSTAKKGVPSKKKEGEVQMIGFNSR
        TL++P+YD+M+GSASTNFSD++TIGERIE+G+K+G+I D A  S  ++ +  KKKEGEV  +    R
Subjt:  TLKSPFYDKMIGSASTNFSDIMTIGERIEYGIKHGKIIDTAGSSTAKKGVPSKKKEGEVQMIGFNSR

KAA0036933.1 uncharacterized protein E6C27_scaffold86G00060 [Cucumis melo var. makuwa]1.1e-18140.66Show/hide
Query:  WEKLTVDRKAKFTSKYGYLAQLMYVQVNYSVLKALIRHWDPAYRCFTFGSIDMTPTIEEYQSLLHMPTRTEVEAYSYDQELTMKRALSTLLGKIRTSDIE
        WE LT  R+  F+ KYG++A+LMY+ VNY  L+A+I   DPAY CFTFGS D+ PTIEEYQ++L MP +     Y ++ + T KR LS  L  +  ++I+
Subjt:  WEKLTVDRKAKFTSKYGYLAQLMYVQVNYSVLKALIRHWDPAYRCFTFGSIDMTPTIEEYQSLLHMPTRTEVEAYSYDQELTMKRALSTLLGKIRTSDIE

Query:  KQVKIKGENTCLPLDYILTLQQKFANEDKELTLLALCIFNVVLFPKICGYVEERVVKLFAKIEIGVDPIIPVLAETFRSLNYCRTKG--------TGRFI
        K +K KG    +P DY++ + Q + +EDK LTLLALCI+  V+FPK  GYV+ +V+KLF ++E GV+PIIP+LAETFRSLNYCR KG         G+  
Subjt:  KQVKIKGENTCLPLDYILTLQQKFANEDKELTLLALCIFNVVLFPKICGYVEERVVKLFAKIEIGVDPIIPVLAETFRSLNYCRTKG--------TGRFI

Query:  GCAPLLYIWVLSHVKCHPNSNVQRSNFQVP---------------------------GISCKIPSQSSCNQVGVHP--------------PLKGALGSLF
         C PLLYIW+ SH+K        R +F  P                               K+ S++   +    P              PL G  G + 
Subjt:  GCAPLLYIWVLSHVKCHPNSNVQRSNFQVP---------------------------GISCKIPSQSSCNQVGVHP--------------PLKGALGSLF

Query:  LRTKDRRLRQIWVRQFIPATHELKDFEFAYDKGFCKDRIQKIVKAWKMITKIQSGQFHDDTTEAYKTWHANRAKTVL-VSPKMKTKIKLNAKVIPDQQTE
          T    LRQ+W++QFIP TH LK      DKG                        ++  T  Y+ W ANR K ++ +S ++  + K  +   P+Q  E
Subjt:  LRTKDRRLRQIWVRQFIPATHELKDFEFAYDKGFCKDRIQKIVKAWKMITKIQSGQFHDDTTEAYKTWHANRAKTVL-VSPKMKTKIKLNAKVIPDQQTE

Query:  QAAR--------EKECDELRKANSS-------LVQENERLQLSVSKQEQLEKEISALDTEARDLNRRMHRLRRDNEVSQATLKSRNDQVLKQQSEIASLH
        ++          E+E ++LRK  S        L  E E+ +  +  Q++LE ++  LD E R +N+    ++ +    QAT+      +  + +E +  +
Subjt:  QAAR--------EKECDELRKANSS-------LVQENERLQLSVSKQEQLEKEISALDTEARDLNRRMHRLRRDNEVSQATLKSRNDQVLKQQSEIASLH

Query:  ELMKELEDCISLRNQTITEDATDRLMKDYTYLKEQYDRLSDDFGFARQNHATLRSKAEHMLTQIRRVTRRADELAEDARTLSKVIAPTQPNSKNVLKFLG
        +++K   D +  +  T  ++++ R+ ++Y  L   Y ++  D+    ++   L  + +  +  +R V++RA+  AE                     + G
Subjt:  ELMKELEDCISLRNQTITEDATDRLMKDYTYLKEQYDRLSDDFGFARQNHATLRSKAEHMLTQIRRVTRRADELAEDARTLSKVIAPTQPNSKNVLKFLG

Query:  KLPRSPRIHRTYVTRYRTRIMEEQSTEMEKTRKDIEELREKMDAI--LVALERGKIIPDIAQ------------VPPKVTITVPNLDDPEIRKELTGGEK
        K      I   Y TRY+++IMEE+  +M+K R++I  L E++  I  L+++ +GK   D  Q             PP    T  +++ P ++   T   +
Subjt:  KLPRSPRIHRTYVTRYRTRIMEEQSTEMEKTRKDIEELREKMDAI--LVALERGKIIPDIAQ------------VPPKVTITVPNLDDPEIRKELTGGEK

Query:  VSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLTGPASRWYMQLDST
        + S +KL+VLEERLRA+E TDV+GNIDAT+LCLVP +I+P KFKVPEF KYDG++CP++HLIMYCRKMA ++ NDKLL+HCFQDSLT PASRWY+QLD+ 
Subjt:  VSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLTGPASRWYMQLDST

Query:  HICSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELSTMFINTLKSPFYDKMIGSASTNFSDIMTIGERIEYG
        HI  WK+LAD+FLKQYK NIDMAPDRLDLQRMEKKS+ESFKEYAQRWRD AA+VQPPL DKE+++MF+NTL++PFY++MIG+ASTNFSDI+ IGERIEYG
Subjt:  HICSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELSTMFINTLKSPFYDKMIGSASTNFSDIMTIGERIEYG

Query:  IKHGKIID-TAGSSTAKKGVPSKKKEGEVQMIGF-NSRQPYPNAGVPQYHYPPPYVYPQPYINNTSAQYSSPYV
        IKHG++ + T      KKG  SKKKEGEV  IGF NS +     G+ +Y    P      YINN S    + YV
Subjt:  IKHGKIID-TAGSSTAKKGVPSKKKEGEVQMIGF-NSRQPYPNAGVPQYHYPPPYVYPQPYINNTSAQYSSPYV

XP_031738551.1 LOW QUALITY PROTEIN: uncharacterized protein LOC101203611 [Cucumis sativus]2.2e-12641.06Show/hide
Query:  LRQIWVRQFIPATHELKDFEFAYDKGFCKDRIQKIVKAWKMITKIQSGQFHDDTTEAYKTWHANRAKTVLVS-PKMKTKIKLNAKVIPDQQTEQAAR---
        LRQ+W++QFIP TH L+D +FAYD   C+ +  + V AWK + KI+    ++  T  Y+ W  NR K V+ +  ++  +++  +   PDQ  E+  +   
Subjt:  LRQIWVRQFIPATHELKDFEFAYDKGFCKDRIQKIVKAWKMITKIQSGQFHDDTTEAYKTWHANRAKTVLVS-PKMKTKIKLNAKVIPDQQTEQAAR---

Query:  -----EKECDELRKANSS-------LVQENERLQLSVSKQEQLEKEISALDTEARDLNRRMHRLRRDNEVSQATLKSRNDQVL-----------------
             E+E ++L+K  S        L +E E+ +  +  Q++LEK +  LD E R +N+    L+ +  + QAT++S+++ +                  
Subjt:  -----EKECDELRKANSS-------LVQENERLQLSVSKQEQLEKEISALDTEARDLNRRMHRLRRDNEVSQATLKSRNDQVL-----------------

Query:  ----KQQSEIA--------------SLH-ELMKELEDCISLRNQ--------TITEDATDRLMKDYTYLKEQYDRLSDDFGFARQNHATLRSKAEHMLTQ
            K++++I               SLH ++++  ED   L+N         T  +++++R++++Y  LK  Y ++  D+   R++  TL  + +  +  
Subjt:  ----KQQSEIA--------------SLH-ELMKELEDCISLRNQ--------TITEDATDRLMKDYTYLKEQYDRLSDDFGFARQNHATLRSKAEHMLTQ

Query:  IRRVTRRADELAEDARTLSKVIAPTQPNSKNVLKF-------LGKLPRSPRIHRTYVTRYRTRIMEEQSTEMEKTRKDIEELREKMDAILVALERGKIIP
        +R V+RRA+  AE A  L       +P+S ++ +F       LG   +   +  T  +    +  ++       T + +   + +     VA+       
Subjt:  IRRVTRRADELAEDARTLSKVIAPTQPNSKNVLKF-------LGKLPRSPRIHRTYVTRYRTRIMEEQSTEMEKTRKDIEELREKMDAILVALERGKIIP

Query:  DIAQVPPKVTITVPNLDDPEIR---KELTGGEKVSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRK
         +  VPP     VP+++  E +   +++   E   + +KL+VLEERLRA+EGTDV+GNIDAT+LCLVP +I+P KFKVP F+KYDG+SCP++HLIMYCRK
Subjt:  DIAQVPPKVTITVPNLDDPEIR---KELTGGEKVSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRK

Query:  MAAYVQNDKLLIHCFQDSLTGPASRWYMQLDSTHICSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELSTMF
        MAA++ NDKLLIHCFQDSLTGPA+RWY+QLD+ HI  WK+LAD+FLKQYKHNIDMAPDRLDLQRMEKKS+ESFKEYAQRWRD AA+VQPPL DKE++ MF
Subjt:  MAAYVQNDKLLIHCFQDSLTGPASRWYMQLDSTHICSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELSTMF

Query:  INTLKSPFYDKMIGSASTNFSDIMTIGERIEYGIKHGKIIDTAGS-STAKKGVPSKKKEGEVQMIGFNSRQPYPNAG
        +NTL++PFYD+MIG+A+TNFSDI+ IGERIEYGIKHG++I+T+      KKG   KKKEGEV  IGF      PN G
Subjt:  INTLKSPFYDKMIGSASTNFSDIMTIGERIEYGIKHGKIIDTAGS-STAKKGVPSKKKEGEVQMIGFNSRQPYPNAG

TrEMBL top hitse value%identityAlignment
A0A061E378 G-patch domain-containing protein5.5e-15233.58Show/hide
Query:  VRQWSENVQQIHGDSLVENVVSQFKD---VSFPESQLEAVKQAWEKLTVDRKAKFTSKYGYLAQLMYVQVNYSVLKALIRHWDPAYRCFTFGSIDMTPTI
        V Q   + QQ  GD L ++  S   D   +   ++    +   W+K     +A F  KYG++A+L+ VQV+  +LKA+++ WDP+YRCF F  +DM PTI
Subjt:  VRQWSENVQQIHGDSLVENVVSQFKD---VSFPESQLEAVKQAWEKLTVDRKAKFTSKYGYLAQLMYVQVNYSVLKALIRHWDPAYRCFTFGSIDMTPTI

Query:  EEYQSLLHMPTRTEVEAYSYDQELTMKRALSTLLGKIRTSDIEKQVKIKGENTCLPLDYILTLQQKFANEDKELTLLALCIFNVVLFPKICGYVEERVVK
        EEY +LL +      + Y   Q+   +R L+ ++G I ++++++ ++ KG+N C+P  ++ +   K  + ++   ++AL I+ +V+FPK+ G++E  ++ 
Subjt:  EEYQSLLHMPTRTEVEAYSYDQELTMKRALSTLLGKIRTSDIEKQVKIKGENTCLPLDYILTLQQKFANEDKELTLLALCIFNVVLFPKICGYVEERVVK

Query:  LFAKIEIGVDPIIPVLAETFRSLNYCRTKGTGRFIGCAPLLYIWVLSHVKC---------HPNSNVQRS-----------------------NFQVPGIS
         F ++    +P   +LA+T RSLNYCR KG GRF+GCA LL IW++SH +C         HP +   R                        + +V   +
Subjt:  LFAKIEIGVDPIIPVLAETFRSLNYCRTKGTGRFIGCAPLLYIWVLSHVKC---------HPNSNVQRS-----------------------NFQVPGIS

Query:  CKIPSQSSCNQVGVHP--PLKGALGSLFLRTKDRRLRQIWVRQFIPATHELKDFEFAYDKGFCKDRIQKIVKAWKMITKIQSGQFHDDTTEAYKTWHANR
          +P      + G  P  PL G  G +       R RQ    QF+P TH L   EFAY +     +I++I +AWK  +++  G++ D+ T  Y+ WH  R
Subjt:  CKIPSQSSCNQVGVHP--PLKGALGSLFLRTKDRRLRQIWVRQFIPATHELKDFEFAYDKGFCKDRIQKIVKAWKMITKIQSGQFHDDTTEAYKTWHANR

Query:  AKTVL----------VSPKMKTKIKLNAKVIPDQQTEQAA-------REKECDELRKANSSLVQENERLQLSVSKQEQLEKEISALDTEARDLNRRMHRL
         K V+          V P+ +  +  +       + E A+        +KEC+++++  S   ++ +++Q    K E L  + SA  +E   L R +   
Subjt:  AKTVL----------VSPKMKTKIKLNAKVIPDQQTEQAA-------REKECDELRKANSSLVQENERLQLSVSKQEQLEKEISALDTEARDLNRRMHRL

Query:  RRDNEVSQATLKSRND----QVLKQQSEIASLHELMKELEDCISLRNQTITEDATDRLMKDYTYLKEQYDRLSDDFGFARQNHATLRSKAEHMLTQIRRV
         R+N+ ++  L++ ND    QV  QQ  I  L +  +ELE  ++   Q            +Y  LK+Q  R+ +     RQ +    ++ ++++ Q+R V
Subjt:  RRDNEVSQATLKSRND----QVLKQQSEIASLHELMKELEDCISLRNQTITEDATDRLMKDYTYLKEQYDRLSDDFGFARQNHATLRSKAEHMLTQIRRV

Query:  TRRADELAEDARTLSKVIAPTQPNSKNVLKFLGK--------------------LPRSPRIHRTYVTRYRTRIMEEQSTE----MEKTRKDIEELREKMD
          +A  +A     L   I P     + ++K+L +                     P  P   R + TR R RIME++  E    ME+ ++++ E   KM 
Subjt:  TRRADELAEDARTLSKVIAPTQPNSKNVLKFLGK--------------------LPRSPRIHRTYVTRYRTRIMEEQSTE----MEKTRKDIEELREKMD

Query:  AILVALERGK--------------------------------------------IIPDI--------------------AQVPPKVTITVPNLDDPEIRK
         ++++L +GK                                            +IP +                    + + P  +I +P+LDDP+ ++
Subjt:  AILVALERGK--------------------------------------------IIPDI--------------------AQVPPKVTITVPNLDDPEIRK

Query:  ELTG-----GEKVSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLTG
        +L       G+     +K ++LEERLRA+EG D FG +DAT+LCLVPDV++P KFKVP+FEKYDG  CP  H+ MYCRKMAA   +DKLLI+ FQDSLTG
Subjt:  ELTG-----GEKVSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLTG

Query:  PASRWYMQLDSTHICSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELSTMFINTLKSPFYDKMIGSASTNFS
         A+RWY+QLD   I +WK LA +F+ QYKH  ++APDRL LQ MEKK +E+FKEYAQRWRDTAAQVQPPL DKE++ +FINTL++PFY+++IG+A  NF+
Subjt:  PASRWYMQLDSTHICSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELSTMFINTLKSPFYDKMIGSASTNFS

Query:  DIMTIGERIEYGIKHGKIIDTAGSSTAKKGVPSKKKEGEVQMIGFNSRQPYPNAGVPQYHYPPPYVYPQPYINNTSAQYSSPYV
        D++  GE IE  IK GK I+    +++KKG   KKKEG+VQ +  +S+Q + N   P Y Y PPY    P+I N +    +PYV
Subjt:  DIMTIGERIEYGIKHGKIIDTAGSSTAKKGVPSKKKEGEVQMIGFNSRQPYPNAGVPQYHYPPPYVYPQPYINNTSAQYSSPYV

A0A061EXR3 G-patch domain-containing protein4.9e-14033.11Show/hide
Query:  VRQWSENVQQIHGDSLVENVVSQFKD---VSFPESQLEAVKQAWEKLTVDRKAKFTSKYGYLAQLMYVQVNYSVLKALIRHWDPAYRCFTFGSIDMTPTI
        V Q   + QQ  GD L ++  S   D   +   ++    +   W+K     +A F  KYG++A+L+ VQ++  +LKA+++ WDP+YRCF F  IDM    
Subjt:  VRQWSENVQQIHGDSLVENVVSQFKD---VSFPESQLEAVKQAWEKLTVDRKAKFTSKYGYLAQLMYVQVNYSVLKALIRHWDPAYRCFTFGSIDMTPTI

Query:  EEYQSLLHMPTRTEVEAYSYDQELTMKRALSTLLGKIRTSDIEKQVKIKGENTCLPLDYILTLQQKFANEDKELTLLALCIFNVVLFPKICGYVEERVVK
                              +   +R L+ ++G I ++++++ ++ KG+N C+P  ++ +   K  + ++   ++AL I+ +V+FPK+ G++E R++ 
Subjt:  EEYQSLLHMPTRTEVEAYSYDQELTMKRALSTLLGKIRTSDIEKQVKIKGENTCLPLDYILTLQQKFANEDKELTLLALCIFNVVLFPKICGYVEERVVK

Query:  LFAKIEIGVDPIIPVLAETFRSLNYCRTKGTGRFIGCAPLLYIWVLSHVKC---------HPNSNVQRS-----------------------NFQVPGIS
         F ++    +P   +LAET RSLNYCR KG GRF+GCA LL IW++SH +C         HP +   R                        + +V   +
Subjt:  LFAKIEIGVDPIIPVLAETFRSLNYCRTKGTGRFIGCAPLLYIWVLSHVKC---------HPNSNVQRS-----------------------NFQVPGIS

Query:  CKIPSQSSCNQVGVHP--PLKGALGSLFLRTKDRRLRQIWVRQFIPATHELKDFEFAYDK-GFCKDRIQKIVKAWKMITKIQSGQFHDDTTEAYKTWHAN
          +P      + G  P   L G  G++       R RQ    QF+P TH L   EFAY++ GF K RI++I +AWK  +++  G++ D+ T  Y+ WH  
Subjt:  CKIPSQSSCNQVGVHP--PLKGALGSLFLRTKDRRLRQIWVRQFIPATHELKDFEFAYDK-GFCKDRIQKIVKAWKMITKIQSGQFHDDTTEAYKTWHAN

Query:  RAKTVL----------VSPKMKTKIKLNAKVIPDQQTEQAAREKECDELRKANSSLVQENERLQLSVSKQE-QLEKEISALDTEARDLNRRMH-RLRRDN
        R K V+          V P+ +  +  +       + E A+ ++  ++L+K    + +E    +  V K E + E       T   +L R +  R  R N
Subjt:  RAKTVL----------VSPKMKTKIKLNAKVIPDQQTEQAAREKECDELRKANSSLVQENERLQLSVSKQE-QLEKEISALDTEARDLNRRMH-RLRRDN

Query:  EVSQATLKSRND----QVLKQQSEIASLHELMKELEDCISLRNQTITEDATDRLMKDYTYLKEQYDRLSDDFGFARQNHATLRSKAEHMLTQIRRVTRRA
        E     L++ ND    QV  QQ  I  L +  +ELE  ++   Q            +Y  LK+Q  R+ +     RQ +     + ++++ Q+R V  +A
Subjt:  EVSQATLKSRND----QVLKQQSEIASLHELMKELEDCISLRNQTITEDATDRLMKDYTYLKEQYDRLSDDFGFARQNHATLRSKAEHMLTQIRRVTRRA

Query:  DELAEDARTLSKVIAPTQPNSKNVLKFLGKLPRSPRIHRTYVTRYRTRIMEEQSTE----MEKTRKDIEELREKMDAILVALERGK--------------
          +A     L   I P     + ++K+L +               R RIMEE+  E    ME+ ++++ E   KM  ++++L +GK              
Subjt:  DELAEDARTLSKVIAPTQPNSKNVLKFLGKLPRSPRIHRTYVTRYRTRIMEEQSTE----MEKTRKDIEELREKMDAILVALERGK--------------

Query:  ------------------------------IIPDI--------------------AQVPPKVTITVPNLDDPEIRKELTGG-----EKVSSSEKLEVLEE
                                      ++P +                      V P   I VP+LDDP+ +++L        E     +K ++LEE
Subjt:  ------------------------------IIPDI--------------------AQVPPKVTITVPNLDDPEIRKELTGG-----EKVSSSEKLEVLEE

Query:  RLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLTGPASRWYMQLDSTHICSWKNLADSF
        RLRA+EG D FG +DAT+LCLVPDV++P KFKVPEFEKYD   CP  H+ M CRKMAA   +DKLLIH FQDSLTG A+RWY+QLD   I +WK+LA +F
Subjt:  RLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLTGPASRWYMQLDSTHICSWKNLADSF

Query:  LKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELSTMFINTLKSPFYDKMIGSASTNFSDIMTIGERIEYGIKHGKIIDTAGS
        + QYKH  ++APDRL LQ MEKK +E+FKEYAQRWRDTAAQVQPPL DKE++ +FINTL++PFY+++IG+A+ NF+D++  GE IE  IK GK I+    
Subjt:  LKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELSTMFINTLKSPFYDKMIGSASTNFSDIMTIGERIEYGIKHGKIIDTAGS

Query:  STAKKGVPSKKKEGEVQMIGFNSRQPYPNAGVPQYHYPPPYVYPQPYINNTSAQYSSPYV
        +++KKG   +KKEG+VQ +  +S+Q +       Y+  PPY    P+I N +    +PYV
Subjt:  STAKKGVPSKKKEGEVQMIGFNSRQPYPNAGVPQYHYPPPYVYPQPYINNTSAQYSSPYV

A0A5A7SUT0 Reverse transcriptase6.5e-16938.16Show/hide
Query:  VRQWSENVQQIHGDS---LVENVVSQFKDVSFPESQLEAVKQAWEKLTVDRKAKFTSKYGYLAQLMYVQVNYSVLKALIRHWDPAYRCFTFGSIDMTPTI
        V QWSE+ Q   G S    +E  + +   + F  + LE +K  WE L  +++AKF   YG +  L+Y  +N S L+AL   WDP  +CFTF + D+TPTI
Subjt:  VRQWSENVQQIHGDS---LVENVVSQFKDVSFPESQLEAVKQAWEKLTVDRKAKFTSKYGYLAQLMYVQVNYSVLKALIRHWDPAYRCFTFGSIDMTPTI

Query:  EEYQSLLHMPTRTEVEAYSYDQELTMKRALSTLLGKIRTSDIEKQVKIKGENTCLPLDYILTLQQKFANEDKELTLLALCIFNVVLFPKICGYVEERVVK
        EEYQ+L+ +P     + Y Y ++LT++R+LS  +G I  S+++KQ+K K    C+P++Y++ L +      K L+L+ALCI+  V+FP+I GYVEE VVK
Subjt:  EEYQSLLHMPTRTEVEAYSYDQELTMKRALSTLLGKIRTSDIEKQVKIKGENTCLPLDYILTLQQKFANEDKELTLLALCIFNVVLFPKICGYVEERVVK

Query:  LFAKIEIGVDPIIPVLAETFRSLNYCRTKGTGRFIGCAPLLYIWVLSHVKCHPNSNVQRSNFQVPGISCKIPSQSSCNQV----GVH----PPLKGALGS
        +F  IE GV+P+IP++AETFRSLN+CR +G G+F  CAP+L+IW+ SH++ +PN      NF  P I    P   + N +      H     P  GA G 
Subjt:  LFAKIEIGVDPIIPVLAETFRSLNYCRTKGTGRFIGCAPLLYIWVLSHVKCHPNSNVQRSNFQVPGISCKIPSQSSCNQV----GVH----PPLKGALGS

Query:  LFLRTKDRRLRQIWVRQFIPATHELKDFEFAYDKGFCKDRIQKIVKAWKMITKIQSGQFHDDTTEAYKTWHANRAKTVLVSPKMKTKIKLNAKVIPDQQT
            +    LRQ W +QF P    L+D+EF+Y+     D+I + V+AWK + K++S +  + TTE Y  W A+R   + ++P       ++  ++P+ + 
Subjt:  LFLRTKDRRLRQIWVRQFIPATHELKDFEFAYDKGFCKDRIQKIVKAWKMITKIQSGQFHDDTTEAYKTWHANRAKTVLVSPKMKTKIKLNAKVIPDQQT

Query:  EQ-AAREKECDELRKANSSLVQENERLQLSVSKQEQLEKEISALDTEARDLN---RRMHRLRRDNEVSQATLKSRNDQVLKQQSEIASLHELMKELEDCI
        ++   +EKE   LR+ N  L  ENE+L      +E+++K +       RDL+   R+        EV Q +++    Q+++ +S+   L + +   E  +
Subjt:  EQ-AAREKECDELRKANSSLVQENERLQLSVSKQEQLEKEISALDTEARDLN---RRMHRLRRDNEVSQATLKSRNDQVLKQQSEIASLHELMKELEDCI

Query:  SLRNQTITEDATDRLMKDYTYLKEQYDRLSDDFGFARQNHATLRSK-------AEHMLTQIRRVTRRADELAEDARTLSKVIAPTQPNSKNVLKFLGKLP
              I   A + +  DY  L E+Y  +S DF   +  + TLR K        E    ++R++ R AD+ +  ARTL + + PT+   K +  FLG + 
Subjt:  SLRNQTITEDATDRLMKDYTYLKEQYDRLSDDFGFARQNHATLRSK-------AEHMLTQIRRVTRRADELAEDARTLSKVIAPTQPNSKNVLKFLGKLP

Query:  RSPRIHRTYVTRYRTRIMEEQSTEMEKTRKDIEELREKMDAILVALERGK-----------------IIPDIAQVPPKVT-----------------ITV
        +   +                       R+D+E L++++  IL  L  G+                 ++ D+   PP  T                    
Subjt:  RSPRIHRTYVTRYRTRIMEEQSTEMEKTRKDIEELREKMDAILVALERGK-----------------IIPDIAQVPPKVT-----------------ITV

Query:  PN----------LDDPEIRKELTGGEKVS----SSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKMA
        PN           ++P     + GG+K+S    S  +LE LEERLR +E                            +FEKY+G SCPK+HL+MYCRKM+
Subjt:  PN----------LDDPEIRKELTGGEKVS----SSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKMA

Query:  AYVQNDKLLIHCFQDSLTGPASRWYMQLDSTHICSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELSTMFIN
        AY  +DKLLIHCFQDSL  PASRWYMQLD + +  WK+LADSFLKQYK+NIDMAPDRLDLQRMEKK+ E+FKEYAQRWR+ AAQVQPPL DKEL+ MFIN
Subjt:  AYVQNDKLLIHCFQDSLTGPASRWYMQLDSTHICSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELSTMFIN

Query:  TLKSPFYDKMIGSASTNFSDIMTIGERIEYGIKHGKIIDTAGSSTAKKGVPSKKKEGEVQMIGFNSR
        TL++P+YD+M+GSASTNFSD++TIGERIE+G+K+G+I D A  S  ++ +  KKKEGEV  +    R
Subjt:  TLKSPFYDKMIGSASTNFSDIMTIGERIEYGIKHGKIIDTAGSSTAKKGVPSKKKEGEVQMIGFNSR

A0A5A7T1W2 Retrotrans_gag domain-containing protein5.1e-18240.66Show/hide
Query:  WEKLTVDRKAKFTSKYGYLAQLMYVQVNYSVLKALIRHWDPAYRCFTFGSIDMTPTIEEYQSLLHMPTRTEVEAYSYDQELTMKRALSTLLGKIRTSDIE
        WE LT  R+  F+ KYG++A+LMY+ VNY  L+A+I   DPAY CFTFGS D+ PTIEEYQ++L MP +     Y ++ + T KR LS  L  +  ++I+
Subjt:  WEKLTVDRKAKFTSKYGYLAQLMYVQVNYSVLKALIRHWDPAYRCFTFGSIDMTPTIEEYQSLLHMPTRTEVEAYSYDQELTMKRALSTLLGKIRTSDIE

Query:  KQVKIKGENTCLPLDYILTLQQKFANEDKELTLLALCIFNVVLFPKICGYVEERVVKLFAKIEIGVDPIIPVLAETFRSLNYCRTKG--------TGRFI
        K +K KG    +P DY++ + Q + +EDK LTLLALCI+  V+FPK  GYV+ +V+KLF ++E GV+PIIP+LAETFRSLNYCR KG         G+  
Subjt:  KQVKIKGENTCLPLDYILTLQQKFANEDKELTLLALCIFNVVLFPKICGYVEERVVKLFAKIEIGVDPIIPVLAETFRSLNYCRTKG--------TGRFI

Query:  GCAPLLYIWVLSHVKCHPNSNVQRSNFQVP---------------------------GISCKIPSQSSCNQVGVHP--------------PLKGALGSLF
         C PLLYIW+ SH+K        R +F  P                               K+ S++   +    P              PL G  G + 
Subjt:  GCAPLLYIWVLSHVKCHPNSNVQRSNFQVP---------------------------GISCKIPSQSSCNQVGVHP--------------PLKGALGSLF

Query:  LRTKDRRLRQIWVRQFIPATHELKDFEFAYDKGFCKDRIQKIVKAWKMITKIQSGQFHDDTTEAYKTWHANRAKTVL-VSPKMKTKIKLNAKVIPDQQTE
          T    LRQ+W++QFIP TH LK      DKG                        ++  T  Y+ W ANR K ++ +S ++  + K  +   P+Q  E
Subjt:  LRTKDRRLRQIWVRQFIPATHELKDFEFAYDKGFCKDRIQKIVKAWKMITKIQSGQFHDDTTEAYKTWHANRAKTVL-VSPKMKTKIKLNAKVIPDQQTE

Query:  QAAR--------EKECDELRKANSS-------LVQENERLQLSVSKQEQLEKEISALDTEARDLNRRMHRLRRDNEVSQATLKSRNDQVLKQQSEIASLH
        ++          E+E ++LRK  S        L  E E+ +  +  Q++LE ++  LD E R +N+    ++ +    QAT+      +  + +E +  +
Subjt:  QAAR--------EKECDELRKANSS-------LVQENERLQLSVSKQEQLEKEISALDTEARDLNRRMHRLRRDNEVSQATLKSRNDQVLKQQSEIASLH

Query:  ELMKELEDCISLRNQTITEDATDRLMKDYTYLKEQYDRLSDDFGFARQNHATLRSKAEHMLTQIRRVTRRADELAEDARTLSKVIAPTQPNSKNVLKFLG
        +++K   D +  +  T  ++++ R+ ++Y  L   Y ++  D+    ++   L  + +  +  +R V++RA+  AE                     + G
Subjt:  ELMKELEDCISLRNQTITEDATDRLMKDYTYLKEQYDRLSDDFGFARQNHATLRSKAEHMLTQIRRVTRRADELAEDARTLSKVIAPTQPNSKNVLKFLG

Query:  KLPRSPRIHRTYVTRYRTRIMEEQSTEMEKTRKDIEELREKMDAI--LVALERGKIIPDIAQ------------VPPKVTITVPNLDDPEIRKELTGGEK
        K      I   Y TRY+++IMEE+  +M+K R++I  L E++  I  L+++ +GK   D  Q             PP    T  +++ P ++   T   +
Subjt:  KLPRSPRIHRTYVTRYRTRIMEEQSTEMEKTRKDIEELREKMDAI--LVALERGKIIPDIAQ------------VPPKVTITVPNLDDPEIRKELTGGEK

Query:  VSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLTGPASRWYMQLDST
        + S +KL+VLEERLRA+E TDV+GNIDAT+LCLVP +I+P KFKVPEF KYDG++CP++HLIMYCRKMA ++ NDKLL+HCFQDSLT PASRWY+QLD+ 
Subjt:  VSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLTGPASRWYMQLDST

Query:  HICSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELSTMFINTLKSPFYDKMIGSASTNFSDIMTIGERIEYG
        HI  WK+LAD+FLKQYK NIDMAPDRLDLQRMEKKS+ESFKEYAQRWRD AA+VQPPL DKE+++MF+NTL++PFY++MIG+ASTNFSDI+ IGERIEYG
Subjt:  HICSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELSTMFINTLKSPFYDKMIGSASTNFSDIMTIGERIEYG

Query:  IKHGKIID-TAGSSTAKKGVPSKKKEGEVQMIGF-NSRQPYPNAGVPQYHYPPPYVYPQPYINNTSAQYSSPYV
        IKHG++ + T      KKG  SKKKEGEV  IGF NS +     G+ +Y    P      YINN S    + YV
Subjt:  IKHGKIID-TAGSSTAKKGVPSKKKEGEVQMIGF-NSRQPYPNAGVPQYHYPPPYVYPQPYINNTSAQYSSPYV

A0A5A7UL51 Girdin-like4.9e-12435.13Show/hide
Query:  EDQATVRQWSENVQQIHGDSLVE----NVVSQFKDVSFPESQLEAVKQAWEKLTVDRKAKFTSKYGYLAQLMYVQVNYSVLKALIRHWDPAYRCFTFGSI
        ++ + V +W+E +QQ  GD +      +V+S+ + +S  ++ L  +K  WE LT  R+  F+ KYG++A+LMY+ VNY  L+A+I  WDPAY CFTFGS 
Subjt:  EDQATVRQWSENVQQIHGDSLVE----NVVSQFKDVSFPESQLEAVKQAWEKLTVDRKAKFTSKYGYLAQLMYVQVNYSVLKALIRHWDPAYRCFTFGSI

Query:  DMTPTIEEYQSLLHMPTRTEVEAYSYDQELTMKRALSTLLGKIRTSDIEKQVKIKGENTCLPLDYILTLQQKFANEDKELTLLALCIFNVVLFPKICGYV
        D+ PTIEEYQ++L MP +     Y ++ + T K           T++I+K +K+KG    +P DY++ + Q + +EDK LTLLALCI+  V+FPK  GYV
Subjt:  DMTPTIEEYQSLLHMPTRTEVEAYSYDQELTMKRALSTLLGKIRTSDIEKQVKIKGENTCLPLDYILTLQQKFANEDKELTLLALCIFNVVLFPKICGYV

Query:  EERVVKLFAKIEIGVDPIIPVLAETFRSLNYCRTKGTGRFIGCAPLLYIWVLSHVKCHPNSNVQRSNFQVPGISCKIPSQSSCNQV-GVHPPLKGALGSL
        + +V+KLF ++E GV+PIIP+LA+TFRSLNYCR KG G+F  C PLLYIW+ SH+K        R +F  P    +      C  V     P K A  S 
Subjt:  EERVVKLFAKIEIGVDPIIPVLAETFRSLNYCRTKGTGRFIGCAPLLYIWVLSHVKCHPNSNVQRSNFQVPGISCKIPSQSSCNQV-GVHPPLKGALGSL

Query:  FLRTKDRR---------------------------------------LRQIWVRQFIPATHELKDFEFAYDKGFCKDRIQKIVKAWKMITKIQSGQFHDD
        F +                                            LRQ+W++QFIP T  L++ +F+Y+   C+ + ++ V AWK I KI+     + 
Subjt:  FLRTKDRR---------------------------------------LRQIWVRQFIPATHELKDFEFAYDKGFCKDRIQKIVKAWKMITKIQSGQFHDD

Query:  TTEAYKTWHANRAKTVLVSPKMKTKIKLNAKVIPDQQTEQAAREKECDELRKANSSLVQENERLQLSVSKQEQLEKEISALDTEARDLNRRMHRLRRDNE
         T  Y+ W AN+ K ++                 D   E   R KE        +S  Q N+ ++ S+  +E+                           
Subjt:  TTEAYKTWHANRAKTVLVSPKMKTKIKLNAKVIPDQQTEQAAREKECDELRKANSSLVQENERLQLSVSKQEQLEKEISALDTEARDLNRRMHRLRRDNE

Query:  VSQATLKSRNDQVLKQQSEIASLHELMKELEDCISLRNQTITEDATDRLMKDYTYLKEQYDRLSDDFGFARQNHATLRSKAEHMLTQIRRVTRRADELAE
                  +++L+Q++E     +L KE                T + M   TYL+                                           
Subjt:  VSQATLKSRNDQVLKQQSEIASLHELMKELEDCISLRNQTITEDATDRLMKDYTYLKEQYDRLSDDFGFARQNHATLRSKAEHMLTQIRRVTRRADELAE

Query:  DARTLSKVIAPTQPNSKNVLKFLGKLPRSPRIHRTYVTRYRTRIMEEQSTEMEKTRKDIEELREKMDAILVALERGKIIPDIAQVPPKVTITVPNLDDPE
                                                          E+EKT K IE            LE    I DI Q                
Subjt:  DARTLSKVIAPTQPNSKNVLKFLGKLPRSPRIHRTYVTRYRTRIMEEQSTEMEKTRKDIEELREKMDAILVALERGKIIPDIAQVPPKVTITVPNLDDPE

Query:  IRKELTGGEKVSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLTGPA
                E   + +KL+VLEERLRA+E TDV+GN DAT+LCLVP +I+P KFKVPEF+KYDG++CP+ HLIMYCRKMAA++ NDKLL+HCFQDSLTGPA
Subjt:  IRKELTGGEKVSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLTGPA

Query:  SRWYMQLDSTHICSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKEYA
        SRWY+QLD+ HI  WK+LAD+FLKQY   IDMAPDRLDLQRMEKKS+ESFKEYA
Subjt:  SRWYMQLDSTHICSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKEYA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTATAGAAGATCAAGCAACAGTACGTCAATGGTCAGAAAATGTACAACAAATCCACGGAGATTCTTTAGTAGAAAATGTTGTTTCTCAATTTAAGGATGTCAGTTT
TCCAGAAAGTCAATTAGAAGCAGTGAAACAGGCTTGGGAAAAATTAACTGTAGATAGGAAGGCTAAATTTACAAGCAAATATGGCTATCTAGCTCAGCTCATGTATGTAC
AAGTTAATTATTCTGTATTAAAAGCTTTGATTCGACATTGGGATCCAGCCTACAGATGTTTCACATTTGGCTCAATTGACATGACTCCTACAATAGAGGAATATCAATCC
CTTCTGCATATGCCAACACGAACAGAGGTTGAAGCTTATTCTTACGATCAAGAGCTTACAATGAAAAGAGCATTATCTACTCTTTTGGGCAAGATTCGTACGAGCGACAT
TGAGAAACAAGTAAAGATAAAAGGAGAGAACACATGCCTACCCCTAGACTACATCCTTACTCTTCAACAAAAATTTGCAAATGAAGACAAGGAGTTAACTTTACTGGCAT
TGTGTATCTTTAATGTTGTTTTGTTTCCTAAAATATGTGGATATGTTGAGGAACGCGTGGTCAAGTTGTTTGCTAAGATAGAAATAGGAGTTGATCCCATTATTCCTGTA
TTGGCAGAAACTTTTCGTTCATTGAATTATTGCAGAACAAAAGGAACAGGAAGATTTATAGGGTGTGCACCATTATTGTACATTTGGGTCCTTAGTCATGTGAAGTGCCA
CCCGAATTCAAATGTCCAGAGATCAAATTTTCAAGTTCCTGGAATAAGCTGCAAAATCCCATCTCAGAGTTCGTGCAATCAGGTTGGAGTTCATCCTCCCCTGAAAGGAG
CGCTTGGGAGCCTTTTTCTCCGAACTAAAGATAGAAGATTACGCCAAATTTGGGTCCGACAATTCATCCCTGCTACGCATGAGTTGAAGGATTTCGAATTTGCTTATGAT
AAAGGCTTCTGTAAAGATAGAATTCAGAAAATTGTGAAAGCATGGAAAATGATCACTAAAATCCAGAGTGGTCAGTTTCATGATGATACCACAGAGGCGTACAAAACATG
GCATGCGAACAGAGCTAAAACCGTGCTTGTGTCACCAAAAATGAAAACCAAGATAAAACTTAATGCAAAGGTGATACCAGATCAACAGACAGAACAAGCAGCACGCGAAA
AGGAATGTGATGAATTGAGAAAAGCGAATTCATCATTGGTTCAAGAAAATGAAAGGTTGCAATTGAGTGTTAGCAAACAAGAACAGTTAGAAAAAGAAATTTCAGCATTA
GACACAGAGGCCCGCGACCTGAACAGAAGAATGCATCGATTAAGAAGGGATAATGAAGTCTCCCAAGCAACTCTCAAGTCAAGGAATGACCAAGTTTTGAAGCAACAATC
TGAGATTGCCTCACTTCATGAGTTGATGAAAGAGCTCGAAGATTGCATTAGTTTGAGGAACCAAACGATTACTGAGGATGCCACTGATAGATTAATGAAAGACTATACCT
ATTTAAAGGAGCAGTACGACAGATTAAGCGATGATTTTGGGTTTGCGAGACAGAACCACGCGACACTACGAAGTAAAGCGGAACATATGCTCACTCAGATTAGGAGAGTC
ACTCGAAGGGCAGATGAACTAGCAGAAGATGCACGTACTCTCTCTAAAGTCATAGCACCTACACAGCCGAATAGCAAGAATGTGCTCAAGTTTCTGGGAAAACTTCCTCG
GTCACCTCGAATCCACCGCACCTACGTCACAAGATACAGGACAAGGATCATGGAAGAGCAGAGTACTGAGATGGAGAAAACAAGGAAAGATATTGAGGAGTTACGAGAAA
AAATGGATGCCATTCTTGTCGCCCTAGAAAGAGGCAAAATAATACCTGATATTGCTCAAGTACCTCCCAAGGTGACCATTACAGTTCCCAATTTAGATGATCCTGAAATC
AGAAAAGAGCTAACGGGAGGTGAGAAAGTCTCTTCTAGTGAAAAGCTTGAAGTCCTGGAGGAAAGATTAAGGGCAGTAGAAGGAACAGACGTCTTCGGAAATATAGATGC
GACCAAGCTATGCTTGGTACCAGATGTAATCCTCCCTCCAAAATTCAAGGTGCCCGAGTTTGAAAAGTATGATGGAGCATCCTGTCCTAAAAACCATCTCATCATGTATT
GTAGGAAGATGGCAGCATACGTCCAAAATGACAAGCTGTTAATTCACTGCTTCCAGGACAGTCTTACTGGTCCAGCATCTCGATGGTATATGCAGTTAGACAGCACTCAT
ATATGTTCATGGAAGAATCTAGCCGATTCATTTTTAAAGCAATATAAGCACAACATAGATATGGCTCCTGACCGCCTAGACCTCCAGAGGATGGAAAAGAAGAGCACAGA
AAGCTTTAAAGAGTATGCCCAAAGGTGGAGGGATACTGCTGCTCAGGTGCAACCACCTTTAGCAGATAAGGAGCTGTCAACCATGTTTATTAATACTCTCAAATCTCCTT
TCTATGATAAGATGATTGGGAGTGCCTCTACCAATTTCTCTGACATAATGACAATTGGAGAGAGAATCGAGTACGGAATTAAGCATGGAAAGATAATTGATACGGCTGGA
TCATCAACAGCGAAAAAGGGGGTTCCATCGAAGAAAAAAGAGGGAGAAGTTCAGATGATTGGTTTCAATTCAAGACAACCATACCCTAATGCCGGAGTGCCACAATATCA
CTATCCACCTCCATATGTTTACCCTCAACCATATATCAATAATACGTCAGCCCAATACTCATCCCCTTACGTCCAAATCCTCGTCCTACTCAAGGCTACCAGCCTCGGAA
TCAACAGAACACTCCTTATGTCCAGGACACCAAAACAATAA
mRNA sequenceShow/hide mRNA sequence
ATGGGTATAGAAGATCAAGCAACAGTACGTCAATGGTCAGAAAATGTACAACAAATCCACGGAGATTCTTTAGTAGAAAATGTTGTTTCTCAATTTAAGGATGTCAGTTT
TCCAGAAAGTCAATTAGAAGCAGTGAAACAGGCTTGGGAAAAATTAACTGTAGATAGGAAGGCTAAATTTACAAGCAAATATGGCTATCTAGCTCAGCTCATGTATGTAC
AAGTTAATTATTCTGTATTAAAAGCTTTGATTCGACATTGGGATCCAGCCTACAGATGTTTCACATTTGGCTCAATTGACATGACTCCTACAATAGAGGAATATCAATCC
CTTCTGCATATGCCAACACGAACAGAGGTTGAAGCTTATTCTTACGATCAAGAGCTTACAATGAAAAGAGCATTATCTACTCTTTTGGGCAAGATTCGTACGAGCGACAT
TGAGAAACAAGTAAAGATAAAAGGAGAGAACACATGCCTACCCCTAGACTACATCCTTACTCTTCAACAAAAATTTGCAAATGAAGACAAGGAGTTAACTTTACTGGCAT
TGTGTATCTTTAATGTTGTTTTGTTTCCTAAAATATGTGGATATGTTGAGGAACGCGTGGTCAAGTTGTTTGCTAAGATAGAAATAGGAGTTGATCCCATTATTCCTGTA
TTGGCAGAAACTTTTCGTTCATTGAATTATTGCAGAACAAAAGGAACAGGAAGATTTATAGGGTGTGCACCATTATTGTACATTTGGGTCCTTAGTCATGTGAAGTGCCA
CCCGAATTCAAATGTCCAGAGATCAAATTTTCAAGTTCCTGGAATAAGCTGCAAAATCCCATCTCAGAGTTCGTGCAATCAGGTTGGAGTTCATCCTCCCCTGAAAGGAG
CGCTTGGGAGCCTTTTTCTCCGAACTAAAGATAGAAGATTACGCCAAATTTGGGTCCGACAATTCATCCCTGCTACGCATGAGTTGAAGGATTTCGAATTTGCTTATGAT
AAAGGCTTCTGTAAAGATAGAATTCAGAAAATTGTGAAAGCATGGAAAATGATCACTAAAATCCAGAGTGGTCAGTTTCATGATGATACCACAGAGGCGTACAAAACATG
GCATGCGAACAGAGCTAAAACCGTGCTTGTGTCACCAAAAATGAAAACCAAGATAAAACTTAATGCAAAGGTGATACCAGATCAACAGACAGAACAAGCAGCACGCGAAA
AGGAATGTGATGAATTGAGAAAAGCGAATTCATCATTGGTTCAAGAAAATGAAAGGTTGCAATTGAGTGTTAGCAAACAAGAACAGTTAGAAAAAGAAATTTCAGCATTA
GACACAGAGGCCCGCGACCTGAACAGAAGAATGCATCGATTAAGAAGGGATAATGAAGTCTCCCAAGCAACTCTCAAGTCAAGGAATGACCAAGTTTTGAAGCAACAATC
TGAGATTGCCTCACTTCATGAGTTGATGAAAGAGCTCGAAGATTGCATTAGTTTGAGGAACCAAACGATTACTGAGGATGCCACTGATAGATTAATGAAAGACTATACCT
ATTTAAAGGAGCAGTACGACAGATTAAGCGATGATTTTGGGTTTGCGAGACAGAACCACGCGACACTACGAAGTAAAGCGGAACATATGCTCACTCAGATTAGGAGAGTC
ACTCGAAGGGCAGATGAACTAGCAGAAGATGCACGTACTCTCTCTAAAGTCATAGCACCTACACAGCCGAATAGCAAGAATGTGCTCAAGTTTCTGGGAAAACTTCCTCG
GTCACCTCGAATCCACCGCACCTACGTCACAAGATACAGGACAAGGATCATGGAAGAGCAGAGTACTGAGATGGAGAAAACAAGGAAAGATATTGAGGAGTTACGAGAAA
AAATGGATGCCATTCTTGTCGCCCTAGAAAGAGGCAAAATAATACCTGATATTGCTCAAGTACCTCCCAAGGTGACCATTACAGTTCCCAATTTAGATGATCCTGAAATC
AGAAAAGAGCTAACGGGAGGTGAGAAAGTCTCTTCTAGTGAAAAGCTTGAAGTCCTGGAGGAAAGATTAAGGGCAGTAGAAGGAACAGACGTCTTCGGAAATATAGATGC
GACCAAGCTATGCTTGGTACCAGATGTAATCCTCCCTCCAAAATTCAAGGTGCCCGAGTTTGAAAAGTATGATGGAGCATCCTGTCCTAAAAACCATCTCATCATGTATT
GTAGGAAGATGGCAGCATACGTCCAAAATGACAAGCTGTTAATTCACTGCTTCCAGGACAGTCTTACTGGTCCAGCATCTCGATGGTATATGCAGTTAGACAGCACTCAT
ATATGTTCATGGAAGAATCTAGCCGATTCATTTTTAAAGCAATATAAGCACAACATAGATATGGCTCCTGACCGCCTAGACCTCCAGAGGATGGAAAAGAAGAGCACAGA
AAGCTTTAAAGAGTATGCCCAAAGGTGGAGGGATACTGCTGCTCAGGTGCAACCACCTTTAGCAGATAAGGAGCTGTCAACCATGTTTATTAATACTCTCAAATCTCCTT
TCTATGATAAGATGATTGGGAGTGCCTCTACCAATTTCTCTGACATAATGACAATTGGAGAGAGAATCGAGTACGGAATTAAGCATGGAAAGATAATTGATACGGCTGGA
TCATCAACAGCGAAAAAGGGGGTTCCATCGAAGAAAAAAGAGGGAGAAGTTCAGATGATTGGTTTCAATTCAAGACAACCATACCCTAATGCCGGAGTGCCACAATATCA
CTATCCACCTCCATATGTTTACCCTCAACCATATATCAATAATACGTCAGCCCAATACTCATCCCCTTACGTCCAAATCCTCGTCCTACTCAAGGCTACCAGCCTCGGAA
TCAACAGAACACTCCTTATGTCCAGGACACCAAAACAATAA
Protein sequenceShow/hide protein sequence
MGIEDQATVRQWSENVQQIHGDSLVENVVSQFKDVSFPESQLEAVKQAWEKLTVDRKAKFTSKYGYLAQLMYVQVNYSVLKALIRHWDPAYRCFTFGSIDMTPTIEEYQS
LLHMPTRTEVEAYSYDQELTMKRALSTLLGKIRTSDIEKQVKIKGENTCLPLDYILTLQQKFANEDKELTLLALCIFNVVLFPKICGYVEERVVKLFAKIEIGVDPIIPV
LAETFRSLNYCRTKGTGRFIGCAPLLYIWVLSHVKCHPNSNVQRSNFQVPGISCKIPSQSSCNQVGVHPPLKGALGSLFLRTKDRRLRQIWVRQFIPATHELKDFEFAYD
KGFCKDRIQKIVKAWKMITKIQSGQFHDDTTEAYKTWHANRAKTVLVSPKMKTKIKLNAKVIPDQQTEQAAREKECDELRKANSSLVQENERLQLSVSKQEQLEKEISAL
DTEARDLNRRMHRLRRDNEVSQATLKSRNDQVLKQQSEIASLHELMKELEDCISLRNQTITEDATDRLMKDYTYLKEQYDRLSDDFGFARQNHATLRSKAEHMLTQIRRV
TRRADELAEDARTLSKVIAPTQPNSKNVLKFLGKLPRSPRIHRTYVTRYRTRIMEEQSTEMEKTRKDIEELREKMDAILVALERGKIIPDIAQVPPKVTITVPNLDDPEI
RKELTGGEKVSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLTGPASRWYMQLDSTH
ICSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELSTMFINTLKSPFYDKMIGSASTNFSDIMTIGERIEYGIKHGKIIDTAG
SSTAKKGVPSKKKEGEVQMIGFNSRQPYPNAGVPQYHYPPPYVYPQPYINNTSAQYSSPYVQILVLLKATSLGINRTLLMSRTPKQ