; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0036500 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0036500
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotransposon protein
Genome locationchr3:47496746..47498973
RNA-Seq ExpressionLag0036500
SyntenyLag0036500
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADN33754.1 retrotransposon protein [Cucumis melo subsp. melo]6.7e-8638.09Show/hide
Query:  MVAMFLHIVAHDVKNRVVRTQFARSGETVSRHFNTVLHAVLRLHDVLLKKPELVTTSCTDPRWKWFQNCLGALDGTYIKVNVGVVDRPRYRTRKGEIATN
        MVAMFLH++AHDVKNRV++ +F RSGETVSRHFN VL AVLRL++ L+K+P  VT++C D RWK F+NCLGALDGTYIKVNV   DRP +RTRKGEIATN
Subjt:  MVAMFLHIVAHDVKNRVVRTQFARSGETVSRHFNTVLHAVLRLHDVLLKKPELVTTSCTDPRWKWFQNCLGALDGTYIKVNVGVVDRPRYRTRKGEIATN

Query:  VLAVCDTKGDFTFVLPGWEG------------------------------------GFPAPYRGERYHLSEWRGAGSAPTTPKEFFNMKHSSARNVIERA
        VL VCD KGDF +VL GWEG                                    GF APY+G+RYHL EWRGA +APT  KE+FNMKHSSARNVIERA
Subjt:  VLAVCDTKGDFTFVLPGWEG------------------------------------GFPAPYRGERYHLSEWRGAGSAPTTPKEFFNMKHSSARNVIERA

Query:  FGLLKGR---MASSSRDPKH-------------------------------------TWTKNKDAKLVETL---------------------------VS
        FG+LKGR   +   S  P                                       T T ++D + +ET                            + 
Subjt:  FGLLKGR---MASSSRDPKH-------------------------------------TWTKNKDAKLVETL---------------------------VS

Query:  LVHACGWRSNNGTFKAGYLGQLENLMREKLLGQDVPAQSSIDSRVHTLKKQYQAIAEMLGHGCSGLGGT-------------MNLNASRQRKKHLTCG--
        LV   GW+S+NGTF+ GYL QL  +M EKL G  V A + ID R+ TLK+ +QAIAEMLG  CSG G                N   S    K L     
Subjt:  LVHACGWRSNNGTFKAGYLGQLENLMREKLLGQDVPAQSSIDSRVHTLKKQYQAIAEMLGHGCSGLGGT-------------MNLNASRQRKKHLTCG--

Query:  ---SRLGICLWKGSSHGEVSEVMGEMGSNMSDDEVDLSASPENDTHIPLFTMPTANMSKDEPRGTPGSQTCPTGRSSRGSKRKRSGSNTHIAEIV----T
             L     +  + G  +E   ++GSN      D     + +   P       ++ +D+ R +  S+       S GSKRKR        E +     
Subjt:  ---SRLGICLWKGSSHGEVSEVMGEMGSNMSDDEVDLSASPENDTHIPLFTMPTANMSKDEPRGTPGSQTCPTGRSSRGSKRKRSGSNTHIAEIV----T

Query:  SFTEQLKSIAEWLKEKRAIEADFRDKIVTHSMEVPELDNRKRVKLMDILFHNMKATESFFSISAALKLEYCELLL
           EQL+ IAEW     A +   R +      E+PEL +  R  L   L   M     F  +    +  +C +LL
Subjt:  SFTEQLKSIAEWLKEKRAIEADFRDKIVTHSMEVPELDNRKRVKLMDILFHNMKATESFFSISAALKLEYCELLL

ADN34114.1 retrotransposon protein [Cucumis melo subsp. melo]1.2e-10341.13Show/hide
Query:  MVAMFLHIVAHDVKNRVVRTQFARSGETVSRHFNTVLHAVLRLHDVLLKKPELVTTSCTDPRWKWFQNCLGALDGTYIKVNVGVVDRPRYRTRKGEIATN
        MVAMFLHI+AHDVK+RV++ +F RSGET+SRHFN VL AV+RLH+ LLKKP+ V   CTD RW+WF+NCLGALDGTYIKVNV   DR RYRTRKGE+ATN
Subjt:  MVAMFLHIVAHDVKNRVVRTQFARSGETVSRHFNTVLHAVLRLHDVLLKKPELVTTSCTDPRWKWFQNCLGALDGTYIKVNVGVVDRPRYRTRKGEIATN

Query:  VLAVCDTKGDFTFVLPGWEG------------------------------------GFPAPYRGERYHLSEWRGAGSAPTTPKEFFNMKHSSARNVIERA
        VL VCDTKGDF +VL GWEG                                    GF APYRG+RYHL EWRG  +AP+T KEFFNMKH SARNVIERA
Subjt:  VLAVCDTKGDFTFVLPGWEG------------------------------------GFPAPYRGERYHLSEWRGAGSAPTTPKEFFNMKHSSARNVIERA

Query:  FGLLKGR-----------------------------------------------------------------------------MASSSRDPKHTWTKNK
        FG+LKGR                                                                             M SSSR PKHTWTK +
Subjt:  FGLLKGR-----------------------------------------------------------------------------MASSSRDPKHTWTKNK

Query:  DAKLVETLVSLVHACGWRSNNGTFKAGYLGQLENLMREKLLGQDVPAQSSIDSRVHTLKKQYQAIAEMLGHGCSGLGGTMNLNASRQRK-----------
        +A LVE LV LV+A GWRS+NGTF+ GYL QL  +M  K+ G ++ A S+IDSR+  +K+ + A+AEM G  CSG G           K           
Subjt:  DAKLVETLVSLVHACGWRSNNGTFKAGYLGQLENLMREKLLGQDVPAQSSIDSRVHTLKKQYQAIAEMLGHGCSGLGGTMNLNASRQRK-----------

Query:  -----KHLTCGSRLGICLWKGSSHGEVSEVMGEMGSNMSDDEVDLSASPENDTHIPLFTMPTANMSKDEPRGTPGSQTCPTGRSSRGSKRKRSGSNTHIA
             K       L     K  + G  +E   ++GSN        +A    DT  P    P  NMS D+   T  ++       S GSKRKR G  T   
Subjt:  -----KHLTCGSRLGICLWKGSSHGEVSEVMGEMGSNMSDDEVDLSASPENDTHIPLFTMPTANMSKDEPRGTPGSQTCPTGRSSRGSKRKRSGSNTHIA

Query:  EIVTSF----TEQLKSIAEWLKEKRAIEADFRDKIVTHSMEVPELDNRKRVKLMDILFHNMKATESFFSISAALKLEYCELLLDKH
        +IV +      EQL  IAEW   +R      R +IV H   +PEL    R +LM IL  N+   ++F  +   +K  YC L+L ++
Subjt:  EIVTSF----TEQLKSIAEWLKEKRAIEADFRDKIVTHSMEVPELDNRKRVKLMDILFHNMKATESFFSISAALKLEYCELLLDKH

KAA0033290.1 putative nuclease HARBI1 [Cucumis melo var. makuwa]1.4e-8842Show/hide
Query:  MVAMFLHIVAHDVKNRVVRTQFARSGETVSRHFNTVLHAVLRLHDVLLKKPELVTTSCTDPRWKWFQNCLGALDGTYIKVNVGVVDRPRYRTRKGEIATN
        MVAMFLHI+AHD+KNR+++ +F RSGETVSRHFN VL +VLRLH+ LLKKP+LVT SC DPRWKWF+NCLGALD TYIKVNV   DRPRY TRKGE+A N
Subjt:  MVAMFLHIVAHDVKNRVVRTQFARSGETVSRHFNTVLHAVLRLHDVLLKKPELVTTSCTDPRWKWFQNCLGALDGTYIKVNVGVVDRPRYRTRKGEIATN

Query:  VLAVCDTKGDFTFVLPGWEG------------------------------------GFPAPYRGERYHLSEWRGAGSAPTTPKEFFNMKHSSARNVIERA
        VL VCDTKGDF FVL GWEG                                    GF APYRGERYHLSEW G  +APTT +EFFNMKHSSARNVI+RA
Subjt:  VLAVCDTKGDFTFVLPGWEG------------------------------------GFPAPYRGERYHLSEWRGAGSAPTTPKEFFNMKHSSARNVIERA

Query:  FGLLKGRMASSSRDPKHTWTKNKDAKLVETLVSLVHACGWRSNNGTFKAGYLGQLENLMREKLLGQDVPAQSSIDSRVHTLKKQYQAIAEMLGHGCSGLG
        F LLKG  A           + K    V+     +  C                L NL+  ++   ++         +  L +     A   G   + + 
Subjt:  FGLLKGRMASSSRDPKHTWTKNKDAKLVETLVSLVHACGWRSNNGTFKAGYLGQLENLMREKLLGQDVPAQSSIDSRVHTLKKQYQAIAEMLGHGCSGLG

Query:  GTMNLNASRQR-------KKHLTCGSRLGICLWKGSSHGEVSEVMGEMGS---NMSDDEVDLSASPENDTHIPLFTMPTANMSKDEPRGTPGSQTCPTGR
         +   +  R +        K       L     K  +    SE   ++GS   NM + +V L  S + D  IP+      +MS DE  G    Q      
Subjt:  GTMNLNASRQR-------KKHLTCGSRLGICLWKGSSHGEVSEVMGEMGS---NMSDDEVDLSASPENDTHIPLFTMPTANMSKDEPRGTPGSQTCPTGR

Query:  SSRGSKRKRSGSNTHIAEIVTSFTE----QLKSIAEWLKEKRAIEADFRDKIVTHSMEVPELDNRKRVKLMDILFHNMKATESFFSISAALKLEYCELLL
         S GSKRK+   +    E++ S  E    QLK+IA W KEKRA E + R +++    ++PEL +R R KL+ ILF +++A E F SI    KLEYC +LL
Subjt:  SSRGSKRKRSGSNTHIAEIVTSFTE----QLKSIAEWLKEKRAIEADFRDKIVTHSMEVPELDNRKRVKLMDILFHNMKATESFFSISAALKLEYCELLL

KAA0034843.1 retrotransposon protein [Cucumis melo var. makuwa]2.3e-10243.06Show/hide
Query:  MVAMFLHIVAHDVKNRVVRTQFARSGETVSRHFNTVLHAVLRLHDVLLKKPELVTTSCTDPRWKWFQNCLGALDGTYIKVNVGVVDRPRYRTRKGEIATN
        MVAMFLHI+AHDVKNRV++ +F RSGET+SRHFN VL AV+RLHD LLKKP+ V   CTD RW+WF+NCLGALDGTYIKVNV   DR RYRTRKGE+ATN
Subjt:  MVAMFLHIVAHDVKNRVVRTQFARSGETVSRHFNTVLHAVLRLHDVLLKKPELVTTSCTDPRWKWFQNCLGALDGTYIKVNVGVVDRPRYRTRKGEIATN

Query:  VLAVCDTKGDFTFVLPGWEG------------------------------------GFPAPYRGERYHLSEWRGAGSAPTTPKEFFNMKHSSARNVIERA
        VL V DTKGDF +VL GWEG                                    GF APYRG+RYHL EWRG  +AP+T KEFFNMKHSSARNVIERA
Subjt:  VLAVCDTKGDFTFVLPGWEG------------------------------------GFPAPYRGERYHLSEWRGAGSAPTTPKEFFNMKHSSARNVIERA

Query:  FGLLKGR--------------------------------------------MASSSRDPKHTWTKNKDAKLVETLVSLVHACGWRSNNGTFKAGYLGQLE
        FG+LKGR                                            M SSSR PKHTWTK ++A LVE    LV+A GWRS+NGTF+ GYL QL 
Subjt:  FGLLKGR--------------------------------------------MASSSRDPKHTWTKNKDAKLVETLVSLVHACGWRSNNGTFKAGYLGQLE

Query:  NLMREKLLGQDVPAQSSIDSRVHTLKKQYQAIAEMLGHGCSGLGGTMNLNASRQRK----------------KHLTCGSRLGICLWKGSSHGEVSEVMGE
         +M  K+ G ++ A S+IDSR+  +K+ + A+AEM G  CSG G           K                K       L     K  + G  +E   +
Subjt:  NLMREKLLGQDVPAQSSIDSRVHTLKKQYQAIAEMLGHGCSGLGGTMNLNASRQRK----------------KHLTCGSRLGICLWKGSSHGEVSEVMGE

Query:  MGSNMSD--DEVDLSASPENDTHIPLFTMPTANMSKDEPRGTPGSQTCPTGRSSRGSKRKRSGSNTHIAEIVTSF----TEQLKSIAEWLKEKRAIEADF
        +GSN     D     A P+ D   P++++   NMS D+   T  ++       S GSKRKR G  T   +IV +      EQL  IAEW   +R      
Subjt:  MGSNMSD--DEVDLSASPENDTHIPLFTMPTANMSKDEPRGTPGSQTCPTGRSSRGSKRKRSGSNTHIAEIVTSF----TEQLKSIAEWLKEKRAIEADF

Query:  RDKIVTHSMEVPELDNRKRVKLMDILFHNMKATESFFSISAALKLEYCELLLDKH
        R +IV     +PEL    R +LM IL  N+   ++F  +   +K  YC ++L ++
Subjt:  RDKIVTHSMEVPELDNRKRVKLMDILFHNMKATESFFSISAALKLEYCELLLDKH

KAA0036474.1 retrotransposon protein [Cucumis melo var. makuwa]7.9e-8750Show/hide
Query:  MVAMFLHIVAHDVKNRVVRTQFARSGETVSRHFNTVLHAVLRLHDVLLKKPELVTTSCTDPRWKWFQNCLGALDGTYIKVNVGVVDRPRYRTRKGEIATN
        MVAMFLHI AHDVKNRV++ +F RSGETVSRHFN VL AVLRL++ L+K+P  VT++C D RWK F+NCLGALDGTYIKVNV   DRP +RTRKGEIATN
Subjt:  MVAMFLHIVAHDVKNRVVRTQFARSGETVSRHFNTVLHAVLRLHDVLLKKPELVTTSCTDPRWKWFQNCLGALDGTYIKVNVGVVDRPRYRTRKGEIATN

Query:  VLAVCDTKGDFTFVLPGWEG------------------------------------GFPAPYRGERYHLSEWRGAGSAPTTPKEFFNMKHSSARNVIERA
        VL VCDTKGDF +VL GW+G                                    GF APYRG+RYHL EWRGA +APT  KE+FNMKHSSARNVIERA
Subjt:  VLAVCDTKGDFTFVLPGWEG------------------------------------GFPAPYRGERYHLSEWRGAGSAPTTPKEFFNMKHSSARNVIERA

Query:  FGLLKGR------------------------------------------------------------MASSSRDPKHTWTKNKDAKLVETLVSLVHACGW
        FG+LKGR                                                            M++S+R P+H WT+ ++  LVE L+ LV   GW
Subjt:  FGLLKGR------------------------------------------------------------MASSSRDPKHTWTKNKDAKLVETLVSLVHACGW

Query:  RSNNGTFKAGYLGQLENLMREKLLGQDVPAQSSIDSRVHTLKKQYQAIAEMLGHGCSGLG
        +S+NGTF++GYL QL  +M EKL  Q V A + ID R+ TLK+ +QAIAEM G  CSG G
Subjt:  RSNNGTFKAGYLGQLENLMREKLLGQDVPAQSSIDSRVHTLKKQYQAIAEMLGHGCSGLG

TrEMBL top hitse value%identityAlignment
A0A5A7SQU2 Putative nuclease HARBI17.0e-8942Show/hide
Query:  MVAMFLHIVAHDVKNRVVRTQFARSGETVSRHFNTVLHAVLRLHDVLLKKPELVTTSCTDPRWKWFQNCLGALDGTYIKVNVGVVDRPRYRTRKGEIATN
        MVAMFLHI+AHD+KNR+++ +F RSGETVSRHFN VL +VLRLH+ LLKKP+LVT SC DPRWKWF+NCLGALD TYIKVNV   DRPRY TRKGE+A N
Subjt:  MVAMFLHIVAHDVKNRVVRTQFARSGETVSRHFNTVLHAVLRLHDVLLKKPELVTTSCTDPRWKWFQNCLGALDGTYIKVNVGVVDRPRYRTRKGEIATN

Query:  VLAVCDTKGDFTFVLPGWEG------------------------------------GFPAPYRGERYHLSEWRGAGSAPTTPKEFFNMKHSSARNVIERA
        VL VCDTKGDF FVL GWEG                                    GF APYRGERYHLSEW G  +APTT +EFFNMKHSSARNVI+RA
Subjt:  VLAVCDTKGDFTFVLPGWEG------------------------------------GFPAPYRGERYHLSEWRGAGSAPTTPKEFFNMKHSSARNVIERA

Query:  FGLLKGRMASSSRDPKHTWTKNKDAKLVETLVSLVHACGWRSNNGTFKAGYLGQLENLMREKLLGQDVPAQSSIDSRVHTLKKQYQAIAEMLGHGCSGLG
        F LLKG  A           + K    V+     +  C                L NL+  ++   ++         +  L +     A   G   + + 
Subjt:  FGLLKGRMASSSRDPKHTWTKNKDAKLVETLVSLVHACGWRSNNGTFKAGYLGQLENLMREKLLGQDVPAQSSIDSRVHTLKKQYQAIAEMLGHGCSGLG

Query:  GTMNLNASRQR-------KKHLTCGSRLGICLWKGSSHGEVSEVMGEMGS---NMSDDEVDLSASPENDTHIPLFTMPTANMSKDEPRGTPGSQTCPTGR
         +   +  R +        K       L     K  +    SE   ++GS   NM + +V L  S + D  IP+      +MS DE  G    Q      
Subjt:  GTMNLNASRQR-------KKHLTCGSRLGICLWKGSSHGEVSEVMGEMGS---NMSDDEVDLSASPENDTHIPLFTMPTANMSKDEPRGTPGSQTCPTGR

Query:  SSRGSKRKRSGSNTHIAEIVTSFTE----QLKSIAEWLKEKRAIEADFRDKIVTHSMEVPELDNRKRVKLMDILFHNMKATESFFSISAALKLEYCELLL
         S GSKRK+   +    E++ S  E    QLK+IA W KEKRA E + R +++    ++PEL +R R KL+ ILF +++A E F SI    KLEYC +LL
Subjt:  SSRGSKRKRSGSNTHIAEIVTSFTE----QLKSIAEWLKEKRAIEADFRDKIVTHSMEVPELDNRKRVKLMDILFHNMKATESFFSISAALKLEYCELLL

A0A5A7SWD8 Retrotransposon protein1.1e-10243.06Show/hide
Query:  MVAMFLHIVAHDVKNRVVRTQFARSGETVSRHFNTVLHAVLRLHDVLLKKPELVTTSCTDPRWKWFQNCLGALDGTYIKVNVGVVDRPRYRTRKGEIATN
        MVAMFLHI+AHDVKNRV++ +F RSGET+SRHFN VL AV+RLHD LLKKP+ V   CTD RW+WF+NCLGALDGTYIKVNV   DR RYRTRKGE+ATN
Subjt:  MVAMFLHIVAHDVKNRVVRTQFARSGETVSRHFNTVLHAVLRLHDVLLKKPELVTTSCTDPRWKWFQNCLGALDGTYIKVNVGVVDRPRYRTRKGEIATN

Query:  VLAVCDTKGDFTFVLPGWEG------------------------------------GFPAPYRGERYHLSEWRGAGSAPTTPKEFFNMKHSSARNVIERA
        VL V DTKGDF +VL GWEG                                    GF APYRG+RYHL EWRG  +AP+T KEFFNMKHSSARNVIERA
Subjt:  VLAVCDTKGDFTFVLPGWEG------------------------------------GFPAPYRGERYHLSEWRGAGSAPTTPKEFFNMKHSSARNVIERA

Query:  FGLLKGR--------------------------------------------MASSSRDPKHTWTKNKDAKLVETLVSLVHACGWRSNNGTFKAGYLGQLE
        FG+LKGR                                            M SSSR PKHTWTK ++A LVE    LV+A GWRS+NGTF+ GYL QL 
Subjt:  FGLLKGR--------------------------------------------MASSSRDPKHTWTKNKDAKLVETLVSLVHACGWRSNNGTFKAGYLGQLE

Query:  NLMREKLLGQDVPAQSSIDSRVHTLKKQYQAIAEMLGHGCSGLGGTMNLNASRQRK----------------KHLTCGSRLGICLWKGSSHGEVSEVMGE
         +M  K+ G ++ A S+IDSR+  +K+ + A+AEM G  CSG G           K                K       L     K  + G  +E   +
Subjt:  NLMREKLLGQDVPAQSSIDSRVHTLKKQYQAIAEMLGHGCSGLGGTMNLNASRQRK----------------KHLTCGSRLGICLWKGSSHGEVSEVMGE

Query:  MGSNMSD--DEVDLSASPENDTHIPLFTMPTANMSKDEPRGTPGSQTCPTGRSSRGSKRKRSGSNTHIAEIVTSF----TEQLKSIAEWLKEKRAIEADF
        +GSN     D     A P+ D   P++++   NMS D+   T  ++       S GSKRKR G  T   +IV +      EQL  IAEW   +R      
Subjt:  MGSNMSD--DEVDLSASPENDTHIPLFTMPTANMSKDEPRGTPGSQTCPTGRSSRGSKRKRSGSNTHIAEIVTSF----TEQLKSIAEWLKEKRAIEADF

Query:  RDKIVTHSMEVPELDNRKRVKLMDILFHNMKATESFFSISAALKLEYCELLLDKH
        R +IV     +PEL    R +LM IL  N+   ++F  +   +K  YC ++L ++
Subjt:  RDKIVTHSMEVPELDNRKRVKLMDILFHNMKATESFFSISAALKLEYCELLLDKH

A0A5A7SYW1 Retrotransposon protein3.8e-8750Show/hide
Query:  MVAMFLHIVAHDVKNRVVRTQFARSGETVSRHFNTVLHAVLRLHDVLLKKPELVTTSCTDPRWKWFQNCLGALDGTYIKVNVGVVDRPRYRTRKGEIATN
        MVAMFLHI AHDVKNRV++ +F RSGETVSRHFN VL AVLRL++ L+K+P  VT++C D RWK F+NCLGALDGTYIKVNV   DRP +RTRKGEIATN
Subjt:  MVAMFLHIVAHDVKNRVVRTQFARSGETVSRHFNTVLHAVLRLHDVLLKKPELVTTSCTDPRWKWFQNCLGALDGTYIKVNVGVVDRPRYRTRKGEIATN

Query:  VLAVCDTKGDFTFVLPGWEG------------------------------------GFPAPYRGERYHLSEWRGAGSAPTTPKEFFNMKHSSARNVIERA
        VL VCDTKGDF +VL GW+G                                    GF APYRG+RYHL EWRGA +APT  KE+FNMKHSSARNVIERA
Subjt:  VLAVCDTKGDFTFVLPGWEG------------------------------------GFPAPYRGERYHLSEWRGAGSAPTTPKEFFNMKHSSARNVIERA

Query:  FGLLKGR------------------------------------------------------------MASSSRDPKHTWTKNKDAKLVETLVSLVHACGW
        FG+LKGR                                                            M++S+R P+H WT+ ++  LVE L+ LV   GW
Subjt:  FGLLKGR------------------------------------------------------------MASSSRDPKHTWTKNKDAKLVETLVSLVHACGW

Query:  RSNNGTFKAGYLGQLENLMREKLLGQDVPAQSSIDSRVHTLKKQYQAIAEMLGHGCSGLG
        +S+NGTF++GYL QL  +M EKL  Q V A + ID R+ TLK+ +QAIAEM G  CSG G
Subjt:  RSNNGTFKAGYLGQLENLMREKLLGQDVPAQSSIDSRVHTLKKQYQAIAEMLGHGCSGLG

A0A803QNC5 Uncharacterized protein1.5e-8639.66Show/hide
Query:  MVAMFLHIVAHDVKNRVVRTQFARSGETVSRHFNTVLHAVLRLHDVLLKKPELVTTSCTDPRWKWFQNCLGALDGTYIKVNVGVVDRPRYRTRKGEIATN
        MVA+FLHI+AHDVKNR+VR QFARSGETVSRHFN VL+A+L LHD+LLKKP  +   C D RWKWF+NCLGALDGTYIKVNV   +RPRYRTRK EIATN
Subjt:  MVAMFLHIVAHDVKNRVVRTQFARSGETVSRHFNTVLHAVLRLHDVLLKKPELVTTSCTDPRWKWFQNCLGALDGTYIKVNVGVVDRPRYRTRKGEIATN

Query:  VLAVCDTKGDFTFVLPGWEG-----------------------------------GFPAPYRGERYHLSEWRGAGSAPTTPKEFFNMKHSSARNVIERAF
        VL V      F +VLPGWEG                                   GF  PYRG+RYHL++W      P +P+EFFNM+HSSARNV+ERAF
Subjt:  VLAVCDTKGDFTFVLPGWEG-----------------------------------GFPAPYRGERYHLSEWRGAGSAPTTPKEFFNMKHSSARNVIERAF

Query:  GLLKGRM------------------------ASSSRDP----KHTWTKNKDAKLVETLVSLVHACGWRSNNGTFKAGYLGQLENLMREKLLGQDVPAQSS
        GLLKGR                         A+S   P    KH WT  +D+KLVE LV + ++  W+++NGTFK GYL QLE +M +++    + AQ  
Subjt:  GLLKGRM------------------------ASSSRDP----KHTWTKNKDAKLVETLVSLVHACGWRSNNGTFKAGYLGQLENLMREKLLGQDVPAQSS

Query:  IDSRVHTLKKQYQAIAEMLGHGCSGLGGTMNLNA--------SRQRKKHLTCG----------SRLGICLWKGSSHGEVSEVMGEMGSNMSDDEV-DLSA
        IDSR+  LK+QY AI++MLG   SG G    L              K H T              L I   K  + G+     G MG + + DE+ +   
Subjt:  IDSRVHTLKKQYQAIAEMLGHGCSGLGGTMNLNA--------SRQRKKHLTCG----------SRLGICLWKGSSHGEVSEVMGEMGSNMSDDEV-DLSA

Query:  SPENDTHIPLFTMPTANMSKDEPRGTPGSQTCPTGRSSRGSKRKR----------SGSNTHIAEIVTSFTEQLKSIAEWLKEKRAIEADFRDKIVTHSME
        +  ND   P   +   N +       P SQT      +R +KRK           S S    + +  S ++ +K +A+  + + A  A  R K+     +
Subjt:  SPENDTHIPLFTMPTANMSKDEPRGTPGSQTCPTGRSSRGSKRKR----------SGSNTHIAEIVTSFTEQLKSIAEWLKEKRAIEADFRDKIVTHSME

Query:  VPELDNRKRVKLMDILFHNMKATESFFSISAALKLEY
        V  L N +R+K+  +L  N    + FF++    KL++
Subjt:  VPELDNRKRVKLMDILFHNMKATESFFSISAALKLEY

E5GCB5 Retrotransposon protein5.9e-10441.13Show/hide
Query:  MVAMFLHIVAHDVKNRVVRTQFARSGETVSRHFNTVLHAVLRLHDVLLKKPELVTTSCTDPRWKWFQNCLGALDGTYIKVNVGVVDRPRYRTRKGEIATN
        MVAMFLHI+AHDVK+RV++ +F RSGET+SRHFN VL AV+RLH+ LLKKP+ V   CTD RW+WF+NCLGALDGTYIKVNV   DR RYRTRKGE+ATN
Subjt:  MVAMFLHIVAHDVKNRVVRTQFARSGETVSRHFNTVLHAVLRLHDVLLKKPELVTTSCTDPRWKWFQNCLGALDGTYIKVNVGVVDRPRYRTRKGEIATN

Query:  VLAVCDTKGDFTFVLPGWEG------------------------------------GFPAPYRGERYHLSEWRGAGSAPTTPKEFFNMKHSSARNVIERA
        VL VCDTKGDF +VL GWEG                                    GF APYRG+RYHL EWRG  +AP+T KEFFNMKH SARNVIERA
Subjt:  VLAVCDTKGDFTFVLPGWEG------------------------------------GFPAPYRGERYHLSEWRGAGSAPTTPKEFFNMKHSSARNVIERA

Query:  FGLLKGR-----------------------------------------------------------------------------MASSSRDPKHTWTKNK
        FG+LKGR                                                                             M SSSR PKHTWTK +
Subjt:  FGLLKGR-----------------------------------------------------------------------------MASSSRDPKHTWTKNK

Query:  DAKLVETLVSLVHACGWRSNNGTFKAGYLGQLENLMREKLLGQDVPAQSSIDSRVHTLKKQYQAIAEMLGHGCSGLGGTMNLNASRQRK-----------
        +A LVE LV LV+A GWRS+NGTF+ GYL QL  +M  K+ G ++ A S+IDSR+  +K+ + A+AEM G  CSG G           K           
Subjt:  DAKLVETLVSLVHACGWRSNNGTFKAGYLGQLENLMREKLLGQDVPAQSSIDSRVHTLKKQYQAIAEMLGHGCSGLGGTMNLNASRQRK-----------

Query:  -----KHLTCGSRLGICLWKGSSHGEVSEVMGEMGSNMSDDEVDLSASPENDTHIPLFTMPTANMSKDEPRGTPGSQTCPTGRSSRGSKRKRSGSNTHIA
             K       L     K  + G  +E   ++GSN        +A    DT  P    P  NMS D+   T  ++       S GSKRKR G  T   
Subjt:  -----KHLTCGSRLGICLWKGSSHGEVSEVMGEMGSNMSDDEVDLSASPENDTHIPLFTMPTANMSKDEPRGTPGSQTCPTGRSSRGSKRKRSGSNTHIA

Query:  EIVTSF----TEQLKSIAEWLKEKRAIEADFRDKIVTHSMEVPELDNRKRVKLMDILFHNMKATESFFSISAALKLEYCELLLDKH
        +IV +      EQL  IAEW   +R      R +IV H   +PEL    R +LM IL  N+   ++F  +   +K  YC L+L ++
Subjt:  EIVTSF----TEQLKSIAEWLKEKRAIEADFRDKIVTHSMEVPELDNRKRVKLMDILFHNMKATESFFSISAALKLEYCELLLDKH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G43722.1 unknown protein7.0e-1730.94Show/hide
Query:  VAMFLHIVAHDVKNRVVRTQFARSGETVSRHFNTVLHAVLRLHDVLLKKP------ELVTTSCTDPR-WKWFQNCLGALDGTYIKVNVGVVDRPRYRTRK
        VAMFL I  H+   R V  +F R+ ETV R F  VL A   L    ++ P       +      D R W +F   +GA+DGT++ V V    +  Y  R 
Subjt:  VAMFLHIVAHDVKNRVVRTQFARSGETVSRHFNTVLHAVLRLHDVLLKKP------ELVTTSCTDPR-WKWFQNCLGALDGTYIKVNVGVVDRPRYRTRK

Query:  GEIATNVLAVCDTKGDFTFVLPGWEG-------------------------------GFP------APYRGE-----RYHLSEWRGAGSAPTTPKEFFNM
           + N++A+CD K  FT++  G  G                               G+P      APYR       RYH+S++   G  P    E FN 
Subjt:  GEIATNVLAVCDTKGDFTFVLPGWEG-------------------------------GFP------APYRGE-----RYHLSEWRGAGSAPTTPKEFFNM

Query:  KHSSARNVIERAFGLLKGRMASS
         H+S R+VIER F + K +M  S
Subjt:  KHSSARNVIERAFGLLKGRMASS

AT4G10890.1 unknown protein2.1e-0537.04Show/hide
Query:  GFPAPYRGERYHLSEWRGAGSAPTTPKEFFNMKHSSARNVIERAFGLLKGRMASSSRDPKHTWTKNKDAKLVETLVSLVHA
        G+  P+R   YHL ++ G G  P T +E FN KH   R+VI+R FG+ K +     R   HT  KN +  +  T  ++ HA
Subjt:  GFPAPYRGERYHLSEWRGAGSAPTTPKEFFNMKHSSARNVIERAFGLLKGRMASSSRDPKHTWTKNKDAKLVETLVSLVHA

AT5G28950.1 unknown protein8.3e-1043.1Show/hide
Query:  WKWFQNCLGALDGTYIKVNVGVVDRPRYRTRKGEIATNVLAVCDTKGDFTFVLPGWEG
        + +F++C+GA+D T+I   V     P +R RKG+I+ N+LA C+   +F +VL GWEG
Subjt:  WKWFQNCLGALDGTYIKVNVGVVDRPRYRTRKGEIATNVLAVCDTKGDFTFVLPGWEG

AT5G35695.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)1.4e-1238.6Show/hide
Query:  FTFVLPGWEG--------------------------GFPAPYRGERYHLSEWRGAGSAPTTPKEFFNMKHSSARNVIERAFGLLKGRMASSSRDPKHTWT
        F +VL GWEG                           F AP+RG RYHL E+ G    P TP E FN++H S RNVIER FG+ K R A     P  ++ 
Subjt:  FTFVLPGWEG--------------------------GFPAPYRGERYHLSEWRGAGSAPTTPKEFFNMKHSSARNVIERAFGLLKGRMASSSRDPKHTWT

Query:  KNKDAKLVETLVSL
          K A LV T  +L
Subjt:  KNKDAKLVETLVSL

AT5G41980.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)5.2e-2031.88Show/hide
Query:  VAMFLHIVAHDVKNRVVRTQFARSGETVSRHFNTVLHAVLRL-HDVLLKKPELVTTSCTDPRWKWFQNCLGALDGTYIKVNVGVVDRPRYRTRKGEIATN
        +A+FL I+ H+++ R V+  F  SGET+SRHFN VL+AV+ +  D         T    DP   +F++C+G +D  +I V VGV ++  +R   G +  N
Subjt:  VAMFLHIVAHDVKNRVVRTQFARSGETVSRHFNTVLHAVLRL-HDVLLKKPELVTTSCTDPRWKWFQNCLGALDGTYIKVNVGVVDRPRYRTRKGEIATN

Query:  VLAVCDTKGDFTFVLPGWEG------------------------------------GFPAPYRGERYHLSEWRGAGSAPTTPKEFFNMKHSSARNVIERA
        VLA       F +VL GWEG                                    GF APY G   +  E           KE FN +H      I R 
Subjt:  VLAVCDTKGDFTFVLPGWEG------------------------------------GFPAPYRGERYHLSEWRGAGSAPTTPKEFFNMKHSSARNVIERA

Query:  FGLLKGR
        FG LK R
Subjt:  FGLLKGR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTGCCATGTTCTTGCACATTGTAGCCCACGACGTTAAGAACCGAGTAGTTCGTACGCAGTTCGCTAGGTCTGGGGAGACAGTTTCAAGGCATTTCAACACCGTCCT
TCATGCGGTGTTACGATTGCATGATGTTCTCTTAAAAAAACCTGAGCTAGTCACAACCTCTTGTACAGATCCAAGGTGGAAATGGTTTCAGAATTGCCTCGGTGCGTTAG
ATGGAACATACATCAAGGTCAATGTTGGTGTTGTCGATCGCCCGAGGTATAGAACAAGGAAAGGTGAAATAGCAACGAACGTGCTTGCTGTCTGTGATACAAAAGGAGAC
TTCACATTCGTCTTACCAGGGTGGGAAGGGGGGTTTCCGGCACCGTATAGAGGGGAACGTTACCACCTCTCTGAATGGCGTGGTGCGGGGAGTGCACCAACTACTCCAAA
AGAATTCTTTAACATGAAGCATTCATCTGCTAGGAACGTGATCGAGAGGGCATTCGGTTTGTTGAAAGGAAGAATGGCCAGTTCGTCAAGAGATCCAAAACACACTTGGA
CAAAGAACAAGGACGCGAAGCTGGTGGAGACCCTCGTGTCCTTAGTTCATGCATGTGGTTGGAGGTCCAATAATGGGACGTTCAAAGCTGGGTATCTGGGGCAGCTGGAG
AATTTGATGAGGGAGAAACTGCTTGGACAAGACGTTCCAGCACAGAGCAGCATCGACTCTAGGGTTCACACCTTAAAGAAACAATACCAAGCAATTGCAGAGATGTTGGG
TCATGGATGTAGTGGCTTGGGTGGAACGATGAATTTAAATGCATCAAGACAGAGAAAGAAACATTTGACTTGTGGGTCAAGACTTGGCATATGTCTTTGGAAAGGATCGA
GCCACGGGGAGGTGTCGGAGGTGATGGGCGAGATGGGATCCAACATGTCAGATGATGAGGTAGACCTCAGTGCATCCCCAGAGAACGACACCCACATCCCGCTGTTCACC
ATGCCTACTGCGAACATGTCAAAGGATGAACCTCGGGGTACGCCAGGCAGTCAAACTTGCCCAACAGGAAGGTCGTCACGTGGGAGCAAGAGGAAGAGGTCTGGGTCTAA
CACGCACATAGCAGAAATTGTCACCTCGTTTACAGAGCAACTGAAGTCAATTGCAGAGTGGCTGAAAGAAAAACGTGCCATAGAGGCTGACTTCCGAGATAAAATTGTCA
CCCACTCGATGGAGGTACCAGAATTGGATAACCGAAAGAGGGTGAAGCTCATGGATATCCTTTTCCATAACATGAAAGCAACAGAGAGCTTCTTCTCCATTTCGGCTGCT
CTGAAGTTGGAGTATTGTGAACTCCTCCTGGACAAACACGGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTGCCATGTTCTTGCACATTGTAGCCCACGACGTTAAGAACCGAGTAGTTCGTACGCAGTTCGCTAGGTCTGGGGAGACAGTTTCAAGGCATTTCAACACCGTCCT
TCATGCGGTGTTACGATTGCATGATGTTCTCTTAAAAAAACCTGAGCTAGTCACAACCTCTTGTACAGATCCAAGGTGGAAATGGTTTCAGAATTGCCTCGGTGCGTTAG
ATGGAACATACATCAAGGTCAATGTTGGTGTTGTCGATCGCCCGAGGTATAGAACAAGGAAAGGTGAAATAGCAACGAACGTGCTTGCTGTCTGTGATACAAAAGGAGAC
TTCACATTCGTCTTACCAGGGTGGGAAGGGGGGTTTCCGGCACCGTATAGAGGGGAACGTTACCACCTCTCTGAATGGCGTGGTGCGGGGAGTGCACCAACTACTCCAAA
AGAATTCTTTAACATGAAGCATTCATCTGCTAGGAACGTGATCGAGAGGGCATTCGGTTTGTTGAAAGGAAGAATGGCCAGTTCGTCAAGAGATCCAAAACACACTTGGA
CAAAGAACAAGGACGCGAAGCTGGTGGAGACCCTCGTGTCCTTAGTTCATGCATGTGGTTGGAGGTCCAATAATGGGACGTTCAAAGCTGGGTATCTGGGGCAGCTGGAG
AATTTGATGAGGGAGAAACTGCTTGGACAAGACGTTCCAGCACAGAGCAGCATCGACTCTAGGGTTCACACCTTAAAGAAACAATACCAAGCAATTGCAGAGATGTTGGG
TCATGGATGTAGTGGCTTGGGTGGAACGATGAATTTAAATGCATCAAGACAGAGAAAGAAACATTTGACTTGTGGGTCAAGACTTGGCATATGTCTTTGGAAAGGATCGA
GCCACGGGGAGGTGTCGGAGGTGATGGGCGAGATGGGATCCAACATGTCAGATGATGAGGTAGACCTCAGTGCATCCCCAGAGAACGACACCCACATCCCGCTGTTCACC
ATGCCTACTGCGAACATGTCAAAGGATGAACCTCGGGGTACGCCAGGCAGTCAAACTTGCCCAACAGGAAGGTCGTCACGTGGGAGCAAGAGGAAGAGGTCTGGGTCTAA
CACGCACATAGCAGAAATTGTCACCTCGTTTACAGAGCAACTGAAGTCAATTGCAGAGTGGCTGAAAGAAAAACGTGCCATAGAGGCTGACTTCCGAGATAAAATTGTCA
CCCACTCGATGGAGGTACCAGAATTGGATAACCGAAAGAGGGTGAAGCTCATGGATATCCTTTTCCATAACATGAAAGCAACAGAGAGCTTCTTCTCCATTTCGGCTGCT
CTGAAGTTGGAGTATTGTGAACTCCTCCTGGACAAACACGGCTGA
Protein sequenceShow/hide protein sequence
MVAMFLHIVAHDVKNRVVRTQFARSGETVSRHFNTVLHAVLRLHDVLLKKPELVTTSCTDPRWKWFQNCLGALDGTYIKVNVGVVDRPRYRTRKGEIATNVLAVCDTKGD
FTFVLPGWEGGFPAPYRGERYHLSEWRGAGSAPTTPKEFFNMKHSSARNVIERAFGLLKGRMASSSRDPKHTWTKNKDAKLVETLVSLVHACGWRSNNGTFKAGYLGQLE
NLMREKLLGQDVPAQSSIDSRVHTLKKQYQAIAEMLGHGCSGLGGTMNLNASRQRKKHLTCGSRLGICLWKGSSHGEVSEVMGEMGSNMSDDEVDLSASPENDTHIPLFT
MPTANMSKDEPRGTPGSQTCPTGRSSRGSKRKRSGSNTHIAEIVTSFTEQLKSIAEWLKEKRAIEADFRDKIVTHSMEVPELDNRKRVKLMDILFHNMKATESFFSISAA
LKLEYCELLLDKHG