; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0027216 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0027216
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr10:45919065..45923395
RNA-Seq ExpressionLag0027216
SyntenyLag0027216
Gene Ontology termsGO:0110165 - cellular anatomical structure (cellular component)
GO:0005488 - binding (molecular function)
InterPro domainsIPR006917 - SOUL haem-binding protein
IPR011256 - Regulatory factor, effector binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7017250.1 Heme-binding-like protein, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma]1.2e-9294.76Show/hide
Query:  ITVETPKFELIQSTNDYEIRKYEPSVVAEVTYDPTQFRGNRDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTAGGGGSEGKPVTM
        I+VETPK+ELIQST DYEIRKYEPSVVAEVTYDPTQFRGNRDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVT GGGG EGKPVTM
Subjt:  ITVETPKFELIQSTNDYEIRKYEPSVVAEVTYDPTQFRGNRDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTAGGGGSEGKPVTM

Query:  QFVLPSKYKKAEEAPKPADESVVIREEGERKLAVVRFSGIATEGVVAQKVENLKKSLEKDGYKVIGDFVLARYNPPWTLPSLRTNEVMIPV
        QFVLPSKYKKAEEAPKPAD SV IREEGERK+AVVRFSGIATEGVVAQKVENLKKSLEKDG+KVIGDFVLARYNPPWTLPSLRTNEVMIP+
Subjt:  QFVLPSKYKKAEEAPKPADESVVIREEGERKLAVVRFSGIATEGVVAQKVENLKKSLEKDGYKVIGDFVLARYNPPWTLPSLRTNEVMIPV

RVW69807.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]1.4e-9643.56Show/hide
Query:  PYPTLPQPLNVKLNDTNFLLWKNQLLNAVLANGLHGFLDGSVPAPPKFLDEHQSQPNPEFTTWERYNRFIMCWIYSYLSEEKMGEIVSLNTASEIWSSLT
        P P+L Q L++KL++TN LL K+QLLN ++ANGL  F+D    +PPK+LD    Q NPEF  W+R N+ +M WIYS L+   +G+IV  +TA +IW+SL 
Subjt:  PYPTLPQPLNVKLNDTNFLLWKNQLLNAVLANGLHGFLDGSVPAPPKFLDEHQSQPNPEFTTWERYNRFIMCWIYSYLSEEKMGEIVSLNTASEIWSSLT

Query:  RAYDSKTTARIMGLKTQLHRIRKDGLTVSQYLSQIKDIADKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDCPALEDVRSLLLAYEARLEKQN
          Y+S + A +M L +QL RI+K  + +S+YLS++K + D+F+ IGEP+SYRD L  IL+GL  EY+ FVTSI NRSD P+L++V SLL  YE RL +++
Subjt:  RAYDSKTTARIMGLKTQLHRIRKDGLTVSQYLSQIKDIADKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDCPALEDVRSLLLAYEARLEKQN

Query:  SVDQLNLAQANFATLNLNNTGRCTPSRASSNNIIRPPFNPFSAFPTSNNNVSSSILGKPQSQPLQKWPQKSNPNRPQCQICGKFGHTALICHHRTNLAYH
            LN  QAN             P +   NN I                                         PQCQICGK GH AL  +HRTNL YH
Subjt:  SVDQLNLAQANFATLNLNNTGRCTPSRASSNNIIRPPFNPFSAFPTSNNNVSSSILGKPQSQPLQKWPQKSNPNRPQCQICGKFGHTALICHHRTNLAYH

Query:  TAP-PQALLSSSNVHLPSPDSISTFSTDSYHP--------DENWFLDSGATHHMTPDATSLTHSIPYVGGEHVTVGDGKNISISLIGSQYLHSFSSKPIY
            P A   + N    +   IS   T S  P        D +W++DSGATHH TP+   +T ++ Y  G+H  VG+ K I IS IG   LHS S KPI+
Subjt:  TAP-PQALLSSSNVHLPSPDSISTFSTDSYHP--------DENWFLDSGATHHMTPDATSLTHSIPYVGGEHVTVGDGKNISISLIGSQYLHSFSSKPIY

Query:  LDSVLYTPAITKKLLSVARLCKDNQAFVEFYSSFFLVKDLRTKQIMLKGILEDGLYRLSAVSASPVQPPSSAALSSPSV-FLASVKAQH----------W
        L+ VL+TP I+K+L+SV RL  DN+AFVEFY +FFLVKD +TKQ++L+G LE GLY+L    A P    SS+  SSPS  F  +  AQ           W
Subjt:  LDSVLYTPAITKKLLSVARLCKDNQAFVEFYSSFFLVKDLRTKQIMLKGILEDGLYRLSAVSASPVQPPSSAALSSPSV-FLASVKAQH----------W

Query:  HLRLG
        H RLG
Subjt:  HLRLG

XP_016899693.1 PREDICTED: heme-binding-like protein At3g10130, chloroplastic [Cucumis melo]5.6e-9393.75Show/hide
Query:  ITVETPKFELIQSTNDYEIRKYEPSVVAEVTYDPTQFRGNRDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTAGGGGSEGKPVTM
        I+VETPK+ELIQST+DYEIRKYEPSVVAEV YDPTQFRGN+DGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVT GGGGSEGKPVTM
Subjt:  ITVETPKFELIQSTNDYEIRKYEPSVVAEVTYDPTQFRGNRDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTAGGGGSEGKPVTM

Query:  QFVLPSKYKKAEEAPKPADESVVIREEGERKLAVVRFSGIATEGVVAQKVENLKKSLEKDGYKVIGDFVLARYNPPWTLPSLRTNEVMIPVE
        QFVLPSKYKKAEEAPKPADE VVI+EEGERKLAVVRFSGIATEGVVA+KVE LKKSLEKDG+KVIGD+VLARYNPPWTLPSLRTNEVMIPVE
Subjt:  QFVLPSKYKKAEEAPKPADESVVIREEGERKLAVVRFSGIATEGVVAQKVENLKKSLEKDGYKVIGDFVLARYNPPWTLPSLRTNEVMIPVE

XP_022155181.1 uncharacterized protein LOC111022315 [Momordica charantia]3.5e-13563.38Show/hide
Query:  PKRAPQFFQPPQLANPLPFTPNPYPTLPQPLNVKLNDTNFLLWKNQLLNAVLANGLHGFLDGSVPAPPKFLDEHQSQPNPEFTTWERYNRFIMCWIYSYL
        P   P F   P    P PF+ NP+PTLPQPLNVKLND NFLLWKNQLLNAV+ANGL G+LDG++  PP+FLD HQ QPNP +  WERYNR +MCWIYS L
Subjt:  PKRAPQFFQPPQLANPLPFTPNPYPTLPQPLNVKLNDTNFLLWKNQLLNAVLANGLHGFLDGSVPAPPKFLDEHQSQPNPEFTTWERYNRFIMCWIYSYL

Query:  SEEKMGEIVSLNTASEIWSSLTRAYDSKTTARIMGLKTQLHRIRKDGLTVSQYLSQIKDIADKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSD
        SEEKMGE+VSL T  +IWSSLTR YDSKTTARIMGLKT+L  +RKDG +VSQYL++IK+IADKF+A+GEP+SYRDHLAH+LDGLGSEYNAFVTSI NR+D
Subjt:  SEEKMGEIVSLNTASEIWSSLTRAYDSKTTARIMGLKTQLHRIRKDGLTVSQYLSQIKDIADKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSD

Query:  CPALEDVRSLLLAYEARLEKQNSVDQLNLAQANFATLNLNNTGRCTPSRASSNNIIRPPF--NPFSAFPTSNNNVSSSILGKPQSQPLQKWPQKSNPNRP
         P+LEDVRSLLLAYEARL+KQN+VDQLN+AQAN   L+L +  +  P + S  N  +  F  +P SA        S SILGKPQS  + KWP K + ++ 
Subjt:  CPALEDVRSLLLAYEARLEKQNSVDQLNLAQANFATLNLNNTGRCTPSRASSNNIIRPPF--NPFSAFPTSNNNVSSSILGKPQSQPLQKWPQKSNPNRP

Query:  QCQICGKFGHTALICHHRTNLAYHTAPPQALLSSSNVHLPSPDSISTFSTDSYHPDENWFLDSGATHHMTPDATSLTHSIPYVGGEHVTVGDGKNI
        QCQICGK GH+A +C+HRTN+AYH A PQAL        PSP   S+   +  HPDE+WF+DSGATHHMTPD++ L +  PY GGE VTVG+G ++
Subjt:  QCQICGKFGHTALICHHRTNLAYHTAPPQALLSSSNVHLPSPDSISTFSTDSYHPDENWFLDSGATHHMTPDATSLTHSIPYVGGEHVTVGDGKNI

XP_038906231.1 heme-binding-like protein At3g10130, chloroplastic [Benincasa hispida]1.2e-9293.23Show/hide
Query:  ITVETPKFELIQSTNDYEIRKYEPSVVAEVTYDPTQFRGNRDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTAGGGGSEGKPVTM
        I+VETPK+ELIQST DYEIRKYEPSVVAEVTYDPTQFRGNRDGGFTVLAKYIGAIG+PQNIKSEKVAMTAPVITKSEKI MTAPVVT GGGG EGKP+TM
Subjt:  ITVETPKFELIQSTNDYEIRKYEPSVVAEVTYDPTQFRGNRDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTAGGGGSEGKPVTM

Query:  QFVLPSKYKKAEEAPKPADESVVIREEGERKLAVVRFSGIATEGVVAQKVENLKKSLEKDGYKVIGDFVLARYNPPWTLPSLRTNEVMIPVE
        QFVLPSKYKKAEEAPKP DE+VVIREEGERKLAVVRFSGIATEGVVAQKVENLKKSLEKDG+K+IGD+VLARYNPPWTLPSLRTNEVMIPVE
Subjt:  QFVLPSKYKKAEEAPKPADESVVIREEGERKLAVVRFSGIATEGVVAQKVENLKKSLEKDGYKVIGDFVLARYNPPWTLPSLRTNEVMIPVE

TrEMBL top hitse value%identityAlignment
A0A1S4DUQ0 heme-binding-like protein At3g10130, chloroplastic2.7e-9393.75Show/hide
Query:  ITVETPKFELIQSTNDYEIRKYEPSVVAEVTYDPTQFRGNRDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTAGGGGSEGKPVTM
        I+VETPK+ELIQST+DYEIRKYEPSVVAEV YDPTQFRGN+DGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVT GGGGSEGKPVTM
Subjt:  ITVETPKFELIQSTNDYEIRKYEPSVVAEVTYDPTQFRGNRDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTAGGGGSEGKPVTM

Query:  QFVLPSKYKKAEEAPKPADESVVIREEGERKLAVVRFSGIATEGVVAQKVENLKKSLEKDGYKVIGDFVLARYNPPWTLPSLRTNEVMIPVE
        QFVLPSKYKKAEEAPKPADE VVI+EEGERKLAVVRFSGIATEGVVA+KVE LKKSLEKDG+KVIGD+VLARYNPPWTLPSLRTNEVMIPVE
Subjt:  QFVLPSKYKKAEEAPKPADESVVIREEGERKLAVVRFSGIATEGVVAQKVENLKKSLEKDGYKVIGDFVLARYNPPWTLPSLRTNEVMIPVE

A0A438GC62 Retrovirus-related Pol polyprotein from transposon RE16.9e-9743.56Show/hide
Query:  PYPTLPQPLNVKLNDTNFLLWKNQLLNAVLANGLHGFLDGSVPAPPKFLDEHQSQPNPEFTTWERYNRFIMCWIYSYLSEEKMGEIVSLNTASEIWSSLT
        P P+L Q L++KL++TN LL K+QLLN ++ANGL  F+D    +PPK+LD    Q NPEF  W+R N+ +M WIYS L+   +G+IV  +TA +IW+SL 
Subjt:  PYPTLPQPLNVKLNDTNFLLWKNQLLNAVLANGLHGFLDGSVPAPPKFLDEHQSQPNPEFTTWERYNRFIMCWIYSYLSEEKMGEIVSLNTASEIWSSLT

Query:  RAYDSKTTARIMGLKTQLHRIRKDGLTVSQYLSQIKDIADKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDCPALEDVRSLLLAYEARLEKQN
          Y+S + A +M L +QL RI+K  + +S+YLS++K + D+F+ IGEP+SYRD L  IL+GL  EY+ FVTSI NRSD P+L++V SLL  YE RL +++
Subjt:  RAYDSKTTARIMGLKTQLHRIRKDGLTVSQYLSQIKDIADKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDCPALEDVRSLLLAYEARLEKQN

Query:  SVDQLNLAQANFATLNLNNTGRCTPSRASSNNIIRPPFNPFSAFPTSNNNVSSSILGKPQSQPLQKWPQKSNPNRPQCQICGKFGHTALICHHRTNLAYH
            LN  QAN             P +   NN I                                         PQCQICGK GH AL  +HRTNL YH
Subjt:  SVDQLNLAQANFATLNLNNTGRCTPSRASSNNIIRPPFNPFSAFPTSNNNVSSSILGKPQSQPLQKWPQKSNPNRPQCQICGKFGHTALICHHRTNLAYH

Query:  TAP-PQALLSSSNVHLPSPDSISTFSTDSYHP--------DENWFLDSGATHHMTPDATSLTHSIPYVGGEHVTVGDGKNISISLIGSQYLHSFSSKPIY
            P A   + N    +   IS   T S  P        D +W++DSGATHH TP+   +T ++ Y  G+H  VG+ K I IS IG   LHS S KPI+
Subjt:  TAP-PQALLSSSNVHLPSPDSISTFSTDSYHP--------DENWFLDSGATHHMTPDATSLTHSIPYVGGEHVTVGDGKNISISLIGSQYLHSFSSKPIY

Query:  LDSVLYTPAITKKLLSVARLCKDNQAFVEFYSSFFLVKDLRTKQIMLKGILEDGLYRLSAVSASPVQPPSSAALSSPSV-FLASVKAQH----------W
        L+ VL+TP I+K+L+SV RL  DN+AFVEFY +FFLVKD +TKQ++L+G LE GLY+L    A P    SS+  SSPS  F  +  AQ           W
Subjt:  LDSVLYTPAITKKLLSVARLCKDNQAFVEFYSSFFLVKDLRTKQIMLKGILEDGLYRLSAVSASPVQPPSSAALSSPSV-FLASVKAQH----------W

Query:  HLRLG
        H RLG
Subjt:  HLRLG

A0A5A7TLQ2 Heme-binding-like protein2.7e-9393.75Show/hide
Query:  ITVETPKFELIQSTNDYEIRKYEPSVVAEVTYDPTQFRGNRDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTAGGGGSEGKPVTM
        I+VETPK+ELIQST+DYEIRKYEPSVVAEV YDPTQFRGN+DGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVT GGGGSEGKPVTM
Subjt:  ITVETPKFELIQSTNDYEIRKYEPSVVAEVTYDPTQFRGNRDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTAGGGGSEGKPVTM

Query:  QFVLPSKYKKAEEAPKPADESVVIREEGERKLAVVRFSGIATEGVVAQKVENLKKSLEKDGYKVIGDFVLARYNPPWTLPSLRTNEVMIPVE
        QFVLPSKYKKAEEAPKPADE VVI+EEGERKLAVVRFSGIATEGVVA+KVE LKKSLEKDG+KVIGD+VLARYNPPWTLPSLRTNEVMIPVE
Subjt:  QFVLPSKYKKAEEAPKPADESVVIREEGERKLAVVRFSGIATEGVVAQKVENLKKSLEKDGYKVIGDFVLARYNPPWTLPSLRTNEVMIPVE

A0A6J1DQX7 uncharacterized protein LOC1110223151.7e-13563.38Show/hide
Query:  PKRAPQFFQPPQLANPLPFTPNPYPTLPQPLNVKLNDTNFLLWKNQLLNAVLANGLHGFLDGSVPAPPKFLDEHQSQPNPEFTTWERYNRFIMCWIYSYL
        P   P F   P    P PF+ NP+PTLPQPLNVKLND NFLLWKNQLLNAV+ANGL G+LDG++  PP+FLD HQ QPNP +  WERYNR +MCWIYS L
Subjt:  PKRAPQFFQPPQLANPLPFTPNPYPTLPQPLNVKLNDTNFLLWKNQLLNAVLANGLHGFLDGSVPAPPKFLDEHQSQPNPEFTTWERYNRFIMCWIYSYL

Query:  SEEKMGEIVSLNTASEIWSSLTRAYDSKTTARIMGLKTQLHRIRKDGLTVSQYLSQIKDIADKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSD
        SEEKMGE+VSL T  +IWSSLTR YDSKTTARIMGLKT+L  +RKDG +VSQYL++IK+IADKF+A+GEP+SYRDHLAH+LDGLGSEYNAFVTSI NR+D
Subjt:  SEEKMGEIVSLNTASEIWSSLTRAYDSKTTARIMGLKTQLHRIRKDGLTVSQYLSQIKDIADKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSD

Query:  CPALEDVRSLLLAYEARLEKQNSVDQLNLAQANFATLNLNNTGRCTPSRASSNNIIRPPF--NPFSAFPTSNNNVSSSILGKPQSQPLQKWPQKSNPNRP
         P+LEDVRSLLLAYEARL+KQN+VDQLN+AQAN   L+L +  +  P + S  N  +  F  +P SA        S SILGKPQS  + KWP K + ++ 
Subjt:  CPALEDVRSLLLAYEARLEKQNSVDQLNLAQANFATLNLNNTGRCTPSRASSNNIIRPPF--NPFSAFPTSNNNVSSSILGKPQSQPLQKWPQKSNPNRP

Query:  QCQICGKFGHTALICHHRTNLAYHTAPPQALLSSSNVHLPSPDSISTFSTDSYHPDENWFLDSGATHHMTPDATSLTHSIPYVGGEHVTVGDGKNI
        QCQICGK GH+A +C+HRTN+AYH A PQAL        PSP   S+   +  HPDE+WF+DSGATHHMTPD++ L +  PY GGE VTVG+G ++
Subjt:  QCQICGKFGHTALICHHRTNLAYHTAPPQALLSSSNVHLPSPDSISTFSTDSYHPDENWFLDSGATHHMTPDATSLTHSIPYVGGEHVTVGDGKNI

A0A6J1F979 heme-binding-like protein At3g10130, chloroplastic7.9e-9394.79Show/hide
Query:  ITVETPKFELIQSTNDYEIRKYEPSVVAEVTYDPTQFRGNRDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTAGGGGSEGKPVTM
        I+VETPK+ELIQST DYEIRKYEPSVVAEVTYDPTQFRGNRDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVT GGGG EGKPVTM
Subjt:  ITVETPKFELIQSTNDYEIRKYEPSVVAEVTYDPTQFRGNRDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTAGGGGSEGKPVTM

Query:  QFVLPSKYKKAEEAPKPADESVVIREEGERKLAVVRFSGIATEGVVAQKVENLKKSLEKDGYKVIGDFVLARYNPPWTLPSLRTNEVMIPVE
        QFVLPSKYKKAEEAPKPAD SV IREEGERK+AVVRFSGIATEGVVAQKVE LKKSLEKDG KVIGDFVLARYNPPWTLPSLRTNEVMIPVE
Subjt:  QFVLPSKYKKAEEAPKPADESVVIREEGERKLAVVRFSGIATEGVVAQKVENLKKSLEKDGYKVIGDFVLARYNPPWTLPSLRTNEVMIPVE

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.5e-5330.61Show/hide
Query:  KLNDTNFLLWKNQLLNAVLANGLHGFLDGSVPAPPKFL-DEHQSQPNPEFTTWERYNRFIMCWIYSYLSEEKMGEIVSLNTASEIWSSLTRAYDSKTTAR
        KL  TN+L+W  Q+        L GFLDGS   PP  +  +   + NP++T W+R ++ I   +   +S      +    TA++IW +L + Y + +   
Subjt:  KLNDTNFLLWKNQLLNAVLANGLHGFLDGSVPAPPKFL-DEHQSQPNPEFTTWERYNRFIMCWIYSYLSEEKMGEIVSLNTASEIWSSLTRAYDSKTTAR

Query:  IMGLKTQLHRIRKDGLTVSQYLSQIKDIADKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDCPALEDVRSLLLAYEARLEKQNSVDQLNL---
        +  L+TQL +  K   T+  Y+  +    D+ + +G+P+ + + +  +L+ L  EY   +  I  +   P L ++   LL +E+++   +S   + +   
Subjt:  IMGLKTQLHRIRKDGLTVSQYLSQIKDIADKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDCPALEDVRSLLLAYEARLEKQNSVDQLNL---

Query:  --AQANFATLNLNNTGRCTPSRASSNNIIRPPFNPFSAFPTSNNNVSSSILGKPQSQPLQKWPQKSNPNRP---QCQICGKFGHTALICHHRTNLAYHTA
          +  N  T N NN G                 N  + +   NNN +S    KP  Q    +   +N ++P   +CQICG  GH+A  C          +
Subjt:  --AQANFATLNLNNTGRCTPSRASSNNIIRPPFNPFSAFPTSNNNVSSSILGKPQSQPLQKWPQKSNPNRP---QCQICGKFGHTALICHHRTNLAYHTA

Query:  PPQALLSSSNVHL-PSP----DSISTFSTDSYHPDENWFLDSGATHHMTPDATSLTHSIPYVGGEHVTVGDGKNISISLIGSQYLHSFSSKPIYLDSVLY
          Q  LSS N    PSP       +  +  S +   NW LDSGATHH+T D  +L+   PY GG+ V V DG  I IS  GS  L S  S+P+ L ++LY
Subjt:  PPQALLSSSNVHL-PSP----DSISTFSTDSYHPDENWFLDSGATHHMTPDATSLTHSIPYVGGEHVTVGDGKNISISLIGSQYLHSFSSKPIYLDSVLY

Query:  TPAITKKLLSVARLCKDNQAFVEFYSSFFLVKDLRTKQIMLKGILEDGLYRLSAVSASPVQPPSSAALSSPSVFLASVKAQH--WHLRLG--TSSCFN--
         P I K L+SV RLC  N   VEF+ + F VKDL T   +L+G  +D LY     S+ PV     +  +SP     S KA H  WH RLG    S  N  
Subjt:  TPAITKKLLSVARLCKDNQAFVEFYSSFFLVKDLRTKQIMLKGILEDGLYRLSAVSASPVQPPSSAALSSPSVFLASVKAQH--WHLRLG--TSSCFN--

Query:  --------FGPSFVFVSFVFCKVVVDEQKPQTPSQTTAT
                  PS  F+S   C +    + P + S   +T
Subjt:  --------FGPSFVFVSFVFCKVVVDEQKPQTPSQTTAT

Q9SR77 Heme-binding-like protein At3g10130, chloroplastic1.6e-2137.02Show/hide
Query:  AFLLNITVETPKFELIQSTNDYEIRKYEPSVVAEVTYDPTQFRGNRDG---GFTVLAKYIGAIGEPQNIKSEKVAMTAPVITK-----SEKISMTAPVVT
        AF+    +ET  F ++  T+ YEIR+ EP  VAE T  P +   +  G    F VLA+Y+      +N   EK+ MT PV+T+      EK+ MT PV+T
Subjt:  AFLLNITVETPKFELIQSTNDYEIRKYEPSVVAEVTYDPTQFRGNRDG---GFTVLAKYIGAIGEPQNIKSEKVAMTAPVITK-----SEKISMTAPVVT

Query:  AGGGGSEGKPVTMQFVLPSKYKKAEEAPKPADESVVIREEGERKLAVVRFSGIATEGVVAQKVENLKKSLEKDGYKVIGD---FVLARYNPPWTLPSLRT
        +     +     M FV+PSKY      P P D SV I++   + +AVV FSG  T+  + ++   L+++L+ D    + D   F +A+YNPP+TLP +R 
Subjt:  AGGGGSEGKPVTMQFVLPSKYKKAEEAPKPADESVVIREEGERKLAVVRFSGIATEGVVAQKVENLKKSLEKDGYKVIGD---FVLARYNPPWTLPSLRT

Query:  NEVMIPVE
        NEV + VE
Subjt:  NEVMIPVE

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.2e-4730.25Show/hide
Query:  KLNDTNFLLWKNQLLNAVLANGLHGFLDGSVPAPPKFL-DEHQSQPNPEFTTWERYNRFIMCWIYSYLSEEKMGEIVSLNTASEIWSSLTRAYDSKTTAR
        KL  TN+L+W  Q+        L GFLDGS P PP  +  +   + NP++T W R ++ I   I   +S      +    TA++IW +L + Y + +   
Subjt:  KLNDTNFLLWKNQLLNAVLANGLHGFLDGSVPAPPKFL-DEHQSQPNPEFTTWERYNRFIMCWIYSYLSEEKMGEIVSLNTASEIWSSLTRAYDSKTTAR

Query:  IMGLKTQLHRIRKDGLTVSQYLSQIKDIADKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDCPALEDVRSLLLAYEARLEKQNSVDQLNLAQA
        +    TQL  I +                D+ + +G+P+ + + +  +L+ L  +Y   +  I  +   P+L ++   L+  E++L   NS + + +  A
Subjt:  IMGLKTQLHRIRKDGLTVSQYLSQIKDIADKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDCPALEDVRSLLLAYEARLEKQNSVDQLNLAQA

Query:  NFATLNLNNTGRCTPSRASSNNIIRPPFNPFSAFPTSNNNVSSSILGKPQSQPLQKWPQKSNPNRPQCQICGKFGHTALIC------HHRTNLAYHTAPP
        N  T    NT R   +R  + N               NNN + S   +P S   +   ++  P   +CQIC   GH+A  C         TN    T+P 
Subjt:  NFATLNLNNTGRCTPSRASSNNIIRPPFNPFSAFPTSNNNVSSSILGKPQSQPLQKWPQKSNPNRPQCQICGKFGHTALIC------HHRTNLAYHTAPP

Query:  QALLSSSNVHLPSPDSISTFSTDSYHPDENWFLDSGATHHMTPDATSLTHSIPYVGGEHVTVGDGKNISISLIGSQYLHSFSSKPIYLDSVLYTPAITKK
              +N+ + SP           +   NW LDSGATHH+T D  +L+   PY GG+ V + DG  I I+  GS  L + SS+ + L+ VLY P I K 
Subjt:  QALLSSSNVHLPSPDSISTFSTDSYHPDENWFLDSGATHHMTPDATSLTHSIPYVGGEHVTVGDGKNISISLIGSQYLHSFSSKPIYLDSVLYTPAITKK

Query:  LLSVARLCKDNQAFVEFYSSFFLVKDLRTKQIMLKGILEDGLYRLSAVSASPVQPPSSAALSSPSVFLASVKAQH--WHLRLGTSS
        L+SV RLC  N+  VEF+ + F VKDL T   +L+G  +D LY     S+  V     +  +SP       KA H  WH RLG  S
Subjt:  LLSVARLCKDNQAFVEFYSSFFLVKDLRTKQIMLKGILEDGLYRLSAVSASPVQPPSSAALSSPSVFLASVKAQH--WHLRLGTSS

Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)4.4e-1120.16Show/hide
Query:  PLNVKLNDTNFLLWKNQLLNAVLANGLHGFLDGSVPAPPKFLDEHQSQPNPEFTTWERYNRFIMCWIYSYLSEEK-MGEIVSLNTASEIWSSLTRAYDSK
        P+ + + ++N+  W+   L   L+  + G +DG++              N     W++ +  +   +Y  L+ ++  G  V+ +T+ +IW  +   + + 
Subjt:  PLNVKLNDTNFLLWKNQLLNAVLANGLHGFLDGSVPAPPKFLDEHQSQPNPEFTTWERYNRFIMCWIYSYLSEEK-MGEIVSLNTASEIWSSLTRAYDSK

Query:  TTARIMGLKTQLHRIRKDGLTVSQYLSQIKDIADKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDCPALEDVRSLLLAYEARLEKQNSVDQLN
          AR + L ++L       + V+ Y  ++K +AD    +  P++ R+ + ++L+GL  +++  +  I++R   P+ +D  ++L   E RL++    +  +
Subjt:  TTARIMGLKTQLHRIRKDGLTVSQYLSQIKDIADKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDCPALEDVRSLLLAYEARLEKQNSVDQLN

Query:  LAQANFATL----------NLNNTG---RCTPSRASSNNIIRPPFNPFSAF--PTSNN
        +  ++ +T+          N   +G        R   NNI R     FS +  PT N+
Subjt:  LAQANFATL----------NLNNTG---RCTPSRASSNNIIRPPFNPFSAF--PTSNN

AT2G37970.1 SOUL heme-binding family protein1.5e-6762.98Show/hide
Query:  ITVETPKFELIQSTNDYEIRKYEPSVVAEVTYDPTQFRGNRDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITK---------------SEKISMTAPV
        I VETPK+ + +S + YEIR+Y P+V AEVTYD ++F+G++DGGF +LAKYIG  G+P+N K EK+AMTAPVITK               SEKI MT+PV
Subjt:  ITVETPKFELIQSTNDYEIRKYEPSVVAEVTYDPTQFRGNRDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITK---------------SEKISMTAPV

Query:  VT-AGGGGSEGKPVTMQFVLPSKYKKAEEAPKPADESVVIREEGERKLAVVRFSGIATEGVVAQKVENLKKSLEKDGYKVIGDFVLARYNPPWTLPSLRT
        VT  GGG    K VTMQF+LPS YKKAEEAP+P DE VVI+EEG RK  V++FSGIA+E VV++KV+ L   LEKDG+K+ GDFVLARYNPPWTLP  RT
Subjt:  VT-AGGGGSEGKPVTMQFVLPSKYKKAEEAPKPADESVVIREEGERKLAVVRFSGIATEGVVAQKVENLKKSLEKDGYKVIGDFVLARYNPPWTLPSLRT

Query:  NEVMIPVE
        NEVMIPVE
Subjt:  NEVMIPVE

AT3G10130.1 SOUL heme-binding family protein1.1e-2237.02Show/hide
Query:  AFLLNITVETPKFELIQSTNDYEIRKYEPSVVAEVTYDPTQFRGNRDG---GFTVLAKYIGAIGEPQNIKSEKVAMTAPVITK-----SEKISMTAPVVT
        AF+    +ET  F ++  T+ YEIR+ EP  VAE T  P +   +  G    F VLA+Y+      +N   EK+ MT PV+T+      EK+ MT PV+T
Subjt:  AFLLNITVETPKFELIQSTNDYEIRKYEPSVVAEVTYDPTQFRGNRDG---GFTVLAKYIGAIGEPQNIKSEKVAMTAPVITK-----SEKISMTAPVVT

Query:  AGGGGSEGKPVTMQFVLPSKYKKAEEAPKPADESVVIREEGERKLAVVRFSGIATEGVVAQKVENLKKSLEKDGYKVIGD---FVLARYNPPWTLPSLRT
        +     +     M FV+PSKY      P P D SV I++   + +AVV FSG  T+  + ++   L+++L+ D    + D   F +A+YNPP+TLP +R 
Subjt:  AGGGGSEGKPVTMQFVLPSKYKKAEEAPKPADESVVIREEGERKLAVVRFSGIATEGVVAQKVENLKKSLEKDGYKVIGD---FVLARYNPPWTLPSLRT

Query:  NEVMIPVE
        NEV + VE
Subjt:  NEVMIPVE

AT5G20140.1 SOUL heme-binding family protein1.8e-1734.03Show/hide
Query:  VETPKFELIQSTNDYEIRKYEPSVVAEVTYDPTQFRGNRDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTAGGGGSEGKPVTMQF
        +ETPK+++++ T +YE+R YEP +V E   D    + +   GF  +A YI      +N   EK+ MT PV T++    +++              V++Q 
Subjt:  VETPKFELIQSTNDYEIRKYEPSVVAEVTYDPTQFRGNRDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTAGGGGSEGKPVTMQF

Query:  VLPSKYKKAEEAPKPADESVVIREEGERKLAVVRFSGIATEGVVAQKVENLKKSLEKDGYKVIGDFVLARYNPPW-TLPSLRTNEVMIPVE
        V+PS  K     P P +E V +++      A V+FSG  TE VV  K   L+ SL KDG +     +LARYN P  T   +  NEV+I +E
Subjt:  VLPSKYKKAEEAPKPADESVVIREEGERKLAVVRFSGIATEGVVAQKVENLKKSLEKDGYKVIGDFVLARYNPPW-TLPSLRTNEVMIPVE

AT5G20140.2 SOUL heme-binding family protein2.0e-1633.91Show/hide
Query:  VETPKFELIQSTNDYEIRKYEPSVVAEVTYDPTQFRGNRDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTAGGGGSEGKPVTMQF
        +ETPK+++++ T +YE+R YEP +V E   D    + +   GF  +A YI      +N   EK+ MT PV T++    +++              V++Q 
Subjt:  VETPKFELIQSTNDYEIRKYEPSVVAEVTYDPTQFRGNRDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTAGGGGSEGKPVTMQF

Query:  VLPSKYKKAEEAPKPADESVVIREEGERKLAVVRFSGIATEGVVAQKVENLKKSLEKDGYKVIGDFVLARYNPP
        V+PS  K     P P +E V +++      A V+FSG  TE VV  K   L+ SL KDG +     +LARYN P
Subjt:  VLPSKYKKAEEAPKPADESVVIREEGERKLAVVRFSGIATEGVVAQKVENLKKSLEKDGYKVIGDFVLARYNPP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTATCAGAGCCCATTAAACCCAAACGGGCCCCTCAATTTTTTCAACCTCCTCAGCTTGCGAATCCGCTGCCCTTTACTCCTAATCCCTACCCGACACTACCTCAACC
TCTGAATGTCAAATTGAATGATACGAACTTCCTATTGTGGAAGAACCAATTGCTCAATGCTGTCCTCGCCAATGGTCTCCATGGGTTTCTCGATGGCTCTGTTCCAGCGC
CTCCAAAGTTTCTTGATGAACATCAGTCACAACCGAATCCTGAATTCACTACATGGGAAAGGTACAATCGCTTCATTATGTGTTGGATTTATTCATACCTTTCCGAAGAG
AAGATGGGCGAAATTGTAAGTCTGAATACTGCTTCTGAAATCTGGTCTTCTTTAACTCGTGCGTATGATTCTAAAACTACAGCTCGAATTATGGGTCTTAAAACTCAACT
CCACCGTATTAGAAAAGATGGTCTCACTGTTAGTCAGTATCTATCTCAGATTAAGGATATTGCGGATAAATTTTCAGCAATAGGGGAACCGATTTCTTATAGGGATCACT
TAGCTCACATATTAGATGGCCTTGGGAGTGAATATAATGCTTTTGTGACATCTATACAAAATCGTTCTGATTGTCCTGCTTTAGAAGATGTGCGTAGTCTTTTGTTGGCA
TATGAAGCTCGGTTAGAAAAGCAAAATAGCGTAGACCAACTTAACCTCGCTCAGGCTAATTTTGCTACTCTTAATCTCAATAACACTGGCCGCTGCACCCCTTCTCGTGC
TTCCTCTAATAACATAATTCGACCTCCCTTCAATCCATTCTCGGCTTTCCCTACCTCAAACAACAATGTTTCTTCTAGCATCCTAGGAAAACCTCAGTCACAACCCCTCC
AAAAATGGCCTCAAAAATCAAATCCTAATCGCCCACAGTGCCAAATATGTGGCAAATTTGGGCACACAGCCCTTATTTGCCACCATCGCACCAATTTGGCTTACCACACA
GCTCCTCCACAAGCCTTACTTTCCTCATCCAATGTCCATCTTCCTTCTCCTGACTCCATCTCCACTTTTTCCACGGATTCATATCACCCAGACGAAAATTGGTTCTTAGA
CTCCGGAGCCACGCATCATATGACCCCGGATGCTACATCTCTCACTCATTCCATACCTTATGTTGGTGGTGAACATGTTACTGTTGGGGATGGTAAGAATATTTCTATCT
CTCTCATTGGTTCTCAATATTTACATTCTTTCTCTTCTAAACCAATTTATCTTGATTCCGTTCTTTATACTCCTGCTATTACTAAGAAATTATTGAGTGTTGCTCGGCTT
TGCAAAGATAACCAAGCATTTGTTGAATTTTACTCTTCTTTTTTCCTTGTTAAGGACCTTCGAACCAAGCAAATTATGCTCAAGGGAATCCTTGAGGATGGTCTATATCG
GTTGTCTGCTGTCTCTGCTTCTCCTGTTCAGCCACCTTCCTCCGCTGCTCTTTCATCTCCTTCCGTTTTTTTGGCATCTGTTAAGGCTCAACATTGGCATTTGAGGTTGG
GGACATCCAGCTGCTTCAACTTTGGGCCAAGTTTTGTCTTTGTTTCGTTTGTCTTCTGCAAAGTCGTCGTGGATGAACAAAAGCCACAAACTCCTTCTCAAACCACCGCC
ACTTATTCCATGGCTGAACTTCCCAATTGCAAATCCAACCCCCTTCTTCAAGCAACCGCCTTTCTTCTCAACATCACTGTCGAAACCCCCAAATTTGAGCTGATTCAATC
CACCAACGACTACGAAATCCGCAAATACGAGCCATCGGTGGTGGCCGAAGTCACCTACGATCCGACCCAGTTCAGAGGCAACAGAGACGGTGGCTTCACTGTATTGGCCA
AGTACATTGGCGCCATCGGCGAGCCACAGAACATCAAGTCCGAGAAAGTGGCCATGACGGCGCCGGTGATCACCAAATCGGAGAAGATTTCGATGACAGCTCCGGTGGTA
ACGGCTGGCGGCGGCGGCAGCGAGGGGAAGCCAGTGACGATGCAGTTTGTGCTGCCGAGCAAGTACAAGAAGGCAGAGGAAGCTCCAAAGCCGGCTGATGAAAGTGTTGT
GATAAGGGAAGAAGGGGAGAGGAAACTCGCCGTCGTGAGATTCAGTGGAATTGCGACGGAGGGAGTGGTGGCGCAGAAGGTGGAGAATCTGAAGAAAAGCTTGGAGAAAG
ATGGGTATAAAGTGATTGGGGATTTTGTGTTGGCTAGATATAACCCACCATGGACATTGCCTTCTTTGAGAACCAATGAAGTCATGATACCAGTAGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGTATCAGAGCCCATTAAACCCAAACGGGCCCCTCAATTTTTTCAACCTCCTCAGCTTGCGAATCCGCTGCCCTTTACTCCTAATCCCTACCCGACACTACCTCAACC
TCTGAATGTCAAATTGAATGATACGAACTTCCTATTGTGGAAGAACCAATTGCTCAATGCTGTCCTCGCCAATGGTCTCCATGGGTTTCTCGATGGCTCTGTTCCAGCGC
CTCCAAAGTTTCTTGATGAACATCAGTCACAACCGAATCCTGAATTCACTACATGGGAAAGGTACAATCGCTTCATTATGTGTTGGATTTATTCATACCTTTCCGAAGAG
AAGATGGGCGAAATTGTAAGTCTGAATACTGCTTCTGAAATCTGGTCTTCTTTAACTCGTGCGTATGATTCTAAAACTACAGCTCGAATTATGGGTCTTAAAACTCAACT
CCACCGTATTAGAAAAGATGGTCTCACTGTTAGTCAGTATCTATCTCAGATTAAGGATATTGCGGATAAATTTTCAGCAATAGGGGAACCGATTTCTTATAGGGATCACT
TAGCTCACATATTAGATGGCCTTGGGAGTGAATATAATGCTTTTGTGACATCTATACAAAATCGTTCTGATTGTCCTGCTTTAGAAGATGTGCGTAGTCTTTTGTTGGCA
TATGAAGCTCGGTTAGAAAAGCAAAATAGCGTAGACCAACTTAACCTCGCTCAGGCTAATTTTGCTACTCTTAATCTCAATAACACTGGCCGCTGCACCCCTTCTCGTGC
TTCCTCTAATAACATAATTCGACCTCCCTTCAATCCATTCTCGGCTTTCCCTACCTCAAACAACAATGTTTCTTCTAGCATCCTAGGAAAACCTCAGTCACAACCCCTCC
AAAAATGGCCTCAAAAATCAAATCCTAATCGCCCACAGTGCCAAATATGTGGCAAATTTGGGCACACAGCCCTTATTTGCCACCATCGCACCAATTTGGCTTACCACACA
GCTCCTCCACAAGCCTTACTTTCCTCATCCAATGTCCATCTTCCTTCTCCTGACTCCATCTCCACTTTTTCCACGGATTCATATCACCCAGACGAAAATTGGTTCTTAGA
CTCCGGAGCCACGCATCATATGACCCCGGATGCTACATCTCTCACTCATTCCATACCTTATGTTGGTGGTGAACATGTTACTGTTGGGGATGGTAAGAATATTTCTATCT
CTCTCATTGGTTCTCAATATTTACATTCTTTCTCTTCTAAACCAATTTATCTTGATTCCGTTCTTTATACTCCTGCTATTACTAAGAAATTATTGAGTGTTGCTCGGCTT
TGCAAAGATAACCAAGCATTTGTTGAATTTTACTCTTCTTTTTTCCTTGTTAAGGACCTTCGAACCAAGCAAATTATGCTCAAGGGAATCCTTGAGGATGGTCTATATCG
GTTGTCTGCTGTCTCTGCTTCTCCTGTTCAGCCACCTTCCTCCGCTGCTCTTTCATCTCCTTCCGTTTTTTTGGCATCTGTTAAGGCTCAACATTGGCATTTGAGGTTGG
GGACATCCAGCTGCTTCAACTTTGGGCCAAGTTTTGTCTTTGTTTCGTTTGTCTTCTGCAAAGTCGTCGTGGATGAACAAAAGCCACAAACTCCTTCTCAAACCACCGCC
ACTTATTCCATGGCTGAACTTCCCAATTGCAAATCCAACCCCCTTCTTCAAGCAACCGCCTTTCTTCTCAACATCACTGTCGAAACCCCCAAATTTGAGCTGATTCAATC
CACCAACGACTACGAAATCCGCAAATACGAGCCATCGGTGGTGGCCGAAGTCACCTACGATCCGACCCAGTTCAGAGGCAACAGAGACGGTGGCTTCACTGTATTGGCCA
AGTACATTGGCGCCATCGGCGAGCCACAGAACATCAAGTCCGAGAAAGTGGCCATGACGGCGCCGGTGATCACCAAATCGGAGAAGATTTCGATGACAGCTCCGGTGGTA
ACGGCTGGCGGCGGCGGCAGCGAGGGGAAGCCAGTGACGATGCAGTTTGTGCTGCCGAGCAAGTACAAGAAGGCAGAGGAAGCTCCAAAGCCGGCTGATGAAAGTGTTGT
GATAAGGGAAGAAGGGGAGAGGAAACTCGCCGTCGTGAGATTCAGTGGAATTGCGACGGAGGGAGTGGTGGCGCAGAAGGTGGAGAATCTGAAGAAAAGCTTGGAGAAAG
ATGGGTATAAAGTGATTGGGGATTTTGTGTTGGCTAGATATAACCCACCATGGACATTGCCTTCTTTGAGAACCAATGAAGTCATGATACCAGTAGAATGA
Protein sequenceShow/hide protein sequence
MVSEPIKPKRAPQFFQPPQLANPLPFTPNPYPTLPQPLNVKLNDTNFLLWKNQLLNAVLANGLHGFLDGSVPAPPKFLDEHQSQPNPEFTTWERYNRFIMCWIYSYLSEE
KMGEIVSLNTASEIWSSLTRAYDSKTTARIMGLKTQLHRIRKDGLTVSQYLSQIKDIADKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDCPALEDVRSLLLA
YEARLEKQNSVDQLNLAQANFATLNLNNTGRCTPSRASSNNIIRPPFNPFSAFPTSNNNVSSSILGKPQSQPLQKWPQKSNPNRPQCQICGKFGHTALICHHRTNLAYHT
APPQALLSSSNVHLPSPDSISTFSTDSYHPDENWFLDSGATHHMTPDATSLTHSIPYVGGEHVTVGDGKNISISLIGSQYLHSFSSKPIYLDSVLYTPAITKKLLSVARL
CKDNQAFVEFYSSFFLVKDLRTKQIMLKGILEDGLYRLSAVSASPVQPPSSAALSSPSVFLASVKAQHWHLRLGTSSCFNFGPSFVFVSFVFCKVVVDEQKPQTPSQTTA
TYSMAELPNCKSNPLLQATAFLLNITVETPKFELIQSTNDYEIRKYEPSVVAEVTYDPTQFRGNRDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVV
TAGGGGSEGKPVTMQFVLPSKYKKAEEAPKPADESVVIREEGERKLAVVRFSGIATEGVVAQKVENLKKSLEKDGYKVIGDFVLARYNPPWTLPSLRTNEVMIPVE