; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0041535 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0041535
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotran_gag_3 domain-containing protein
Genome locationchr13:19839541..19842301
RNA-Seq ExpressionLag0041535
SyntenyLag0041535
Gene Ontology termsGO:0097159 - organic cyclic compound binding (molecular function)
GO:1901363 - heterocyclic compound binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8645659.1 hypothetical protein Csa_020439 [Cucumis sativus]1.5e-9551.07Show/hide
Query:  ADSTARDLNSPIFLLSNICNL----------------------AHKLFGFIDGSFPKPASSVRTTTSSSSTDSSSAVSQPNPQYEDWIAKDHALQTLINA
        + S  +D  SPIFLLSNICNL                      AHKLFGF+DG+ P P +S  TT        S+   Q NP YEDWIAKD AL T+INA
Subjt:  ADSTARDLNSPIFLLSNICNL----------------------AHKLFGFIDGSFPKPASSVRTTTSSSSTDSSSAVSQPNPQYEDWIAKDHALQTLINA

Query:  TLSPTALAYIVGCETAQQMWETLEKHYSSTSRTNVVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIDVEDLVIYTMSGLPAEYNTFKTSLRT
        TLSP ALAY+VG  +++Q+W+ L K YSS SR+NVVNLKS+LQTI KK  ESI  YI+RIKE+KDKLANVS  I+ EDL+IY ++GLP EYNTF+TS+RT
Subjt:  TLSPTALAYIVGCETAQQMWETLEKHYSSTSRTNVVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIDVEDLVIYTMSGLPAEYNTFKTSLRT

Query:  RSQSVTFEKLHVLLVSEESALEKQIKRDEAFAQPTVLLAQTSS--NRSQNYNPNIPRGHGNSGNSRGRGRGFGRPPFCNSGRGRIPGHPVGNTPKPQPDQ
        RSQ VTFE+LHVLL +EESAL KQ K D+++ QPTVLL+ + S  + +  +N N  RG+G+     G+  G GR  F    R    GH +    KP  D 
Subjt:  RSQSVTFEKLHVLLVSEESALEKQIKRDEAFAQPTVLLAQTSS--NRSQNYNPNIPRGHGNSGNSRGRGRGFGRPPFCNSGRGRIPGHPVGNTPKPQPDQ

Query:  YSSRIQCQICQRAGHSAIDCYNRMNDNFQGRHPPAQLAAMVATQNYNVVANTSVSHPSPWLANSGCNSHVTSDMNHFTNSSDFTGDDQVTVGSGQSLPIT
        +++   CQIC R GH+A+DC+NRMN NFQGRHPP QLAAMVA+QN   ++  + S     L +SGCN+H+TSDMN+ + + ++ G++QV VG+GQ+ PI+
Subjt:  YSSRIQCQICQRAGHSAIDCYNRMNDNFQGRHPPAQLAAMVATQNYNVVANTSVSHPSPWLANSGCNSHVTSDMNHFTNSSDFTGDDQVTVGSGQSLPIT

Query:  HSGCGILPTPTSSLKLFDLLC
        HSG   L   + S  +    C
Subjt:  HSGCGILPTPTSSLKLFDLLC

XP_008448007.1 PREDICTED: uncharacterized protein LOC103490319 isoform X2 [Cucumis melo]4.0e-9350.36Show/hide
Query:  ADSTARDLNSPIFLLSNICNL----------------------AHKLFGFIDGSFPKPASSVRTTTSSSSTDSSSAVSQPNPQYEDWIAKDHALQTLINA
        + S  +D  SPIFLLSNICNL                      AHKL+GFIDG+ P P    RT  SSS   +S+   Q NP YEDWIAKD AL T+INA
Subjt:  ADSTARDLNSPIFLLSNICNL----------------------AHKLFGFIDGSFPKPASSVRTTTSSSSTDSSSAVSQPNPQYEDWIAKDHALQTLINA

Query:  TLSPTALAYIVGCETAQQMWETLEKHYSSTSRTNVVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIDVEDLVIYTMSGLPAEYNTFKTSLRT
        TLSP ALAY+VG  +++Q+W+ L K YSS SR+NVVNLKS+LQTI KK  ESI  YI+RIKE+KDKLANVS  I+ EDL+IY ++GLP EYNTF+TS+RT
Subjt:  TLSPTALAYIVGCETAQQMWETLEKHYSSTSRTNVVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIDVEDLVIYTMSGLPAEYNTFKTSLRT

Query:  RSQSVTFEKLHVLLVSEESALEKQIKRDEAFAQPTVLLAQTSS--NRSQNYNPNIPRGHGNSGNSRGRGRGFGRPPFCNSGRGRIPGHPVGNTPKPQPDQ
        RSQ VTFE+LHVLL +EESAL KQ K D+++ QPTVLL+ + S  + +  ++ N  RG+G+     G+  G GR  F    RG       G++P+ Q   
Subjt:  RSQSVTFEKLHVLLVSEESALEKQIKRDEAFAQPTVLLAQTSS--NRSQNYNPNIPRGHGNSGNSRGRGRGFGRPPFCNSGRGRIPGHPVGNTPKPQPDQ

Query:  YSSRIQCQICQRAGHSAIDCYNRMNDNFQGRHPPAQLAAMVATQNYNVVANTSVSHPSPWLANSGCNSHVTSDMNHFTNSSDFTGDDQVTVGSGQSLPIT
        + +   CQIC R GH+A+DC+NRMN NFQGRHPP QLAAMVA+QN   ++  + S     L +SGCN+ +TSDMN+ + + ++ G++QV +G+GQ+ P++
Subjt:  YSSRIQCQICQRAGHSAIDCYNRMNDNFQGRHPPAQLAAMVATQNYNVVANTSVSHPSPWLANSGCNSHVTSDMNHFTNSSDFTGDDQVTVGSGQSLPIT

Query:  HSGCGILPTPTSSLKLFDLLC
        HSG       + S  +  L C
Subjt:  HSGCGILPTPTSSLKLFDLLC

XP_008448008.1 PREDICTED: uncharacterized protein LOC103490319 isoform X3 [Cucumis melo]4.0e-9351.86Show/hide
Query:  ADSTARDLNSPIFLLSNICNL----------------------AHKLFGFIDGSFPKPASSVRTTTSSSSTDSSSAVSQPNPQYEDWIAKDHALQTLINA
        + S  +D  SPIFLLSNICNL                      AHKL+GFIDG+ P P    RT  SSS   +S+   Q NP YEDWIAKD AL T+INA
Subjt:  ADSTARDLNSPIFLLSNICNL----------------------AHKLFGFIDGSFPKPASSVRTTTSSSSTDSSSAVSQPNPQYEDWIAKDHALQTLINA

Query:  TLSPTALAYIVGCETAQQMWETLEKHYSSTSRTNVVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIDVEDLVIYTMSGLPAEYNTFKTSLRT
        TLSP ALAY+VG  +++Q+W+ L K YSS SR+NVVNLKS+LQTI KK  ESI  YI+RIKE+KDKLANVS  I+ EDL+IY ++GLP EYNTF+TS+RT
Subjt:  TLSPTALAYIVGCETAQQMWETLEKHYSSTSRTNVVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIDVEDLVIYTMSGLPAEYNTFKTSLRT

Query:  RSQSVTFEKLHVLLVSEESALEKQIKRDEAFAQPTVLLAQTSS--NRSQNYNPNIPRGHGNSGNSRGRGRGFGRPPFCNSGRGRIPGHPVGNTPKPQPDQ
        RSQ VTFE+LHVLL +EESAL KQ K D+++ QPTVLL+ + S  + +  ++ N  RG+G+     G+  G GR  F    RG       G++P+ Q   
Subjt:  RSQSVTFEKLHVLLVSEESALEKQIKRDEAFAQPTVLLAQTSS--NRSQNYNPNIPRGHGNSGNSRGRGRGFGRPPFCNSGRGRIPGHPVGNTPKPQPDQ

Query:  YSSRIQCQICQRAGHSAIDCYNRMNDNFQGRHPPAQLAAMVATQNYNVVANTSVSHPSPWLANSGCNSHVTSDMNHFTNSSDFTGDDQVTVGSGQSLPIT
        + +   CQIC R GH+A+DC+NRMN NFQGRHPP QLAAMVA+QN   ++  + S     L +SGCN+ +TSDMN+ + + ++ G++QV +G+GQ+ P++
Subjt:  YSSRIQCQICQRAGHSAIDCYNRMNDNFQGRHPPAQLAAMVATQNYNVVANTSVSHPSPWLANSGCNSHVTSDMNHFTNSSDFTGDDQVTVGSGQSLPIT

Query:  HSG
        HSG
Subjt:  HSG

XP_011658579.1 uncharacterized protein LOC105436058 [Cucumis sativus]8.5e-9652.61Show/hide
Query:  ADSTARDLNSPIFLLSNICNL----------------------AHKLFGFIDGSFPKPASSVRTTTSSSSTDSSSAVSQPNPQYEDWIAKDHALQTLINA
        + S  +D  SPIFLLSNICNL                      AHKLFGF+DG+ P P +S  TT        S+   Q NP YEDWIAKD AL T+INA
Subjt:  ADSTARDLNSPIFLLSNICNL----------------------AHKLFGFIDGSFPKPASSVRTTTSSSSTDSSSAVSQPNPQYEDWIAKDHALQTLINA

Query:  TLSPTALAYIVGCETAQQMWETLEKHYSSTSRTNVVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIDVEDLVIYTMSGLPAEYNTFKTSLRT
        TLSP ALAY+VG  +++Q+W+ L K YSS SR+NVVNLKS+LQTI KK  ESI  YI+RIKE+KDKLANVS  I+ EDL+IY ++GLP EYNTF+TS+RT
Subjt:  TLSPTALAYIVGCETAQQMWETLEKHYSSTSRTNVVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIDVEDLVIYTMSGLPAEYNTFKTSLRT

Query:  RSQSVTFEKLHVLLVSEESALEKQIKRDEAFAQPTVLLAQTSS--NRSQNYNPNIPRGHGNSGNSRGRGRGFGRPPFCNSGRGRIPGHPVGNTPKPQPDQ
        RSQ VTFE+LHVLL +EESAL KQ K D+++ QPTVLL+ + S  + +  +N N  RG+G+     G+  G GR  F    R    GH +    KP  D 
Subjt:  RSQSVTFEKLHVLLVSEESALEKQIKRDEAFAQPTVLLAQTSS--NRSQNYNPNIPRGHGNSGNSRGRGRGFGRPPFCNSGRGRIPGHPVGNTPKPQPDQ

Query:  YSSRIQCQICQRAGHSAIDCYNRMNDNFQGRHPPAQLAAMVATQNYNVVANTSVSHPSPWLANSGCNSHVTSDMNHFTNSSDFTGDDQVTVGSGQSLPIT
        +++   CQIC R GH+A+DC+NRMN NFQGRHPP QLAAMVA+QN   ++  + S     L +SGCN+H+TSDMN+ + + ++ G++QV VG+GQ+ PI+
Subjt:  YSSRIQCQICQRAGHSAIDCYNRMNDNFQGRHPPAQLAAMVATQNYNVVANTSVSHPSPWLANSGCNSHVTSDMNHFTNSSDFTGDDQVTVGSGQSLPIT

Query:  HSG
        HSG
Subjt:  HSG

XP_022150845.1 uncharacterized protein LOC111018892 [Momordica charantia]2.2e-9952.67Show/hide
Query:  STARDLNSPIFLLSNICNL----------------------AHKLFGFIDGSFPKP----ASSVRTTTSSSSTDSSSAVSQPNPQYEDWIAKDHALQTLI
        +T +DL+SPIFLLSNICNL                      AHKLFGFIDGS   P    ASS  T +  ++T S   +   NP +EDWIAKD AL TLI
Subjt:  STARDLNSPIFLLSNICNL----------------------AHKLFGFIDGSFPKP----ASSVRTTTSSSSTDSSSAVSQPNPQYEDWIAKDHALQTLI

Query:  NATLSPTALAYIVGCETAQQMWETLEKHYSSTSRTNVVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIDVEDLVIYTMSGLPAEYNTFKTSL
        NATLS  ALAY+V   T++Q+WE LEKHYSS SRTNVVNLKS+LQ+I KK+ ESI  Y++RIKE+KDK ANVS+TI+ E L+IY ++GL  EYNT  TS+
Subjt:  NATLSPTALAYIVGCETAQQMWETLEKHYSSTSRTNVVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIDVEDLVIYTMSGLPAEYNTFKTSL

Query:  RTRSQSVTFEKLHVLLVSEESALEKQIKRDEAFAQPTVLLAQT--SSNRSQNYNPNIPRGHGNSGNSRGRGRGFGRPPFCNSGRGRIPGHPVGNTPKPQP
        RTR+QSV+FE+LHV + SEESA+EKQ+KR++   QP  L A +  S NR+  ++PN     G  G + GRG+    P F N GRGR  G+   +      
Subjt:  RTRSQSVTFEKLHVLLVSEESALEKQIKRDEAFAQPTVLLAQT--SSNRSQNYNPNIPRGHGNSGNSRGRGRGFGRPPFCNSGRGRIPGHPVGNTPKPQP

Query:  DQYSSRIQCQICQRAGHSAIDCYNRMNDNFQGRHPPAQLAAMVATQN--YNVVANTSVSHPSPWLANSGCNSHVTSDMNHF---TNSSDFTGDDQVTVGS
         Q  +R  CQIC + GH+A+DCYNRMN +FQGRHPP QLAAMVA QN  Y  V N+S   P+ WLA+S CN+H+T+D+++    + +SD+ G++ ++VGS
Subjt:  DQYSSRIQCQICQRAGHSAIDCYNRMNDNFQGRHPPAQLAAMVATQN--YNVVANTSVSHPSPWLANSGCNSHVTSDMNHF---TNSSDFTGDDQVTVGS

Query:  GQSLPITHSGCG
        GQS PITH GCG
Subjt:  GQSLPITHSGCG

TrEMBL top hitse value%identityAlignment
A0A1S3BI58 uncharacterized protein LOC103490319 isoform X21.9e-9350.36Show/hide
Query:  ADSTARDLNSPIFLLSNICNL----------------------AHKLFGFIDGSFPKPASSVRTTTSSSSTDSSSAVSQPNPQYEDWIAKDHALQTLINA
        + S  +D  SPIFLLSNICNL                      AHKL+GFIDG+ P P    RT  SSS   +S+   Q NP YEDWIAKD AL T+INA
Subjt:  ADSTARDLNSPIFLLSNICNL----------------------AHKLFGFIDGSFPKPASSVRTTTSSSSTDSSSAVSQPNPQYEDWIAKDHALQTLINA

Query:  TLSPTALAYIVGCETAQQMWETLEKHYSSTSRTNVVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIDVEDLVIYTMSGLPAEYNTFKTSLRT
        TLSP ALAY+VG  +++Q+W+ L K YSS SR+NVVNLKS+LQTI KK  ESI  YI+RIKE+KDKLANVS  I+ EDL+IY ++GLP EYNTF+TS+RT
Subjt:  TLSPTALAYIVGCETAQQMWETLEKHYSSTSRTNVVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIDVEDLVIYTMSGLPAEYNTFKTSLRT

Query:  RSQSVTFEKLHVLLVSEESALEKQIKRDEAFAQPTVLLAQTSS--NRSQNYNPNIPRGHGNSGNSRGRGRGFGRPPFCNSGRGRIPGHPVGNTPKPQPDQ
        RSQ VTFE+LHVLL +EESAL KQ K D+++ QPTVLL+ + S  + +  ++ N  RG+G+     G+  G GR  F    RG       G++P+ Q   
Subjt:  RSQSVTFEKLHVLLVSEESALEKQIKRDEAFAQPTVLLAQTSS--NRSQNYNPNIPRGHGNSGNSRGRGRGFGRPPFCNSGRGRIPGHPVGNTPKPQPDQ

Query:  YSSRIQCQICQRAGHSAIDCYNRMNDNFQGRHPPAQLAAMVATQNYNVVANTSVSHPSPWLANSGCNSHVTSDMNHFTNSSDFTGDDQVTVGSGQSLPIT
        + +   CQIC R GH+A+DC+NRMN NFQGRHPP QLAAMVA+QN   ++  + S     L +SGCN+ +TSDMN+ + + ++ G++QV +G+GQ+ P++
Subjt:  YSSRIQCQICQRAGHSAIDCYNRMNDNFQGRHPPAQLAAMVATQNYNVVANTSVSHPSPWLANSGCNSHVTSDMNHFTNSSDFTGDDQVTVGSGQSLPIT

Query:  HSGCGILPTPTSSLKLFDLLC
        HSG       + S  +  L C
Subjt:  HSGCGILPTPTSSLKLFDLLC

A0A1S3BIR3 uncharacterized protein LOC103490319 isoform X31.9e-9351.86Show/hide
Query:  ADSTARDLNSPIFLLSNICNL----------------------AHKLFGFIDGSFPKPASSVRTTTSSSSTDSSSAVSQPNPQYEDWIAKDHALQTLINA
        + S  +D  SPIFLLSNICNL                      AHKL+GFIDG+ P P    RT  SSS   +S+   Q NP YEDWIAKD AL T+INA
Subjt:  ADSTARDLNSPIFLLSNICNL----------------------AHKLFGFIDGSFPKPASSVRTTTSSSSTDSSSAVSQPNPQYEDWIAKDHALQTLINA

Query:  TLSPTALAYIVGCETAQQMWETLEKHYSSTSRTNVVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIDVEDLVIYTMSGLPAEYNTFKTSLRT
        TLSP ALAY+VG  +++Q+W+ L K YSS SR+NVVNLKS+LQTI KK  ESI  YI+RIKE+KDKLANVS  I+ EDL+IY ++GLP EYNTF+TS+RT
Subjt:  TLSPTALAYIVGCETAQQMWETLEKHYSSTSRTNVVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIDVEDLVIYTMSGLPAEYNTFKTSLRT

Query:  RSQSVTFEKLHVLLVSEESALEKQIKRDEAFAQPTVLLAQTSS--NRSQNYNPNIPRGHGNSGNSRGRGRGFGRPPFCNSGRGRIPGHPVGNTPKPQPDQ
        RSQ VTFE+LHVLL +EESAL KQ K D+++ QPTVLL+ + S  + +  ++ N  RG+G+     G+  G GR  F    RG       G++P+ Q   
Subjt:  RSQSVTFEKLHVLLVSEESALEKQIKRDEAFAQPTVLLAQTSS--NRSQNYNPNIPRGHGNSGNSRGRGRGFGRPPFCNSGRGRIPGHPVGNTPKPQPDQ

Query:  YSSRIQCQICQRAGHSAIDCYNRMNDNFQGRHPPAQLAAMVATQNYNVVANTSVSHPSPWLANSGCNSHVTSDMNHFTNSSDFTGDDQVTVGSGQSLPIT
        + +   CQIC R GH+A+DC+NRMN NFQGRHPP QLAAMVA+QN   ++  + S     L +SGCN+ +TSDMN+ + + ++ G++QV +G+GQ+ P++
Subjt:  YSSRIQCQICQRAGHSAIDCYNRMNDNFQGRHPPAQLAAMVATQNYNVVANTSVSHPSPWLANSGCNSHVTSDMNHFTNSSDFTGDDQVTVGSGQSLPIT

Query:  HSG
        HSG
Subjt:  HSG

A0A1S4DWT9 uncharacterized protein LOC103490319 isoform X11.9e-9350.36Show/hide
Query:  ADSTARDLNSPIFLLSNICNL----------------------AHKLFGFIDGSFPKPASSVRTTTSSSSTDSSSAVSQPNPQYEDWIAKDHALQTLINA
        + S  +D  SPIFLLSNICNL                      AHKL+GFIDG+ P P    RT  SSS   +S+   Q NP YEDWIAKD AL T+INA
Subjt:  ADSTARDLNSPIFLLSNICNL----------------------AHKLFGFIDGSFPKPASSVRTTTSSSSTDSSSAVSQPNPQYEDWIAKDHALQTLINA

Query:  TLSPTALAYIVGCETAQQMWETLEKHYSSTSRTNVVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIDVEDLVIYTMSGLPAEYNTFKTSLRT
        TLSP ALAY+VG  +++Q+W+ L K YSS SR+NVVNLKS+LQTI KK  ESI  YI+RIKE+KDKLANVS  I+ EDL+IY ++GLP EYNTF+TS+RT
Subjt:  TLSPTALAYIVGCETAQQMWETLEKHYSSTSRTNVVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIDVEDLVIYTMSGLPAEYNTFKTSLRT

Query:  RSQSVTFEKLHVLLVSEESALEKQIKRDEAFAQPTVLLAQTSS--NRSQNYNPNIPRGHGNSGNSRGRGRGFGRPPFCNSGRGRIPGHPVGNTPKPQPDQ
        RSQ VTFE+LHVLL +EESAL KQ K D+++ QPTVLL+ + S  + +  ++ N  RG+G+     G+  G GR  F    RG       G++P+ Q   
Subjt:  RSQSVTFEKLHVLLVSEESALEKQIKRDEAFAQPTVLLAQTSS--NRSQNYNPNIPRGHGNSGNSRGRGRGFGRPPFCNSGRGRIPGHPVGNTPKPQPDQ

Query:  YSSRIQCQICQRAGHSAIDCYNRMNDNFQGRHPPAQLAAMVATQNYNVVANTSVSHPSPWLANSGCNSHVTSDMNHFTNSSDFTGDDQVTVGSGQSLPIT
        + +   CQIC R GH+A+DC+NRMN NFQGRHPP QLAAMVA+QN   ++  + S     L +SGCN+ +TSDMN+ + + ++ G++QV +G+GQ+ P++
Subjt:  YSSRIQCQICQRAGHSAIDCYNRMNDNFQGRHPPAQLAAMVATQNYNVVANTSVSHPSPWLANSGCNSHVTSDMNHFTNSSDFTGDDQVTVGSGQSLPIT

Query:  HSGCGILPTPTSSLKLFDLLC
        HSG       + S  +  L C
Subjt:  HSGCGILPTPTSSLKLFDLLC

A0A5D3CLI6 T4.59.5e-9351.74Show/hide
Query:  ADSTARDLNSPIFLLSNICNL----------------------AHKLFGFIDGSFPKPASSVRTTTSSSSTDSSSAVSQPNPQYEDWIAKDHALQTLINA
        + S  +D  SPIFLLSNICNL                      AHKL+GFIDG+ P P    RT  SSS   +S+   Q NP YEDWIAKD AL T+INA
Subjt:  ADSTARDLNSPIFLLSNICNL----------------------AHKLFGFIDGSFPKPASSVRTTTSSSSTDSSSAVSQPNPQYEDWIAKDHALQTLINA

Query:  TLSPTALAYIVGCETAQQMWETLEKHYSSTSRTNVVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIDVEDLVIYTMSGLPAEYNTFKTSLRT
        TLSP ALAY+VG  +++Q+W+ L K YSS SR+NVVNLKS+LQTI KK  ESI  YI+RIKE+KDKLANVS  I+ EDL+IY ++GLP EYNTF+TS+RT
Subjt:  TLSPTALAYIVGCETAQQMWETLEKHYSSTSRTNVVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIDVEDLVIYTMSGLPAEYNTFKTSLRT

Query:  RSQSVTFEKLHVLLVSEESALEKQIKRDEAFAQPTVLLAQTSS--NRSQNYNPNIPRGHGNSGNSRGRGRGFGRPPFCNSGRGRIPGHPVGNTPKPQPDQ
        RSQ VTFE+LHVLL +EESAL KQ K D+++ QPTVLL+ + S  + +  ++ N  RG+G+     G+  G GR  F    RG       G++P+ Q   
Subjt:  RSQSVTFEKLHVLLVSEESALEKQIKRDEAFAQPTVLLAQTSS--NRSQNYNPNIPRGHGNSGNSRGRGRGFGRPPFCNSGRGRIPGHPVGNTPKPQPDQ

Query:  YSSRIQCQICQRAGHSAIDCYNRMNDNFQGRHPPAQLAAMVATQNYNVVANTSVSHPSPWLANSGCNSHVTSDMNHFTNSSDFTGDDQVTVGSGQSLPIT
        + +   CQIC R GH+A+DC+NRMN NFQGRHPP QLAAMVA+QN   ++  + S     L +SGCN+ +TSDMN+ + + ++ G++QV +G+GQ+ P++
Subjt:  YSSRIQCQICQRAGHSAIDCYNRMNDNFQGRHPPAQLAAMVATQNYNVVANTSVSHPSPWLANSGCNSHVTSDMNHFTNSSDFTGDDQVTVGSGQSLPIT

Query:  HS
        HS
Subjt:  HS

A0A6J1D9L6 uncharacterized protein LOC1110188921.0e-9952.67Show/hide
Query:  STARDLNSPIFLLSNICNL----------------------AHKLFGFIDGSFPKP----ASSVRTTTSSSSTDSSSAVSQPNPQYEDWIAKDHALQTLI
        +T +DL+SPIFLLSNICNL                      AHKLFGFIDGS   P    ASS  T +  ++T S   +   NP +EDWIAKD AL TLI
Subjt:  STARDLNSPIFLLSNICNL----------------------AHKLFGFIDGSFPKP----ASSVRTTTSSSSTDSSSAVSQPNPQYEDWIAKDHALQTLI

Query:  NATLSPTALAYIVGCETAQQMWETLEKHYSSTSRTNVVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIDVEDLVIYTMSGLPAEYNTFKTSL
        NATLS  ALAY+V   T++Q+WE LEKHYSS SRTNVVNLKS+LQ+I KK+ ESI  Y++RIKE+KDK ANVS+TI+ E L+IY ++GL  EYNT  TS+
Subjt:  NATLSPTALAYIVGCETAQQMWETLEKHYSSTSRTNVVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIDVEDLVIYTMSGLPAEYNTFKTSL

Query:  RTRSQSVTFEKLHVLLVSEESALEKQIKRDEAFAQPTVLLAQT--SSNRSQNYNPNIPRGHGNSGNSRGRGRGFGRPPFCNSGRGRIPGHPVGNTPKPQP
        RTR+QSV+FE+LHV + SEESA+EKQ+KR++   QP  L A +  S NR+  ++PN     G  G + GRG+    P F N GRGR  G+   +      
Subjt:  RTRSQSVTFEKLHVLLVSEESALEKQIKRDEAFAQPTVLLAQT--SSNRSQNYNPNIPRGHGNSGNSRGRGRGFGRPPFCNSGRGRIPGHPVGNTPKPQP

Query:  DQYSSRIQCQICQRAGHSAIDCYNRMNDNFQGRHPPAQLAAMVATQN--YNVVANTSVSHPSPWLANSGCNSHVTSDMNHF---TNSSDFTGDDQVTVGS
         Q  +R  CQIC + GH+A+DCYNRMN +FQGRHPP QLAAMVA QN  Y  V N+S   P+ WLA+S CN+H+T+D+++    + +SD+ G++ ++VGS
Subjt:  DQYSSRIQCQICQRAGHSAIDCYNRMNDNFQGRHPPAQLAAMVATQN--YNVVANTSVSHPSPWLANSGCNSHVTSDMNHF---TNSSDFTGDDQVTVGS

Query:  GQSLPITHSGCG
        GQS PITH GCG
Subjt:  GQSLPITHSGCG

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.8e-0824.23Show/hide
Query:  EDWIAKDHALQTLINATLSPTALAYIVGCETAQQMWETLEKHYSSTSRTNVVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIDVEDLVIYTM
        EDW   D    + I   LS   +  I+  +TA+ +W  LE  Y S + TN + LK +L  +    G +   ++     L  +LAN+ V I+ ED  I  +
Subjt:  EDWIAKDHALQTLINATLSPTALAYIVGCETAQQMWETLEKHYSSTSRTNVVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIDVEDLVIYTM

Query:  SGLPAEYNTFKTSLRTRSQSVTFEKLHVLLVSEESALEKQIKRDEAFAQPTVLLAQTSSNRSQNYNPNIPRGHGNSGNSRGRGRGFGRPPFCNSGRGRIP
        + LP+ Y+   T++     ++  + +   L+  E   +K   + +A           +  R ++Y     R   N G S  RG+   R            
Subjt:  SGLPAEYNTFKTSLRTRSQSVTFEKLHVLLVSEESALEKQIKRDEAFAQPTVLLAQTSSNRSQNYNPNIPRGHGNSGNSRGRGRGFGRPPFCNSGRGRIP

Query:  GHPVGNTPKPQPDQYSSRIQ-CQICQRAGHSAIDCYN--RMNDNFQGRHPPAQLAAMVATQNYNVVANTS-------VSHP-SPWLANSGCNSHVTSDMN
                        SR++ C  C + GH   DC N  +      G+      AAMV   N NVV   +       +S P S W+ ++  + H T   +
Subjt:  GHPVGNTPKPQPDQYSSRIQ-CQICQRAGHSAIDCYN--RMNDNFQGRHPPAQLAAMVATQNYNVVANTS-------VSHP-SPWLANSGCNSHVTSDMN

Query:  HFTNSSDFTGDDQVTVGSGQSLPITHSGCGILPTPTS---SLKLFDLLCVPHISSNLLS
         F     +   D  TV  G +     +G G +   T+   +L L D+  VP +  NL+S
Subjt:  HFTNSSDFTGDDQVTVGSGQSLPITHSGCGILPTPTS---SLKLFDLLCVPHISSNLLS

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.6e-3128.51Show/hide
Query:  HKLFGFIDGSFPKPASSVRTTTSSSSTDSSSAVSQPNPQYEDWIAKDHALQTLINATLSPTALAYIVGCETAQQMWETLEKHYSSTSRTNVVNLKSELQT
        ++L GF+DGS          TT   +T  + A  + NP Y  W  +D  + + +   +S +    +    TA Q+WETL K Y++ S  +V  L+++L+ 
Subjt:  HKLFGFIDGSFPKPASSVRTTTSSSSTDSSSAVSQPNPQYEDWIAKDHALQTLINATLSPTALAYIVGCETAQQMWETLEKHYSSTSRTNVVNLKSELQT

Query:  IAKKSGESITQYIQRIKELKDKLANVSVTIDVEDLVIYTMSGLPAEYNTFKTSLRTRSQSVTFEKLHVLLVSEESALEKQIKRDEAFAQPTVLLAQTSSN
          K + ++I  Y+Q +    D+LA +   +D ++ V   +  LP EY      +  +    T  ++H  L++ ES   K +    A   P    A +  N
Subjt:  IAKKSGESITQYIQRIKELKDKLANVSVTIDVEDLVIYTMSGLPAEYNTFKTSLRTRSQSVTFEKLHVLLVSEESALEKQIKRDEAFAQPTVLLAQTSSN

Query:  RSQNYNPNIPRGHGNSGNS-RGRGRGFGRPPFCNSGRGRIPGHPVGNTPKPQPDQYSSRIQCQICQRAGHSAIDCYNRMN--DNFQGRHPPAQLAAMVAT
         +   N N    +GN  N    R       P+  S       HP  N  KP         +CQIC   GHSA  C    +   +   + PP+        
Subjt:  RSQNYNPNIPRGHGNSGNS-RGRGRGFGRPPFCNSGRGRIPGHPVGNTPKPQPDQYSSRIQCQICQRAGHSAIDCYNRMN--DNFQGRHPPAQLAAMVAT

Query:  QNYNVVANTSVSHPSPWLANSGCNSHVTSDMNHFTNSSDFTGDDQVTVGSGQSLPITHSGCGILPTPTSSLKLFDLLCVPHISSNLLSVNRLCADNNCVC
         N   +A  S    + WL +SG   H+TSD N+ +    +TG D V V  G ++PI+H+G   L T +  L L ++L VP+I  NL+SV RLC  N    
Subjt:  QNYNVVANTSVSHPSPWLANSGCNSHVTSDMNHFTNSSDFTGDDQVTVGSGQSLPITHSGCGILPTPTSSLKLFDLLCVPHISSNLLSVNRLCADNNCVC

Query:  VFDSDFFLIQDKSSGKILFQGPSVNGLYPINPAGCCWAISAS
         F    F ++D ++G  L QG + + LY        W I++S
Subjt:  VFDSDFFLIQDKSSGKILFQGPSVNGLYPINPAGCCWAISAS

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE26.9e-3228.28Show/hide
Query:  HKLFGFIDGSFPKPASSVRTTTSSSSTDSSSAVSQPNPQYEDWIAKDHALQTLINATLSPTALAYIVGCETAQQMWETLEKHYSSTSRTNVVNLKSELQT
        ++L GF+DGS P P +++ T           AV + NP Y  W  +D  + + I   +S +    +    TA Q+WETL K Y++ S  +V  L+     
Subjt:  HKLFGFIDGSFPKPASSVRTTTSSSSTDSSSAVSQPNPQYEDWIAKDHALQTLINATLSPTALAYIVGCETAQQMWETLEKHYSSTSRTNVVNLKSELQT

Query:  IAKKSGESITQYIQRIKELKDKLANVSVTIDVEDLVIYTMSGLPAEYNTFKTSLRTRSQSVTFEKLHVLLVSEESALEKQIKRDEAFAQPTVLLAQTSSN
                   +I R     D+LA +   +D ++ V   +  LP +Y      +  +    +  ++H  L++ ES   K +  + A   P      T  N
Subjt:  IAKKSGESITQYIQRIKELKDKLANVSVTIDVEDLVIYTMSGLPAEYNTFKTSLRTRSQSVTFEKLHVLLVSEESALEKQIKRDEAFAQPTVLLAQTSSN

Query:  RSQNYNPNIPRGHGNSGNSRGRGRGFGRPPFCNSGRGRIPGHPVGNTPKPQPDQYSSRIQCQICQRAGHSAIDCYNRMNDNFQGRHPPAQLAAMVATQNY
         + N N N    + N  N+  R   +             P      +   QP  Y  R  CQIC   GHSA  C       FQ      Q  +      +
Subjt:  RSQNYNPNIPRGHGNSGNSRGRGRGFGRPPFCNSGRGRIPGHPVGNTPKPQPDQYSSRIQCQICQRAGHSAIDCYNRMNDNFQGRHPPAQLAAMVATQNY

Query:  NVVANTSVSHP---SPWLANSGCNSHVTSDMNHFTNSSDFTGDDQVTVGSGQSLPITHSGCGILPTPTSSLKLFDLLCVPHISSNLLSVNRLCADNNCVC
           AN +V+ P   + WL +SG   H+TSD N+ +    +TG D V +  G ++PITH+G   LPT + SL L  +L VP+I  NL+SV RLC  N    
Subjt:  NVVANTSVSHP---SPWLANSGCNSHVTSDMNHFTNSSDFTGDDQVTVGSGQSLPITHSGCGILPTPTSSLKLFDLLCVPHISSNLLSVNRLCADNNCVC

Query:  VFDSDFFLIQDKSSGKILFQGPSVNGLYPINPAGCCWAISAS
         F    F ++D ++G  L QG + + LY        W I++S
Subjt:  VFDSDFFLIQDKSSGKILFQGPSVNGLYPINPAGCCWAISAS

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATGAGGACCCATGGATGCCTAAAGTCTCCACTTTTAAGCCTGTTAGTATTAATGGAATCATATCTCAGGAGAGAGTTGCTGATTTCATAACACATTCTGGGGAGTG
GGACTTGAGGAAATTAAGGGATTCGGTGATGAGTGAAGACTATGAAATAATTAGAAGAATTCCAATAAACATAAGGCTTAAGGATAAATTAGTTTGGCATTTTGATAAAA
CTGGAATCTACTCTGCAAAGAGTGGCTACAAGCTTTTTATGCAAGGAAAAATAAATGAGGCCTCTTCTAAGATTTGGAGGCTTACCATTAATCGAGTTTTCTTAGAGGAA
GATTTCAACGGGAGCTTCATTGACAGATGGATGAAGATTAATTCACGGTTGGACATTCAGGAGTTGGAGCTGGTAGCTGTTACCTGTTGGTTGATTTGGAATGATAGGAA
CAAATTCATTAATGGTTTTTTGAAAATCAATGTGGACGCCGACTGGATCCCTTCTCCTCCGTCGACAAGGGTGGGTATTGTTTGCAGAAATGATGTTGGTGGTGCAGTGC
CTACTATTTTTGGTGACTCACGTCTCGATCCTTCCACGCTTCTGCATTTCTTAGCAATTTTCTTACCAATGGCTGATTCTACTGCCAGAGATCTCAATTCTCCTATTTTT
CTTCTTTCAAATATTTGTAATCTGGCGCACAAATTGTTTGGATTCATTGATGGATCCTTTCCGAAACCGGCTTCATCTGTACGTACTACCACTTCTTCTTCGTCTACTGA
TTCTTCATCTGCTGTTTCTCAGCCAAATCCACAATATGAGGACTGGATTGCCAAAGATCACGCACTACAAACTCTGATTAATGCAACTCTTTCTCCCACTGCCTTGGCCT
ATATTGTTGGTTGTGAGACTGCACAACAAATGTGGGAAACTCTAGAGAAACACTACTCTTCTACTTCGCGAACAAATGTCGTTAACCTAAAATCAGAGTTGCAAACTATT
GCGAAGAAATCTGGTGAAAGCATCACACAATACATCCAACGCATCAAGGAGTTGAAAGATAAATTGGCCAATGTTTCCGTCACGATTGATGTAGAGGATCTGGTGATCTA
CACGATGAGTGGCTTACCTGCTGAATACAACACTTTCAAGACGTCTCTGCGTACTAGATCTCAGTCGGTCACCTTCGAAAAACTTCATGTTCTTCTTGTTTCTGAAGAGT
CTGCCCTTGAGAAGCAGATTAAACGTGATGAGGCCTTTGCTCAGCCTACTGTTTTACTTGCACAAACTTCTTCAAATCGCAGTCAGAACTACAATCCAAATATACCTCGA
GGTCATGGAAATTCTGGAAATTCTCGAGGCCGTGGAAGAGGTTTTGGTCGTCCTCCTTTTTGTAACTCTGGTCGTGGACGTATACCTGGACATCCTGTTGGTAATACTCC
GAAACCTCAACCTGATCAGTACTCTTCTCGTATTCAATGTCAGATTTGTCAGCGAGCAGGTCATAGTGCGATTGATTGTTATAATCGCATGAATGACAATTTTCAAGGTC
GTCATCCACCTGCTCAGTTAGCCGCTATGGTTGCTACTCAAAATTATAATGTTGTGGCCAACACTTCTGTTTCTCATCCATCACCTTGGCTTGCTAATTCTGGTTGCAAC
TCTCATGTTACCTCTGATATGAATCATTTTACTAATTCTTCTGATTTTACTGGTGATGATCAGGTAACTGTTGGCAGTGGACAGTCCCTTCCTATAACACATTCAGGTTG
TGGCATTCTTCCTACTCCTACTTCCTCTCTTAAACTCTTTGATTTACTTTGTGTTCCTCATATTTCTTCAAATCTTTTATCTGTTAACCGTCTTTGTGCCGATAATAATT
GTGTGTGCGTCTTCGATTCTGACTTTTTTCTCATTCAGGACAAGTCTTCGGGCAAGATCCTTTTTCAAGGACCTAGTGTCAATGGGTTGTATCCTATCAATCCCGCTGGC
TGTTGTTGGGCGATATCCGCAAGCGCACGGGCCACAAGTAATATAAAACGGAAAGTATTCCGAGTATCGAATCCACAGGGAGGCGAAGCGAAACTTAAGCAATTAATTAA
AATTCTCTTGAGCACTGAAAATACTGAGGCGAAATAA
mRNA sequenceShow/hide mRNA sequence
ATGTATGAGGACCCATGGATGCCTAAAGTCTCCACTTTTAAGCCTGTTAGTATTAATGGAATCATATCTCAGGAGAGAGTTGCTGATTTCATAACACATTCTGGGGAGTG
GGACTTGAGGAAATTAAGGGATTCGGTGATGAGTGAAGACTATGAAATAATTAGAAGAATTCCAATAAACATAAGGCTTAAGGATAAATTAGTTTGGCATTTTGATAAAA
CTGGAATCTACTCTGCAAAGAGTGGCTACAAGCTTTTTATGCAAGGAAAAATAAATGAGGCCTCTTCTAAGATTTGGAGGCTTACCATTAATCGAGTTTTCTTAGAGGAA
GATTTCAACGGGAGCTTCATTGACAGATGGATGAAGATTAATTCACGGTTGGACATTCAGGAGTTGGAGCTGGTAGCTGTTACCTGTTGGTTGATTTGGAATGATAGGAA
CAAATTCATTAATGGTTTTTTGAAAATCAATGTGGACGCCGACTGGATCCCTTCTCCTCCGTCGACAAGGGTGGGTATTGTTTGCAGAAATGATGTTGGTGGTGCAGTGC
CTACTATTTTTGGTGACTCACGTCTCGATCCTTCCACGCTTCTGCATTTCTTAGCAATTTTCTTACCAATGGCTGATTCTACTGCCAGAGATCTCAATTCTCCTATTTTT
CTTCTTTCAAATATTTGTAATCTGGCGCACAAATTGTTTGGATTCATTGATGGATCCTTTCCGAAACCGGCTTCATCTGTACGTACTACCACTTCTTCTTCGTCTACTGA
TTCTTCATCTGCTGTTTCTCAGCCAAATCCACAATATGAGGACTGGATTGCCAAAGATCACGCACTACAAACTCTGATTAATGCAACTCTTTCTCCCACTGCCTTGGCCT
ATATTGTTGGTTGTGAGACTGCACAACAAATGTGGGAAACTCTAGAGAAACACTACTCTTCTACTTCGCGAACAAATGTCGTTAACCTAAAATCAGAGTTGCAAACTATT
GCGAAGAAATCTGGTGAAAGCATCACACAATACATCCAACGCATCAAGGAGTTGAAAGATAAATTGGCCAATGTTTCCGTCACGATTGATGTAGAGGATCTGGTGATCTA
CACGATGAGTGGCTTACCTGCTGAATACAACACTTTCAAGACGTCTCTGCGTACTAGATCTCAGTCGGTCACCTTCGAAAAACTTCATGTTCTTCTTGTTTCTGAAGAGT
CTGCCCTTGAGAAGCAGATTAAACGTGATGAGGCCTTTGCTCAGCCTACTGTTTTACTTGCACAAACTTCTTCAAATCGCAGTCAGAACTACAATCCAAATATACCTCGA
GGTCATGGAAATTCTGGAAATTCTCGAGGCCGTGGAAGAGGTTTTGGTCGTCCTCCTTTTTGTAACTCTGGTCGTGGACGTATACCTGGACATCCTGTTGGTAATACTCC
GAAACCTCAACCTGATCAGTACTCTTCTCGTATTCAATGTCAGATTTGTCAGCGAGCAGGTCATAGTGCGATTGATTGTTATAATCGCATGAATGACAATTTTCAAGGTC
GTCATCCACCTGCTCAGTTAGCCGCTATGGTTGCTACTCAAAATTATAATGTTGTGGCCAACACTTCTGTTTCTCATCCATCACCTTGGCTTGCTAATTCTGGTTGCAAC
TCTCATGTTACCTCTGATATGAATCATTTTACTAATTCTTCTGATTTTACTGGTGATGATCAGGTAACTGTTGGCAGTGGACAGTCCCTTCCTATAACACATTCAGGTTG
TGGCATTCTTCCTACTCCTACTTCCTCTCTTAAACTCTTTGATTTACTTTGTGTTCCTCATATTTCTTCAAATCTTTTATCTGTTAACCGTCTTTGTGCCGATAATAATT
GTGTGTGCGTCTTCGATTCTGACTTTTTTCTCATTCAGGACAAGTCTTCGGGCAAGATCCTTTTTCAAGGACCTAGTGTCAATGGGTTGTATCCTATCAATCCCGCTGGC
TGTTGTTGGGCGATATCCGCAAGCGCACGGGCCACAAGTAATATAAAACGGAAAGTATTCCGAGTATCGAATCCACAGGGAGGCGAAGCGAAACTTAAGCAATTAATTAA
AATTCTCTTGAGCACTGAAAATACTGAGGCGAAATAA
Protein sequenceShow/hide protein sequence
MYEDPWMPKVSTFKPVSINGIISQERVADFITHSGEWDLRKLRDSVMSEDYEIIRRIPINIRLKDKLVWHFDKTGIYSAKSGYKLFMQGKINEASSKIWRLTINRVFLEE
DFNGSFIDRWMKINSRLDIQELELVAVTCWLIWNDRNKFINGFLKINVDADWIPSPPSTRVGIVCRNDVGGAVPTIFGDSRLDPSTLLHFLAIFLPMADSTARDLNSPIF
LLSNICNLAHKLFGFIDGSFPKPASSVRTTTSSSSTDSSSAVSQPNPQYEDWIAKDHALQTLINATLSPTALAYIVGCETAQQMWETLEKHYSSTSRTNVVNLKSELQTI
AKKSGESITQYIQRIKELKDKLANVSVTIDVEDLVIYTMSGLPAEYNTFKTSLRTRSQSVTFEKLHVLLVSEESALEKQIKRDEAFAQPTVLLAQTSSNRSQNYNPNIPR
GHGNSGNSRGRGRGFGRPPFCNSGRGRIPGHPVGNTPKPQPDQYSSRIQCQICQRAGHSAIDCYNRMNDNFQGRHPPAQLAAMVATQNYNVVANTSVSHPSPWLANSGCN
SHVTSDMNHFTNSSDFTGDDQVTVGSGQSLPITHSGCGILPTPTSSLKLFDLLCVPHISSNLLSVNRLCADNNCVCVFDSDFFLIQDKSSGKILFQGPSVNGLYPINPAG
CCWAISASARATSNIKRKVFRVSNPQGGEAKLKQLIKILLSTENTEAK