; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022878 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022878
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr7:40019617..40021479
RNA-Seq ExpressionLag0022878
SyntenyLag0022878
Gene Ontology termsGO:0005488 - binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GFY85402.1 hypothetical protein Acr_04g0001400 [Actinidia rufa]9.0e-5847.92Show/hide
Query:  SRPPAPFPSFFPPQNNPNPQLQPNNPYPLPFTPNPYPTLP---QPLAVKLNDTNFLLWKNQLMNVVLANGLHGFLDGSVPAPPRFLDDQQQQPNPDFITW
        S PPAP     PP +NP P     NP P     +P P +P   QPLAVKL+D N+++WK QL+N+V+ANGL  FLDGS   PPRFLD QQQQ NP+F +W
Subjt:  SRPPAPFPSFFPPQNNPNPQLQPNNPYPLPFTPNPYPTLP---QPLAVKLNDTNFLLWKNQLMNVVLANGLHGFLDGSVPAPPRFLDDQQQQPNPDFITW

Query:  ERYNRFIMCWIYSSLSEEKMGEIVSLETAAEIWNSLKKSYDSKTTARIMGLKSQLQKIRKDGLTVSQYLSQIKEVADKFAAIGEPISYRDHLAHILDGLG
        +RYNR +M WIY+S++E  +G+IV   +A++IW +L++ Y + + A +  L++ LQ I+K+GLT   Y+ + + + +  A+IGEP++Y DHL + L GLG
Subjt:  ERYNRFIMCWIYSSLSEEKMGEIVSLETAAEIWNSLKKSYDSKTTARIMGLKSQLQKIRKDGLTVSQYLSQIKEVADKFAAIGEPISYRDHLAHILDGLG

Query:  SEYNAFVTSIQNRSDNPALEDVRSLLFAYEARLEKQNVVE
         +YN FVTSIQ+++  P++E+V SLL +Y+ARLE+Q+  +
Subjt:  SEYNAFVTSIQNRSDNPALEDVRSLLFAYEARLEKQNVVE

GFZ12741.1 UBX domain-containing protein [Actinidia rufa]2.8e-5939.29Show/hide
Query:  SRPPAPFPSFFPPQNNPNPQLQPNNPYPLPFTPNP---YPTLPQPLAVKLNDTNFLLWKNQLMNVVLANGLHGFLDGSVPAPPRFLDDQQQQPNPDFITW
        S PPAP     PP +NP P      P P     +P    P++ QPLAVKL+D N+++WK QL+N+V+ANGL  FLDGS   PPRFLD QQQQ NP+F +W
Subjt:  SRPPAPFPSFFPPQNNPNPQLQPNNPYPLPFTPNP---YPTLPQPLAVKLNDTNFLLWKNQLMNVVLANGLHGFLDGSVPAPPRFLDDQQQQPNPDFITW

Query:  ERYNRFIMCWIYSSLSEEKMGEIVSLETAAEIWNSLKKSYDSKTTARIMGLKSQLQKIRKDGLTVSQYLSQIKEVADKFAAIGEPISYRDHLAHILDGLG
        +RYNR +M WIY+S++E  +G+IV   +A++IW +L++ Y + + A +  L++ LQ I+K+GLT   Y+ + + + +  A+IGEP++Y DHL + L GLG
Subjt:  ERYNRFIMCWIYSSLSEEKMGEIVSLETAAEIWNSLKKSYDSKTTARIMGLKSQLQKIRKDGLTVSQYLSQIKEVADKFAAIGEPISYRDHLAHILDGLG

Query:  SEYNAFVTSIQNRSDNPALEDVRSLLFAYEARLEKQNVVEQLNVAQANLSSLQLHNNSRRHTSRPSSLPSRPPFNPTNLNFFSPQHSSFSQPSSFTPSIL
         +YN FVTSIQ+++  P++E+  S                                        P+SL  +P F   + N F P  +S+S P        
Subjt:  SEYNAFVTSIQNRSDNPALEDVRSLLFAYEARLEKQNVVEQLNVAQANLSSLQLHNNSRRHTSRPSSLPSRPPFNPTNLNFFSPQHSSFSQPSSFTPSIL

Query:  GRPQSQSFQKWPSRPNLNRPQCQICGKFGHTALICHHRTNLAYQTPSP-----QAMVT-----NISSGQ------SLPDNV-SMISNDSYH--PDEN
        G+ ++ S+   PS P   RP+CQIC K GHTA  C+H TNL YQ P P      A +T     + SS Q      SLPD+   M S  S+H  PD N
Subjt:  GRPQSQSFQKWPSRPNLNRPQCQICGKFGHTALICHHRTNLAYQTPSP-----QAMVT-----NISSGQ------SLPDNV-SMISNDSYH--PDEN

PON47862.1 hypothetical protein TorRG33x02_321990 [Trema orientale]1.5e-5744.44Show/hide
Query:  SNFSRPPAPFPSF-FPPQNNPNPQLQPNNPYPLPFTPNP-YPTLPQPLAVKLNDTNFLLWKNQLMNVVLANGLHGFLDGSVPAPPRFLDDQQQQPNPDFI
        S+ + P    P+   PP N PN Q Q     P P  P P  P++ QP  +KL+  N+L+WKNQL+NV++ANGL  F+DGS P PPRF D  +Q  N ++I
Subjt:  SNFSRPPAPFPSF-FPPQNNPNPQLQPNNPYPLPFTPNP-YPTLPQPLAVKLNDTNFLLWKNQLMNVVLANGLHGFLDGSVPAPPRFLDDQQQQPNPDFI

Query:  TWERYNRFIMCWIYSSLSEEKMGEIVSLETAAEIWNSLKKSYDSKTTARIMGLKSQLQKIRKDGLTVSQYLSQIKEVADKFAAIGEPISYRDHLAHILDG
         W+R+NR IM WIY+SL++  MG+IV   +A EIW +L + Y S + A+I  L+++LQ +RKDGLT  +Y+ + K + +  AA+GEP+S +DHL ++  G
Subjt:  TWERYNRFIMCWIYSSLSEEKMGEIVSLETAAEIWNSLKKSYDSKTTARIMGLKSQLQKIRKDGLTVSQYLSQIKEVADKFAAIGEPISYRDHLAHILDG

Query:  LGSEYNAFVTSIQNRSDNPALEDVRSLLFAYEARLEKQNVVEQLNVAQANLSSLQLHNNSRRHTSRPSSLPSRPPF-NPT---NLNFFSPQHSSFSQP--
        L  EYNAFVTSI  R DN  LE++ SLL +YE RLE QN   QL+  QANL+ L ++             P RP F NP      NF +      S P  
Subjt:  LGSEYNAFVTSIQNRSDNPALEDVRSLLFAYEARLEKQNVVEQLNVAQANLSSLQLHNNSRRHTSRPSSLPSRPPF-NPT---NLNFFSPQHSSFSQP--

Query:  -SSFTPSILGRPQSQ
         + F PSILG+PQ +
Subjt:  -SSFTPSILGRPQSQ

RVW69807.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]9.9e-5739.3Show/hide
Query:  SFFPPQNNPNPQLQPN---NPYP-LPFTPNPYPTLPQPLAVKLNDTNFLLWKNQLMNVVLANGLHGFLDGSVPAPPRFLDDQQQQPNPDFITWERYNRFI
        S FPP    N     N   NP P +     P P+L Q L++KL++TN LL K+QL+NV++ANGL  F+D    +PP++LD   +Q NP+F+ W+R N+ +
Subjt:  SFFPPQNNPNPQLQPN---NPYP-LPFTPNPYPTLPQPLAVKLNDTNFLLWKNQLMNVVLANGLHGFLDGSVPAPPRFLDDQQQQPNPDFITWERYNRFI

Query:  MCWIYSSLSEEKMGEIVSLETAAEIWNSLKKSYDSKTTARIMGLKSQLQKIRKDGLTVSQYLSQIKEVADKFAAIGEPISYRDHLAHILDGLGSEYNAFV
        M WIYSSL+   +G+IV   TA +IW SL   Y+S + A +M L SQLQ+I+K  + +S+YLS++K V D+FA IGEP+SYRD L  IL+GL  EY+ FV
Subjt:  MCWIYSSLSEEKMGEIVSLETAAEIWNSLKKSYDSKTTARIMGLKSQLQKIRKDGLTVSQYLSQIKEVADKFAAIGEPISYRDHLAHILDGLGSEYNAFV

Query:  TSIQNRSDNPALEDVRSLLFAYEARLEKQNVVEQLNVAQANLSSLQLHNNSRRHTSRPSSLPSRPPFNPTNLNFFSPQHSSFSQPSSFTPSILGRPQSQS
        TSI NRSD P+L++V SLL  YE RL ++++ + LN  QAN                    P +P +N                                
Subjt:  TSIQNRSDNPALEDVRSLLFAYEARLEKQNVVEQLNVAQANLSSLQLHNNSRRHTSRPSSLPSRPPFNPTNLNFFSPQHSSFSQPSSFTPSILGRPQSQS

Query:  FQKWPSRPNLNRPQCQICGKFGHTALICHHRTNLAYQTP-SPQAMVTNISSGQSLPDNVSMISNDSYHP
                  + PQCQICGK GH AL  +HRTNL Y  P  P A   N +        +S +   S  P
Subjt:  FQKWPSRPNLNRPQCQICGKFGHTALICHHRTNLAYQTP-SPQAMVTNISSGQSLPDNVSMISNDSYHP

XP_022155181.1 uncharacterized protein LOC111022315 [Momordica charantia]1.8e-11460Show/hide
Query:  FPPQNNPNPQLQPNNPYPLPFTPNPYPTLPQPLAVKLNDTNFLLWKNQLMNVVLANGLHGFLDGSVPAPPRFLDDQQQQPNPDFITWERYNRFIMCWIYS
        FPP   PN   QP    P PF+ NP+PTLPQPL VKLND NFLLWKNQL+N V+ANGL G+LDG++  PP+FLD  Q QPNP +  WERYNR +MCWIYS
Subjt:  FPPQNNPNPQLQPNNPYPLPFTPNPYPTLPQPLAVKLNDTNFLLWKNQLMNVVLANGLHGFLDGSVPAPPRFLDDQQQQPNPDFITWERYNRFIMCWIYS

Query:  SLSEEKMGEIVSLETAAEIWNSLKKSYDSKTTARIMGLKSQLQKIRKDGLTVSQYLSQIKEVADKFAAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNR
        SLSEEKMGE+VSLET  +IW+SL + YDSKTTARIMGLK++LQ +RKDG +VSQYL++IKE+ADKFAA+GEP+SYRDHLAH+LDGLGSEYNAFVTSI NR
Subjt:  SLSEEKMGEIVSLETAAEIWNSLKKSYDSKTTARIMGLKSQLQKIRKDGLTVSQYLSQIKEVADKFAAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNR

Query:  SDNPALEDVRSLLFAYEARLEKQNVVEQLNVAQANLSSLQLHNNSRRHTSRPSSLPSRPPFNPTNLNFFSPQHSSFSQPSS-----FTPSILGRPQSQSF
        +D+P+LEDVRSLL AYEARL+KQN V+QLN+AQANL +L L +NS+R           PP       F  P H   S P+S      + SILG+P  QS 
Subjt:  SDNPALEDVRSLLFAYEARLEKQNVVEQLNVAQANLSSLQLHNNSRRHTSRPSSLPSRPPFNPTNLNFFSPQHSSFSQPSS-----FTPSILGRPQSQSF

Query:  QKWPSRPNLNRPQCQICGKFGHTALICHHRTNLAYQTPSPQAMVTNISSGQSLPDNVSMISNDSYHPDEN
         KWP +P+ ++ QCQICGK GH+A +C+HRTN+AY   SPQA+  ++    + P +     ++  HPDE+
Subjt:  QKWPSRPNLNRPQCQICGKFGHTALICHHRTNLAYQTPSPQAMVTNISSGQSLPDNVSMISNDSYHPDEN

TrEMBL top hitse value%identityAlignment
A0A2P5BGF8 Uncharacterized protein7.4e-5844.44Show/hide
Query:  SNFSRPPAPFPSF-FPPQNNPNPQLQPNNPYPLPFTPNP-YPTLPQPLAVKLNDTNFLLWKNQLMNVVLANGLHGFLDGSVPAPPRFLDDQQQQPNPDFI
        S+ + P    P+   PP N PN Q Q     P P  P P  P++ QP  +KL+  N+L+WKNQL+NV++ANGL  F+DGS P PPRF D  +Q  N ++I
Subjt:  SNFSRPPAPFPSF-FPPQNNPNPQLQPNNPYPLPFTPNP-YPTLPQPLAVKLNDTNFLLWKNQLMNVVLANGLHGFLDGSVPAPPRFLDDQQQQPNPDFI

Query:  TWERYNRFIMCWIYSSLSEEKMGEIVSLETAAEIWNSLKKSYDSKTTARIMGLKSQLQKIRKDGLTVSQYLSQIKEVADKFAAIGEPISYRDHLAHILDG
         W+R+NR IM WIY+SL++  MG+IV   +A EIW +L + Y S + A+I  L+++LQ +RKDGLT  +Y+ + K + +  AA+GEP+S +DHL ++  G
Subjt:  TWERYNRFIMCWIYSSLSEEKMGEIVSLETAAEIWNSLKKSYDSKTTARIMGLKSQLQKIRKDGLTVSQYLSQIKEVADKFAAIGEPISYRDHLAHILDG

Query:  LGSEYNAFVTSIQNRSDNPALEDVRSLLFAYEARLEKQNVVEQLNVAQANLSSLQLHNNSRRHTSRPSSLPSRPPF-NPT---NLNFFSPQHSSFSQP--
        L  EYNAFVTSI  R DN  LE++ SLL +YE RLE QN   QL+  QANL+ L ++             P RP F NP      NF +      S P  
Subjt:  LGSEYNAFVTSIQNRSDNPALEDVRSLLFAYEARLEKQNVVEQLNVAQANLSSLQLHNNSRRHTSRPSSLPSRPPF-NPT---NLNFFSPQHSSFSQP--

Query:  -SSFTPSILGRPQSQ
         + F PSILG+PQ +
Subjt:  -SSFTPSILGRPQSQ

A0A438GC62 Retrovirus-related Pol polyprotein from transposon RE14.8e-5739.3Show/hide
Query:  SFFPPQNNPNPQLQPN---NPYP-LPFTPNPYPTLPQPLAVKLNDTNFLLWKNQLMNVVLANGLHGFLDGSVPAPPRFLDDQQQQPNPDFITWERYNRFI
        S FPP    N     N   NP P +     P P+L Q L++KL++TN LL K+QL+NV++ANGL  F+D    +PP++LD   +Q NP+F+ W+R N+ +
Subjt:  SFFPPQNNPNPQLQPN---NPYP-LPFTPNPYPTLPQPLAVKLNDTNFLLWKNQLMNVVLANGLHGFLDGSVPAPPRFLDDQQQQPNPDFITWERYNRFI

Query:  MCWIYSSLSEEKMGEIVSLETAAEIWNSLKKSYDSKTTARIMGLKSQLQKIRKDGLTVSQYLSQIKEVADKFAAIGEPISYRDHLAHILDGLGSEYNAFV
        M WIYSSL+   +G+IV   TA +IW SL   Y+S + A +M L SQLQ+I+K  + +S+YLS++K V D+FA IGEP+SYRD L  IL+GL  EY+ FV
Subjt:  MCWIYSSLSEEKMGEIVSLETAAEIWNSLKKSYDSKTTARIMGLKSQLQKIRKDGLTVSQYLSQIKEVADKFAAIGEPISYRDHLAHILDGLGSEYNAFV

Query:  TSIQNRSDNPALEDVRSLLFAYEARLEKQNVVEQLNVAQANLSSLQLHNNSRRHTSRPSSLPSRPPFNPTNLNFFSPQHSSFSQPSSFTPSILGRPQSQS
        TSI NRSD P+L++V SLL  YE RL ++++ + LN  QAN                    P +P +N                                
Subjt:  TSIQNRSDNPALEDVRSLLFAYEARLEKQNVVEQLNVAQANLSSLQLHNNSRRHTSRPSSLPSRPPFNPTNLNFFSPQHSSFSQPSSFTPSILGRPQSQS

Query:  FQKWPSRPNLNRPQCQICGKFGHTALICHHRTNLAYQTP-SPQAMVTNISSGQSLPDNVSMISNDSYHP
                  + PQCQICGK GH AL  +HRTNL Y  P  P A   N +        +S +   S  P
Subjt:  FQKWPSRPNLNRPQCQICGKFGHTALICHHRTNLAYQTP-SPQAMVTNISSGQSLPDNVSMISNDSYHP

A0A6J1DQX7 uncharacterized protein LOC1110223158.6e-11560Show/hide
Query:  FPPQNNPNPQLQPNNPYPLPFTPNPYPTLPQPLAVKLNDTNFLLWKNQLMNVVLANGLHGFLDGSVPAPPRFLDDQQQQPNPDFITWERYNRFIMCWIYS
        FPP   PN   QP    P PF+ NP+PTLPQPL VKLND NFLLWKNQL+N V+ANGL G+LDG++  PP+FLD  Q QPNP +  WERYNR +MCWIYS
Subjt:  FPPQNNPNPQLQPNNPYPLPFTPNPYPTLPQPLAVKLNDTNFLLWKNQLMNVVLANGLHGFLDGSVPAPPRFLDDQQQQPNPDFITWERYNRFIMCWIYS

Query:  SLSEEKMGEIVSLETAAEIWNSLKKSYDSKTTARIMGLKSQLQKIRKDGLTVSQYLSQIKEVADKFAAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNR
        SLSEEKMGE+VSLET  +IW+SL + YDSKTTARIMGLK++LQ +RKDG +VSQYL++IKE+ADKFAA+GEP+SYRDHLAH+LDGLGSEYNAFVTSI NR
Subjt:  SLSEEKMGEIVSLETAAEIWNSLKKSYDSKTTARIMGLKSQLQKIRKDGLTVSQYLSQIKEVADKFAAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNR

Query:  SDNPALEDVRSLLFAYEARLEKQNVVEQLNVAQANLSSLQLHNNSRRHTSRPSSLPSRPPFNPTNLNFFSPQHSSFSQPSS-----FTPSILGRPQSQSF
        +D+P+LEDVRSLL AYEARL+KQN V+QLN+AQANL +L L +NS+R           PP       F  P H   S P+S      + SILG+P  QS 
Subjt:  SDNPALEDVRSLLFAYEARLEKQNVVEQLNVAQANLSSLQLHNNSRRHTSRPSSLPSRPPFNPTNLNFFSPQHSSFSQPSS-----FTPSILGRPQSQSF

Query:  QKWPSRPNLNRPQCQICGKFGHTALICHHRTNLAYQTPSPQAMVTNISSGQSLPDNVSMISNDSYHPDEN
         KWP +P+ ++ QCQICGK GH+A +C+HRTN+AY   SPQA+  ++    + P +     ++  HPDE+
Subjt:  QKWPSRPNLNRPQCQICGKFGHTALICHHRTNLAYQTPSPQAMVTNISSGQSLPDNVSMISNDSYHPDEN

A0A7J0EGI5 Uncharacterized protein4.3e-5847.92Show/hide
Query:  SRPPAPFPSFFPPQNNPNPQLQPNNPYPLPFTPNPYPTLP---QPLAVKLNDTNFLLWKNQLMNVVLANGLHGFLDGSVPAPPRFLDDQQQQPNPDFITW
        S PPAP     PP +NP P     NP P     +P P +P   QPLAVKL+D N+++WK QL+N+V+ANGL  FLDGS   PPRFLD QQQQ NP+F +W
Subjt:  SRPPAPFPSFFPPQNNPNPQLQPNNPYPLPFTPNPYPTLP---QPLAVKLNDTNFLLWKNQLMNVVLANGLHGFLDGSVPAPPRFLDDQQQQPNPDFITW

Query:  ERYNRFIMCWIYSSLSEEKMGEIVSLETAAEIWNSLKKSYDSKTTARIMGLKSQLQKIRKDGLTVSQYLSQIKEVADKFAAIGEPISYRDHLAHILDGLG
        +RYNR +M WIY+S++E  +G+IV   +A++IW +L++ Y + + A +  L++ LQ I+K+GLT   Y+ + + + +  A+IGEP++Y DHL + L GLG
Subjt:  ERYNRFIMCWIYSSLSEEKMGEIVSLETAAEIWNSLKKSYDSKTTARIMGLKSQLQKIRKDGLTVSQYLSQIKEVADKFAAIGEPISYRDHLAHILDGLG

Query:  SEYNAFVTSIQNRSDNPALEDVRSLLFAYEARLEKQNVVE
         +YN FVTSIQ+++  P++E+V SLL +Y+ARLE+Q+  +
Subjt:  SEYNAFVTSIQNRSDNPALEDVRSLLFAYEARLEKQNVVE

A0A7J0GPN0 UBX domain-containing protein1.4e-5939.29Show/hide
Query:  SRPPAPFPSFFPPQNNPNPQLQPNNPYPLPFTPNP---YPTLPQPLAVKLNDTNFLLWKNQLMNVVLANGLHGFLDGSVPAPPRFLDDQQQQPNPDFITW
        S PPAP     PP +NP P      P P     +P    P++ QPLAVKL+D N+++WK QL+N+V+ANGL  FLDGS   PPRFLD QQQQ NP+F +W
Subjt:  SRPPAPFPSFFPPQNNPNPQLQPNNPYPLPFTPNP---YPTLPQPLAVKLNDTNFLLWKNQLMNVVLANGLHGFLDGSVPAPPRFLDDQQQQPNPDFITW

Query:  ERYNRFIMCWIYSSLSEEKMGEIVSLETAAEIWNSLKKSYDSKTTARIMGLKSQLQKIRKDGLTVSQYLSQIKEVADKFAAIGEPISYRDHLAHILDGLG
        +RYNR +M WIY+S++E  +G+IV   +A++IW +L++ Y + + A +  L++ LQ I+K+GLT   Y+ + + + +  A+IGEP++Y DHL + L GLG
Subjt:  ERYNRFIMCWIYSSLSEEKMGEIVSLETAAEIWNSLKKSYDSKTTARIMGLKSQLQKIRKDGLTVSQYLSQIKEVADKFAAIGEPISYRDHLAHILDGLG

Query:  SEYNAFVTSIQNRSDNPALEDVRSLLFAYEARLEKQNVVEQLNVAQANLSSLQLHNNSRRHTSRPSSLPSRPPFNPTNLNFFSPQHSSFSQPSSFTPSIL
         +YN FVTSIQ+++  P++E+  S                                        P+SL  +P F   + N F P  +S+S P        
Subjt:  SEYNAFVTSIQNRSDNPALEDVRSLLFAYEARLEKQNVVEQLNVAQANLSSLQLHNNSRRHTSRPSSLPSRPPFNPTNLNFFSPQHSSFSQPSSFTPSIL

Query:  GRPQSQSFQKWPSRPNLNRPQCQICGKFGHTALICHHRTNLAYQTPSP-----QAMVT-----NISSGQ------SLPDNV-SMISNDSYH--PDEN
        G+ ++ S+   PS P   RP+CQIC K GHTA  C+H TNL YQ P P      A +T     + SS Q      SLPD+   M S  S+H  PD N
Subjt:  GRPQSQSFQKWPSRPNLNRPQCQICGKFGHTALICHHRTNLAYQTPSP-----QAMVT-----NISSGQ------SLPDNV-SMISNDSYH--PDEN

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE17.5e-2324.56Show/hide
Query:  KLNDTNFLLWKNQLMNVVLANGLHGFLDGSVPAPPRFL-DDQQQQPNPDFITWERYNRFIMCWIYSSLSEEKMGEIVSLETAAEIWNSLKKSYDSKTTAR
        KL  TN+L+W  Q+  +     L GFLDGS   PP  +  D   + NPD+  W+R ++ I   +  ++S      +    TAA+IW +L+K Y + +   
Subjt:  KLNDTNFLLWKNQLMNVVLANGLHGFLDGSVPAPPRFL-DDQQQQPNPDFITWERYNRFIMCWIYSSLSEEKMGEIVSLETAAEIWNSLKKSYDSKTTAR

Query:  IMGLKSQLQKIRKDGLTVSQYLSQIKEVADKFAAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDNPALEDVRSLLFAYEARLEKQNVVEQLNVAQA
        +  L++QL++  K   T+  Y+  +    D+ A +G+P+ + + +  +L+ L  EY   +  I  +   P L ++   L  +E+++        L V+ A
Subjt:  IMGLKSQLQKIRKDGLTVSQYLSQIKEVADKFAAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDNPALEDVRSLLFAYEARLEKQNVVEQLNVAQA

Query:  NLSSLQLHNNSRRHTSRPSSLPSRPPFNPTNLNFFSPQHSSFSQPSSFTPSILGRPQSQSFQKWPSRPNLNRP---QCQICGKFGHTALIC----HHRTN
         +  +  +  S R+T+             TN N    +++ +   ++   S   +P  QS   +    N ++P   +CQICG  GH+A  C    H  ++
Subjt:  NLSSLQLHNNSRRHTSRPSSLPSRPPFNPTNLNFFSPQHSSFSQPSSFTPSILGRPQSQSFQKWPSRPNLNRP---QCQICGKFGHTALIC----HHRTN

Query:  LAYQTP----SPQAMVTNISSGQSLPDNVSMISNDSYH
        +  Q P    +P     N++ G     N  ++ + + H
Subjt:  LAYQTP----SPQAMVTNISSGQSLPDNVSMISNDSYH

Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)1.1e-1320.83Show/hide
Query:  PLAVKLNDTNFLLWKNQLMNVVLANGLHGFLDGSVPAPPRFLDDQQQQPNPDFITWERYNRFIMCWIYSSLSEEK-MGEIVSLETAAEIWNSLKKSYDSK
        P+ + + ++N+  W+   +   L+  + G +DG++              N + + W++ +  +   +Y +L+ ++  G  V+  T+ +IW  +K  + + 
Subjt:  PLAVKLNDTNFLLWKNQLMNVVLANGLHGFLDGSVPAPPRFLDDQQQQPNPDFITWERYNRFIMCWIYSSLSEEK-MGEIVSLETAAEIWNSLKKSYDSK

Query:  TTARIMGLKSQLQKIRKDGLTVSQYLSQIKEVADKFAAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDNPALEDVRSLLFAYEARLEK
          AR + L S+L+      + V+ Y  ++K++AD    +  P++ R+ + ++L+GL  +++  +  I++R   P+ +D  ++L   E RL++
Subjt:  TTARIMGLKSQLQKIRKDGLTVSQYLSQIKEVADKFAAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDNPALEDVRSLLFAYEARLEK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCAAAATATAGAGCCTTTCAAGTATATGGGCCATACTGAAGAAGCTAAATCGTTTGCTTTCTATGTCTTGGCAAGTCCAAAATCCCCTGCATTAGCGTCATGCAA
TAACGAACTGAGTCACCAGAACAGTTCAAAAGTGTATCCTAAATATACTAGCTCTGTGTTTTCTTCAGCTATGGCGTCAGATGCTTCTTCTTCTATTACCTCTACTGGAC
CCATTTCTAATCCTATCGACAATATCCTTACACCAGACACCACACCTGTCGTCACCCCTGTTACCCAACGAGGAAAAGCTCCAATTCCTCCTTCTCAACCAAACTTTGTT
CCCCCTTCTCCGAGACCAAATTTTTCGATGAATCCAACCTCCTCAACCCCACCATTTCAACCATACCAACCTCAACTTTTTTATCCAACCCAGCCCTCTTACTTTCAACC
TTATTATCCATCTAATTTTTCCCGCCCGCCTGCTCCTTTTCCTTCGTTCTTTCCCCCACAAAATAATCCCAATCCTCAGCTTCAACCTAACAATCCCTATCCTCTTCCGT
TTACTCCCAACCCTTACCCTACTTTGCCCCAACCCCTTGCAGTTAAGCTGAATGATACTAATTTTCTGTTGTGGAAAAACCAGTTGATGAATGTTGTTCTTGCTAATGGA
CTTCACGGCTTCCTTGATGGATCAGTACCTGCTCCTCCTCGTTTTCTTGATGATCAACAACAACAACCGAACCCGGATTTTATCACTTGGGAAAGGTATAATCGATTTAT
CATGTGTTGGATCTATTCGTCTCTTTCTGAAGAAAAGATGGGAGAAATTGTTAGTTTAGAAACCGCTGCTGAAATTTGGAATTCGTTAAAGAAATCTTATGATTCCAAAA
CCACTGCTAGGATTATGGGTCTTAAGTCTCAACTACAGAAAATTAGAAAGGATGGTCTAACCGTCAGTCAGTATTTATCTCAAATTAAAGAAGTAGCCGATAAATTCGCA
GCCATAGGAGAACCTATCTCTTATCGAGATCACCTAGCTCACATCCTCGATGGCCTTGGGAGTGAGTACAACGCCTTTGTCACCTCTATACAAAATAGATCGGATAATCC
GGCTCTCGAAGATGTCCGTAGTTTATTGTTCGCCTATGAAGCACGGCTGGAAAAACAAAATGTTGTAGAACAGCTTAATGTTGCTCAAGCAAATTTAAGTAGTCTTCAGT
TGCACAATAACAGCCGTCGTCACACTTCCCGCCCATCTTCTCTACCTTCCAGACCTCCCTTCAATCCAACCAACCTTAACTTTTTTTCTCCTCAGCACTCATCATTTTCC
CAACCTTCATCCTTCACCCCTAGTATTTTGGGTCGTCCTCAATCTCAATCTTTTCAGAAATGGCCATCTCGACCCAATCTCAATCGTCCACAGTGTCAAATATGTGGCAA
GTTTGGGCACACCGCTCTTATTTGCCATCATCGAACGAACCTAGCTTATCAAACTCCTTCTCCCCAAGCAATGGTTACTAATATTTCCTCGGGTCAATCCTTGCCTGATA
ATGTCTCCATGATATCCAATGACTCCTACCATCCAGATGAGAATTAG
mRNA sequenceShow/hide mRNA sequence
ATGAATCAAAATATAGAGCCTTTCAAGTATATGGGCCATACTGAAGAAGCTAAATCGTTTGCTTTCTATGTCTTGGCAAGTCCAAAATCCCCTGCATTAGCGTCATGCAA
TAACGAACTGAGTCACCAGAACAGTTCAAAAGTGTATCCTAAATATACTAGCTCTGTGTTTTCTTCAGCTATGGCGTCAGATGCTTCTTCTTCTATTACCTCTACTGGAC
CCATTTCTAATCCTATCGACAATATCCTTACACCAGACACCACACCTGTCGTCACCCCTGTTACCCAACGAGGAAAAGCTCCAATTCCTCCTTCTCAACCAAACTTTGTT
CCCCCTTCTCCGAGACCAAATTTTTCGATGAATCCAACCTCCTCAACCCCACCATTTCAACCATACCAACCTCAACTTTTTTATCCAACCCAGCCCTCTTACTTTCAACC
TTATTATCCATCTAATTTTTCCCGCCCGCCTGCTCCTTTTCCTTCGTTCTTTCCCCCACAAAATAATCCCAATCCTCAGCTTCAACCTAACAATCCCTATCCTCTTCCGT
TTACTCCCAACCCTTACCCTACTTTGCCCCAACCCCTTGCAGTTAAGCTGAATGATACTAATTTTCTGTTGTGGAAAAACCAGTTGATGAATGTTGTTCTTGCTAATGGA
CTTCACGGCTTCCTTGATGGATCAGTACCTGCTCCTCCTCGTTTTCTTGATGATCAACAACAACAACCGAACCCGGATTTTATCACTTGGGAAAGGTATAATCGATTTAT
CATGTGTTGGATCTATTCGTCTCTTTCTGAAGAAAAGATGGGAGAAATTGTTAGTTTAGAAACCGCTGCTGAAATTTGGAATTCGTTAAAGAAATCTTATGATTCCAAAA
CCACTGCTAGGATTATGGGTCTTAAGTCTCAACTACAGAAAATTAGAAAGGATGGTCTAACCGTCAGTCAGTATTTATCTCAAATTAAAGAAGTAGCCGATAAATTCGCA
GCCATAGGAGAACCTATCTCTTATCGAGATCACCTAGCTCACATCCTCGATGGCCTTGGGAGTGAGTACAACGCCTTTGTCACCTCTATACAAAATAGATCGGATAATCC
GGCTCTCGAAGATGTCCGTAGTTTATTGTTCGCCTATGAAGCACGGCTGGAAAAACAAAATGTTGTAGAACAGCTTAATGTTGCTCAAGCAAATTTAAGTAGTCTTCAGT
TGCACAATAACAGCCGTCGTCACACTTCCCGCCCATCTTCTCTACCTTCCAGACCTCCCTTCAATCCAACCAACCTTAACTTTTTTTCTCCTCAGCACTCATCATTTTCC
CAACCTTCATCCTTCACCCCTAGTATTTTGGGTCGTCCTCAATCTCAATCTTTTCAGAAATGGCCATCTCGACCCAATCTCAATCGTCCACAGTGTCAAATATGTGGCAA
GTTTGGGCACACCGCTCTTATTTGCCATCATCGAACGAACCTAGCTTATCAAACTCCTTCTCCCCAAGCAATGGTTACTAATATTTCCTCGGGTCAATCCTTGCCTGATA
ATGTCTCCATGATATCCAATGACTCCTACCATCCAGATGAGAATTAG
Protein sequenceShow/hide protein sequence
MNQNIEPFKYMGHTEEAKSFAFYVLASPKSPALASCNNELSHQNSSKVYPKYTSSVFSSAMASDASSSITSTGPISNPIDNILTPDTTPVVTPVTQRGKAPIPPSQPNFV
PPSPRPNFSMNPTSSTPPFQPYQPQLFYPTQPSYFQPYYPSNFSRPPAPFPSFFPPQNNPNPQLQPNNPYPLPFTPNPYPTLPQPLAVKLNDTNFLLWKNQLMNVVLANG
LHGFLDGSVPAPPRFLDDQQQQPNPDFITWERYNRFIMCWIYSSLSEEKMGEIVSLETAAEIWNSLKKSYDSKTTARIMGLKSQLQKIRKDGLTVSQYLSQIKEVADKFA
AIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDNPALEDVRSLLFAYEARLEKQNVVEQLNVAQANLSSLQLHNNSRRHTSRPSSLPSRPPFNPTNLNFFSPQHSSFS
QPSSFTPSILGRPQSQSFQKWPSRPNLNRPQCQICGKFGHTALICHHRTNLAYQTPSPQAMVTNISSGQSLPDNVSMISNDSYHPDEN