; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0001486 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0001486
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotran_gag_3 domain-containing protein
Genome locationchr4:31898615..31900850
RNA-Seq ExpressionLag0001486
SyntenyLag0001486
Gene Ontology termsNA
InterPro domainsIPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0049700.1 T4.5 [Cucumis melo var. makuwa]5.6e-15077.07Show/hide
Query:  MNPSIDSSSSSAEKDLLSPIFLLTNICNLISIKLDSTNFVLWKFQMTAILKAHKLFGFIDGTRPC-PASNVASSISTGPPQSNPLYDDWIAKDQALMTVI
        M+ S    SSSAEKD LSPIFLL+NICNLIS++LDSTNFVLWKFQ+TAILKAHKL+GFIDGT PC P +N +SS ST PPQSNP Y+DWIAKDQALMTVI
Subjt:  MNPSIDSSSSSAEKDLLSPIFLLTNICNLISIKLDSTNFVLWKFQMTAILKAHKLFGFIDGTRPC-PASNVASSISTGPPQSNPLYDDWIAKDQALMTVI

Query:  NATLSPEALAYVVGSTSSKQVWDVLARLYSSNSRYNVVNLKSELQTISKKPDESIDAYIKRIKEIKDKLADVSTVINEEDLLIYALNGLPTEFNTFRTAM
        NATLSPEALAYVVGSTSSKQVWDVLA+LYSS SR NVVNLKS+LQTI KKPDESIDAYIKRIKEIKDKLA+VST INEEDLLIYALNGLP E+NTFRT+M
Subjt:  NATLSPEALAYVVGSTSSKQVWDVLARLYSSNSRYNVVNLKSELQTISKKPDESIDAYIKRIKEIKDKLADVSTVINEEDLLIYALNGLPTEFNTFRTAM

Query:  RTRPQPVTFEELHVLLKSEESALAKQSKCDDLFNQPIALLTFSQSLPSHAPTFNNNSIQGCGCGTSNGHGSFSFDTQGHGLGSSQQQQSVVPDKHPSCQI
        RTR QPVTFEELHVLL++EESALAKQSK DD +NQP  LL+ SQSL S APTF+NN ++G G G   GHG FSFD Q  G GSS +Q+S V D H +CQI
Subjt:  RTRPQPVTFEELHVLLKSEESALAKQSKCDDLFNQPIALLTFSQSLPSHAPTFNNNSIQGCGCGTSNGHGSFSFDTQGHGLGSSQQQQSVVPDKHPSCQI

Query:  CLRRGHTALDCFNRMNYSFQGCHPPHQLAAMVASQNNAFLSIANSSS--------PITKDLHYLSLASKYNGEEQ
        C RRGHTALDCFNRMNY+FQG HPP QLAAMVASQNNAFLSI NSSS         IT D++Y+SLA +YNGEEQ
Subjt:  CLRRGHTALDCFNRMNYSFQGCHPPHQLAAMVASQNNAFLSIANSSS--------PITKDLHYLSLASKYNGEEQ

KAE8645659.1 hypothetical protein Csa_020439 [Cucumis sativus]2.3e-15177.01Show/hide
Query:  MNPSIDSSSSSAEKDLLSPIFLLTNICNLISIKLDSTNFVLWKFQMTAILKAHKLFGFIDGTRPCPASNVASSISTGPPQSNPLYDDWIAKDQALMTVIN
        M+ S    SSSAEKD LSPIFLL+NICNLIS++LDSTNFVLWKFQ+TAILKAHKLFGF+DGT PCP ++  S+ ST PPQ+NPLY+DWIAKDQALMTVIN
Subjt:  MNPSIDSSSSSAEKDLLSPIFLLTNICNLISIKLDSTNFVLWKFQMTAILKAHKLFGFIDGTRPCPASNVASSISTGPPQSNPLYDDWIAKDQALMTVIN

Query:  ATLSPEALAYVVGSTSSKQVWDVLARLYSSNSRYNVVNLKSELQTISKKPDESIDAYIKRIKEIKDKLADVSTVINEEDLLIYALNGLPTEFNTFRTAMR
        ATLSPEALAYVVGSTSSKQVWDVLA+LYSS SR NVVNLKS+LQTI KKPDESIDAYIKRIKEIKDKLA+VST INEEDLLIYALNGLP E+NTFRT+MR
Subjt:  ATLSPEALAYVVGSTSSKQVWDVLARLYSSNSRYNVVNLKSELQTISKKPDESIDAYIKRIKEIKDKLADVSTVINEEDLLIYALNGLPTEFNTFRTAMR

Query:  TRPQPVTFEELHVLLKSEESALAKQSKCDDLFNQPIALLTFSQSLPSHAPTFNNNSIQGCGCGTSNGHGSFSFDTQGHGLGSSQQQQSVVPDKHPSCQIC
        TR QPVTFEELHVLL++EESALAKQSKCDD +NQP  LL+ SQSL S APTFNNN ++G G G + GHG FSFD Q  G G SQ+Q+  V D H +CQIC
Subjt:  TRPQPVTFEELHVLLKSEESALAKQSKCDDLFNQPIALLTFSQSLPSHAPTFNNNSIQGCGCGTSNGHGSFSFDTQGHGLGSSQQQQSVVPDKHPSCQIC

Query:  LRRGHTALDCFNRMNYSFQGCHPPHQLAAMVASQNNAFLSIANSSS--------PITKDLHYLSLASKYNGEEQ
         RRGHTALDCFNRMNY+FQG HPP QLAAMVASQNNAFLSI NSSS         IT D++Y+SLA +YNGEEQ
Subjt:  LRRGHTALDCFNRMNYSFQGCHPPHQLAAMVASQNNAFLSIANSSS--------PITKDLHYLSLASKYNGEEQ

XP_008448007.1 PREDICTED: uncharacterized protein LOC103490319 isoform X2 [Cucumis melo]5.6e-15077.07Show/hide
Query:  MNPSIDSSSSSAEKDLLSPIFLLTNICNLISIKLDSTNFVLWKFQMTAILKAHKLFGFIDGTRPC-PASNVASSISTGPPQSNPLYDDWIAKDQALMTVI
        M+ S    SSSAEKD LSPIFLL+NICNLIS++LDSTNFVLWKFQ+TAILKAHKL+GFIDGT PC P +N +SS ST PPQSNP Y+DWIAKDQALMTVI
Subjt:  MNPSIDSSSSSAEKDLLSPIFLLTNICNLISIKLDSTNFVLWKFQMTAILKAHKLFGFIDGTRPC-PASNVASSISTGPPQSNPLYDDWIAKDQALMTVI

Query:  NATLSPEALAYVVGSTSSKQVWDVLARLYSSNSRYNVVNLKSELQTISKKPDESIDAYIKRIKEIKDKLADVSTVINEEDLLIYALNGLPTEFNTFRTAM
        NATLSPEALAYVVGSTSSKQVWDVLA+LYSS SR NVVNLKS+LQTI KKPDESIDAYIKRIKEIKDKLA+VST INEEDLLIYALNGLP E+NTFRT+M
Subjt:  NATLSPEALAYVVGSTSSKQVWDVLARLYSSNSRYNVVNLKSELQTISKKPDESIDAYIKRIKEIKDKLADVSTVINEEDLLIYALNGLPTEFNTFRTAM

Query:  RTRPQPVTFEELHVLLKSEESALAKQSKCDDLFNQPIALLTFSQSLPSHAPTFNNNSIQGCGCGTSNGHGSFSFDTQGHGLGSSQQQQSVVPDKHPSCQI
        RTR QPVTFEELHVLL++EESALAKQSK DD +NQP  LL+ SQSL S APTF+NN ++G G G   GHG FSFD Q  G GSS +Q+S V D H +CQI
Subjt:  RTRPQPVTFEELHVLLKSEESALAKQSKCDDLFNQPIALLTFSQSLPSHAPTFNNNSIQGCGCGTSNGHGSFSFDTQGHGLGSSQQQQSVVPDKHPSCQI

Query:  CLRRGHTALDCFNRMNYSFQGCHPPHQLAAMVASQNNAFLSIANSSS--------PITKDLHYLSLASKYNGEEQ
        C RRGHTALDCFNRMNY+FQG HPP QLAAMVASQNNAFLSI NSSS         IT D++Y+SLA +YNGEEQ
Subjt:  CLRRGHTALDCFNRMNYSFQGCHPPHQLAAMVASQNNAFLSIANSSS--------PITKDLHYLSLASKYNGEEQ

XP_008448008.1 PREDICTED: uncharacterized protein LOC103490319 isoform X3 [Cucumis melo]5.6e-15077.07Show/hide
Query:  MNPSIDSSSSSAEKDLLSPIFLLTNICNLISIKLDSTNFVLWKFQMTAILKAHKLFGFIDGTRPC-PASNVASSISTGPPQSNPLYDDWIAKDQALMTVI
        M+ S    SSSAEKD LSPIFLL+NICNLIS++LDSTNFVLWKFQ+TAILKAHKL+GFIDGT PC P +N +SS ST PPQSNP Y+DWIAKDQALMTVI
Subjt:  MNPSIDSSSSSAEKDLLSPIFLLTNICNLISIKLDSTNFVLWKFQMTAILKAHKLFGFIDGTRPC-PASNVASSISTGPPQSNPLYDDWIAKDQALMTVI

Query:  NATLSPEALAYVVGSTSSKQVWDVLARLYSSNSRYNVVNLKSELQTISKKPDESIDAYIKRIKEIKDKLADVSTVINEEDLLIYALNGLPTEFNTFRTAM
        NATLSPEALAYVVGSTSSKQVWDVLA+LYSS SR NVVNLKS+LQTI KKPDESIDAYIKRIKEIKDKLA+VST INEEDLLIYALNGLP E+NTFRT+M
Subjt:  NATLSPEALAYVVGSTSSKQVWDVLARLYSSNSRYNVVNLKSELQTISKKPDESIDAYIKRIKEIKDKLADVSTVINEEDLLIYALNGLPTEFNTFRTAM

Query:  RTRPQPVTFEELHVLLKSEESALAKQSKCDDLFNQPIALLTFSQSLPSHAPTFNNNSIQGCGCGTSNGHGSFSFDTQGHGLGSSQQQQSVVPDKHPSCQI
        RTR QPVTFEELHVLL++EESALAKQSK DD +NQP  LL+ SQSL S APTF+NN ++G G G   GHG FSFD Q  G GSS +Q+S V D H +CQI
Subjt:  RTRPQPVTFEELHVLLKSEESALAKQSKCDDLFNQPIALLTFSQSLPSHAPTFNNNSIQGCGCGTSNGHGSFSFDTQGHGLGSSQQQQSVVPDKHPSCQI

Query:  CLRRGHTALDCFNRMNYSFQGCHPPHQLAAMVASQNNAFLSIANSSS--------PITKDLHYLSLASKYNGEEQ
        C RRGHTALDCFNRMNY+FQG HPP QLAAMVASQNNAFLSI NSSS         IT D++Y+SLA +YNGEEQ
Subjt:  CLRRGHTALDCFNRMNYSFQGCHPPHQLAAMVASQNNAFLSIANSSS--------PITKDLHYLSLASKYNGEEQ

XP_011658579.1 uncharacterized protein LOC105436058 [Cucumis sativus]2.3e-15177.01Show/hide
Query:  MNPSIDSSSSSAEKDLLSPIFLLTNICNLISIKLDSTNFVLWKFQMTAILKAHKLFGFIDGTRPCPASNVASSISTGPPQSNPLYDDWIAKDQALMTVIN
        M+ S    SSSAEKD LSPIFLL+NICNLIS++LDSTNFVLWKFQ+TAILKAHKLFGF+DGT PCP ++  S+ ST PPQ+NPLY+DWIAKDQALMTVIN
Subjt:  MNPSIDSSSSSAEKDLLSPIFLLTNICNLISIKLDSTNFVLWKFQMTAILKAHKLFGFIDGTRPCPASNVASSISTGPPQSNPLYDDWIAKDQALMTVIN

Query:  ATLSPEALAYVVGSTSSKQVWDVLARLYSSNSRYNVVNLKSELQTISKKPDESIDAYIKRIKEIKDKLADVSTVINEEDLLIYALNGLPTEFNTFRTAMR
        ATLSPEALAYVVGSTSSKQVWDVLA+LYSS SR NVVNLKS+LQTI KKPDESIDAYIKRIKEIKDKLA+VST INEEDLLIYALNGLP E+NTFRT+MR
Subjt:  ATLSPEALAYVVGSTSSKQVWDVLARLYSSNSRYNVVNLKSELQTISKKPDESIDAYIKRIKEIKDKLADVSTVINEEDLLIYALNGLPTEFNTFRTAMR

Query:  TRPQPVTFEELHVLLKSEESALAKQSKCDDLFNQPIALLTFSQSLPSHAPTFNNNSIQGCGCGTSNGHGSFSFDTQGHGLGSSQQQQSVVPDKHPSCQIC
        TR QPVTFEELHVLL++EESALAKQSKCDD +NQP  LL+ SQSL S APTFNNN ++G G G + GHG FSFD Q  G G SQ+Q+  V D H +CQIC
Subjt:  TRPQPVTFEELHVLLKSEESALAKQSKCDDLFNQPIALLTFSQSLPSHAPTFNNNSIQGCGCGTSNGHGSFSFDTQGHGLGSSQQQQSVVPDKHPSCQIC

Query:  LRRGHTALDCFNRMNYSFQGCHPPHQLAAMVASQNNAFLSIANSSS--------PITKDLHYLSLASKYNGEEQ
         RRGHTALDCFNRMNY+FQG HPP QLAAMVASQNNAFLSI NSSS         IT D++Y+SLA +YNGEEQ
Subjt:  LRRGHTALDCFNRMNYSFQGCHPPHQLAAMVASQNNAFLSIANSSS--------PITKDLHYLSLASKYNGEEQ

TrEMBL top hitse value%identityAlignment
A0A1S3BI58 uncharacterized protein LOC103490319 isoform X22.7e-15077.07Show/hide
Query:  MNPSIDSSSSSAEKDLLSPIFLLTNICNLISIKLDSTNFVLWKFQMTAILKAHKLFGFIDGTRPC-PASNVASSISTGPPQSNPLYDDWIAKDQALMTVI
        M+ S    SSSAEKD LSPIFLL+NICNLIS++LDSTNFVLWKFQ+TAILKAHKL+GFIDGT PC P +N +SS ST PPQSNP Y+DWIAKDQALMTVI
Subjt:  MNPSIDSSSSSAEKDLLSPIFLLTNICNLISIKLDSTNFVLWKFQMTAILKAHKLFGFIDGTRPC-PASNVASSISTGPPQSNPLYDDWIAKDQALMTVI

Query:  NATLSPEALAYVVGSTSSKQVWDVLARLYSSNSRYNVVNLKSELQTISKKPDESIDAYIKRIKEIKDKLADVSTVINEEDLLIYALNGLPTEFNTFRTAM
        NATLSPEALAYVVGSTSSKQVWDVLA+LYSS SR NVVNLKS+LQTI KKPDESIDAYIKRIKEIKDKLA+VST INEEDLLIYALNGLP E+NTFRT+M
Subjt:  NATLSPEALAYVVGSTSSKQVWDVLARLYSSNSRYNVVNLKSELQTISKKPDESIDAYIKRIKEIKDKLADVSTVINEEDLLIYALNGLPTEFNTFRTAM

Query:  RTRPQPVTFEELHVLLKSEESALAKQSKCDDLFNQPIALLTFSQSLPSHAPTFNNNSIQGCGCGTSNGHGSFSFDTQGHGLGSSQQQQSVVPDKHPSCQI
        RTR QPVTFEELHVLL++EESALAKQSK DD +NQP  LL+ SQSL S APTF+NN ++G G G   GHG FSFD Q  G GSS +Q+S V D H +CQI
Subjt:  RTRPQPVTFEELHVLLKSEESALAKQSKCDDLFNQPIALLTFSQSLPSHAPTFNNNSIQGCGCGTSNGHGSFSFDTQGHGLGSSQQQQSVVPDKHPSCQI

Query:  CLRRGHTALDCFNRMNYSFQGCHPPHQLAAMVASQNNAFLSIANSSS--------PITKDLHYLSLASKYNGEEQ
        C RRGHTALDCFNRMNY+FQG HPP QLAAMVASQNNAFLSI NSSS         IT D++Y+SLA +YNGEEQ
Subjt:  CLRRGHTALDCFNRMNYSFQGCHPPHQLAAMVASQNNAFLSIANSSS--------PITKDLHYLSLASKYNGEEQ

A0A1S3BIR3 uncharacterized protein LOC103490319 isoform X32.7e-15077.07Show/hide
Query:  MNPSIDSSSSSAEKDLLSPIFLLTNICNLISIKLDSTNFVLWKFQMTAILKAHKLFGFIDGTRPC-PASNVASSISTGPPQSNPLYDDWIAKDQALMTVI
        M+ S    SSSAEKD LSPIFLL+NICNLIS++LDSTNFVLWKFQ+TAILKAHKL+GFIDGT PC P +N +SS ST PPQSNP Y+DWIAKDQALMTVI
Subjt:  MNPSIDSSSSSAEKDLLSPIFLLTNICNLISIKLDSTNFVLWKFQMTAILKAHKLFGFIDGTRPC-PASNVASSISTGPPQSNPLYDDWIAKDQALMTVI

Query:  NATLSPEALAYVVGSTSSKQVWDVLARLYSSNSRYNVVNLKSELQTISKKPDESIDAYIKRIKEIKDKLADVSTVINEEDLLIYALNGLPTEFNTFRTAM
        NATLSPEALAYVVGSTSSKQVWDVLA+LYSS SR NVVNLKS+LQTI KKPDESIDAYIKRIKEIKDKLA+VST INEEDLLIYALNGLP E+NTFRT+M
Subjt:  NATLSPEALAYVVGSTSSKQVWDVLARLYSSNSRYNVVNLKSELQTISKKPDESIDAYIKRIKEIKDKLADVSTVINEEDLLIYALNGLPTEFNTFRTAM

Query:  RTRPQPVTFEELHVLLKSEESALAKQSKCDDLFNQPIALLTFSQSLPSHAPTFNNNSIQGCGCGTSNGHGSFSFDTQGHGLGSSQQQQSVVPDKHPSCQI
        RTR QPVTFEELHVLL++EESALAKQSK DD +NQP  LL+ SQSL S APTF+NN ++G G G   GHG FSFD Q  G GSS +Q+S V D H +CQI
Subjt:  RTRPQPVTFEELHVLLKSEESALAKQSKCDDLFNQPIALLTFSQSLPSHAPTFNNNSIQGCGCGTSNGHGSFSFDTQGHGLGSSQQQQSVVPDKHPSCQI

Query:  CLRRGHTALDCFNRMNYSFQGCHPPHQLAAMVASQNNAFLSIANSSS--------PITKDLHYLSLASKYNGEEQ
        C RRGHTALDCFNRMNY+FQG HPP QLAAMVASQNNAFLSI NSSS         IT D++Y+SLA +YNGEEQ
Subjt:  CLRRGHTALDCFNRMNYSFQGCHPPHQLAAMVASQNNAFLSIANSSS--------PITKDLHYLSLASKYNGEEQ

A0A1S4DWT9 uncharacterized protein LOC103490319 isoform X18.7e-14971.33Show/hide
Query:  MNPSIDSSSSSAEKDLLSPIFLLTNICNLISIKLDSTNFVLWKFQMTAILKAHKLFGFIDGTRPC-PASNVASSISTGPPQSNPLYDDWIAKDQALMTVI
        M+ S    SSSAEKD LSPIFLL+NICNLIS++LDSTNFVLWKFQ+TAILKAHKL+GFIDGT PC P +N +SS ST PPQSNP Y+DWIAKDQALMTVI
Subjt:  MNPSIDSSSSSAEKDLLSPIFLLTNICNLISIKLDSTNFVLWKFQMTAILKAHKLFGFIDGTRPC-PASNVASSISTGPPQSNPLYDDWIAKDQALMTVI

Query:  NATLSPEALAYVVGSTSSKQVWDVLARLYSSNSRYNVVNLKSELQTISKKPDESIDAYIKRIKEIKDKLADVSTVINEEDLLIYALNGLPTEFNTFRTAM
        NATLSPEALAYVVGSTSSKQVWDVLA+LYSS SR NVVNLKS+LQTI KKPDESIDAYIKRIKEIKDKLA+VST INEEDLLIYALNGLP E+NTFRT+M
Subjt:  NATLSPEALAYVVGSTSSKQVWDVLARLYSSNSRYNVVNLKSELQTISKKPDESIDAYIKRIKEIKDKLADVSTVINEEDLLIYALNGLPTEFNTFRTAM

Query:  RTRPQPVTFEELHVLLKSEESALAKQSKCDDLFNQPIALLTFSQSLPSHAPTFNNNSIQGCGCGTSNGHGSFSFDTQGHGLGSSQQQQSVVPDKHPSCQI
        RTR QPVTFEELHVLL++EESALAKQSK DD +NQP  LL+ SQSL S APTF+NN ++G G G   GHG FSFD Q  G GSS +Q+S V D H +CQI
Subjt:  RTRPQPVTFEELHVLLKSEESALAKQSKCDDLFNQPIALLTFSQSLPSHAPTFNNNSIQGCGCGTSNGHGSFSFDTQGHGLGSSQQQQSVVPDKHPSCQI

Query:  CLRRGHTALDCFNRMNYSFQGCHPPHQLAAMVASQNNAFLSIANSSS--------PITKDLHYLSLASKYNGEEQDKFSGKILFQEPSINGLYPIVSKAT
        C RRGHTALDCFNRMNY+FQG HPP QLAAMVASQNNAFLSI NSSS         IT D++Y+SLA +YNGEEQ    G    +  S +G +    K  
Subjt:  CLRRGHTALDCFNRMNYSFQGCHPPHQLAAMVASQNNAFLSIANSSS--------PITKDLHYLSLASKYNGEEQDKFSGKILFQEPSINGLYPIVSKAT

Query:  AASSSASTSSSCSTVAHVAAKG
         AS SAS +       H   KG
Subjt:  AASSSASTSSSCSTVAHVAAKG

A0A5D3CLI6 T4.52.7e-15077.07Show/hide
Query:  MNPSIDSSSSSAEKDLLSPIFLLTNICNLISIKLDSTNFVLWKFQMTAILKAHKLFGFIDGTRPC-PASNVASSISTGPPQSNPLYDDWIAKDQALMTVI
        M+ S    SSSAEKD LSPIFLL+NICNLIS++LDSTNFVLWKFQ+TAILKAHKL+GFIDGT PC P +N +SS ST PPQSNP Y+DWIAKDQALMTVI
Subjt:  MNPSIDSSSSSAEKDLLSPIFLLTNICNLISIKLDSTNFVLWKFQMTAILKAHKLFGFIDGTRPC-PASNVASSISTGPPQSNPLYDDWIAKDQALMTVI

Query:  NATLSPEALAYVVGSTSSKQVWDVLARLYSSNSRYNVVNLKSELQTISKKPDESIDAYIKRIKEIKDKLADVSTVINEEDLLIYALNGLPTEFNTFRTAM
        NATLSPEALAYVVGSTSSKQVWDVLA+LYSS SR NVVNLKS+LQTI KKPDESIDAYIKRIKEIKDKLA+VST INEEDLLIYALNGLP E+NTFRT+M
Subjt:  NATLSPEALAYVVGSTSSKQVWDVLARLYSSNSRYNVVNLKSELQTISKKPDESIDAYIKRIKEIKDKLADVSTVINEEDLLIYALNGLPTEFNTFRTAM

Query:  RTRPQPVTFEELHVLLKSEESALAKQSKCDDLFNQPIALLTFSQSLPSHAPTFNNNSIQGCGCGTSNGHGSFSFDTQGHGLGSSQQQQSVVPDKHPSCQI
        RTR QPVTFEELHVLL++EESALAKQSK DD +NQP  LL+ SQSL S APTF+NN ++G G G   GHG FSFD Q  G GSS +Q+S V D H +CQI
Subjt:  RTRPQPVTFEELHVLLKSEESALAKQSKCDDLFNQPIALLTFSQSLPSHAPTFNNNSIQGCGCGTSNGHGSFSFDTQGHGLGSSQQQQSVVPDKHPSCQI

Query:  CLRRGHTALDCFNRMNYSFQGCHPPHQLAAMVASQNNAFLSIANSSS--------PITKDLHYLSLASKYNGEEQ
        C RRGHTALDCFNRMNY+FQG HPP QLAAMVASQNNAFLSI NSSS         IT D++Y+SLA +YNGEEQ
Subjt:  CLRRGHTALDCFNRMNYSFQGCHPPHQLAAMVASQNNAFLSIANSSS--------PITKDLHYLSLASKYNGEEQ

A0A6J1D9L6 uncharacterized protein LOC1110188922.2e-10758.25Show/hide
Query:  SIDSSSSSAEKDLLSPIFLLTNICNLISIKLDSTNFVLWKFQMTAILKAHKLFGFIDGTRPCPASNVASS--------ISTGPPQSNPLYDDWIAKDQAL
        ++ SSS++ +KDL SPIFLL+NICNL+SI+LDST+F+LWKFQ+TAILKAHKLFGFIDG+   P+  +ASS         +T  P  NP ++DWIAKDQAL
Subjt:  SIDSSSSSAEKDLLSPIFLLTNICNLISIKLDSTNFVLWKFQMTAILKAHKLFGFIDGTRPCPASNVASS--------ISTGPPQSNPLYDDWIAKDQAL

Query:  MTVINATLSPEALAYVVGSTSSKQVWDVLARLYSSNSRYNVVNLKSELQTISKKPDESIDAYIKRIKEIKDKLADVSTVINEEDLLIYALNGLPTEFNTF
        MT+INATLS EALAYVV S +SKQVW+VL + YSSNSR NVVNLKS+LQ+I KK +ESIDAY+KRIKEIKDK A+VS  IN+E LLIYALNGL TE+NT 
Subjt:  MTVINATLSPEALAYVVGSTSSKQVWDVLARLYSSNSRYNVVNLKSELQTISKKPDESIDAYIKRIKEIKDKLADVSTVINEEDLLIYALNGLPTEFNTF

Query:  RTAMRTRPQPVTFEELHVLLKSEESALAKQSKCDDLFNQPIALLTFSQSLPSHAPTFNNNSIQGCGCGTSNGHGSF----SFDTQGHGLGSSQQQQSVVP
         T+MRTR Q V+FEELHV +KSEESA+ KQ K +DL  QP AL   S    +    F+ N     G G +NG G      +F  QG G  S     S   
Subjt:  RTAMRTRPQPVTFEELHVLLKSEESALAKQSKCDDLFNQPIALLTFSQSLPSHAPTFNNNSIQGCGCGTSNGHGSF----SFDTQGHGLGSSQQQQSVVP

Query:  DKHPSCQICLRRGHTALDCFNRMNYSFQGCHPPHQLAAMVASQNNAFLSIANSS-----------SPITKDLHYL---SLASKYNGEE
        D    CQIC + GHTALDC+NRMN+ FQG HPP QLAAMVA QNN++L++ NSS           + +T DL  L   S+AS YNGEE
Subjt:  DKHPSCQICLRRGHTALDCFNRMNYSFQGCHPPHQLAAMVASQNNAFLSIANSS-----------SPITKDLHYL---SLASKYNGEE

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE15.1e-2124.2Show/hide
Query:  SSSAEKDLLSPIFLL-TNICNLISIKLDSTNFVLWKFQMTAILKAHKLFGFIDGTRPCPASNVASSISTGPPQSNPLYDDWIAKDQALMTVINATLSPEA
        ++ AE+ +L+   +L  N+ N+   KL STN+++W  Q+ A+   ++L GF+DG+   P + + +  +   P+ NP Y  W  +D+ + + +   +S   
Subjt:  SSSAEKDLLSPIFLL-TNICNLISIKLDSTNFVLWKFQMTAILKAHKLFGFIDGTRPCPASNVASSISTGPPQSNPLYDDWIAKDQALMTVINATLSPEA

Query:  LAYVVGSTSSKQVWDVLARLYSSNSRYNVVNLKSELQTISKKPDESIDAYIKRIKEIKDKLADVSTVINEEDLLIYALNGLPTEFNTFRTAMRTRPQPVT
           V  +T++ Q+W+ L ++Y++ S  +V  L+++L+  + K  ++ID Y++ +    D+LA +   ++ ++ +   L  LP E+      +  +  P T
Subjt:  LAYVVGSTSSKQVWDVLARLYSSNSRYNVVNLKSELQTISKKPDESIDAYIKRIKEIKDKLADVSTVINEEDLLIYALNGLPTEFNTFRTAMRTRPQPVT

Query:  FEELHVLLKSEESALAKQSKCDDLFNQPIALLTFSQSLPSHAPTFNNNSIQGCGCGTSNGHGSFSFDTQGHGLGSSQQQQSVV---PDKHPS------CQ
          E+H  L + ES +   S           ++  + +  SH  T   N+        +NG+ +  +D + +   S   QQS     P+ + S      CQ
Subjt:  FEELHVLLKSEESALAKQSKCDDLFNQPIALLTFSQSLPSHAPTFNNNSIQGCGCGTSNGHGSFSFDTQGHGLGSSQQQQSVV---PDKHPS------CQ

Query:  ICLRRGHTALDCFNRMNY-----------SFQGCHPPHQLAAMVASQNNAFLSIANSSSPITKDLHYLSLASKYNG
        IC  +GH+A  C    ++            F    P   LA      +N +L  + ++  IT D + LSL   Y G
Subjt:  ICLRRGHTALDCFNRMNY-----------SFQGCHPPHQLAAMVASQNNAFLSIANSSSPITKDLHYLSLASKYNG

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.6e-1422.85Show/hide
Query:  IFLLTNICNLIS---IKLDSTNFVLWKFQMTAILKAHKLFGFIDGTRPCPASNVASSISTGPPQSNPLYDDWIAKDQALMTVINATLSPEALAYVVGSTS
        + + TNI N+      KL STN+++W  Q+ A+   ++L GF+DG+ P P + + +      P+ NP Y  W  +D+ + + I   +S      V  +T+
Subjt:  IFLLTNICNLIS---IKLDSTNFVLWKFQMTAILKAHKLFGFIDGTRPCPASNVASSISTGPPQSNPLYDDWIAKDQALMTVINATLSPEALAYVVGSTS

Query:  SKQVWDVLARLYSSNSRYNVVNLKSELQTISKKPDESIDAYIKRIKEIKDKLADVSTVINEEDLLIYALNGLPTEFNTFRTAMRTRPQPVTFEELHVLLK
        + Q+W+ L ++Y++ S  +V  L+                +I R     D+LA +   ++ ++ +   L  LP ++      +  +  P +  E+H  L 
Subjt:  SKQVWDVLARLYSSNSRYNVVNLKSELQTISKKPDESIDAYIKRIKEIKDKLADVSTVINEEDLLIYALNGLPTEFNTFRTAMRTRPQPVTFEELHVLLK

Query:  SEESALAKQSKCDDLFNQPIALLTFSQSLPSHAPTFNNNSIQGCGCG---TSNGHGSFSFDTQGHGLGSSQQQQSVVPDKHPSCQICLRRGHTALDC---
        + ES L   +  +        ++  + ++ +H  T  N +    G      +N + S S+     G  S  +Q      +   CQIC  +GH+A  C   
Subjt:  SEESALAKQSKCDDLFNQPIALLTFSQSLPSHAPTFNNNSIQGCGCG---TSNGHGSFSFDTQGHGLGSSQQQQSVVPDKHPSCQICLRRGHTALDC---

Query:  --FNRMNYSFQGCHP--PHQLAAMVASQN----NAFLSIANSSSPITKDLHYLSLASKYNGEEQDKFSGKILFQEPSINGLYPIVSKATAASSSASTSSS
          F       Q   P  P Q  A +A  +    N +L  + ++  IT D + LS    Y G +       ++  + S     PI    +A+  ++S S  
Subjt:  --FNRMNYSFQGCHP--PHQLAAMVASQN----NAFLSIANSSSPITKDLHYLSLASKYNGEEQDKFSGKILFQEPSINGLYPIVSKATAASSSASTSSS

Query:  CSTVAHV
         + V +V
Subjt:  CSTVAHV

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).4.3e-0722.91Show/hide
Query:  MNPSIDSSSSSAEKDLLSPIFLLTNI-----CNLISIKLDSTNFVLWKFQMTAILKAHKLFGFIDGTRPCPASNVASSISTGPPQSNPLYDDWIAKDQAL
        M  +I S S +++ D  SP +L  +I      ++  +  D  N+V WK +  + L+  K FGFIDGT P             P   +PLY  W   +  +
Subjt:  MNPSIDSSSSSAEKDLLSPIFLLTNI-----CNLISIKLDSTNFVLWKFQMTAILKAHKLFGFIDGTRPCPASNVASSISTGPPQSNPLYDDWIAKDQAL

Query:  MTVINATLSPEALAYVVGSTSSKQVWDVLARLYSSNSRYNVVNLKSELQTISKKPDESIDAYIKRIKEIKDKLADVSTV
        M  +  +++ + L  V+ + ++ ++W+ L R++       +  L+  L T+ ++  +S++ Y  ++ ++  +L++ + +
Subjt:  MTVINATLSPEALAYVVGSTSSKQVWDVLARLYSSNSRYNVVNLKSELQTISKKPDESIDAYIKRIKEIKDKLADVSTV

AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)9.2e-1024.06Show/hide
Query:  IFLLTNICNLISIKLD--STNFVLWKFQMTAILKAHKLFGFIDGT-RPCPASNVASSISTGPPQSNPLYDDWIAKDQALMTVINATLSPEAL-AYVVGST
        I+ ++NI + I + LD   +N+  W+        +  + G IDGT  P  A++V                +W  +D  +   +  TL+P+      V S+
Subjt:  IFLLTNICNLISIKLD--STNFVLWKFQMTAILKAHKLFGFIDGT-RPCPASNVASSISTGPPQSNPLYDDWIAKDQALMTVINATLSPEAL-AYVVGST

Query:  SSKQVWDVLARLYSSNSRYNVVNLKSELQTISKKPDESIDAYIKRIKEIKDKLADVSTVINEEDLLIYALNGLPTEFNTFRTAMRTRPQPVTFEELHVLL
        +S+ +W  +   + +N     + L SEL+T     D  +  Y +++K++ D L +V   + + +L++Y LNGL  +F+     ++ R    +F++   +L
Subjt:  SSKQVWDVLARLYSSNSRYNVVNLKSELQTISKKPDESIDAYIKRIKEIKDKLADVSTVINEEDLLIYALNGLPTEFNTFRTAMRTRPQPVTFEELHVLL

Query:  KSEESALAKQSK
        + EE  L +  K
Subjt:  KSEESALAKQSK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCCCTCTATTGACTCCTCTTCTTCTTCAGCTGAGAAAGACTTACTTTCACCAATTTTTCTGCTGACCAACATCTGCAACCTGATTTCAATCAAACTTGACTCTAC
AAATTTTGTCCTATGGAAATTCCAGATGACAGCGATTTTGAAAGCTCATAAGCTTTTTGGCTTTATCGACGGTACTCGTCCATGTCCTGCTTCGAATGTTGCATCTTCTA
TCTCTACTGGTCCACCTCAATCAAATCCTTTGTATGATGACTGGATTGCTAAAGATCAAGCACTTATGACAGTCATAAATGCTACACTTTCACCTGAGGCTTTGGCATAT
GTTGTTGGCAGCACTTCCTCCAAACAGGTTTGGGATGTTCTAGCTAGGCTGTATTCTTCTAATTCTCGGTATAATGTGGTTAATTTGAAGTCTGAATTACAAACTATTTC
CAAGAAGCCTGATGAATCGATCGATGCCTATATTAAACGGATTAAGGAGATCAAGGATAAACTTGCTGATGTTTCTACTGTTATCAATGAGGAGGATCTTCTTATCTATG
CTCTAAATGGCCTTCCAACTGAGTTTAATACTTTTCGAACGGCTATGCGTACGCGTCCTCAGCCTGTTACTTTTGAAGAACTTCATGTTCTTCTAAAATCTGAGGAATCA
GCTCTAGCAAAACAGTCTAAATGTGATGATTTGTTTAATCAGCCAATTGCTTTGCTAACTTTTTCTCAGTCTCTTCCATCTCATGCTCCTACTTTCAATAATAACTCTAT
TCAAGGCTGTGGATGTGGTACAAGTAATGGACATGGAAGTTTCTCTTTTGATACTCAAGGCCATGGTCTTGGTTCTTCCCAACAGCAGCAGTCTGTTGTTCCTGATAAGC
ATCCATCTTGTCAGATTTGTTTACGTCGTGGCCATACTGCACTTGATTGTTTCAATCGAATGAACTATAGTTTTCAAGGATGTCACCCTCCACATCAGCTTGCTGCAATG
GTTGCATCACAGAATAATGCTTTTCTATCTATTGCTAATTCTTCTTCTCCTATTACTAAAGATCTACATTATCTTTCTCTTGCATCTAAATATAATGGTGAAGAACAGGA
CAAGTTTTCGGGCAAAATTTTGTTCCAAGAACCTAGCATCAATGGTCTATATCCGATCGTTTCTAAAGCTACGGCTGCTTCCAGTTCAGCCTCCACCAGCAGTAGTTGTT
CTACTGTTGCTCATGTTGCTGCCAAGGGGGAGCATCGGTCGGTCGAGATCGGTTTTGGCCCCAAACCGACGCCGAACCGACTAACAAGCAAAAGAAAAACGGATGGACGG
ACGGCCGACGAGAACGAAGAAGAAGACGAAGAACGCGACTGCGGAGTGCGGAGGGGCGGCGAACCGACGACGACTGCGGAGTGGAGGCGACGCGAGCGACGGCGACTGGG
GTCTGCGGTGGTGGTGTGGTGCGTGGCTGCTGCGAGTGCGAGGGTTTTTCTGGCGAAGAAGAAGAAGATAACTTTTTTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAATCCCTCTATTGACTCCTCTTCTTCTTCAGCTGAGAAAGACTTACTTTCACCAATTTTTCTGCTGACCAACATCTGCAACCTGATTTCAATCAAACTTGACTCTAC
AAATTTTGTCCTATGGAAATTCCAGATGACAGCGATTTTGAAAGCTCATAAGCTTTTTGGCTTTATCGACGGTACTCGTCCATGTCCTGCTTCGAATGTTGCATCTTCTA
TCTCTACTGGTCCACCTCAATCAAATCCTTTGTATGATGACTGGATTGCTAAAGATCAAGCACTTATGACAGTCATAAATGCTACACTTTCACCTGAGGCTTTGGCATAT
GTTGTTGGCAGCACTTCCTCCAAACAGGTTTGGGATGTTCTAGCTAGGCTGTATTCTTCTAATTCTCGGTATAATGTGGTTAATTTGAAGTCTGAATTACAAACTATTTC
CAAGAAGCCTGATGAATCGATCGATGCCTATATTAAACGGATTAAGGAGATCAAGGATAAACTTGCTGATGTTTCTACTGTTATCAATGAGGAGGATCTTCTTATCTATG
CTCTAAATGGCCTTCCAACTGAGTTTAATACTTTTCGAACGGCTATGCGTACGCGTCCTCAGCCTGTTACTTTTGAAGAACTTCATGTTCTTCTAAAATCTGAGGAATCA
GCTCTAGCAAAACAGTCTAAATGTGATGATTTGTTTAATCAGCCAATTGCTTTGCTAACTTTTTCTCAGTCTCTTCCATCTCATGCTCCTACTTTCAATAATAACTCTAT
TCAAGGCTGTGGATGTGGTACAAGTAATGGACATGGAAGTTTCTCTTTTGATACTCAAGGCCATGGTCTTGGTTCTTCCCAACAGCAGCAGTCTGTTGTTCCTGATAAGC
ATCCATCTTGTCAGATTTGTTTACGTCGTGGCCATACTGCACTTGATTGTTTCAATCGAATGAACTATAGTTTTCAAGGATGTCACCCTCCACATCAGCTTGCTGCAATG
GTTGCATCACAGAATAATGCTTTTCTATCTATTGCTAATTCTTCTTCTCCTATTACTAAAGATCTACATTATCTTTCTCTTGCATCTAAATATAATGGTGAAGAACAGGA
CAAGTTTTCGGGCAAAATTTTGTTCCAAGAACCTAGCATCAATGGTCTATATCCGATCGTTTCTAAAGCTACGGCTGCTTCCAGTTCAGCCTCCACCAGCAGTAGTTGTT
CTACTGTTGCTCATGTTGCTGCCAAGGGGGAGCATCGGTCGGTCGAGATCGGTTTTGGCCCCAAACCGACGCCGAACCGACTAACAAGCAAAAGAAAAACGGATGGACGG
ACGGCCGACGAGAACGAAGAAGAAGACGAAGAACGCGACTGCGGAGTGCGGAGGGGCGGCGAACCGACGACGACTGCGGAGTGGAGGCGACGCGAGCGACGGCGACTGGG
GTCTGCGGTGGTGGTGTGGTGCGTGGCTGCTGCGAGTGCGAGGGTTTTTCTGGCGAAGAAGAAGAAGATAACTTTTTTTTAA
Protein sequenceShow/hide protein sequence
MNPSIDSSSSSAEKDLLSPIFLLTNICNLISIKLDSTNFVLWKFQMTAILKAHKLFGFIDGTRPCPASNVASSISTGPPQSNPLYDDWIAKDQALMTVINATLSPEALAY
VVGSTSSKQVWDVLARLYSSNSRYNVVNLKSELQTISKKPDESIDAYIKRIKEIKDKLADVSTVINEEDLLIYALNGLPTEFNTFRTAMRTRPQPVTFEELHVLLKSEES
ALAKQSKCDDLFNQPIALLTFSQSLPSHAPTFNNNSIQGCGCGTSNGHGSFSFDTQGHGLGSSQQQQSVVPDKHPSCQICLRRGHTALDCFNRMNYSFQGCHPPHQLAAM
VASQNNAFLSIANSSSPITKDLHYLSLASKYNGEEQDKFSGKILFQEPSINGLYPIVSKATAASSSASTSSSCSTVAHVAAKGEHRSVEIGFGPKPTPNRLTSKRKTDGR
TADENEEEDEERDCGVRRGGEPTTTAEWRRRERRRLGSAVVVWCVAAASARVFLAKKKKITFF