; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0039770 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0039770
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotran_gag_3 domain-containing protein
Genome locationchr2:49607430..49608554
RNA-Seq ExpressionLag0039770
SyntenyLag0039770
Gene Ontology termsGO:0005488 - binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0049700.1 T4.5 [Cucumis melo var. makuwa]6.3e-9055.52Show/hide
Query:  FWFVDGSNKAPPKTLSAGSSTTSSSSESSAPDLNPEYEDWLAKDQALMTLINATLSTEALAYIVGCNSSREKWEALEKHYSSSTRSNIVNLKTDLQSITK
        + F+DG+N  PP+T        SSS+ +  P  NP YEDW+AKDQALMT+INATLS EALAY+VG  SS++ W+ L K YSS +RSN+VNLK+DLQ+I K
Subjt:  FWFVDGSNKAPPKTLSAGSSTTSSSSESSAPDLNPEYEDWLAKDQALMTLINATLSTEALAYIVGCNSSREKWEALEKHYSSSTRSNIVNLKTDLQSITK

Query:  KSDESVDSYIKRIKEIKDKLANVSSVVNDEDLLIYTLNGLPVEYNTFCSSMRIRSQSVTFAELPVLLKAEESAIEKQAKREDLVVQPTTMLASSANQFCP
        K DES+D+YIKRIKEIKDKLANVS+ +N+EDLLIY LNGLP EYNTF +SMR RSQ VTF EL VLL+AEESA+ KQ+K +D   QPT +L+SS +    
Subjt:  KSDESVDSYIKRIKEIKDKLANVSSVVNDEDLLIYTLNGLPVEYNTFCSSMRIRSQSVTFAELPVLLKAEESAIEKQAKREDLVVQPTTMLASSANQFCP

Query:  PPSSSSQSSFRGRGRGGRNSVRGRNSSTFTSPRRGRGFPNSSPSSVDFPTACQICHRPGHMALDCYNRMNYNFQGRHPPPQLAAMVATQNHQYISTQHNQ
         P+  + +  RG G G      G    +F +  RG G      S  D    CQIC R GH ALDC+NRMNYNFQGRHPP QLAAMVA+QN+ ++      
Subjt:  PPSSSSQSSFRGRGRGGRNSVRGRNSSTFTSPRRGRGFPNSSPSSVDFPTACQICHRPGHMALDCYNRMNYNFQGRHPPPQLAAMVATQNHQYISTQHNQ

Query:  FSPTSAVGSPWLADSGCNTHVTSDLSNLAISSEYNDEENVAVGN
            S V S  L DSGCNT +TSD++ ++++ EYN EE V +GN
Subjt:  FSPTSAVGSPWLADSGCNTHVTSDLSNLAISSEYNDEENVAVGN

XP_008448007.1 PREDICTED: uncharacterized protein LOC103490319 isoform X2 [Cucumis melo]6.3e-9055.52Show/hide
Query:  FWFVDGSNKAPPKTLSAGSSTTSSSSESSAPDLNPEYEDWLAKDQALMTLINATLSTEALAYIVGCNSSREKWEALEKHYSSSTRSNIVNLKTDLQSITK
        + F+DG+N  PP+T        SSS+ +  P  NP YEDW+AKDQALMT+INATLS EALAY+VG  SS++ W+ L K YSS +RSN+VNLK+DLQ+I K
Subjt:  FWFVDGSNKAPPKTLSAGSSTTSSSSESSAPDLNPEYEDWLAKDQALMTLINATLSTEALAYIVGCNSSREKWEALEKHYSSSTRSNIVNLKTDLQSITK

Query:  KSDESVDSYIKRIKEIKDKLANVSSVVNDEDLLIYTLNGLPVEYNTFCSSMRIRSQSVTFAELPVLLKAEESAIEKQAKREDLVVQPTTMLASSANQFCP
        K DES+D+YIKRIKEIKDKLANVS+ +N+EDLLIY LNGLP EYNTF +SMR RSQ VTF EL VLL+AEESA+ KQ+K +D   QPT +L+SS +    
Subjt:  KSDESVDSYIKRIKEIKDKLANVSSVVNDEDLLIYTLNGLPVEYNTFCSSMRIRSQSVTFAELPVLLKAEESAIEKQAKREDLVVQPTTMLASSANQFCP

Query:  PPSSSSQSSFRGRGRGGRNSVRGRNSSTFTSPRRGRGFPNSSPSSVDFPTACQICHRPGHMALDCYNRMNYNFQGRHPPPQLAAMVATQNHQYISTQHNQ
         P+  + +  RG G G      G    +F +  RG G      S  D    CQIC R GH ALDC+NRMNYNFQGRHPP QLAAMVA+QN+ ++      
Subjt:  PPSSSSQSSFRGRGRGGRNSVRGRNSSTFTSPRRGRGFPNSSPSSVDFPTACQICHRPGHMALDCYNRMNYNFQGRHPPPQLAAMVATQNHQYISTQHNQ

Query:  FSPTSAVGSPWLADSGCNTHVTSDLSNLAISSEYNDEENVAVGN
            S V S  L DSGCNT +TSD++ ++++ EYN EE V +GN
Subjt:  FSPTSAVGSPWLADSGCNTHVTSDLSNLAISSEYNDEENVAVGN

XP_008448008.1 PREDICTED: uncharacterized protein LOC103490319 isoform X3 [Cucumis melo]6.3e-9055.52Show/hide
Query:  FWFVDGSNKAPPKTLSAGSSTTSSSSESSAPDLNPEYEDWLAKDQALMTLINATLSTEALAYIVGCNSSREKWEALEKHYSSSTRSNIVNLKTDLQSITK
        + F+DG+N  PP+T        SSS+ +  P  NP YEDW+AKDQALMT+INATLS EALAY+VG  SS++ W+ L K YSS +RSN+VNLK+DLQ+I K
Subjt:  FWFVDGSNKAPPKTLSAGSSTTSSSSESSAPDLNPEYEDWLAKDQALMTLINATLSTEALAYIVGCNSSREKWEALEKHYSSSTRSNIVNLKTDLQSITK

Query:  KSDESVDSYIKRIKEIKDKLANVSSVVNDEDLLIYTLNGLPVEYNTFCSSMRIRSQSVTFAELPVLLKAEESAIEKQAKREDLVVQPTTMLASSANQFCP
        K DES+D+YIKRIKEIKDKLANVS+ +N+EDLLIY LNGLP EYNTF +SMR RSQ VTF EL VLL+AEESA+ KQ+K +D   QPT +L+SS +    
Subjt:  KSDESVDSYIKRIKEIKDKLANVSSVVNDEDLLIYTLNGLPVEYNTFCSSMRIRSQSVTFAELPVLLKAEESAIEKQAKREDLVVQPTTMLASSANQFCP

Query:  PPSSSSQSSFRGRGRGGRNSVRGRNSSTFTSPRRGRGFPNSSPSSVDFPTACQICHRPGHMALDCYNRMNYNFQGRHPPPQLAAMVATQNHQYISTQHNQ
         P+  + +  RG G G      G    +F +  RG G      S  D    CQIC R GH ALDC+NRMNYNFQGRHPP QLAAMVA+QN+ ++      
Subjt:  PPSSSSQSSFRGRGRGGRNSVRGRNSSTFTSPRRGRGFPNSSPSSVDFPTACQICHRPGHMALDCYNRMNYNFQGRHPPPQLAAMVATQNHQYISTQHNQ

Query:  FSPTSAVGSPWLADSGCNTHVTSDLSNLAISSEYNDEENVAVGN
            S V S  L DSGCNT +TSD++ ++++ EYN EE V +GN
Subjt:  FSPTSAVGSPWLADSGCNTHVTSDLSNLAISSEYNDEENVAVGN

XP_016900446.1 PREDICTED: uncharacterized protein LOC103490319 isoform X1 [Cucumis melo]6.3e-9055.52Show/hide
Query:  FWFVDGSNKAPPKTLSAGSSTTSSSSESSAPDLNPEYEDWLAKDQALMTLINATLSTEALAYIVGCNSSREKWEALEKHYSSSTRSNIVNLKTDLQSITK
        + F+DG+N  PP+T        SSS+ +  P  NP YEDW+AKDQALMT+INATLS EALAY+VG  SS++ W+ L K YSS +RSN+VNLK+DLQ+I K
Subjt:  FWFVDGSNKAPPKTLSAGSSTTSSSSESSAPDLNPEYEDWLAKDQALMTLINATLSTEALAYIVGCNSSREKWEALEKHYSSSTRSNIVNLKTDLQSITK

Query:  KSDESVDSYIKRIKEIKDKLANVSSVVNDEDLLIYTLNGLPVEYNTFCSSMRIRSQSVTFAELPVLLKAEESAIEKQAKREDLVVQPTTMLASSANQFCP
        K DES+D+YIKRIKEIKDKLANVS+ +N+EDLLIY LNGLP EYNTF +SMR RSQ VTF EL VLL+AEESA+ KQ+K +D   QPT +L+SS +    
Subjt:  KSDESVDSYIKRIKEIKDKLANVSSVVNDEDLLIYTLNGLPVEYNTFCSSMRIRSQSVTFAELPVLLKAEESAIEKQAKREDLVVQPTTMLASSANQFCP

Query:  PPSSSSQSSFRGRGRGGRNSVRGRNSSTFTSPRRGRGFPNSSPSSVDFPTACQICHRPGHMALDCYNRMNYNFQGRHPPPQLAAMVATQNHQYISTQHNQ
         P+  + +  RG G G      G    +F +  RG G      S  D    CQIC R GH ALDC+NRMNYNFQGRHPP QLAAMVA+QN+ ++      
Subjt:  PPSSSSQSSFRGRGRGGRNSVRGRNSSTFTSPRRGRGFPNSSPSSVDFPTACQICHRPGHMALDCYNRMNYNFQGRHPPPQLAAMVATQNHQYISTQHNQ

Query:  FSPTSAVGSPWLADSGCNTHVTSDLSNLAISSEYNDEENVAVGN
            S V S  L DSGCNT +TSD++ ++++ EYN EE V +GN
Subjt:  FSPTSAVGSPWLADSGCNTHVTSDLSNLAISSEYNDEENVAVGN

XP_022150845.1 uncharacterized protein LOC111018892 [Momordica charantia]1.9e-9457.58Show/hide
Query:  FWFVDGSNKAPPKTLSAGSSTTSS-SSESSAPDLNPEYEDWLAKDQALMTLINATLSTEALAYIVGCNSSREKWEALEKHYSSSTRSNIVNLKTDLQSIT
        F F+DGS  AP + L++ S T S  ++ +S P +NP +EDW+AKDQALMTLINATLS EALAY+V   +S++ WE LEKHYSS++R+N+VNLK+DLQSI 
Subjt:  FWFVDGSNKAPPKTLSAGSSTTSS-SSESSAPDLNPEYEDWLAKDQALMTLINATLSTEALAYIVGCNSSREKWEALEKHYSSSTRSNIVNLKTDLQSIT

Query:  KKSDESVDSYIKRIKEIKDKLANVSSVVNDEDLLIYTLNGLPVEYNTFCSSMRIRSQSVTFAELPVLLKAEESAIEKQAKREDLVVQPTTMLASSANQFC
        KK++ES+D+Y+KRIKEIKDK ANVS  +NDE LLIY LNGL  EYNT  +SMR R+QSV+F EL V +K+EESAIEKQ KREDLV QP  + ASS     
Subjt:  KKSDESVDSYIKRIKEIKDKLANVSSVVNDEDLLIYTLNGLPVEYNTFCSSMRIRSQSVTFAELPVLLKAEESAIEKQAKREDLVVQPTTMLASSANQFC

Query:  PPPSSSSQSSF-----RGRGRGGRNSVRGRN--SSTFTSPRRGRGFPNSSPS-SVDFPTACQICHRPGHMALDCYNRMNYNFQGRHPPPQLAAMVATQNH
         P S +  S+F       RGR G+N+ RG+   + TFT+  RGR   N   S   D  + CQIC + GH ALDCYNRMN++FQGRHPPPQLAAMVA QN+
Subjt:  PPPSSSSQSSF-----RGRGRGGRNSVRGRN--SSTFTSPRRGRGFPNSSPS-SVDFPTACQICHRPGHMALDCYNRMNYNFQGRHPPPQLAAMVATQNH

Query:  QYISTQHNQFSPTSAVGSPWLADSGCNTHVTSDLSNL---AISSEYNDEENVAVGN
         Y++  ++  SPT+     WLADS CNTH+T+DLSNL   +I+S+YN EEN++VG+
Subjt:  QYISTQHNQFSPTSAVGSPWLADSGCNTHVTSDLSNL---AISSEYNDEENVAVGN

TrEMBL top hitse value%identityAlignment
A0A1S3BI58 uncharacterized protein LOC103490319 isoform X23.0e-9055.52Show/hide
Query:  FWFVDGSNKAPPKTLSAGSSTTSSSSESSAPDLNPEYEDWLAKDQALMTLINATLSTEALAYIVGCNSSREKWEALEKHYSSSTRSNIVNLKTDLQSITK
        + F+DG+N  PP+T        SSS+ +  P  NP YEDW+AKDQALMT+INATLS EALAY+VG  SS++ W+ L K YSS +RSN+VNLK+DLQ+I K
Subjt:  FWFVDGSNKAPPKTLSAGSSTTSSSSESSAPDLNPEYEDWLAKDQALMTLINATLSTEALAYIVGCNSSREKWEALEKHYSSSTRSNIVNLKTDLQSITK

Query:  KSDESVDSYIKRIKEIKDKLANVSSVVNDEDLLIYTLNGLPVEYNTFCSSMRIRSQSVTFAELPVLLKAEESAIEKQAKREDLVVQPTTMLASSANQFCP
        K DES+D+YIKRIKEIKDKLANVS+ +N+EDLLIY LNGLP EYNTF +SMR RSQ VTF EL VLL+AEESA+ KQ+K +D   QPT +L+SS +    
Subjt:  KSDESVDSYIKRIKEIKDKLANVSSVVNDEDLLIYTLNGLPVEYNTFCSSMRIRSQSVTFAELPVLLKAEESAIEKQAKREDLVVQPTTMLASSANQFCP

Query:  PPSSSSQSSFRGRGRGGRNSVRGRNSSTFTSPRRGRGFPNSSPSSVDFPTACQICHRPGHMALDCYNRMNYNFQGRHPPPQLAAMVATQNHQYISTQHNQ
         P+  + +  RG G G      G    +F +  RG G      S  D    CQIC R GH ALDC+NRMNYNFQGRHPP QLAAMVA+QN+ ++      
Subjt:  PPSSSSQSSFRGRGRGGRNSVRGRNSSTFTSPRRGRGFPNSSPSSVDFPTACQICHRPGHMALDCYNRMNYNFQGRHPPPQLAAMVATQNHQYISTQHNQ

Query:  FSPTSAVGSPWLADSGCNTHVTSDLSNLAISSEYNDEENVAVGN
            S V S  L DSGCNT +TSD++ ++++ EYN EE V +GN
Subjt:  FSPTSAVGSPWLADSGCNTHVTSDLSNLAISSEYNDEENVAVGN

A0A1S3BIR3 uncharacterized protein LOC103490319 isoform X33.0e-9055.52Show/hide
Query:  FWFVDGSNKAPPKTLSAGSSTTSSSSESSAPDLNPEYEDWLAKDQALMTLINATLSTEALAYIVGCNSSREKWEALEKHYSSSTRSNIVNLKTDLQSITK
        + F+DG+N  PP+T        SSS+ +  P  NP YEDW+AKDQALMT+INATLS EALAY+VG  SS++ W+ L K YSS +RSN+VNLK+DLQ+I K
Subjt:  FWFVDGSNKAPPKTLSAGSSTTSSSSESSAPDLNPEYEDWLAKDQALMTLINATLSTEALAYIVGCNSSREKWEALEKHYSSSTRSNIVNLKTDLQSITK

Query:  KSDESVDSYIKRIKEIKDKLANVSSVVNDEDLLIYTLNGLPVEYNTFCSSMRIRSQSVTFAELPVLLKAEESAIEKQAKREDLVVQPTTMLASSANQFCP
        K DES+D+YIKRIKEIKDKLANVS+ +N+EDLLIY LNGLP EYNTF +SMR RSQ VTF EL VLL+AEESA+ KQ+K +D   QPT +L+SS +    
Subjt:  KSDESVDSYIKRIKEIKDKLANVSSVVNDEDLLIYTLNGLPVEYNTFCSSMRIRSQSVTFAELPVLLKAEESAIEKQAKREDLVVQPTTMLASSANQFCP

Query:  PPSSSSQSSFRGRGRGGRNSVRGRNSSTFTSPRRGRGFPNSSPSSVDFPTACQICHRPGHMALDCYNRMNYNFQGRHPPPQLAAMVATQNHQYISTQHNQ
         P+  + +  RG G G      G    +F +  RG G      S  D    CQIC R GH ALDC+NRMNYNFQGRHPP QLAAMVA+QN+ ++      
Subjt:  PPSSSSQSSFRGRGRGGRNSVRGRNSSTFTSPRRGRGFPNSSPSSVDFPTACQICHRPGHMALDCYNRMNYNFQGRHPPPQLAAMVATQNHQYISTQHNQ

Query:  FSPTSAVGSPWLADSGCNTHVTSDLSNLAISSEYNDEENVAVGN
            S V S  L DSGCNT +TSD++ ++++ EYN EE V +GN
Subjt:  FSPTSAVGSPWLADSGCNTHVTSDLSNLAISSEYNDEENVAVGN

A0A1S4DWT9 uncharacterized protein LOC103490319 isoform X13.0e-9055.52Show/hide
Query:  FWFVDGSNKAPPKTLSAGSSTTSSSSESSAPDLNPEYEDWLAKDQALMTLINATLSTEALAYIVGCNSSREKWEALEKHYSSSTRSNIVNLKTDLQSITK
        + F+DG+N  PP+T        SSS+ +  P  NP YEDW+AKDQALMT+INATLS EALAY+VG  SS++ W+ L K YSS +RSN+VNLK+DLQ+I K
Subjt:  FWFVDGSNKAPPKTLSAGSSTTSSSSESSAPDLNPEYEDWLAKDQALMTLINATLSTEALAYIVGCNSSREKWEALEKHYSSSTRSNIVNLKTDLQSITK

Query:  KSDESVDSYIKRIKEIKDKLANVSSVVNDEDLLIYTLNGLPVEYNTFCSSMRIRSQSVTFAELPVLLKAEESAIEKQAKREDLVVQPTTMLASSANQFCP
        K DES+D+YIKRIKEIKDKLANVS+ +N+EDLLIY LNGLP EYNTF +SMR RSQ VTF EL VLL+AEESA+ KQ+K +D   QPT +L+SS +    
Subjt:  KSDESVDSYIKRIKEIKDKLANVSSVVNDEDLLIYTLNGLPVEYNTFCSSMRIRSQSVTFAELPVLLKAEESAIEKQAKREDLVVQPTTMLASSANQFCP

Query:  PPSSSSQSSFRGRGRGGRNSVRGRNSSTFTSPRRGRGFPNSSPSSVDFPTACQICHRPGHMALDCYNRMNYNFQGRHPPPQLAAMVATQNHQYISTQHNQ
         P+  + +  RG G G      G    +F +  RG G      S  D    CQIC R GH ALDC+NRMNYNFQGRHPP QLAAMVA+QN+ ++      
Subjt:  PPSSSSQSSFRGRGRGGRNSVRGRNSSTFTSPRRGRGFPNSSPSSVDFPTACQICHRPGHMALDCYNRMNYNFQGRHPPPQLAAMVATQNHQYISTQHNQ

Query:  FSPTSAVGSPWLADSGCNTHVTSDLSNLAISSEYNDEENVAVGN
            S V S  L DSGCNT +TSD++ ++++ EYN EE V +GN
Subjt:  FSPTSAVGSPWLADSGCNTHVTSDLSNLAISSEYNDEENVAVGN

A0A5D3CLI6 T4.53.0e-9055.52Show/hide
Query:  FWFVDGSNKAPPKTLSAGSSTTSSSSESSAPDLNPEYEDWLAKDQALMTLINATLSTEALAYIVGCNSSREKWEALEKHYSSSTRSNIVNLKTDLQSITK
        + F+DG+N  PP+T        SSS+ +  P  NP YEDW+AKDQALMT+INATLS EALAY+VG  SS++ W+ L K YSS +RSN+VNLK+DLQ+I K
Subjt:  FWFVDGSNKAPPKTLSAGSSTTSSSSESSAPDLNPEYEDWLAKDQALMTLINATLSTEALAYIVGCNSSREKWEALEKHYSSSTRSNIVNLKTDLQSITK

Query:  KSDESVDSYIKRIKEIKDKLANVSSVVNDEDLLIYTLNGLPVEYNTFCSSMRIRSQSVTFAELPVLLKAEESAIEKQAKREDLVVQPTTMLASSANQFCP
        K DES+D+YIKRIKEIKDKLANVS+ +N+EDLLIY LNGLP EYNTF +SMR RSQ VTF EL VLL+AEESA+ KQ+K +D   QPT +L+SS +    
Subjt:  KSDESVDSYIKRIKEIKDKLANVSSVVNDEDLLIYTLNGLPVEYNTFCSSMRIRSQSVTFAELPVLLKAEESAIEKQAKREDLVVQPTTMLASSANQFCP

Query:  PPSSSSQSSFRGRGRGGRNSVRGRNSSTFTSPRRGRGFPNSSPSSVDFPTACQICHRPGHMALDCYNRMNYNFQGRHPPPQLAAMVATQNHQYISTQHNQ
         P+  + +  RG G G      G    +F +  RG G      S  D    CQIC R GH ALDC+NRMNYNFQGRHPP QLAAMVA+QN+ ++      
Subjt:  PPSSSSQSSFRGRGRGGRNSVRGRNSSTFTSPRRGRGFPNSSPSSVDFPTACQICHRPGHMALDCYNRMNYNFQGRHPPPQLAAMVATQNHQYISTQHNQ

Query:  FSPTSAVGSPWLADSGCNTHVTSDLSNLAISSEYNDEENVAVGN
            S V S  L DSGCNT +TSD++ ++++ EYN EE V +GN
Subjt:  FSPTSAVGSPWLADSGCNTHVTSDLSNLAISSEYNDEENVAVGN

A0A6J1D9L6 uncharacterized protein LOC1110188929.2e-9557.58Show/hide
Query:  FWFVDGSNKAPPKTLSAGSSTTSS-SSESSAPDLNPEYEDWLAKDQALMTLINATLSTEALAYIVGCNSSREKWEALEKHYSSSTRSNIVNLKTDLQSIT
        F F+DGS  AP + L++ S T S  ++ +S P +NP +EDW+AKDQALMTLINATLS EALAY+V   +S++ WE LEKHYSS++R+N+VNLK+DLQSI 
Subjt:  FWFVDGSNKAPPKTLSAGSSTTSS-SSESSAPDLNPEYEDWLAKDQALMTLINATLSTEALAYIVGCNSSREKWEALEKHYSSSTRSNIVNLKTDLQSIT

Query:  KKSDESVDSYIKRIKEIKDKLANVSSVVNDEDLLIYTLNGLPVEYNTFCSSMRIRSQSVTFAELPVLLKAEESAIEKQAKREDLVVQPTTMLASSANQFC
        KK++ES+D+Y+KRIKEIKDK ANVS  +NDE LLIY LNGL  EYNT  +SMR R+QSV+F EL V +K+EESAIEKQ KREDLV QP  + ASS     
Subjt:  KKSDESVDSYIKRIKEIKDKLANVSSVVNDEDLLIYTLNGLPVEYNTFCSSMRIRSQSVTFAELPVLLKAEESAIEKQAKREDLVVQPTTMLASSANQFC

Query:  PPPSSSSQSSF-----RGRGRGGRNSVRGRN--SSTFTSPRRGRGFPNSSPS-SVDFPTACQICHRPGHMALDCYNRMNYNFQGRHPPPQLAAMVATQNH
         P S +  S+F       RGR G+N+ RG+   + TFT+  RGR   N   S   D  + CQIC + GH ALDCYNRMN++FQGRHPPPQLAAMVA QN+
Subjt:  PPPSSSSQSSF-----RGRGRGGRNSVRGRN--SSTFTSPRRGRGFPNSSPS-SVDFPTACQICHRPGHMALDCYNRMNYNFQGRHPPPQLAAMVATQNH

Query:  QYISTQHNQFSPTSAVGSPWLADSGCNTHVTSDLSNL---AISSEYNDEENVAVGN
         Y++  ++  SPT+     WLADS CNTH+T+DLSNL   +I+S+YN EEN++VG+
Subjt:  QYISTQHNQFSPTSAVGSPWLADSGCNTHVTSDLSNL---AISSEYNDEENVAVGN

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE16.0e-1924.36Show/hide
Query:  FVDGSNKAPPKTLSAGSSTTSSSSESSAPDLNPEYEDWLAKDQALMTLINATLSTEALAYIVGCNSSREKWEALEKHYSSSTRSNIVNLKTDLQSITKKS
        F+DGS   PP T+             +AP +NP+Y  W  +D+ + + +   +S      +    ++ + WE L K Y++ +  ++  L+T L+  T K 
Subjt:  FVDGSNKAPPKTLSAGSSTTSSSSESSAPDLNPEYEDWLAKDQALMTLINATLSTEALAYIVGCNSSREKWEALEKHYSSSTRSNIVNLKTDLQSITKKS

Query:  DESVDSYIKRIKEIKDKLANVSSVVNDEDLLIYTLNGLPVEYNTFCSSMRIRSQSVTFAELPVLLKAEESAIEKQAKREDLVVQPTTMLASSANQFCPPP
         +++D Y++ +    D+LA +   ++ ++ +   L  LP EY      +  +    T  E+   L   ES I        L V   T++  +AN      
Subjt:  DESVDSYIKRIKEIKDKLANVSSVVNDEDLLIYTLNGLPVEYNTFCSSMRIRSQSVTFAELPVLLKAEESAIEKQAKREDLVVQPTTMLASSANQFCPPP

Query:  SSSSQSSFRGRGRGGRNSVRGRNSSTFTSPRRGRGFPNSSPSSVDFPTACQICHRPGHMALDCYNRMNYNFQGRHPPPQLAAMVATQNHQYISTQHNQFS
        ++++ ++  G  R  R   R  N+++    +    F  ++  S  +   CQIC   GH A  C               QL   +++ N Q   +    + 
Subjt:  SSSSQSSFRGRGRGGRNSVRGRNSSTFTSPRRGRGFPNSSPSSVDFPTACQICHRPGHMALDCYNRMNYNFQGRHPPPQLAAMVATQNHQYISTQHNQFS

Query:  PTS--AVGSP-----WLADSGCNTHVTSDLSNLAISSEYNDEENVAVGNCCTI
        P +  A+GSP     WL DSG   H+TSD +NL++   Y   ++V V +  TI
Subjt:  PTS--AVGSP-----WLADSGCNTHVTSDLSNLAISSEYNDEENVAVGNCCTI

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.6e-1121.53Show/hide
Query:  FVDGSNKAPPKTLSAGSSTTSSSSESSAPDLNPEYEDWLAKDQALMTLINATLSTEALAYIVGCNSSREKWEALEKHYSSSTRSNIVNLKTDLQSITKKS
        F+DGS   PP T+             + P +NP+Y  W  +D+ + + I   +S      +    ++ + WE L K Y++ +  ++  L+          
Subjt:  FVDGSNKAPPKTLSAGSSTTSSSSESSAPDLNPEYEDWLAKDQALMTLINATLSTEALAYIVGCNSSREKWEALEKHYSSSTRSNIVNLKTDLQSITKKS

Query:  DESVDSYIKRIKEIKDKLANVSSVVNDEDLLIYTLNGLPVEYNTFCSSMRIRSQSVTFAELPVLLKAEESAIEKQAKREDLVVQPTTMLASSANQFCPPP
              +I R     D+LA +   ++ ++ +   L  LP +Y      +  +         P L +  E  I +++K   L +    ++  +AN      
Subjt:  DESVDSYIKRIKEIKDKLANVSSVVNDEDLLIYTLNGLPVEYNTFCSSMRIRSQSVTFAELPVLLKAEESAIEKQAKREDLVVQPTTMLASSANQFCPPP

Query:  SSSSQSSFRGRGRGGRNSVRGRNSSTFTSPRRGRGFPNSSPSSVDFPTACQICHRPGHMALDCYNRMNYNFQGRHPPPQLAAMVATQNHQYISTQHNQFS
        ++++++      RG   +    N+ + +      G  + +     +   CQIC   GH A  C              PQL    +T N Q  ++    + 
Subjt:  SSSSQSSFRGRGRGGRNSVRGRNSSTFTSPRRGRGFPNSSPSSVDFPTACQICHRPGHMALDCYNRMNYNFQGRHPPPQLAAMVATQNHQYISTQHNQFS

Query:  PTS--AVGSP-----WLADSGCNTHVTSDLSNLAISSEYNDEENVAVGNCCTI
        P +  AV SP     WL DSG   H+TSD +NL+    Y   ++V + +  TI
Subjt:  PTS--AVGSP-----WLADSGCNTHVTSDLSNLAISSEYNDEENVAVGNCCTI

Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)2.9e-0825.22Show/hide
Query:  SSAPDLNPEYEDWLAKDQALMTLINATLSTEALAYIVGCNSSREKWEALEKHYSSSTRSNIVNLKTDLQSITKKSDESVDSYIKRIKEIKDKLANVSSVV
        ++A D+N +  D + K    ++L       +     V  ++SR+ W  ++  + ++  +  + L ++L++     D  V  Y +++K++ D L NV   V
Subjt:  SSAPDLNPEYEDWLAKDQALMTLINATLSTEALAYIVGCNSSREKWEALEKHYSSSTRSNIVNLKTDLQSITKKSDESVDSYIKRIKEIKDKLANVSSVV

Query:  NDEDLLIYTLNGLPVEYNTFCSSMRIRSQSVTFAELPVLLKAEESAIEKQAKREDLVVQPTTMLASSANQFCPP-----PSSSSQSSFRGRGRGGRNSVR
         D +L++Y LNGL  +++   + ++ R    +F +   +L+ EE  +++  K     V  ++     A    PP      S  +Q  +RGRGR G N  R
Subjt:  NDEDLLIYTLNGLPVEYNTFCSSMRIRSQSVTFAELPVLLKAEESAIEKQAKREDLVVQPTTMLASSANQFCPP-----PSSSSQSSFRGRGRGGRNSVR

Query:  GR-------NSSTFTSPRRGRGFPNS
        GR       N  TF S  R   + NS
Subjt:  GR-------NSSTFTSPRRGRGFPNS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTGTGGAAGTTTCAATTTTCATCAATGTTGCGTGCACACAAACTTTTTGGTTTGTTGATGGCTCTAACAAGGCTCCGCCGAAGACTCTTTCTGCTGGATCATCTAC
TACATCGTCTTCCTCTGAATCTTCTGCTCCTGATCTTAATCCTGAGTATGAAGATTGGCTTGCTAAGGATCAAGCTTTGATGACTTTGATCAATGCTACACTGTCTACGG
AAGCCCTAGCCTACATCGTTGGTTGTAATTCTTCAAGAGAGAAGTGGGAAGCCCTAGAAAAACACTATTCTTCTTCGACTCGGTCGAATATTGTCAATCTTAAGACTGAT
TTGCAGTCTATCACGAAGAAATCGGATGAATCTGTTGATTCATACATCAAGCGCATCAAAGAAATAAAGGACAAACTTGCAAATGTTTCGTCAGTGGTCAATGATGAGGA
TCTTCTCATCTATACACTCAATGGACTTCCTGTTGAGTACAATACATTTTGCAGTTCTATGCGAATCAGGTCTCAATCCGTTACGTTCGCTGAATTACCTGTCTTACTCA
AAGCAGAGGAATCAGCGATTGAAAAACAAGCCAAACGAGAGGATCTGGTTGTTCAACCTACAACGATGCTTGCCTCTTCAGCCAATCAGTTTTGTCCGCCTCCATCATCC
TCTTCTCAGTCTTCCTTTCGTGGTCGTGGTCGAGGAGGACGTAATTCAGTACGAGGTCGGAATTCTAGCACTTTTACTTCTCCTAGACGAGGTCGGGGTTTTCCTAATTC
TTCGCCATCATCTGTTGATTTTCCAACGGCTTGTCAGATCTGTCACCGACCTGGTCATATGGCTCTCGATTGTTACAATCGAATGAACTACAATTTTCAAGGCAGGCATC
CTCCTCCTCAGTTAGCTGCAATGGTAGCCACACAGAATCATCAGTATATATCTACTCAGCATAATCAATTTTCTCCCACCTCTGCGGTTGGATCTCCATGGCTTGCTGAT
TCTGGATGCAACACTCATGTTACATCAGACCTGTCGAATTTGGCCATCTCCTCTGAGTACAACGATGAAGAAAATGTTGCTGTTGGTAACTGTTGCACGATTTTATACCG
CAAGCGTACGGGTCGTCACAAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGTTTGTGGAAGTTTCAATTTTCATCAATGTTGCGTGCACACAAACTTTTTGGTTTGTTGATGGCTCTAACAAGGCTCCGCCGAAGACTCTTTCTGCTGGATCATCTAC
TACATCGTCTTCCTCTGAATCTTCTGCTCCTGATCTTAATCCTGAGTATGAAGATTGGCTTGCTAAGGATCAAGCTTTGATGACTTTGATCAATGCTACACTGTCTACGG
AAGCCCTAGCCTACATCGTTGGTTGTAATTCTTCAAGAGAGAAGTGGGAAGCCCTAGAAAAACACTATTCTTCTTCGACTCGGTCGAATATTGTCAATCTTAAGACTGAT
TTGCAGTCTATCACGAAGAAATCGGATGAATCTGTTGATTCATACATCAAGCGCATCAAAGAAATAAAGGACAAACTTGCAAATGTTTCGTCAGTGGTCAATGATGAGGA
TCTTCTCATCTATACACTCAATGGACTTCCTGTTGAGTACAATACATTTTGCAGTTCTATGCGAATCAGGTCTCAATCCGTTACGTTCGCTGAATTACCTGTCTTACTCA
AAGCAGAGGAATCAGCGATTGAAAAACAAGCCAAACGAGAGGATCTGGTTGTTCAACCTACAACGATGCTTGCCTCTTCAGCCAATCAGTTTTGTCCGCCTCCATCATCC
TCTTCTCAGTCTTCCTTTCGTGGTCGTGGTCGAGGAGGACGTAATTCAGTACGAGGTCGGAATTCTAGCACTTTTACTTCTCCTAGACGAGGTCGGGGTTTTCCTAATTC
TTCGCCATCATCTGTTGATTTTCCAACGGCTTGTCAGATCTGTCACCGACCTGGTCATATGGCTCTCGATTGTTACAATCGAATGAACTACAATTTTCAAGGCAGGCATC
CTCCTCCTCAGTTAGCTGCAATGGTAGCCACACAGAATCATCAGTATATATCTACTCAGCATAATCAATTTTCTCCCACCTCTGCGGTTGGATCTCCATGGCTTGCTGAT
TCTGGATGCAACACTCATGTTACATCAGACCTGTCGAATTTGGCCATCTCCTCTGAGTACAACGATGAAGAAAATGTTGCTGTTGGTAACTGTTGCACGATTTTATACCG
CAAGCGTACGGGTCGTCACAAGTAA
Protein sequenceShow/hide protein sequence
MFVEVSIFINVACTQTFWFVDGSNKAPPKTLSAGSSTTSSSSESSAPDLNPEYEDWLAKDQALMTLINATLSTEALAYIVGCNSSREKWEALEKHYSSSTRSNIVNLKTD
LQSITKKSDESVDSYIKRIKEIKDKLANVSSVVNDEDLLIYTLNGLPVEYNTFCSSMRIRSQSVTFAELPVLLKAEESAIEKQAKREDLVVQPTTMLASSANQFCPPPSS
SSQSSFRGRGRGGRNSVRGRNSSTFTSPRRGRGFPNSSPSSVDFPTACQICHRPGHMALDCYNRMNYNFQGRHPPPQLAAMVATQNHQYISTQHNQFSPTSAVGSPWLAD
SGCNTHVTSDLSNLAISSEYNDEENVAVGNCCTILYRKRTGRHK