; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0000436 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0000436
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionU-box domain-containing protein 4
Genome locationchr4:7300840..7306202
RNA-Seq ExpressionLag0000436
SyntenyLag0000436
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR000225 - Armadillo
IPR011989 - Armadillo-like helical
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR016024 - Armadillo-type fold
IPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6592081.1 U-box domain-containing protein 12, partial [Cucurbita argyrosperma subsp. sororia]2.0e-15669.27Show/hide
Query:  MGHSQIQRESAKPSLDWKAALKRYENVMASESEAVKVKATVKLAHLSKKAPENILNSTIPIIAKHLEGNLNDNSSHYVKEAAAYCLRCISCQGDGTLATT
        MGHSQI R S KP+LDW+AALK+YENVMA+ESEAVKVKAT+KLA LS  AP NILNSTIPIIAKHLE N   NSS  ++ AAAYCL+ IS QGDGTLAT 
Subjt:  MGHSQIQRESAKPSLDWKAALKRYENVMASESEAVKVKATVKLAHLSKKAPENILNSTIPIIAKHLEGNLNDNSSHYVKEAAAYCLRCISCQGDGTLATT

Query:  VGHSGALQSLLRLLPHSRGSFCKILVKCVWSIVTFDKTSHVIIARNGGLEVIIGMFDSVIDGTRRYLLEILSAMALLREVRKALISLRGLPFLMQAARCG
        V HSGAL+S+LR LP + G F KILVKC+WSIVT DK S VI+ARNGGLEV+IGMFDSV+DGTRRYLLEILSA+AL+REVRKALISLRGLPFL+QAAR G
Subjt:  VGHSGALQSLLRLLPHSRGSFCKILVKCVWSIVTFDKTSHVIIARNGGLEVIIGMFDSVIDGTRRYLLEILSAMALLREVRKALISLRGLPFLMQAARCG

Query:  CMASRERACQAIGLLAISKRGRHMLFELGVVSALIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQAGVIPLFVDLLQCPDLIGKEIAEDVFFLLVVA
        CMASRERACQAIGL+AI++RGRHML ELGV+  LIELF EGD VTKLV+GN+LGIVSAHVAYIRPVAQAG IPLF DLLQ P+ I KEIAEDVF LL VA
Subjt:  CMASRERACQAIGLLAISKRGRHMLFELGVVSALIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQAGVIPLFVDLLQCPDLIGKEIAEDVFFLLVVA

Query:  EANAVVISDHLVRILKEGDDGAKVATADVLWDLSGYKYSYQLCVV--RVPFQFLWIYYWTGMMKPR-----------------------------LLQDE
        EANAV+I DHLVRILKEGDD  K A ADVLWDLS YKYS  +      +P     ++   G ++ +                             LLQDE
Subjt:  EANAVVISDHLVRILKEGDDGAKVATADVLWDLSGYKYSYQLCVV--RVPFQFLWIYYWTGMMKPR-----------------------------LLQDE

Query:  LEELKDNSVEALINFSEDPLYCDRVSEAVSTHAFQNMQERMNHIHATER
        L+E+KDN+ EALINFSED LYC+RVSEA+S  AFQN+ ERM HI ATER
Subjt:  LEELKDNSVEALINFSEDPLYCDRVSEAVSTHAFQNMQERMNHIHATER

XP_022139563.1 uncharacterized protein LOC111010437 [Momordica charantia]7.6e-15670.73Show/hide
Query:  MGHSQIQRESAKPSLDWKAALKRYENVMASESEAVKVKATVKLAHLSKKAPENILNSTIPIIAKHLEGNLNDNSSHYVKEAAAYCLRCISCQG-DGTLAT
        M  S+I++ES KP+L+W+ AL++Y+NVMASESEAVKVKATVKLA+LS+ APENILNSTIPIIAKHL  N   NSS  ++ AAAYCLR ISC+G DGTLA 
Subjt:  MGHSQIQRESAKPSLDWKAALKRYENVMASESEAVKVKATVKLAHLSKKAPENILNSTIPIIAKHLEGNLNDNSSHYVKEAAAYCLRCISCQG-DGTLAT

Query:  TVGHSGALQSLLRLLPHSRGSFCKILVKCVWSIVTFDKTSHVIIARNGGLEVIIGMFDSVIDGTRRYLLEILSAMALLREVRKALISLRGLPFLMQAARC
         VG+SGAL+SLLRLLP+S G F KILVKCVWS+VTF KTS VIIARNGGLEVIIGM DSVID +RRYLLEILSA+ALLREVRKALISLRGLPFL++AARC
Subjt:  TVGHSGALQSLLRLLPHSRGSFCKILVKCVWSIVTFDKTSHVIIARNGGLEVIIGMFDSVIDGTRRYLLEILSAMALLREVRKALISLRGLPFLMQAARC

Query:  GCMASRERACQAIGLLAISKRGRHMLFELGVVSALIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQAGVIPLFVDLLQCPD-LIGKEIAEDVFFLLV
        GC+ASRERACQAIGLLAI+KRGR MLFELGVV  LIELFR GDC TKLVAGN+LGIVSAHVAYIRPVAQAG IPLF DLLQ P+  IGKEIAEDVF LL 
Subjt:  GCMASRERACQAIGLLAISKRGRHMLFELGVVSALIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQAGVIPLFVDLLQCPD-LIGKEIAEDVFFLLV

Query:  VAEANAVVISDHLVRILKEGDDGAKVATADVLWDLSGYKYSYQLC--VVRVPFQFLWIYYWTGMMKPR-----------------------------LLQ
        VAE NAVVISDHLVRILKEGDD AK A ADVLWDLSGYKYS         +P     +      ++ +                             LLQ
Subjt:  VAEANAVVISDHLVRILKEGDDGAKVATADVLWDLSGYKYSYQLC--VVRVPFQFLWIYYWTGMMKPR-----------------------------LLQ

Query:  DELEELKDNSVEALINFSEDPLYCDRVSEAVSTHAFQNMQERMNHIHATER
        DELEELKDN+ EALINFSED  YC RVSEAVST AF+NMQERM HI  TE+
Subjt:  DELEELKDNSVEALINFSEDPLYCDRVSEAVSTHAFQNMQERMNHIHATER

XP_022936629.1 uncharacterized protein LOC111443172 [Cucurbita moschata]9.0e-15769.49Show/hide
Query:  MGHSQIQRESAKPSLDWKAALKRYENVMASESEAVKVKATVKLAHLSKKAPENILNSTIPIIAKHLEGNLNDNSSHYVKEAAAYCLRCISCQGDGTLATT
        MGHSQI R S KP+LDW+AALK+YENVMA+ESEAVKVKAT+KLA LS  AP NILNSTIPIIAKHLE N   NSS  ++ AAAYCL+ IS QGDGTLAT 
Subjt:  MGHSQIQRESAKPSLDWKAALKRYENVMASESEAVKVKATVKLAHLSKKAPENILNSTIPIIAKHLEGNLNDNSSHYVKEAAAYCLRCISCQGDGTLATT

Query:  VGHSGALQSLLRLLPHSRGSFCKILVKCVWSIVTFDKTSHVIIARNGGLEVIIGMFDSVIDGTRRYLLEILSAMALLREVRKALISLRGLPFLMQAARCG
        V HSGAL+S+LR LP + G F KILVKC+WSIVT DK S VI+ARNGGLEV+IGMFDSV+DGTRRYLLEILSA+AL+REVRKALISLRGLPFL+QAAR G
Subjt:  VGHSGALQSLLRLLPHSRGSFCKILVKCVWSIVTFDKTSHVIIARNGGLEVIIGMFDSVIDGTRRYLLEILSAMALLREVRKALISLRGLPFLMQAARCG

Query:  CMASRERACQAIGLLAISKRGRHMLFELGVVSALIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQAGVIPLFVDLLQCPDLIGKEIAEDVFFLLVVA
        CMASRERACQAIGL+AI++RGRHML ELGV+  LIELF EGD VTKLVAGN+LGIVSAHVAYIRPVAQAG IPLF DLLQ P+ I KEIAEDVF LL VA
Subjt:  CMASRERACQAIGLLAISKRGRHMLFELGVVSALIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQAGVIPLFVDLLQCPDLIGKEIAEDVFFLLVVA

Query:  EANAVVISDHLVRILKEGDDGAKVATADVLWDLSGYKYSYQLCVV--RVPFQFLWIYYWTGMMKPR-----------------------------LLQDE
        EANAV+I DHLVRILKEGDD  K A ADVLWDLS YKYS  +      +P     ++   G ++ +                             LLQDE
Subjt:  EANAVVISDHLVRILKEGDDGAKVATADVLWDLSGYKYSYQLCVV--RVPFQFLWIYYWTGMMKPR-----------------------------LLQDE

Query:  LEELKDNSVEALINFSEDPLYCDRVSEAVSTHAFQNMQERMNHIHATER
        L+E+KDN+ EALINFSED LYC+RVSEA+S  AFQN+ ERM HI ATER
Subjt:  LEELKDNSVEALINFSEDPLYCDRVSEAVSTHAFQNMQERMNHIHATER

XP_022976328.1 uncharacterized protein LOC111476761 [Cucurbita maxima]3.4e-15669.04Show/hide
Query:  MGHSQIQRESAKPSLDWKAALKRYENVMASESEAVKVKATVKLAHLSKKAPENILNSTIPIIAKHLEGNLNDNSSHYVKEAAAYCLRCISCQGDGTLATT
        M HS+I+R S KP+LDW+AALK+YENVMA+ESEAVKVKAT+KLA LS  AP NILNSTIPIIAKHLE N   NSS  ++ AAAYCL+ IS QGDGTLAT 
Subjt:  MGHSQIQRESAKPSLDWKAALKRYENVMASESEAVKVKATVKLAHLSKKAPENILNSTIPIIAKHLEGNLNDNSSHYVKEAAAYCLRCISCQGDGTLATT

Query:  VGHSGALQSLLRLLPHSRGSFCKILVKCVWSIVTFDKTSHVIIARNGGLEVIIGMFDSVIDGTRRYLLEILSAMALLREVRKALISLRGLPFLMQAARCG
        + HSGAL+S+LR LP + G F KILVKC+WSIVT DK S VI+ARNGGLEV+IGMFDSV+DGTRRYLLEILSA+AL+REVRKALISLRGLPFL+QAAR G
Subjt:  VGHSGALQSLLRLLPHSRGSFCKILVKCVWSIVTFDKTSHVIIARNGGLEVIIGMFDSVIDGTRRYLLEILSAMALLREVRKALISLRGLPFLMQAARCG

Query:  CMASRERACQAIGLLAISKRGRHMLFELGVVSALIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQAGVIPLFVDLLQCPDLIGKEIAEDVFFLLVVA
        CMASRERACQAIGL+AI++RGRHML ELGV+  LIELF EGD VTKLVAGN+LGIVSAHVAYIRPVAQAG IPLF DLLQ PD I KEIAEDVF LL VA
Subjt:  CMASRERACQAIGLLAISKRGRHMLFELGVVSALIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQAGVIPLFVDLLQCPDLIGKEIAEDVFFLLVVA

Query:  EANAVVISDHLVRILKEGDDGAKVATADVLWDLSGYKYSYQLCVV--RVPFQFLWIYYWTGMMKPR-----------------------------LLQDE
        EANAV+I DHLVRILKEGDD  K A ADVLWDLS YKYS  +      +P     ++   G ++ +                             LLQDE
Subjt:  EANAVVISDHLVRILKEGDDGAKVATADVLWDLSGYKYSYQLCVV--RVPFQFLWIYYWTGMMKPR-----------------------------LLQDE

Query:  LEELKDNSVEALINFSEDPLYCDRVSEAVSTHAFQNMQERMNHIHATER
        L+E+KDN+ EALINFSED LYC+RVSEA+S  AFQN+ ERM HI ATER
Subjt:  LEELKDNSVEALINFSEDPLYCDRVSEAVSTHAFQNMQERMNHIHATER

XP_038897634.1 protein spotted leaf 11 [Benincasa hispida]1.2e-16673.56Show/hide
Query:  MGHSQIQRESAKPSLDWKAALKRYENVMASESEAVKVKATVKLAHLSKKAPENILNSTIPIIAKHLEGNLNDNSSHYVKEAAAYCLRCISCQGDGTLATT
        MGHSQI R S KP+LDW+AAL +YENVMASESEAVKVKAT+KLAHLSK APENILNS IP IAKHLE N   NSS  ++ AAAYCLRCISCQGDGTLA  
Subjt:  MGHSQIQRESAKPSLDWKAALKRYENVMASESEAVKVKATVKLAHLSKKAPENILNSTIPIIAKHLEGNLNDNSSHYVKEAAAYCLRCISCQGDGTLATT

Query:  VGHSGALQSLLRLLPHSRGSFCKILVKCVWSIVTFDKTSHVIIARNGGLEVIIGMFDSVIDGTRRYLLEILSAMALLREVRKALISLRGLPFLMQAARCG
        VGHSGAL+SLLR LPHS G F KILVKCVWSIVTFD+TS VIIARNGGLEVIIGMFDSV DGTRRYLLEILSAMALLREVRKALISLRGLPFL+QAAR G
Subjt:  VGHSGALQSLLRLLPHSRGSFCKILVKCVWSIVTFDKTSHVIIARNGGLEVIIGMFDSVIDGTRRYLLEILSAMALLREVRKALISLRGLPFLMQAARCG

Query:  CMASRERACQAIGLLAISKRGRHMLFELGVVSALIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQAGVIPLFVDLLQCPDLIGKEIAEDVFFLLVVA
        CMASRERACQAIGLLAI+KRGR ML ELGVV  LIEL REGD VTKLVAGN+LG+VSAHV YIRP+AQAG IPLF +LLQ PD IGKEIAEDVF LL VA
Subjt:  CMASRERACQAIGLLAISKRGRHMLFELGVVSALIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQAGVIPLFVDLLQCPDLIGKEIAEDVFFLLVVA

Query:  EANAVVISDHLVRILKEGDDGAKVATADVLWDLSGYKYSYQLCVVRVPFQFLWIYYWTGMMKPR-------------------------------LLQDE
        EANAVVISDHLVRILKEGD G K A ADVLWDLS YKYS  +         L      G  + R                               LLQDE
Subjt:  EANAVVISDHLVRILKEGDDGAKVATADVLWDLSGYKYSYQLCVVRVPFQFLWIYYWTGMMKPR-------------------------------LLQDE

Query:  LEELKDNSVEALINFSEDPLYCDRVSEAVSTHAFQNMQERMNHIHATERK
        LEE+KDN  EAL+NFSEDPLYC RVSEAVST AFQN+QERM HI A ER+
Subjt:  LEELKDNSVEALINFSEDPLYCDRVSEAVSTHAFQNMQERMNHIHATERK

TrEMBL top hitse value%identityAlignment
A0A0A0LGL6 Uncharacterized protein3.8e-14566.29Show/hide
Query:  MGHSQIQRESAKPSLDWKAALKRYENVMASESEAVKVKATVKLAHLSKKAPENILNSTIPIIAKHLEGNLNDNSSHYVKEAAAYCLRCISCQGDGTLATT
        MG  +I R S KP+LDW+A+LK+YENVMASESEA+KVKAT+KLA LSK APENIL S IPIIAK LE N  +N+S  ++ AAAYCLRCISC+GDGTLAT 
Subjt:  MGHSQIQRESAKPSLDWKAALKRYENVMASESEAVKVKATVKLAHLSKKAPENILNSTIPIIAKHLEGNLNDNSSHYVKEAAAYCLRCISCQGDGTLATT

Query:  VGHSGALQSLLRLLPHSRGSFCKILVKCVWSIVTFDKTSHVIIARNGGLEVIIGMFDSVIDGTRRYLLEILSAMALLREVRKALISLRGLPFLMQAARCG
        VGHSGAL+SL++ L HS G FCKILVKC+WSIVTFD++S VIIARNGGLEVII M   VIDGTRRYLLEILSAMALLREVRKALI  RGLPFL+QAAR G
Subjt:  VGHSGALQSLLRLLPHSRGSFCKILVKCVWSIVTFDKTSHVIIARNGGLEVIIGMFDSVIDGTRRYLLEILSAMALLREVRKALISLRGLPFLMQAARCG

Query:  CMASRERACQAIGLLAISKRGRHMLFELGVVSALIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQAGVIPLFVDLLQCPDLIGKEIAEDVFFLLVVA
        CMASRERAC+AIGL+AI+KRGR+ L ELGVV  LIEL REGD VTKLVAGN LGIVSAH+AYIRPVAQAG IPLF DLLQ  D I KEIAEDVF LL VA
Subjt:  CMASRERACQAIGLLAISKRGRHMLFELGVVSALIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQAGVIPLFVDLLQCPDLIGKEIAEDVFFLLVVA

Query:  EANAVVISDHLVRILKEGDDGAKVATADVLWDLSGYKYSYQLCVVRVPFQFLWIYYWTGMMKPR-------------------------------LLQDE
        E NAVVI DHLV++LKEGDD  K A ADVL  LS YKYS  +         L      G  + R                               LLQD+
Subjt:  EANAVVISDHLVRILKEGDDGAKVATADVLWDLSGYKYSYQLCVVRVPFQFLWIYYWTGMMKPR-------------------------------LLQDE

Query:  LEELKDNSVEALINFSEDPLYCDRVSEAVSTHAFQNMQERMNHIHATE
        LE++K N  EA+ +FS+DPLYC RV+EA+ST AFQN+QER+ HI A E
Subjt:  LEELKDNSVEALINFSEDPLYCDRVSEAVSTHAFQNMQERMNHIHATE

A0A5A7U359 Vacuolar protein 83.8e-14566.96Show/hide
Query:  MGHSQIQRESAKPSLDWKAALKRYENVMASESEAVKVKATVKLAHLSKKAPENILNSTIPIIAKHLEGNLNDNSSHYVKEAAAYCLRCISCQGDGTLATT
        MG ++I R S  P+LDW+A+LK+YENVMASESEA+KVKAT+KLA LSK APENIL S IPIIAK LE    +N+S  ++ AAAYCLRCISCQGDGTLA  
Subjt:  MGHSQIQRESAKPSLDWKAALKRYENVMASESEAVKVKATVKLAHLSKKAPENILNSTIPIIAKHLEGNLNDNSSHYVKEAAAYCLRCISCQGDGTLATT

Query:  VGHSGALQSLLRLLPHSRGSFCKILVKCVWSIVTFDKTSHVIIARNGGLEVIIGMFDSVIDGTRRYLLEILSAMALLREVRKALISLRGLPFLMQAARCG
        VGHSGAL+SLLR L HS G FCKILVKC+WS VTFDK+S VI+ RNGGLEVII M   V DGTRRYLLEILSAMALLREVRKAL+ LRGLPFL+QAAR G
Subjt:  VGHSGALQSLLRLLPHSRGSFCKILVKCVWSIVTFDKTSHVIIARNGGLEVIIGMFDSVIDGTRRYLLEILSAMALLREVRKALISLRGLPFLMQAARCG

Query:  CMASRERACQAIGLLAISKRGRHMLFELGVVSALIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQAGVIPLFVDLLQCPDLIGKEIAEDVFFLLVVA
         MASRERAC+AIGL+AI+KRGR+ L ELGVV+ LIEL REGD VTKLVAGNALGIVSAH+AYIRPVAQAG IPLF DLLQ PD I KEIAEDVF LL VA
Subjt:  CMASRERACQAIGLLAISKRGRHMLFELGVVSALIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQAGVIPLFVDLLQCPDLIGKEIAEDVFFLLVVA

Query:  EANAVVISDHLVRILKEGDDGAKVATADVLWDLSGYKYSYQLC----VVRVPFQFL-----------------WIYYWTGMMKPR----------LLQDE
        E NAVVISDHLV++LKEGDD  K A ADVL  LS YKYS  +      + V    L                   YY T  +             LLQDE
Subjt:  EANAVVISDHLVRILKEGDDGAKVATADVLWDLSGYKYSYQLC----VVRVPFQFL-----------------WIYYWTGMMKPR----------LLQDE

Query:  LEELKDNSVEALINFSEDPLYCDRVSEAVSTHAFQNMQERMNHIHATE
        LE++K N  EAL +FS+DPLY  R+SE VST AFQN+QER+ HI ATE
Subjt:  LEELKDNSVEALINFSEDPLYCDRVSEAVSTHAFQNMQERMNHIHATE

A0A6J1CFX3 uncharacterized protein LOC1110104373.7e-15670.73Show/hide
Query:  MGHSQIQRESAKPSLDWKAALKRYENVMASESEAVKVKATVKLAHLSKKAPENILNSTIPIIAKHLEGNLNDNSSHYVKEAAAYCLRCISCQG-DGTLAT
        M  S+I++ES KP+L+W+ AL++Y+NVMASESEAVKVKATVKLA+LS+ APENILNSTIPIIAKHL  N   NSS  ++ AAAYCLR ISC+G DGTLA 
Subjt:  MGHSQIQRESAKPSLDWKAALKRYENVMASESEAVKVKATVKLAHLSKKAPENILNSTIPIIAKHLEGNLNDNSSHYVKEAAAYCLRCISCQG-DGTLAT

Query:  TVGHSGALQSLLRLLPHSRGSFCKILVKCVWSIVTFDKTSHVIIARNGGLEVIIGMFDSVIDGTRRYLLEILSAMALLREVRKALISLRGLPFLMQAARC
         VG+SGAL+SLLRLLP+S G F KILVKCVWS+VTF KTS VIIARNGGLEVIIGM DSVID +RRYLLEILSA+ALLREVRKALISLRGLPFL++AARC
Subjt:  TVGHSGALQSLLRLLPHSRGSFCKILVKCVWSIVTFDKTSHVIIARNGGLEVIIGMFDSVIDGTRRYLLEILSAMALLREVRKALISLRGLPFLMQAARC

Query:  GCMASRERACQAIGLLAISKRGRHMLFELGVVSALIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQAGVIPLFVDLLQCPD-LIGKEIAEDVFFLLV
        GC+ASRERACQAIGLLAI+KRGR MLFELGVV  LIELFR GDC TKLVAGN+LGIVSAHVAYIRPVAQAG IPLF DLLQ P+  IGKEIAEDVF LL 
Subjt:  GCMASRERACQAIGLLAISKRGRHMLFELGVVSALIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQAGVIPLFVDLLQCPD-LIGKEIAEDVFFLLV

Query:  VAEANAVVISDHLVRILKEGDDGAKVATADVLWDLSGYKYSYQLC--VVRVPFQFLWIYYWTGMMKPR-----------------------------LLQ
        VAE NAVVISDHLVRILKEGDD AK A ADVLWDLSGYKYS         +P     +      ++ +                             LLQ
Subjt:  VAEANAVVISDHLVRILKEGDDGAKVATADVLWDLSGYKYSYQLC--VVRVPFQFLWIYYWTGMMKPR-----------------------------LLQ

Query:  DELEELKDNSVEALINFSEDPLYCDRVSEAVSTHAFQNMQERMNHIHATER
        DELEELKDN+ EALINFSED  YC RVSEAVST AF+NMQERM HI  TE+
Subjt:  DELEELKDNSVEALINFSEDPLYCDRVSEAVSTHAFQNMQERMNHIHATER

A0A6J1F8U2 uncharacterized protein LOC1114431724.3e-15769.49Show/hide
Query:  MGHSQIQRESAKPSLDWKAALKRYENVMASESEAVKVKATVKLAHLSKKAPENILNSTIPIIAKHLEGNLNDNSSHYVKEAAAYCLRCISCQGDGTLATT
        MGHSQI R S KP+LDW+AALK+YENVMA+ESEAVKVKAT+KLA LS  AP NILNSTIPIIAKHLE N   NSS  ++ AAAYCL+ IS QGDGTLAT 
Subjt:  MGHSQIQRESAKPSLDWKAALKRYENVMASESEAVKVKATVKLAHLSKKAPENILNSTIPIIAKHLEGNLNDNSSHYVKEAAAYCLRCISCQGDGTLATT

Query:  VGHSGALQSLLRLLPHSRGSFCKILVKCVWSIVTFDKTSHVIIARNGGLEVIIGMFDSVIDGTRRYLLEILSAMALLREVRKALISLRGLPFLMQAARCG
        V HSGAL+S+LR LP + G F KILVKC+WSIVT DK S VI+ARNGGLEV+IGMFDSV+DGTRRYLLEILSA+AL+REVRKALISLRGLPFL+QAAR G
Subjt:  VGHSGALQSLLRLLPHSRGSFCKILVKCVWSIVTFDKTSHVIIARNGGLEVIIGMFDSVIDGTRRYLLEILSAMALLREVRKALISLRGLPFLMQAARCG

Query:  CMASRERACQAIGLLAISKRGRHMLFELGVVSALIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQAGVIPLFVDLLQCPDLIGKEIAEDVFFLLVVA
        CMASRERACQAIGL+AI++RGRHML ELGV+  LIELF EGD VTKLVAGN+LGIVSAHVAYIRPVAQAG IPLF DLLQ P+ I KEIAEDVF LL VA
Subjt:  CMASRERACQAIGLLAISKRGRHMLFELGVVSALIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQAGVIPLFVDLLQCPDLIGKEIAEDVFFLLVVA

Query:  EANAVVISDHLVRILKEGDDGAKVATADVLWDLSGYKYSYQLCVV--RVPFQFLWIYYWTGMMKPR-----------------------------LLQDE
        EANAV+I DHLVRILKEGDD  K A ADVLWDLS YKYS  +      +P     ++   G ++ +                             LLQDE
Subjt:  EANAVVISDHLVRILKEGDDGAKVATADVLWDLSGYKYSYQLCVV--RVPFQFLWIYYWTGMMKPR-----------------------------LLQDE

Query:  LEELKDNSVEALINFSEDPLYCDRVSEAVSTHAFQNMQERMNHIHATER
        L+E+KDN+ EALINFSED LYC+RVSEA+S  AFQN+ ERM HI ATER
Subjt:  LEELKDNSVEALINFSEDPLYCDRVSEAVSTHAFQNMQERMNHIHATER

A0A6J1ILS8 uncharacterized protein LOC1114767611.7e-15669.04Show/hide
Query:  MGHSQIQRESAKPSLDWKAALKRYENVMASESEAVKVKATVKLAHLSKKAPENILNSTIPIIAKHLEGNLNDNSSHYVKEAAAYCLRCISCQGDGTLATT
        M HS+I+R S KP+LDW+AALK+YENVMA+ESEAVKVKAT+KLA LS  AP NILNSTIPIIAKHLE N   NSS  ++ AAAYCL+ IS QGDGTLAT 
Subjt:  MGHSQIQRESAKPSLDWKAALKRYENVMASESEAVKVKATVKLAHLSKKAPENILNSTIPIIAKHLEGNLNDNSSHYVKEAAAYCLRCISCQGDGTLATT

Query:  VGHSGALQSLLRLLPHSRGSFCKILVKCVWSIVTFDKTSHVIIARNGGLEVIIGMFDSVIDGTRRYLLEILSAMALLREVRKALISLRGLPFLMQAARCG
        + HSGAL+S+LR LP + G F KILVKC+WSIVT DK S VI+ARNGGLEV+IGMFDSV+DGTRRYLLEILSA+AL+REVRKALISLRGLPFL+QAAR G
Subjt:  VGHSGALQSLLRLLPHSRGSFCKILVKCVWSIVTFDKTSHVIIARNGGLEVIIGMFDSVIDGTRRYLLEILSAMALLREVRKALISLRGLPFLMQAARCG

Query:  CMASRERACQAIGLLAISKRGRHMLFELGVVSALIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQAGVIPLFVDLLQCPDLIGKEIAEDVFFLLVVA
        CMASRERACQAIGL+AI++RGRHML ELGV+  LIELF EGD VTKLVAGN+LGIVSAHVAYIRPVAQAG IPLF DLLQ PD I KEIAEDVF LL VA
Subjt:  CMASRERACQAIGLLAISKRGRHMLFELGVVSALIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQAGVIPLFVDLLQCPDLIGKEIAEDVFFLLVVA

Query:  EANAVVISDHLVRILKEGDDGAKVATADVLWDLSGYKYSYQLCVV--RVPFQFLWIYYWTGMMKPR-----------------------------LLQDE
        EANAV+I DHLVRILKEGDD  K A ADVLWDLS YKYS  +      +P     ++   G ++ +                             LLQDE
Subjt:  EANAVVISDHLVRILKEGDDGAKVATADVLWDLSGYKYSYQLCVV--RVPFQFLWIYYWTGMMKPR-----------------------------LLQDE

Query:  LEELKDNSVEALINFSEDPLYCDRVSEAVSTHAFQNMQERMNHIHATER
        L+E+KDN+ EALINFSED LYC+RVSEA+S  AFQN+ ERM HI ATER
Subjt:  LEELKDNSVEALINFSEDPLYCDRVSEAVSTHAFQNMQERMNHIHATER

SwissProt top hitse value%identityAlignment
P04146 Copia protein3.8e-0943.94Show/hide
Query:  WHEAMQKEFDALVRNHTWSLVPPSPNYKVISCKWVYKLKLKPNGDIERYKARLVARGFDQAYGIDY
        W EA+  E +A   N+TW++     N  ++  +WV+ +K    G+  RYKARLVARGF Q Y IDY
Subjt:  WHEAMQKEFDALVRNHTWSLVPPSPNYKVISCKWVYKLKLKPNGDIERYKARLVARGFDQAYGIDY

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.2e-0923.48Show/hide
Query:  IDVSCSSVVSEFCCQLQFLNPYQVLFQKSPDYTLLRAFGCACFSFLRPYNSHKLNFRSKECIFLGYSSNHKGYLCLDPVSDRLYFSRHVIFNETFFSCSK
        +  +C  +       L F  P +V   K   Y+ L+ FGC  F+ +      KL+ +S  CIF+GY     GY   DPV  ++  SR V+F E+      
Subjt:  IDVSCSSVVSEFCCQLQFLNPYQVLFQKSPDYTLLRAFGCACFSFLRPYNSHKLNFRSKECIFLGYSSNHKGYLCLDPVSDRLYFSRHVIFNETFFSCSK

Query:  TDFYFGHVKSSAPDVFQFSASPFPIIPSVPVSSVAPLSLPPSLSHPPGFTPSP-------------VNHVHHPHSPSAPPSSPVNLSFSPPVSSQPEATT
               V+++A    +      P   ++P +S  P S   +          P             V  V HP +       P+  S  P V S+     
Subjt:  TDFYFGHVKSSAPDVFQFSASPFPIIPSVPVSSVAPLSLPPSLSHPPGFTPSP-------------VNHVHHPHSPSAPPSSPVNLSFSPPVSSQPEATT

Query:  SPTPVVPFFPPLSSVFTSPNNPSATINPSILSTSPTSQQSPSVPPSTAPFSNIPATNIHHMVTRAKAGINKPKCWLSVASLPSKPSNFKAAIKCVEWHEA
                +P    V  S +                                                  +P+    V S P K    K          A
Subjt:  SPTPVVPFFPPLSSVFTSPNNPSATINPSILSTSPTSQQSPSVPPSTAPFSNIPATNIHHMVTRAKAGINKPKCWLSVASLPSKPSNFKAAIKCVEWHEA

Query:  MQKEFDALVRNHTWSLVPPSPNYKVISCKWVYKLKLKPNGDIERYKARLVARGFDQAYGIDY
        MQ+E ++L +N T+ LV      + + CKWV+KLK   +  + RYKARLV +GF+Q  GID+
Subjt:  MQKEFDALVRNHTWSLVPPSPNYKVISCKWVYKLKLKPNGDIERYKARLVARGFDQAYGIDY

P92520 Uncharacterized mitochondrial protein AtMg008203.7e-2050.49Show/hide
Query:  MVTRAKAGINK--PKCWLSV-ASLPSKPSNFKAAIKCVEWHEAMQKEFDALVRNHTWSLVPPSPNYKVISCKWVYKLKLKPNGDIERYKARLVARGFDQA
        M+TR+KAGINK  PK  L++  ++  +P +   A+K   W +AMQ+E DAL RN TW LVPP  N  ++ CKWV+K KL  +G ++R KARLVA+GF Q 
Subjt:  MVTRAKAGINK--PKCWLSV-ASLPSKPSNFKAAIKCVEWHEAMQKEFDALVRNHTWSLVPPSPNYKVISCKWVYKLKLKPNGDIERYKARLVARGFDQA

Query:  YGI
         GI
Subjt:  YGI

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE16.4e-6525.61Show/hide
Query:  KLTDTNFLLWKNQLLNAVLANGLPGFLDGSIPAPPRFLDAQQS-QPNPDFITWERYNRFIMCWMYSSFSEKKMGEIVTLETASDIWNYLKHAYDSKTTAR
        KLT TN+L+W  Q+        L GFLDGS   PP  +    + + NPD+  W+R ++ I   +  + S      +    TA+ IW  L+  Y + +   
Subjt:  KLTDTNFLLWKNQLLNAVLANGLPGFLDGSIPAPPRFLDAQQS-QPNPDFITWERYNRFIMCWMYSSFSEKKMGEIVTLETASDIWNYLKHAYDSKTTAR

Query:  LMGLKTQLQRIKKDGLTVSQYLSQIKDITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDSPPLEDVHSLLLACEARLEKQNSVDQL-----
        +  L+TQL++  K   T+  Y+  +    D+ + +G+P+ + + +  +L+ L  EY   +  I  +   P L ++H  LL  E+++   +S   +     
Subjt:  LMGLKTQLQRIKKDGLTVSQYLSQIKDITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDSPPLEDVHSLLLACEARLEKQNSVDQL-----

Query:  SLAQANLASLNIHNNSRRSISRSSPSHFSRTPFNPLPFSSSSTFSIVVSPSLLGKPQSQPLQKWPSSTRSNSNRPQCQICGKFGHTALIC----HHRTNL
        +++  N  + N +NN  R+    + ++ +    N  P+  SST                     P++ +S     +CQICG  GH+A  C    H  +++
Subjt:  SLAQANLASLNIHNNSRRSISRSSPSHFSRTPFNPLPFSSSSTFSIVVSPSLLGKPQSQPLQKWPSSTRSNSNRPQCQICGKFGHTALIC----HHRTNL

Query:  AYQTPPPQALLTTTTAQLPLLDSVSTVSMDSYHPDENWFLDSGATHHMTPDASTLTHSMPYLGGEHVTVLLRGTLE-DGLYRVSVAPKS--------IFI
          Q PP         A L         ++ S +   NW LDSGATHH+T D + L+   PY GG+ V V    T+        S++ KS        +++
Subjt:  AYQTPPPQALLTTTTAQLPLLDSVSTVSMDSYHPDENWFLDSGATHHMTPDASTLTHSMPYLGGEHVTVLLRGTLE-DGLYRVSVAPKS--------IFI

Query:  PS----------------------GANFQ---------------------------KPAAVFL---SSVVVPPWHLRLGHPSTAFLRQVLSNI-------
        P+                       A+FQ                           +P ++F    S      WH RLGHP+ + L  V+SN        
Subjt:  PS----------------------GANFQ---------------------------KPAAVFL---SSVVVPPWHLRLGHPSTAFLRQVLSNI-------

Query:  ------------------------------------DVSCSSVVS-------------------------------------------------------
                                            DV  S ++S                                                       
Subjt:  ------------------------------------DVSCSSVVS-------------------------------------------------------

Query:  --------EFCCQ--------------------------------------------------------------LQFLNPYQVLFQKSPDYTLLRAFGC
                E+  Q                                                              LQ  +P+Q LF  SP+Y  LR FGC
Subjt:  --------EFCCQ--------------------------------------------------------------LQFLNPYQVLFQKSPDYTLLRAFGC

Query:  ACFSFLRPYNSHKLNFRSKECIFLGYSSNHKGYLCLDPVSDRLYFSRHVIFNETFFSCSKTDFYFGHVKSSAPDVFQFSASPFPIIPSVPVSSVAPLSLP
        AC+ +LRPYN HKL+ +S++C+FLGYS     YLCL   + RLY SRHV F+E  F  S       ++ + +P   Q   S     P   + +  P+   
Subjt:  ACFSFLRPYNSHKLNFRSKECIFLGYSSNHKGYLCLDPVSDRLYFSRHVIFNETFFSCSKTDFYFGHVKSSAPDVFQFSASPFPIIPSVPVSSVAPLSLP

Query:  PSLSHPPGFTPSPVNHVHH----PHSPSAP------PSSPVNLSFSPPVSSQPEATT------SPT--PVVPFFPPLSSVFTSPNNP-------------
        PS S P           HH    P SPSAP       SS ++ SFS    S PE T        PT  P        SS  TS NNP             
Subjt:  PSLSHPPGFTPSPVNHVHH----PHSPSAP------PSSPVNLSFSPPVSSQPEATT------SPT--PVVPFFPPLSSVFTSPNNP-------------

Query:  ----SATINPS---ILSTSPTSQQSPSV----PPSTAPF---SNIPATNIHHMVTRAKAGINKPKCWLSVA---SLPSKPSNFKAAIKCVEWHEAMQKEF
            S++ +PS     S+S TS   PS+    PP  A     +N    N H M TRAKAGI KP    S+A   +  S+P     A+K   W  AM  E 
Subjt:  ----SATINPS---ILSTSPTSQQSPSV----PPSTAPF---SNIPATNIHHMVTRAKAGINKPKCWLSVA---SLPSKPSNFKAAIKCVEWHEAMQKEF

Query:  DALVRNHTWSLVPPSPNY-KVISCKWVYKLKLKPNGDIERYKARLVARGFDQAYGIDY
        +A + NHTW LVPP P++  ++ C+W++  K   +G + RYKARLVA+G++Q  G+DY
Subjt:  DALVRNHTWSLVPPSPNY-KVISCKWVYKLKLKPNGDIERYKARLVARGFDQAYGIDY

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.0e-5924.41Show/hide
Query:  KLTDTNFLLWKNQLLNAVLANGLPGFLDGSIPAPPRFLDAQQ-SQPNPDFITWERYNRFIMCWMYSSFSEKKMGEIVTLETASDIWNYLKHAYDSKTTAR
        KLT TN+L+W  Q+        L GFLDGS P PP  +      + NPD+  W R ++ I   +  + S      +    TA+ IW  L+  Y + +   
Subjt:  KLTDTNFLLWKNQLLNAVLANGLPGFLDGSIPAPPRFLDAQQ-SQPNPDFITWERYNRFIMCWMYSSFSEKKMGEIVTLETASDIWNYLKHAYDSKTTAR

Query:  LMGLKTQLQRIKKDGLTVSQYLSQIKDITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDSPPLEDVHSLLLACEARLEKQNSVDQLSLAQA
          G  TQL+ I +                D+ + +G+P+ + + +  +L+ L  +Y   +  I  +   P L ++H  L+  E++L   NS + + +  A
Subjt:  LMGLKTQLQRIKKDGLTVSQYLSQIKDITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDSPPLEDVHSLLLACEARLEKQNSVDQLSLAQA

Query:  NLASLNIHNNSRRSISRSSPSHFSRTPFNPLPFSSSSTFSIVVSPSLLGKPQSQPLQKWPSSTRSNSNRP-----QCQICGKFGHTALICHHRTNLAYQT
        N+ +    N +R   +R    +++                           +S   Q   S +RS++ +P     +CQIC   GH+A  C         T
Subjt:  NLASLNIHNNSRRSISRSSPSHFSRTPFNPLPFSSSSTFSIVVSPSLLGKPQSQPLQKWPSSTRSNSNRP-----QCQICGKFGHTALICHHRTNLAYQT

Query:  PPPQALLTTTTAQLPLLDSVSTVSMDSYHPDENWFLDSGATHHMTPDASTLTHSMPYLGGEHVTV-----------------------------------
           Q+    T  Q       + ++++S +   NW LDSGATHH+T D + L+   PY GG+ V +                                   
Subjt:  PPPQALLTTTTAQLPLLDSVSTVSMDSYHPDENWFLDSGATHHMTPDASTLTHSMPYLGGEHVTV-----------------------------------

Query:  ----------------------------------LLRGTLEDGLYRVSVAPKSIFIPSGANFQKPAAVFLSSVVVPPWHLRLGHPSTAFLRQVLSNID--
                                          LL+G  +D LY   +A       + + F  P     S      WH RLGHPS A L  V+SN    
Subjt:  ----------------------------------LLRGTLEDGLYRVSVAPKSIFIPSGANFQKPAAVFLSSVVVPPWHLRLGHPSTAFLRQVLSNID--

Query:  --------VSCSS---------------------------------------------------------------------------------------
                +SCS                                                                                        
Subjt:  --------VSCSS---------------------------------------------------------------------------------------

Query:  ---------VVSEFCCQ--------------------------------------------------------------LQFLNPYQVLFQKSPDYTLLR
                 V+ ++  Q                                                              LQ  +P+Q LF + P+Y  L+
Subjt:  ---------VVSEFCCQ--------------------------------------------------------------LQFLNPYQVLFQKSPDYTLLR

Query:  AFGCACFSFLRPYNSHKLNFRSKECIFLGYSSNHKGYLCLDPVSDRLYFSRHVIFNETFFSCSKTDFYFGHVKSSAPDVFQFSASPFP---IIPSVPVSS
         FGCAC+ +LRPYN HKL  +SK+C F+GYS     YLCL   + RLY SRHV F+E  F  S T+F     +    D    SA  +P    +P+ P+  
Subjt:  AFGCACFSFLRPYNSHKLNFRSKECIFLGYSSNHKGYLCLDPVSDRLYFSRHVIFNETFFSCSKTDFYFGHVKSSAPDVFQFSASPFP---IIPSVPVSS

Query:  VAPLSLPPSL--SHPPGFTPSPV--NHVHHPHSPSAPPSSPVNLSFSPPVSSQPEATTSP---------TPVVPFFPPLSSVFTSPNN----PSATINPS
         AP  L P L  S  P  +PSP+    V   + PS+  SSP +   + P  + P+ T  P         +P++    P S    SPN     P + I+  
Subjt:  VAPLSLPPSL--SHPPGFTPSPV--NHVHHPHSPSAPPSSPVNLSFSPPVSSQPEATTSP---------TPVVPFFPPLSSVFTSPNN----PSATINPS

Query:  ILSTSPTSQQSPSVPPSTA-------------PFSNIPA---TNIHHMVTRAKAGINKPKCWLSVA---SLPSKPSNFKAAIKCVEWHEAMQKEFDALVR
         + T  TS   P+ P S++             P   + A    N H M TRAK GI KP    S A   +  S+P     A+K   W +AM  E +A + 
Subjt:  ILSTSPTSQQSPSVPPSTA-------------PFSNIPA---TNIHHMVTRAKAGINKPKCWLSVA---SLPSKPSNFKAAIKCVEWHEAMQKEFDALVR

Query:  NHTWSLV-PPSPNYKVISCKWVYKLKLKPNGDIERYKARLVARGFDQAYGIDY
        NHTW LV PP P+  ++ C+W++  K   +G + RYKARLVA+G++Q  G+DY
Subjt:  NHTWSLV-PPSPNYKVISCKWVYKLKLKPNGDIERYKARLVARGFDQAYGIDY

Arabidopsis top hitse value%identityAlignment
AT3G20170.1 ARM repeat superfamily protein7.7e-9043.48Show/hide
Query:  MGHSQIQRES-AKPSLDWKAALKRYENVMASESEAVKVKATVKLAHLSKKAPENILNSTIPIIAKHLEGNLNDNSSHYVKEAAAYCLRCISCQG--DGTL
        MG S+ + ES    + DW+    R+EN ++S S +++V++ +KL+ L+ + PE+ ++  IPI+A  L   ++D+S+  V+ AAA+CL+CI+C G  +   
Subjt:  MGHSQIQRES-AKPSLDWKAALKRYENVMASESEAVKVKATVKLAHLSKKAPENILNSTIPIIAKHLEGNLNDNSSHYVKEAAAYCLRCISCQG--DGTL

Query:  ATTVGHSGALQSLLRLLPHSR---GSFCKILVKCVWSIVTFDKTSHVIIARNGGLEVIIGMFDS-VIDGTRRYLLEILSAMALLREVRKALISLRGLPFL
        A T+G  G + SLL LL  +      F +I VKC+WS+VTF  +  V +AR GGLE++I   ++   DG+R YLLEILSA+  +RE R+ L+   GL FL
Subjt:  ATTVGHSGALQSLLRLLPHSR---GSFCKILVKCVWSIVTFDKTSHVIIARNGGLEVIIGMFDS-VIDGTRRYLLEILSAMALLREVRKALISLRGLPFL

Query:  MQAARCGCMASRERACQAIGLLAISKRGRHMLFELGVVSALIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQAGVIPLFVDLLQCPDLIGKEIAEDV
        ++AA+ G +ASRERAC AIGL+ +++R R +L E GV+ AL++L+R+GD   KL+AGNALGI+SA   YIRPV +AG IPL+V+LL   D +GK+IAEDV
Subjt:  MQAARCGCMASRERACQAIGLLAISKRGRHMLFELGVVSALIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQAGVIPLFVDLLQCPDLIGKEIAEDV

Query:  FFLLVVAEANAVVISDHLVRILKEGDDGAKVATADVLWDLSGYKYSYQLCVVR----VPFQFLWIYYWTGMMKPRL------------------------
        F +L VAE NAV+I++ LVRIL+ GD+ AK+A +DVLWDL+GY++S  + V+R    +P     +   +   + R+                        
Subjt:  FFLLVVAEANAVVISDHLVRILKEGDDGAKVATADVLWDLSGYKYSYQLCVVR----VPFQFLWIYYWTGMMKPRL------------------------

Query:  -----LQDELEELKDNSVEALINFSEDPLYCDRVSEAVSTHAFQNMQERMNHIHATERKM
             L DE EEL+DN+ EALINFSED  +  RV EA+    FQ+MQ R+  I A+   M
Subjt:  -----LQDELEELKDNSVEALINFSEDPLYCDRVSEAVSTHAFQNMQERMNHIHATERKM

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 89.6e-1635.26Show/hide
Query:  TSQQSPSVPPSTAPFSNIPATNIH--HMVTRAKAGINKPKCWLSVASL---------------------------PSKPSNFKAAIKCVEWHEAMQKEFD
        TS  S  + PS    +++P  ++H  H  TR  A +    C  SVASL                             +PS +  A + + W  AM  E  
Subjt:  TSQQSPSVPPSTAPFSNIPATNIH--HMVTRAKAGINKPKCWLSVASL---------------------------PSKPSNFKAAIKCVEWHEAMQKEFD

Query:  ALVRNHTWSLVPPSPNYKVISCKWVYKLKLKPNGDIERYKARLVARGFDQAYGIDY
        A+   HTW +    PN K I CKWVYK+K   +G IERYKARLVA+G+ Q  GID+
Subjt:  ALVRNHTWSLVPPSPNYKVISCKWVYKLKLKPNGDIERYKARLVARGFDQAYGIDY

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)1.6e-1525.24Show/hide
Query:  LSVKLTDTNFLLWKNQLLNAVLANGLPGFLDGSIPAPPRFLDAQQSQPNP-DFITWERYNRFIMCWMYSSFSEKKMGEIVTLE-TASDIWNYLKHAYDSK
        +++ L   N+ +W+       L+ G+ G +DGS            S P P     W+  +  +  W+Y + ++  +  I+ +  TA D+W  L++ +   
Subjt:  LSVKLTDTNFLLWKNQLLNAVLANGLPGFLDGSIPAPPRFLDAQQSQPNP-DFITWERYNRFIMCWMYSSFSEKKMGEIVTLE-TASDIWNYLKHAYDSK

Query:  TTARLMGLKTQLQRIKKDGLTVSQYLSQIKDITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDSPPLEDVHSLLLACEARLEKQNSVDQLS
          AR +  + +L+    D L+V +Y  ++K ++D  + +  PIS R  + H+L+GL  +Y+  +  I+++S  P   +  S+LL  E+RL  ++   + S
Subjt:  TTARLMGLKTQLQRIKKDGLTVSQYLSQIKDITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDSPPLEDVHSLLLACEARLEKQNSVDQLS

Query:  LAQANLASLN
        L+  N  SL+
Subjt:  LAQANLASLN

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)2.6e-2150.49Show/hide
Query:  MVTRAKAGINK--PKCWLSV-ASLPSKPSNFKAAIKCVEWHEAMQKEFDALVRNHTWSLVPPSPNYKVISCKWVYKLKLKPNGDIERYKARLVARGFDQA
        M+TR+KAGINK  PK  L++  ++  +P +   A+K   W +AMQ+E DAL RN TW LVPP  N  ++ CKWV+K KL  +G ++R KARLVA+GF Q 
Subjt:  MVTRAKAGINK--PKCWLSV-ASLPSKPSNFKAAIKCVEWHEAMQKEFDALVRNHTWSLVPPSPNYKVISCKWVYKLKLKPNGDIERYKARLVARGFDQA

Query:  YGI
         GI
Subjt:  YGI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTCACTCACAGATTCAGAGAGAATCTGCAAAGCCGAGTCTAGATTGGAAAGCAGCGCTGAAGCGGTATGAAAACGTTATGGCTTCAGAATCCGAGGCTGTTAAAGT
GAAAGCCACAGTCAAATTGGCCCATCTCTCCAAAAAGGCCCCCGAGAATATTTTGAACAGCACCATACCCATTATTGCTAAGCATCTTGAGGGTAACCTCAACGATAATT
CAAGCCATTACGTGAAAGAAGCTGCTGCATATTGTCTAAGATGTATATCTTGTCAAGGTGATGGTACTTTGGCGACGACGGTAGGTCATTCAGGCGCTCTACAGTCGTTG
CTGAGGTTATTGCCACATTCTAGAGGTTCTTTTTGTAAGATTTTAGTTAAATGTGTTTGGAGTATTGTCACTTTTGATAAAACTAGTCATGTGATTATAGCAAGAAATGG
GGGTTTGGAAGTTATTATTGGCATGTTTGATTCTGTAATTGATGGCACAAGGAGATATTTATTGGAGATTTTGAGTGCAATGGCACTATTGAGAGAGGTTAGGAAGGCCC
TTATCAGTTTAAGAGGACTCCCTTTTCTTATGCAAGCTGCTAGATGTGGCTGCATGGCCTCCAGAGAAAGAGCTTGCCAAGCAATTGGGTTGCTTGCAATTTCCAAGCGT
GGAAGACACATGCTTTTTGAGTTGGGAGTGGTTTCAGCGCTTATTGAACTATTTCGTGAAGGAGATTGTGTGACAAAACTGGTAGCTGGTAATGCTCTTGGAATAGTTTC
AGCTCATGTAGCCTATATTAGGCCCGTTGCACAAGCTGGGGTCATTCCTTTATTTGTTGATCTTCTTCAGTGCCCTGACCTTATTGGTAAGGAGATTGCAGAAGATGTCT
TCTTTCTCTTAGTTGTTGCTGAAGCGAATGCAGTAGTCATTTCTGATCATCTAGTGAGAATTCTTAAAGAAGGTGATGATGGAGCAAAGGTTGCAACTGCTGATGTTTTA
TGGGATCTTTCAGGCTATAAGTACTCATATCAGTTGTGCGTAGTTCGGGTGCCATTCCAGTTTTTGTGGATCTATTACTGGACGGGAATGATGAAGCCAAGGCTGTTGCA
AGATGAGTTAGAAGAACTGAAAGATAATTCCGTTGAGGCCCTTATAAATTTTTCTGAAGACCCCTTATACTGTGATAGAGTATCTGAAGCAGTTAGCACTCATGCTTTCC
AGAATATGCAGGAAAGAATGAACCATATTCATGCTACTGAAAGGAAAATGTCATCAGAAGCTTTTAATTCGTCTTCATCTTCTTCGTCTGCACCAATAATGCCACCAGTG
GTTACTCCTTCTACTCCAACTACGACTGCCCAGACCCCTCCATCGCAAGTTCCTCAGCCTGTAAATCCGAACCCAACCCCATTTTCCCCCAACCCGTATCCCACTTTACC
TCAGCCCCTTTCGGTTAAGCTCACTGATACGAATTTTCTCCTCTGGAAGAATCAACTGCTGAATGCTGTGTTAGCCAATGGTCTTCCAGGTTTTCTCGATGGCTCCATTC
CAGCCCCACCGCGGTTTCTTGATGCTCAACAGTCTCAGCCAAATCCGGATTTTATCACTTGGGAAAGGTACAATCGTTTCATTATGTGTTGGATGTATTCCTCCTTTTCT
GAAAAGAAGATGGGTGAGATAGTCACTTTAGAAACTGCCTCTGATATTTGGAACTATTTGAAACATGCTTATGATTCTAAGACTACTGCTAGGTTAATGGGTCTTAAAAC
TCAGCTTCAAAGAATTAAGAAGGATGGTCTTACTGTGAGTCAATATTTGTCTCAGATTAAAGATATTACAGATAAATTTTCTGCTATAGGTGAGCCTATCTCATATAGAG
ATCATTTGGCTCATATCCTTGATGGCCTTGGAAGTGAATACAACGCTTTTGTTACCTCAATTCAAAATCGTTCTGACAGTCCACCTTTAGAGGATGTTCACAGTCTCCTT
TTAGCTTGTGAGGCAAGATTAGAAAAGCAAAACAGTGTTGATCAACTTTCCTTGGCTCAAGCGAATCTTGCTAGTCTCAACATTCATAATAATAGTCGCCGCTCTATTTC
TCGTTCCTCTCCATCACATTTTTCTAGAACTCCCTTCAATCCATTACCCTTTTCATCATCTTCTACTTTTTCCATTGTTGTATCTCCTAGTCTTTTGGGTAAACCACAGT
CACAACCTCTCCAAAAATGGCCATCCTCAACTAGATCTAACTCTAATAGACCTCAATGTCAAATTTGTGGTAAGTTTGGGCACACTGCTTTAATATGTCATCATCGTACC
AACTTAGCCTATCAAACTCCACCTCCTCAAGCTTTGTTAACTACTACCACTGCCCAACTTCCTCTTTTGGATTCAGTCTCCACTGTTTCTATGGATTCCTATCACCCCGA
TGAGAATTGGTTCCTTGATTCCGGAGCTACTCATCATATGACACCGGATGCTTCGACCCTCACTCATTCTATGCCTTACCTTGGTGGTGAACATGTCACGGTACTTCTCA
GGGGCACTCTTGAGGACGGTCTTTATCGTGTTTCCGTTGCTCCTAAGTCCATTTTTATTCCCTCTGGTGCCAATTTTCAGAAGCCTGCTGCTGTTTTCTTATCTTCAGTT
GTTGTCCCTCCTTGGCATCTTCGTTTAGGCCACCCGTCTACTGCTTTTTTGCGTCAAGTCCTATCAAATATCGATGTTTCATGTTCTTCTGTTGTTTCTGAATTCTGTTG
CCAACTCCAGTTCTTAAATCCTTATCAAGTCCTGTTTCAGAAATCCCCTGATTACACTCTTCTTCGTGCATTCGGTTGTGCTTGTTTCTCTTTTCTTCGGCCCTATAATT
CTCATAAACTTAATTTTCGTTCTAAAGAGTGTATTTTTCTGGGTTATAGTTCTAATCATAAGGGATATTTATGTCTTGATCCCGTTTCCGACCGTCTCTATTTCTCCCGT
CATGTTATATTCAATGAAACTTTCTTTTCGTGCTCCAAAACCGATTTTTATTTTGGTCATGTTAAGTCATCTGCTCCTGATGTTTTTCAATTTTCTGCTTCTCCTTTTCC
CATCATTCCATCTGTTCCTGTGTCGTCTGTTGCCCCTTTATCTCTGCCCCCCTCTTTGTCTCATCCCCCTGGCTTTACTCCCTCACCGGTTAATCATGTTCATCACCCTC
ATTCACCGTCTGCTCCTCCATCTTCGCCCGTTAACCTCTCTTTCTCTCCCCCCGTTTCTTCTCAACCCGAGGCTACAACATCTCCCACACCTGTTGTACCTTTTTTCCCT
CCTTTATCTTCGGTTTTTACATCTCCCAACAACCCTTCAGCAACCATCAATCCATCTATTCTGTCTACTAGCCCAACATCCCAGCAATCTCCATCTGTGCCACCTTCAAC
TGCTCCCTTCTCTAATATTCCAGCAACTAATATTCACCATATGGTTACCCGTGCTAAGGCTGGCATTAATAAGCCTAAGTGTTGGTTATCCGTTGCTTCTCTCCCCTCTA
AACCCTCTAATTTTAAGGCGGCTATAAAGTGTGTTGAGTGGCATGAGGCTATGCAAAAGGAGTTTGATGCCTTGGTTAGAAATCACACTTGGTCCCTTGTTCCACCATCA
CCAAATTACAAAGTAATCAGTTGTAAATGGGTTTATAAGCTTAAGCTTAAGCCAAATGGTGATATTGAAAGGTATAAAGCTCGTTTAGTTGCTCGTGGTTTTGATCAAGC
TTATGGGATAGACTATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGTCACTCACAGATTCAGAGAGAATCTGCAAAGCCGAGTCTAGATTGGAAAGCAGCGCTGAAGCGGTATGAAAACGTTATGGCTTCAGAATCCGAGGCTGTTAAAGT
GAAAGCCACAGTCAAATTGGCCCATCTCTCCAAAAAGGCCCCCGAGAATATTTTGAACAGCACCATACCCATTATTGCTAAGCATCTTGAGGGTAACCTCAACGATAATT
CAAGCCATTACGTGAAAGAAGCTGCTGCATATTGTCTAAGATGTATATCTTGTCAAGGTGATGGTACTTTGGCGACGACGGTAGGTCATTCAGGCGCTCTACAGTCGTTG
CTGAGGTTATTGCCACATTCTAGAGGTTCTTTTTGTAAGATTTTAGTTAAATGTGTTTGGAGTATTGTCACTTTTGATAAAACTAGTCATGTGATTATAGCAAGAAATGG
GGGTTTGGAAGTTATTATTGGCATGTTTGATTCTGTAATTGATGGCACAAGGAGATATTTATTGGAGATTTTGAGTGCAATGGCACTATTGAGAGAGGTTAGGAAGGCCC
TTATCAGTTTAAGAGGACTCCCTTTTCTTATGCAAGCTGCTAGATGTGGCTGCATGGCCTCCAGAGAAAGAGCTTGCCAAGCAATTGGGTTGCTTGCAATTTCCAAGCGT
GGAAGACACATGCTTTTTGAGTTGGGAGTGGTTTCAGCGCTTATTGAACTATTTCGTGAAGGAGATTGTGTGACAAAACTGGTAGCTGGTAATGCTCTTGGAATAGTTTC
AGCTCATGTAGCCTATATTAGGCCCGTTGCACAAGCTGGGGTCATTCCTTTATTTGTTGATCTTCTTCAGTGCCCTGACCTTATTGGTAAGGAGATTGCAGAAGATGTCT
TCTTTCTCTTAGTTGTTGCTGAAGCGAATGCAGTAGTCATTTCTGATCATCTAGTGAGAATTCTTAAAGAAGGTGATGATGGAGCAAAGGTTGCAACTGCTGATGTTTTA
TGGGATCTTTCAGGCTATAAGTACTCATATCAGTTGTGCGTAGTTCGGGTGCCATTCCAGTTTTTGTGGATCTATTACTGGACGGGAATGATGAAGCCAAGGCTGTTGCA
AGATGAGTTAGAAGAACTGAAAGATAATTCCGTTGAGGCCCTTATAAATTTTTCTGAAGACCCCTTATACTGTGATAGAGTATCTGAAGCAGTTAGCACTCATGCTTTCC
AGAATATGCAGGAAAGAATGAACCATATTCATGCTACTGAAAGGAAAATGTCATCAGAAGCTTTTAATTCGTCTTCATCTTCTTCGTCTGCACCAATAATGCCACCAGTG
GTTACTCCTTCTACTCCAACTACGACTGCCCAGACCCCTCCATCGCAAGTTCCTCAGCCTGTAAATCCGAACCCAACCCCATTTTCCCCCAACCCGTATCCCACTTTACC
TCAGCCCCTTTCGGTTAAGCTCACTGATACGAATTTTCTCCTCTGGAAGAATCAACTGCTGAATGCTGTGTTAGCCAATGGTCTTCCAGGTTTTCTCGATGGCTCCATTC
CAGCCCCACCGCGGTTTCTTGATGCTCAACAGTCTCAGCCAAATCCGGATTTTATCACTTGGGAAAGGTACAATCGTTTCATTATGTGTTGGATGTATTCCTCCTTTTCT
GAAAAGAAGATGGGTGAGATAGTCACTTTAGAAACTGCCTCTGATATTTGGAACTATTTGAAACATGCTTATGATTCTAAGACTACTGCTAGGTTAATGGGTCTTAAAAC
TCAGCTTCAAAGAATTAAGAAGGATGGTCTTACTGTGAGTCAATATTTGTCTCAGATTAAAGATATTACAGATAAATTTTCTGCTATAGGTGAGCCTATCTCATATAGAG
ATCATTTGGCTCATATCCTTGATGGCCTTGGAAGTGAATACAACGCTTTTGTTACCTCAATTCAAAATCGTTCTGACAGTCCACCTTTAGAGGATGTTCACAGTCTCCTT
TTAGCTTGTGAGGCAAGATTAGAAAAGCAAAACAGTGTTGATCAACTTTCCTTGGCTCAAGCGAATCTTGCTAGTCTCAACATTCATAATAATAGTCGCCGCTCTATTTC
TCGTTCCTCTCCATCACATTTTTCTAGAACTCCCTTCAATCCATTACCCTTTTCATCATCTTCTACTTTTTCCATTGTTGTATCTCCTAGTCTTTTGGGTAAACCACAGT
CACAACCTCTCCAAAAATGGCCATCCTCAACTAGATCTAACTCTAATAGACCTCAATGTCAAATTTGTGGTAAGTTTGGGCACACTGCTTTAATATGTCATCATCGTACC
AACTTAGCCTATCAAACTCCACCTCCTCAAGCTTTGTTAACTACTACCACTGCCCAACTTCCTCTTTTGGATTCAGTCTCCACTGTTTCTATGGATTCCTATCACCCCGA
TGAGAATTGGTTCCTTGATTCCGGAGCTACTCATCATATGACACCGGATGCTTCGACCCTCACTCATTCTATGCCTTACCTTGGTGGTGAACATGTCACGGTACTTCTCA
GGGGCACTCTTGAGGACGGTCTTTATCGTGTTTCCGTTGCTCCTAAGTCCATTTTTATTCCCTCTGGTGCCAATTTTCAGAAGCCTGCTGCTGTTTTCTTATCTTCAGTT
GTTGTCCCTCCTTGGCATCTTCGTTTAGGCCACCCGTCTACTGCTTTTTTGCGTCAAGTCCTATCAAATATCGATGTTTCATGTTCTTCTGTTGTTTCTGAATTCTGTTG
CCAACTCCAGTTCTTAAATCCTTATCAAGTCCTGTTTCAGAAATCCCCTGATTACACTCTTCTTCGTGCATTCGGTTGTGCTTGTTTCTCTTTTCTTCGGCCCTATAATT
CTCATAAACTTAATTTTCGTTCTAAAGAGTGTATTTTTCTGGGTTATAGTTCTAATCATAAGGGATATTTATGTCTTGATCCCGTTTCCGACCGTCTCTATTTCTCCCGT
CATGTTATATTCAATGAAACTTTCTTTTCGTGCTCCAAAACCGATTTTTATTTTGGTCATGTTAAGTCATCTGCTCCTGATGTTTTTCAATTTTCTGCTTCTCCTTTTCC
CATCATTCCATCTGTTCCTGTGTCGTCTGTTGCCCCTTTATCTCTGCCCCCCTCTTTGTCTCATCCCCCTGGCTTTACTCCCTCACCGGTTAATCATGTTCATCACCCTC
ATTCACCGTCTGCTCCTCCATCTTCGCCCGTTAACCTCTCTTTCTCTCCCCCCGTTTCTTCTCAACCCGAGGCTACAACATCTCCCACACCTGTTGTACCTTTTTTCCCT
CCTTTATCTTCGGTTTTTACATCTCCCAACAACCCTTCAGCAACCATCAATCCATCTATTCTGTCTACTAGCCCAACATCCCAGCAATCTCCATCTGTGCCACCTTCAAC
TGCTCCCTTCTCTAATATTCCAGCAACTAATATTCACCATATGGTTACCCGTGCTAAGGCTGGCATTAATAAGCCTAAGTGTTGGTTATCCGTTGCTTCTCTCCCCTCTA
AACCCTCTAATTTTAAGGCGGCTATAAAGTGTGTTGAGTGGCATGAGGCTATGCAAAAGGAGTTTGATGCCTTGGTTAGAAATCACACTTGGTCCCTTGTTCCACCATCA
CCAAATTACAAAGTAATCAGTTGTAAATGGGTTTATAAGCTTAAGCTTAAGCCAAATGGTGATATTGAAAGGTATAAAGCTCGTTTAGTTGCTCGTGGTTTTGATCAAGC
TTATGGGATAGACTATTGA
Protein sequenceShow/hide protein sequence
MGHSQIQRESAKPSLDWKAALKRYENVMASESEAVKVKATVKLAHLSKKAPENILNSTIPIIAKHLEGNLNDNSSHYVKEAAAYCLRCISCQGDGTLATTVGHSGALQSL
LRLLPHSRGSFCKILVKCVWSIVTFDKTSHVIIARNGGLEVIIGMFDSVIDGTRRYLLEILSAMALLREVRKALISLRGLPFLMQAARCGCMASRERACQAIGLLAISKR
GRHMLFELGVVSALIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQAGVIPLFVDLLQCPDLIGKEIAEDVFFLLVVAEANAVVISDHLVRILKEGDDGAKVATADVL
WDLSGYKYSYQLCVVRVPFQFLWIYYWTGMMKPRLLQDELEELKDNSVEALINFSEDPLYCDRVSEAVSTHAFQNMQERMNHIHATERKMSSEAFNSSSSSSSAPIMPPV
VTPSTPTTTAQTPPSQVPQPVNPNPTPFSPNPYPTLPQPLSVKLTDTNFLLWKNQLLNAVLANGLPGFLDGSIPAPPRFLDAQQSQPNPDFITWERYNRFIMCWMYSSFS
EKKMGEIVTLETASDIWNYLKHAYDSKTTARLMGLKTQLQRIKKDGLTVSQYLSQIKDITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDSPPLEDVHSLL
LACEARLEKQNSVDQLSLAQANLASLNIHNNSRRSISRSSPSHFSRTPFNPLPFSSSSTFSIVVSPSLLGKPQSQPLQKWPSSTRSNSNRPQCQICGKFGHTALICHHRT
NLAYQTPPPQALLTTTTAQLPLLDSVSTVSMDSYHPDENWFLDSGATHHMTPDASTLTHSMPYLGGEHVTVLLRGTLEDGLYRVSVAPKSIFIPSGANFQKPAAVFLSSV
VVPPWHLRLGHPSTAFLRQVLSNIDVSCSSVVSEFCCQLQFLNPYQVLFQKSPDYTLLRAFGCACFSFLRPYNSHKLNFRSKECIFLGYSSNHKGYLCLDPVSDRLYFSR
HVIFNETFFSCSKTDFYFGHVKSSAPDVFQFSASPFPIIPSVPVSSVAPLSLPPSLSHPPGFTPSPVNHVHHPHSPSAPPSSPVNLSFSPPVSSQPEATTSPTPVVPFFP
PLSSVFTSPNNPSATINPSILSTSPTSQQSPSVPPSTAPFSNIPATNIHHMVTRAKAGINKPKCWLSVASLPSKPSNFKAAIKCVEWHEAMQKEFDALVRNHTWSLVPPS
PNYKVISCKWVYKLKLKPNGDIERYKARLVARGFDQAYGIDY