; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022155 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022155
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionhAT transposon superfamily protein
Genome locationchr7:19823753..19829645
RNA-Seq ExpressionLag0022155
SyntenyLag0022155
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0016310 - phosphorylation (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016301 - kinase activity (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR007021 - Domain of unknown function DUF659
IPR012337 - Ribonuclease H-like superfamily
IPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0065480.1 Cysteine-rich RLK (receptor-like protein kinase) 8 [Cucumis melo var. makuwa]2.6e-9465.41Show/hide
Query:  KPLLGAINYTSWSRAMRMVISGKNKLGFIIGKISKPQEEGPLLEAWECNNNIVASWILNSVSKEIAASIVYTGSVKAVWDELRERFKQANGASIYQLRKD
        +PL GAINYTSWSRAM M ISG+NK GFI GKI KP  +G LL+AW CNN+I+ASWILNSVSKEIAASI+Y GS+K +WDELR+RFKQ+NG SIYQLRK+
Subjt:  KPLLGAINYTSWSRAMRMVISGKNKLGFIIGKISKPQEEGPLLEAWECNNNIVASWILNSVSKEIAASIVYTGSVKAVWDELRERFKQANGASIYQLRKD

Query:  LVTLRQGSMSVETYYTKLKTVWQDLSDHRPTTSCTCGGLKSFLDHLDSEYVMTFLMGLNESYSAIRAQILLMKPIPSITKTFSLLIQEERQRSTDILELP
         VTLRQG++++ETYYTKLKT+WQ+L+++R T  CTCGGLK F+DHL+SEY+M FLMGLN+SY+A+RAQILLM+P+PSI   FSLLIQEE+QRS  IL  P
Subjt:  LVTLRQGSMSVETYYTKLKTVWQDLSDHRPTTSCTCGGLKSFLDHLDSEYVMTFLMGLNESYSAIRAQILLMKPIPSITKTFSLLIQEERQRSTDILELP

Query:  ADLVDLVVSDSSKKYSPSDRSRMKDTQHKRPHCTHCNIKGHTVDRCYKLHDYPPGYKFPSSNNADT
         D V L ++ S      +DR+R K+    RP C++C IKGH  D+CYK H YPPGYK  +SN+  T
Subjt:  ADLVDLVVSDSSKKYSPSDRSRMKDTQHKRPHCTHCNIKGHTVDRCYKLHDYPPGYKFPSSNNADT

KAA8523936.1 hypothetical protein F0562_010359 [Nyssa sinensis]6.7e-6647.99Show/hide
Query:  MRMVISGKNKLGFIIGKISKPQ-EEGPLLEAWECNNNIVASWILNSVSKEIAASIVYTGSVKAVWDELRERFKQANGASIYQLRKDLVTLRQGSMSVETY
        M + +  KNKLGF+ G I +PQ  +  L  +W  NNNIV SWILNSVSKEI+ASI++  S + +W +LR+RF+Q NG  I+QL+++L+ LRQ   SV  Y
Subjt:  MRMVISGKNKLGFIIGKISKPQ-EEGPLLEAWECNNNIVASWILNSVSKEIAASIVYTGSVKAVWDELRERFKQANGASIYQLRKDLVTLRQGSMSVETY

Query:  YTKLKTVWQDLSDHRPTTS---CTCGGLKSFLDHLDSEYVMTFLMGLNESYSAIRAQILLMKPIPSITKTFSLLIQEERQRSTDILELPAD---------
        +TKLKT+W++LS+ RP  S   C+CGG+K+  DH   EY+M+FLMGL++S+S +R Q+LLM P+P I + FSL++QEE+QR T+     ++         
Subjt:  YTKLKTVWQDLSDHRPTTS---CTCGGLKSFLDHLDSEYVMTFLMGLNESYSAIRAQILLMKPIPSITKTFSLLIQEERQRSTDILELPAD---------

Query:  LVDLVVSDSS--KKYSPSDRSRMKDTQHKRPHCTHCNIKGHTVDRCYKLHDYPPGYKFPSSNNADTLANCVTS
         +D+  SD S  +    S+ S  K+ +  +P+CTHC I+GHTVDRCYK+H YPPGYKF S+NN +  A+ V++
Subjt:  LVDLVVSDSS--KKYSPSDRSRMKDTQHKRPHCTHCNIKGHTVDRCYKLHDYPPGYKFPSSNNADTLANCVTS

KAA8536734.1 hypothetical protein F0562_029212 [Nyssa sinensis]1.1e-6848.04Show/hide
Query:  NYTSWSRAMRMVISGKNKLGFIIGKISKPQ-EEGPLLEAWECNNNIVASWILNSVSKEIAASIVYTGSVKAVWDELRERFKQANGASIYQLRKDLVTLRQ
        NYT+WSRAM + +S KNKLGF+ G I +PQ  +  LL++W  NNNIV SWILNS+SKEI+ASI++    + +W +LR+RF+Q NG  I+QL+++L+ LRQ
Subjt:  NYTSWSRAMRMVISGKNKLGFIIGKISKPQ-EEGPLLEAWECNNNIVASWILNSVSKEIAASIVYTGSVKAVWDELRERFKQANGASIYQLRKDLVTLRQ

Query:  GSMSVETYYTKLKTVWQDLSDHRPTTS---CTCGGLKSFLDHLDSEYVMTFLMGLNESYSAIRAQILLMKPIPSITKTFSLLIQEERQR----STDILEL
           SV  Y+TK+KT+W++LS++RP  S   C CGG+K+  D+  +EY+M+FLMGL++S+S +  Q+LLM  +P I + FSL++QEE+QR    S+D    
Subjt:  GSMSVETYYTKLKTVWQDLSDHRPTTS---CTCGGLKSFLDHLDSEYVMTFLMGLNESYSAIRAQILLMKPIPSITKTFSLLIQEERQR----STDILEL

Query:  PADLVDLVVSD-------SSKKYSPSDRSRMKDTQHKRPHCTHCNIKGHTVDRCYKLHDYPPGYKFPSSNNADTLANCVTS
           +  +V +D        S+    S+ S  K+ +  RP+CTHC I GHTVDRCYK+H YPPGYKF S+NN++  A  V++
Subjt:  PADLVDLVVSD-------SSKKYSPSDRSRMKDTQHKRPHCTHCNIKGHTVDRCYKLHDYPPGYKFPSSNNADTLANCVTS

KAA8543184.1 hypothetical protein F0562_021321 [Nyssa sinensis]7.2e-6848.75Show/hide
Query:  NYTSWSRAMRMVISGKNKLGFIIGKISKPQEEG-PLLEAWECNNNIVASWILNSVSKEIAASIVYTGSVKAVWDELRERFKQANGASIYQLRKDLVTLRQ
        NYT+WSRAM + +S KNKLGF+ G I +PQ  G  LL +W  NNNIV SWILNSVSKEI+ASI++  S + +W +LR+RF+Q N   I+QL+++L+ L Q
Subjt:  NYTSWSRAMRMVISGKNKLGFIIGKISKPQEEG-PLLEAWECNNNIVASWILNSVSKEIAASIVYTGSVKAVWDELRERFKQANGASIYQLRKDLVTLRQ

Query:  GSMSVETYYTKLKTVWQDLSDHRPTTS---CTCGGLKSFLDHLDSEYVMTFLMGLNESYSAIRAQILLMKPIPSITKTFSLLIQEERQR----STDILEL
           SV  Y+TKLKT+W++LS++R   S   C+CGG+K+  DH   EY+M+FLMGL++S+S +R Q+LLM P+P I + FSL++QEE+QR    S+D    
Subjt:  GSMSVETYYTKLKTVWQDLSDHRPTTS---CTCGGLKSFLDHLDSEYVMTFLMGLNESYSAIRAQILLMKPIPSITKTFSLLIQEERQR----STDILEL

Query:  PADLVDLVVSD-------SSKKYSPSDRSRMKDTQHKRPHCTHCNIKGHTVDRCYKLHDYPPGYKFPSSNNADTLANCVTS
           +   V +D        S+    S+ S  K+ +  R +C HC I GHTVDRCYK+H YPPGYKF S+NN +  A+ V++
Subjt:  PADLVDLVVSD-------SSKKYSPSDRSRMKDTQHKRPHCTHCNIKGHTVDRCYKLHDYPPGYKFPSSNNADTLANCVTS

XP_022145891.1 uncharacterized protein LOC111015239 [Momordica charantia]2.1e-8361.18Show/hide
Query:  LLGAINYTSWSRAMRMVISGKNKLGFIIGKISKPQEEGPLLEAWECNNNIVASWILNSVSKEIAASIVYTGSVKAVWDELRERFKQANGASIYQLRKDLV
        LLGA NY SW R+M + +SGKNK+GFI G I KP   G LL AW+CNN+I+ SWI+NSVSKEIAASI+YTGS K +WDEL+ERF+Q++   I+QLRK+LV
Subjt:  LLGAINYTSWSRAMRMVISGKNKLGFIIGKISKPQEEGPLLEAWECNNNIVASWILNSVSKEIAASIVYTGSVKAVWDELRERFKQANGASIYQLRKDLV

Query:  TLRQGSMSVETYYTKLKTVWQDLSDHRPTTSCTCGGLKSFLDHLDSEYVMTFLMGLNESYSAIRAQILLMKPIPSITKTFSLLIQEERQRSTDILELPAD
        T  QG++S+E YYTKLKTVWQ+L+D+RPT  CTC GLKS  +   SEYVMTFLMGLNESY+ IRAQILLM PIP + K FSLLIQEERQR+   +  P  
Subjt:  TLRQGSMSVETYYTKLKTVWQDLSDHRPTTSCTCGGLKSFLDHLDSEYVMTFLMGLNESYSAIRAQILLMKPIPSITKTFSLLIQEERQRSTDILELPAD

Query:  LVDLVVSDSSKKYSPSDRSRMKDTQHKRPHCTHCNIKGHTVDRCYKLHDYPPGYK
         + + V++ SK+ S + + R KD    R  CTHC ++GH +D+CYKLH YPPGY+
Subjt:  LVDLVVSDSSKKYSPSDRSRMKDTQHKRPHCTHCNIKGHTVDRCYKLHDYPPGYK

TrEMBL top hitse value%identityAlignment
A0A2N9HKE6 Uncharacterized protein1.3e-6750.98Show/hide
Query:  NYTSWSRAMRMVISGKNKLGFIIGKISKPQEE-GPLLEAWECNNNIVASWILNSVSKEIAASIVYTGSVKAVWDELRERFKQANGASIYQLRKDLVTLRQ
        NY +WSR+M M ++ KNK+GF+ G I +PQ+E  P   AW   N +V SW+LNS+SKEIA+S++Y  + K +W++LRERF Q NG  I++++K +  L Q
Subjt:  NYTSWSRAMRMVISGKNKLGFIIGKISKPQEE-GPLLEAWECNNNIVASWILNSVSKEIAASIVYTGSVKAVWDELRERFKQANGASIYQLRKDLVTLRQ

Query:  GSMSVETYYTKLKTVWQDLSDHRPTTSCTCGGLKSFLDHLDSEYVMTFLMGLNESYSAIRAQILLMKPIPSITKTFSLLIQEERQRSTDILEL--PADLV
         + SV +YYT+LK++W +LS+ RP   C+CG +K  LD+   EYVM FLMGLN+S+S +RAQIL+  P+PSITK F+L+IQEERQR+ +I  L   AD V
Subjt:  GSMSVETYYTKLKTVWQDLSDHRPTTSCTCGGLKSFLDHLDSEYVMTFLMGLNESYSAIRAQILLMKPIPSITKTFSLLIQEERQRSTDILEL--PADLV

Query:  DLVVSDSSKKYS-PSDRSRMKDTQHKRPHCTHCNIKGHTVDRCYKLHDYPPGYKF
         L     + +++   ++S  KD    RP C+HC I GHTVD+CYKLH YPPGYKF
Subjt:  DLVVSDSSKKYS-PSDRSRMKDTQHKRPHCTHCNIKGHTVDRCYKLHDYPPGYKF

A0A5A7VE66 Cysteine-rich RLK (Receptor-like protein kinase) 81.3e-9465.41Show/hide
Query:  KPLLGAINYTSWSRAMRMVISGKNKLGFIIGKISKPQEEGPLLEAWECNNNIVASWILNSVSKEIAASIVYTGSVKAVWDELRERFKQANGASIYQLRKD
        +PL GAINYTSWSRAM M ISG+NK GFI GKI KP  +G LL+AW CNN+I+ASWILNSVSKEIAASI+Y GS+K +WDELR+RFKQ+NG SIYQLRK+
Subjt:  KPLLGAINYTSWSRAMRMVISGKNKLGFIIGKISKPQEEGPLLEAWECNNNIVASWILNSVSKEIAASIVYTGSVKAVWDELRERFKQANGASIYQLRKD

Query:  LVTLRQGSMSVETYYTKLKTVWQDLSDHRPTTSCTCGGLKSFLDHLDSEYVMTFLMGLNESYSAIRAQILLMKPIPSITKTFSLLIQEERQRSTDILELP
         VTLRQG++++ETYYTKLKT+WQ+L+++R T  CTCGGLK F+DHL+SEY+M FLMGLN+SY+A+RAQILLM+P+PSI   FSLLIQEE+QRS  IL  P
Subjt:  LVTLRQGSMSVETYYTKLKTVWQDLSDHRPTTSCTCGGLKSFLDHLDSEYVMTFLMGLNESYSAIRAQILLMKPIPSITKTFSLLIQEERQRSTDILELP

Query:  ADLVDLVVSDSSKKYSPSDRSRMKDTQHKRPHCTHCNIKGHTVDRCYKLHDYPPGYKFPSSNNADT
         D V L ++ S      +DR+R K+    RP C++C IKGH  D+CYK H YPPGYK  +SN+  T
Subjt:  ADLVDLVVSDSSKKYSPSDRSRMKDTQHKRPHCTHCNIKGHTVDRCYKLHDYPPGYKFPSSNNADT

A0A5J5B2C5 Uncharacterized protein5.4e-6948.04Show/hide
Query:  NYTSWSRAMRMVISGKNKLGFIIGKISKPQ-EEGPLLEAWECNNNIVASWILNSVSKEIAASIVYTGSVKAVWDELRERFKQANGASIYQLRKDLVTLRQ
        NYT+WSRAM + +S KNKLGF+ G I +PQ  +  LL++W  NNNIV SWILNS+SKEI+ASI++    + +W +LR+RF+Q NG  I+QL+++L+ LRQ
Subjt:  NYTSWSRAMRMVISGKNKLGFIIGKISKPQ-EEGPLLEAWECNNNIVASWILNSVSKEIAASIVYTGSVKAVWDELRERFKQANGASIYQLRKDLVTLRQ

Query:  GSMSVETYYTKLKTVWQDLSDHRPTTS---CTCGGLKSFLDHLDSEYVMTFLMGLNESYSAIRAQILLMKPIPSITKTFSLLIQEERQR----STDILEL
           SV  Y+TK+KT+W++LS++RP  S   C CGG+K+  D+  +EY+M+FLMGL++S+S +  Q+LLM  +P I + FSL++QEE+QR    S+D    
Subjt:  GSMSVETYYTKLKTVWQDLSDHRPTTS---CTCGGLKSFLDHLDSEYVMTFLMGLNESYSAIRAQILLMKPIPSITKTFSLLIQEERQR----STDILEL

Query:  PADLVDLVVSD-------SSKKYSPSDRSRMKDTQHKRPHCTHCNIKGHTVDRCYKLHDYPPGYKFPSSNNADTLANCVTS
           +  +V +D        S+    S+ S  K+ +  RP+CTHC I GHTVDRCYK+H YPPGYKF S+NN++  A  V++
Subjt:  PADLVDLVVSD-------SSKKYSPSDRSRMKDTQHKRPHCTHCNIKGHTVDRCYKLHDYPPGYKFPSSNNADTLANCVTS

A0A5J5BKC2 Uncharacterized protein3.5e-6848.75Show/hide
Query:  NYTSWSRAMRMVISGKNKLGFIIGKISKPQEEG-PLLEAWECNNNIVASWILNSVSKEIAASIVYTGSVKAVWDELRERFKQANGASIYQLRKDLVTLRQ
        NYT+WSRAM + +S KNKLGF+ G I +PQ  G  LL +W  NNNIV SWILNSVSKEI+ASI++  S + +W +LR+RF+Q N   I+QL+++L+ L Q
Subjt:  NYTSWSRAMRMVISGKNKLGFIIGKISKPQEEG-PLLEAWECNNNIVASWILNSVSKEIAASIVYTGSVKAVWDELRERFKQANGASIYQLRKDLVTLRQ

Query:  GSMSVETYYTKLKTVWQDLSDHRPTTS---CTCGGLKSFLDHLDSEYVMTFLMGLNESYSAIRAQILLMKPIPSITKTFSLLIQEERQR----STDILEL
           SV  Y+TKLKT+W++LS++R   S   C+CGG+K+  DH   EY+M+FLMGL++S+S +R Q+LLM P+P I + FSL++QEE+QR    S+D    
Subjt:  GSMSVETYYTKLKTVWQDLSDHRPTTS---CTCGGLKSFLDHLDSEYVMTFLMGLNESYSAIRAQILLMKPIPSITKTFSLLIQEERQR----STDILEL

Query:  PADLVDLVVSD-------SSKKYSPSDRSRMKDTQHKRPHCTHCNIKGHTVDRCYKLHDYPPGYKFPSSNNADTLANCVTS
           +   V +D        S+    S+ S  K+ +  R +C HC I GHTVDRCYK+H YPPGYKF S+NN +  A+ V++
Subjt:  PADLVDLVVSD-------SSKKYSPSDRSRMKDTQHKRPHCTHCNIKGHTVDRCYKLHDYPPGYKFPSSNNADTLANCVTS

A0A6J1CXR2 uncharacterized protein LOC1110152391.0e-8361.18Show/hide
Query:  LLGAINYTSWSRAMRMVISGKNKLGFIIGKISKPQEEGPLLEAWECNNNIVASWILNSVSKEIAASIVYTGSVKAVWDELRERFKQANGASIYQLRKDLV
        LLGA NY SW R+M + +SGKNK+GFI G I KP   G LL AW+CNN+I+ SWI+NSVSKEIAASI+YTGS K +WDEL+ERF+Q++   I+QLRK+LV
Subjt:  LLGAINYTSWSRAMRMVISGKNKLGFIIGKISKPQEEGPLLEAWECNNNIVASWILNSVSKEIAASIVYTGSVKAVWDELRERFKQANGASIYQLRKDLV

Query:  TLRQGSMSVETYYTKLKTVWQDLSDHRPTTSCTCGGLKSFLDHLDSEYVMTFLMGLNESYSAIRAQILLMKPIPSITKTFSLLIQEERQRSTDILELPAD
        T  QG++S+E YYTKLKTVWQ+L+D+RPT  CTC GLKS  +   SEYVMTFLMGLNESY+ IRAQILLM PIP + K FSLLIQEERQR+   +  P  
Subjt:  TLRQGSMSVETYYTKLKTVWQDLSDHRPTTSCTCGGLKSFLDHLDSEYVMTFLMGLNESYSAIRAQILLMKPIPSITKTFSLLIQEERQRSTDILELPAD

Query:  LVDLVVSDSSKKYSPSDRSRMKDTQHKRPHCTHCNIKGHTVDRCYKLHDYPPGYK
         + + V++ SK+ S + + R KD    R  CTHC ++GH +D+CYKLH YPPGY+
Subjt:  LVDLVVSDSSKKYSPSDRSRMKDTQHKRPHCTHCNIKGHTVDRCYKLHDYPPGYK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).2.2e-3035.98Show/hide
Query:  NYTSWSRAMRMVISGKNKLGFIIGKISKPQEEGPLLEAWECNNNIVASWILNSVSKEIAASIVYTGSVKAVWDELRERFKQANGASIYQLRKDLVTLRQG
        NY +W    R  +    K GFI G + KP    PL + WE  N +V  W++NS++ ++  S++Y  +   +W++LR  F       IYQLR+ L TLRQG
Subjt:  NYTSWSRAMRMVISGKNKLGFIIGKISKPQEEGPLLEAWECNNNIVASWILNSVSKEIAASIVYTGSVKAVWDELRERFKQANGASIYQLRKDLVTLRQG

Query:  SMSVETYYTKLKTVWQDLSDHRPTTSCTCGG-----LKSFLDHLDSEYVMTFLMG--LNESYSAIRAQILLMKPIPSITKTFSLLIQEE
          SVE Y+ KL  VW +LS++ P   C CGG      K   +  + E    FLMG  LN+ + A+  +I+  KP PS+ + F+++   E
Subjt:  SMSVETYYTKLKTVWQDLSDHRPTTSCTCGG-----LKSFLDHLDSEYVMTFLMG--LNESYSAIRAQILLMKPIPSITKTFSLLIQEE

AT3G13030.1 hAT transposon superfamily protein9.5e-1838.54Show/hide
Query:  MGYIYEAMDRAKEHIEKKFNGIQKHFKPIWDIIDKRWAMQLHRPLHAATYYLNPRFHYAPEFNADYEIKMGFYQTIQRMCPSHVLREKVDKQLDLF
        +GY+Y+ MD  KE I ++FN   + +KP+WD+ID  W   LH PLHAA Y+LNP   Y+  F+ D E+  G   ++  M     ++ K+  Q+D++
Subjt:  MGYIYEAMDRAKEHIEKKFNGIQKHFKPIWDIIDKRWAMQLHRPLHAATYYLNPRFHYAPEFNADYEIKMGFYQTIQRMCPSHVLREKVDKQLDLF

AT3G13030.1 hAT transposon superfamily protein1.4e-0533.78Show/hide
Query:  DGWTDGKNQSITNFLVNSPRGTVFLKSVDTSRVYKSADNLFELLDLVIEEIGENNVVQVLTDSASAYV-KAGEM
        D W D K + +  F+ + P G V+L S D S        L  L++ ++EE+G  NV Q++  S S +V + GE+
Subjt:  DGWTDGKNQSITNFLVNSPRGTVFLKSVDTSRVYKSADNLFELLDLVIEEIGENNVVQVLTDSASAYV-KAGEM

AT3G13030.2 hAT transposon superfamily protein9.5e-1838.54Show/hide
Query:  MGYIYEAMDRAKEHIEKKFNGIQKHFKPIWDIIDKRWAMQLHRPLHAATYYLNPRFHYAPEFNADYEIKMGFYQTIQRMCPSHVLREKVDKQLDLF
        +GY+Y+ MD  KE I ++FN   + +KP+WD+ID  W   LH PLHAA Y+LNP   Y+  F+ D E+  G   ++  M     ++ K+  Q+D++
Subjt:  MGYIYEAMDRAKEHIEKKFNGIQKHFKPIWDIIDKRWAMQLHRPLHAATYYLNPRFHYAPEFNADYEIKMGFYQTIQRMCPSHVLREKVDKQLDLF

AT3G13030.2 hAT transposon superfamily protein1.4e-0533.78Show/hide
Query:  DGWTDGKNQSITNFLVNSPRGTVFLKSVDTSRVYKSADNLFELLDLVIEEIGENNVVQVLTDSASAYV-KAGEM
        D W D K + +  F+ + P G V+L S D S        L  L++ ++EE+G  NV Q++  S S +V + GE+
Subjt:  DGWTDGKNQSITNFLVNSPRGTVFLKSVDTSRVYKSADNLFELLDLVIEEIGENNVVQVLTDSASAYV-KAGEM

AT3G13030.3 hAT transposon superfamily protein9.5e-1838.54Show/hide
Query:  MGYIYEAMDRAKEHIEKKFNGIQKHFKPIWDIIDKRWAMQLHRPLHAATYYLNPRFHYAPEFNADYEIKMGFYQTIQRMCPSHVLREKVDKQLDLF
        +GY+Y+ MD  KE I ++FN   + +KP+WD+ID  W   LH PLHAA Y+LNP   Y+  F+ D E+  G   ++  M     ++ K+  Q+D++
Subjt:  MGYIYEAMDRAKEHIEKKFNGIQKHFKPIWDIIDKRWAMQLHRPLHAATYYLNPRFHYAPEFNADYEIKMGFYQTIQRMCPSHVLREKVDKQLDLF

AT3G13030.3 hAT transposon superfamily protein1.4e-0533.78Show/hide
Query:  DGWTDGKNQSITNFLVNSPRGTVFLKSVDTSRVYKSADNLFELLDLVIEEIGENNVVQVLTDSASAYV-KAGEM
        D W D K + +  F+ + P G V+L S D S        L  L++ ++EE+G  NV Q++  S S +V + GE+
Subjt:  DGWTDGKNQSITNFLVNSPRGTVFLKSVDTSRVYKSADNLFELLDLVIEEIGENNVVQVLTDSASAYV-KAGEM

AT5G33406.1 hAT dimerisation domain-containing protein / transposase-related2.1e-2544.44Show/hide
Query:  LKVLRLVDGDARPAMGYIYEAMDRAKEHIEKKFNGIQKHFKPIWDIIDKRWAMQLHRPLHAATYYLNPRFHYAPEFNADY-EIKMGFYQTIQRMCPSHVL
        ++VLR+VDG+ +P MGYIY AMD+AKE I K F   ++++K  ++IID+RW +QLHRPLHAA YYLNP FHY    +  Y E+  GF   + R+ P    
Subjt:  LKVLRLVDGDARPAMGYIYEAMDRAKEHIEKKFNGIQKHFKPIWDIIDKRWAMQLHRPLHAATYYLNPRFHYAPEFNADY-EIKMGFYQTIQRMCPSHVL

Query:  REKVDKQLDLFHNAEKKCVQREGIVWNNKESDGEW
        ++K+  +LD F  A         I    K S  EW
Subjt:  REKVDKQLDLFHNAEKKCVQREGIVWNNKESDGEW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGATGGTTGGACAGATGGGAAGAATCAATCCATTACAAACTTTTTGGTAAATAGTCCTAGAGGCACTGTTTTTCTAAAAAGTGTGGATACATCTAGAGTCTACAA
AAGCGCAGATAATTTGTTTGAACTACTAGATTTAGTGATAGAAGAAATTGGTGAAAATAATGTAGTTCAAGTGTTAACAGATAGTGCATCTGCTTATGTCAAAGCTGGAG
AAATGTTAATGGATAAGCTAAAAGTTTTAAGACTTGTTGATGGTGATGCAAGACCTGCAATGGGATACATATATGAAGCAATGGACAGGGCAAAGGAACATATTGAGAAA
AAATTCAACGGAATTCAGAAACATTTTAAGCCTATTTGGGATATTATTGATAAAAGATGGGCAATGCAACTTCACAGGCCTCTTCATGCAGCAACATATTATCTTAACCC
AAGGTTTCATTATGCACCTGAATTCAATGCTGACTATGAAATCAAGATGGGGTTTTATCAAACTATTCAAAGAATGTGTCCTTCACATGTGCTCAGAGAAAAAGTTGACA
AGCAACTAGATCTATTCCATAATGCAGAGAAAAAGTGCGTTCAAAGAGAAGGAATCGTTTGGAACAACAAAGAATCTGATGGTGAATGGATTACAGAGAAGGAAGATCCA
ACATTATCAAATCACACCTCGTGGATAAATGTAAATGAGTGTTTCGACGTTCAAGAAATAGAGAGTAGCAAAAAGAGGAAGAGAGAATACAACGAGGAGGATGAAGAATT
AGATGAATCGTACAAAGAATCTAAAGCAGATGAATTTGAGGAAGAAGATGATTATGATGATGGCGTTGCAGGAGATGGAGTTGAGGAAGATGGTGGGTTAGATGGACGCA
ATGAAATCGAGGAAGATGTTACTCCTCTTGCTTCCAGAATGGATGGTGCTTCTAGCCAACAATTAATCACATCAGGACATGCGAAAAGATTATGGGGCACTATTAATTAT
GAGAAAGCATGTCAGGCCAAAGAGGAGATACTAGCTTTTGTTAGAGGATCACTAGAAGAGTCATTTGGTCACTTACATGCATATGGTGAAGCACTAAAGATAATGAACCC
TGGGACCGTATATGATATAAAGCTCGAGGATGGAAAGTATTTTAAATACGTGTTCATGGCACTAGGACAATGGCAAGTTTTGTGGTTATTGTGGTTTCATTGTTGGATTA
AGCCGTTGCTTGGAGCAATCAACTATACTTCATGGAGTCGTGCAATGCGTATGGTGATTTCTGGTAAAAACAAACTTGGTTTCATTATTGGAAAAATCTCCAAACCACAG
GAGGAAGGGCCTCTGCTCGAAGCTTGGGAATGCAACAACAATATTGTTGCTTCTTGGATCCTCAACTCAGTATCGAAGGAAATCGCTGCAAGTATTGTCTACACAGGCTC
TGTCAAAGCTGTCTGGGATGAATTAAGAGAACGATTCAAACAAGCGAATGGTGCCAGTATCTATCAATTGAGGAAGGACTTGGTTACTTTACGCCAAGGATCGATGTCTG
TGGAAACATATTACACCAAATTGAAGACAGTCTGGCAAGATCTCAGCGATCATCGCCCTACCACCAGTTGTACCTGTGGAGGTTTGAAGTCGTTCCTCGATCATCTTGAT
TCTGAATATGTGATGACATTTCTCATGGGATTAAATGAGTCCTACTCTGCAATTAGGGCTCAAATCCTCCTTATGAAGCCAATTCCTTCGATTACTAAAACTTTCTCATT
GTTGATTCAAGAAGAAAGGCAAAGATCCACCGATATTCTTGAATTGCCTGCCGATCTGGTTGACCTTGTTGTCAGTGATTCTTCTAAGAAGTATTCTCCTTCGGATCGTT
CTCGAATGAAAGATACTCAACATAAACGACCTCATTGTACTCACTGTAACATCAAAGGGCACACTGTTGATCGTTGCTATAAGTTGCACGACTATCCTCCAGGTTATAAA
TTTCCTTCCTCTAATAATGCCGACACCTTGGCGAACTGTGTTACTTCATAA
mRNA sequenceShow/hide mRNA sequence
ATGTCTGATGGTTGGACAGATGGGAAGAATCAATCCATTACAAACTTTTTGGTAAATAGTCCTAGAGGCACTGTTTTTCTAAAAAGTGTGGATACATCTAGAGTCTACAA
AAGCGCAGATAATTTGTTTGAACTACTAGATTTAGTGATAGAAGAAATTGGTGAAAATAATGTAGTTCAAGTGTTAACAGATAGTGCATCTGCTTATGTCAAAGCTGGAG
AAATGTTAATGGATAAGCTAAAAGTTTTAAGACTTGTTGATGGTGATGCAAGACCTGCAATGGGATACATATATGAAGCAATGGACAGGGCAAAGGAACATATTGAGAAA
AAATTCAACGGAATTCAGAAACATTTTAAGCCTATTTGGGATATTATTGATAAAAGATGGGCAATGCAACTTCACAGGCCTCTTCATGCAGCAACATATTATCTTAACCC
AAGGTTTCATTATGCACCTGAATTCAATGCTGACTATGAAATCAAGATGGGGTTTTATCAAACTATTCAAAGAATGTGTCCTTCACATGTGCTCAGAGAAAAAGTTGACA
AGCAACTAGATCTATTCCATAATGCAGAGAAAAAGTGCGTTCAAAGAGAAGGAATCGTTTGGAACAACAAAGAATCTGATGGTGAATGGATTACAGAGAAGGAAGATCCA
ACATTATCAAATCACACCTCGTGGATAAATGTAAATGAGTGTTTCGACGTTCAAGAAATAGAGAGTAGCAAAAAGAGGAAGAGAGAATACAACGAGGAGGATGAAGAATT
AGATGAATCGTACAAAGAATCTAAAGCAGATGAATTTGAGGAAGAAGATGATTATGATGATGGCGTTGCAGGAGATGGAGTTGAGGAAGATGGTGGGTTAGATGGACGCA
ATGAAATCGAGGAAGATGTTACTCCTCTTGCTTCCAGAATGGATGGTGCTTCTAGCCAACAATTAATCACATCAGGACATGCGAAAAGATTATGGGGCACTATTAATTAT
GAGAAAGCATGTCAGGCCAAAGAGGAGATACTAGCTTTTGTTAGAGGATCACTAGAAGAGTCATTTGGTCACTTACATGCATATGGTGAAGCACTAAAGATAATGAACCC
TGGGACCGTATATGATATAAAGCTCGAGGATGGAAAGTATTTTAAATACGTGTTCATGGCACTAGGACAATGGCAAGTTTTGTGGTTATTGTGGTTTCATTGTTGGATTA
AGCCGTTGCTTGGAGCAATCAACTATACTTCATGGAGTCGTGCAATGCGTATGGTGATTTCTGGTAAAAACAAACTTGGTTTCATTATTGGAAAAATCTCCAAACCACAG
GAGGAAGGGCCTCTGCTCGAAGCTTGGGAATGCAACAACAATATTGTTGCTTCTTGGATCCTCAACTCAGTATCGAAGGAAATCGCTGCAAGTATTGTCTACACAGGCTC
TGTCAAAGCTGTCTGGGATGAATTAAGAGAACGATTCAAACAAGCGAATGGTGCCAGTATCTATCAATTGAGGAAGGACTTGGTTACTTTACGCCAAGGATCGATGTCTG
TGGAAACATATTACACCAAATTGAAGACAGTCTGGCAAGATCTCAGCGATCATCGCCCTACCACCAGTTGTACCTGTGGAGGTTTGAAGTCGTTCCTCGATCATCTTGAT
TCTGAATATGTGATGACATTTCTCATGGGATTAAATGAGTCCTACTCTGCAATTAGGGCTCAAATCCTCCTTATGAAGCCAATTCCTTCGATTACTAAAACTTTCTCATT
GTTGATTCAAGAAGAAAGGCAAAGATCCACCGATATTCTTGAATTGCCTGCCGATCTGGTTGACCTTGTTGTCAGTGATTCTTCTAAGAAGTATTCTCCTTCGGATCGTT
CTCGAATGAAAGATACTCAACATAAACGACCTCATTGTACTCACTGTAACATCAAAGGGCACACTGTTGATCGTTGCTATAAGTTGCACGACTATCCTCCAGGTTATAAA
TTTCCTTCCTCTAATAATGCCGACACCTTGGCGAACTGTGTTACTTCATAA
Protein sequenceShow/hide protein sequence
MSDGWTDGKNQSITNFLVNSPRGTVFLKSVDTSRVYKSADNLFELLDLVIEEIGENNVVQVLTDSASAYVKAGEMLMDKLKVLRLVDGDARPAMGYIYEAMDRAKEHIEK
KFNGIQKHFKPIWDIIDKRWAMQLHRPLHAATYYLNPRFHYAPEFNADYEIKMGFYQTIQRMCPSHVLREKVDKQLDLFHNAEKKCVQREGIVWNNKESDGEWITEKEDP
TLSNHTSWINVNECFDVQEIESSKKRKREYNEEDEELDESYKESKADEFEEEDDYDDGVAGDGVEEDGGLDGRNEIEEDVTPLASRMDGASSQQLITSGHAKRLWGTINY
EKACQAKEEILAFVRGSLEESFGHLHAYGEALKIMNPGTVYDIKLEDGKYFKYVFMALGQWQVLWLLWFHCWIKPLLGAINYTSWSRAMRMVISGKNKLGFIIGKISKPQ
EEGPLLEAWECNNNIVASWILNSVSKEIAASIVYTGSVKAVWDELRERFKQANGASIYQLRKDLVTLRQGSMSVETYYTKLKTVWQDLSDHRPTTSCTCGGLKSFLDHLD
SEYVMTFLMGLNESYSAIRAQILLMKPIPSITKTFSLLIQEERQRSTDILELPADLVDLVVSDSSKKYSPSDRSRMKDTQHKRPHCTHCNIKGHTVDRCYKLHDYPPGYK
FPSSNNADTLANCVTS