; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0021928 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0021928
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr7:14076493..14086523
RNA-Seq ExpressionLag0021928
SyntenyLag0021928
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
BBN67916.1 transposable element gene [Prunus dulcis]1.0e-8769.17Show/hide
Query:  MLKSNFDMKDLGLADVILEIRITRNPSGYILSQSHYIEKMLRKFDQFESKLAVTPFDPSCKLKKNSGEGVSSLENSRVVGSLMYIMNCTRPDTTYSVGRL
        ML S+FDMKDLG ADVIL  +I RN  GYIL+QSHY E  LRKF QF+ K AVTPF+ +C LKKN+G+ +S  E S+V+GSLMY+MN TRPD  YSV RL
Subjt:  MLKSNFDMKDLGLADVILEIRITRNPSGYILSQSHYIEKMLRKFDQFESKLAVTPFDPSCKLKKNSGEGVSSLENSRVVGSLMYIMNCTRPDTTYSVGRL

Query:  ARYTSNPGHDHWNALIRVLRYLKYTLNYGLHYMRDPHVLEGFSDPNWISDSMETKSTSGYIFTLGGAAVSWKSSKQTCITRSTMESKFIALDKAGEEAEW
        +RYTSNPG+DHW ALI+VLRYLKYT +YGLHY + P VLEG+SD NWISDS ETKSTSGY+FTLGGAA+SWKSSKQTCI RSTMESKF+ALDKA EEAEW
Subjt:  ARYTSNPGHDHWNALIRVLRYLKYTLNYGLHYMRDPHVLEGFSDPNWISDSMETKSTSGYIFTLGGAAVSWKSSKQTCITRSTMESKFIALDKAGEEAEW

Query:  LRNFLEDIPLWLKHGSRYSKAFWAELRQPRAQGIERKRKS
        LRNFLEDIP+W KH +       +   Q RA+    KRKS
Subjt:  LRNFLEDIPLWLKHGSRYSKAFWAELRQPRAQGIERKRKS

KAB2605475.1 hypothetical protein D8674_005192 [Pyrus ussuriensis x Pyrus communis]1.1e-8673.71Show/hide
Query:  MLKSNFDMKDLGLADVILEIRITRNPSGYILSQSHYIEKMLRKFDQFESKLAVTPFDPSCKLKKNSGEGVSSLENSRVVGSLMYIMNCTRPDTTYSVGRL
        +LKS+FDMKDLGLADVIL ++I RN  GYIL+QSHYIE  LRK+   ESK A+TPFD + KLKKN G+ +S  E S+V+GSLMYIMNCTRPD  YSV RL
Subjt:  MLKSNFDMKDLGLADVILEIRITRNPSGYILSQSHYIEKMLRKFDQFESKLAVTPFDPSCKLKKNSGEGVSSLENSRVVGSLMYIMNCTRPDTTYSVGRL

Query:  ARYTSNPGHDHWNALIRVLRYLKYTLNYGLHYMRDPHVLEGFSDPNWISDSMETKSTSGYIFTLGGAAVSWKSSKQTCITRSTMESKFIALDKAGEEAEW
        +RYT NPGHDHW ALIRVLRYLK+T+NYGLHY R P VLEGF+D NWISDS +TKSTSGY+FTLGGAA+SWKSSKQTCI RSTME +FIALD A  EAEW
Subjt:  ARYTSNPGHDHWNALIRVLRYLKYTLNYGLHYMRDPHVLEGFSDPNWISDSMETKSTSGYIFTLGGAAVSWKSSKQTCITRSTMESKFIALDKAGEEAEW

Query:  LRNFLEDIPLWLK
        LR+FLED+P+W K
Subjt:  LRNFLEDIPLWLK

KAG7552374.1 Zinc finger CCHC-type [Arabidopsis thaliana x Arabidopsis arenosa]3.7e-8570.42Show/hide
Query:  MLKSNFDMKDLGLADVILEIRITRNPSGYILSQSHYIEKMLRKFDQFESKLAVTPFDPSCKLKKNSGEGVSSLENSRVVGSLMYIMNCTRPDTTYSVGRL
        ML  NF+MKD+GLADVIL IRI R P+GY L+QSHY+EK+L+KF  ++ +  VTPFDP+CKL KN GE VS LE ++++GS+MYI NCTRPD  YS+ RL
Subjt:  MLKSNFDMKDLGLADVILEIRITRNPSGYILSQSHYIEKMLRKFDQFESKLAVTPFDPSCKLKKNSGEGVSSLENSRVVGSLMYIMNCTRPDTTYSVGRL

Query:  ARYTSNPGHDHWNALIRVLRYLKYTLNYGLHYMRDPHVLEGFSDPNWISDSMETKSTSGYIFTLGGAAVSWKSSKQTCITRSTMESKFIALDKAGEEAEW
        +RY SNP  +HWNAL+RVLRYLK+T+NYGL+Y + P VLEG+SD NWIS S ++KSTSGY+FTLGG AVSWKSSKQTCI RSTMES+FIALD A EEAEW
Subjt:  ARYTSNPGHDHWNALIRVLRYLKYTLNYGLHYMRDPHVLEGFSDPNWISDSMETKSTSGYIFTLGGAAVSWKSSKQTCITRSTMESKFIALDKAGEEAEW

Query:  LRNFLEDIPLWLK
        LRNFLEDIPLW K
Subjt:  LRNFLEDIPLWLK

KAG7583533.1 hypothetical protein ISN44_As08g030420 [Arabidopsis suecica]1.7e-8570.89Show/hide
Query:  MLKSNFDMKDLGLADVILEIRITRNPSGYILSQSHYIEKMLRKFDQFESKLAVTPFDPSCKLKKNSGEGVSSLENSRVVGSLMYIMNCTRPDTTYSVGRL
        ML  NF+MKD+GLADVIL IRI R P+GY L+QSHY+EK+L+KF  F+ +  VTPFDP+CKL KN GE VS LE ++++GS+MYI NCTRPD  YS+ RL
Subjt:  MLKSNFDMKDLGLADVILEIRITRNPSGYILSQSHYIEKMLRKFDQFESKLAVTPFDPSCKLKKNSGEGVSSLENSRVVGSLMYIMNCTRPDTTYSVGRL

Query:  ARYTSNPGHDHWNALIRVLRYLKYTLNYGLHYMRDPHVLEGFSDPNWISDSMETKSTSGYIFTLGGAAVSWKSSKQTCITRSTMESKFIALDKAGEEAEW
        +RY SNP  +HWNAL+RVLRYLK+T+NYGL+Y + P VLEG+SD NWIS S ++KSTSGY+FTLGG AVSWKSSKQTCI RSTMES+FIALD A EEAEW
Subjt:  ARYTSNPGHDHWNALIRVLRYLKYTLNYGLHYMRDPHVLEGFSDPNWISDSMETKSTSGYIFTLGGAAVSWKSSKQTCITRSTMESKFIALDKAGEEAEW

Query:  LRNFLEDIPLWLK
        LRNFLEDIPLW K
Subjt:  LRNFLEDIPLWLK

KAG7585377.1 Zinc finger CCHC-type [Arabidopsis thaliana x Arabidopsis arenosa]1.7e-8570.89Show/hide
Query:  MLKSNFDMKDLGLADVILEIRITRNPSGYILSQSHYIEKMLRKFDQFESKLAVTPFDPSCKLKKNSGEGVSSLENSRVVGSLMYIMNCTRPDTTYSVGRL
        ML  NF+MKD+GLADVIL IRI R P+GY L+QSHY+EK+L+KF  F+ +  VTPFDP+CKL KN GE VS LE ++++GS+MYI NCTRPD  YS+ RL
Subjt:  MLKSNFDMKDLGLADVILEIRITRNPSGYILSQSHYIEKMLRKFDQFESKLAVTPFDPSCKLKKNSGEGVSSLENSRVVGSLMYIMNCTRPDTTYSVGRL

Query:  ARYTSNPGHDHWNALIRVLRYLKYTLNYGLHYMRDPHVLEGFSDPNWISDSMETKSTSGYIFTLGGAAVSWKSSKQTCITRSTMESKFIALDKAGEEAEW
        +RY SNP  +HWNAL+RVLRYLK+T+NYGL+Y + P VLEG+SD NWIS S ++KSTSGY+FTLGG AVSWKSSKQTCI RSTMES+FIALD A EEAEW
Subjt:  ARYTSNPGHDHWNALIRVLRYLKYTLNYGLHYMRDPHVLEGFSDPNWISDSMETKSTSGYIFTLGGAAVSWKSSKQTCITRSTMESKFIALDKAGEEAEW

Query:  LRNFLEDIPLWLK
        LRNFLEDIPLW K
Subjt:  LRNFLEDIPLWLK

TrEMBL top hitse value%identityAlignment
A0A2N9F5X3 Integrase catalytic domain-containing protein1.8e-8571.36Show/hide
Query:  MLKSNFDMKDLGLADVILEIRITRNPSGYILSQSHYIEKMLRKFDQFESKLAVTPFDPSCKLKKNSGEGVSSLENSRVVGSLMYIMNCTRPDTTYSVGRL
        ML S FDMKDLG+ADVIL I+ITR   G +LSQSHYI+K+L KF +++     TP D +  L KN G G+S LE S+++GSLMYIMNCTRPD  YSV +L
Subjt:  MLKSNFDMKDLGLADVILEIRITRNPSGYILSQSHYIEKMLRKFDQFESKLAVTPFDPSCKLKKNSGEGVSSLENSRVVGSLMYIMNCTRPDTTYSVGRL

Query:  ARYTSNPGHDHWNALIRVLRYLKYTLNYGLHYMRDPHVLEGFSDPNWISDSMETKSTSGYIFTLGGAAVSWKSSKQTCITRSTMESKFIALDKAGEEAEW
        +RYTSNPG DHW A++RVLRYLKYTLNYG+HY R P VLEG+SD NWISD+ +TKSTSGY+FTLGGAAVSWKSSKQTCI RSTMES+FIALDKAGEEAEW
Subjt:  ARYTSNPGHDHWNALIRVLRYLKYTLNYGLHYMRDPHVLEGFSDPNWISDSMETKSTSGYIFTLGGAAVSWKSSKQTCITRSTMESKFIALDKAGEEAEW

Query:  LRNFLEDIPLWLK
        LR+FLED+P+W K
Subjt:  LRNFLEDIPLWLK

A0A2N9FSE3 Reverse transcriptase Ty1/copia-type domain-containing protein1.8e-8570.42Show/hide
Query:  MLKSNFDMKDLGLADVILEIRITRNPSGYILSQSHYIEKMLRKFDQFESKLAVTPFDPSCKLKKNSGEGVSSLENSRVVGSLMYIMNCTRPDTTYSVGRL
        ML S FDMKDLG+ADVIL I+ITR   G++LSQ+HYI+K+L KF +++     TP D +  L KN G G+S LE S+++GSLMYIMNCTRPD  YS+ +L
Subjt:  MLKSNFDMKDLGLADVILEIRITRNPSGYILSQSHYIEKMLRKFDQFESKLAVTPFDPSCKLKKNSGEGVSSLENSRVVGSLMYIMNCTRPDTTYSVGRL

Query:  ARYTSNPGHDHWNALIRVLRYLKYTLNYGLHYMRDPHVLEGFSDPNWISDSMETKSTSGYIFTLGGAAVSWKSSKQTCITRSTMESKFIALDKAGEEAEW
        +RYTSNPG DHW A++RVLRYLKYTLNYG+HY R P VLEG+SD NWISD+ +TKSTSGY+FTLGGAAVSWKSSKQTCI RSTMES+FIALDKAGEEAEW
Subjt:  ARYTSNPGHDHWNALIRVLRYLKYTLNYGLHYMRDPHVLEGFSDPNWISDSMETKSTSGYIFTLGGAAVSWKSSKQTCITRSTMESKFIALDKAGEEAEW

Query:  LRNFLEDIPLWLK
        LR+FLED+P+W K
Subjt:  LRNFLEDIPLWLK

A0A2N9GUH4 Integrase catalytic domain-containing protein2.3e-8571.36Show/hide
Query:  MLKSNFDMKDLGLADVILEIRITRNPSGYILSQSHYIEKMLRKFDQFESKLAVTPFDPSCKLKKNSGEGVSSLENSRVVGSLMYIMNCTRPDTTYSVGRL
        ML S FDMKDLG+ADVIL I+ITR   G +LSQSHYI+K+L KF +++     TP D +  L KN G G+S LE S+++GSLMYIMNCTRPD  YSV +L
Subjt:  MLKSNFDMKDLGLADVILEIRITRNPSGYILSQSHYIEKMLRKFDQFESKLAVTPFDPSCKLKKNSGEGVSSLENSRVVGSLMYIMNCTRPDTTYSVGRL

Query:  ARYTSNPGHDHWNALIRVLRYLKYTLNYGLHYMRDPHVLEGFSDPNWISDSMETKSTSGYIFTLGGAAVSWKSSKQTCITRSTMESKFIALDKAGEEAEW
        +RYTSNPG DHW A++RVLRYLKYTLNYG+HY R P VLEG+SD NWISD+ +TKSTSGY+FTLGGAAVSWKSSKQTCI RSTMES+FIALDKAGEEAEW
Subjt:  ARYTSNPGHDHWNALIRVLRYLKYTLNYGLHYMRDPHVLEGFSDPNWISDSMETKSTSGYIFTLGGAAVSWKSSKQTCITRSTMESKFIALDKAGEEAEW

Query:  LRNFLEDIPLWLK
        LR+FLED+P+W K
Subjt:  LRNFLEDIPLWLK

A0A5H2XYS2 Transposable element protein5.0e-8869.17Show/hide
Query:  MLKSNFDMKDLGLADVILEIRITRNPSGYILSQSHYIEKMLRKFDQFESKLAVTPFDPSCKLKKNSGEGVSSLENSRVVGSLMYIMNCTRPDTTYSVGRL
        ML S+FDMKDLG ADVIL  +I RN  GYIL+QSHY E  LRKF QF+ K AVTPF+ +C LKKN+G+ +S  E S+V+GSLMY+MN TRPD  YSV RL
Subjt:  MLKSNFDMKDLGLADVILEIRITRNPSGYILSQSHYIEKMLRKFDQFESKLAVTPFDPSCKLKKNSGEGVSSLENSRVVGSLMYIMNCTRPDTTYSVGRL

Query:  ARYTSNPGHDHWNALIRVLRYLKYTLNYGLHYMRDPHVLEGFSDPNWISDSMETKSTSGYIFTLGGAAVSWKSSKQTCITRSTMESKFIALDKAGEEAEW
        +RYTSNPG+DHW ALI+VLRYLKYT +YGLHY + P VLEG+SD NWISDS ETKSTSGY+FTLGGAA+SWKSSKQTCI RSTMESKF+ALDKA EEAEW
Subjt:  ARYTSNPGHDHWNALIRVLRYLKYTLNYGLHYMRDPHVLEGFSDPNWISDSMETKSTSGYIFTLGGAAVSWKSSKQTCITRSTMESKFIALDKAGEEAEW

Query:  LRNFLEDIPLWLKHGSRYSKAFWAELRQPRAQGIERKRKS
        LRNFLEDIP+W KH +       +   Q RA+    KRKS
Subjt:  LRNFLEDIPLWLKHGSRYSKAFWAELRQPRAQGIERKRKS

A0A5N5FQV0 Reverse transcriptase Ty1/copia-type domain-containing protein5.5e-8773.71Show/hide
Query:  MLKSNFDMKDLGLADVILEIRITRNPSGYILSQSHYIEKMLRKFDQFESKLAVTPFDPSCKLKKNSGEGVSSLENSRVVGSLMYIMNCTRPDTTYSVGRL
        +LKS+FDMKDLGLADVIL ++I RN  GYIL+QSHYIE  LRK+   ESK A+TPFD + KLKKN G+ +S  E S+V+GSLMYIMNCTRPD  YSV RL
Subjt:  MLKSNFDMKDLGLADVILEIRITRNPSGYILSQSHYIEKMLRKFDQFESKLAVTPFDPSCKLKKNSGEGVSSLENSRVVGSLMYIMNCTRPDTTYSVGRL

Query:  ARYTSNPGHDHWNALIRVLRYLKYTLNYGLHYMRDPHVLEGFSDPNWISDSMETKSTSGYIFTLGGAAVSWKSSKQTCITRSTMESKFIALDKAGEEAEW
        +RYT NPGHDHW ALIRVLRYLK+T+NYGLHY R P VLEGF+D NWISDS +TKSTSGY+FTLGGAA+SWKSSKQTCI RSTME +FIALD A  EAEW
Subjt:  ARYTSNPGHDHWNALIRVLRYLKYTLNYGLHYMRDPHVLEGFSDPNWISDSMETKSTSGYIFTLGGAAVSWKSSKQTCITRSTMESKFIALDKAGEEAEW

Query:  LRNFLEDIPLWLK
        LR+FLED+P+W K
Subjt:  LRNFLEDIPLWLK

SwissProt top hitse value%identityAlignment
P04146 Copia protein3.4e-2532.26Show/hide
Query:  LKSNFDMKDLGLADVILEIRITRNPSGYILSQSHYIEKMLRKFDQFESKLAVTPFDPSCKLKKNSGEGVSSLENSRVVGSLMYIMNCTRPDTTYSVGRLA
        L   F M DL      + IRI        LSQS Y++K+L KF+        TP       +  + +   +     ++G LMYIM CTRPD T +V  L+
Subjt:  LKSNFDMKDLGLADVILEIRITRNPSGYILSQSHYIEKMLRKFDQFESKLAVTPFDPSCKLKKNSGEGVSSLENSRVVGSLMYIMNCTRPDTTYSVGRLA

Query:  RYTSNPGHDHWNALIRVLRYLKYTLNYGLHYMRD---PHVLEGFSDPNWISDSMETKSTSGYIFTL-GGAAVSWKSSKQTCITRSTMESKFIALDKAGEE
        RY+S    + W  L RVLRYLK T++  L + ++    + + G+ D +W    ++ KST+GY+F +     + W + +Q  +  S+ E++++AL +A  E
Subjt:  RYTSNPGHDHWNALIRVLRYLKYTLNYGLHYMRD---PHVLEGFSDPNWISDSMETKSTSGYIFTL-GGAAVSWKSSKQTCITRSTMESKFIALDKAGEE

Query:  AEWLRNFLEDIPLWLKH
        A WL+  L  I + L++
Subjt:  AEWLRNFLEDIPLWLKH

P0CV72 Secreted RxLR effector protein 1617.6e-2546.77Show/hide
Query:  VGSLMYIMNCTRPDTTYSVGRLARYTSNPGHDHWNALIRVLRYLKYTLNYGLHYMR-DPHVLEGFSDPNWISDSMETKSTSGYIFTLGGAAVSWKSSKQT
        VG++MY+M  TRPD   +VG L+++ S+P   HW AL RVLRYL+ T  YGL + R     L G+SD +W  D    +STSGY+F L G  VSW+S KQ 
Subjt:  VGSLMYIMNCTRPDTTYSVGRLARYTSNPGHDHWNALIRVLRYLKYTLNYGLHYMR-DPHVLEGFSDPNWISDSMETKSTSGYIFTLGGAAVSWKSSKQT

Query:  CITRSTMESKFIALDKAGEEAEWL
         +  S+ E +++AL +A +EA WL
Subjt:  CITRSTMESKFIALDKAGEEAEWL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.1e-3738.46Show/hide
Query:  LKSNFDMKDLGLADVILEIRITRNPSG--YILSQSHYIEKMLRKFDQFESKLAVTPFDPSCKLKK-------NSGEGVSSLENSRVVGSLMYIMNCTRPD
        L  +FDMKDLG A  IL ++I R  +     LSQ  YIE++L +F+   +K   TP     KL K            ++ +  S  VGSLMY M CTRPD
Subjt:  LKSNFDMKDLGLADVILEIRITRNPSG--YILSQSHYIEKMLRKFDQFESKLAVTPFDPSCKLKK-------NSGEGVSSLENSRVVGSLMYIMNCTRPD

Query:  TTYSVGRLARYTSNPGHDHWNALIRVLRYLKYTLNYGLHYMRDPHVLEGFSDPNWISDSMETKSTSGYIFTLGGAAVSWKSSKQTCITRSTMESKFIALD
          ++VG ++R+  NPG +HW A+  +LRYL+ T    L +     +L+G++D +   D    KS++GY+FT  G A+SW+S  Q C+  ST E+++IA  
Subjt:  TTYSVGRLARYTSNPGHDHWNALIRVLRYLKYTLNYGLHYMRDPHVLEGFSDPNWISDSMETKSTSGYIFTLGGAAVSWKSSKQTCITRSTMESKFIALD

Query:  KAGEEAEWLRNFLEDIPLWLK
        + G+E  WL+ FL+++ L  K
Subjt:  KAGEEAEWLRNFLEDIPLWLK

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE18.1e-2734.74Show/hide
Query:  LKSNFDMKDLGLADVILEIRITRNPSGYILSQSHYIEKMLRKFDQFESKLAVTPFDPSCKLKKNSGEGVSS-LENSRVVGSLMYIMNCTRPDTTYSVGRL
        L   F +KD       L I   R P+G  LSQ  YI  +L + +   +K   TP  PS KL   SG  ++   E   +VGSL Y+   TRPD +Y+V RL
Subjt:  LKSNFDMKDLGLADVILEIRITRNPSGYILSQSHYIEKMLRKFDQFESKLAVTPFDPSCKLKKNSGEGVSS-LENSRVVGSLMYIMNCTRPDTTYSVGRL

Query:  ARYTSNPGHDHWNALIRVLRYLKYTLNYGLHYMRDPHV-LEGFSDPNWISDSMETKSTSGYIFTLGGAAVSWKSSKQTCITRSTMESKFIALDKAGEEAE
        +++   P  +H  AL R+LRYL  T N+G+   +   + L  +SD +W  D  +  ST+GYI  LG   +SW S KQ  + RS+ E+++ ++     E +
Subjt:  ARYTSNPGHDHWNALIRVLRYLKYTLNYGLHYMRDPHV-LEGFSDPNWISDSMETKSTSGYIFTLGGAAVSWKSSKQTCITRSTMESKFIALDKAGEEAE

Query:  WLRNFLEDIPLWL
        W+ + L ++ + L
Subjt:  WLRNFLEDIPLWL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.5e-2835.35Show/hide
Query:  LKSNFDMKDLGLADVILEIRITRNPSGYILSQSHYIEKMLRKFDQFESKLAVTPFDPSCKLKKNSGEGV-SSLENSRVVGSLMYIMNCTRPDTTYSVGRL
        L   F +K+       L I   R P G  LSQ  Y   +L + +   +K   TP   S KL  +SG  +    E   +VGSL Y+   TRPD +Y+V RL
Subjt:  LKSNFDMKDLGLADVILEIRITRNPSGYILSQSHYIEKMLRKFDQFESKLAVTPFDPSCKLKKNSGEGV-SSLENSRVVGSLMYIMNCTRPDTTYSVGRL

Query:  ARYTSNPGHDHWNALIRVLRYLKYTLNYGLHYMRDPHV-LEGFSDPNWISDSMETKSTSGYIFTLGGAAVSWKSSKQTCITRSTMESKFIALDKAGEEAE
        ++Y   P  DHWNAL RVLRYL  T ++G+   +   + L  +SD +W  D+ +  ST+GYI  LG   +SW S KQ  + RS+ E+++ ++     E +
Subjt:  ARYTSNPGHDHWNALIRVLRYLKYTLNYGLHYMRDPHV-LEGFSDPNWISDSMETKSTSGYIFTLGGAAVSWKSSKQTCITRSTMESKFIALDKAGEEAE

Query:  WLRNFLEDIPLWLKH
        W+ + L ++ + L H
Subjt:  WLRNFLEDIPLWLKH

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 84.1e-2632.7Show/hide
Query:  LKSNFDMKDLGLADVILEIRITRNPSGYILSQSHYIEKMLRKFDQFESKLAVTPFDPSCKLKKNS-GEGVSSLENSRVVGSLMYIMNCTRPDTTYSVGRL
        LKS F ++DLG     L + I R+ +G  + Q  Y   +L +      K +  P DPS     +S G+ V +    R++G LMY +  TR D +++V +L
Subjt:  LKSNFDMKDLGLADVILEIRITRNPSGYILSQSHYIEKMLRKFDQFESKLAVTPFDPSCKLKKNS-GEGVSSLENSRVVGSLMYIMNCTRPDTTYSVGRL

Query:  ARYTSNPGHDHWNALIRVLRYLKYTLNYGLHYMRDPHV-LEGFSDPNWISDSMETKSTSGYIFTLGGAAVSWKSSKQTCITRSTMESKFIALDKAGEEAE
        ++++  P   H  A++++L Y+K T+  GL Y     + L+ FSD ++ S     +ST+GY   LG + +SWKS KQ  +++S+ E+++ AL  A +E  
Subjt:  ARYTSNPGHDHWNALIRVLRYLKYTLNYGLHYMRDPHV-LEGFSDPNWISDSMETKSTSGYIFTLGGAAVSWKSSKQTCITRSTMESKFIALDKAGEEAE

Query:  WLRNFLEDIPL
        WL  F  ++ L
Subjt:  WLRNFLEDIPL

ATMG00240.1 Gag-Pol-related retrotransposon family protein3.1e-0532.89Show/hide
Query:  MNCTRPDTTYSVGRLARYTSNPGHDHWNALIRVLRYLKYTLNYGLHYMRDPHV-LEGFSDPNWISDSMETKSTSGY
        +  TRPD T++V RL++++S        A+ +VL Y+K T+  GL Y     + L+ F+D +W S     +S +G+
Subjt:  MNCTRPDTTYSVGRLARYTSNPGHDHWNALIRVLRYLKYTLNYGLHYMRDPHV-LEGFSDPNWISDSMETKSTSGY

ATMG00810.1 DNA/RNA polymerases superfamily protein1.2e-2231Show/hide
Query:  LKSNFDMKDLGLADVILEIRITRNPSGYILSQSHYIEKMLRKFDQFESKLAVTPFDPSCKLKKNSGEGVSSLENSRVVGSLMYIMNCTRPDTTYSVGRLA
        L S F MKDLG     L I+I  +PSG  LSQ+ Y E++L      + K   TP         ++ +     +   +VG+L Y +  TRPD +Y+V  + 
Subjt:  LKSNFDMKDLGLADVILEIRITRNPSGYILSQSHYIEKMLRKFDQFESKLAVTPFDPSCKLKKNSGEGVSSLENSRVVGSLMYIMNCTRPDTTYSVGRLA

Query:  RYTSNPGHDHWNALIRVLRYLKYTLNYGLHYMRDPHV-LEGFSDPNWISDSMETKSTSGYIFTLGGAAVSWKSSKQTCITRSTMESKFIALDKAGEEAEW
        +    P    ++ L RVLRY+K T+ +GL+  ++  + ++ F D +W   +   +ST+G+   LG   +SW + +Q  ++RS+ E+++ AL     E  W
Subjt:  RYTSNPGHDHWNALIRVLRYLKYTLNYGLHYMRDPHV-LEGFSDPNWISDSMETKSTSGYIFTLGGAAVSWKSSKQTCITRSTMESKFIALDKAGEEAEW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGAAATCCAACTTTGATATGAAAGACCTTGGTCTTGCTGATGTTATACTTGAAATTCGTATAACTAGAAATCCAAGTGGATACATACTTTCTCAATCTCACTACAT
AGAGAAAATGTTGAGAAAATTTGATCAATTTGAGAGTAAACTTGCAGTTACTCCATTTGATCCAAGTTGTAAGTTAAAGAAGAATAGTGGTGAAGGTGTGTCTTCTCTAG
AAAACTCTAGAGTTGTTGGTAGCTTAATGTACATAATGAACTGTACAAGACCTGATACAACCTATTCAGTAGGAAGGTTAGCAAGATACACCAGCAATCCTGGGCATGAT
CATTGGAATGCTTTAATTAGAGTTCTAAGGTACTTGAAGTACACTCTGAACTATGGATTGCATTATATGAGGGATCCACATGTACTGGAGGGATTTAGTGATCCTAATTG
GATTTCTGATAGCATGGAGACTAAATCTACCAGTGGGTATATTTTTACCTTAGGTGGAGCAGCTGTATCATGGAAATCTTCAAAACAGACGTGTATAACGCGTTCAACTA
TGGAGTCAAAGTTTATAGCTCTTGACAAAGCTGGAGAAGAAGCTGAATGGCTTCGAAATTTTCTTGAAGATATTCCATTATGGTTGAAGCATGGTTCTAGGTATTCCAAG
GCGTTTTGGGCTGAACTGAGGCAGCCGAGGGCACAAGGGATTGAACGAAAGCGGAAGAGCTCGACCCGCACGAGCAGGCCGAGTGGTCGGTCGAGCAAGGGGTCGGGCCA
AAAGCCCAATCCCCTTAGCTTTGGCCCGTTCTTCCTCTGGATTTCGTTTCCCGACTATCTCCTCGGGTGCTTTTTCTCACTTAGCCATTTATGTGGCGCACTTGGTAAGA
CCATAGGGTCGGTCTCGGCCACACTCCATGTTATTGAAAATATTGAGTTCCTACATCAAAATATTAGTTTCTACAACAGAAAAGTGAGTGAAGAGGATAGGGTCGACCTC
GACCTTAATCCTCCATTAGCTCAAATTGTAGCTCCCTGCACTCACACATTAGTATTTGCAAAAGAAAAAACGAGGCCTGATCGACCTCGATCTTCTTCCTATTGGGAATA
TGGCAATGCCACTCCTCCAAGGTCATTCCTGCAAGTTAGTTACATCGGGGAGTATCGACGTTGCCACTCCACCGAGTACCTTGGGAGTATAGGCGTTGCCACTCCTACAA
GGACTTTTCCACAAATTGCTTACCTCGATGAGTATAGGCGTTGCCACTCCACCGAATACCCTATGAGTATAGGCGTTGTCACTCCTATAAGAACTTTTTCACAGATTGCT
TACCTTGGGGAGTATAGGCGTTGCCACTCCACCGAGTACCCTAGGAGTATAGGCGTTGCCACTCCTACAAGGACTTTTTCACAGATTGCTTACCTTAGGGAGTATAGGCA
TTGCCACTCCACCGAATACCCTAGAAGTATAGGCGTTGCCACTCCTACAAGGACTTTTTCACAGATTGTTTACCTCGGGGAGTATGGGCGTTGCCACTCCACCGAGTACC
CAAGGAGTATAGGCGTTGCCACTCCTACAAGGACTTTTTCACAGATTGCTAACCTCGGGGAGTATAGGCGCTGCCACTCCACCGAGTACCCTAGGAGTATAGGCGTTGCC
ATTCCTACAAGGACTTTTTCACAGATTTCGTACCTCGGGGAGTATAGGCGTTGCCACTCCACCGAGTACCTTAGGAGTATAGGCGTTGCCACTCCTAGAAGGACTTTTTC
ACAGATTGCTTACCTCGGGGAGTATAGGCGTTGCCACTCCACCGAGTACCCTAGGAGTATAGGCGTTGCCACTCCTACAAGGACTTTTTCACATATTGTTTACCTCGGGG
AGTATAGGCGTTGCCACTCCACCGAGCACCTTGGGAGTATAGGCGTTGCCACTCCTACAAGAAATTTTTTACAAGTCGATACCTCGGGGAGTATAGGCATTGCCACTCCA
CCGACTCTTCAACGAGACTTTTACGCTGATTCGTCCTCCAAAGATGATCCTGATCCCTCGTGGGACTTCGTCCTGACCTCGGCCTTGGCCTCGGGTCGGCACTGTTCGCC
TTGGTATTCCGAGGCATTTTGGGTTGAACCGAGGCAGCCGAGGGCACCAGGGACCGAACGAAAGCAGAAGAGCGCGCGGGCCGACCATGGGCCTCGGCTCGGCCCAAGGC
CGAGGCCGAGCAGGGGTCGGGCCAAAAGCCCGATCCCCTCAGCTTTGGCCCGACCCTTTGGCCTGTTCTTCCTCCAGATTTCGTTTCCTGACTGTCTCCTTGGGCTGGAT
CCCTCCCATTCAGATGTGACCTCGGTTCATTCATGTACCCCTCCTAACTCGGATGCTGATTACACATTTGATCCGGATGACCCCAAGTCAACAACTGGTTTCTGTGTTTT
CTTTGGTAGAAATTTGAGAACATGGGGATCCAAAAAGCAAAATATCATATCCCGATCCAACACTGAAGTGGAATATCGCAGCTTGACCACTATTGCTACTGAACTAGTTT
GGCTTAAATCCTTATTTTTTTTATTTACAGATCTATTTAACTAA
mRNA sequenceShow/hide mRNA sequence
ATGTTGAAATCCAACTTTGATATGAAAGACCTTGGTCTTGCTGATGTTATACTTGAAATTCGTATAACTAGAAATCCAAGTGGATACATACTTTCTCAATCTCACTACAT
AGAGAAAATGTTGAGAAAATTTGATCAATTTGAGAGTAAACTTGCAGTTACTCCATTTGATCCAAGTTGTAAGTTAAAGAAGAATAGTGGTGAAGGTGTGTCTTCTCTAG
AAAACTCTAGAGTTGTTGGTAGCTTAATGTACATAATGAACTGTACAAGACCTGATACAACCTATTCAGTAGGAAGGTTAGCAAGATACACCAGCAATCCTGGGCATGAT
CATTGGAATGCTTTAATTAGAGTTCTAAGGTACTTGAAGTACACTCTGAACTATGGATTGCATTATATGAGGGATCCACATGTACTGGAGGGATTTAGTGATCCTAATTG
GATTTCTGATAGCATGGAGACTAAATCTACCAGTGGGTATATTTTTACCTTAGGTGGAGCAGCTGTATCATGGAAATCTTCAAAACAGACGTGTATAACGCGTTCAACTA
TGGAGTCAAAGTTTATAGCTCTTGACAAAGCTGGAGAAGAAGCTGAATGGCTTCGAAATTTTCTTGAAGATATTCCATTATGGTTGAAGCATGGTTCTAGGTATTCCAAG
GCGTTTTGGGCTGAACTGAGGCAGCCGAGGGCACAAGGGATTGAACGAAAGCGGAAGAGCTCGACCCGCACGAGCAGGCCGAGTGGTCGGTCGAGCAAGGGGTCGGGCCA
AAAGCCCAATCCCCTTAGCTTTGGCCCGTTCTTCCTCTGGATTTCGTTTCCCGACTATCTCCTCGGGTGCTTTTTCTCACTTAGCCATTTATGTGGCGCACTTGGTAAGA
CCATAGGGTCGGTCTCGGCCACACTCCATGTTATTGAAAATATTGAGTTCCTACATCAAAATATTAGTTTCTACAACAGAAAAGTGAGTGAAGAGGATAGGGTCGACCTC
GACCTTAATCCTCCATTAGCTCAAATTGTAGCTCCCTGCACTCACACATTAGTATTTGCAAAAGAAAAAACGAGGCCTGATCGACCTCGATCTTCTTCCTATTGGGAATA
TGGCAATGCCACTCCTCCAAGGTCATTCCTGCAAGTTAGTTACATCGGGGAGTATCGACGTTGCCACTCCACCGAGTACCTTGGGAGTATAGGCGTTGCCACTCCTACAA
GGACTTTTCCACAAATTGCTTACCTCGATGAGTATAGGCGTTGCCACTCCACCGAATACCCTATGAGTATAGGCGTTGTCACTCCTATAAGAACTTTTTCACAGATTGCT
TACCTTGGGGAGTATAGGCGTTGCCACTCCACCGAGTACCCTAGGAGTATAGGCGTTGCCACTCCTACAAGGACTTTTTCACAGATTGCTTACCTTAGGGAGTATAGGCA
TTGCCACTCCACCGAATACCCTAGAAGTATAGGCGTTGCCACTCCTACAAGGACTTTTTCACAGATTGTTTACCTCGGGGAGTATGGGCGTTGCCACTCCACCGAGTACC
CAAGGAGTATAGGCGTTGCCACTCCTACAAGGACTTTTTCACAGATTGCTAACCTCGGGGAGTATAGGCGCTGCCACTCCACCGAGTACCCTAGGAGTATAGGCGTTGCC
ATTCCTACAAGGACTTTTTCACAGATTTCGTACCTCGGGGAGTATAGGCGTTGCCACTCCACCGAGTACCTTAGGAGTATAGGCGTTGCCACTCCTAGAAGGACTTTTTC
ACAGATTGCTTACCTCGGGGAGTATAGGCGTTGCCACTCCACCGAGTACCCTAGGAGTATAGGCGTTGCCACTCCTACAAGGACTTTTTCACATATTGTTTACCTCGGGG
AGTATAGGCGTTGCCACTCCACCGAGCACCTTGGGAGTATAGGCGTTGCCACTCCTACAAGAAATTTTTTACAAGTCGATACCTCGGGGAGTATAGGCATTGCCACTCCA
CCGACTCTTCAACGAGACTTTTACGCTGATTCGTCCTCCAAAGATGATCCTGATCCCTCGTGGGACTTCGTCCTGACCTCGGCCTTGGCCTCGGGTCGGCACTGTTCGCC
TTGGTATTCCGAGGCATTTTGGGTTGAACCGAGGCAGCCGAGGGCACCAGGGACCGAACGAAAGCAGAAGAGCGCGCGGGCCGACCATGGGCCTCGGCTCGGCCCAAGGC
CGAGGCCGAGCAGGGGTCGGGCCAAAAGCCCGATCCCCTCAGCTTTGGCCCGACCCTTTGGCCTGTTCTTCCTCCAGATTTCGTTTCCTGACTGTCTCCTTGGGCTGGAT
CCCTCCCATTCAGATGTGACCTCGGTTCATTCATGTACCCCTCCTAACTCGGATGCTGATTACACATTTGATCCGGATGACCCCAAGTCAACAACTGGTTTCTGTGTTTT
CTTTGGTAGAAATTTGAGAACATGGGGATCCAAAAAGCAAAATATCATATCCCGATCCAACACTGAAGTGGAATATCGCAGCTTGACCACTATTGCTACTGAACTAGTTT
GGCTTAAATCCTTATTTTTTTTATTTACAGATCTATTTAACTAA
Protein sequenceShow/hide protein sequence
MLKSNFDMKDLGLADVILEIRITRNPSGYILSQSHYIEKMLRKFDQFESKLAVTPFDPSCKLKKNSGEGVSSLENSRVVGSLMYIMNCTRPDTTYSVGRLARYTSNPGHD
HWNALIRVLRYLKYTLNYGLHYMRDPHVLEGFSDPNWISDSMETKSTSGYIFTLGGAAVSWKSSKQTCITRSTMESKFIALDKAGEEAEWLRNFLEDIPLWLKHGSRYSK
AFWAELRQPRAQGIERKRKSSTRTSRPSGRSSKGSGQKPNPLSFGPFFLWISFPDYLLGCFFSLSHLCGALGKTIGSVSATLHVIENIEFLHQNISFYNRKVSEEDRVDL
DLNPPLAQIVAPCTHTLVFAKEKTRPDRPRSSSYWEYGNATPPRSFLQVSYIGEYRRCHSTEYLGSIGVATPTRTFPQIAYLDEYRRCHSTEYPMSIGVVTPIRTFSQIA
YLGEYRRCHSTEYPRSIGVATPTRTFSQIAYLREYRHCHSTEYPRSIGVATPTRTFSQIVYLGEYGRCHSTEYPRSIGVATPTRTFSQIANLGEYRRCHSTEYPRSIGVA
IPTRTFSQISYLGEYRRCHSTEYLRSIGVATPRRTFSQIAYLGEYRRCHSTEYPRSIGVATPTRTFSHIVYLGEYRRCHSTEHLGSIGVATPTRNFLQVDTSGSIGIATP
PTLQRDFYADSSSKDDPDPSWDFVLTSALASGRHCSPWYSEAFWVEPRQPRAPGTERKQKSARADHGPRLGPRPRPSRGRAKSPIPSALARPFGLFFLQISFPDCLLGLD
PSHSDVTSVHSCTPPNSDADYTFDPDDPKSTTGFCVFFGRNLRTWGSKKQNIISRSNTEVEYRSLTTIATELVWLKSLFFLFTDLFN