; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0000600 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0000600
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr4:11097563..11098936
RNA-Seq ExpressionLag0000600
SyntenyLag0000600
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GFY85402.1 hypothetical protein Acr_04g0001400 [Actinidia rufa]9.5e-5648.05Show/hide
Query:  PPQQFSQNQIQNPIPYPNP--FTPNPYPTLP---QPLSVKLNDSNFLLWKNQLLNAVIANGLQGYLHGSIAAPPRYLDDQQTQPNPDFLHWERYNRFIMC
        PP   S     + IP PNP     +P P +P   QPL+VKL+D N+++WK QLLN VIANGL+ +L GS   PPR+LD QQ Q NP+F  W+RYNR +M 
Subjt:  PPQQFSQNQIQNPIPYPNP--FTPNPYPTLP---QPLSVKLNDSNFLLWKNQLLNAVIANGLQGYLHGSIAAPPRYLDDQQTQPNPDFLHWERYNRFIMC

Query:  WIYSSLSEEKMGEIVSFDSAAAIWNSLKRSYDSKTTARIMGLKTQLQKIKKDNLSVSQYLSQIKEVADKFSAIGEPISYRDHLAHILDGLGSEYNAFVTT
        WIY+S++E  +G+IV + SA+ IW +L+R Y + + A +  L+T LQ IKK+ L+   Y+ + + + +  ++IGEP++Y DHL + L GLG +YN FVT+
Subjt:  WIYSSLSEEKMGEIVSFDSAAAIWNSLKRSYDSKTTARIMGLKTQLQKIKKDNLSVSQYLSQIKEVADKFSAIGEPISYRDHLAHILDGLGSEYNAFVTT

Query:  IQNRSDNPSLEDVRSLLLAYEARLEKQNAVD
        IQ+++  PS+E+V SLLL+Y+ARLE+Q+A D
Subjt:  IQNRSDNPSLEDVRSLLLAYEARLEKQNAVD

GFZ12741.1 UBX domain-containing protein [Actinidia rufa]4.0e-5439.4Show/hide
Query:  PPQQFSQNQIQNPIPYPNPFTPNP-----YPTLPQPLSVKLNDSNFLLWKNQLLNAVIANGLQGYLHGSIAAPPRYLDDQQTQPNPDFLHWERYNRFIMC
        PP   S     + IP PNP   N       P++ QPL+VKL+D N+++WK QLLN VIANGL+ +L GS   PPR+LD QQ Q NP+F  W+RYNR +M 
Subjt:  PPQQFSQNQIQNPIPYPNPFTPNP-----YPTLPQPLSVKLNDSNFLLWKNQLLNAVIANGLQGYLHGSIAAPPRYLDDQQTQPNPDFLHWERYNRFIMC

Query:  WIYSSLSEEKMGEIVSFDSAAAIWNSLKRSYDSKTTARIMGLKTQLQKIKKDNLSVSQYLSQIKEVADKFSAIGEPISYRDHLAHILDGLGSEYNAFVTT
        WIY+S++E  +G+IV + SA+ IW +L+R Y + + A +  L+T LQ IKK+ L+   Y+ + + + +  ++IGEP++Y DHL + L GLG +YN FVT+
Subjt:  WIYSSLSEEKMGEIVSFDSAAAIWNSLKRSYDSKTTARIMGLKTQLQKIKKDNLSVSQYLSQIKEVADKFSAIGEPISYRDHLAHILDGLGSEYNAFVTT

Query:  IQNRSDNPSLEDVRS-LLLAYEARLEKQNAVDQLNLAQANLSSLSLQNSRR-SNPKPNHSIPFRPPFNPQAFSPFSSQQHSASPSLLGKPQTQQLQKWPS
        IQ+++  PS+E+  S   L  + + +             N S+ S  NS   S+P+  +     P ++P   SP              KP          
Subjt:  IQNRSDNPSLEDVRS-LLLAYEARLEKQNAVDQLNLAQANLSSLSLQNSRR-SNPKPNHSIPFRPPFNPQAFSPFSSQQHSASPSLLGKPQTQQLQKWPS

Query:  RLSSNKPQCQICGKFGHTALICHHRTNLAYQTPPP
             +P+CQIC K GHTA  C+H TNL YQ PPP
Subjt:  RLSSNKPQCQICGKFGHTALICHHRTNLAYQTPPP

PON47862.1 hypothetical protein TorRG33x02_321990 [Trema orientale]5.8e-5341.42Show/hide
Query:  SPNFYNPRPQFFPPPQQFSQNQIQNPIPYPNPFTPNP-YPTLPQPLSVKLNDSNFLLWKNQLLNAVIANGLQGYLHGSIAAPPRYLDDQQTQPNPDFLHW
        +P+  NP  Q  P      Q+QI    P   P  P P  P++ QP ++KL+  N+L+WKNQLLN +IANGL+ ++ GS   PPR+ D  +   N +++ W
Subjt:  SPNFYNPRPQFFPPPQQFSQNQIQNPIPYPNPFTPNP-YPTLPQPLSVKLNDSNFLLWKNQLLNAVIANGLQGYLHGSIAAPPRYLDDQQTQPNPDFLHW

Query:  ERYNRFIMCWIYSSLSEEKMGEIVSFDSAAAIWNSLKRSYDSKTTARIMGLKTQLQKIKKDNLSVSQYLSQIKEVADKFSAIGEPISYRDHLAHILDGLG
        +R+NR IM WIY+SL++  MG+IV + SA  IW +L + Y S + A+I  L+ +LQ ++KD L+  +Y+ + K + +  +A+GEP+S +DHL ++  GL 
Subjt:  ERYNRFIMCWIYSSLSEEKMGEIVSFDSAAAIWNSLKRSYDSKTTARIMGLKTQLQKIKKDNLSVSQYLSQIKEVADKFSAIGEPISYRDHLAHILDGLG

Query:  SEYNAFVTTIQNRSDNPSLEDVRSLLLAYEARLEKQNAVDQLNLAQANLSSLSLQNSRRSNPKPNHSIP---FRPPF--NPQAFSPFSSQQHSASPSLLG
         EYNAFVT+I  R DN  LE++ SLLL+YE RLE QNA  QL+  QANL+ L   N  +   +PN S P   F   F    Q F       +   PS+LG
Subjt:  SEYNAFVTTIQNRSDNPSLEDVRSLLLAYEARLEKQNAVDQLNLAQANLSSLSLQNSRRSNPKPNHSIP---FRPPF--NPQAFSPFSSQQHSASPSLLG

Query:  KPQTQQLQK
        KPQ + + +
Subjt:  KPQTQQLQK

RVW69807.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]3.0e-5740.73Show/hide
Query:  FPPP----QQFSQNQIQNPIPYPNPFTPNPYPTLPQPLSVKLNDSNFLLWKNQLLNAVIANGLQGYLHGSIAAPPRYLDDQQTQPNPDFLHWERYNRFIM
        FPP        + N  QNP P     T  P P+L Q LS+KL+++N LL K+QLLN +IANGL+ ++    ++PP+YLD    Q NP+F+ W+R N+ +M
Subjt:  FPPP----QQFSQNQIQNPIPYPNPFTPNPYPTLPQPLSVKLNDSNFLLWKNQLLNAVIANGLQGYLHGSIAAPPRYLDDQQTQPNPDFLHWERYNRFIM

Query:  CWIYSSLSEEKMGEIVSFDSAAAIWNSLKRSYDSKTTARIMGLKTQLQKIKKDNLSVSQYLSQIKEVADKFSAIGEPISYRDHLAHILDGLGSEYNAFVT
         WIYSSL+   +G+IV + +A  IW SL   Y+S + A +M L +QLQ+IKK ++ +S+YLS++K V D+F+ IGEP+SYRD L  IL+GL  EY+ FVT
Subjt:  CWIYSSLSEEKMGEIVSFDSAAAIWNSLKRSYDSKTTARIMGLKTQLQKIKKDNLSVSQYLSQIKEVADKFSAIGEPISYRDHLAHILDGLGSEYNAFVT

Query:  TIQNRSDNPSLEDVRSLLLAYEARLEKQNAVDQLNLAQANLSSLSLQNSRRSNPKPNHSIPFRPPFNPQAFSPFSSQQHSASPSLLGKPQTQQLQKWPSR
        +I NRSD PSL++V SLL  YE RL +++    LN  QAN                    P +P +N                                 
Subjt:  TIQNRSDNPSLEDVRSLLLAYEARLEKQNAVDQLNLAQANLSSLSLQNSRRSNPKPNHSIPFRPPFNPQAFSPFSSQQHSASPSLLGKPQTQQLQKWPSR

Query:  LSSNKPQCQICGKFGHTALICHHRTNLAYQT---PPPQAYCPRFP-QPLPPLSLML
          ++ PQCQICGK GH AL  +HRTNL Y     P   A+ P  P Q   P+S ML
Subjt:  LSSNKPQCQICGKFGHTALICHHRTNLAYQT---PPPQAYCPRFP-QPLPPLSLML

XP_022155181.1 uncharacterized protein LOC111022315 [Momordica charantia]1.4e-11565.43Show/hide
Query:  FPPPQQFSQNQIQNPIPYPNPFTPNPYPTLPQPLSVKLNDSNFLLWKNQLLNAVIANGLQGYLHGSIAAPPRYLDDQQTQPNPDFLHWERYNRFIMCWIY
        FPPP   + N +  P   PNPF+ NP+PTLPQPL+VKLND+NFLLWKNQLLNAVIANGL+GYL G+I  PP++LD  Q QPNP +  WERYNR +MCWIY
Subjt:  FPPPQQFSQNQIQNPIPYPNPFTPNPYPTLPQPLSVKLNDSNFLLWKNQLLNAVIANGLQGYLHGSIAAPPRYLDDQQTQPNPDFLHWERYNRFIMCWIY

Query:  SSLSEEKMGEIVSFDSAAAIWNSLKRSYDSKTTARIMGLKTQLQKIKKDNLSVSQYLSQIKEVADKFSAIGEPISYRDHLAHILDGLGSEYNAFVTTIQN
        SSLSEEKMGE+VS ++   IW+SL R YDSKTTARIMGLKT+LQ ++KD  SVSQYL++IKE+ADKF+A+GEP+SYRDHLAH+LDGLGSEYNAFVT+I N
Subjt:  SSLSEEKMGEIVSFDSAAAIWNSLKRSYDSKTTARIMGLKTQLQKIKKDNLSVSQYLSQIKEVADKFSAIGEPISYRDHLAHILDGLGSEYNAFVTTIQN

Query:  RSDNPSLEDVRSLLLAYEARLEKQNAVDQLNLAQANLSSLSLQ-NSRRSNPK---PNHSIPFRPPFNPQAFSPFSSQQHSASPSLLGKPQTQQLQKWPSR
        R+D+PSLEDVRSLLLAYEARL+KQN VDQLN+AQANL +LSLQ NS+R  PK   PNH   ++  F     SP S+ Q   S S+LGKPQ+  + KWP +
Subjt:  RSDNPSLEDVRSLLLAYEARLEKQNAVDQLNLAQANLSSLSLQ-NSRRSNPK---PNHSIPFRPPFNPQAFSPFSSQQHSASPSLLGKPQTQQLQKWPSR

Query:  LSSNKPQCQICGKFGHTALICHHRTNLAYQTPPPQA-YCPRFPQPLPPLS
         SS+K QCQICGK GH+A +C+HRTN+AY    PQA Y    P P  P S
Subjt:  LSSNKPQCQICGKFGHTALICHHRTNLAYQTPPPQA-YCPRFPQPLPPLS

TrEMBL top hitse value%identityAlignment
A0A2P5BGF8 Uncharacterized protein2.8e-5341.42Show/hide
Query:  SPNFYNPRPQFFPPPQQFSQNQIQNPIPYPNPFTPNP-YPTLPQPLSVKLNDSNFLLWKNQLLNAVIANGLQGYLHGSIAAPPRYLDDQQTQPNPDFLHW
        +P+  NP  Q  P      Q+QI    P   P  P P  P++ QP ++KL+  N+L+WKNQLLN +IANGL+ ++ GS   PPR+ D  +   N +++ W
Subjt:  SPNFYNPRPQFFPPPQQFSQNQIQNPIPYPNPFTPNP-YPTLPQPLSVKLNDSNFLLWKNQLLNAVIANGLQGYLHGSIAAPPRYLDDQQTQPNPDFLHW

Query:  ERYNRFIMCWIYSSLSEEKMGEIVSFDSAAAIWNSLKRSYDSKTTARIMGLKTQLQKIKKDNLSVSQYLSQIKEVADKFSAIGEPISYRDHLAHILDGLG
        +R+NR IM WIY+SL++  MG+IV + SA  IW +L + Y S + A+I  L+ +LQ ++KD L+  +Y+ + K + +  +A+GEP+S +DHL ++  GL 
Subjt:  ERYNRFIMCWIYSSLSEEKMGEIVSFDSAAAIWNSLKRSYDSKTTARIMGLKTQLQKIKKDNLSVSQYLSQIKEVADKFSAIGEPISYRDHLAHILDGLG

Query:  SEYNAFVTTIQNRSDNPSLEDVRSLLLAYEARLEKQNAVDQLNLAQANLSSLSLQNSRRSNPKPNHSIP---FRPPF--NPQAFSPFSSQQHSASPSLLG
         EYNAFVT+I  R DN  LE++ SLLL+YE RLE QNA  QL+  QANL+ L   N  +   +PN S P   F   F    Q F       +   PS+LG
Subjt:  SEYNAFVTTIQNRSDNPSLEDVRSLLLAYEARLEKQNAVDQLNLAQANLSSLSLQNSRRSNPKPNHSIP---FRPPF--NPQAFSPFSSQQHSASPSLLG

Query:  KPQTQQLQK
        KPQ + + +
Subjt:  KPQTQQLQK

A0A438GC62 Retrovirus-related Pol polyprotein from transposon RE11.4e-5740.73Show/hide
Query:  FPPP----QQFSQNQIQNPIPYPNPFTPNPYPTLPQPLSVKLNDSNFLLWKNQLLNAVIANGLQGYLHGSIAAPPRYLDDQQTQPNPDFLHWERYNRFIM
        FPP        + N  QNP P     T  P P+L Q LS+KL+++N LL K+QLLN +IANGL+ ++    ++PP+YLD    Q NP+F+ W+R N+ +M
Subjt:  FPPP----QQFSQNQIQNPIPYPNPFTPNPYPTLPQPLSVKLNDSNFLLWKNQLLNAVIANGLQGYLHGSIAAPPRYLDDQQTQPNPDFLHWERYNRFIM

Query:  CWIYSSLSEEKMGEIVSFDSAAAIWNSLKRSYDSKTTARIMGLKTQLQKIKKDNLSVSQYLSQIKEVADKFSAIGEPISYRDHLAHILDGLGSEYNAFVT
         WIYSSL+   +G+IV + +A  IW SL   Y+S + A +M L +QLQ+IKK ++ +S+YLS++K V D+F+ IGEP+SYRD L  IL+GL  EY+ FVT
Subjt:  CWIYSSLSEEKMGEIVSFDSAAAIWNSLKRSYDSKTTARIMGLKTQLQKIKKDNLSVSQYLSQIKEVADKFSAIGEPISYRDHLAHILDGLGSEYNAFVT

Query:  TIQNRSDNPSLEDVRSLLLAYEARLEKQNAVDQLNLAQANLSSLSLQNSRRSNPKPNHSIPFRPPFNPQAFSPFSSQQHSASPSLLGKPQTQQLQKWPSR
        +I NRSD PSL++V SLL  YE RL +++    LN  QAN                    P +P +N                                 
Subjt:  TIQNRSDNPSLEDVRSLLLAYEARLEKQNAVDQLNLAQANLSSLSLQNSRRSNPKPNHSIPFRPPFNPQAFSPFSSQQHSASPSLLGKPQTQQLQKWPSR

Query:  LSSNKPQCQICGKFGHTALICHHRTNLAYQT---PPPQAYCPRFP-QPLPPLSLML
          ++ PQCQICGK GH AL  +HRTNL Y     P   A+ P  P Q   P+S ML
Subjt:  LSSNKPQCQICGKFGHTALICHHRTNLAYQT---PPPQAYCPRFP-QPLPPLSLML

A0A6J1DQX7 uncharacterized protein LOC1110223156.7e-11665.43Show/hide
Query:  FPPPQQFSQNQIQNPIPYPNPFTPNPYPTLPQPLSVKLNDSNFLLWKNQLLNAVIANGLQGYLHGSIAAPPRYLDDQQTQPNPDFLHWERYNRFIMCWIY
        FPPP   + N +  P   PNPF+ NP+PTLPQPL+VKLND+NFLLWKNQLLNAVIANGL+GYL G+I  PP++LD  Q QPNP +  WERYNR +MCWIY
Subjt:  FPPPQQFSQNQIQNPIPYPNPFTPNPYPTLPQPLSVKLNDSNFLLWKNQLLNAVIANGLQGYLHGSIAAPPRYLDDQQTQPNPDFLHWERYNRFIMCWIY

Query:  SSLSEEKMGEIVSFDSAAAIWNSLKRSYDSKTTARIMGLKTQLQKIKKDNLSVSQYLSQIKEVADKFSAIGEPISYRDHLAHILDGLGSEYNAFVTTIQN
        SSLSEEKMGE+VS ++   IW+SL R YDSKTTARIMGLKT+LQ ++KD  SVSQYL++IKE+ADKF+A+GEP+SYRDHLAH+LDGLGSEYNAFVT+I N
Subjt:  SSLSEEKMGEIVSFDSAAAIWNSLKRSYDSKTTARIMGLKTQLQKIKKDNLSVSQYLSQIKEVADKFSAIGEPISYRDHLAHILDGLGSEYNAFVTTIQN

Query:  RSDNPSLEDVRSLLLAYEARLEKQNAVDQLNLAQANLSSLSLQ-NSRRSNPK---PNHSIPFRPPFNPQAFSPFSSQQHSASPSLLGKPQTQQLQKWPSR
        R+D+PSLEDVRSLLLAYEARL+KQN VDQLN+AQANL +LSLQ NS+R  PK   PNH   ++  F     SP S+ Q   S S+LGKPQ+  + KWP +
Subjt:  RSDNPSLEDVRSLLLAYEARLEKQNAVDQLNLAQANLSSLSLQ-NSRRSNPK---PNHSIPFRPPFNPQAFSPFSSQQHSASPSLLGKPQTQQLQKWPSR

Query:  LSSNKPQCQICGKFGHTALICHHRTNLAYQTPPPQA-YCPRFPQPLPPLS
         SS+K QCQICGK GH+A +C+HRTN+AY    PQA Y    P P  P S
Subjt:  LSSNKPQCQICGKFGHTALICHHRTNLAYQTPPPQA-YCPRFPQPLPPLS

A0A7J0EGI5 Uncharacterized protein4.6e-5648.05Show/hide
Query:  PPQQFSQNQIQNPIPYPNP--FTPNPYPTLP---QPLSVKLNDSNFLLWKNQLLNAVIANGLQGYLHGSIAAPPRYLDDQQTQPNPDFLHWERYNRFIMC
        PP   S     + IP PNP     +P P +P   QPL+VKL+D N+++WK QLLN VIANGL+ +L GS   PPR+LD QQ Q NP+F  W+RYNR +M 
Subjt:  PPQQFSQNQIQNPIPYPNP--FTPNPYPTLP---QPLSVKLNDSNFLLWKNQLLNAVIANGLQGYLHGSIAAPPRYLDDQQTQPNPDFLHWERYNRFIMC

Query:  WIYSSLSEEKMGEIVSFDSAAAIWNSLKRSYDSKTTARIMGLKTQLQKIKKDNLSVSQYLSQIKEVADKFSAIGEPISYRDHLAHILDGLGSEYNAFVTT
        WIY+S++E  +G+IV + SA+ IW +L+R Y + + A +  L+T LQ IKK+ L+   Y+ + + + +  ++IGEP++Y DHL + L GLG +YN FVT+
Subjt:  WIYSSLSEEKMGEIVSFDSAAAIWNSLKRSYDSKTTARIMGLKTQLQKIKKDNLSVSQYLSQIKEVADKFSAIGEPISYRDHLAHILDGLGSEYNAFVTT

Query:  IQNRSDNPSLEDVRSLLLAYEARLEKQNAVD
        IQ+++  PS+E+V SLLL+Y+ARLE+Q+A D
Subjt:  IQNRSDNPSLEDVRSLLLAYEARLEKQNAVD

A0A7J0GPN0 UBX domain-containing protein1.9e-5439.4Show/hide
Query:  PPQQFSQNQIQNPIPYPNPFTPNP-----YPTLPQPLSVKLNDSNFLLWKNQLLNAVIANGLQGYLHGSIAAPPRYLDDQQTQPNPDFLHWERYNRFIMC
        PP   S     + IP PNP   N       P++ QPL+VKL+D N+++WK QLLN VIANGL+ +L GS   PPR+LD QQ Q NP+F  W+RYNR +M 
Subjt:  PPQQFSQNQIQNPIPYPNPFTPNP-----YPTLPQPLSVKLNDSNFLLWKNQLLNAVIANGLQGYLHGSIAAPPRYLDDQQTQPNPDFLHWERYNRFIMC

Query:  WIYSSLSEEKMGEIVSFDSAAAIWNSLKRSYDSKTTARIMGLKTQLQKIKKDNLSVSQYLSQIKEVADKFSAIGEPISYRDHLAHILDGLGSEYNAFVTT
        WIY+S++E  +G+IV + SA+ IW +L+R Y + + A +  L+T LQ IKK+ L+   Y+ + + + +  ++IGEP++Y DHL + L GLG +YN FVT+
Subjt:  WIYSSLSEEKMGEIVSFDSAAAIWNSLKRSYDSKTTARIMGLKTQLQKIKKDNLSVSQYLSQIKEVADKFSAIGEPISYRDHLAHILDGLGSEYNAFVTT

Query:  IQNRSDNPSLEDVRS-LLLAYEARLEKQNAVDQLNLAQANLSSLSLQNSRR-SNPKPNHSIPFRPPFNPQAFSPFSSQQHSASPSLLGKPQTQQLQKWPS
        IQ+++  PS+E+  S   L  + + +             N S+ S  NS   S+P+  +     P ++P   SP              KP          
Subjt:  IQNRSDNPSLEDVRS-LLLAYEARLEKQNAVDQLNLAQANLSSLSLQNSRR-SNPKPNHSIPFRPPFNPQAFSPFSSQQHSASPSLLGKPQTQQLQKWPS

Query:  RLSSNKPQCQICGKFGHTALICHHRTNLAYQTPPP
             +P+CQIC K GHTA  C+H TNL YQ PPP
Subjt:  RLSSNKPQCQICGKFGHTALICHHRTNLAYQTPPP

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.8e-2022.9Show/hide
Query:  KLNDSNFLLWKNQLLNAVIANGLQGYLHGSIAAPPRYL-DDQQTQPNPDFLHWERYNRFIMCWIYSSLSEEKMGEIVSFDSAAAIWNSLKRSYDSKTTAR
        KL  +N+L+W  Q+        L G+L GS   PP  +  D   + NPD+  W+R ++ I   +  ++S      +    +AA IW +L++ Y + +   
Subjt:  KLNDSNFLLWKNQLLNAVIANGLQGYLHGSIAAPPRYL-DDQQTQPNPDFLHWERYNRFIMCWIYSSLSEEKMGEIVSFDSAAAIWNSLKRSYDSKTTAR

Query:  IMGLKTQLQKIKKDNLSVSQYLSQIKEVADKFSAIGEPISYRDHLAHILDGLGSEYNAFVTTIQNRSDNPSLEDVRSLLLAYEARLEKQNAVDQLNLAQA
        +  L+TQL++  K   ++  Y+  +    D+ + +G+P+ + + +  +L+ L  EY   +  I  +   P+L ++   LL +E+++    AV    +   
Subjt:  IMGLKTQLQKIKKDNLSVSQYLSQIKEVADKFSAIGEPISYRDHLAHILDGLGSEYNAFVTTIQNRSDNPSLEDVRSLLLAYEARLEKQNAVDQLNLAQA

Query:  NLSSLSLQNSRRSNPKPNHSIPFRPPFNPQAFSPFSSQQHSASPSLLGKPQTQQLQKWPSRLSSNKP---QCQICGKFGHTALIC---HHRTNLAYQTPP
          +++S +N+  +N   N +               +++  + + +   KP  Q    +    + +KP   +CQICG  GH+A  C    H  +      P
Subjt:  NLSSLSLQNSRRSNPKPNHSIPFRPPFNPQAFSPFSSQQHSASPSLLGKPQTQQLQKWPSRLSSNKP---QCQICGKFGHTALIC---HHRTNLAYQTPP

Query:  PQAYCPRFPQ
        P  + P  P+
Subjt:  PQAYCPRFPQ

Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)1.3e-1320.31Show/hide
Query:  PLSVKLNDSNFLLWKNQLLNAVIANGLQGYLHGSIAAPPRYLDDQQTQPNPDFLHWERYNRFIMCWIYSSLSEEK-MGEIVSFDSAAAIWNSLKRSYDSK
        P+ + + +SN+  W+   L   ++  + G++ G++              N + ++W++ +  +   +Y +L+ ++  G  V+  ++  IW  +K  + + 
Subjt:  PLSVKLNDSNFLLWKNQLLNAVIANGLQGYLHGSIAAPPRYLDDQQTQPNPDFLHWERYNRFIMCWIYSSLSEEK-MGEIVSFDSAAAIWNSLKRSYDSK

Query:  TTARIMGLKTQLQKIKKDNLSVSQYLSQIKEVADKFSAIGEPISYRDHLAHILDGLGSEYNAFVTTIQNRSDNPSLEDVRSLLLAYEARLEK
          AR + L ++L+     ++ V+ Y  ++K++AD    +  P++ R+ + ++L+GL  +++  +  I++R   PS +D  ++L   E RL++
Subjt:  TTARIMGLKTQLQKIKKDNLSVSQYLSQIKEVADKFSAIGEPISYRDHLAHILDGLGSEYNAFVTTIQNRSDNPSLEDVRSLLLAYEARLEK

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)4.2e-1725.84Show/hide
Query:  LSVKLNDSNFLLWKNQLLNAVIANGLQGYLHGSIAAPPRYLDDQQTQPNPDFLHWERYNRFIMCWIYSSLSEEKMGEIVSFD-SAAAIWNSLKRSYDSKT
        +++ LN  N+ +W+       ++ G+ G++ GS    P       T+       W+  +  +  WIY ++++  +  I+    +A  +W SL+  +    
Subjt:  LSVKLNDSNFLLWKNQLLNAVIANGLQGYLHGSIAAPPRYLDDQQTQPNPDFLHWERYNRFIMCWIYSSLSEEKMGEIVSFD-SAAAIWNSLKRSYDSKT

Query:  TARIMGLKTQLQKIKKDNLSVSQYLSQIKEVADKFSAIGEPISYRDHLAHILDGLGSEYNAFVTTIQNRSDNPSLEDVRSLLLAYEARLEKQNAVDQLNL
         AR +  + +L+    D+LSV +Y  ++K ++D  + +  PIS R  + H+L+GL  +Y+  +  I+++S  PS  + RS+LL  E+RL  ++   + +L
Subjt:  TARIMGLKTQLQKIKKDNLSVSQYLSQIKEVADKFSAIGEPISYRDHLAHILDGLGSEYNAFVTTIQNRSDNPSLEDVRSLLLAYEARLEKQNAVDQLNL

Query:  AQANLSSLS
        +  N  SLS
Subjt:  AQANLSSLS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCTCTGATGCTTCTTCTTCTTCTTCTTCCAGTGGTCCGATAATCACACCCGTAGCTTCACCATCTACCCCAAATACTACACCGATTGTCACACCGATTACAAACAC
CCAGAATGTTCAGCCTCAAATCCCTCGACCAAATCCTCAAATTACTCGACCAAATCCCCAAAATCCTCAACAACCATTTCAACCTCAACCTTCGATTTCTACTCATCAAC
ATTATCAAGCCTATGCCCAACCATTTTCCCCAAATTTCTATAATCCTCGTCCTCAGTTTTTCCCACCTCCACAACAGTTTTCACAAAATCAAATCCAAAATCCTATTCCA
TACCCAAACCCTTTTACCCCTAACCCTTACCCGACCTTACCCCAACCCTTATCGGTGAAGCTGAATGACTCGAACTTTCTCCTCTGGAAAAATCAGTTGCTGAATGCGGT
GATTGCAAATGGGCTTCAAGGGTACCTCCATGGCTCTATTGCGGCTCCTCCCAGGTATCTCGATGATCAACAAACTCAACCGAATCCAGATTTTCTCCATTGGGAAAGGT
ACAATCGGTTTATCATGTGTTGGATATACTCTTCTCTGTCTGAGGAAAAAATGGGTGAGATAGTAAGTTTTGACTCTGCTGCTGCTATTTGGAACTCTTTGAAACGATCC
TATGATTCTAAAACTACGGCTAGGATTATGGGACTCAAAACTCAACTTCAAAAAATAAAGAAGGATAACCTCTCTGTTAGTCAATATCTGTCTCAAATAAAGGAAGTAGC
TGATAAATTTTCTGCAATAGGTGAGCCCATCTCTTATAGGGACCATTTAGCTCATATCTTAGATGGTCTTGGTAGTGAATACAATGCCTTTGTTACTACCATACAAAATC
GGTCTGATAATCCGTCTTTAGAAGATGTTAGAAGTTTATTATTGGCATATGAGGCCCGGTTGGAAAAACAGAATGCTGTGGATCAATTGAATCTTGCCCAGGCAAATTTA
AGTTCTCTTAGCCTCCAAAACAGCCGTCGGTCCAACCCCAAACCAAATCACTCCATCCCCTTTAGACCTCCCTTCAATCCACAAGCCTTTTCTCCTTTTTCCTCTCAACA
ACACTCTGCCTCTCCAAGCCTCTTAGGCAAACCACAAACTCAACAACTTCAAAAATGGCCTTCTCGTTTATCTTCTAACAAACCTCAATGCCAAATATGTGGCAAATTTG
GGCACACTGCTTTAATTTGTCACCATAGAACTAATTTGGCCTACCAAACCCCACCTCCTCAAGCCTATTGTCCACGGTTTCCGCAACCACTCCCTCCTCTGTCCCTGATG
CTTTATCCACTATGTCCACTGATTCCTATCATCCTGACGAGAATTGGTTTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCTCTGATGCTTCTTCTTCTTCTTCTTCCAGTGGTCCGATAATCACACCCGTAGCTTCACCATCTACCCCAAATACTACACCGATTGTCACACCGATTACAAACAC
CCAGAATGTTCAGCCTCAAATCCCTCGACCAAATCCTCAAATTACTCGACCAAATCCCCAAAATCCTCAACAACCATTTCAACCTCAACCTTCGATTTCTACTCATCAAC
ATTATCAAGCCTATGCCCAACCATTTTCCCCAAATTTCTATAATCCTCGTCCTCAGTTTTTCCCACCTCCACAACAGTTTTCACAAAATCAAATCCAAAATCCTATTCCA
TACCCAAACCCTTTTACCCCTAACCCTTACCCGACCTTACCCCAACCCTTATCGGTGAAGCTGAATGACTCGAACTTTCTCCTCTGGAAAAATCAGTTGCTGAATGCGGT
GATTGCAAATGGGCTTCAAGGGTACCTCCATGGCTCTATTGCGGCTCCTCCCAGGTATCTCGATGATCAACAAACTCAACCGAATCCAGATTTTCTCCATTGGGAAAGGT
ACAATCGGTTTATCATGTGTTGGATATACTCTTCTCTGTCTGAGGAAAAAATGGGTGAGATAGTAAGTTTTGACTCTGCTGCTGCTATTTGGAACTCTTTGAAACGATCC
TATGATTCTAAAACTACGGCTAGGATTATGGGACTCAAAACTCAACTTCAAAAAATAAAGAAGGATAACCTCTCTGTTAGTCAATATCTGTCTCAAATAAAGGAAGTAGC
TGATAAATTTTCTGCAATAGGTGAGCCCATCTCTTATAGGGACCATTTAGCTCATATCTTAGATGGTCTTGGTAGTGAATACAATGCCTTTGTTACTACCATACAAAATC
GGTCTGATAATCCGTCTTTAGAAGATGTTAGAAGTTTATTATTGGCATATGAGGCCCGGTTGGAAAAACAGAATGCTGTGGATCAATTGAATCTTGCCCAGGCAAATTTA
AGTTCTCTTAGCCTCCAAAACAGCCGTCGGTCCAACCCCAAACCAAATCACTCCATCCCCTTTAGACCTCCCTTCAATCCACAAGCCTTTTCTCCTTTTTCCTCTCAACA
ACACTCTGCCTCTCCAAGCCTCTTAGGCAAACCACAAACTCAACAACTTCAAAAATGGCCTTCTCGTTTATCTTCTAACAAACCTCAATGCCAAATATGTGGCAAATTTG
GGCACACTGCTTTAATTTGTCACCATAGAACTAATTTGGCCTACCAAACCCCACCTCCTCAAGCCTATTGTCCACGGTTTCCGCAACCACTCCCTCCTCTGTCCCTGATG
CTTTATCCACTATGTCCACTGATTCCTATCATCCTGACGAGAATTGGTTTTTAG
Protein sequenceShow/hide protein sequence
MASDASSSSSSSGPIITPVASPSTPNTTPIVTPITNTQNVQPQIPRPNPQITRPNPQNPQQPFQPQPSISTHQHYQAYAQPFSPNFYNPRPQFFPPPQQFSQNQIQNPIP
YPNPFTPNPYPTLPQPLSVKLNDSNFLLWKNQLLNAVIANGLQGYLHGSIAAPPRYLDDQQTQPNPDFLHWERYNRFIMCWIYSSLSEEKMGEIVSFDSAAAIWNSLKRS
YDSKTTARIMGLKTQLQKIKKDNLSVSQYLSQIKEVADKFSAIGEPISYRDHLAHILDGLGSEYNAFVTTIQNRSDNPSLEDVRSLLLAYEARLEKQNAVDQLNLAQANL
SSLSLQNSRRSNPKPNHSIPFRPPFNPQAFSPFSSQQHSASPSLLGKPQTQQLQKWPSRLSSNKPQCQICGKFGHTALICHHRTNLAYQTPPPQAYCPRFPQPLPPLSLM
LYPLCPLIPIILTRIGF