; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0027536 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0027536
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr8:1827956..1830226
RNA-Seq ExpressionLag0027536
SyntenyLag0027536
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU19483.1 hypothetical protein TSUD_77270 [Trifolium subterraneum]1.7e-17145.65Show/hide
Query:  MTPEIATQVMGFDNAKDLWSAIQSLFGIQSRAEEDFLRQTFQQSRKGTSKMTDYLRLMKSHADNLGQAGSPVSTRNLVSQVLLGLDEEYNPIVAMIHGRG
        MT EIATQ++  + +K LW   QSL G  +R++  +L+  F   RKG  KM DYL  MK+  D L  AG+PVST +L+ Q L GLD EYNP+V  +  + 
Subjt:  MTPEIATQVMGFDNAKDLWSAIQSLFGIQSRAEEDFLRQTFQQSRKGTSKMTDYLRLMKSHADNLGQAGSPVSTRNLVSQVLLGLDEEYNPIVAMIHGRG

Query:  DISWSEMQAELLVFEKRLELQNTQKAVVSFNHTPTVNVANNKNNMNQNNNRGWNYSHSNGQRGQFYNNNQRGGSNFNNGRGRGGRGRGYGGYGNSNNRQV
         +SW ++QA+LL FE R+E  N    + +     T NVAN  ++  +++N  W  S+S G                     RGGRGRG  G      +  
Subjt:  DISWSEMQAELLVFEKRLELQNTQKAVVSFNHTPTVNVANNKNNMNQNNNRGWNYSHSNGQRGQFYNNNQRGGSNFNNGRGRGGRGRGYGGYGNSNNRQV

Query:  CQVCGRPGHSALMCYHRFDKEFSPNVNRGGNQNPNNSGNSGNTQPPSAFVANSNSQYACPETVIDSNWYADSGASNHVTGDFNNLANTKEYGGNEQVVIG
        CQVCG   H A+ C+HRFDK +S           N+S         +AF+A+ NS       V D +WY DSGASNHVT       +  E+ G   +V+G
Subjt:  CQVCGRPGHSALMCYHRFDKEFSPNVNRGGNQNPNNSGNSGNTQPPSAFVANSNSQYACPETVIDSNWYADSGASNHVTGDFNNLANTKEYGGNEQVVIG

Query:  NGESLPITFTGDTYLSNGAAILSLNNVLCVPEITKNLVSVSKLAQDNDVFIEFHGDCCIIKDKRSGQEVLKGVLRDGLYQLNNVTRVPGVNEGCSESISK
        NGE L I  TG + L +    L+L+++L VP ITKNL+SVSKLA DN++ +EF  +CC +KDK +G+ +LKG+L+DGLYQL+   R P            
Subjt:  NGESLPITFTGDTYLSNGAAILSLNNVLCVPEITKNLVSVSKLAQDNDVFIEFHGDCCIIKDKRSGQEVLKGVLRDGLYQLNNVTRVPGVNEGCSESISK

Query:  NSTANNHSSVFVVSRYPLSVNIIVSKNVWHKRLGHPASRVLDFVINDCKLQVKENEMLSFCESCQFGKSHNLPFPLSSSRAKYPFELIHSDLWGPAPVLS
                S FV      SV     K  WH+RLGHP ++VLD V+  CK++V  ++  SFCE+CQ+GK H LPF  SSS A+ P EL+H+D+WGPAP+++
Subjt:  NSTANNHSSVFVVSRYPLSVNIIVSKNVWHKRLGHPASRVLDFVINDCKLQVKENEMLSFCESCQFGKSHNLPFPLSSSRAKYPFELIHSDLWGPAPVLS

Query:  TDGFQYYVLFLDDYSRYLWLYPLKKKNDALAAFHHFISMVRNQFGCQIKILQSDNGGEYARIHQECYRLGIISRYSCPYTFAQNGRAERKHRHVVETGLT
        + GF+YYV F+DD+SR+ W+YPLK+K++ + AF  F ++  NQF  +IK++Q D GGEY  + +     GI  R SCPYT  QNGRAERKHRH+ E GLT
Subjt:  TDGFQYYVLFLDDYSRYLWLYPLKKKNDALAAFHHFISMVRNQFGCQIKILQSDNGGEYARIHQECYRLGIISRYSCPYTFAQNGRAERKHRHVVETGLT

Query:  LLAQASMPLQYWWDAFLAAVQLINGLPTSVLEGKSPLEVLHHKKLNFAGLRSFGCACYPCLRPYHNHKFQFHSERCVYLGFSPSHKGHKCLSASGRVFIS
        LLAQA MPL YWW+AF  AV LIN LP+ V + +SP  ++  K+ ++  L++FGCACYPCL+PY+ HK Q+H+ RCV+LG+S SHKG+KCL++ GR+FIS
Subjt:  LLAQASMPLQYWWDAFLAAVQLINGLPTSVLEGKSPLEVLHHKKLNFAGLRSFGCACYPCLRPYHNHKFQFHSERCVYLGFSPSHKGHKCLSASGRVFIS

Query:  RHVQFNELMFPF
        RHV FNE  FPF
Subjt:  RHVQFNELMFPF

GAU51268.1 hypothetical protein TSUD_412550 [Trifolium subterraneum]4.0e-16044.32Show/hide
Query:  MTPEIATQVMGFDNAKDLWSAIQSLFGIQSRAEEDFLRQTFQQSRKGTSKMTDYLRLMKSHADNLGQAGSPVSTRNLVSQVLLGLDEEYNPIVAMIHGRG
        M  +IATQ++  + +K LW   QSL G  +++   +L+  F  +RKG  KM +YL  MK+ +D L  AGSP+S  +L+ Q L GLD EYNP+V  +  + 
Subjt:  MTPEIATQVMGFDNAKDLWSAIQSLFGIQSRAEEDFLRQTFQQSRKGTSKMTDYLRLMKSHADNLGQAGSPVSTRNLVSQVLLGLDEEYNPIVAMIHGRG

Query:  DISWSEMQAELLVFEKRLELQNTQKAVVSFNHTPTVNVANNKNNMNQNNNRGWNYSHSNGQRGQFYNNNQRGGSNFNNGRG-RGGRGRGYGGYGNSNNRQ
        ++SW ++QA+LL FE RL+          FN+   + +  + N  N+   RG  +             N RG    +N RG RGGRG+G      SN + 
Subjt:  DISWSEMQAELLVFEKRLELQNTQKAVVSFNHTPTVNVANNKNNMNQNNNRGWNYSHSNGQRGQFYNNNQRGGSNFNNGRG-RGGRGRGYGGYGNSNNRQ

Query:  VCQVCGRPGHSALMCYHRFDKEFSPNVNRGGNQNPNNSGNSGNTQPPSAFVANSNSQYACPETVIDSNWYADSGASNHVTGDFNNLANTKEYGGNEQVVI
         CQVC   GH A+ C +RFD+   P   R  +   +  G+       SAF+A+       P    D  WY DSGA+NHVT   +      E+ G   +++
Subjt:  VCQVCGRPGHSALMCYHRFDKEFSPNVNRGGNQNPNNSGNSGNTQPPSAFVANSNSQYACPETVIDSNWYADSGASNHVTGDFNNLANTKEYGGNEQVVI

Query:  GNGESLPITFTGDTYLSNGAAILSLNNVLCVPEITKNLVSVSKLAQDNDVFIEFHGDCCIIKDKRSGQEVLKGVLRDGLYQLNNVTRVPGVNEGCSESIS
        GNGE L I  +G T L+N    L+L++VL VP+ITKNL+SVSKL  DN++ +EF  +CC +KDK +GQ +LKG L+DGLYQL+N        E C     
Subjt:  GNGESLPITFTGDTYLSNGAAILSLNNVLCVPEITKNLVSVSKLAQDNDVFIEFHGDCCIIKDKRSGQEVLKGVLRDGLYQLNNVTRVPGVNEGCSESIS

Query:  KNSTANNHSSVFVVSRYPLSVNIIVSKNVWHKRLGHPASRVLDFVINDCKLQVKENEMLSFCESCQFGKSHNLPFPLSSSRAKYPFELIHSDLWGPAPVL
        K S                          WH++LGHP ++VLD V+ DC +++  ++  SFCE+CQFGK H LPF  SSS  + P  LIHSD+WGPAP+L
Subjt:  KNSTANNHSSVFVVSRYPLSVNIIVSKNVWHKRLGHPASRVLDFVINDCKLQVKENEMLSFCESCQFGKSHNLPFPLSSSRAKYPFELIHSDLWGPAPVL

Query:  STDGFQYYVLFLDDYSRYLWLYPLKKKNDALAAFHHFISMVRNQFGCQIKILQSDNGGEYARIHQECYRLGIISRYSCPYTFAQNGRAERKHRHVVETGL
        S  GF+YYV F+DD+SR+ W++PLK+K+D + AF  F ++  NQF  +IKI+Q D GGEY  + +     GI  R SCPYT  QNGRAERKHRHV E GL
Subjt:  STDGFQYYVLFLDDYSRYLWLYPLKKKNDALAAFHHFISMVRNQFGCQIKILQSDNGGEYARIHQECYRLGIISRYSCPYTFAQNGRAERKHRHVVETGL

Query:  TLLAQASMPLQYWWDAFLAAVQLINGLPTSVLEGKSPLEVLHHKKLNFAGLRSFGCACYPCLRPYHNHKFQFHSERCVYLGFSPSHKGHKCLSASGRVFI
        TLLAQA MPL+YWW+AF  AV LIN LP+SV   +SP  ++  ++ ++  L+ FGCACYPCL+PY+ HK QFH+ RCV++G+S SHKG+KC+++ GR+F+
Subjt:  TLLAQASMPLQYWWDAFLAAVQLINGLPTSVLEGKSPLEVLHHKKLNFAGLRSFGCACYPCLRPYHNHKFQFHSERCVYLGFSPSHKGHKCLSASGRVFI

Query:  SRHVQFNELMFPF
        SRHV FNE  FPF
Subjt:  SRHVQFNELMFPF

KYP50444.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan]7.0e-17345.2Show/hide
Query:  MTPEIATQVMGFDNAKDLWSAIQSLFGIQSRAEEDFLRQTFQQSRKGTSKMTDYLRLMKSHADNLGQAGSPVSTRNLVSQVLLGLDEEYNPIVAMIHGRG
        MT E+ATQ++  + ++ +W   QSL G  +R+   FL+  F ++RKG  KM +YL  MK  AD+L  AGS VST +LV+Q L GLD EYNPIV  +  + 
Subjt:  MTPEIATQVMGFDNAKDLWSAIQSLFGIQSRAEEDFLRQTFQQSRKGTSKMTDYLRLMKSHADNLGQAGSPVSTRNLVSQVLLGLDEEYNPIVAMIHGRG

Query:  DISWSEMQAELLVFEKRLELQNTQKAVVSFNHTPTVNVANNKNNMNQNNNRGWNYSHSNGQRGQFYNNNQRGGSNFNNGRGRGGRGRGYGGYGNSNNRQV
         ++W EMQA+LL +E RLE  N Q  +       T+N ++N + +   N RG + +   G+ GQ             N   RGGRGRG      + +R V
Subjt:  DISWSEMQAELLVFEKRLELQNTQKAVVSFNHTPTVNVANNKNNMNQNNNRGWNYSHSNGQRGQFYNNNQRGGSNFNNGRGRGGRGRGYGGYGNSNNRQV

Query:  CQVCGRPGHSALMCYHRFDKEFSPNVNRGGNQNPNNSGNSGNTQPPSAFVANSNSQYACPETVIDSNWYADSGASNHVTGDFNNLANTKEYGGNEQVVIG
        CQVC +PGH+A  CYHRF+K +   + +  ++  +      N         N N+  A P TV D +WY DSGASNHVT D N +    E  G   + +G
Subjt:  CQVCGRPGHSALMCYHRFDKEFSPNVNRGGNQNPNNSGNSGNTQPPSAFVANSNSQYACPETVIDSNWYADSGASNHVTGDFNNLANTKEYGGNEQVVIG

Query:  NGESLPITFTGDTYLSNGAAILSLNNVLCVPEITKNLVSVSKLAQDNDVFIEFHGDCCIIKDKRSGQEVLKGVLRDGLYQLNNVTRVPGVNEGCSESISK
        NG +L I   GD+ L      L+L ++L VP+ITKNL+S+SKL  DND+++EFH   C +KDK +G+ +L+G ++DGLYQL      PG           
Subjt:  NGESLPITFTGDTYLSNGAAILSLNNVLCVPEITKNLVSVSKLAQDNDVFIEFHGDCCIIKDKRSGQEVLKGVLRDGLYQLNNVTRVPGVNEGCSESISK

Query:  NSTANNHSSVFVVSRYPLSVNIIVSKNVWHKRLGHPASRVLDFVINDCKLQVKENEMLSFCESCQFGKSHNLPFPLSSSRAKYPFELIHSDLWGPAPVLS
        +++ N    VF              K  WH++LGHP S+VL+ V+  C ++    E   FCE+CQFGK+HNLPF  S S AK P +L+HSD+WGPAP+ S
Subjt:  NSTANNHSSVFVVSRYPLSVNIIVSKNVWHKRLGHPASRVLDFVINDCKLQVKENEMLSFCESCQFGKSHNLPFPLSSSRAKYPFELIHSDLWGPAPVLS

Query:  TDGFQYYVLFLDDYSRYLWLYPLKKKNDALAAFHHFISMVRNQFGCQIKILQSDNGGEYARIHQECYRLGIISRYSCPYTFAQNGRAERKHRHVVETGLT
          GF+YYVLFLDD+SR+ W+YPLK+K+D   AF  F ++V NQF  +IK LQ D GGE+  + +   + GI  R SCPYT AQNGRAERKHRHVVE+GLT
Subjt:  TDGFQYYVLFLDDYSRYLWLYPLKKKNDALAAFHHFISMVRNQFGCQIKILQSDNGGEYARIHQECYRLGIISRYSCPYTFAQNGRAERKHRHVVETGLT

Query:  LLAQASMPLQYWWDAFLAAVQLINGLPTSVLEGKSPLEVLHHKKLNFAGLRSFGCACYPCLRPYHNHKFQFHSERCVYLGFSPSHKGHKCLSASGRVFIS
        LLAQA MPL YWW+AF  AV LIN LPT V++ KSP + L  K  ++  +++FGCACYPCL+PY+ HK QFH+ +CV+LG+S SHKG+KCL+++GR+FIS
Subjt:  LLAQASMPLQYWWDAFLAAVQLINGLPTSVLEGKSPLEVLHHKKLNFAGLRSFGCACYPCLRPYHNHKFQFHSERCVYLGFSPSHKGHKCLSASGRVFIS

Query:  RHVQFNELMFPF---ALDFGKPS---SSPT-----FSPSHGPSILTWFQSLEHSHLSQENT
        RHV FNE  FPF    L+  KP+   + PT      SP+ G ++    Q L  ++ S  NT
Subjt:  RHVQFNELMFPF---ALDFGKPS---SSPT-----FSPSHGPSILTWFQSLEHSHLSQENT

PNX76291.1 gag/pol polyprotein - maize retrotransposon Hopscotch, partial [Trifolium pratense]5.1e-17145.01Show/hide
Query:  MTPEIATQVMGFDNAKDLWSAIQSLFGIQSRAEEDFLRQTFQQSRKGTSKMTDYLRLMKSHADNLGQAGSPVSTRNLVSQVLLGLDEEYNPIVAMIHGRG
        MT  IATQ++  + +  LW   QSL G  +R++  +L+  F  +RKG  KM DYL  MK+ AD L  AG+P+ST +L+ Q L GLD EYNP+V  +  + 
Subjt:  MTPEIATQVMGFDNAKDLWSAIQSLFGIQSRAEEDFLRQTFQQSRKGTSKMTDYLRLMKSHADNLGQAGSPVSTRNLVSQVLLGLDEEYNPIVAMIHGRG

Query:  DISWSEMQAELLVFEKRLELQNTQKAVVSFNHTPTVNVANNKNNMNQNNNRGWNYSHSNGQRGQFYNNNQRGGSNFNNGRGRGGRGRGYGGYGNSNNRQV
         +SW ++QA+LL FE R+E  N   ++ +     T NVA       ++++RG  ++ +N  RG   NNN R GSNF   RG  GRGR +        +  
Subjt:  DISWSEMQAELLVFEKRLELQNTQKAVVSFNHTPTVNVANNKNNMNQNNNRGWNYSHSNGQRGQFYNNNQRGGSNFNNGRGRGGRGRGYGGYGNSNNRQV

Query:  CQVCGRPGHSALMCYHRFDKEFSPNVNRGGNQNPNNSGNSGNTQPPSAFVANSNSQYACPETVIDSNWYADSGASNHVTGDFNNLANTKEYGGNEQVVIG
        CQVCG   H A+ C++RFDK +S           N+S N+      +AF+A+ NS       + D +WY DSGASNHVT   +   N  E+ G   +++G
Subjt:  CQVCGRPGHSALMCYHRFDKEFSPNVNRGGNQNPNNSGNSGNTQPPSAFVANSNSQYACPETVIDSNWYADSGASNHVTGDFNNLANTKEYGGNEQVVIG

Query:  NGESLPITFTGDTYLSNGAAILSLNNVLCVPEITKNLVSVSKLAQDNDVFIEFHGDCCIIKDKRSGQEVLKGVLRDGLYQLNNVTRVPGVNEGCSESISK
        NGE L I  TG + L +    L+L+++L VP+ITKNL+SVSKLA DN++ +EF  +CC +KDK +G+ +L+G+L+DGLYQL+                 K
Subjt:  NGESLPITFTGDTYLSNGAAILSLNNVLCVPEITKNLVSVSKLAQDNDVFIEFHGDCCIIKDKRSGQEVLKGVLRDGLYQLNNVTRVPGVNEGCSESISK

Query:  NSTANNHSSVFVVSRYPLSVNIIVSKNVWHKRLGHPASRVLDFVINDCKLQVKENEMLSFCESCQFGKSHNLPFPLSSSRAKYPFELIHSDLWGPAPVLS
        +S+A                  +  K  WH++LGHP ++VLD V+  C +++  ++  SFCE+CQ+GK H LPF  S S AK   EL+H+D+WGPAP++S
Subjt:  NSTANNHSSVFVVSRYPLSVNIIVSKNVWHKRLGHPASRVLDFVINDCKLQVKENEMLSFCESCQFGKSHNLPFPLSSSRAKYPFELIHSDLWGPAPVLS

Query:  TDGFQYYVLFLDDYSRYLWLYPLKKKNDALAAFHHFISMVRNQFGCQIKILQSDNGGEYARIHQECYRLGIISRYSCPYTFAQNGRAERKHRHVVETGLT
        + GF+YYV F+DD++R+ W+YPLK+K+D   AF  F +MV NQF  +IK +Q D GGEY  + +     GI  R SCPYT  QNGRAERKHRH+ E GLT
Subjt:  TDGFQYYVLFLDDYSRYLWLYPLKKKNDALAAFHHFISMVRNQFGCQIKILQSDNGGEYARIHQECYRLGIISRYSCPYTFAQNGRAERKHRHVVETGLT

Query:  LLAQASMPLQYWWDAFLAAVQLINGLPTSVLEGKSPLEVLHHKKLNFAGLRSFGCACYPCLRPYHNHKFQFHSERCVYLGFSPSHKGHKCLSASGRVFIS
        LLAQA MPL YWW+AF  AV LIN LP+SV   KSP  +LH ++ ++  L+ FGCACYP L+PY+ HK QFH+ RCV+LG+S SHKG+KC+++ GR+FIS
Subjt:  LLAQASMPLQYWWDAFLAAVQLINGLPTSVLEGKSPLEVLHHKKLNFAGLRSFGCACYPCLRPYHNHKFQFHSERCVYLGFSPSHKGHKCLSASGRVFIS

Query:  RHVQFNELMFPF---ALDFGKPSSSPTFSPS
        RHV FNE  FPF    L+   P  + T SPS
Subjt:  RHVQFNELMFPF---ALDFGKPSSSPTFSPS

PNX94503.1 putative retrotransposon Ty1-copia subclass protein, partial [Trifolium pratense]9.5e-16244.77Show/hide
Query:  MTPEIATQVMGFDNAKDLWSAIQSLFGIQSRAEEDFLRQTFQQSRKGTSKMTDYLRLMKSHADNLGQAGSPVSTRNLVSQVLLGLDEEYNPIVAMIHGRG
        MT +IATQV+  + +K LW   QSL G  +R+   +L+  F  + K   KM  YL  MK+ AD L  AGSP+S+ +L+ Q L GLD EYNP+V  +  + 
Subjt:  MTPEIATQVMGFDNAKDLWSAIQSLFGIQSRAEEDFLRQTFQQSRKGTSKMTDYLRLMKSHADNLGQAGSPVSTRNLVSQVLLGLDEEYNPIVAMIHGRG

Query:  DISWSEMQAELLVFEKRLELQNTQKAVVSFNHTPTVNVANNKNNMNQNNNRGWNYSHSNGQRGQFYNNNQRGGSNFNNGRG-RGGRGRGYGGYGNSNNRQ
        +ISW + QA+LL FE RL+                    NN NN+N N +   N++  N   G  + +  RGG   +N RG RGGRGR      +   R 
Subjt:  DISWSEMQAELLVFEKRLELQNTQKAVVSFNHTPTVNVANNKNNMNQNNNRGWNYSHSNGQRGQFYNNNQRGGSNFNNGRG-RGGRGRGYGGYGNSNNRQ

Query:  VCQVCGRPGHSALMCYHRFDKEFSPNVNRGGNQNPNNSGNSGNTQPPSAFVANSNSQYACPETVIDSNWYADSGASNHVTGDFNNLANTKEYGGNEQVVI
        +CQ+CG+ GH+A  CY+RFDK ++        +N    G   +    SAFVA+       P    D  WY DSGASNHVT     L +  E  G   +++
Subjt:  VCQVCGRPGHSALMCYHRFDKEFSPNVNRGGNQNPNNSGNSGNTQPPSAFVANSNSQYACPETVIDSNWYADSGASNHVTGDFNNLANTKEYGGNEQVVI

Query:  GNGESLPITFTGDTYLSNGAAILSLNNVLCVPEITKNLVSVSKLAQDNDVFIEFHGDCCIIKDKRSGQEVLKGVLRDGLYQLNNVTRVPGVNEGCSESIS
        GNGE L I  +G T L++    ++L NVL VPEITKNL+SVSKL  DN+  +EF  + C +KDK +G+ +LKG L+DGLYQL+     P   + C+    
Subjt:  GNGESLPITFTGDTYLSNGAAILSLNNVLCVPEITKNLVSVSKLAQDNDVFIEFHGDCCIIKDKRSGQEVLKGVLRDGLYQLNNVTRVPGVNEGCSESIS

Query:  KNSTANNHSSVFVVSRYPLSVNIIVSKNVWHKRLGHPASRVLDFVINDCKLQVKENEMLSFCESCQFGKSHNLPFPLSSSRAKYPFELIHSDLWGPAPVL
                               I  K +WH++LGHP ++VL+ V+ D  +++  ++  +FCE+CQFGK H LPF  SSS AK P +LIH+D+WGPAP+L
Subjt:  KNSTANNHSSVFVVSRYPLSVNIIVSKNVWHKRLGHPASRVLDFVINDCKLQVKENEMLSFCESCQFGKSHNLPFPLSSSRAKYPFELIHSDLWGPAPVL

Query:  STDGFQYYVLFLDDYSRYLWLYPLKKKNDALAAFHHFISMVRNQFGCQIKILQSDNGGEYARIHQECYRLGIISRYSCPYTFAQNGRAERKHRHVVETGL
        S   F+YYV FLDD+SR+ W++PLK+K++ + AF+ F ++V NQF  +IK+++ D GGEY  + +     GI  + SCPYT  QNGRAERKHRHV E GL
Subjt:  STDGFQYYVLFLDDYSRYLWLYPLKKKNDALAAFHHFISMVRNQFGCQIKILQSDNGGEYARIHQECYRLGIISRYSCPYTFAQNGRAERKHRHVVETGL

Query:  TLLAQASMPLQYWWDAFLAAVQLINGLPTSVLEGKSPLEVLHHKKLNFAGLRSFGCACYPCLRPYHNHKFQFHSERCVYLGFSPSHKGHKCLSASGRVFI
        TLLAQA MPL YWW+AF  AV LIN LP+SV   +SP  ++  K+ ++  L+ FGCACYPCL+PY+ HK QFH+ RCV+LG+S SHKG+KC+++ GRVF+
Subjt:  TLLAQASMPLQYWWDAFLAAVQLINGLPTSVLEGKSPLEVLHHKKLNFAGLRSFGCACYPCLRPYHNHKFQFHSERCVYLGFSPSHKGHKCLSASGRVFI

Query:  SRHVQFNELMFPFALDF
        SRHV FNE  FPF   F
Subjt:  SRHVQFNELMFPFALDF

TrEMBL top hitse value%identityAlignment
A0A151S6M8 Retrovirus-related Pol polyprotein from transposon TNT 1-943.4e-17345.2Show/hide
Query:  MTPEIATQVMGFDNAKDLWSAIQSLFGIQSRAEEDFLRQTFQQSRKGTSKMTDYLRLMKSHADNLGQAGSPVSTRNLVSQVLLGLDEEYNPIVAMIHGRG
        MT E+ATQ++  + ++ +W   QSL G  +R+   FL+  F ++RKG  KM +YL  MK  AD+L  AGS VST +LV+Q L GLD EYNPIV  +  + 
Subjt:  MTPEIATQVMGFDNAKDLWSAIQSLFGIQSRAEEDFLRQTFQQSRKGTSKMTDYLRLMKSHADNLGQAGSPVSTRNLVSQVLLGLDEEYNPIVAMIHGRG

Query:  DISWSEMQAELLVFEKRLELQNTQKAVVSFNHTPTVNVANNKNNMNQNNNRGWNYSHSNGQRGQFYNNNQRGGSNFNNGRGRGGRGRGYGGYGNSNNRQV
         ++W EMQA+LL +E RLE  N Q  +       T+N ++N + +   N RG + +   G+ GQ             N   RGGRGRG      + +R V
Subjt:  DISWSEMQAELLVFEKRLELQNTQKAVVSFNHTPTVNVANNKNNMNQNNNRGWNYSHSNGQRGQFYNNNQRGGSNFNNGRGRGGRGRGYGGYGNSNNRQV

Query:  CQVCGRPGHSALMCYHRFDKEFSPNVNRGGNQNPNNSGNSGNTQPPSAFVANSNSQYACPETVIDSNWYADSGASNHVTGDFNNLANTKEYGGNEQVVIG
        CQVC +PGH+A  CYHRF+K +   + +  ++  +      N         N N+  A P TV D +WY DSGASNHVT D N +    E  G   + +G
Subjt:  CQVCGRPGHSALMCYHRFDKEFSPNVNRGGNQNPNNSGNSGNTQPPSAFVANSNSQYACPETVIDSNWYADSGASNHVTGDFNNLANTKEYGGNEQVVIG

Query:  NGESLPITFTGDTYLSNGAAILSLNNVLCVPEITKNLVSVSKLAQDNDVFIEFHGDCCIIKDKRSGQEVLKGVLRDGLYQLNNVTRVPGVNEGCSESISK
        NG +L I   GD+ L      L+L ++L VP+ITKNL+S+SKL  DND+++EFH   C +KDK +G+ +L+G ++DGLYQL      PG           
Subjt:  NGESLPITFTGDTYLSNGAAILSLNNVLCVPEITKNLVSVSKLAQDNDVFIEFHGDCCIIKDKRSGQEVLKGVLRDGLYQLNNVTRVPGVNEGCSESISK

Query:  NSTANNHSSVFVVSRYPLSVNIIVSKNVWHKRLGHPASRVLDFVINDCKLQVKENEMLSFCESCQFGKSHNLPFPLSSSRAKYPFELIHSDLWGPAPVLS
        +++ N    VF              K  WH++LGHP S+VL+ V+  C ++    E   FCE+CQFGK+HNLPF  S S AK P +L+HSD+WGPAP+ S
Subjt:  NSTANNHSSVFVVSRYPLSVNIIVSKNVWHKRLGHPASRVLDFVINDCKLQVKENEMLSFCESCQFGKSHNLPFPLSSSRAKYPFELIHSDLWGPAPVLS

Query:  TDGFQYYVLFLDDYSRYLWLYPLKKKNDALAAFHHFISMVRNQFGCQIKILQSDNGGEYARIHQECYRLGIISRYSCPYTFAQNGRAERKHRHVVETGLT
          GF+YYVLFLDD+SR+ W+YPLK+K+D   AF  F ++V NQF  +IK LQ D GGE+  + +   + GI  R SCPYT AQNGRAERKHRHVVE+GLT
Subjt:  TDGFQYYVLFLDDYSRYLWLYPLKKKNDALAAFHHFISMVRNQFGCQIKILQSDNGGEYARIHQECYRLGIISRYSCPYTFAQNGRAERKHRHVVETGLT

Query:  LLAQASMPLQYWWDAFLAAVQLINGLPTSVLEGKSPLEVLHHKKLNFAGLRSFGCACYPCLRPYHNHKFQFHSERCVYLGFSPSHKGHKCLSASGRVFIS
        LLAQA MPL YWW+AF  AV LIN LPT V++ KSP + L  K  ++  +++FGCACYPCL+PY+ HK QFH+ +CV+LG+S SHKG+KCL+++GR+FIS
Subjt:  LLAQASMPLQYWWDAFLAAVQLINGLPTSVLEGKSPLEVLHHKKLNFAGLRSFGCACYPCLRPYHNHKFQFHSERCVYLGFSPSHKGHKCLSASGRVFIS

Query:  RHVQFNELMFPF---ALDFGKPS---SSPT-----FSPSHGPSILTWFQSLEHSHLSQENT
        RHV FNE  FPF    L+  KP+   + PT      SP+ G ++    Q L  ++ S  NT
Subjt:  RHVQFNELMFPF---ALDFGKPS---SSPT-----FSPSHGPSILTWFQSLEHSHLSQENT

A0A2K3LCM1 Gag/pol polyprotein-maize retrotransposon Hopscotch (Fragment)2.4e-17145.01Show/hide
Query:  MTPEIATQVMGFDNAKDLWSAIQSLFGIQSRAEEDFLRQTFQQSRKGTSKMTDYLRLMKSHADNLGQAGSPVSTRNLVSQVLLGLDEEYNPIVAMIHGRG
        MT  IATQ++  + +  LW   QSL G  +R++  +L+  F  +RKG  KM DYL  MK+ AD L  AG+P+ST +L+ Q L GLD EYNP+V  +  + 
Subjt:  MTPEIATQVMGFDNAKDLWSAIQSLFGIQSRAEEDFLRQTFQQSRKGTSKMTDYLRLMKSHADNLGQAGSPVSTRNLVSQVLLGLDEEYNPIVAMIHGRG

Query:  DISWSEMQAELLVFEKRLELQNTQKAVVSFNHTPTVNVANNKNNMNQNNNRGWNYSHSNGQRGQFYNNNQRGGSNFNNGRGRGGRGRGYGGYGNSNNRQV
         +SW ++QA+LL FE R+E  N   ++ +     T NVA       ++++RG  ++ +N  RG   NNN R GSNF   RG  GRGR +        +  
Subjt:  DISWSEMQAELLVFEKRLELQNTQKAVVSFNHTPTVNVANNKNNMNQNNNRGWNYSHSNGQRGQFYNNNQRGGSNFNNGRGRGGRGRGYGGYGNSNNRQV

Query:  CQVCGRPGHSALMCYHRFDKEFSPNVNRGGNQNPNNSGNSGNTQPPSAFVANSNSQYACPETVIDSNWYADSGASNHVTGDFNNLANTKEYGGNEQVVIG
        CQVCG   H A+ C++RFDK +S           N+S N+      +AF+A+ NS       + D +WY DSGASNHVT   +   N  E+ G   +++G
Subjt:  CQVCGRPGHSALMCYHRFDKEFSPNVNRGGNQNPNNSGNSGNTQPPSAFVANSNSQYACPETVIDSNWYADSGASNHVTGDFNNLANTKEYGGNEQVVIG

Query:  NGESLPITFTGDTYLSNGAAILSLNNVLCVPEITKNLVSVSKLAQDNDVFIEFHGDCCIIKDKRSGQEVLKGVLRDGLYQLNNVTRVPGVNEGCSESISK
        NGE L I  TG + L +    L+L+++L VP+ITKNL+SVSKLA DN++ +EF  +CC +KDK +G+ +L+G+L+DGLYQL+                 K
Subjt:  NGESLPITFTGDTYLSNGAAILSLNNVLCVPEITKNLVSVSKLAQDNDVFIEFHGDCCIIKDKRSGQEVLKGVLRDGLYQLNNVTRVPGVNEGCSESISK

Query:  NSTANNHSSVFVVSRYPLSVNIIVSKNVWHKRLGHPASRVLDFVINDCKLQVKENEMLSFCESCQFGKSHNLPFPLSSSRAKYPFELIHSDLWGPAPVLS
        +S+A                  +  K  WH++LGHP ++VLD V+  C +++  ++  SFCE+CQ+GK H LPF  S S AK   EL+H+D+WGPAP++S
Subjt:  NSTANNHSSVFVVSRYPLSVNIIVSKNVWHKRLGHPASRVLDFVINDCKLQVKENEMLSFCESCQFGKSHNLPFPLSSSRAKYPFELIHSDLWGPAPVLS

Query:  TDGFQYYVLFLDDYSRYLWLYPLKKKNDALAAFHHFISMVRNQFGCQIKILQSDNGGEYARIHQECYRLGIISRYSCPYTFAQNGRAERKHRHVVETGLT
        + GF+YYV F+DD++R+ W+YPLK+K+D   AF  F +MV NQF  +IK +Q D GGEY  + +     GI  R SCPYT  QNGRAERKHRH+ E GLT
Subjt:  TDGFQYYVLFLDDYSRYLWLYPLKKKNDALAAFHHFISMVRNQFGCQIKILQSDNGGEYARIHQECYRLGIISRYSCPYTFAQNGRAERKHRHVVETGLT

Query:  LLAQASMPLQYWWDAFLAAVQLINGLPTSVLEGKSPLEVLHHKKLNFAGLRSFGCACYPCLRPYHNHKFQFHSERCVYLGFSPSHKGHKCLSASGRVFIS
        LLAQA MPL YWW+AF  AV LIN LP+SV   KSP  +LH ++ ++  L+ FGCACYP L+PY+ HK QFH+ RCV+LG+S SHKG+KC+++ GR+FIS
Subjt:  LLAQASMPLQYWWDAFLAAVQLINGLPTSVLEGKSPLEVLHHKKLNFAGLRSFGCACYPCLRPYHNHKFQFHSERCVYLGFSPSHKGHKCLSASGRVFIS

Query:  RHVQFNELMFPF---ALDFGKPSSSPTFSPS
        RHV FNE  FPF    L+   P  + T SPS
Subjt:  RHVQFNELMFPF---ALDFGKPSSSPTFSPS

A0A2K3MUJ9 Putative retrotransposon Ty1-copia subclass protein (Fragment)4.6e-16244.77Show/hide
Query:  MTPEIATQVMGFDNAKDLWSAIQSLFGIQSRAEEDFLRQTFQQSRKGTSKMTDYLRLMKSHADNLGQAGSPVSTRNLVSQVLLGLDEEYNPIVAMIHGRG
        MT +IATQV+  + +K LW   QSL G  +R+   +L+  F  + K   KM  YL  MK+ AD L  AGSP+S+ +L+ Q L GLD EYNP+V  +  + 
Subjt:  MTPEIATQVMGFDNAKDLWSAIQSLFGIQSRAEEDFLRQTFQQSRKGTSKMTDYLRLMKSHADNLGQAGSPVSTRNLVSQVLLGLDEEYNPIVAMIHGRG

Query:  DISWSEMQAELLVFEKRLELQNTQKAVVSFNHTPTVNVANNKNNMNQNNNRGWNYSHSNGQRGQFYNNNQRGGSNFNNGRG-RGGRGRGYGGYGNSNNRQ
        +ISW + QA+LL FE RL+                    NN NN+N N +   N++  N   G  + +  RGG   +N RG RGGRGR      +   R 
Subjt:  DISWSEMQAELLVFEKRLELQNTQKAVVSFNHTPTVNVANNKNNMNQNNNRGWNYSHSNGQRGQFYNNNQRGGSNFNNGRG-RGGRGRGYGGYGNSNNRQ

Query:  VCQVCGRPGHSALMCYHRFDKEFSPNVNRGGNQNPNNSGNSGNTQPPSAFVANSNSQYACPETVIDSNWYADSGASNHVTGDFNNLANTKEYGGNEQVVI
        +CQ+CG+ GH+A  CY+RFDK ++        +N    G   +    SAFVA+       P    D  WY DSGASNHVT     L +  E  G   +++
Subjt:  VCQVCGRPGHSALMCYHRFDKEFSPNVNRGGNQNPNNSGNSGNTQPPSAFVANSNSQYACPETVIDSNWYADSGASNHVTGDFNNLANTKEYGGNEQVVI

Query:  GNGESLPITFTGDTYLSNGAAILSLNNVLCVPEITKNLVSVSKLAQDNDVFIEFHGDCCIIKDKRSGQEVLKGVLRDGLYQLNNVTRVPGVNEGCSESIS
        GNGE L I  +G T L++    ++L NVL VPEITKNL+SVSKL  DN+  +EF  + C +KDK +G+ +LKG L+DGLYQL+     P   + C+    
Subjt:  GNGESLPITFTGDTYLSNGAAILSLNNVLCVPEITKNLVSVSKLAQDNDVFIEFHGDCCIIKDKRSGQEVLKGVLRDGLYQLNNVTRVPGVNEGCSESIS

Query:  KNSTANNHSSVFVVSRYPLSVNIIVSKNVWHKRLGHPASRVLDFVINDCKLQVKENEMLSFCESCQFGKSHNLPFPLSSSRAKYPFELIHSDLWGPAPVL
                               I  K +WH++LGHP ++VL+ V+ D  +++  ++  +FCE+CQFGK H LPF  SSS AK P +LIH+D+WGPAP+L
Subjt:  KNSTANNHSSVFVVSRYPLSVNIIVSKNVWHKRLGHPASRVLDFVINDCKLQVKENEMLSFCESCQFGKSHNLPFPLSSSRAKYPFELIHSDLWGPAPVL

Query:  STDGFQYYVLFLDDYSRYLWLYPLKKKNDALAAFHHFISMVRNQFGCQIKILQSDNGGEYARIHQECYRLGIISRYSCPYTFAQNGRAERKHRHVVETGL
        S   F+YYV FLDD+SR+ W++PLK+K++ + AF+ F ++V NQF  +IK+++ D GGEY  + +     GI  + SCPYT  QNGRAERKHRHV E GL
Subjt:  STDGFQYYVLFLDDYSRYLWLYPLKKKNDALAAFHHFISMVRNQFGCQIKILQSDNGGEYARIHQECYRLGIISRYSCPYTFAQNGRAERKHRHVVETGL

Query:  TLLAQASMPLQYWWDAFLAAVQLINGLPTSVLEGKSPLEVLHHKKLNFAGLRSFGCACYPCLRPYHNHKFQFHSERCVYLGFSPSHKGHKCLSASGRVFI
        TLLAQA MPL YWW+AF  AV LIN LP+SV   +SP  ++  K+ ++  L+ FGCACYPCL+PY+ HK QFH+ RCV+LG+S SHKG+KC+++ GRVF+
Subjt:  TLLAQASMPLQYWWDAFLAAVQLINGLPTSVLEGKSPLEVLHHKKLNFAGLRSFGCACYPCLRPYHNHKFQFHSERCVYLGFSPSHKGHKCLSASGRVFI

Query:  SRHVQFNELMFPFALDF
        SRHV FNE  FPF   F
Subjt:  SRHVQFNELMFPFALDF

A0A2Z6MBG6 Integrase catalytic domain-containing protein8.4e-17245.65Show/hide
Query:  MTPEIATQVMGFDNAKDLWSAIQSLFGIQSRAEEDFLRQTFQQSRKGTSKMTDYLRLMKSHADNLGQAGSPVSTRNLVSQVLLGLDEEYNPIVAMIHGRG
        MT EIATQ++  + +K LW   QSL G  +R++  +L+  F   RKG  KM DYL  MK+  D L  AG+PVST +L+ Q L GLD EYNP+V  +  + 
Subjt:  MTPEIATQVMGFDNAKDLWSAIQSLFGIQSRAEEDFLRQTFQQSRKGTSKMTDYLRLMKSHADNLGQAGSPVSTRNLVSQVLLGLDEEYNPIVAMIHGRG

Query:  DISWSEMQAELLVFEKRLELQNTQKAVVSFNHTPTVNVANNKNNMNQNNNRGWNYSHSNGQRGQFYNNNQRGGSNFNNGRGRGGRGRGYGGYGNSNNRQV
         +SW ++QA+LL FE R+E  N    + +     T NVAN  ++  +++N  W  S+S G                     RGGRGRG  G      +  
Subjt:  DISWSEMQAELLVFEKRLELQNTQKAVVSFNHTPTVNVANNKNNMNQNNNRGWNYSHSNGQRGQFYNNNQRGGSNFNNGRGRGGRGRGYGGYGNSNNRQV

Query:  CQVCGRPGHSALMCYHRFDKEFSPNVNRGGNQNPNNSGNSGNTQPPSAFVANSNSQYACPETVIDSNWYADSGASNHVTGDFNNLANTKEYGGNEQVVIG
        CQVCG   H A+ C+HRFDK +S           N+S         +AF+A+ NS       V D +WY DSGASNHVT       +  E+ G   +V+G
Subjt:  CQVCGRPGHSALMCYHRFDKEFSPNVNRGGNQNPNNSGNSGNTQPPSAFVANSNSQYACPETVIDSNWYADSGASNHVTGDFNNLANTKEYGGNEQVVIG

Query:  NGESLPITFTGDTYLSNGAAILSLNNVLCVPEITKNLVSVSKLAQDNDVFIEFHGDCCIIKDKRSGQEVLKGVLRDGLYQLNNVTRVPGVNEGCSESISK
        NGE L I  TG + L +    L+L+++L VP ITKNL+SVSKLA DN++ +EF  +CC +KDK +G+ +LKG+L+DGLYQL+   R P            
Subjt:  NGESLPITFTGDTYLSNGAAILSLNNVLCVPEITKNLVSVSKLAQDNDVFIEFHGDCCIIKDKRSGQEVLKGVLRDGLYQLNNVTRVPGVNEGCSESISK

Query:  NSTANNHSSVFVVSRYPLSVNIIVSKNVWHKRLGHPASRVLDFVINDCKLQVKENEMLSFCESCQFGKSHNLPFPLSSSRAKYPFELIHSDLWGPAPVLS
                S FV      SV     K  WH+RLGHP ++VLD V+  CK++V  ++  SFCE+CQ+GK H LPF  SSS A+ P EL+H+D+WGPAP+++
Subjt:  NSTANNHSSVFVVSRYPLSVNIIVSKNVWHKRLGHPASRVLDFVINDCKLQVKENEMLSFCESCQFGKSHNLPFPLSSSRAKYPFELIHSDLWGPAPVLS

Query:  TDGFQYYVLFLDDYSRYLWLYPLKKKNDALAAFHHFISMVRNQFGCQIKILQSDNGGEYARIHQECYRLGIISRYSCPYTFAQNGRAERKHRHVVETGLT
        + GF+YYV F+DD+SR+ W+YPLK+K++ + AF  F ++  NQF  +IK++Q D GGEY  + +     GI  R SCPYT  QNGRAERKHRH+ E GLT
Subjt:  TDGFQYYVLFLDDYSRYLWLYPLKKKNDALAAFHHFISMVRNQFGCQIKILQSDNGGEYARIHQECYRLGIISRYSCPYTFAQNGRAERKHRHVVETGLT

Query:  LLAQASMPLQYWWDAFLAAVQLINGLPTSVLEGKSPLEVLHHKKLNFAGLRSFGCACYPCLRPYHNHKFQFHSERCVYLGFSPSHKGHKCLSASGRVFIS
        LLAQA MPL YWW+AF  AV LIN LP+ V + +SP  ++  K+ ++  L++FGCACYPCL+PY+ HK Q+H+ RCV+LG+S SHKG+KCL++ GR+FIS
Subjt:  LLAQASMPLQYWWDAFLAAVQLINGLPTSVLEGKSPLEVLHHKKLNFAGLRSFGCACYPCLRPYHNHKFQFHSERCVYLGFSPSHKGHKCLSASGRVFIS

Query:  RHVQFNELMFPF
        RHV FNE  FPF
Subjt:  RHVQFNELMFPF

A0A2Z6P4D5 Integrase catalytic domain-containing protein1.9e-16044.32Show/hide
Query:  MTPEIATQVMGFDNAKDLWSAIQSLFGIQSRAEEDFLRQTFQQSRKGTSKMTDYLRLMKSHADNLGQAGSPVSTRNLVSQVLLGLDEEYNPIVAMIHGRG
        M  +IATQ++  + +K LW   QSL G  +++   +L+  F  +RKG  KM +YL  MK+ +D L  AGSP+S  +L+ Q L GLD EYNP+V  +  + 
Subjt:  MTPEIATQVMGFDNAKDLWSAIQSLFGIQSRAEEDFLRQTFQQSRKGTSKMTDYLRLMKSHADNLGQAGSPVSTRNLVSQVLLGLDEEYNPIVAMIHGRG

Query:  DISWSEMQAELLVFEKRLELQNTQKAVVSFNHTPTVNVANNKNNMNQNNNRGWNYSHSNGQRGQFYNNNQRGGSNFNNGRG-RGGRGRGYGGYGNSNNRQ
        ++SW ++QA+LL FE RL+          FN+   + +  + N  N+   RG  +             N RG    +N RG RGGRG+G      SN + 
Subjt:  DISWSEMQAELLVFEKRLELQNTQKAVVSFNHTPTVNVANNKNNMNQNNNRGWNYSHSNGQRGQFYNNNQRGGSNFNNGRG-RGGRGRGYGGYGNSNNRQ

Query:  VCQVCGRPGHSALMCYHRFDKEFSPNVNRGGNQNPNNSGNSGNTQPPSAFVANSNSQYACPETVIDSNWYADSGASNHVTGDFNNLANTKEYGGNEQVVI
         CQVC   GH A+ C +RFD+   P   R  +   +  G+       SAF+A+       P    D  WY DSGA+NHVT   +      E+ G   +++
Subjt:  VCQVCGRPGHSALMCYHRFDKEFSPNVNRGGNQNPNNSGNSGNTQPPSAFVANSNSQYACPETVIDSNWYADSGASNHVTGDFNNLANTKEYGGNEQVVI

Query:  GNGESLPITFTGDTYLSNGAAILSLNNVLCVPEITKNLVSVSKLAQDNDVFIEFHGDCCIIKDKRSGQEVLKGVLRDGLYQLNNVTRVPGVNEGCSESIS
        GNGE L I  +G T L+N    L+L++VL VP+ITKNL+SVSKL  DN++ +EF  +CC +KDK +GQ +LKG L+DGLYQL+N        E C     
Subjt:  GNGESLPITFTGDTYLSNGAAILSLNNVLCVPEITKNLVSVSKLAQDNDVFIEFHGDCCIIKDKRSGQEVLKGVLRDGLYQLNNVTRVPGVNEGCSESIS

Query:  KNSTANNHSSVFVVSRYPLSVNIIVSKNVWHKRLGHPASRVLDFVINDCKLQVKENEMLSFCESCQFGKSHNLPFPLSSSRAKYPFELIHSDLWGPAPVL
        K S                          WH++LGHP ++VLD V+ DC +++  ++  SFCE+CQFGK H LPF  SSS  + P  LIHSD+WGPAP+L
Subjt:  KNSTANNHSSVFVVSRYPLSVNIIVSKNVWHKRLGHPASRVLDFVINDCKLQVKENEMLSFCESCQFGKSHNLPFPLSSSRAKYPFELIHSDLWGPAPVL

Query:  STDGFQYYVLFLDDYSRYLWLYPLKKKNDALAAFHHFISMVRNQFGCQIKILQSDNGGEYARIHQECYRLGIISRYSCPYTFAQNGRAERKHRHVVETGL
        S  GF+YYV F+DD+SR+ W++PLK+K+D + AF  F ++  NQF  +IKI+Q D GGEY  + +     GI  R SCPYT  QNGRAERKHRHV E GL
Subjt:  STDGFQYYVLFLDDYSRYLWLYPLKKKNDALAAFHHFISMVRNQFGCQIKILQSDNGGEYARIHQECYRLGIISRYSCPYTFAQNGRAERKHRHVVETGL

Query:  TLLAQASMPLQYWWDAFLAAVQLINGLPTSVLEGKSPLEVLHHKKLNFAGLRSFGCACYPCLRPYHNHKFQFHSERCVYLGFSPSHKGHKCLSASGRVFI
        TLLAQA MPL+YWW+AF  AV LIN LP+SV   +SP  ++  ++ ++  L+ FGCACYPCL+PY+ HK QFH+ RCV++G+S SHKG+KC+++ GR+F+
Subjt:  TLLAQASMPLQYWWDAFLAAVQLINGLPTSVLEGKSPLEVLHHKKLNFAGLRSFGCACYPCLRPYHNHKFQFHSERCVYLGFSPSHKGHKCLSASGRVFI

Query:  SRHVQFNELMFPF
        SRHV FNE  FPF
Subjt:  SRHVQFNELMFPF

SwissProt top hitse value%identityAlignment
P04146 Copia protein4.8e-3927.79Show/hide
Query:  GNSNNRQVCQVCGRPGHSALMCYHRFDKEFSPNVNRGGNQNPNNSGNSGNTQPPSAFVANSNSQYACPETVIDS-NWYADSGASNHVTGDFNNLANTKEY
        GNS  +  C  CGR GH    C+H   K    N N+   +    + + G      AF+    +      +V+D+  +  DSGAS+H+  D +   ++ E 
Subjt:  GNSNNRQVCQVCGRPGHSALMCYHRFDKEFSPNVNRGGNQNPNNSGNSGNTQPPSAFVANSNSQYACPETVIDS-NWYADSGASNHVTGDFNNLANTKEY

Query:  GGNEQVVIG-NGESLPITFTGDTYLSNGAAILSLNNVLCVPEITKNLVSVSKLAQDNDVFIEFHGDCCIIKDKRSGQEVLKGVLRDGLYQLNNVTRVPGV
            ++ +   GE +  T  G   L N   I +L +VL   E   NL+SV +L Q+  + IEF          +SG  + K  L                
Subjt:  GGNEQVVIG-NGESLPITFTGDTYLSNGAAILSLNNVLCVPEITKNLVSVSKLAQDNDVFIEFHGDCCIIKDKRSGQEVLKGVLRDGLYQLNNVTRVPGV

Query:  NEGCSESISKNSTANNHSSVFVVSRYPLSVNIIVSKNVWHKRLGHPASRVLDFVINDCK-LQVKENEMLS-------------FCESCQFGKSHNLPFP-
               + KNS   N+  V     Y ++     +  +WH+R GH         I+D K L++K   M S              CE C  GK   LPF  
Subjt:  NEGCSESISKNSTANNHSSVFVVSRYPLSVNIIVSKNVWHKRLGHPASRVLDFVINDCK-LQVKENEMLS-------------FCESCQFGKSHNLPFP-

Query:  -LSSSRAKYPFELIHSDLWGPAPVLSTDGFQYYVLFLDDYSRYLWLYPLKKKNDALAAFHHFISMVRNQFGCQIKILQSDNGGEYA--RIHQECYRLGII
            +  K P  ++HSD+ GP   ++ D   Y+V+F+D ++ Y   Y +K K+D  + F  F++     F  ++  L  DNG EY    + Q C + GI 
Subjt:  -LSSSRAKYPFELIHSDLWGPAPVLSTDGFQYYVLFLDDYSRYLWLYPLKKKNDALAAFHHFISMVRNQFGCQIKILQSDNGGEYA--RIHQECYRLGII

Query:  SRYSCPYTFAQNGRAERKHRHVVETGLTLLAQASMPLQYWWDAFLAAVQLINGLPTSVL--EGKSPLEVLHHKKLNFAGLRSFGCACYPCLRPYHNHKFQ
           + P+T   NG +ER  R + E   T+++ A +   +W +A L A  LIN +P+  L    K+P E+ H+KK     LR FG   Y  ++     KF 
Subjt:  SRYSCPYTFAQNGRAERKHRHVVETGLTLLAQASMPLQYWWDAFLAAVQLINGLPTSVL--EGKSPLEVLHHKKLNFAGLRSFGCACYPCLRPYHNHKFQ

Query:  FHSERCVYLGFSPSHKGHKCLSASGRVFI
          S + +++G+ P+  G K   A    FI
Subjt:  FHSERCVYLGFSPSHKGHKCLSASGRVFI

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.1e-4628.14Show/hide
Query:  GRGRGY----------GGYGNSNNR-----QVCQVCGRPGHSALMCYHRFDKEFSPNVNRGGNQ---NPNNSGNSGNTQPPSAFVANSNSQYACPE-TVI
        GRGR Y          G  G S NR     + C  C +PGH    C         PN  +G  +     N+   +   Q     V   N +  C   +  
Subjt:  GRGRGY----------GGYGNSNNR-----QVCQVCGRPGHSALMCYHRFDKEFSPNVNRGGNQ---NPNNSGNSGNTQPPSAFVANSNSQYACPE-TVI

Query:  DSNWYADSGASNHVTGDFNNLANTKEYGGNEQVVIGNGESLPITFTGDTYL-SNGAAILSLNNVLCVPEITKNLVSVSKLAQDNDVFIEFHGDCCIIKDK
        +S W  D+ AS+H T    +L      G    V +GN     I   GD  + +N    L L +V  VP++  NL  +S +A D D +  +  +      K
Subjt:  DSNWYADSGASNHVTGDFNNLANTKEYGGNEQVVIGNGESLPITFTGDTYL-SNGAAILSLNNVLCVPEITKNLVSVSKLAQDNDVFIEFHGDCCIIKDK

Query:  RSGQEVLKGVLRDGLYQLNNVTRVPGVNEGCSESISKNSTANNHSSVFVVSRYPLSVNIIVSKNVWHKRLGHPASRVLDFVINDCKLQVKENEMLSFCES
         S   + KGV R  LY+ N       +N    E                           +S ++WHKR+GH + + L  +     +   +   +  C+ 
Subjt:  RSGQEVLKGVLRDGLYQLNNVTRVPGVNEGCSESISKNSTANNHSSVFVVSRYPLSVNIIVSKNVWHKRLGHPASRVLDFVINDCKLQVKENEMLSFCES

Query:  CQFGKSHNLPFPLSSSRAKYPFELIHSDLWGPAPVLSTDGFQYYVLFLDDYSRYLWLYPLKKKNDALAAFHHFISMVRNQFGCQIKILQSDNGGEYA--R
        C FGK H + F  SS R     +L++SD+ GP  + S  G +Y+V F+DD SR LW+Y LK K+     F  F ++V  + G ++K L+SDNGGEY    
Subjt:  CQFGKSHNLPFPLSSSRAKYPFELIHSDLWGPAPVLSTDGFQYYVLFLDDYSRYLWLYPLKKKNDALAAFHHFISMVRNQFGCQIKILQSDNGGEYA--R

Query:  IHQECYRLGIISRYSCPYTFAQNGRAERKHRHVVETGLTLLAQASMPLQYWWDAFLAAVQLINGLPTSVLEGKSPLEVLHHKKLNFAGLRSFGCACYPCL
          + C   GI    + P T   NG AER +R +VE   ++L  A +P  +W +A   A  LIN  P+  L  + P  V  +K+++++ L+ FGC  +  +
Subjt:  IHQECYRLGIISRYSCPYTFAQNGRAERKHRHVVETGLTLLAQASMPLQYWWDAFLAAVQLINGLPTSVLEGKSPLEVLHHKKLNFAGLRSFGCACYPCL

Query:  RPYHNHKFQFHSERCVYLGFSPSHKGHKCLS-ASGRVFISRHVQFNELMFPFALDFGK
              K    S  C+++G+     G++       +V  SR V F E     A D  +
Subjt:  RPYHNHKFQFHSERCVYLGFSPSHKGHKCLS-ASGRVFISRHVQFNELMFPFALDFGK

Q07791 Transposon Ty2-DR3 Gag-Pol polyprotein2.4e-2224.89Show/hide
Query:  PETVIDSN------WYADSGASNHVTGDFNNLANTKEYGGNEQVVIGNGESLPITFTGDTYLSNGAAILSLNNVLCVPEITKNLVSVSKLAQDNDVFIEF
        P   IDSN         DSGAS  +    + L +         +V    + +PI   G+ + +      +    L  P I  +L+S+S+LA  N      
Subjt:  PETVIDSN------WYADSGASNHVTGDFNNLANTKEYGGNEQVVIGNGESLPITFTGDTYLSNGAAILSLNNVLCVPEITKNLVSVSKLAQDNDVFIEF

Query:  HGDCCIIKD--KRSGQEVLKGVLRDG-LYQLNNVTRVPGVNEGCSESISKNSTANNHSSVFVVSRYPLSVNIIVSKNVWHKRLGHPASRVLDFVI-NDCK
            C  ++  +RS   VL  +++ G  Y L+    +P         ISK  T NN +    V++YP          + H+ LGH   R +   +  +  
Subjt:  HGDCCIIKD--KRSGQEVLKGVLRDG-LYQLNNVTRVPGVNEGCSESISKNSTANNHSSVFVVSRYPLSVNIIVSKNVWHKRLGHPASRVLDFVI-NDCK

Query:  LQVKE------NEMLSFCESCQFGKSHNLPFPLSSSRAKY-----PFELIHSDLWGPAPVLSTDGFQYYVLFLDDYSRYLWLYPL--KKKNDALAAFHHF
          +KE      N     C  C  GKS      +  SR KY     PF+ +H+D++GP   L      Y++ F D+ +R+ W+YPL  +++   L  F   
Subjt:  LQVKE------NEMLSFCESCQFGKSHNLPFPLSSSRAKY-----PFELIHSDLWGPAPVLSTDGFQYYVLFLDDYSRYLWLYPL--KKKNDALAAFHHF

Query:  ISMVRNQFGCQIKILQSDNGGEYAR--IHQECYRLGIISRYSCPYTFAQNGRAERKHRHVVETGLTLLAQASMPLQYWWDAFLAAVQLINGLPTSVLEGK
        ++ ++NQF  ++ ++Q D G EY    +H+     GI + Y+       +G AER +R ++    TLL  + +P   W+ A   +  + N L  S    K
Subjt:  ISMVRNQFGCQIKILQSDNGGEYAR--IHQECYRLGIISRYSCPYTFAQNGRAERKHRHVVETGLTLLAQASMPLQYWWDAFLAAVQLINGLPTSVLEGK

Query:  SPLEVLHHKKLNFAGLRSFGCACYPCLRPYHNHKFQFHSERCVYLGFSPSHKGH
        S  +      L+   +  FG    P +   HN   + H          PS   +
Subjt:  SPLEVLHHKKLNFAGLRSFGCACYPCLRPYHNHKFQFHSERCVYLGFSPSHKGH

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.7e-10535.4Show/hide
Query:  AKDLWSAIQSLFGIQSRAEEDFLRQTFQQSRKGTSKMTDYLRLMKSHADNLGQAGSPVSTRNLVSQVLLGLDEEYNPIVAMIHGRG-DISWSEMQAELLV
        A  +W  ++ ++   S      LR   +Q  KGT  + DY++ + +  D L   G P+     V +VL  L EEY P++  I  +    + +E+   LL 
Subjt:  AKDLWSAIQSLFGIQSRAEEDFLRQTFQQSRKGTSKMTDYLRLMKSHADNLGQAGSPVSTRNLVSQVLLGLDEEYNPIVAMIHGRG-DISWSEMQAELLV

Query:  FEKRLELQNTQKAVVSFNHTP-TVNVANNKNNMNQNNNRGWNYSHSNGQRGQFYNNNQRGGSNFNNGRGRGGRGRGYGGYGNSNNRQ----VCQVCGRPG
         E ++       AV S    P T N  +++N    NNN       +NG R   Y+N        NN   +  +      + N+N  +     CQ+CG  G
Subjt:  FEKRLELQNTQKAVVSFNHTP-TVNVANNKNNMNQNNNRGWNYSHSNGQRGQFYNNNQRGGSNFNNGRGRGGRGRGYGGYGNSNNRQ----VCQVCGRPG

Query:  HSALMCYHRFDKEFSPNVNRGGNQNPNNSGNSGNTQPPSAFVA-NSNSQYACPETVIDSNWYADSGASNHVTGDFNNLANTKEYGGNEQVVIGNGESLPI
        HSA  C     + F  +VN              + QPPS F      +  A       +NW  DSGA++H+T DFNNL+  + Y G + V++ +G ++PI
Subjt:  HSALMCYHRFDKEFSPNVNRGGNQNPNNSGNSGNTQPPSAFVA-NSNSQYACPETVIDSNWYADSGASNHVTGDFNNLANTKEYGGNEQVVIGNGESLPI

Query:  TFTGDTYLSNGAAILSLNNVLCVPEITKNLVSVSKLAQDNDVFIEFHGDCCIIKDKRSGQEVLKGVLRDGLYQLNNVTRVPGVNEGCSESISKNSTANNH
        + TG T LS  +  L+L+N+L VP I KNL+SV +L   N V +EF      +KD  +G  +L+G  +D LY+    +  P          +  S+   H
Subjt:  TFTGDTYLSNGAAILSLNNVLCVPEITKNLVSVSKLAQDNDVFIEFHGDCCIIKDKRSGQEVLKGVLRDGLYQLNNVTRVPGVNEGCSESISKNSTANNH

Query:  SSVFVVSRYPLSVNIIVSKNVWHKRLGHPASRVLDFVINDCKLQV--KENEMLSFCESCQFGKSHNLPFPLSSSRAKYPFELIHSDLWGPAPVLSTDGFQ
        SS                   WH RLGHPA  +L+ VI++  L V    ++ LS C  C   KS+ +PF  S+  +  P E I+SD+W  +P+LS D ++
Subjt:  SSVFVVSRYPLSVNIIVSKNVWHKRLGHPASRVLDFVINDCKLQV--KENEMLSFCESCQFGKSHNLPFPLSSSRAKYPFELIHSDLWGPAPVLSTDGFQ

Query:  YYVLFLDDYSRYLWLYPLKKKNDALAAFHHFISMVRNQFGCQIKILQSDNGGEYARIHQECYRLGIISRYSCPYTFAQNGRAERKHRHVVETGLTLLAQA
        YYV+F+D ++RY WLYPLK+K+     F  F +++ N+F  +I    SDNGGE+  + +   + GI    S P+T   NG +ERKHRH+VETGLTLL+ A
Subjt:  YYVLFLDDYSRYLWLYPLKKKNDALAAFHHFISMVRNQFGCQIKILQSDNGGEYARIHQECYRLGIISRYSCPYTFAQNGRAERKHRHVVETGLTLLAQA

Query:  SMPLQYWWDAFLAAVQLINGLPTSVLEGKSPLEVLHHKKLNFAGLRSFGCACYPCLRPYHNHKFQFHSERCVYLGFSPSHKGHKCLS-ASGRVFISRHVQ
        S+P  YW  AF  AV LIN LPT +L+ +SP + L     N+  LR FGCACYP LRPY+ HK    S +CV+LG+S +   + CL   + R++ISRHV+
Subjt:  SMPLQYWWDAFLAAVQLINGLPTSVLEGKSPLEVLHHKKLNFAGLRSFGCACYPCLRPYHNHKFQFHSERCVYLGFSPSHKGHKCLS-ASGRVFISRHVQ

Query:  FNELMFPFA
        F+E  FPF+
Subjt:  FNELMFPFA

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.0e-9735.66Show/hide
Query:  DNLGQAGSPVSTRNLVSQVLLGLDEEYNPIVAMIHGRG-DISWSEMQAELLVFEKRLELQNTQKAV-VSFNHTPTVNVANNKNNMNQNNNRGWNYSHSNG
        D L   G P+     V +VL  L ++Y P++  I  +    S +E+   L+  E +L   N+ + V ++ N     N   N+N  N+ +NR  NY+++N 
Subjt:  DNLGQAGSPVSTRNLVSQVLLGLDEEYNPIVAMIHGRG-DISWSEMQAELLVFEKRLELQNTQKAV-VSFNHTPTVNVANNKNNMNQNNNRGWNYSHSNG

Query:  QRGQFYNNNQRGGSNFNNGRGRGGRGRGYGGYGNSNNRQVCQVCGRPGHSALMC--YHRFDKEFSPNVNRGGNQNPNNSGNSGNTQPPSAFVANSNSQYA
        +   +  ++   GS  +N + +   GR             CQ+C   GHSA  C   H+F             Q+  N   S  T P + +   +N    
Subjt:  QRGQFYNNNQRGGSNFNNGRGRGGRGRGYGGYGNSNNRQVCQVCGRPGHSALMC--YHRFDKEFSPNVNRGGNQNPNNSGNSGNTQPPSAFVANSNSQYA

Query:  CPETVIDSNWYADSGASNHVTGDFNNLANTKEYGGNEQVVIGNGESLPITFTGDTYLSNGAAILSLNNVLCVPEITKNLVSVSKLAQDNDVFIEFHGDCC
         P     +NW  DSGA++H+T DFNNL+  + Y G + V+I +G ++PIT TG   L   +  L LN VL VP I KNL+SV +L   N V +EF     
Subjt:  CPETVIDSNWYADSGASNHVTGDFNNLANTKEYGGNEQVVIGNGESLPITFTGDTYLSNGAAILSLNNVLCVPEITKNLVSVSKLAQDNDVFIEFHGDCC

Query:  IIKDKRSGQEVLKGVLRDGLYQLNNVTRVPGVNEGCSESISKNSTANNHSSVFVVSRYPLSVNIIVSKNVWHKRLGHPASRVLDFVINDCKLQV-KENEM
         +KD  +G  +L+G  +D LY+       P  +       +   +   HSS                   WH RLGHP+  +L+ VI++  L V   +  
Subjt:  IIKDKRSGQEVLKGVLRDGLYQLNNVTRVPGVNEGCSESISKNSTANNHSSVFVVSRYPLSVNIIVSKNVWHKRLGHPASRVLDFVINDCKLQV-KENEM

Query:  LSFCESCQFGKSHNLPFPLSSSRAKYPFELIHSDLWGPAPVLSTDGFQYYVLFLDDYSRYLWLYPLKKKNDALAAFHHFISMVRNQFGCQIKILQSDNGG
        L  C  C   KSH +PF  S+  +  P E I+SD+W  +P+LS D ++YYV+F+D ++RY WLYPLK+K+     F  F S+V N+F  +I  L SDNGG
Subjt:  LSFCESCQFGKSHNLPFPLSSSRAKYPFELIHSDLWGPAPVLSTDGFQYYVLFLDDYSRYLWLYPLKKKNDALAAFHHFISMVRNQFGCQIKILQSDNGG

Query:  EYARIHQECYRLGIISRYSCPYTFAQNGRAERKHRHVVETGLTLLAQASMPLQYWWDAFLAAVQLINGLPTSVLEGKSPLEVLHHKKLNFAGLRSFGCAC
        E+  +     + GI    S P+T   NG +ERKHRH+VE GLTLL+ AS+P  YW  AF  AV LIN LPT +L+ +SP + L  +  N+  L+ FGCAC
Subjt:  EYARIHQECYRLGIISRYSCPYTFAQNGRAERKHRHVVETGLTLLAQASMPLQYWWDAFLAAVQLINGLPTSVLEGKSPLEVLHHKKLNFAGLRSFGCAC

Query:  YPCLRPYHNHKFQFHSERCVYLGFSPSHKGHKCLS-ASGRVFISRHVQFNELMFPFA-LDFGKPSSSPTFSPS
        YP LRPY+ HK +  S++C ++G+S +   + CL   +GR++ SRHVQF+E  FPF+  +FG  +S    S S
Subjt:  YPCLRPYHNHKFQFHSERCVYLGFSPSHKGHKCLS-ASGRVFISRHVQFNELMFPFA-LDFGKPSSSPTFSPS

Arabidopsis top hitse value%identityAlignment
ATMG00300.1 Gag-Pol-related retrotransposon family protein1.4e-0927.87Show/hide
Query:  VNEGCSESISK-------NSTANNHSSVFVV------SRYPLSVNIIVSKNVWHKRLGHPASRVLDFVINDCKLQVKENEMLSFCESCQFGKSHNLPFPL
        V   CSE + K           N H S++++          L+        +WH RL H + R ++ ++    L   +   L FCE C +GK+H + F  
Subjt:  VNEGCSESISK-------NSTANNHSSVFVV------SRYPLSVNIIVSKNVWHKRLGHPASRVLDFVINDCKLQVKENEMLSFCESCQFGKSHNLPFPL

Query:  SSSRAKYPFELIHSDLWGPAPV
             K P + +HSDLWG   V
Subjt:  SSSRAKYPFELIHSDLWGPAPV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACACCTGAAATTGCCACACAAGTCATGGGATTCGACAATGCGAAAGATCTCTGGAGTGCTATTCAGAGTTTATTTGGTATTCAATCAAGAGCAGAGGAAGATTTCCT
CAGACAAACCTTTCAACAATCTAGAAAAGGTACGTCCAAAATGACTGATTATTTACGATTAATGAAGTCTCATGCCGATAATCTAGGGCAAGCAGGAAGTCCAGTTTCGA
CAAGGAACTTAGTATCTCAAGTATTGCTCGGACTCGATGAGGAGTATAATCCGATTGTAGCCATGATCCATGGAAGGGGAGACATCTCGTGGTCTGAAATGCAGGCCGAA
CTCCTTGTGTTTGAGAAGAGATTGGAACTACAGAATACTCAAAAAGCCGTTGTCTCCTTTAATCACACTCCCACCGTCAATGTGGCAAATAACAAGAACAACATGAATCA
AAACAACAATAGAGGCTGGAATTACAGTCACAGTAATGGCCAGAGAGGACAGTTCTATAACAACAATCAACGTGGGGGTTCAAATTTTAACAATGGCAGGGGACGAGGAG
GCCGTGGCAGAGGATATGGAGGATATGGCAACTCAAATAATCGCCAAGTTTGCCAAGTATGTGGAAGACCTGGTCATTCGGCACTTATGTGTTATCACAGATTTGATAAA
GAGTTCAGCCCAAATGTGAACAGAGGTGGCAATCAGAATCCAAATAACTCAGGTAACTCAGGGAACACTCAGCCACCGTCTGCTTTTGTGGCCAACTCAAACAGTCAATA
TGCTTGTCCTGAGACAGTAATAGACTCCAACTGGTACGCTGACAGTGGAGCTTCGAATCATGTCACCGGAGACTTCAACAATCTTGCTAATACCAAGGAATATGGAGGTA
ATGAACAAGTGGTCATAGGTAATGGAGAATCTCTCCCTATTACTTTCACTGGAGATACTTATTTATCTAATGGTGCTGCTATTCTTAGTCTCAATAACGTTTTGTGTGTT
CCTGAAATAACTAAAAACCTAGTTAGTGTATCAAAACTAGCTCAGGACAATGACGTTTTCATTGAATTTCATGGTGATTGTTGTATTATTAAGGACAAGCGTTCGGGTCA
GGAGGTGCTGAAAGGAGTACTTAGGGACGGTCTCTACCAGCTTAACAATGTCACGAGGGTACCAGGAGTGAATGAAGGATGTTCTGAGTCAATTTCCAAGAATTCTACGG
CCAATAATCATTCCTCAGTTTTTGTTGTTTCTCGTTATCCACTGAGTGTTAATATTATTGTGTCTAAGAATGTATGGCACAAACGTCTGGGTCATCCAGCATCTCGGGTT
TTAGATTTTGTTATCAATGATTGTAAGCTTCAAGTTAAAGAGAATGAGATGCTCAGTTTTTGCGAGTCATGTCAATTTGGCAAGTCACACAATTTACCTTTCCCTCTATC
TTCAAGTCGGGCAAAGTATCCATTTGAATTGATTCACTCGGACCTTTGGGGTCCTGCTCCGGTCTTGTCTACTGATGGCTTTCAATATTATGTTTTATTTCTAGATGATT
ACAGCAGATATCTATGGCTTTATCCATTGAAAAAGAAAAATGATGCGCTTGCTGCCTTTCACCACTTTATCTCTATGGTCAGGAATCAGTTTGGTTGTCAAATAAAGATT
CTTCAGTCTGACAATGGTGGCGAATACGCTAGGATTCATCAGGAATGTTATCGACTTGGTATTATCTCTCGATATTCTTGTCCCTACACGTTTGCACAAAATGGAAGAGC
AGAACGGAAGCATAGACACGTTGTTGAAACTGGTCTGACATTGCTTGCTCAGGCGTCAATGCCTCTTCAGTATTGGTGGGATGCGTTTTTAGCGGCTGTCCAGCTGATAA
ATGGTTTACCAACCTCAGTTCTTGAAGGTAAGTCACCATTGGAAGTGTTACATCACAAGAAACTTAATTTTGCAGGTCTACGCTCATTTGGATGTGCCTGTTATCCATGC
CTGAGGCCTTACCATAACCACAAATTTCAGTTTCACTCCGAGAGGTGCGTTTATCTTGGCTTCAGCCCCTCTCATAAAGGACATAAATGCCTTAGTGCTTCTGGTCGTGT
ATTTATTTCTCGACATGTGCAGTTTAATGAACTCATGTTTCCATTTGCACTTGATTTTGGAAAACCCTCAAGCTCCCCAACATTTTCACCCTCTCATGGTCCATCTATCT
TAACCTGGTTTCAATCTCTAGAACACAGTCACTTATCCCAAGAAAATACCAACAGGCAATTGAAATATTGA
mRNA sequenceShow/hide mRNA sequence
ATGACACCTGAAATTGCCACACAAGTCATGGGATTCGACAATGCGAAAGATCTCTGGAGTGCTATTCAGAGTTTATTTGGTATTCAATCAAGAGCAGAGGAAGATTTCCT
CAGACAAACCTTTCAACAATCTAGAAAAGGTACGTCCAAAATGACTGATTATTTACGATTAATGAAGTCTCATGCCGATAATCTAGGGCAAGCAGGAAGTCCAGTTTCGA
CAAGGAACTTAGTATCTCAAGTATTGCTCGGACTCGATGAGGAGTATAATCCGATTGTAGCCATGATCCATGGAAGGGGAGACATCTCGTGGTCTGAAATGCAGGCCGAA
CTCCTTGTGTTTGAGAAGAGATTGGAACTACAGAATACTCAAAAAGCCGTTGTCTCCTTTAATCACACTCCCACCGTCAATGTGGCAAATAACAAGAACAACATGAATCA
AAACAACAATAGAGGCTGGAATTACAGTCACAGTAATGGCCAGAGAGGACAGTTCTATAACAACAATCAACGTGGGGGTTCAAATTTTAACAATGGCAGGGGACGAGGAG
GCCGTGGCAGAGGATATGGAGGATATGGCAACTCAAATAATCGCCAAGTTTGCCAAGTATGTGGAAGACCTGGTCATTCGGCACTTATGTGTTATCACAGATTTGATAAA
GAGTTCAGCCCAAATGTGAACAGAGGTGGCAATCAGAATCCAAATAACTCAGGTAACTCAGGGAACACTCAGCCACCGTCTGCTTTTGTGGCCAACTCAAACAGTCAATA
TGCTTGTCCTGAGACAGTAATAGACTCCAACTGGTACGCTGACAGTGGAGCTTCGAATCATGTCACCGGAGACTTCAACAATCTTGCTAATACCAAGGAATATGGAGGTA
ATGAACAAGTGGTCATAGGTAATGGAGAATCTCTCCCTATTACTTTCACTGGAGATACTTATTTATCTAATGGTGCTGCTATTCTTAGTCTCAATAACGTTTTGTGTGTT
CCTGAAATAACTAAAAACCTAGTTAGTGTATCAAAACTAGCTCAGGACAATGACGTTTTCATTGAATTTCATGGTGATTGTTGTATTATTAAGGACAAGCGTTCGGGTCA
GGAGGTGCTGAAAGGAGTACTTAGGGACGGTCTCTACCAGCTTAACAATGTCACGAGGGTACCAGGAGTGAATGAAGGATGTTCTGAGTCAATTTCCAAGAATTCTACGG
CCAATAATCATTCCTCAGTTTTTGTTGTTTCTCGTTATCCACTGAGTGTTAATATTATTGTGTCTAAGAATGTATGGCACAAACGTCTGGGTCATCCAGCATCTCGGGTT
TTAGATTTTGTTATCAATGATTGTAAGCTTCAAGTTAAAGAGAATGAGATGCTCAGTTTTTGCGAGTCATGTCAATTTGGCAAGTCACACAATTTACCTTTCCCTCTATC
TTCAAGTCGGGCAAAGTATCCATTTGAATTGATTCACTCGGACCTTTGGGGTCCTGCTCCGGTCTTGTCTACTGATGGCTTTCAATATTATGTTTTATTTCTAGATGATT
ACAGCAGATATCTATGGCTTTATCCATTGAAAAAGAAAAATGATGCGCTTGCTGCCTTTCACCACTTTATCTCTATGGTCAGGAATCAGTTTGGTTGTCAAATAAAGATT
CTTCAGTCTGACAATGGTGGCGAATACGCTAGGATTCATCAGGAATGTTATCGACTTGGTATTATCTCTCGATATTCTTGTCCCTACACGTTTGCACAAAATGGAAGAGC
AGAACGGAAGCATAGACACGTTGTTGAAACTGGTCTGACATTGCTTGCTCAGGCGTCAATGCCTCTTCAGTATTGGTGGGATGCGTTTTTAGCGGCTGTCCAGCTGATAA
ATGGTTTACCAACCTCAGTTCTTGAAGGTAAGTCACCATTGGAAGTGTTACATCACAAGAAACTTAATTTTGCAGGTCTACGCTCATTTGGATGTGCCTGTTATCCATGC
CTGAGGCCTTACCATAACCACAAATTTCAGTTTCACTCCGAGAGGTGCGTTTATCTTGGCTTCAGCCCCTCTCATAAAGGACATAAATGCCTTAGTGCTTCTGGTCGTGT
ATTTATTTCTCGACATGTGCAGTTTAATGAACTCATGTTTCCATTTGCACTTGATTTTGGAAAACCCTCAAGCTCCCCAACATTTTCACCCTCTCATGGTCCATCTATCT
TAACCTGGTTTCAATCTCTAGAACACAGTCACTTATCCCAAGAAAATACCAACAGGCAATTGAAATATTGA
Protein sequenceShow/hide protein sequence
MTPEIATQVMGFDNAKDLWSAIQSLFGIQSRAEEDFLRQTFQQSRKGTSKMTDYLRLMKSHADNLGQAGSPVSTRNLVSQVLLGLDEEYNPIVAMIHGRGDISWSEMQAE
LLVFEKRLELQNTQKAVVSFNHTPTVNVANNKNNMNQNNNRGWNYSHSNGQRGQFYNNNQRGGSNFNNGRGRGGRGRGYGGYGNSNNRQVCQVCGRPGHSALMCYHRFDK
EFSPNVNRGGNQNPNNSGNSGNTQPPSAFVANSNSQYACPETVIDSNWYADSGASNHVTGDFNNLANTKEYGGNEQVVIGNGESLPITFTGDTYLSNGAAILSLNNVLCV
PEITKNLVSVSKLAQDNDVFIEFHGDCCIIKDKRSGQEVLKGVLRDGLYQLNNVTRVPGVNEGCSESISKNSTANNHSSVFVVSRYPLSVNIIVSKNVWHKRLGHPASRV
LDFVINDCKLQVKENEMLSFCESCQFGKSHNLPFPLSSSRAKYPFELIHSDLWGPAPVLSTDGFQYYVLFLDDYSRYLWLYPLKKKNDALAAFHHFISMVRNQFGCQIKI
LQSDNGGEYARIHQECYRLGIISRYSCPYTFAQNGRAERKHRHVVETGLTLLAQASMPLQYWWDAFLAAVQLINGLPTSVLEGKSPLEVLHHKKLNFAGLRSFGCACYPC
LRPYHNHKFQFHSERCVYLGFSPSHKGHKCLSASGRVFISRHVQFNELMFPFALDFGKPSSSPTFSPSHGPSILTWFQSLEHSHLSQENTNRQLKY