; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0000912 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0000912
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr4:19479051..19480772
RNA-Seq ExpressionLag0000912
SyntenyLag0000912
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN71553.1 hypothetical protein VITISV_034738 [Vitis vinifera]8.3e-11245.52Show/hide
Query:  LLQSKQDVVSIFLRFKVMIETQFKTKIRALQTDSGTEFKPLSSILASNGILHRISYPYAPQQNGSVERKNRHVVE-------------------------
        LL +K  V S F++FK ++E QF+T I+ LQT+ G EF  L   L+ +GILHR S PY P+QNG VERK RHVVE                         
Subjt:  LLQSKQDVVSIFLRFKVMIETQFKTKIRALQTDSGTEFKPLSSILASNGILHRISYPYAPQQNGSVERKNRHVVE-------------------------

Query:  ------------------LYGRALDYSLLKMFGCSCFPFLRTYNANKLQFRSKECVFLGYSSSQKGYLCLDIHSGRLYVSRHVVFNELSFLFQSLPSTDL
                          LY    DY   K+FGC C+P L+  N+NK Q RS  C+FLGY+ S KGYLCL+  + R+Y+SRHVVF E +F FQ+L     
Subjt:  ------------------LYGRALDYSLLKMFGCSCFPFLRTYNANKLQFRSKECVFLGYSSSQKGYLCLDIHSGRLYVSRHVVFNELSFLFQSLPSTDL

Query:  PFVKSSFLDPSYPLGSLLHCVNKFSSPLISPSITPPAVPLPPTAAPIPHASAAPIPISTTYDGSPCSSSSSGSFSSFSSISHPPPTPALVLSESLPNQPV
                  S P     H                P  PLPP+    P  +++   ++T  + +P S  +       SS+S PP      + E+    P 
Subjt:  PFVKSSFLDPSYPLGSLLHCVNKFSSPLISPSITPPAVPLPPTAAPIPHASAAPIPISTTYDGSPCSSSSSGSFSSFSSISHPPPTPALVLSESLPNQPV

Query:  DNPSNHPSSINVHPMVTRSKTSVFKPKAWLVTKAPVTCEPKNYKEALQDSNWKAAMDNEYEALIRNKTWSLVPLPNDRKVVGCKWVFRVKLKPDGSIDRY
         +  +  + I  HPM+TRSK+ + K K +L +   +T EP+  K+ALQD NWK AM+ EY+AL++N+TWSLVP P++ K++GCKWVF++K KP+GSIDRY
Subjt:  DNPSNHPSSINVHPMVTRSKTSVFKPKAWLVTKAPVTCEPKNYKEALQDSNWKAAMDNEYEALIRNKTWSLVPLPNDRKVVGCKWVFRVKLKPDGSIDRY

Query:  KARLVAQGYTQALGIDYFKTFSPVVKPATIRIVLSLAVSFGWQIRQLDVHNAFLHGELLEQVYMKQPTGFVSTSHPSHVLKLHKALYGIKQSPRAWFSKL
        KARLVAQG+ Q  GID+F+TFSPVVKP TIR+VLS+AVS  W I+QLDVHNAFL+G+L EQV+M QP GF   S P+HV +L KALYG+KQ+PRAWF KL
Subjt:  KARLVAQGYTQALGIDYFKTFSPVVKPATIRIVLSLAVSFGWQIRQLDVHNAFLHGELLEQVYMKQPTGFVSTSHPSHVLKLHKALYGIKQSPRAWFSKL

Query:  ISCLLEWGFLASKADASMFIFKSGS
         S LL+ GF  S+ADAS+F F S S
Subjt:  ISCLLEWGFLASKADASMFIFKSGS

CAN73924.1 hypothetical protein VITISV_041509 [Vitis vinifera]9.5e-10842.97Show/hide
Query:  LQSKQDVVSIFLRFKVMIETQFKTKIRALQTDSGTEFKPLSSILASNGILHRISYPYAPQQNGSVERKNRHVVE--------------------------
        L +K   +  F+ F+ M+E Q +T I+ +Q+D+G EF      L ++GILH+ S P+ PQQNG  ERK RH+VE                          
Subjt:  LQSKQDVVSIFLRFKVMIETQFKTKIRALQTDSGTEFKPLSSILASNGILHRISYPYAPQQNGSVERKNRHVVE--------------------------

Query:  -----------------LYGRALDYSLLKMFGCSCFPFLRTYNANKLQFRSKECVFLGYSSSQKGYLCLDIHSGRLYVSRHVVFNELSFLFQSLPSTDLP
                         L+ +  +Y  L++FGC CFP LR Y  +KL +RS  CVFLGY+ + KGYLCLD+ + R+Y+SR+V+F+E SF FQS       
Subjt:  -----------------LYGRALDYSLLKMFGCSCFPFLRTYNANKLQFRSKECVFLGYSSSQKGYLCLDIHSGRLYVSRHVVFNELSFLFQSLPSTDLP

Query:  FVKSSFLDPSYPLGSLLHCVNKFSSPLISPSITPPAVPLPPTAAPIPHASAAPIPISTTYDGSPCSSSSSGSFSSFSSISHPP--PTPALVLSESLPNQP
           SS   PS P            SP + PS TP  +  P  +AP   A ++PI                      +S S PP  P P    S + P+ P
Subjt:  FVKSSFLDPSYPLGSLLHCVNKFSSPLISPSITPPAVPLPPTAAPIPHASAAPIPISTTYDGSPCSSSSSGSFSSFSSISHPP--PTPALVLSESLPNQP

Query:  VDNPSNHPSSINVHPMVTRSKTSVFKPKAWLVTKAPVTCEPKNYKEALQDSNWKAAMDNEYEALIRNKTWSLVPLPNDRKVVGCKWVFRVKLKPDGSIDR
               P  +N HPMVTR+K+ + K ++++V     T EP+ Y +A ++ +W  AM++EY+AL+RN TWSLVP P+   +VGC+W++++K +PDGSIDR
Subjt:  VDNPSNHPSSINVHPMVTRSKTSVFKPKAWLVTKAPVTCEPKNYKEALQDSNWKAAMDNEYEALIRNKTWSLVPLPNDRKVVGCKWVFRVKLKPDGSIDR

Query:  YKARLVAQGYTQALGIDYFKTFSPVVKPATIRIVLSLAVSFGWQIRQLDVHNAFLHGELLEQVYMKQPTGFVSTSHPSHVLKLHKALYGIKQSPRAWFSK
        +KARLVAQG+TQ  GIDYF TFSPVVKP TIR++L+LAVSF W +RQLDV NAFL+G+L E+V+M QP GFV+ ++P++V KLHKALYG+KQ+PRAWF K
Subjt:  YKARLVAQGYTQALGIDYFKTFSPVVKPATIRIVLSLAVSFGWQIRQLDVHNAFLHGELLEQVYMKQPTGFVSTSHPSHVLKLHKALYGIKQSPRAWFSK

Query:  LISCLLEWGFLASKADASMFIFKSGS
        L   LL++GF +S+AD S+FIF + +
Subjt:  LISCLLEWGFLASKADASMFIFKSGS

RVW64314.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]2.2e-11243.51Show/hide
Query:  LQSKQDVVSIFLRFKVMIETQFKTKIRALQTDSGTEFKPLSSILASNGILHRISYPYAPQQNGSVERKNRHVVE--------------------------
        L SK   +S+F++FK ++E QF ++I+ L++D+G EFK  SS LA++GI  + S PY P+QNG  ERK RH++E                          
Subjt:  LQSKQDVVSIFLRFKVMIETQFKTKIRALQTDSGTEFKPLSSILASNGILHRISYPYAPQQNGSVERKNRHVVE--------------------------

Query:  -----------------LYGRALDYSLLKMFGCSCFPFLRTYNANKLQFRSKECVFLGYSSSQKGYLCLDIHSGRLYVSRHVVFNELSFLFQSLPSTDLP
                         L+G++ +Y + K+FGC C+P++R YN NKL +RS +CVFLGYSS+ KGY+CL+  +GRLYV+RHVVF+E  F FQS P     
Subjt:  -----------------LYGRALDYSLLKMFGCSCFPFLRTYNANKLQFRSKECVFLGYSSSQKGYLCLDIHSGRLYVSRHVVFNELSFLFQSLPSTDLP

Query:  FVKSSFLDPSYPLGSLLHCVNKFSSPLISP--SITPPAVPLPPTAAPIPHASAAPIPISTTYDGSPCSSSSSGSFSSFSSISHPPPTPALVLSESLPNQP
         V       + P  + L C    SSP +S   S T P+   PP                      P S+ S                P L+    +P   
Subjt:  FVKSSFLDPSYPLGSLLHCVNKFSSPLISP--SITPPAVPLPPTAAPIPHASAAPIPISTTYDGSPCSSSSSGSFSSFSSISHPPPTPALVLSESLPNQP

Query:  VDNPSNHPSSINVHPMVTRSKTSVFKPKAWLVTKAPVTCEPKNYKEALQDSNWKAAMDNEYEALIRNKTWSLVPLPNDRKVVGCKWVFRVKLKPDGSIDR
        +     HP+  N HPMVTR+K  + K K +  +      EP  + +A++DSNW  AM+ E+ AL RN TW LVP P++  ++GCKWV+++K KPDG++DR
Subjt:  VDNPSNHPSSINVHPMVTRSKTSVFKPKAWLVTKAPVTCEPKNYKEALQDSNWKAAMDNEYEALIRNKTWSLVPLPNDRKVVGCKWVFRVKLKPDGSIDR

Query:  YKARLVAQGYTQALGIDYFKTFSPVVKPATIRIVLSLAVSFGWQIRQLDVHNAFLHGELLEQVYMKQPTGFVSTSHPSHVLKLHKALYGIKQSPRAWFSK
        YKARLVAQG+TQ LG+DYF+TFSPVVK +TIRI+L++A+SF W + QLDV NAFLHG+L E V+M+QP GF+++ +PSHV KL+KALYG+KQ+PRAW++K
Subjt:  YKARLVAQGYTQALGIDYFKTFSPVVKPATIRIVLSLAVSFGWQIRQLDVHNAFLHGELLEQVYMKQPTGFVSTSHPSHVLKLHKALYGIKQSPRAWFSK

Query:  LISCLLEWGFLASKADASMFIFKS
        L + LL WGF AS+AD+SMFI  S
Subjt:  LISCLLEWGFLASKADASMFIFKS

RVX06084.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]2.2e-11243.51Show/hide
Query:  LQSKQDVVSIFLRFKVMIETQFKTKIRALQTDSGTEFKPLSSILASNGILHRISYPYAPQQNGSVERKNRHVVE--------------------------
        L SK   +S+F++FK ++E QF ++I+ L++D+G EFK  SS LA++GI  + S PY P+QNG  ERK RH++E                          
Subjt:  LQSKQDVVSIFLRFKVMIETQFKTKIRALQTDSGTEFKPLSSILASNGILHRISYPYAPQQNGSVERKNRHVVE--------------------------

Query:  -----------------LYGRALDYSLLKMFGCSCFPFLRTYNANKLQFRSKECVFLGYSSSQKGYLCLDIHSGRLYVSRHVVFNELSFLFQSLPSTDLP
                         L+G++ +Y + K+FGC C+P++R YN NKL +RS +CVFLGYSS+ KGY+CL+  +GRLYV+RHVVF+E  F FQS P     
Subjt:  -----------------LYGRALDYSLLKMFGCSCFPFLRTYNANKLQFRSKECVFLGYSSSQKGYLCLDIHSGRLYVSRHVVFNELSFLFQSLPSTDLP

Query:  FVKSSFLDPSYPLGSLLHCVNKFSSPLISP--SITPPAVPLPPTAAPIPHASAAPIPISTTYDGSPCSSSSSGSFSSFSSISHPPPTPALVLSESLPNQP
         V       + P  + L C    SSP +S   S T P+   PP                      P S+ S                P L+    +P   
Subjt:  FVKSSFLDPSYPLGSLLHCVNKFSSPLISP--SITPPAVPLPPTAAPIPHASAAPIPISTTYDGSPCSSSSSGSFSSFSSISHPPPTPALVLSESLPNQP

Query:  VDNPSNHPSSINVHPMVTRSKTSVFKPKAWLVTKAPVTCEPKNYKEALQDSNWKAAMDNEYEALIRNKTWSLVPLPNDRKVVGCKWVFRVKLKPDGSIDR
        +     HP+  N HPMVTR+K  + K K +  +      EP  + +A++DSNW  AM+ E+ AL RN TW LVP P++  ++GCKWV+++K KPDG++DR
Subjt:  VDNPSNHPSSINVHPMVTRSKTSVFKPKAWLVTKAPVTCEPKNYKEALQDSNWKAAMDNEYEALIRNKTWSLVPLPNDRKVVGCKWVFRVKLKPDGSIDR

Query:  YKARLVAQGYTQALGIDYFKTFSPVVKPATIRIVLSLAVSFGWQIRQLDVHNAFLHGELLEQVYMKQPTGFVSTSHPSHVLKLHKALYGIKQSPRAWFSK
        YKARLVAQG+TQ LG+DYF+TFSPVVK +TIRI+L++A+SF W + QLDV NAFLHG+L E V+M+QP GF+++ +PSHV KL+KALYG+KQ+PRAW++K
Subjt:  YKARLVAQGYTQALGIDYFKTFSPVVKPATIRIVLSLAVSFGWQIRQLDVHNAFLHGELLEQVYMKQPTGFVSTSHPSHVLKLHKALYGIKQSPRAWFSK

Query:  LISCLLEWGFLASKADASMFIFKS
        L + LL WGF AS+AD+SMFI  S
Subjt:  LISCLLEWGFLASKADASMFIFKS

RVX14515.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]1.3e-11245.64Show/hide
Query:  LLQSKQDVVSIFLRFKVMIETQFKTKIRALQTDSGTEFKPLSSILASNGILHRISYPYAPQQNGSVERKNRHVVE-------------------------
        LL SK   +S FL+FKVMIETQF TK+R LQTD G EF+  ++ L   GILHR+SYP   QQNG VERK+RHVVE                         
Subjt:  LLQSKQDVVSIFLRFKVMIETQFKTKIRALQTDSGTEFKPLSSILASNGILHRISYPYAPQQNGSVERKNRHVVE-------------------------

Query:  ------------------LYGRALDYSLLKMFGCSCFPFLRTYNANKLQFRSKECVFLGYSSSQKGYLCLDIHSGRLYVSRHVVFNELSFLFQSLPSTDL
                          LY    +YS L+++GC+C+PFLR +N +K  +RS +C F+GY+S  KGYLCL++ +G++++SRHV+F EL          D 
Subjt:  ------------------LYGRALDYSLLKMFGCSCFPFLRTYNANKLQFRSKECVFLGYSSSQKGYLCLDIHSGRLYVSRHVVFNELSFLFQSLPSTDL

Query:  PFVKSSFLDPSYPLGSLLHCVNKFSSPLISPSITPPAVPLPPTAAPIPHASAAPIPISTTYDGSPCSSSSSGSFSSFSSISHPPPTPALVLSESLPNQPV
        PF K S +     + S  HC    + PL SP ITP +  LP +++P    S+  +PIS T            S S  SS   P PT              
Subjt:  PFVKSSFLDPSYPLGSLLHCVNKFSSPLISPSITPPAVPLPPTAAPIPHASAAPIPISTTYDGSPCSSSSSGSFSSFSSISHPPPTPALVLSESLPNQPV

Query:  DNPSNHPSSINVHPMVTRSKTSVFKPKAWLVTKAPV---TCEPKNYKEALQDSNWKAAMDNEYEALIRNKTWSLVPLPNDRKVVGCKWVFRVKLKPDGSI
          PS+   S   HPM TR+K  +FKPK +L     +    CEP ++KEA++   W+ AM  E+EAL+ NKTW LVP P ++ ++GC+WV+++K KPDG++
Subjt:  DNPSNHPSSINVHPMVTRSKTSVFKPKAWLVTKAPV---TCEPKNYKEALQDSNWKAAMDNEYEALIRNKTWSLVPLPNDRKVVGCKWVFRVKLKPDGSI

Query:  DRYKARLVAQGYTQALGIDYFKTFSPVVKPATIRIVLSLAVSFGWQIRQLDVHNAFLHGELLEQVYMKQPTGFVSTSHPSHVLKLHKALYGIKQSPRAWF
        +RYKARLVA+G+ Q  G DYF+TFSPVVKP TIR+VLSLA+S  W IRQLDVHNAFL+G+L EQV+M QP GFV  S P+ V KL KALYG+KQ+P AWF
Subjt:  DRYKARLVAQGYTQALGIDYFKTFSPVVKPATIRIVLSLAVSFGWQIRQLDVHNAFLHGELLEQVYMKQPTGFVSTSHPSHVLKLHKALYGIKQSPRAWF

Query:  SKLISCLLEWGFLASKADASMFIFKSGS
        +KL S L++WGF  S+AD SMF++ + S
Subjt:  SKLISCLLEWGFLASKADASMFIFKSGS

TrEMBL top hitse value%identityAlignment
A0A438FWJ3 Retrovirus-related Pol polyprotein from transposon TNT 1-941.1e-11243.51Show/hide
Query:  LQSKQDVVSIFLRFKVMIETQFKTKIRALQTDSGTEFKPLSSILASNGILHRISYPYAPQQNGSVERKNRHVVE--------------------------
        L SK   +S+F++FK ++E QF ++I+ L++D+G EFK  SS LA++GI  + S PY P+QNG  ERK RH++E                          
Subjt:  LQSKQDVVSIFLRFKVMIETQFKTKIRALQTDSGTEFKPLSSILASNGILHRISYPYAPQQNGSVERKNRHVVE--------------------------

Query:  -----------------LYGRALDYSLLKMFGCSCFPFLRTYNANKLQFRSKECVFLGYSSSQKGYLCLDIHSGRLYVSRHVVFNELSFLFQSLPSTDLP
                         L+G++ +Y + K+FGC C+P++R YN NKL +RS +CVFLGYSS+ KGY+CL+  +GRLYV+RHVVF+E  F FQS P     
Subjt:  -----------------LYGRALDYSLLKMFGCSCFPFLRTYNANKLQFRSKECVFLGYSSSQKGYLCLDIHSGRLYVSRHVVFNELSFLFQSLPSTDLP

Query:  FVKSSFLDPSYPLGSLLHCVNKFSSPLISP--SITPPAVPLPPTAAPIPHASAAPIPISTTYDGSPCSSSSSGSFSSFSSISHPPPTPALVLSESLPNQP
         V       + P  + L C    SSP +S   S T P+   PP                      P S+ S                P L+    +P   
Subjt:  FVKSSFLDPSYPLGSLLHCVNKFSSPLISP--SITPPAVPLPPTAAPIPHASAAPIPISTTYDGSPCSSSSSGSFSSFSSISHPPPTPALVLSESLPNQP

Query:  VDNPSNHPSSINVHPMVTRSKTSVFKPKAWLVTKAPVTCEPKNYKEALQDSNWKAAMDNEYEALIRNKTWSLVPLPNDRKVVGCKWVFRVKLKPDGSIDR
        +     HP+  N HPMVTR+K  + K K +  +      EP  + +A++DSNW  AM+ E+ AL RN TW LVP P++  ++GCKWV+++K KPDG++DR
Subjt:  VDNPSNHPSSINVHPMVTRSKTSVFKPKAWLVTKAPVTCEPKNYKEALQDSNWKAAMDNEYEALIRNKTWSLVPLPNDRKVVGCKWVFRVKLKPDGSIDR

Query:  YKARLVAQGYTQALGIDYFKTFSPVVKPATIRIVLSLAVSFGWQIRQLDVHNAFLHGELLEQVYMKQPTGFVSTSHPSHVLKLHKALYGIKQSPRAWFSK
        YKARLVAQG+TQ LG+DYF+TFSPVVK +TIRI+L++A+SF W + QLDV NAFLHG+L E V+M+QP GF+++ +PSHV KL+KALYG+KQ+PRAW++K
Subjt:  YKARLVAQGYTQALGIDYFKTFSPVVKPATIRIVLSLAVSFGWQIRQLDVHNAFLHGELLEQVYMKQPTGFVSTSHPSHVLKLHKALYGIKQSPRAWFSK

Query:  LISCLLEWGFLASKADASMFIFKS
        L + LL WGF AS+AD+SMFI  S
Subjt:  LISCLLEWGFLASKADASMFIFKS

A0A438JAU4 Retrovirus-related Pol polyprotein from transposon TNT 1-941.1e-11243.51Show/hide
Query:  LQSKQDVVSIFLRFKVMIETQFKTKIRALQTDSGTEFKPLSSILASNGILHRISYPYAPQQNGSVERKNRHVVE--------------------------
        L SK   +S+F++FK ++E QF ++I+ L++D+G EFK  SS LA++GI  + S PY P+QNG  ERK RH++E                          
Subjt:  LQSKQDVVSIFLRFKVMIETQFKTKIRALQTDSGTEFKPLSSILASNGILHRISYPYAPQQNGSVERKNRHVVE--------------------------

Query:  -----------------LYGRALDYSLLKMFGCSCFPFLRTYNANKLQFRSKECVFLGYSSSQKGYLCLDIHSGRLYVSRHVVFNELSFLFQSLPSTDLP
                         L+G++ +Y + K+FGC C+P++R YN NKL +RS +CVFLGYSS+ KGY+CL+  +GRLYV+RHVVF+E  F FQS P     
Subjt:  -----------------LYGRALDYSLLKMFGCSCFPFLRTYNANKLQFRSKECVFLGYSSSQKGYLCLDIHSGRLYVSRHVVFNELSFLFQSLPSTDLP

Query:  FVKSSFLDPSYPLGSLLHCVNKFSSPLISP--SITPPAVPLPPTAAPIPHASAAPIPISTTYDGSPCSSSSSGSFSSFSSISHPPPTPALVLSESLPNQP
         V       + P  + L C    SSP +S   S T P+   PP                      P S+ S                P L+    +P   
Subjt:  FVKSSFLDPSYPLGSLLHCVNKFSSPLISP--SITPPAVPLPPTAAPIPHASAAPIPISTTYDGSPCSSSSSGSFSSFSSISHPPPTPALVLSESLPNQP

Query:  VDNPSNHPSSINVHPMVTRSKTSVFKPKAWLVTKAPVTCEPKNYKEALQDSNWKAAMDNEYEALIRNKTWSLVPLPNDRKVVGCKWVFRVKLKPDGSIDR
        +     HP+  N HPMVTR+K  + K K +  +      EP  + +A++DSNW  AM+ E+ AL RN TW LVP P++  ++GCKWV+++K KPDG++DR
Subjt:  VDNPSNHPSSINVHPMVTRSKTSVFKPKAWLVTKAPVTCEPKNYKEALQDSNWKAAMDNEYEALIRNKTWSLVPLPNDRKVVGCKWVFRVKLKPDGSIDR

Query:  YKARLVAQGYTQALGIDYFKTFSPVVKPATIRIVLSLAVSFGWQIRQLDVHNAFLHGELLEQVYMKQPTGFVSTSHPSHVLKLHKALYGIKQSPRAWFSK
        YKARLVAQG+TQ LG+DYF+TFSPVVK +TIRI+L++A+SF W + QLDV NAFLHG+L E V+M+QP GF+++ +PSHV KL+KALYG+KQ+PRAW++K
Subjt:  YKARLVAQGYTQALGIDYFKTFSPVVKPATIRIVLSLAVSFGWQIRQLDVHNAFLHGELLEQVYMKQPTGFVSTSHPSHVLKLHKALYGIKQSPRAWFSK

Query:  LISCLLEWGFLASKADASMFIFKS
        L + LL WGF AS+AD+SMFI  S
Subjt:  LISCLLEWGFLASKADASMFIFKS

A0A438JZY3 Retrovirus-related Pol polyprotein from transposon RE16.2e-11345.64Show/hide
Query:  LLQSKQDVVSIFLRFKVMIETQFKTKIRALQTDSGTEFKPLSSILASNGILHRISYPYAPQQNGSVERKNRHVVE-------------------------
        LL SK   +S FL+FKVMIETQF TK+R LQTD G EF+  ++ L   GILHR+SYP   QQNG VERK+RHVVE                         
Subjt:  LLQSKQDVVSIFLRFKVMIETQFKTKIRALQTDSGTEFKPLSSILASNGILHRISYPYAPQQNGSVERKNRHVVE-------------------------

Query:  ------------------LYGRALDYSLLKMFGCSCFPFLRTYNANKLQFRSKECVFLGYSSSQKGYLCLDIHSGRLYVSRHVVFNELSFLFQSLPSTDL
                          LY    +YS L+++GC+C+PFLR +N +K  +RS +C F+GY+S  KGYLCL++ +G++++SRHV+F EL          D 
Subjt:  ------------------LYGRALDYSLLKMFGCSCFPFLRTYNANKLQFRSKECVFLGYSSSQKGYLCLDIHSGRLYVSRHVVFNELSFLFQSLPSTDL

Query:  PFVKSSFLDPSYPLGSLLHCVNKFSSPLISPSITPPAVPLPPTAAPIPHASAAPIPISTTYDGSPCSSSSSGSFSSFSSISHPPPTPALVLSESLPNQPV
        PF K S +     + S  HC    + PL SP ITP +  LP +++P    S+  +PIS T            S S  SS   P PT              
Subjt:  PFVKSSFLDPSYPLGSLLHCVNKFSSPLISPSITPPAVPLPPTAAPIPHASAAPIPISTTYDGSPCSSSSSGSFSSFSSISHPPPTPALVLSESLPNQPV

Query:  DNPSNHPSSINVHPMVTRSKTSVFKPKAWLVTKAPV---TCEPKNYKEALQDSNWKAAMDNEYEALIRNKTWSLVPLPNDRKVVGCKWVFRVKLKPDGSI
          PS+   S   HPM TR+K  +FKPK +L     +    CEP ++KEA++   W+ AM  E+EAL+ NKTW LVP P ++ ++GC+WV+++K KPDG++
Subjt:  DNPSNHPSSINVHPMVTRSKTSVFKPKAWLVTKAPV---TCEPKNYKEALQDSNWKAAMDNEYEALIRNKTWSLVPLPNDRKVVGCKWVFRVKLKPDGSI

Query:  DRYKARLVAQGYTQALGIDYFKTFSPVVKPATIRIVLSLAVSFGWQIRQLDVHNAFLHGELLEQVYMKQPTGFVSTSHPSHVLKLHKALYGIKQSPRAWF
        +RYKARLVA+G+ Q  G DYF+TFSPVVKP TIR+VLSLA+S  W IRQLDVHNAFL+G+L EQV+M QP GFV  S P+ V KL KALYG+KQ+P AWF
Subjt:  DRYKARLVAQGYTQALGIDYFKTFSPVVKPATIRIVLSLAVSFGWQIRQLDVHNAFLHGELLEQVYMKQPTGFVSTSHPSHVLKLHKALYGIKQSPRAWF

Query:  SKLISCLLEWGFLASKADASMFIFKSGS
        +KL S L++WGF  S+AD SMF++ + S
Subjt:  SKLISCLLEWGFLASKADASMFIFKSGS

A5AQ04 Integrase catalytic domain-containing protein4.0e-11245.52Show/hide
Query:  LLQSKQDVVSIFLRFKVMIETQFKTKIRALQTDSGTEFKPLSSILASNGILHRISYPYAPQQNGSVERKNRHVVE-------------------------
        LL +K  V S F++FK ++E QF+T I+ LQT+ G EF  L   L+ +GILHR S PY P+QNG VERK RHVVE                         
Subjt:  LLQSKQDVVSIFLRFKVMIETQFKTKIRALQTDSGTEFKPLSSILASNGILHRISYPYAPQQNGSVERKNRHVVE-------------------------

Query:  ------------------LYGRALDYSLLKMFGCSCFPFLRTYNANKLQFRSKECVFLGYSSSQKGYLCLDIHSGRLYVSRHVVFNELSFLFQSLPSTDL
                          LY    DY   K+FGC C+P L+  N+NK Q RS  C+FLGY+ S KGYLCL+  + R+Y+SRHVVF E +F FQ+L     
Subjt:  ------------------LYGRALDYSLLKMFGCSCFPFLRTYNANKLQFRSKECVFLGYSSSQKGYLCLDIHSGRLYVSRHVVFNELSFLFQSLPSTDL

Query:  PFVKSSFLDPSYPLGSLLHCVNKFSSPLISPSITPPAVPLPPTAAPIPHASAAPIPISTTYDGSPCSSSSSGSFSSFSSISHPPPTPALVLSESLPNQPV
                  S P     H                P  PLPP+    P  +++   ++T  + +P S  +       SS+S PP      + E+    P 
Subjt:  PFVKSSFLDPSYPLGSLLHCVNKFSSPLISPSITPPAVPLPPTAAPIPHASAAPIPISTTYDGSPCSSSSSGSFSSFSSISHPPPTPALVLSESLPNQPV

Query:  DNPSNHPSSINVHPMVTRSKTSVFKPKAWLVTKAPVTCEPKNYKEALQDSNWKAAMDNEYEALIRNKTWSLVPLPNDRKVVGCKWVFRVKLKPDGSIDRY
         +  +  + I  HPM+TRSK+ + K K +L +   +T EP+  K+ALQD NWK AM+ EY+AL++N+TWSLVP P++ K++GCKWVF++K KP+GSIDRY
Subjt:  DNPSNHPSSINVHPMVTRSKTSVFKPKAWLVTKAPVTCEPKNYKEALQDSNWKAAMDNEYEALIRNKTWSLVPLPNDRKVVGCKWVFRVKLKPDGSIDRY

Query:  KARLVAQGYTQALGIDYFKTFSPVVKPATIRIVLSLAVSFGWQIRQLDVHNAFLHGELLEQVYMKQPTGFVSTSHPSHVLKLHKALYGIKQSPRAWFSKL
        KARLVAQG+ Q  GID+F+TFSPVVKP TIR+VLS+AVS  W I+QLDVHNAFL+G+L EQV+M QP GF   S P+HV +L KALYG+KQ+PRAWF KL
Subjt:  KARLVAQGYTQALGIDYFKTFSPVVKPATIRIVLSLAVSFGWQIRQLDVHNAFLHGELLEQVYMKQPTGFVSTSHPSHVLKLHKALYGIKQSPRAWFSKL

Query:  ISCLLEWGFLASKADASMFIFKSGS
         S LL+ GF  S+ADAS+F F S S
Subjt:  ISCLLEWGFLASKADASMFIFKSGS

A5AYB0 Integrase catalytic domain-containing protein4.6e-10842.97Show/hide
Query:  LQSKQDVVSIFLRFKVMIETQFKTKIRALQTDSGTEFKPLSSILASNGILHRISYPYAPQQNGSVERKNRHVVE--------------------------
        L +K   +  F+ F+ M+E Q +T I+ +Q+D+G EF      L ++GILH+ S P+ PQQNG  ERK RH+VE                          
Subjt:  LQSKQDVVSIFLRFKVMIETQFKTKIRALQTDSGTEFKPLSSILASNGILHRISYPYAPQQNGSVERKNRHVVE--------------------------

Query:  -----------------LYGRALDYSLLKMFGCSCFPFLRTYNANKLQFRSKECVFLGYSSSQKGYLCLDIHSGRLYVSRHVVFNELSFLFQSLPSTDLP
                         L+ +  +Y  L++FGC CFP LR Y  +KL +RS  CVFLGY+ + KGYLCLD+ + R+Y+SR+V+F+E SF FQS       
Subjt:  -----------------LYGRALDYSLLKMFGCSCFPFLRTYNANKLQFRSKECVFLGYSSSQKGYLCLDIHSGRLYVSRHVVFNELSFLFQSLPSTDLP

Query:  FVKSSFLDPSYPLGSLLHCVNKFSSPLISPSITPPAVPLPPTAAPIPHASAAPIPISTTYDGSPCSSSSSGSFSSFSSISHPP--PTPALVLSESLPNQP
           SS   PS P            SP + PS TP  +  P  +AP   A ++PI                      +S S PP  P P    S + P+ P
Subjt:  FVKSSFLDPSYPLGSLLHCVNKFSSPLISPSITPPAVPLPPTAAPIPHASAAPIPISTTYDGSPCSSSSSGSFSSFSSISHPP--PTPALVLSESLPNQP

Query:  VDNPSNHPSSINVHPMVTRSKTSVFKPKAWLVTKAPVTCEPKNYKEALQDSNWKAAMDNEYEALIRNKTWSLVPLPNDRKVVGCKWVFRVKLKPDGSIDR
               P  +N HPMVTR+K+ + K ++++V     T EP+ Y +A ++ +W  AM++EY+AL+RN TWSLVP P+   +VGC+W++++K +PDGSIDR
Subjt:  VDNPSNHPSSINVHPMVTRSKTSVFKPKAWLVTKAPVTCEPKNYKEALQDSNWKAAMDNEYEALIRNKTWSLVPLPNDRKVVGCKWVFRVKLKPDGSIDR

Query:  YKARLVAQGYTQALGIDYFKTFSPVVKPATIRIVLSLAVSFGWQIRQLDVHNAFLHGELLEQVYMKQPTGFVSTSHPSHVLKLHKALYGIKQSPRAWFSK
        +KARLVAQG+TQ  GIDYF TFSPVVKP TIR++L+LAVSF W +RQLDV NAFL+G+L E+V+M QP GFV+ ++P++V KLHKALYG+KQ+PRAWF K
Subjt:  YKARLVAQGYTQALGIDYFKTFSPVVKPATIRIVLSLAVSFGWQIRQLDVHNAFLHGELLEQVYMKQPTGFVSTSHPSHVLKLHKALYGIKQSPRAWFSK

Query:  LISCLLEWGFLASKADASMFIFKSGS
        L   LL++GF +S+AD S+FIF + +
Subjt:  LISCLLEWGFLASKADASMFIFKSGS

SwissProt top hitse value%identityAlignment
P04146 Copia protein7.2e-3440.34Show/hide
Query:  SNWKAAMDNEYEALIRNKTWSLVPLPNDRKVVGCKWVFRVKLKPDGSIDRYKARLVAQGYTQALGIDYFKTFSPVVKPATIRIVLSLAVSFGWQIRQLDV
        S+W+ A++ E  A   N TW++   P ++ +V  +WVF VK    G+  RYKARLVA+G+TQ   IDY +TF+PV + ++ R +LSL + +  ++ Q+DV
Subjt:  SNWKAAMDNEYEALIRNKTWSLVPLPNDRKVVGCKWVFRVKLKPDGSIDRYKARLVAQGYTQALGIDYFKTFSPVVKPATIRIVLSLAVSFGWQIRQLDV

Query:  HNAFLHGELLEQVYMKQPTGFVSTSHPSHVLKLHKALYGIKQSPRAWFSKLISCLLEWGFLASKADASMFIFKSGS
          AFL+G L E++YM+ P G    S   +V KL+KA+YG+KQ+ R WF      L E  F+ S  D  ++I   G+
Subjt:  HNAFLHGELLEQVYMKQPTGFVSTSHPSHVLKLHKALYGIKQSPRAWFSKLISCLLEWGFLASKADASMFIFKSGS

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.0e-4828.41Show/hide
Query:  LLQSKQDVVSIFLRFKVMIETQFKTKIRALQTDSGTEF--KPLSSILASNGILHRISYPYAPQQNGSVERKNRHVVE--------------LYGRA----
        +L++K  V  +F +F  ++E +   K++ L++D+G E+  +      +S+GI H  + P  PQ NG  ER NR +VE               +G A    
Subjt:  LLQSKQDVVSIFLRFKVMIETQFKTKIRALQTDSGTEF--KPLSSILASNGILHRISYPYAPQQNGSVERKNRHVVE--------------LYGRA----

Query:  -------------------------LDYSLLKMFGCSCFPFLRTYNANKLQFRSKECVFLGYSSSQKGYLCLDIHSGRLYVSRHVVFNELSFLFQSLPST
                                 + YS LK+FGC  F  +      KL  +S  C+F+GY   + GY   D    ++  SR VVF E     +   + 
Subjt:  -------------------------LDYSLLKMFGCSCFPFLRTYNANKLQFRSKECVFLGYSSSQKGYLCLDIHSGRLYVSRHVVFNELSFLFQSLPST

Query:  DLPFVKSSFLDPSYPLGSLLHCVNKFSSPLISPSITPPAVPLPPTAAPIPHASAAPIPISTTYDGSPCSSSSSGSFSSFSSISHPPPTPALVLSESLP-N
        D+                         S  +   I P  V +P T                       S++ + + S+   +S     P  V+ +    +
Subjt:  DLPFVKSSFLDPSYPLGSLLHCVNKFSSPLISPSITPPAVPLPPTAAPIPHASAAPIPISTTYDGSPCSSSSSGSFSSFSSISHPPPTPALVLSESLP-N

Query:  QPVDNPSNHPSSINVHPMVTRSKTSVFKPKAWLVTKAPVTC---EPKNYKEAL---QDSNWKAAMDNEYEALIRNKTWSLVPLPNDRKVVGCKWVFRVKL
        + V+   +       H  + RS+    + + +  T+  +     EP++ KE L   + +    AM  E E+L +N T+ LV LP  ++ + CKWVF++K 
Subjt:  QPVDNPSNHPSSINVHPMVTRSKTSVFKPKAWLVTKAPVTC---EPKNYKEAL---QDSNWKAAMDNEYEALIRNKTWSLVPLPNDRKVVGCKWVFRVKL

Query:  KPDGSIDRYKARLVAQGYTQALGIDYFKTFSPVVKPATIRIVLSLAVSFGWQIRQLDVHNAFLHGELLEQVYMKQPTGFVSTSHPSHVLKLHKALYGIKQ
          D  + RYKARLV +G+ Q  GID+ + FSPVVK  +IR +LSLA S   ++ QLDV  AFLHG+L E++YM+QP GF        V KL+K+LYG+KQ
Subjt:  KPDGSIDRYKARLVAQGYTQALGIDYFKTFSPVVKPATIRIVLSLAVSFGWQIRQLDVHNAFLHGELLEQVYMKQPTGFVSTSHPSHVLKLHKALYGIKQ

Query:  SPRAWFSKLISCLLEWGFLASKADASMF
        +PR W+ K  S +    +L + +D  ++
Subjt:  SPRAWFSKLISCLLEWGFLASKADASMF

P92520 Uncharacterized mitochondrial protein AtMg008206.5e-2750.4Show/hide
Query:  MVTRSKTSVFK--PKAWLVTKAPVTCEPKNYKEALQDSNWKAAMDNEYEALIRNKTWSLVPLPNDRKVVGCKWVFRVKLKPDGSIDRYKARLVAQGYTQA
        M+TRSK  + K  PK  L     +  EPK+   AL+D  W  AM  E +AL RNKTW LVP P ++ ++GCKWVF+ KL  DG++DR KARLVA+G+ Q 
Subjt:  MVTRSKTSVFK--PKAWLVTKAPVTCEPKNYKEALQDSNWKAAMDNEYEALIRNKTWSLVPLPNDRKVVGCKWVFRVKLKPDGSIDRYKARLVAQGYTQA

Query:  LGIDYFKTFSPVVKPATIRIVLSLA
         GI + +T+SPVV+ ATIR +L++A
Subjt:  LGIDYFKTFSPVVKPATIRIVLSLA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE17.3e-8736.32Show/hide
Query:  LQSKQDVVSIFLRFKVMIETQFKTKIRALQTDSGTEFKPLSSILASNGILHRISYPYAPQQNGSVERKNRHVVE--------------------------
        L+ K  V   F+ FK ++E +F+T+I    +D+G EF  L    + +GI H  S P+ P+ NG  ERK+RH+VE                          
Subjt:  LQSKQDVVSIFLRFKVMIETQFKTKIRALQTDSGTEFKPLSSILASNGILHRISYPYAPQQNGSVERKNRHVVE--------------------------

Query:  -----------------LYGRALDYSLLKMFGCSCFPFLRTYNANKLQFRSKECVFLGYSSSQKGYLCLDIHSGRLYVSRHVVFNELSFLFQSL------
                         L+G + +Y  L++FGC+C+P+LR YN +KL  +S++CVFLGYS +Q  YLCL + + RLY+SRHV F+E  F F +       
Subjt:  -----------------LYGRALDYSLLKMFGCSCFPFLRTYNANKLQFRSKECVFLGYSSSQKGYLCLDIHSGRLYVSRHVVFNELSFLFQSL------

Query:  -------------PSTDLP---------------FVKSSFLDPSYPLGSLLHCVNKFSSPLIS--PSITPPAVPL----PPTAAPI-----PHAS-----
                     P T LP                  +    PS P  +     +   S   S  PS   P  P      PT  P       H+S     
Subjt:  -------------PSTDLP---------------FVKSSFLDPSYPLGSLLHCVNKFSSPLIS--PSITPPAVPL----PPTAAPI-----PHAS-----

Query:  -----AAPIPISTTYDGSPCSSSSSGSFSSFSSISHPPPTPALVLSESLPNQPVDNPSNHPSSINVHPMVTRSKTSVFK--PKAWLVTKAPVTCEPKNYK
              +P  ++ +      SSSSS S ++ +S S   PTP  +L    P       +N+ + +N H M TR+K  + K  PK  L        EP+   
Subjt:  -----AAPIPISTTYDGSPCSSSSSGSFSSFSSISHPPPTPALVLSESLPNQPVDNPSNHPSSINVHPMVTRSKTSVFK--PKAWLVTKAPVTCEPKNYK

Query:  EALQDSNWKAAMDNEYEALIRNKTWSLVPLPNDR-KVVGCKWVFRVKLKPDGSIDRYKARLVAQGYTQALGIDYFKTFSPVVKPATIRIVLSLAVSFGWQ
        +AL+D  W+ AM +E  A I N TW LVP P     +VGC+W+F  K   DGS++RYKARLVA+GY Q  G+DY +TFSPV+K  +IRIVL +AV   W 
Subjt:  EALQDSNWKAAMDNEYEALIRNKTWSLVPLPNDR-KVVGCKWVFRVKLKPDGSIDRYKARLVAQGYTQALGIDYFKTFSPVVKPATIRIVLSLAVSFGWQ

Query:  IRQLDVHNAFLHGELLEQVYMKQPTGFVSTSHPSHVLKLHKALYGIKQSPRAWFSKLISCLLEWGFLASKADASMFIFKSG
        IRQLDV+NAFL G L + VYM QP GF+    P++V KL KALYG+KQ+PRAW+ +L + LL  GF+ S +D S+F+ + G
Subjt:  IRQLDVHNAFLHGELLEQVYMKQPTGFVSTSHPSHVLKLHKALYGIKQSPRAWFSKLISCLLEWGFLASKADASMFIFKSG

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.6e-8937.33Show/hide
Query:  LQSKQDVVSIFLRFKVMIETQFKTKIRALQTDSGTEFKPLSSILASNGILHRISYPYAPQQNGSVERKNRHVVE--------------------------
        L+ K  V   F+ FK ++E +F+T+I  L +D+G EF  L   L+ +GI H  S P+ P+ NG  ERK+RH+VE                          
Subjt:  LQSKQDVVSIFLRFKVMIETQFKTKIRALQTDSGTEFKPLSSILASNGILHRISYPYAPQQNGSVERKNRHVVE--------------------------

Query:  -----------------LYGRALDYSLLKMFGCSCFPFLRTYNANKLQFRSKECVFLGYSSSQKGYLCLDIHSGRLYVSRHVVFNELSFLFQ--------
                         L+G+  +Y  LK+FGC+C+P+LR YN +KL+ +SK+C F+GYS +Q  YLCL I +GRLY SRHV F+E  F F         
Subjt:  -----------------LYGRALDYSLLKMFGCSCFPFLRTYNANKLQFRSKECVFLGYSSSQKGYLCLDIHSGRLYVSRHVVFNELSFLFQ--------

Query:  ----------------SLPSTDL---------PFVKSSFLDPSYP---LGSLLHCVNKFSSPLISPSITPPAVPL----PPTAAPIP---HASAAPI---
                        +LP+T L         P + +S   PS P     + +   N  SS + SPS + P  P      PTA P       S +PI   
Subjt:  ----------------SLPSTDL---------PFVKSSFLDPSYP---LGSLLHCVNKFSSPLISPSITPPAVPL----PPTAAPIP---HASAAPI---

Query:  --------------------PISTTYDGSPCSSSSSGSFSSFSSISHPPPTPALVLSESLPNQPVDNPSNHPSSINVHPMVTRSKTSVFKP--KAWLVTK
                            PIS+ +  +P +S S  +  S SS S PP  P       LP  P+    N  + +N H M TR+K  + KP  K    T 
Subjt:  --------------------PISTTYDGSPCSSSSSGSFSSFSSISHPPPTPALVLSESLPNQPVDNPSNHPSSINVHPMVTRSKTSVFKP--KAWLVTK

Query:  APVTCEPKNYKEALQDSNWKAAMDNEYEALIRNKTWSLVPLPNDR-KVVGCKWVFRVKLKPDGSIDRYKARLVAQGYTQALGIDYFKTFSPVVKPATIRI
             EP+   +A++D  W+ AM +E  A I N TW LVP P     +VGC+W+F  K   DGS++RYKARLVA+GY Q  G+DY +TFSPV+K  +IRI
Subjt:  APVTCEPKNYKEALQDSNWKAAMDNEYEALIRNKTWSLVPLPNDR-KVVGCKWVFRVKLKPDGSIDRYKARLVAQGYTQALGIDYFKTFSPVVKPATIRI

Query:  VLSLAVSFGWQIRQLDVHNAFLHGELLEQVYMKQPTGFVSTSHPSHVLKLHKALYGIKQSPRAWFSKLISCLLEWGFLASKADASMFIFKSG
        VL +AV   W IRQLDV+NAFL G L ++VYM QP GFV    P +V +L KA+YG+KQ+PRAW+ +L + LL  GF+ S +D S+F+ + G
Subjt:  VLSLAVSFGWQIRQLDVHNAFLHGELLEQVYMKQPTGFVSTSHPSHVLKLHKALYGIKQSPRAWFSKLISCLLEWGFLASKADASMFIFKSG

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.5e-4235.74Show/hide
Query:  DGSPCSSSSSGSFSSFSSISHPPPTPALVLSESLPNQPV---DNPSNHPSSINVHPMVTRSKTSVFKP--KAWLVTKAPVTCEPKNYKEALQDSNWKAAM
        D    +SSSS      ++I +  P P++  S     +P    D   +  +S+ +H +          P   ++LV  A    EP  Y EA +   W  AM
Subjt:  DGSPCSSSSSGSFSSFSSISHPPPTPALVLSESLPNQPV---DNPSNHPSSINVHPMVTRSKTSVFKP--KAWLVTKAPVTCEPKNYKEALQDSNWKAAM

Query:  DNEYEALIRNKTWSLVPLPNDRKVVGCKWVFRVKLKPDGSIDRYKARLVAQGYTQALGIDYFKTFSPVVKPATIRIVLSLAVSFGWQIRQLDVHNAFLHG
        D+E  A+    TW +  LP ++K +GCKWV+++K   DG+I+RYKARLVA+GYTQ  GID+ +TFSPV K  +++++L+++  + + + QLD+ NAFL+G
Subjt:  DNEYEALIRNKTWSLVPLPNDRKVVGCKWVFRVKLKPDGSIDRYKARLVAQGYTQALGIDYFKTFSPVVKPATIRIVLSLAVSFGWQIRQLDVHNAFLHG

Query:  ELLEQVYMKQPTGFVS----TSHPSHVLKLHKALYGIKQSPRAWFSKLISCLLEWGFLASKADASMFIFKSGSKFGC
        +L E++YMK P G+ +    +  P+ V  L K++YG+KQ+ R WF K    L+ +GF+ S +D + F+  + + F C
Subjt:  ELLEQVYMKQPTGFVS----TSHPSHVLKLHKALYGIKQSPRAWFSKLISCLLEWGFLASKADASMFIFKSGSKFGC

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)4.6e-2850.4Show/hide
Query:  MVTRSKTSVFK--PKAWLVTKAPVTCEPKNYKEALQDSNWKAAMDNEYEALIRNKTWSLVPLPNDRKVVGCKWVFRVKLKPDGSIDRYKARLVAQGYTQA
        M+TRSK  + K  PK  L     +  EPK+   AL+D  W  AM  E +AL RNKTW LVP P ++ ++GCKWVF+ KL  DG++DR KARLVA+G+ Q 
Subjt:  MVTRSKTSVFK--PKAWLVTKAPVTCEPKNYKEALQDSNWKAAMDNEYEALIRNKTWSLVPLPNDRKVVGCKWVFRVKLKPDGSIDRYKARLVAQGYTQA

Query:  LGIDYFKTFSPVVKPATIRIVLSLA
         GI + +T+SPVV+ ATIR +L++A
Subjt:  LGIDYFKTFSPVVKPATIRIVLSLA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGACCAATCACACCCTAAGATGATTGGTGGCGACTCCAAACCTCAAACCTCGAAAGAGACCCTTAGGAAAGGACATCGGCCGCTCCGCGTCGCGATGACGTTGAG
AAATTTATTACAATCTAAGCAAGATGTTGTCTCCATTTTCCTTCGATTCAAAGTCATGATAGAAACACAATTTAAAACCAAAATTAGGGCCCTTCAAACTGACAGTGGTA
CTGAATTCAAACCTTTGTCTTCTATTTTAGCCTCTAATGGTATCTTACATAGAATATCCTATCCTTACGCGCCACAACAAAATGGAAGCGTCGAAAGGAAAAACAGGCAT
GTAGTCGAGTTATATGGTCGTGCCCTTGATTACAGTCTCTTAAAGATGTTCGGATGCTCTTGTTTTCCTTTCTTAAGAACTTATAATGCCAACAAGCTCCAATTTAGGTC
CAAGGAATGTGTCTTTCTTGGTTATAGTTCATCTCAGAAAGGGTATCTTTGTCTTGATATTCATTCTGGTCGCCTGTATGTCTCAAGGCATGTAGTTTTCAACGAGTTGT
CTTTCCTTTTTCAATCTTTGCCCTCTACTGATCTCCCCTTTGTCAAATCTTCTTTTCTTGATCCCTCTTATCCTCTTGGGTCATTACTTCATTGTGTAAATAAATTTTCG
TCTCCTTTGATATCTCCCTCTATCACTCCTCCTGCTGTTCCATTGCCTCCCACGGCTGCTCCCATCCCACATGCTTCAGCTGCTCCTATTCCCATTTCCACAACTTATGA
TGGTTCTCCTTGTTCTTCGTCCTCTTCTGGATCCTTTTCTTCCTTTTCTTCCATCTCCCACCCCCCTCCAACCCCTGCCCTTGTTCTGTCCGAATCCTTACCAAATCAGC
CAGTTGATAACCCTTCTAACCATCCTTCATCCATTAATGTTCATCCAATGGTCACAAGGTCTAAAACTAGTGTGTTTAAACCAAAGGCATGGTTAGTGACAAAAGCCCCT
GTTACATGTGAACCAAAGAACTACAAGGAGGCGTTACAAGACTCCAATTGGAAAGCTGCCATGGATAATGAATATGAGGCCCTGATTCGAAATAAGACATGGTCCCTGGT
TCCTTTACCTAATGATCGTAAGGTAGTCGGTTGTAAGTGGGTTTTTCGTGTTAAACTTAAGCCCGATGGATCCATTGATAGGTATAAGGCCCGATTGGTTGCTCAAGGTT
ACACTCAGGCCCTTGGTATCGATTACTTCAAAACTTTCAGCCCTGTTGTAAAGCCAGCTACAATACGTATTGTTTTATCTTTAGCTGTTTCCTTTGGTTGGCAGATTCGG
CAGCTTGATGTACACAATGCATTCCTTCATGGTGAACTTTTGGAGCAAGTTTACATGAAGCAGCCAACTGGTTTTGTGAGCACCTCTCATCCTTCTCATGTCTTAAAGTT
GCACAAGGCTTTATATGGTATCAAACAGAGTCCTCGCGCTTGGTTCTCTAAGTTGATTTCTTGTCTTTTAGAGTGGGGGTTTCTTGCTTCTAAGGCTGATGCGTCTATGT
TTATTTTCAAATCTGGTTCTAAGTTTGGTTGTTTTTCTTATTTATGTGGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTGACCAATCACACCCTAAGATGATTGGTGGCGACTCCAAACCTCAAACCTCGAAAGAGACCCTTAGGAAAGGACATCGGCCGCTCCGCGTCGCGATGACGTTGAG
AAATTTATTACAATCTAAGCAAGATGTTGTCTCCATTTTCCTTCGATTCAAAGTCATGATAGAAACACAATTTAAAACCAAAATTAGGGCCCTTCAAACTGACAGTGGTA
CTGAATTCAAACCTTTGTCTTCTATTTTAGCCTCTAATGGTATCTTACATAGAATATCCTATCCTTACGCGCCACAACAAAATGGAAGCGTCGAAAGGAAAAACAGGCAT
GTAGTCGAGTTATATGGTCGTGCCCTTGATTACAGTCTCTTAAAGATGTTCGGATGCTCTTGTTTTCCTTTCTTAAGAACTTATAATGCCAACAAGCTCCAATTTAGGTC
CAAGGAATGTGTCTTTCTTGGTTATAGTTCATCTCAGAAAGGGTATCTTTGTCTTGATATTCATTCTGGTCGCCTGTATGTCTCAAGGCATGTAGTTTTCAACGAGTTGT
CTTTCCTTTTTCAATCTTTGCCCTCTACTGATCTCCCCTTTGTCAAATCTTCTTTTCTTGATCCCTCTTATCCTCTTGGGTCATTACTTCATTGTGTAAATAAATTTTCG
TCTCCTTTGATATCTCCCTCTATCACTCCTCCTGCTGTTCCATTGCCTCCCACGGCTGCTCCCATCCCACATGCTTCAGCTGCTCCTATTCCCATTTCCACAACTTATGA
TGGTTCTCCTTGTTCTTCGTCCTCTTCTGGATCCTTTTCTTCCTTTTCTTCCATCTCCCACCCCCCTCCAACCCCTGCCCTTGTTCTGTCCGAATCCTTACCAAATCAGC
CAGTTGATAACCCTTCTAACCATCCTTCATCCATTAATGTTCATCCAATGGTCACAAGGTCTAAAACTAGTGTGTTTAAACCAAAGGCATGGTTAGTGACAAAAGCCCCT
GTTACATGTGAACCAAAGAACTACAAGGAGGCGTTACAAGACTCCAATTGGAAAGCTGCCATGGATAATGAATATGAGGCCCTGATTCGAAATAAGACATGGTCCCTGGT
TCCTTTACCTAATGATCGTAAGGTAGTCGGTTGTAAGTGGGTTTTTCGTGTTAAACTTAAGCCCGATGGATCCATTGATAGGTATAAGGCCCGATTGGTTGCTCAAGGTT
ACACTCAGGCCCTTGGTATCGATTACTTCAAAACTTTCAGCCCTGTTGTAAAGCCAGCTACAATACGTATTGTTTTATCTTTAGCTGTTTCCTTTGGTTGGCAGATTCGG
CAGCTTGATGTACACAATGCATTCCTTCATGGTGAACTTTTGGAGCAAGTTTACATGAAGCAGCCAACTGGTTTTGTGAGCACCTCTCATCCTTCTCATGTCTTAAAGTT
GCACAAGGCTTTATATGGTATCAAACAGAGTCCTCGCGCTTGGTTCTCTAAGTTGATTTCTTGTCTTTTAGAGTGGGGGTTTCTTGCTTCTAAGGCTGATGCGTCTATGT
TTATTTTCAAATCTGGTTCTAAGTTTGGTTGTTTTTCTTATTTATGTGGATGA
Protein sequenceShow/hide protein sequence
MADQSHPKMIGGDSKPQTSKETLRKGHRPLRVAMTLRNLLQSKQDVVSIFLRFKVMIETQFKTKIRALQTDSGTEFKPLSSILASNGILHRISYPYAPQQNGSVERKNRH
VVELYGRALDYSLLKMFGCSCFPFLRTYNANKLQFRSKECVFLGYSSSQKGYLCLDIHSGRLYVSRHVVFNELSFLFQSLPSTDLPFVKSSFLDPSYPLGSLLHCVNKFS
SPLISPSITPPAVPLPPTAAPIPHASAAPIPISTTYDGSPCSSSSSGSFSSFSSISHPPPTPALVLSESLPNQPVDNPSNHPSSINVHPMVTRSKTSVFKPKAWLVTKAP
VTCEPKNYKEALQDSNWKAAMDNEYEALIRNKTWSLVPLPNDRKVVGCKWVFRVKLKPDGSIDRYKARLVAQGYTQALGIDYFKTFSPVVKPATIRIVLSLAVSFGWQIR
QLDVHNAFLHGELLEQVYMKQPTGFVSTSHPSHVLKLHKALYGIKQSPRAWFSKLISCLLEWGFLASKADASMFIFKSGSKFGCFSYLCG