; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh16G006510 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh16G006510
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationCmo_Chr16:3213877..3216429
RNA-Seq ExpressionCmoCh16G006510
SyntenyCmoCh16G006510
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0016772 - transferase activity, transferring phosphorus-containing groups (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
BBG99082.1 ADP glucose pyrophosphorylase large subunit 1 [Prunus dulcis]1.6e-19946.67Show/hide
Query:  MSFHAMKVADQGVWVVDSGCSNHMTGCKEFFSTLDENFHTTVSVGNNSTIQVMEKETVDIKTRNGFVESISNVFYIPDLKANLLSI--------------
        M+FHA+   DQ  W VD+GCSNHM+GCK  F  LDE+FHT VS G+ ST++VM K  + IKT+N F+E+ISNVFYIPDLK NLLS               
Subjt:  MSFHAMKVADQGVWVVDSGCSNHMTGCKEFFSTLDENFHTTVSVGNNSTIQVMEKETVDIKTRNGFVESISNVFYIPDLKANLLSI--------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------DICGPISPASNGDKKYFMTLIDDFSRKMWTYFLHAKSEAFNCFKKLCATMKTETGRRVEALRTDRGGEFCSNEFIKFWEEKGIRRQLTTTY
                 D+CGPI+P SNG+KKYF++  DDFSRK W YFL  KSEAF  FK   A ++ E+ ++++ LRTDRGGE+CS EF  F +EKGI+RQLTT Y
Subjt:  ---------DICGPISPASNGDKKYFMTLIDDFSRKMWTYFLHAKSEAFNCFKKLCATMKTETGRRVEALRTDRGGEFCSNEFIKFWEEKGIRRQLTTTY

Query:  TPQQNGVAKRKNRIILNMVRSLLNKGEVLKEFWPEAVVWNVHILNRSPTFSVRDMTPQETWSGRKPTIDHFRIFGCIAYAHIPDEKRKKLEDKSLKCVFL
        TPQQNGV++RKNR ILNMVRSLL KG + K+FWPEAV+W+VHILNRSPTFSV++MTPQE WSG KP +DHFR+FGCIAYAHIPDEKRKKL+DKS KCVFL
Subjt:  TPQQNGVAKRKNRIILNMVRSLLNKGEVLKEFWPEAVVWNVHILNRSPTFSVRDMTPQETWSGRKPTIDHFRIFGCIAYAHIPDEKRKKLEDKSLKCVFL

Query:  GVSETSKAYKLYDPLTKKVVVSRDVIFDEKKIWTWEEKITVNQQIPV--DLEEEQTEAPIQPHYEHGESSSQAQQEEPAEEDHDHNQRNSRTRKRLNWMV
        GVSE SKAYKLY+P+TKK+VVSRDVIFDE  +W W E  +V QQIPV  DL+ E+  AP     +  +   Q+ Q+    E+  +N R+  +RKR  WM+
Subjt:  GVSETSKAYKLYDPLTKKVVVSRDVIFDEKKIWTWEEKITVNQQIPV--DLEEEQTEAPIQPHYEHGESSSQAQQEEPAEEDHDHNQRNSRTRKRLNWMV

Query:  DYEMDYESSD--STYFAFFMDSDPIVYKEAVKEKKWKEAMDSEIKSIEKNKTWELTDLPRGQKTIGVKWVFKTKLNENGEVDKYKARLVAKGYKQKYGVD
        DY++ Y SSD  + +FA  +DSDPI Y EAVKE+KW+EAMD+EIKSIEKN+TWELTDLP+G++TIGVKWVF+TKLNE GEVDK+KARLVAKGYKQK+G+D
Subjt:  DYEMDYESSD--STYFAFFMDSDPIVYKEAVKEKKWKEAMDSEIKSIEKNKTWELTDLPRGQKTIGVKWVFKTKLNENGEVDKYKARLVAKGYKQKYGVD

Query:  YKEVFAPMTCQDTIRLIISIATQKSWPIYQLDVKSA----------------------------------------------------------------
        YKEVFAP+   DTIRL+IS+A Q SW I+QLDVKSA                                                                
Subjt:  YKEVFAPMTCQDTIRLIISIATQKSWPIYQLDVKSA----------------------------------------------------------------

Query:  --------------------------SD---LESFKKRYMWSNQKV--------------------LFKKKKYAQELLEIFKLDEC-TFGTPSELGSKLH
                                  SD    + FKK  M   +                         +KKYAQ++L  F++D+C  FGTP+E G KL 
Subjt:  --------------------------SD---LESFKKRYMWSNQKV--------------------LFKKKKYAQELLEIFKLDEC-TFGTPSELGSKLH

Query:  KDIERREVDNRYFKQIVGSLMYLTSARPDIMYAVSMISRYLEHPMEKHLNAARRIHR
        KD   +EVD+ ++KQI+GSLMYLT  RPDIMYAVS++SRY+E P E HLNAA+RI R
Subjt:  KDIERREVDNRYFKQIVGSLMYLTSARPDIMYAVSMISRYLEHPMEKHLNAARRIHR

BBH01550.1 ADP glucose pyrophosphorylase large subunit 1 [Prunus dulcis]9.8e-18153.4Show/hide
Query:  DICGPISPASNGDKKYFMTLIDDFSRKMWTYFLHAKSEAFNCFKKLCATMKTETGRRVEALRTDRGGEFCSNEFIKFWEEKGIRRQLTTTYTPQQNGVAK
        D+CGPI+P SNG+KKYF++  DDFSRK W YFL  KSEAF  FK   A ++ E+ ++++ LRTDRGGE+CS EF  F +EKGI+RQLTT YTPQQNGV++
Subjt:  DICGPISPASNGDKKYFMTLIDDFSRKMWTYFLHAKSEAFNCFKKLCATMKTETGRRVEALRTDRGGEFCSNEFIKFWEEKGIRRQLTTTYTPQQNGVAK

Query:  RKNRIILNMVRSLLNKGEVLKEFWPEAVVWNVHILNRSPTFSVRDMTPQETWSGRKPTIDHFRIFGCIAYAHIPDEKRKKLEDKSLKCVFLGVSETSKAY
        RKNR ILNMVRSLL KG + K+FWPEAV+W+VHILNRSPTFSV++MTPQE WSG KP +DHFR+FGCIAYAHIPDEKRKKL+DKS KCVFLGVSE SKAY
Subjt:  RKNRIILNMVRSLLNKGEVLKEFWPEAVVWNVHILNRSPTFSVRDMTPQETWSGRKPTIDHFRIFGCIAYAHIPDEKRKKLEDKSLKCVFLGVSETSKAY

Query:  KLYDPLTKKVVVSRDVIFDEKKIWTWEEKITVNQQIPV--DLEEEQTEAPIQPHYEHGESSSQAQQEEPAEEDHDHNQRNSRTRKRLNWMVDYEMDYESS
        KLY+P+TKK+VVSRDVIFDE  +W W E  +V QQIPV  DL+ E+  AP     +  +   Q+ Q+    E+  +N R+  +RKR  WM+DY++ Y SS
Subjt:  KLYDPLTKKVVVSRDVIFDEKKIWTWEEKITVNQQIPV--DLEEEQTEAPIQPHYEHGESSSQAQQEEPAEEDHDHNQRNSRTRKRLNWMVDYEMDYESS

Query:  D--STYFAFFMDSDPIVYKEAVKEKKWKEAMDSEIKSIEKNKTWELTDLPRGQKTIGVKWVFKTKLNENGEVDKYKARLVAKGYKQKYGVDYKEVFAPMT
        D  + +FA  +DSDPI Y EAVKE+KW+EAMD+EIKSIEKN+TWELTDLP+G++TIGVKWVF+TKLNE GEVDK+KARLVAKGYKQK+G+DYKEVFAP+ 
Subjt:  D--STYFAFFMDSDPIVYKEAVKEKKWKEAMDSEIKSIEKNKTWELTDLPRGQKTIGVKWVFKTKLNENGEVDKYKARLVAKGYKQKYGVDYKEVFAPMT

Query:  CQDTIRLIISIATQKSWPIYQLDVKSA-------------------------------------------------------------------------
          DTIRL+IS+A Q SW I+QLDVKSA                                                                         
Subjt:  CQDTIRLIISIATQKSWPIYQLDVKSA-------------------------------------------------------------------------

Query:  -----------------SD---LESFKKRYMWSNQKV--------------------LFKKKKYAQELLEIFKLDEC-TFGTPSELGSKLHKDIERREVD
                         SD    + FKK  M   +                         +KKYAQ++L  F++D+C  FGTP+E G KL KD   +EVD
Subjt:  -----------------SD---LESFKKRYMWSNQKV--------------------LFKKKKYAQELLEIFKLDEC-TFGTPSELGSKLHKDIERREVD

Query:  NRYFKQIVGSLMYLTSARPDIMYAVSMISRYLEHPMEKHLNAARRIHR
        + ++KQI+GSLMYLT  RPDIMYAVS++SRY+E P E HLNAA+RI R
Subjt:  NRYFKQIVGSLMYLTSARPDIMYAVSMISRYLEHPMEKHLNAARRIHR

RVW39897.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]1.3e-16140.73Show/hide
Query:  VWVVDSGCSNHMTGCKEFFSTLDENFHTTVSVGNNSTIQVMEKETVDIKTRNGFVESISNVFYIPDLKANLLSI--------------------------
        VW VD+G +NHM G K  F  L+E F +TVS G+ ST+ VM K  ++I+T+NGFVE+ISNVFY+PDLK+NLLS                           
Subjt:  VWVVDSGCSNHMTGCKEFFSTLDENFHTTVSVGNNSTIQVMEKETVDIKTRNGFVESISNVFYIPDLKANLLSI--------------------------

Query:  --------------------------------------------------------------------------------------------------DI
                                                                                                          +I
Subjt:  --------------------------------------------------------------------------------------------------DI

Query:  CGPISPASNGDKKYFMTLIDDFSRKMWTYFLHAKSEAFNCFKKLCATMKTETGRRVEALRTDRGGEFCSNEFIKFWEEKGIRRQLTTTYTPQQNGVAKRK
        CGPI+P SNG KKY +  IDD+SRK W  FL  KSEAF+ FK   A ++ ET R ++ LRTDRGGE+CSNEF  F +++GIRR+LT  YTPQQNGV++RK
Subjt:  CGPISPASNGDKKYFMTLIDDFSRKMWTYFLHAKSEAFNCFKKLCATMKTETGRRVEALRTDRGGEFCSNEFIKFWEEKGIRRQLTTTYTPQQNGVAKRK

Query:  NRIILNMVRSLLNKGEVLKEFWPEAVVWNVHILNRSPTFSVRDMTPQETWSGRKPTIDHFRIFGCIAYAHIPDEKRKKLEDKSLKCVFLGVSETSKAYKL
        N+ ILNMVRSLL +G++ K FW +A+ W++H+LNRSPTF V++MTP+E WSGRKPT+DHF+IFGCIAYAH+PDEKRKKL+DK  KCVFLGVSE SKAYKL
Subjt:  NRIILNMVRSLLNKGEVLKEFWPEAVVWNVHILNRSPTFSVRDMTPQETWSGRKPTIDHFRIFGCIAYAHIPDEKRKKLEDKSLKCVFLGVSETSKAYKL

Query:  YDPLTKKVVVSRDVIFDEKKIWTWEEKITVNQQIPVDLEEEQTEAPIQ-------PHYEHGESSSQAQQEEPAEEDH-DHNQRNSRTRKRLNWMVDYEMD
        ++PLTKK+V+SRDVIFDE+  W W  +         D EEE+ +   Q       P     ++ + A+    A E +     R    RKR  WM D+E+ 
Subjt:  YDPLTKKVVVSRDVIFDEKKIWTWEEKITVNQQIPVDLEEEQTEAPIQ-------PHYEHGESSSQAQQEEPAEEDH-DHNQRNSRTRKRLNWMVDYEMD

Query:  YESSDS----TYFAFFMDSDPIVYKEAVKEKKWKEAMDSEIKSIEKNKTWELTDLPRGQKTIGVKWVFKTKLNENGEVDKYKARLVAKGYKQKYGVDYKE
           SD+     ++A   D DPI ++EA+K+ KW +AM+ EI SIEKN +WEL +LP+ QK+IGVKWV+KTKLN++G VDKYKARLVAKGYK+++GVDYKE
Subjt:  YESSDS----TYFAFFMDSDPIVYKEAVKEKKWKEAMDSEIKSIEKNKTWELTDLPRGQKTIGVKWVFKTKLNENGEVDKYKARLVAKGYKQKYGVDYKE

Query:  VFAPMTCQDTIRLIISIATQKSWPIYQLDVKSA-------------------------------------------------------------------
        +FAP+   DTI L++S+A Q SW I+QLDVKSA                                                                   
Subjt:  VFAPMTCQDTIRLIISIATQKSWPIYQLDVKSA-------------------------------------------------------------------

Query:  --------------------------SDLESFKKRYM--------------------WSNQKVLFKKKKYAQELLEIFKLDEC-TFGTPSELGSKLHKDI
                                  + L  FKK  M                     S+  V   +KKYA E+L+ F L +C +  TPSE+G KL K  
Subjt:  --------------------------SDLESFKKRYM--------------------WSNQKVLFKKKKYAQELLEIFKLDEC-TFGTPSELGSKLHKDI

Query:  ERREVDNRYFKQIVGSLMYLTSARPDIMYAVSMISRYLEHPMEKHLNAARRI
          + VD+  +KQIVGSLMYLTS RPDIM+ V++I+RY+E+P E H+ AA+RI
Subjt:  ERREVDNRYFKQIVGSLMYLTSARPDIMYAVSMISRYLEHPMEKHLNAARRI

RVW92024.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]2.8e-16740.07Show/hide
Query:  MSFHAMKVADQGVWVVDSGCSNHMTGCKEFFSTLDENFHTTVSVGNNSTIQVMEKETVDIKTRNGFVESISNVFYIPDLKANLLSI--------------
        M+ HA +     +W +D+GCSNHM G K  FS LDE F  +V+ G+NS + VM K +V I ++    + ISNVF++PDLK NLLS+              
Subjt:  MSFHAMKVADQGVWVVDSGCSNHMTGCKEFFSTLDENFHTTVSVGNNSTIQVMEKETVDIKTRNGFVESISNVFYIPDLKANLLSI--------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------DICGPISPASNGDKKYFMTLIDDFSRKMWTYFLHAKSEAFNCFKKLCATMKTETGRRVEALRTDRGGEFCSNEFIKFWEEKGIRRQLTT
                   DICGPI+P SNG K YF+T IDD+SRK W YFL  KSEAF+ FK     ++ E G+ ++   +DRGGE+ S EF+ F E  GI++QLT 
Subjt:  -----------DICGPISPASNGDKKYFMTLIDDFSRKMWTYFLHAKSEAFNCFKKLCATMKTETGRRVEALRTDRGGEFCSNEFIKFWEEKGIRRQLTT

Query:  TYTPQQNGVAKRKNRIILNMVRSLLNKGEVLKEFWPEAVVWNVHILNRSPTFSVRDMTPQETWSGRKPTIDHFRIFGCIAYAHIPDEKRKKLEDKSLKCV
         Y+PQQNGV++RKNR ILNMVR++L+KG + + FWPEAV+W++HILNRSPT  V+++TP+E W+GRKP+++HFRIFGCIAYAHIPD+KRKKL+DK  KC+
Subjt:  TYTPQQNGVAKRKNRIILNMVRSLLNKGEVLKEFWPEAVVWNVHILNRSPTFSVRDMTPQETWSGRKPTIDHFRIFGCIAYAHIPDEKRKKLEDKSLKCV

Query:  FLGVSETSKAYKLYDPLTKKVVVSRDVIFDEKKIWTWEEKITVNQQIPVDLE---EEQTEAPIQPHYEHGE----SSSQAQQEEPAEEDHDH------NQ
        FLGVSE SKAYKLY+P+TKK+V+SRD+IFDE   W W++  T+ QQI  D +   EE+ + P+Q      E     +    +  P   + D         
Subjt:  FLGVSETSKAYKLYDPLTKKVVVSRDVIFDEKKIWTWEEKITVNQQIPVDLE---EEQTEAPIQPHYEHGE----SSSQAQQEEPAEEDHDH------NQ

Query:  RNSRTRKRLNWMVDYE---MDYESSDSTYFAFFMDSDPIVYKEAVKEKKWKEAMDSEIKSIEKNKTWELTDLPRGQKTIGVKWVFKTKLNENGEVDKYKA
         + R RKR  WM DYE   +D      T+FA F D DP  ++ AVKE KW++AMD+EI +IE+N TWEL++LP+G KTIGVKWV+KTKL ENGEVDKYKA
Subjt:  RNSRTRKRLNWMVDYE---MDYESSDSTYFAFFMDSDPIVYKEAVKEKKWKEAMDSEIKSIEKNKTWELTDLPRGQKTIGVKWVFKTKLNENGEVDKYKA

Query:  RLVAKGYKQKYGVDYKEVFAPMTCQDTIRLIISIATQKSWPIYQLDVKSA--------------------------------------------------
        RLVAKGYKQ++GVDYKEVFAP+   DTIRL+I++A Q SWPI+QLDVKSA                                                  
Subjt:  RLVAKGYKQKYGVDYKEVFAPMTCQDTIRLIISIATQKSWPIYQLDVKSA--------------------------------------------------

Query:  ---------------------------------SDL----------ESFKKRYM--------------------WSNQKVLFKKKKYAQELLEIFKLDEC
                                          DL          E FKK  M                     S+  +   +KKY +E+L  F++ +C
Subjt:  ---------------------------------SDL----------ESFKKRYM--------------------WSNQKVLFKKKKYAQELLEIFKLDEC

Query:  T-FGTPSELGSKLHKDIERREVDNRYFKQIVGSLMYLTSARPDIMYAVSMISRYLEHPMEKHLNAARRIHR
            TP++ G KL+KD   ++VDN  +KQIVGSLMYLT+ RPDIM++VS+ISRY+E+P E H  AA++I R
Subjt:  T-FGTPSELGSKLHKDIERREVDNRYFKQIVGSLMYLTSARPDIMYAVSMISRYLEHPMEKHLNAARRIHR

RVW98955.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]1.6e-16240.96Show/hide
Query:  VWVVDSGCSNHMTGCKEFFSTLDENFHTTVSVGNNSTIQVMEKETVDIKTRNGFVESISNVFYIPDLKANLLSI--------------------------
        VW VD+G +NHM G K  F  L+E F +TVS G+ ST+ VM K  ++I+T+NGFVE+ISNVFY+PDLK+NLLS                           
Subjt:  VWVVDSGCSNHMTGCKEFFSTLDENFHTTVSVGNNSTIQVMEKETVDIKTRNGFVESISNVFYIPDLKANLLSI--------------------------

Query:  --------------------------------------------------------------------------------------------------DI
                                                                                                          +I
Subjt:  --------------------------------------------------------------------------------------------------DI

Query:  CGPISPASNGDKKYFMTLIDDFSRKMWTYFLHAKSEAFNCFKKLCATMKTETGRRVEALRTDRGGEFCSNEFIKFWEEKGIRRQLTTTYTPQQNGVAKRK
        CGPI+P SNG KKY +  IDD+SRK W  FL  KSEAF+ FK   A ++ ET R ++ LRTDRGGE+CSNEF  F +++GIRR+LT  YTPQQNGV++RK
Subjt:  CGPISPASNGDKKYFMTLIDDFSRKMWTYFLHAKSEAFNCFKKLCATMKTETGRRVEALRTDRGGEFCSNEFIKFWEEKGIRRQLTTTYTPQQNGVAKRK

Query:  NRIILNMVRSLLNKGEVLKEFWPEAVVWNVHILNRSPTFSVRDMTPQETWSGRKPTIDHFRIFGCIAYAHIPDEKRKKLEDKSLKCVFLGVSETSKAYKL
        N+ ILNMVRSLL +G++ K FW +AV W++H+LNRSPTF V++MTP+E WSGRKPT+DHF+IFGCIAYAH+PDEKRKKL+DK  KCVFLGVSE SKAYKL
Subjt:  NRIILNMVRSLLNKGEVLKEFWPEAVVWNVHILNRSPTFSVRDMTPQETWSGRKPTIDHFRIFGCIAYAHIPDEKRKKLEDKSLKCVFLGVSETSKAYKL

Query:  YDPLTKKVVVSRDVIFDEKKIWTWEEKITVNQQIPVDLEEEQTEAPIQ-------PHYEHGESSSQAQQEEPAEEDH-DHNQRNSRTRKRLNWMVDYEMD
        ++PLTKK+V+SRDVIFDE+  W W  +         D EEE+ +   Q       P     ++ + A+    A E +     R    RKR  WM D+E+ 
Subjt:  YDPLTKKVVVSRDVIFDEKKIWTWEEKITVNQQIPVDLEEEQTEAPIQ-------PHYEHGESSSQAQQEEPAEEDH-DHNQRNSRTRKRLNWMVDYEMD

Query:  YESSDS----TYFAFFMDSDPIVYKEAVKEKKWKEAMDSEIKSIEKNKTWELTDLPRGQKTIGVKWVFKTKLNENGEVDKYKARLVAKGYKQKYGVDYKE
           SD+     ++A   D DPI ++EA+K+ KW +AM+ EI SIEKN +WEL +LP+ QK+IGVKWV+KTKLN++G VDKYKARLVAKGYK+++GVDYKE
Subjt:  YESSDS----TYFAFFMDSDPIVYKEAVKEKKWKEAMDSEIKSIEKNKTWELTDLPRGQKTIGVKWVFKTKLNENGEVDKYKARLVAKGYKQKYGVDYKE

Query:  VFAPMTCQDTIRLIISIATQKSWPIYQLDVKSA-------------------------------------------------------------------
        +FAP+   DTIRL++S+A Q SW I+QLDVKSA                                                                   
Subjt:  VFAPMTCQDTIRLIISIATQKSWPIYQLDVKSA-------------------------------------------------------------------

Query:  --------------------------SDLESFKKRYM--------------------WSNQKVLFKKKKYAQELLEIFKLDEC-TFGTPSELGSKLHKDI
                                  + L  FKK  M                     S+  V   +KKYA E+L+ F L +C +  TPSE+G KL K  
Subjt:  --------------------------SDLESFKKRYM--------------------WSNQKVLFKKKKYAQELLEIFKLDEC-TFGTPSELGSKLHKDI

Query:  ERREVDNRYFKQIVGSLMYLTSARPDIMYAVSMISRYLEHPMEKHLNAARRI
          + VD+  +KQIVGSLMYLTS RPDIM+ V++I+RY+E+P E H+ AA+RI
Subjt:  ERREVDNRYFKQIVGSLMYLTSARPDIMYAVSMISRYLEHPMEKHLNAARRI

TrEMBL top hitse value%identityAlignment
A0A438DWY0 Retrovirus-related Pol polyprotein from transposon TNT 1-946.4e-16240.73Show/hide
Query:  VWVVDSGCSNHMTGCKEFFSTLDENFHTTVSVGNNSTIQVMEKETVDIKTRNGFVESISNVFYIPDLKANLLSI--------------------------
        VW VD+G +NHM G K  F  L+E F +TVS G+ ST+ VM K  ++I+T+NGFVE+ISNVFY+PDLK+NLLS                           
Subjt:  VWVVDSGCSNHMTGCKEFFSTLDENFHTTVSVGNNSTIQVMEKETVDIKTRNGFVESISNVFYIPDLKANLLSI--------------------------

Query:  --------------------------------------------------------------------------------------------------DI
                                                                                                          +I
Subjt:  --------------------------------------------------------------------------------------------------DI

Query:  CGPISPASNGDKKYFMTLIDDFSRKMWTYFLHAKSEAFNCFKKLCATMKTETGRRVEALRTDRGGEFCSNEFIKFWEEKGIRRQLTTTYTPQQNGVAKRK
        CGPI+P SNG KKY +  IDD+SRK W  FL  KSEAF+ FK   A ++ ET R ++ LRTDRGGE+CSNEF  F +++GIRR+LT  YTPQQNGV++RK
Subjt:  CGPISPASNGDKKYFMTLIDDFSRKMWTYFLHAKSEAFNCFKKLCATMKTETGRRVEALRTDRGGEFCSNEFIKFWEEKGIRRQLTTTYTPQQNGVAKRK

Query:  NRIILNMVRSLLNKGEVLKEFWPEAVVWNVHILNRSPTFSVRDMTPQETWSGRKPTIDHFRIFGCIAYAHIPDEKRKKLEDKSLKCVFLGVSETSKAYKL
        N+ ILNMVRSLL +G++ K FW +A+ W++H+LNRSPTF V++MTP+E WSGRKPT+DHF+IFGCIAYAH+PDEKRKKL+DK  KCVFLGVSE SKAYKL
Subjt:  NRIILNMVRSLLNKGEVLKEFWPEAVVWNVHILNRSPTFSVRDMTPQETWSGRKPTIDHFRIFGCIAYAHIPDEKRKKLEDKSLKCVFLGVSETSKAYKL

Query:  YDPLTKKVVVSRDVIFDEKKIWTWEEKITVNQQIPVDLEEEQTEAPIQ-------PHYEHGESSSQAQQEEPAEEDH-DHNQRNSRTRKRLNWMVDYEMD
        ++PLTKK+V+SRDVIFDE+  W W  +         D EEE+ +   Q       P     ++ + A+    A E +     R    RKR  WM D+E+ 
Subjt:  YDPLTKKVVVSRDVIFDEKKIWTWEEKITVNQQIPVDLEEEQTEAPIQ-------PHYEHGESSSQAQQEEPAEEDH-DHNQRNSRTRKRLNWMVDYEMD

Query:  YESSDS----TYFAFFMDSDPIVYKEAVKEKKWKEAMDSEIKSIEKNKTWELTDLPRGQKTIGVKWVFKTKLNENGEVDKYKARLVAKGYKQKYGVDYKE
           SD+     ++A   D DPI ++EA+K+ KW +AM+ EI SIEKN +WEL +LP+ QK+IGVKWV+KTKLN++G VDKYKARLVAKGYK+++GVDYKE
Subjt:  YESSDS----TYFAFFMDSDPIVYKEAVKEKKWKEAMDSEIKSIEKNKTWELTDLPRGQKTIGVKWVFKTKLNENGEVDKYKARLVAKGYKQKYGVDYKE

Query:  VFAPMTCQDTIRLIISIATQKSWPIYQLDVKSA-------------------------------------------------------------------
        +FAP+   DTI L++S+A Q SW I+QLDVKSA                                                                   
Subjt:  VFAPMTCQDTIRLIISIATQKSWPIYQLDVKSA-------------------------------------------------------------------

Query:  --------------------------SDLESFKKRYM--------------------WSNQKVLFKKKKYAQELLEIFKLDEC-TFGTPSELGSKLHKDI
                                  + L  FKK  M                     S+  V   +KKYA E+L+ F L +C +  TPSE+G KL K  
Subjt:  --------------------------SDLESFKKRYM--------------------WSNQKVLFKKKKYAQELLEIFKLDEC-TFGTPSELGSKLHKDI

Query:  ERREVDNRYFKQIVGSLMYLTSARPDIMYAVSMISRYLEHPMEKHLNAARRI
          + VD+  +KQIVGSLMYLTS RPDIM+ V++I+RY+E+P E H+ AA+RI
Subjt:  ERREVDNRYFKQIVGSLMYLTSARPDIMYAVSMISRYLEHPMEKHLNAARRI

A0A438I5N6 Retrovirus-related Pol polyprotein from transposon TNT 1-941.3e-16740.07Show/hide
Query:  MSFHAMKVADQGVWVVDSGCSNHMTGCKEFFSTLDENFHTTVSVGNNSTIQVMEKETVDIKTRNGFVESISNVFYIPDLKANLLSI--------------
        M+ HA +     +W +D+GCSNHM G K  FS LDE F  +V+ G+NS + VM K +V I ++    + ISNVF++PDLK NLLS+              
Subjt:  MSFHAMKVADQGVWVVDSGCSNHMTGCKEFFSTLDENFHTTVSVGNNSTIQVMEKETVDIKTRNGFVESISNVFYIPDLKANLLSI--------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------DICGPISPASNGDKKYFMTLIDDFSRKMWTYFLHAKSEAFNCFKKLCATMKTETGRRVEALRTDRGGEFCSNEFIKFWEEKGIRRQLTT
                   DICGPI+P SNG K YF+T IDD+SRK W YFL  KSEAF+ FK     ++ E G+ ++   +DRGGE+ S EF+ F E  GI++QLT 
Subjt:  -----------DICGPISPASNGDKKYFMTLIDDFSRKMWTYFLHAKSEAFNCFKKLCATMKTETGRRVEALRTDRGGEFCSNEFIKFWEEKGIRRQLTT

Query:  TYTPQQNGVAKRKNRIILNMVRSLLNKGEVLKEFWPEAVVWNVHILNRSPTFSVRDMTPQETWSGRKPTIDHFRIFGCIAYAHIPDEKRKKLEDKSLKCV
         Y+PQQNGV++RKNR ILNMVR++L+KG + + FWPEAV+W++HILNRSPT  V+++TP+E W+GRKP+++HFRIFGCIAYAHIPD+KRKKL+DK  KC+
Subjt:  TYTPQQNGVAKRKNRIILNMVRSLLNKGEVLKEFWPEAVVWNVHILNRSPTFSVRDMTPQETWSGRKPTIDHFRIFGCIAYAHIPDEKRKKLEDKSLKCV

Query:  FLGVSETSKAYKLYDPLTKKVVVSRDVIFDEKKIWTWEEKITVNQQIPVDLE---EEQTEAPIQPHYEHGE----SSSQAQQEEPAEEDHDH------NQ
        FLGVSE SKAYKLY+P+TKK+V+SRD+IFDE   W W++  T+ QQI  D +   EE+ + P+Q      E     +    +  P   + D         
Subjt:  FLGVSETSKAYKLYDPLTKKVVVSRDVIFDEKKIWTWEEKITVNQQIPVDLE---EEQTEAPIQPHYEHGE----SSSQAQQEEPAEEDHDH------NQ

Query:  RNSRTRKRLNWMVDYE---MDYESSDSTYFAFFMDSDPIVYKEAVKEKKWKEAMDSEIKSIEKNKTWELTDLPRGQKTIGVKWVFKTKLNENGEVDKYKA
         + R RKR  WM DYE   +D      T+FA F D DP  ++ AVKE KW++AMD+EI +IE+N TWEL++LP+G KTIGVKWV+KTKL ENGEVDKYKA
Subjt:  RNSRTRKRLNWMVDYE---MDYESSDSTYFAFFMDSDPIVYKEAVKEKKWKEAMDSEIKSIEKNKTWELTDLPRGQKTIGVKWVFKTKLNENGEVDKYKA

Query:  RLVAKGYKQKYGVDYKEVFAPMTCQDTIRLIISIATQKSWPIYQLDVKSA--------------------------------------------------
        RLVAKGYKQ++GVDYKEVFAP+   DTIRL+I++A Q SWPI+QLDVKSA                                                  
Subjt:  RLVAKGYKQKYGVDYKEVFAPMTCQDTIRLIISIATQKSWPIYQLDVKSA--------------------------------------------------

Query:  ---------------------------------SDL----------ESFKKRYM--------------------WSNQKVLFKKKKYAQELLEIFKLDEC
                                          DL          E FKK  M                     S+  +   +KKY +E+L  F++ +C
Subjt:  ---------------------------------SDL----------ESFKKRYM--------------------WSNQKVLFKKKKYAQELLEIFKLDEC

Query:  T-FGTPSELGSKLHKDIERREVDNRYFKQIVGSLMYLTSARPDIMYAVSMISRYLEHPMEKHLNAARRIHR
            TP++ G KL+KD   ++VDN  +KQIVGSLMYLT+ RPDIM++VS+ISRY+E+P E H  AA++I R
Subjt:  T-FGTPSELGSKLHKDIERREVDNRYFKQIVGSLMYLTSARPDIMYAVSMISRYLEHPMEKHLNAARRIHR

A0A438IQQ3 Retrovirus-related Pol polyprotein from transposon TNT 1-947.6e-16340.96Show/hide
Query:  VWVVDSGCSNHMTGCKEFFSTLDENFHTTVSVGNNSTIQVMEKETVDIKTRNGFVESISNVFYIPDLKANLLSI--------------------------
        VW VD+G +NHM G K  F  L+E F +TVS G+ ST+ VM K  ++I+T+NGFVE+ISNVFY+PDLK+NLLS                           
Subjt:  VWVVDSGCSNHMTGCKEFFSTLDENFHTTVSVGNNSTIQVMEKETVDIKTRNGFVESISNVFYIPDLKANLLSI--------------------------

Query:  --------------------------------------------------------------------------------------------------DI
                                                                                                          +I
Subjt:  --------------------------------------------------------------------------------------------------DI

Query:  CGPISPASNGDKKYFMTLIDDFSRKMWTYFLHAKSEAFNCFKKLCATMKTETGRRVEALRTDRGGEFCSNEFIKFWEEKGIRRQLTTTYTPQQNGVAKRK
        CGPI+P SNG KKY +  IDD+SRK W  FL  KSEAF+ FK   A ++ ET R ++ LRTDRGGE+CSNEF  F +++GIRR+LT  YTPQQNGV++RK
Subjt:  CGPISPASNGDKKYFMTLIDDFSRKMWTYFLHAKSEAFNCFKKLCATMKTETGRRVEALRTDRGGEFCSNEFIKFWEEKGIRRQLTTTYTPQQNGVAKRK

Query:  NRIILNMVRSLLNKGEVLKEFWPEAVVWNVHILNRSPTFSVRDMTPQETWSGRKPTIDHFRIFGCIAYAHIPDEKRKKLEDKSLKCVFLGVSETSKAYKL
        N+ ILNMVRSLL +G++ K FW +AV W++H+LNRSPTF V++MTP+E WSGRKPT+DHF+IFGCIAYAH+PDEKRKKL+DK  KCVFLGVSE SKAYKL
Subjt:  NRIILNMVRSLLNKGEVLKEFWPEAVVWNVHILNRSPTFSVRDMTPQETWSGRKPTIDHFRIFGCIAYAHIPDEKRKKLEDKSLKCVFLGVSETSKAYKL

Query:  YDPLTKKVVVSRDVIFDEKKIWTWEEKITVNQQIPVDLEEEQTEAPIQ-------PHYEHGESSSQAQQEEPAEEDH-DHNQRNSRTRKRLNWMVDYEMD
        ++PLTKK+V+SRDVIFDE+  W W  +         D EEE+ +   Q       P     ++ + A+    A E +     R    RKR  WM D+E+ 
Subjt:  YDPLTKKVVVSRDVIFDEKKIWTWEEKITVNQQIPVDLEEEQTEAPIQ-------PHYEHGESSSQAQQEEPAEEDH-DHNQRNSRTRKRLNWMVDYEMD

Query:  YESSDS----TYFAFFMDSDPIVYKEAVKEKKWKEAMDSEIKSIEKNKTWELTDLPRGQKTIGVKWVFKTKLNENGEVDKYKARLVAKGYKQKYGVDYKE
           SD+     ++A   D DPI ++EA+K+ KW +AM+ EI SIEKN +WEL +LP+ QK+IGVKWV+KTKLN++G VDKYKARLVAKGYK+++GVDYKE
Subjt:  YESSDS----TYFAFFMDSDPIVYKEAVKEKKWKEAMDSEIKSIEKNKTWELTDLPRGQKTIGVKWVFKTKLNENGEVDKYKARLVAKGYKQKYGVDYKE

Query:  VFAPMTCQDTIRLIISIATQKSWPIYQLDVKSA-------------------------------------------------------------------
        +FAP+   DTIRL++S+A Q SW I+QLDVKSA                                                                   
Subjt:  VFAPMTCQDTIRLIISIATQKSWPIYQLDVKSA-------------------------------------------------------------------

Query:  --------------------------SDLESFKKRYM--------------------WSNQKVLFKKKKYAQELLEIFKLDEC-TFGTPSELGSKLHKDI
                                  + L  FKK  M                     S+  V   +KKYA E+L+ F L +C +  TPSE+G KL K  
Subjt:  --------------------------SDLESFKKRYM--------------------WSNQKVLFKKKKYAQELLEIFKLDEC-TFGTPSELGSKLHKDI

Query:  ERREVDNRYFKQIVGSLMYLTSARPDIMYAVSMISRYLEHPMEKHLNAARRI
          + VD+  +KQIVGSLMYLTS RPDIM+ V++I+RY+E+P E H+ AA+RI
Subjt:  ERREVDNRYFKQIVGSLMYLTSARPDIMYAVSMISRYLEHPMEKHLNAARRI

A0A4Y1R4M3 ADP glucose pyrophosphorylase large subunit 17.8e-20046.67Show/hide
Query:  MSFHAMKVADQGVWVVDSGCSNHMTGCKEFFSTLDENFHTTVSVGNNSTIQVMEKETVDIKTRNGFVESISNVFYIPDLKANLLSI--------------
        M+FHA+   DQ  W VD+GCSNHM+GCK  F  LDE+FHT VS G+ ST++VM K  + IKT+N F+E+ISNVFYIPDLK NLLS               
Subjt:  MSFHAMKVADQGVWVVDSGCSNHMTGCKEFFSTLDENFHTTVSVGNNSTIQVMEKETVDIKTRNGFVESISNVFYIPDLKANLLSI--------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------DICGPISPASNGDKKYFMTLIDDFSRKMWTYFLHAKSEAFNCFKKLCATMKTETGRRVEALRTDRGGEFCSNEFIKFWEEKGIRRQLTTTY
                 D+CGPI+P SNG+KKYF++  DDFSRK W YFL  KSEAF  FK   A ++ E+ ++++ LRTDRGGE+CS EF  F +EKGI+RQLTT Y
Subjt:  ---------DICGPISPASNGDKKYFMTLIDDFSRKMWTYFLHAKSEAFNCFKKLCATMKTETGRRVEALRTDRGGEFCSNEFIKFWEEKGIRRQLTTTY

Query:  TPQQNGVAKRKNRIILNMVRSLLNKGEVLKEFWPEAVVWNVHILNRSPTFSVRDMTPQETWSGRKPTIDHFRIFGCIAYAHIPDEKRKKLEDKSLKCVFL
        TPQQNGV++RKNR ILNMVRSLL KG + K+FWPEAV+W+VHILNRSPTFSV++MTPQE WSG KP +DHFR+FGCIAYAHIPDEKRKKL+DKS KCVFL
Subjt:  TPQQNGVAKRKNRIILNMVRSLLNKGEVLKEFWPEAVVWNVHILNRSPTFSVRDMTPQETWSGRKPTIDHFRIFGCIAYAHIPDEKRKKLEDKSLKCVFL

Query:  GVSETSKAYKLYDPLTKKVVVSRDVIFDEKKIWTWEEKITVNQQIPV--DLEEEQTEAPIQPHYEHGESSSQAQQEEPAEEDHDHNQRNSRTRKRLNWMV
        GVSE SKAYKLY+P+TKK+VVSRDVIFDE  +W W E  +V QQIPV  DL+ E+  AP     +  +   Q+ Q+    E+  +N R+  +RKR  WM+
Subjt:  GVSETSKAYKLYDPLTKKVVVSRDVIFDEKKIWTWEEKITVNQQIPV--DLEEEQTEAPIQPHYEHGESSSQAQQEEPAEEDHDHNQRNSRTRKRLNWMV

Query:  DYEMDYESSD--STYFAFFMDSDPIVYKEAVKEKKWKEAMDSEIKSIEKNKTWELTDLPRGQKTIGVKWVFKTKLNENGEVDKYKARLVAKGYKQKYGVD
        DY++ Y SSD  + +FA  +DSDPI Y EAVKE+KW+EAMD+EIKSIEKN+TWELTDLP+G++TIGVKWVF+TKLNE GEVDK+KARLVAKGYKQK+G+D
Subjt:  DYEMDYESSD--STYFAFFMDSDPIVYKEAVKEKKWKEAMDSEIKSIEKNKTWELTDLPRGQKTIGVKWVFKTKLNENGEVDKYKARLVAKGYKQKYGVD

Query:  YKEVFAPMTCQDTIRLIISIATQKSWPIYQLDVKSA----------------------------------------------------------------
        YKEVFAP+   DTIRL+IS+A Q SW I+QLDVKSA                                                                
Subjt:  YKEVFAPMTCQDTIRLIISIATQKSWPIYQLDVKSA----------------------------------------------------------------

Query:  --------------------------SD---LESFKKRYMWSNQKV--------------------LFKKKKYAQELLEIFKLDEC-TFGTPSELGSKLH
                                  SD    + FKK  M   +                         +KKYAQ++L  F++D+C  FGTP+E G KL 
Subjt:  --------------------------SD---LESFKKRYMWSNQKV--------------------LFKKKKYAQELLEIFKLDEC-TFGTPSELGSKLH

Query:  KDIERREVDNRYFKQIVGSLMYLTSARPDIMYAVSMISRYLEHPMEKHLNAARRIHR
        KD   +EVD+ ++KQI+GSLMYLT  RPDIMYAVS++SRY+E P E HLNAA+RI R
Subjt:  KDIERREVDNRYFKQIVGSLMYLTSARPDIMYAVSMISRYLEHPMEKHLNAARRIHR

A0A4Y1RBF6 ADP glucose pyrophosphorylase large subunit 14.7e-18153.4Show/hide
Query:  DICGPISPASNGDKKYFMTLIDDFSRKMWTYFLHAKSEAFNCFKKLCATMKTETGRRVEALRTDRGGEFCSNEFIKFWEEKGIRRQLTTTYTPQQNGVAK
        D+CGPI+P SNG+KKYF++  DDFSRK W YFL  KSEAF  FK   A ++ E+ ++++ LRTDRGGE+CS EF  F +EKGI+RQLTT YTPQQNGV++
Subjt:  DICGPISPASNGDKKYFMTLIDDFSRKMWTYFLHAKSEAFNCFKKLCATMKTETGRRVEALRTDRGGEFCSNEFIKFWEEKGIRRQLTTTYTPQQNGVAK

Query:  RKNRIILNMVRSLLNKGEVLKEFWPEAVVWNVHILNRSPTFSVRDMTPQETWSGRKPTIDHFRIFGCIAYAHIPDEKRKKLEDKSLKCVFLGVSETSKAY
        RKNR ILNMVRSLL KG + K+FWPEAV+W+VHILNRSPTFSV++MTPQE WSG KP +DHFR+FGCIAYAHIPDEKRKKL+DKS KCVFLGVSE SKAY
Subjt:  RKNRIILNMVRSLLNKGEVLKEFWPEAVVWNVHILNRSPTFSVRDMTPQETWSGRKPTIDHFRIFGCIAYAHIPDEKRKKLEDKSLKCVFLGVSETSKAY

Query:  KLYDPLTKKVVVSRDVIFDEKKIWTWEEKITVNQQIPV--DLEEEQTEAPIQPHYEHGESSSQAQQEEPAEEDHDHNQRNSRTRKRLNWMVDYEMDYESS
        KLY+P+TKK+VVSRDVIFDE  +W W E  +V QQIPV  DL+ E+  AP     +  +   Q+ Q+    E+  +N R+  +RKR  WM+DY++ Y SS
Subjt:  KLYDPLTKKVVVSRDVIFDEKKIWTWEEKITVNQQIPV--DLEEEQTEAPIQPHYEHGESSSQAQQEEPAEEDHDHNQRNSRTRKRLNWMVDYEMDYESS

Query:  D--STYFAFFMDSDPIVYKEAVKEKKWKEAMDSEIKSIEKNKTWELTDLPRGQKTIGVKWVFKTKLNENGEVDKYKARLVAKGYKQKYGVDYKEVFAPMT
        D  + +FA  +DSDPI Y EAVKE+KW+EAMD+EIKSIEKN+TWELTDLP+G++TIGVKWVF+TKLNE GEVDK+KARLVAKGYKQK+G+DYKEVFAP+ 
Subjt:  D--STYFAFFMDSDPIVYKEAVKEKKWKEAMDSEIKSIEKNKTWELTDLPRGQKTIGVKWVFKTKLNENGEVDKYKARLVAKGYKQKYGVDYKEVFAPMT

Query:  CQDTIRLIISIATQKSWPIYQLDVKSA-------------------------------------------------------------------------
          DTIRL+IS+A Q SW I+QLDVKSA                                                                         
Subjt:  CQDTIRLIISIATQKSWPIYQLDVKSA-------------------------------------------------------------------------

Query:  -----------------SD---LESFKKRYMWSNQKV--------------------LFKKKKYAQELLEIFKLDEC-TFGTPSELGSKLHKDIERREVD
                         SD    + FKK  M   +                         +KKYAQ++L  F++D+C  FGTP+E G KL KD   +EVD
Subjt:  -----------------SD---LESFKKRYMWSNQKV--------------------LFKKKKYAQELLEIFKLDEC-TFGTPSELGSKLHKDIERREVD

Query:  NRYFKQIVGSLMYLTSARPDIMYAVSMISRYLEHPMEKHLNAARRIHR
        + ++KQI+GSLMYLT  RPDIMYAVS++SRY+E P E HLNAA+RI R
Subjt:  NRYFKQIVGSLMYLTSARPDIMYAVSMISRYLEHPMEKHLNAARRIHR

SwissProt top hitse value%identityAlignment
P04146 Copia protein4.3e-5428.45Show/hide
Query:  DICGPISPASNGDKKYFMTLIDDFSRKMWTYFLHAKSEAFNCFKKLCATMKTETGRRVEALRTDRGGEFCSNEFIKFWEEKGIRRQLTTTYTPQQNGVAK
        D+CGPI+P +  DK YF+  +D F+    TY +  KS+ F+ F+   A  +     +V  L  D G E+ SNE  +F  +KGI   LT  +TPQ NGV++
Subjt:  DICGPISPASNGDKKYFMTLIDDFSRKMWTYFLHAKSEAFNCFKKLCATMKTETGRRVEALRTDRGGEFCSNEFIKFWEEKGIRRQLTTTYTPQQNGVAK

Query:  RKNRIILNMVRSLLNKGEVLKEFWPEAVVWNVHILNRSPTFSVRD--MTPQETWSGRKPTIDHFRIFGCIAYAHIPDEKRKKLEDKSLKCVFLGVSETSK
        R  R I    R++++  ++ K FW EAV+   +++NR P+ ++ D   TP E W  +KP + H R+FG   Y HI + K+ K +DKS K +F+G      
Subjt:  RKNRIILNMVRSLLNKGEVLKEFWPEAVVWNVHILNRSPTFSVRD--MTPQETWSGRKPTIDHFRIFGCIAYAHIPDEKRKKLEDKSLKCVFLGVSETSK

Query:  AYKLYDPLTKKVVVSRDVIFDE-----------KKIWTWEEKITVNQQIPVD-------------------------LEEE------------QTEAP--
         +KL+D + +K +V+RDV+ DE           + ++  + K + N+  P D                          E E            QTE P  
Subjt:  AYKLYDPLTKKVVVSRDVIFDE-----------KKIWTWEEKITVNQQIPVD-------------------------LEEE------------QTEAP--

Query:  ------IQPHYEHGESSSQAQQEEPAEEDHDH-----------NQRNSRTRKRLNWM----------------------VDYEMDYESSDSTYFAFFMDS
              IQ   +  ES+     E    +  DH             R S T + L  +                         ++ Y   D++     +++
Subjt:  ------IQPHYEHGESSSQAQQEEPAEEDHDH-----------NQRNSRTRKRLNWM----------------------VDYEMDYESSDSTYFAFFMDS

Query:  -----------DPIVYKEAVKEKKWKEAMDSEIKSIEKNKTWELTDLPRGQKTIGVKWVFKTKLNENGEVDKYKARLVAKGYKQKYGVDYKEVFAPMTCQ
                   D I Y++   +  W+EA+++E+ + + N TW +T  P  +  +  +WVF  K NE G   +YKARLVA+G+ QKY +DY+E FAP+   
Subjt:  -----------DPIVYKEAVKEKKWKEAMDSEIKSIEKNKTWELTDLPRGQKTIGVKWVFKTKLNENGEVDKYKARLVAKGYKQKYGVDYKEVFAPMTCQ

Query:  DTIRLIISIATQKSWPIYQLDVKSASDLESFKKR-YMWSNQ-------------KVLFKKKKYAQELLEIFK--LDECTF
         + R I+S+  Q +  ++Q+DVK+A    + K+  YM   Q             K ++  K+ A+   E+F+  L EC F
Subjt:  DTIRLIISIATQKSWPIYQLDVKSASDLESFKKR-YMWSNQ-------------KVLFKKKKYAQELLEIFK--LDECTF

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.9e-8036.9Show/hide
Query:  KANLLSI---DICGPISPASNGDKKYFMTLIDDFSRKMWTYFLHAKSEAFNCFKKLCATMKTETGRRVEALRTDRGGEFCSNEFIKFWEEKGIRRQLTTT
        K N+L +   D+CGP+   S G  KYF+T IDD SRK+W Y L  K + F  F+K  A ++ ETGR+++ LR+D GGE+ S EF ++    GIR + T  
Subjt:  KANLLSI---DICGPISPASNGDKKYFMTLIDDFSRKMWTYFLHAKSEAFNCFKKLCATMKTETGRRVEALRTDRGGEFCSNEFIKFWEEKGIRRQLTTT

Query:  YTPQQNGVAKRKNRIILNMVRSLLNKGEVLKEFWPEAVVWNVHILNRSPTFSVRDMTPQETWSGRKPTIDHFRIFGCIAYAHIPDEKRKKLEDKSLKCVF
         TPQ NGVA+R NR I+  VRS+L   ++ K FW EAV    +++NRSP+  +    P+  W+ ++ +  H ++FGC A+AH+P E+R KL+DKS+ C+F
Subjt:  YTPQQNGVAKRKNRIILNMVRSLLNKGEVLKEFWPEAVVWNVHILNRSPTFSVRDMTPQETWSGRKPTIDHFRIFGCIAYAHIPDEKRKKLEDKSLKCVF

Query:  LGVSETSKAYKLYDPLTKKVVVSRDVIFDEKKIWT---WEEKITVNQQIP-----------VDLEEEQTEAPIQPHYEHGESSSQAQQEEPAEEDHDHNQ
        +G  +    Y+L+DP+ KKV+ SRDV+F E ++ T     EK+  N  IP               E  T+   +   + GE   Q +Q +   E+ +H  
Subjt:  LGVSETSKAYKLYDPLTKKVVVSRDVIFDEKKIWT---WEEKITVNQQIP-----------VDLEEEQTEAPIQPHYEHGESSSQAQQEEPAEEDHDHNQ

Query:  RNSRTRKRLNWMVDYEMDYESSDSTYFAFFMDS-DPIVYKEAV---KEKKWKEAMDSEIKSIEKNKTWELTDLPRGQKTIGVKWVFKTKLNENGEVDKYK
        +     + L       ++     ST +    D  +P   KE +   ++ +  +AM  E++S++KN T++L +LP+G++ +  KWVFK K + + ++ +YK
Subjt:  RNSRTRKRLNWMVDYEMDYESSDSTYFAFFMDS-DPIVYKEAV---KEKKWKEAMDSEIKSIEKNKTWELTDLPRGQKTIGVKWVFKTKLNENGEVDKYK

Query:  ARLVAKGYKQKYGVDYKEVFAPMTCQDTIRLIISIATQKSWPIYQLDVKSA---SDLE
        ARLV KG++QK G+D+ E+F+P+    +IR I+S+A      + QLDVK+A    DLE
Subjt:  ARLVAKGYKQKYGVDYKEVFAPMTCQDTIRLIISIATQKSWPIYQLDVKSA---SDLE

P92520 Uncharacterized mitochondrial protein AtMg008201.2e-1638.89Show/hide
Query:  AVKEKKWKEAMDSEIKSIEKNKTWELTDLPRGQKTIGVKWVFKTKLNENGEVDKYKARLVAKGYKQKYGVDYKEVFAPMTCQDTIRLIISIATQ-KSWPI
        A+K+  W +AM  E+ ++ +NKTW L   P  Q  +G KWVFKTKL+ +G +D+ KARLVAKG+ Q+ G+ + E ++P+    TIR I+++A Q +    
Subjt:  AVKEKKWKEAMDSEIKSIEKNKTWELTDLPRGQKTIGVKWVFKTKLNENGEVDKYKARLVAKGYKQKYGVDYKEVFAPMTCQDTIRLIISIATQ-KSWPI

Query:  YQLDVKSASDLESFKKRYMWSNQKVL
             K    +  FKK+++  N  VL
Subjt:  YQLDVKSASDLESFKKRYMWSNQKVL

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE19.9e-4322.94Show/hide
Query:  SNGDKKYFMTLIDDFSRKMWTYFLHAKSEAFNCFKKLCATMKTETGRRVEALRTDRGGEFCSNEFIKFWEEKGIRRQLTTTYTPQQNGVAKRKNRIILNM
        S+ + +Y++  +D F+R  W Y L  KS+    F      ++     R+    +D GGEF +    +++ + GI    +  +TP+ NG+++RK+R I+  
Subjt:  SNGDKKYFMTLIDDFSRKMWTYFLHAKSEAFNCFKKLCATMKTETGRRVEALRTDRGGEFCSNEFIKFWEEKGIRRQLTTTYTPQQNGVAKRKNRIILNM

Query:  VRSLLNKGEVLKEFWPEAVVWNVHILNRSPTFSVRDMTPQETWSGRKPTIDHFRIFGCIAYAHIPDEKRKKLEDKSLKCVFLGVSETSKAYKLYDPLTKK
          +LL+   + K +WP A    V+++NR PT  ++  +P +   G  P  D  R+FGC  Y  +    + KL+DKS +CVFLG S T  AY      T +
Subjt:  VRSLLNKGEVLKEFWPEAVVWNVHILNRSPTFSVRDMTPQETWSGRKPTIDHFRIFGCIAYAHIPDEKRKKLEDKSLKCVFLGVSETSKAYKLYDPLTKK

Query:  VVVSRDVIFDE--------------------KKIWTWEEKITVNQQIPV-------DLEEEQT--EAPIQPHYEHGESSS--------------------
        + +SR V FDE                    +    W    T+  + PV       D     T   +P  P      SSS                    
Subjt:  VVVSRDVIFDE--------------------KKIWTWEEKITVNQQIPV-------DLEEEQT--EAPIQPHYEHGESSS--------------------

Query:  -----------QAQQEEPAEEDHDHNQRNSRTRKRLNWMVDYEMDYESSD--------------------------------------------------
                   Q Q +  + ++   N   + +  +L   +       SS                                                   
Subjt:  -----------QAQQEEPAEEDHDHNQRNSRTRKRLNWMVDYEMDYESSD--------------------------------------------------

Query:  ---------STYFAFFMDSDPIVYKEAVKEKKWKEAMDSEIKSIEKNKTWELTDLPRGQKTI-GVKWVFKTKLNENGEVDKYKARLVAKGYKQKYGVDYK
                 S   +   +S+P    +A+K+++W+ AM SEI +   N TW+L   P    TI G +W+F  K N +G +++YKARLVAKGY Q+ G+DY 
Subjt:  ---------STYFAFFMDSDPIVYKEAVKEKKWKEAMDSEIKSIEKNKTWELTDLPRGQKTI-GVKWVFKTKLNENGEVDKYKARLVAKGYKQKYGVDYK

Query:  EVFAPMTCQDTIRLIISIATQKSWPIYQLDVKSA----------------SDLESFKKRYMWSNQKVLFKKKK---------------------------
        E F+P+    +IR+++ +A  +SWPI QLDV +A                  ++  +  Y+   +K L+  K+                           
Subjt:  EVFAPMTCQDTIRLIISIATQKSWPIYQLDVKSA----------------SDLESFKKRYMWSNQKVLFKKKK---------------------------

Query:  --------------YAQELLEIFKLDECTFGTPSELGSKL----HKDI------------------ERREV-----------------------------
                      Y  ++L           T   L  +     H+++                  +RR +                             
Subjt:  --------------YAQELLEIFKLDECTFGTPSELGSKL----HKDI------------------ERREV-----------------------------

Query:  -----DNRYFKQIVGSLMYLTSARPDIMYAVSMISRYLEHPMEKHLNAARRIHR
             D   ++ IVGSL YL   RPDI YAV+ +S+++  P E+HL A +RI R
Subjt:  -----DNRYFKQIVGSLMYLTSARPDIMYAVSMISRYLEHPMEKHLNAARRIHR

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.1e-4023.51Show/hide
Query:  KYFMTLIDDFSRKMWTYFLHAKSEAFNCFKKLCATMKTETGRRVEALRTDRGGEFCSNEFIKFWEEKGIRRQLTTTYTPQQNGVAKRKNRIILNMVRSLL
        +Y++  +D F+R  W Y L  KS+  + F    + ++     R+  L +D GGEF       +  + GI    +  +TP+ NG+++RK+R I+ M  +LL
Subjt:  KYFMTLIDDFSRKMWTYFLHAKSEAFNCFKKLCATMKTETGRRVEALRTDRGGEFCSNEFIKFWEEKGIRRQLTTTYTPQQNGVAKRKNRIILNMVRSLL

Query:  NKGEVLKEFWPEAVVWNVHILNRSPTFSVRDMTPQETWSGRKPTIDHFRIFGCIAYAHIPDEKRKKLEDKSLKCVFLGVSETSKAYKLYDPLTKKVVVSR
        +   V K +WP A    V+++NR PT  ++  +P +   G+ P  +  ++FGC  Y  +    R KLEDKS +C F+G S T  AY      T ++  SR
Subjt:  NKGEVLKEFWPEAVVWNVHILNRSPTFSVRDMTPQETWSGRKPTIDHFRIFGCIAYAHIPDEKRKKLEDKSLKCVFLGVSETSKAYKLYDPLTKKVVVSR

Query:  DVIFDEKKIWTWEEKITVNQQIPVDLEEEQTEAP--------------------IQPHYE----------------------HGESSSQAQQEEPAEEDH
         V FDE+         T N  +    E+    AP                    + PH +                         S S     EP    H
Subjt:  DVIFDEKKIWTWEEKITVNQQIPVDLEEEQTEAP--------------------IQPHYE----------------------HGESSSQAQQEEPAEEDH

Query:  D--------HNQRNSRTRKRL----------------------------------NWMVDYEMDYESSDST-----------------------------
        +        H  +NS +   +                                    + +      SS ST                             
Subjt:  D--------HNQRNSRTRKRL----------------------------------NWMVDYEMDYESSDST-----------------------------

Query:  --------------YFAFFMDSDPIVYKEAVKEKKWKEAMDSEIKSIEKNKTWELTDLPRGQKTI-GVKWVFKTKLNENGEVDKYKARLVAKGYKQKYGV
                        +   +S+P    +A+K+ +W++AM SEI +   N TW+L   P    TI G +W+F  K N +G +++YKARLVAKGY Q+ G+
Subjt:  --------------YFAFFMDSDPIVYKEAVKEKKWKEAMDSEIKSIEKNKTWELTDLPRGQKTI-GVKWVFKTKLNENGEVDKYKARLVAKGYKQKYGV

Query:  DYKEVFAPMTCQDTIRLIISIATQKSWPIYQLDVKSA-------------------------------------------------------------SD
        DY E F+P+    +IR+++ +A  +SWPI QLDV +A                                                             SD
Subjt:  DYKEVFAPMTCQDTIRLIISIATQKSWPIYQLDVKSA-------------------------------------------------------------SD

Query:  LESFKKR------YM---------WSNQKVLFK------------------------------------KKKYAQELL-EIFKLDECTFGTPSELGSKLH
           F  +      YM           N  VL K                                    +++Y  +LL     L      TP     KL 
Subjt:  LESFKKR------YM---------WSNQKVLFK------------------------------------KKKYAQELL-EIFKLDECTFGTPSELGSKLH

Query:  KDIERREVDNRYFKQIVGSLMYLTSARPDIMYAVSMISRYLEHPMEKHLNAARRIHR
             +  D   ++ IVGSL YL   RPD+ YAV+ +S+Y+  P + H NA +R+ R
Subjt:  KDIERREVDNRYFKQIVGSLMYLTSARPDIMYAVSMISRYLEHPMEKHLNAARRIHR

Arabidopsis top hitse value%identityAlignment
AT3G21000.1 Gag-Pol-related retrotransposon family protein3.4e-0631.65Show/hide
Query:  KVADQGVWVVDSGCSNHMTGCKEFFSTLDENFHTTVSVGNNSTIQVMEKETVDIKTRNGFVESISNVFYIPDLKANLLS
        K  D  +W++      +MT   ++F+TLD  F  TV   + + + V  K  V I+ + G  ++I NV ++P L  N+LS
Subjt:  KVADQGVWVVDSGCSNHMTGCKEFFSTLDENFHTTVSVGNNSTIQVMEKETVDIKTRNGFVESISNVFYIPDLKANLLS

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 86.8e-2339.26Show/hide
Query:  MDYESSDSTYFAFFM----DSDPIVYKEAVKEKKWKEAMDSEIKSIEKNKTWELTDLPRGQKTIGVKWVFKTKLNENGEVDKYKARLVAKGYKQKYGVDY
        + YE     Y +F +      +P  Y EA +   W  AMD EI ++E   TWE+  LP  +K IG KWV+K K N +G +++YKARLVAKGY Q+ G+D+
Subjt:  MDYESSDSTYFAFFM----DSDPIVYKEAVKEKKWKEAMDSEIKSIEKNKTWELTDLPRGQKTIGVKWVFKTKLNENGEVDKYKARLVAKGYKQKYGVDY

Query:  KEVFAPMTCQDTIRLIISIATQKSWPIYQLDVKSA
         E F+P+    +++LI++I+   ++ ++QLD+ +A
Subjt:  KEVFAPMTCQDTIRLIISIATQKSWPIYQLDVKSA

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 86.0e-0330.23Show/hide
Query:  KKKYAQELLEIFKLDEC-TFGTPSELGSKLHKDIERREVDNRYFKQIVGSLMYLTSARPDIMYAVSMISRYLEHPMEKHLNAARRI
        ++KYA +LL+   L  C     P +             VD + +++++G LMYL   R DI +AV+ +S++ E P   H  A  +I
Subjt:  KKKYAQELLEIFKLDEC-TFGTPSELGSKLHKDIERREVDNRYFKQIVGSLMYLTSARPDIMYAVSMISRYLEHPMEKHLNAARRI

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.9e-0937.66Show/hide
Query:  NRIILNMVRSLLNKGEVLKEFWPEAVVWNVHILNRSPTFSVRDMTPQETWSGRKPTIDHFRIFGCIAYAHIPDEKRK
        NR I+  VRS+L +  + K F  +A    VHI+N+ P+ ++    P E W    PT  + R FGC+AY H  + K K
Subjt:  NRIILNMVRSLLNKGEVLKEFWPEAVVWNVHILNRSPTFSVRDMTPQETWSGRKPTIDHFRIFGCIAYAHIPDEKRK

ATMG00810.1 DNA/RNA polymerases superfamily protein2.4e-0431.18Show/hide
Query:  VLFKKKKYAQELLEIFKLDEC-TFGTPSELGSKLHKDIERREV-DNRYFKQIVGSLMYLTSARPDIMYAVSMISRYLEHPMEKHLNAARRIHR
        +   + KYA+++L    + +C    TP  L  KL+  +   +  D   F+ IVG+L YLT  RPDI YAV+++ + +  P     +  +R+ R
Subjt:  VLFKKKKYAQELLEIFKLDEC-TFGTPSELGSKLHKDIERREV-DNRYFKQIVGSLMYLTSARPDIMYAVSMISRYLEHPMEKHLNAARRIHR

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)8.6e-1838.89Show/hide
Query:  AVKEKKWKEAMDSEIKSIEKNKTWELTDLPRGQKTIGVKWVFKTKLNENGEVDKYKARLVAKGYKQKYGVDYKEVFAPMTCQDTIRLIISIATQ-KSWPI
        A+K+  W +AM  E+ ++ +NKTW L   P  Q  +G KWVFKTKL+ +G +D+ KARLVAKG+ Q+ G+ + E ++P+    TIR I+++A Q +    
Subjt:  AVKEKKWKEAMDSEIKSIEKNKTWELTDLPRGQKTIGVKWVFKTKLNENGEVDKYKARLVAKGYKQKYGVDYKEVFAPMTCQDTIRLIISIATQ-KSWPI

Query:  YQLDVKSASDLESFKKRYMWSNQKVL
             K    +  FKK+++  N  VL
Subjt:  YQLDVKSASDLESFKKRYMWSNQKVL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCATTCCATGCTATGAAGGTTGCAGATCAAGGTGTATGGGTCGTAGACTCTGGATGCAGCAACCATATGACAGGTTGTAAGGAATTCTTCTCAACTCTGGATGAAAA
TTTTCATACTACGGTTTCTGTTGGCAACAATTCAACTATTCAAGTGATGGAGAAAGAAACCGTAGATATCAAGACTAGAAATGGCTTTGTAGAGTCTATTTCTAATGTAT
TCTACATTCCTGATCTAAAGGCCAACTTACTTAGTATCGACATATGTGGTCCCATCTCTCCAGCATCAAATGGTGACAAGAAATATTTCATGACCCTCATCGATGATTTT
AGTCGAAAAATGTGGACTTATTTCTTGCATGCCAAATCAGAAGCCTTTAATTGCTTCAAAAAATTGTGTGCTACTATGAAGACAGAAACTGGAAGAAGAGTCGAGGCTTT
AAGAACTGACAGAGGTGGAGAGTTCTGCTCGAACGAGTTTATTAAATTTTGGGAAGAAAAGGGCATAAGGAGGCAGTTAACAACAACATATACACCACAACAAAATGGCG
TGGCCAAAAGGAAAAATAGGATCATTCTCAACATGGTTCGAAGTTTACTAAATAAAGGAGAAGTCCTGAAGGAATTTTGGCCAGAAGCTGTCGTGTGGAACGTTCACATT
CTCAACCGGAGCCCTACTTTTTCTGTTAGAGATATGACTCCACAAGAAACATGGAGCGGAAGAAAACCTACAATAGACCACTTTAGAATCTTTGGGTGCATAGCATATGC
ACACATTCCTGATGAGAAAAGAAAGAAACTTGAGGACAAGAGTTTGAAGTGTGTGTTCCTTGGAGTAAGTGAGACTTCTAAGGCGTACAAGCTCTATGATCCTTTGACAA
AGAAGGTGGTGGTAAGTCGTGATGTAATCTTTGACGAAAAGAAGATTTGGACATGGGAAGAAAAGATCACTGTGAATCAGCAGATTCCTGTAGATCTTGAAGAAGAACAA
ACTGAAGCCCCTATTCAGCCTCACTATGAGCATGGAGAGAGCTCGTCACAAGCTCAACAAGAAGAACCAGCAGAAGAAGATCATGATCATAATCAAAGAAATTCAAGAAC
AAGAAAAAGACTAAATTGGATGGTTGATTATGAGATGGACTATGAATCAAGTGATAGTACTTATTTTGCATTCTTTATGGATAGTGATCCAATTGTCTATAAAGAAGCAG
TAAAGGAGAAGAAATGGAAGGAAGCTATGGACAGTGAAATCAAATCCATAGAGAAGAACAAGACTTGGGAGCTCACTGATCTTCCTAGAGGGCAGAAAACAATCGGAGTA
AAATGGGTTTTCAAAACCAAGTTGAATGAAAACGGTGAGGTGGATAAGTACAAGGCGCGACTCGTTGCAAAAGGATACAAGCAGAAGTATGGAGTTGATTATAAGGAGGT
ATTTGCACCAATGACATGTCAAGATACTATTAGGCTTATAATTTCTATTGCAACACAAAAATCTTGGCCTATTTATCAACTTGATGTGAAATCGGCTTCCGACCTGGAGA
GCTTCAAGAAGAGGTATATGTGGAGCAACCAGAAGGTTTTGTTCAAAAAGAAGAAATATGCTCAAGAATTACTGGAAATATTTAAATTGGATGAGTGCACATTTGGAACT
CCATCAGAATTGGGTTCGAAGTTGCACAAGGATATTGAAAGGCGAGAGGTTGATAATAGGTACTTCAAACAAATTGTGGGAAGTTTGATGTACTTAACTTCAGCAAGACC
AGACATCATGTATGCGGTTAGTATGATAAGTCGTTACCTGGAGCATCCAATGGAAAAACATCTCAATGCAGCAAGAAGGATACATCGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCATTCCATGCTATGAAGGTTGCAGATCAAGGTGTATGGGTCGTAGACTCTGGATGCAGCAACCATATGACAGGTTGTAAGGAATTCTTCTCAACTCTGGATGAAAA
TTTTCATACTACGGTTTCTGTTGGCAACAATTCAACTATTCAAGTGATGGAGAAAGAAACCGTAGATATCAAGACTAGAAATGGCTTTGTAGAGTCTATTTCTAATGTAT
TCTACATTCCTGATCTAAAGGCCAACTTACTTAGTATCGACATATGTGGTCCCATCTCTCCAGCATCAAATGGTGACAAGAAATATTTCATGACCCTCATCGATGATTTT
AGTCGAAAAATGTGGACTTATTTCTTGCATGCCAAATCAGAAGCCTTTAATTGCTTCAAAAAATTGTGTGCTACTATGAAGACAGAAACTGGAAGAAGAGTCGAGGCTTT
AAGAACTGACAGAGGTGGAGAGTTCTGCTCGAACGAGTTTATTAAATTTTGGGAAGAAAAGGGCATAAGGAGGCAGTTAACAACAACATATACACCACAACAAAATGGCG
TGGCCAAAAGGAAAAATAGGATCATTCTCAACATGGTTCGAAGTTTACTAAATAAAGGAGAAGTCCTGAAGGAATTTTGGCCAGAAGCTGTCGTGTGGAACGTTCACATT
CTCAACCGGAGCCCTACTTTTTCTGTTAGAGATATGACTCCACAAGAAACATGGAGCGGAAGAAAACCTACAATAGACCACTTTAGAATCTTTGGGTGCATAGCATATGC
ACACATTCCTGATGAGAAAAGAAAGAAACTTGAGGACAAGAGTTTGAAGTGTGTGTTCCTTGGAGTAAGTGAGACTTCTAAGGCGTACAAGCTCTATGATCCTTTGACAA
AGAAGGTGGTGGTAAGTCGTGATGTAATCTTTGACGAAAAGAAGATTTGGACATGGGAAGAAAAGATCACTGTGAATCAGCAGATTCCTGTAGATCTTGAAGAAGAACAA
ACTGAAGCCCCTATTCAGCCTCACTATGAGCATGGAGAGAGCTCGTCACAAGCTCAACAAGAAGAACCAGCAGAAGAAGATCATGATCATAATCAAAGAAATTCAAGAAC
AAGAAAAAGACTAAATTGGATGGTTGATTATGAGATGGACTATGAATCAAGTGATAGTACTTATTTTGCATTCTTTATGGATAGTGATCCAATTGTCTATAAAGAAGCAG
TAAAGGAGAAGAAATGGAAGGAAGCTATGGACAGTGAAATCAAATCCATAGAGAAGAACAAGACTTGGGAGCTCACTGATCTTCCTAGAGGGCAGAAAACAATCGGAGTA
AAATGGGTTTTCAAAACCAAGTTGAATGAAAACGGTGAGGTGGATAAGTACAAGGCGCGACTCGTTGCAAAAGGATACAAGCAGAAGTATGGAGTTGATTATAAGGAGGT
ATTTGCACCAATGACATGTCAAGATACTATTAGGCTTATAATTTCTATTGCAACACAAAAATCTTGGCCTATTTATCAACTTGATGTGAAATCGGCTTCCGACCTGGAGA
GCTTCAAGAAGAGGTATATGTGGAGCAACCAGAAGGTTTTGTTCAAAAAGAAGAAATATGCTCAAGAATTACTGGAAATATTTAAATTGGATGAGTGCACATTTGGAACT
CCATCAGAATTGGGTTCGAAGTTGCACAAGGATATTGAAAGGCGAGAGGTTGATAATAGGTACTTCAAACAAATTGTGGGAAGTTTGATGTACTTAACTTCAGCAAGACC
AGACATCATGTATGCGGTTAGTATGATAAGTCGTTACCTGGAGCATCCAATGGAAAAACATCTCAATGCAGCAAGAAGGATACATCGTTAG
Protein sequenceShow/hide protein sequence
MSFHAMKVADQGVWVVDSGCSNHMTGCKEFFSTLDENFHTTVSVGNNSTIQVMEKETVDIKTRNGFVESISNVFYIPDLKANLLSIDICGPISPASNGDKKYFMTLIDDF
SRKMWTYFLHAKSEAFNCFKKLCATMKTETGRRVEALRTDRGGEFCSNEFIKFWEEKGIRRQLTTTYTPQQNGVAKRKNRIILNMVRSLLNKGEVLKEFWPEAVVWNVHI
LNRSPTFSVRDMTPQETWSGRKPTIDHFRIFGCIAYAHIPDEKRKKLEDKSLKCVFLGVSETSKAYKLYDPLTKKVVVSRDVIFDEKKIWTWEEKITVNQQIPVDLEEEQ
TEAPIQPHYEHGESSSQAQQEEPAEEDHDHNQRNSRTRKRLNWMVDYEMDYESSDSTYFAFFMDSDPIVYKEAVKEKKWKEAMDSEIKSIEKNKTWELTDLPRGQKTIGV
KWVFKTKLNENGEVDKYKARLVAKGYKQKYGVDYKEVFAPMTCQDTIRLIISIATQKSWPIYQLDVKSASDLESFKKRYMWSNQKVLFKKKKYAQELLEIFKLDECTFGT
PSELGSKLHKDIERREVDNRYFKQIVGSLMYLTSARPDIMYAVSMISRYLEHPMEKHLNAARRIHR