; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0007908 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0007908
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr9:7692967..7698744
RNA-Seq ExpressionLag0007908
SyntenyLag0007908
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0050719.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]1.1e-11937.35Show/hide
Query:  MGNNQQCVVKGIGSVTIRLQDGSLKILKEVRFVPELKRNLISLGTLDKAGFGYKSENGLLTVTKDSVVKLTGTTTNSTFNVGLFLSLLENTSSAAPPLFA
        +G+N  C VKG GSV I   DG ++IL  VR+VP+LKRNLISLG LD++G   KSENG++ VTK S+VKL GT  +        L +LE T+ +     A
Subjt:  MGNNQQCVVKGIGSVTIRLQDGSLKILKEVRFVPELKRNLISLGTLDKAGFGYKSENGLLTVTKDSVVKLTGTTTNSTFNVGLFLSLLENTSSAAPPLFA

Query:  LATTANPPSQQQVGRIVEVCLPKKPSGLNVFVSSLVCLDLHLREIGVLMGILSFSTLSDVDVVQVPCGTEMDQKPPKCPKRCPVSPLCSHRLSQKDFMFE
                     G++ ++ +        ++   L     H+ E G L  +     L  V  V++                    P C H +        
Subjt:  LATTANPPSQQQVGRIVEVCLPKKPSGLNVFVSSLVCLDLHLREIGVLMGILSFSTLSDVDVVQVPCGTEMDQKPPKCPKRCPVSPLCSHRLSQKDFMFE

Query:  DTIFCGVFDGQGPYDRFVARIVRDTLPIKLLSPQLCKSSRASFRPSTYITKDKLDYIRSDLWGPARVSTHGGARYFLTFVDDFSIKLWVYMLKTIDEVFE
                                          + KS+R  F    + TK  LDY+ SDLWGP +  + GG+RYF++ +DDFS K+W+Y LK  DE F 
Subjt:  DTIFCGVFDGQGPYDRFVARIVRDTLPIKLLSPQLCKSSRASFRPSTYITKDKLDYIRSDLWGPARVSTHGGARYFLTFVDDFSIKLWVYMLKTIDEVFE

Query:  KFVEWKDFVENQTDRKIKLLRTDNGLEYVNDRFITLCKTSGIDRHLTTAGTPLQNGLAERFNRTILERK-------------------------------
        KF+EWK  VENQT RK+K LRTDNGLE+VN++F   CK+ GI RH T   TP QNGLAERFNRTI+ER                                
Subjt:  KFVEWKDFVENQTDRKIKLLRTDNGLEYVNDRFITLCKTSGIDRHLTTAGTPLQNGLAERFNRTILERK-------------------------------

Query:  ----------WTGRPPNLRHLRVFGCATYAPTRQDKLQPRVKRCIFLGYPHGIKAYKLWSLDPGKRRCIISKDVSFDEDVMPWKFSSQTENRDKDLEGFE
                  WTG+ P+L HLRVFGC  YA  +  KL  R  +C+F+GYP G+K YKLW ++ G  +CIIS+DV+F+E  MP+    Q + +  D    E
Subjt:  ----------WTGRPPNLRHLRVFGCATYAPTRQDKLQPRVKRCIFLGYPHGIKAYKLWSLDPGKRRCIISKDVSFDEDVMPWKFSSQTENRDKDLEGFE

Query:  IDLLPSSSSPQIANSSSQPVLH-----QPSEEEVTETQTD--IVDTSQQAEEEGI----------------------------LLIISLREIDKGGRTEP
        +  + S   P I   +  P++      Q SE +  ++Q +  ++D     EE                               L+  +L         EP
Subjt:  IDLLPSSSSPQIANSSSQPVLH-----QPSEEEVTETQTD--IVDTSQQAEEEGI----------------------------LLIISLREIDKGGRTEP

Query:  NTYNQAMKSQNSDLWMSAILKELDSLKENNTWSLVKRPHGQKFVDCKWVYKLKLNVDPSALPRFKARLVAKGFTQEEVVDYTEVFSLVVRHTSIRILLSL
         T+ +A+ S +   W  A+ +EL SL +N TWSLV +P  QK +  KW+YK+K     ++ PR+KARLVAKG+TQ+E VD+ E+FS VVRH+SIR++LS+
Subjt:  NTYNQAMKSQNSDLWMSAILKELDSLKENNTWSLVKRPHGQKFVDCKWVYKLKLNVDPSALPRFKARLVAKGFTQEEVVDYTEVFSLVVRHTSIRILLSL

Query:  VVQFNLELQQMDVTTAFLYGHLEEDIYMNLSPKFVEV-GGEDLVCHLHKSIYGLK
         V F++ ++QMDVTTAFL+G LEE IYM   PK  EV G ED+VC LHKS+YGLK
Subjt:  VVQFNLELQQMDVTTAFLYGHLEEDIYMNLSPKFVEV-GGEDLVCHLHKSIYGLK

PKU72844.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Dendrobium catenatum]3.2e-11636.78Show/hide
Query:  MGNNQQCVVKGIGSVTIRLQDGSLKILKEVRFVPELKRNLISLGTLDKAGFGYKSENGLLTVTKDSVVKLTGTTTNSTFNVGLFLSLLENTSSAAPPLFA
        +GNN+ C V GIGS+ +++ DG ++ILK+VR VP+LKRNLISLGTLD +G+ ++SE GLL ++K ++V + G   N  + +     + E   +A   L  
Subjt:  MGNNQQCVVKGIGSVTIRLQDGSLKILKEVRFVPELKRNLISLGTLDKAGFGYKSENGLLTVTKDSVVKLTGTTTNSTFNVGLFLSLLENTSSAAPPLFA

Query:  LATTANPPSQQQVGRIVEVCLPKKPSGLNVFVSSLVCLDLHLREIGVL----MGILSFSTLSDVDVVQVPCGTEMDQKPPKCPKRCPVSPLCSHRLSQKD
                  Q++G                          HL + G++     G+    +++ +D                                   
Subjt:  LATTANPPSQQQVGRIVEVCLPKKPSGLNVFVSSLVCLDLHLREIGVL----MGILSFSTLSDVDVVQVPCGTEMDQKPPKCPKRCPVSPLCSHRLSQKD

Query:  FMFEDTIFCGVFDGQGPYDRFVARIVRDTLPIKLLSPQLCKSSRASFRPSTYITKDKLDYIRSDLWGPARVSTHGGARYFLTFVDDFSIKLWVYMLKTID
               FC                          S  + KS R SF+ ST+  +  LDYI SDLWGPARV+THGG RYFL+F+DD+S K+W+++LK+ D
Subjt:  FMFEDTIFCGVFDGQGPYDRFVARIVRDTLPIKLLSPQLCKSSRASFRPSTYITKDKLDYIRSDLWGPARVSTHGGARYFLTFVDDFSIKLWVYMLKTID

Query:  EVFEKFVEWKDFVENQTDRKIKLLRTDNGLEYVNDRFITLCKTSGIDRHLTTAGTPLQNGLAERFNRTILER----------------------------
        E F KF+EWK  VENQ +RK+K+LRTDNGLE+ N+ F   C  SGI RH T + TP QNGLAER NRT+L+R                            
Subjt:  EVFEKFVEWKDFVENQTDRKIKLLRTDNGLEYVNDRFITLCKTSGIDRHLTTAGTPLQNGLAERFNRTILER----------------------------

Query:  -------------KWTGRPPNLRHLRVFGCATYAPTRQDKLQPRVKRCIFLGYPHGIKAYKLWSLDPGKRRCIISKDVSFDEDVMPWKFSSQTENRD---
                      W G+PP+L HLRVFGC  Y    + KL+PR  +C+FLGYP G+K Y+LW L     + IIS+DV F+E+ +   + S++EN+D   
Subjt:  -------------KWTGRPPNLRHLRVFGCATYAPTRQDKLQPRVKRCIFLGYPHGIKAYKLWSLDPGKRRCIISKDVSFDEDVMPWKFSSQTENRD---

Query:  ---------KDLEGFEIDLLPSSSSPQIANSSSQPVLHQPSEEEVTETQTDIVDTSQQAEEEGI----------LLIISLREIDKGGRTEPNTYNQAMKS
                 KD   FE++   ++ S +    S++P   +   E V+E   D +  S+  E   I          L+  +L    +   +EP +Y +A+  
Subjt:  ---------KDLEGFEIDLLPSSSSPQIANSSSQPVLHQPSEEEVTETQTDIVDTSQQAEEEGI----------LLIISLREIDKGGRTEPNTYNQAMKS

Query:  QNSDLWMSAILKELDSLKENNTWSLVKRPHGQKFVDCKWVYKLKLNVDPSALPRFKARLVAKGFTQEEVVDYTEVFSLVVRHTSIRILLSLVVQFNLELQ
        ++S+ W+ A+  E DSL +NNTW LV +   QK V CKW+YK+K     +   R+KARLVA+G+TQ+E +DY+E+FS VV+HTSIRIL+ LV  F+ EL+
Subjt:  QNSDLWMSAILKELDSLKENNTWSLVKRPHGQKFVDCKWVYKLKLNVDPSALPRFKARLVAKGFTQEEVVDYTEVFSLVVRHTSIRILLSLVVQFNLELQ

Query:  QMDVTTAFLYGHLEEDIYMNLSPKFVEVGGEDLVCHLHKSIYGLK
        Q+DV TAFL+G L+E+I M     FV  G E+LVC L KSIYGLK
Subjt:  QMDVTTAFLYGHLEEDIYMNLSPKFVEVGGEDLVCHLHKSIYGLK

PNX96445.1 copia LTR rider [Trifolium pratense]1.1e-12138.17Show/hide
Query:  MGNNQQCVVKGIGSVTIRLQDGSLKILKEVRFVPELKRNLISLGTLDKAGFGYKSENGLLTVTKDSVVKLTGTTTNSTFNVGLFLSLLENTSSAAPPLFA
        +GNN+ C + G+GSV  +L D S+++L EVR+VP+LKRNL+SLG  DK G+ ++ E  +L V K S   L G         GL+    E  S +      
Subjt:  MGNNQQCVVKGIGSVTIRLQDGSLKILKEVRFVPELKRNLISLGTLDKAGFGYKSENGLLTVTKDSVVKLTGTTTNSTFNVGLFLSLLENTSSAAPPLFA

Query:  LATTANPPSQQQVGRIVEVCLPKKPSGLNVFVSSLVCLDLHLREIGVLMGILSFSTLSDVDVVQVPCGTEMDQKPPKCPKRCPVSPLCSHRLSQKDFMFE
           +  P S+ ++                           H+R     +G +S   L ++    +  G ++++      K C                 E
Subjt:  LATTANPPSQQQVGRIVEVCLPKKPSGLNVFVSSLVCLDLHLREIGVLMGILSFSTLSDVDVVQVPCGTEMDQKPPKCPKRCPVSPLCSHRLSQKDFMFE

Query:  DTIFCGVFDGQGPYDRFVARIVRDTLPIKLLSPQLCKSSRASFRPSTYITKDKLDYIRSDLWGPARVSTHGGARYFLTFVDDFSIKLWVYMLKTIDEVFE
          +F                                KS R  F      T   LDYI +DLWGPAR  +H GARYFL+ VDD+S KLWV++ KT DE FE
Subjt:  DTIFCGVFDGQGPYDRFVARIVRDTLPIKLLSPQLCKSSRASFRPSTYITKDKLDYIRSDLWGPARVSTHGGARYFLTFVDDFSIKLWVYMLKTIDEVFE

Query:  KFVEWKDFVENQTDRKIKLLRTDNGLEYVNDRFITLCKTSGIDRHLTTAGTPLQNGLAERFNRTILER--------------------------------
         F  WK  VENQT RK+K LRTDNGLE+ N+ F   C  SGI RH TTAGTP QNGLAERFNRTILER                                
Subjt:  KFVEWKDFVENQTDRKIKLLRTDNGLEYVNDRFITLCKTSGIDRHLTTAGTPLQNGLAERFNRTILER--------------------------------

Query:  ---------KWTGRPPNLRHLRVFGCATYAPTRQDKLQPRVKRCIFLGYPHGIKAYKLWSLDPGKRRCIISKDVSFDEDVMPWK----FSSQTENRDKDL
                  W+G PP+L  LRVFGC  YA  RQDK++PR  +C+F+GYP G+KAY+LW L+PG +RCI S+DV F+E  M +K        TE  D++L
Subjt:  ---------KWTGRPPNLRHLRVFGCATYAPTRQDKLQPRVKRCIFLGYPHGIKAYKLWSLDPGKRRCIISKDVSFDEDVMPWK----FSSQTENRDKDL

Query:  EGFEIDLLPSSSSPQIANSSSQPVLHQPSE--------EEVTETQTDIV---DTSQQ----------AEEEGILLIISLREIDKGGRTEPNTYNQAMKSQ
        E  EI +       ++ +  ++  LH P E        EEV ET  D +   D S++          A+     LI +   +D+    EP  Y + M+S+
Subjt:  EGFEIDLLPSSSSPQIANSSSQPVLHQPSE--------EEVTETQTDIV---DTSQQ----------AEEEGILLIISLREIDKGGRTEPNTYNQAMKSQ

Query:  NSDLWMSAILKELDSLKENNTWSLVKRPHGQKFVDCKWVYKLKLNVDPSALPRFKARLVAKGFTQEEVVDYTEVFSLVVRHTSIRILLSLVVQFNLELQQ
        N   W+ A+  E+ SL +N+TW L+K+P G + V CKW++K+K  ++     R+KARLVA+GFTQ+E VD+ +VFS VV+H SIR+LL++V QF+LEL+Q
Subjt:  NSDLWMSAILKELDSLKENNTWSLVKRPHGQKFVDCKWVYKLKLNVDPSALPRFKARLVAKGFTQEEVVDYTEVFSLVVRHTSIRILLSLVVQFNLELQQ

Query:  MDVTTAFLYGHLEEDIYMNLSPKFVEVGGEDLVCHLHKSIYGLK
        MDV TAFLYG L+E I M     +VE G ED VC L +S+YGLK
Subjt:  MDVTTAFLYGHLEEDIYMNLSPKFVEVGGEDLVCHLHKSIYGLK

TYK13826.1 putative polyprotein [Cucumis melo var. makuwa]1.1e-11937.35Show/hide
Query:  MGNNQQCVVKGIGSVTIRLQDGSLKILKEVRFVPELKRNLISLGTLDKAGFGYKSENGLLTVTKDSVVKLTGTTTNSTFNVGLFLSLLENTSSAAPPLFA
        +G+N  C VKG GSV I   DG ++IL  VR+VP+LKRNLISLG LD++G   KSENG++ VTK S+VKL GT  +        L +LE T+ +     A
Subjt:  MGNNQQCVVKGIGSVTIRLQDGSLKILKEVRFVPELKRNLISLGTLDKAGFGYKSENGLLTVTKDSVVKLTGTTTNSTFNVGLFLSLLENTSSAAPPLFA

Query:  LATTANPPSQQQVGRIVEVCLPKKPSGLNVFVSSLVCLDLHLREIGVLMGILSFSTLSDVDVVQVPCGTEMDQKPPKCPKRCPVSPLCSHRLSQKDFMFE
                     G++ ++ +        ++   L     H+ E G L  +     L  V  V++                    P C H +        
Subjt:  LATTANPPSQQQVGRIVEVCLPKKPSGLNVFVSSLVCLDLHLREIGVLMGILSFSTLSDVDVVQVPCGTEMDQKPPKCPKRCPVSPLCSHRLSQKDFMFE

Query:  DTIFCGVFDGQGPYDRFVARIVRDTLPIKLLSPQLCKSSRASFRPSTYITKDKLDYIRSDLWGPARVSTHGGARYFLTFVDDFSIKLWVYMLKTIDEVFE
                                          + KS+R  F    + TK  LDY+ SDLWGP +  + GG+RYF++ +DDFS K+W+Y LK  DE F 
Subjt:  DTIFCGVFDGQGPYDRFVARIVRDTLPIKLLSPQLCKSSRASFRPSTYITKDKLDYIRSDLWGPARVSTHGGARYFLTFVDDFSIKLWVYMLKTIDEVFE

Query:  KFVEWKDFVENQTDRKIKLLRTDNGLEYVNDRFITLCKTSGIDRHLTTAGTPLQNGLAERFNRTILERK-------------------------------
        KF+EWK  VENQT RK+K LRTDNGLE+VN++F   CK+ GI RH T   TP QNGLAERFNRTI+ER                                
Subjt:  KFVEWKDFVENQTDRKIKLLRTDNGLEYVNDRFITLCKTSGIDRHLTTAGTPLQNGLAERFNRTILERK-------------------------------

Query:  ----------WTGRPPNLRHLRVFGCATYAPTRQDKLQPRVKRCIFLGYPHGIKAYKLWSLDPGKRRCIISKDVSFDEDVMPWKFSSQTENRDKDLEGFE
                  WTG+ P+L HLRVFGC  YA  +  KL  R  +C+F+GYP G+K YKLW ++ G  +CIIS+DV+F+E  MP+    Q + +  D    E
Subjt:  ----------WTGRPPNLRHLRVFGCATYAPTRQDKLQPRVKRCIFLGYPHGIKAYKLWSLDPGKRRCIISKDVSFDEDVMPWKFSSQTENRDKDLEGFE

Query:  IDLLPSSSSPQIANSSSQPVLH-----QPSEEEVTETQTD--IVDTSQQAEEEGI----------------------------LLIISLREIDKGGRTEP
        +  + S   P I   +  P++      Q SE +  ++Q +  ++D     EE                               L+  +L         EP
Subjt:  IDLLPSSSSPQIANSSSQPVLH-----QPSEEEVTETQTD--IVDTSQQAEEEGI----------------------------LLIISLREIDKGGRTEP

Query:  NTYNQAMKSQNSDLWMSAILKELDSLKENNTWSLVKRPHGQKFVDCKWVYKLKLNVDPSALPRFKARLVAKGFTQEEVVDYTEVFSLVVRHTSIRILLSL
         T+ +A+ S +   W  A+ +EL SL +N TWSLV +P  QK +  KW+YK+K     ++ PR+KARLVAKG+TQ+E VD+ E+FS VVRH+SIR++LS+
Subjt:  NTYNQAMKSQNSDLWMSAILKELDSLKENNTWSLVKRPHGQKFVDCKWVYKLKLNVDPSALPRFKARLVAKGFTQEEVVDYTEVFSLVVRHTSIRILLSL

Query:  VVQFNLELQQMDVTTAFLYGHLEEDIYMNLSPKFVEV-GGEDLVCHLHKSIYGLK
         V F++ ++QMDVTTAFL+G LEE IYM   PK  EV G ED+VC LHKS+YGLK
Subjt:  VVQFNLELQQMDVTTAFLYGHLEEDIYMNLSPKFVEV-GGEDLVCHLHKSIYGLK

TYK25306.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]1.1e-11937.35Show/hide
Query:  MGNNQQCVVKGIGSVTIRLQDGSLKILKEVRFVPELKRNLISLGTLDKAGFGYKSENGLLTVTKDSVVKLTGTTTNSTFNVGLFLSLLENTSSAAPPLFA
        +G+N  C VKG GSV I   DG ++IL  VR+VP+LKRNLISLG LD++G   KSENG++ VTK S+VKL GT  +        L +LE T+ +     A
Subjt:  MGNNQQCVVKGIGSVTIRLQDGSLKILKEVRFVPELKRNLISLGTLDKAGFGYKSENGLLTVTKDSVVKLTGTTTNSTFNVGLFLSLLENTSSAAPPLFA

Query:  LATTANPPSQQQVGRIVEVCLPKKPSGLNVFVSSLVCLDLHLREIGVLMGILSFSTLSDVDVVQVPCGTEMDQKPPKCPKRCPVSPLCSHRLSQKDFMFE
                     G++ ++ +        ++   L     H+ E G L  +     L  V  V++                    P C H +        
Subjt:  LATTANPPSQQQVGRIVEVCLPKKPSGLNVFVSSLVCLDLHLREIGVLMGILSFSTLSDVDVVQVPCGTEMDQKPPKCPKRCPVSPLCSHRLSQKDFMFE

Query:  DTIFCGVFDGQGPYDRFVARIVRDTLPIKLLSPQLCKSSRASFRPSTYITKDKLDYIRSDLWGPARVSTHGGARYFLTFVDDFSIKLWVYMLKTIDEVFE
                                          + KS+R  F    + TK  LDY+ SDLWGP +  + GG+RYF++ +DDFS K+W+Y LK  DE F 
Subjt:  DTIFCGVFDGQGPYDRFVARIVRDTLPIKLLSPQLCKSSRASFRPSTYITKDKLDYIRSDLWGPARVSTHGGARYFLTFVDDFSIKLWVYMLKTIDEVFE

Query:  KFVEWKDFVENQTDRKIKLLRTDNGLEYVNDRFITLCKTSGIDRHLTTAGTPLQNGLAERFNRTILERK-------------------------------
        KF+EWK  VENQT RK+K LRTDNGLE+VN++F   CK+ GI RH T   TP QNGLAERFNRTI+ER                                
Subjt:  KFVEWKDFVENQTDRKIKLLRTDNGLEYVNDRFITLCKTSGIDRHLTTAGTPLQNGLAERFNRTILERK-------------------------------

Query:  ----------WTGRPPNLRHLRVFGCATYAPTRQDKLQPRVKRCIFLGYPHGIKAYKLWSLDPGKRRCIISKDVSFDEDVMPWKFSSQTENRDKDLEGFE
                  WTG+ P+L HLRVFGC  YA  +  KL  R  +C+F+GYP G+K YKLW ++ G  +CIIS+DV+F+E  MP+    Q + +  D    E
Subjt:  ----------WTGRPPNLRHLRVFGCATYAPTRQDKLQPRVKRCIFLGYPHGIKAYKLWSLDPGKRRCIISKDVSFDEDVMPWKFSSQTENRDKDLEGFE

Query:  IDLLPSSSSPQIANSSSQPVLH-----QPSEEEVTETQTD--IVDTSQQAEEEGI----------------------------LLIISLREIDKGGRTEP
        +  + S   P I   +  P++      Q SE +  ++Q +  ++D     EE                               L+  +L         EP
Subjt:  IDLLPSSSSPQIANSSSQPVLH-----QPSEEEVTETQTD--IVDTSQQAEEEGI----------------------------LLIISLREIDKGGRTEP

Query:  NTYNQAMKSQNSDLWMSAILKELDSLKENNTWSLVKRPHGQKFVDCKWVYKLKLNVDPSALPRFKARLVAKGFTQEEVVDYTEVFSLVVRHTSIRILLSL
         T+ +A+ S +   W  A+ +EL SL +N TWSLV +P  QK +  KW+YK+K     ++ PR+KARLVAKG+TQ+E VD+ E+FS VVRH+SIR++LS+
Subjt:  NTYNQAMKSQNSDLWMSAILKELDSLKENNTWSLVKRPHGQKFVDCKWVYKLKLNVDPSALPRFKARLVAKGFTQEEVVDYTEVFSLVVRHTSIRILLSL

Query:  VVQFNLELQQMDVTTAFLYGHLEEDIYMNLSPKFVEV-GGEDLVCHLHKSIYGLK
         V F++ ++QMDVTTAFL+G LEE IYM   PK  EV G ED+VC LHKS+YGLK
Subjt:  VVQFNLELQQMDVTTAFLYGHLEEDIYMNLSPKFVEV-GGEDLVCHLHKSIYGLK

TrEMBL top hitse value%identityAlignment
A0A2K3N065 Copia LTR rider5.5e-12238.17Show/hide
Query:  MGNNQQCVVKGIGSVTIRLQDGSLKILKEVRFVPELKRNLISLGTLDKAGFGYKSENGLLTVTKDSVVKLTGTTTNSTFNVGLFLSLLENTSSAAPPLFA
        +GNN+ C + G+GSV  +L D S+++L EVR+VP+LKRNL+SLG  DK G+ ++ E  +L V K S   L G         GL+    E  S +      
Subjt:  MGNNQQCVVKGIGSVTIRLQDGSLKILKEVRFVPELKRNLISLGTLDKAGFGYKSENGLLTVTKDSVVKLTGTTTNSTFNVGLFLSLLENTSSAAPPLFA

Query:  LATTANPPSQQQVGRIVEVCLPKKPSGLNVFVSSLVCLDLHLREIGVLMGILSFSTLSDVDVVQVPCGTEMDQKPPKCPKRCPVSPLCSHRLSQKDFMFE
           +  P S+ ++                           H+R     +G +S   L ++    +  G ++++      K C                 E
Subjt:  LATTANPPSQQQVGRIVEVCLPKKPSGLNVFVSSLVCLDLHLREIGVLMGILSFSTLSDVDVVQVPCGTEMDQKPPKCPKRCPVSPLCSHRLSQKDFMFE

Query:  DTIFCGVFDGQGPYDRFVARIVRDTLPIKLLSPQLCKSSRASFRPSTYITKDKLDYIRSDLWGPARVSTHGGARYFLTFVDDFSIKLWVYMLKTIDEVFE
          +F                                KS R  F      T   LDYI +DLWGPAR  +H GARYFL+ VDD+S KLWV++ KT DE FE
Subjt:  DTIFCGVFDGQGPYDRFVARIVRDTLPIKLLSPQLCKSSRASFRPSTYITKDKLDYIRSDLWGPARVSTHGGARYFLTFVDDFSIKLWVYMLKTIDEVFE

Query:  KFVEWKDFVENQTDRKIKLLRTDNGLEYVNDRFITLCKTSGIDRHLTTAGTPLQNGLAERFNRTILER--------------------------------
         F  WK  VENQT RK+K LRTDNGLE+ N+ F   C  SGI RH TTAGTP QNGLAERFNRTILER                                
Subjt:  KFVEWKDFVENQTDRKIKLLRTDNGLEYVNDRFITLCKTSGIDRHLTTAGTPLQNGLAERFNRTILER--------------------------------

Query:  ---------KWTGRPPNLRHLRVFGCATYAPTRQDKLQPRVKRCIFLGYPHGIKAYKLWSLDPGKRRCIISKDVSFDEDVMPWK----FSSQTENRDKDL
                  W+G PP+L  LRVFGC  YA  RQDK++PR  +C+F+GYP G+KAY+LW L+PG +RCI S+DV F+E  M +K        TE  D++L
Subjt:  ---------KWTGRPPNLRHLRVFGCATYAPTRQDKLQPRVKRCIFLGYPHGIKAYKLWSLDPGKRRCIISKDVSFDEDVMPWK----FSSQTENRDKDL

Query:  EGFEIDLLPSSSSPQIANSSSQPVLHQPSE--------EEVTETQTDIV---DTSQQ----------AEEEGILLIISLREIDKGGRTEPNTYNQAMKSQ
        E  EI +       ++ +  ++  LH P E        EEV ET  D +   D S++          A+     LI +   +D+    EP  Y + M+S+
Subjt:  EGFEIDLLPSSSSPQIANSSSQPVLHQPSE--------EEVTETQTDIV---DTSQQ----------AEEEGILLIISLREIDKGGRTEPNTYNQAMKSQ

Query:  NSDLWMSAILKELDSLKENNTWSLVKRPHGQKFVDCKWVYKLKLNVDPSALPRFKARLVAKGFTQEEVVDYTEVFSLVVRHTSIRILLSLVVQFNLELQQ
        N   W+ A+  E+ SL +N+TW L+K+P G + V CKW++K+K  ++     R+KARLVA+GFTQ+E VD+ +VFS VV+H SIR+LL++V QF+LEL+Q
Subjt:  NSDLWMSAILKELDSLKENNTWSLVKRPHGQKFVDCKWVYKLKLNVDPSALPRFKARLVAKGFTQEEVVDYTEVFSLVVRHTSIRILLSLVVQFNLELQQ

Query:  MDVTTAFLYGHLEEDIYMNLSPKFVEVGGEDLVCHLHKSIYGLK
        MDV TAFLYG L+E I M     +VE G ED VC L +S+YGLK
Subjt:  MDVTTAFLYGHLEEDIYMNLSPKFVEVGGEDLVCHLHKSIYGLK

A0A445I3R1 Retrovirus-related Pol polyprotein from transposon TNT 1-941.5e-11636.3Show/hide
Query:  MGNNQQCVVKGIGSVTIRLQDGSLKILKEVRFVPELKRNLISLGTLDKAGFGYKSENGLLTVTKDSVVKLTGTTTNSTFNVGLFLSLLENTSSAAPPLFA
        +G+N+ C ++GIGS+  +  DG+ +IL EVR+V ELKRNLISLG  DK G+ +K E G+L V KDS+V + G   N  + V                +  
Subjt:  MGNNQQCVVKGIGSVTIRLQDGSLKILKEVRFVPELKRNLISLGTLDKAGFGYKSENGLLTVTKDSVVKLTGTTTNSTFNVGLFLSLLENTSSAAPPLFA

Query:  LATTANPPSQQQVGRIVEVCLPKKPSGLNVFVSSLVCLDLHLREIGVLMGILSFSTLSDVDVVQVPCGTEMDQKPPKCPKRCPVSPLCSHRLSQKDFMFE
          TTA        GR++        +GL            H+R     +  +S + L +++  ++ CG +M++              C H +        
Subjt:  LATTANPPSQQQVGRIVEVCLPKKPSGLNVFVSSLVCLDLHLREIGVLMGILSFSTLSDVDVVQVPCGTEMDQKPPKCPKRCPVSPLCSHRLSQKDFMFE

Query:  DTIFCGVFDGQGPYDRFVARIVRDTLPIKLLSPQLCKSSRASFRPSTYITKDKLDYIRSDLWGPARVSTHGGARYFLTFVDDFSIKLWVYMLKTIDEVFE
                                            K+ RA F      TK  LDY+ +DLWGP +  +H GARYFL+ VDD+S KLW+Y+ KT DE F+
Subjt:  DTIFCGVFDGQGPYDRFVARIVRDTLPIKLLSPQLCKSSRASFRPSTYITKDKLDYIRSDLWGPARVSTHGGARYFLTFVDDFSIKLWVYMLKTIDEVFE

Query:  KFVEWKDFVENQTDRKIKLLRTDNGLEYVNDRFITLCKTSGIDRHLTTAGTPLQNGLAERFNRTILERK-------------------------------
         F  WK  +ENQT RKIK LRTDNGLE+ ++ F   CK +GI RH T AGTP QNGLAERFN TILER                                
Subjt:  KFVEWKDFVENQTDRKIKLLRTDNGLEYVNDRFITLCKTSGIDRHLTTAGTPLQNGLAERFNRTILERK-------------------------------

Query:  ----------WTGRPPNLRHLRVFGCATYAPTRQDKLQPRVKRCIFLGYPHGIKAYKLWSLDPGKRRCIISKDVSFDEDVMPWKFSSQTENRDKDLEGFE
                  W+G PP+L+ L+VFGC  YA  +QDKL+PR  +CIFLGYP G+K YKLW L+ G +RC++S+DV F+E  M +K        + D + +E
Subjt:  ----------WTGRPPNLRHLRVFGCATYAPTRQDKLQPRVKRCIFLGYPHGIKAYKLWSLDPGKRRCIISKDVSFDEDVMPWKFSSQTENRDKDLEGFE

Query:  IDLLPSSSSPQIANSSSQPVLHQPSEEEVTETQTDIVDTSQQAEEEGILLIISLREIDKGGRTEPNTYNQAMKSQNSDLWMSAILKELDSLKENNTWSLV
         +     +   +A   +   + QP      +                 L+  SL    +    +P T    + S+  + W+SA+ +E+ SL +N+TW L+
Subjt:  IDLLPSSSSPQIANSSSQPVLHQPSEEEVTETQTDIVDTSQQAEEEGILLIISLREIDKGGRTEPNTYNQAMKSQNSDLWMSAILKELDSLKENNTWSLV

Query:  KRPHGQKFVDCKWVYKLKLNVDPSALPRFKARLVAKGFTQEEVVDYTEVFSLVVRHTSIRILLSLVVQFNLELQQMDVTTAFLYGHLEEDIYMNLSPKFV
        K+P   + V CKW++K K  +      RFKARLVA+GFTQ+E +D+ EVFS VV+H SIRIL+++V +F+L L+Q+DV T FLYG L+E I M     F 
Subjt:  KRPHGQKFVDCKWVYKLKLNVDPSALPRFKARLVAKGFTQEEVVDYTEVFSLVVRHTSIRILLSLVVQFNLELQQMDVTTAFLYGHLEEDIYMNLSPKFV

Query:  EVGGEDLVCHLHKSIYGLK
          G ED VC L+KS+YGLK
Subjt:  EVGGEDLVCHLHKSIYGLK

A0A5A7UB25 Putative gag-pol polyprotein5.1e-12037.35Show/hide
Query:  MGNNQQCVVKGIGSVTIRLQDGSLKILKEVRFVPELKRNLISLGTLDKAGFGYKSENGLLTVTKDSVVKLTGTTTNSTFNVGLFLSLLENTSSAAPPLFA
        +G+N  C VKG GSV I   DG ++IL  VR+VP+LKRNLISLG LD++G   KSENG++ VTK S+VKL GT  +        L +LE T+ +     A
Subjt:  MGNNQQCVVKGIGSVTIRLQDGSLKILKEVRFVPELKRNLISLGTLDKAGFGYKSENGLLTVTKDSVVKLTGTTTNSTFNVGLFLSLLENTSSAAPPLFA

Query:  LATTANPPSQQQVGRIVEVCLPKKPSGLNVFVSSLVCLDLHLREIGVLMGILSFSTLSDVDVVQVPCGTEMDQKPPKCPKRCPVSPLCSHRLSQKDFMFE
                     G++ ++ +        ++   L     H+ E G L  +     L  V  V++                    P C H +        
Subjt:  LATTANPPSQQQVGRIVEVCLPKKPSGLNVFVSSLVCLDLHLREIGVLMGILSFSTLSDVDVVQVPCGTEMDQKPPKCPKRCPVSPLCSHRLSQKDFMFE

Query:  DTIFCGVFDGQGPYDRFVARIVRDTLPIKLLSPQLCKSSRASFRPSTYITKDKLDYIRSDLWGPARVSTHGGARYFLTFVDDFSIKLWVYMLKTIDEVFE
                                          + KS+R  F    + TK  LDY+ SDLWGP +  + GG+RYF++ +DDFS K+W+Y LK  DE F 
Subjt:  DTIFCGVFDGQGPYDRFVARIVRDTLPIKLLSPQLCKSSRASFRPSTYITKDKLDYIRSDLWGPARVSTHGGARYFLTFVDDFSIKLWVYMLKTIDEVFE

Query:  KFVEWKDFVENQTDRKIKLLRTDNGLEYVNDRFITLCKTSGIDRHLTTAGTPLQNGLAERFNRTILERK-------------------------------
        KF+EWK  VENQT RK+K LRTDNGLE+VN++F   CK+ GI RH T   TP QNGLAERFNRTI+ER                                
Subjt:  KFVEWKDFVENQTDRKIKLLRTDNGLEYVNDRFITLCKTSGIDRHLTTAGTPLQNGLAERFNRTILERK-------------------------------

Query:  ----------WTGRPPNLRHLRVFGCATYAPTRQDKLQPRVKRCIFLGYPHGIKAYKLWSLDPGKRRCIISKDVSFDEDVMPWKFSSQTENRDKDLEGFE
                  WTG+ P+L HLRVFGC  YA  +  KL  R  +C+F+GYP G+K YKLW ++ G  +CIIS+DV+F+E  MP+    Q + +  D    E
Subjt:  ----------WTGRPPNLRHLRVFGCATYAPTRQDKLQPRVKRCIFLGYPHGIKAYKLWSLDPGKRRCIISKDVSFDEDVMPWKFSSQTENRDKDLEGFE

Query:  IDLLPSSSSPQIANSSSQPVLH-----QPSEEEVTETQTD--IVDTSQQAEEEGI----------------------------LLIISLREIDKGGRTEP
        +  + S   P I   +  P++      Q SE +  ++Q +  ++D     EE                               L+  +L         EP
Subjt:  IDLLPSSSSPQIANSSSQPVLH-----QPSEEEVTETQTD--IVDTSQQAEEEGI----------------------------LLIISLREIDKGGRTEP

Query:  NTYNQAMKSQNSDLWMSAILKELDSLKENNTWSLVKRPHGQKFVDCKWVYKLKLNVDPSALPRFKARLVAKGFTQEEVVDYTEVFSLVVRHTSIRILLSL
         T+ +A+ S +   W  A+ +EL SL +N TWSLV +P  QK +  KW+YK+K     ++ PR+KARLVAKG+TQ+E VD+ E+FS VVRH+SIR++LS+
Subjt:  NTYNQAMKSQNSDLWMSAILKELDSLKENNTWSLVKRPHGQKFVDCKWVYKLKLNVDPSALPRFKARLVAKGFTQEEVVDYTEVFSLVVRHTSIRILLSL

Query:  VVQFNLELQQMDVTTAFLYGHLEEDIYMNLSPKFVEV-GGEDLVCHLHKSIYGLK
         V F++ ++QMDVTTAFL+G LEE IYM   PK  EV G ED+VC LHKS+YGLK
Subjt:  VVQFNLELQQMDVTTAFLYGHLEEDIYMNLSPKFVEV-GGEDLVCHLHKSIYGLK

A0A5D3CTV2 Putative polyprotein5.1e-12037.35Show/hide
Query:  MGNNQQCVVKGIGSVTIRLQDGSLKILKEVRFVPELKRNLISLGTLDKAGFGYKSENGLLTVTKDSVVKLTGTTTNSTFNVGLFLSLLENTSSAAPPLFA
        +G+N  C VKG GSV I   DG ++IL  VR+VP+LKRNLISLG LD++G   KSENG++ VTK S+VKL GT  +        L +LE T+ +     A
Subjt:  MGNNQQCVVKGIGSVTIRLQDGSLKILKEVRFVPELKRNLISLGTLDKAGFGYKSENGLLTVTKDSVVKLTGTTTNSTFNVGLFLSLLENTSSAAPPLFA

Query:  LATTANPPSQQQVGRIVEVCLPKKPSGLNVFVSSLVCLDLHLREIGVLMGILSFSTLSDVDVVQVPCGTEMDQKPPKCPKRCPVSPLCSHRLSQKDFMFE
                     G++ ++ +        ++   L     H+ E G L  +     L  V  V++                    P C H +        
Subjt:  LATTANPPSQQQVGRIVEVCLPKKPSGLNVFVSSLVCLDLHLREIGVLMGILSFSTLSDVDVVQVPCGTEMDQKPPKCPKRCPVSPLCSHRLSQKDFMFE

Query:  DTIFCGVFDGQGPYDRFVARIVRDTLPIKLLSPQLCKSSRASFRPSTYITKDKLDYIRSDLWGPARVSTHGGARYFLTFVDDFSIKLWVYMLKTIDEVFE
                                          + KS+R  F    + TK  LDY+ SDLWGP +  + GG+RYF++ +DDFS K+W+Y LK  DE F 
Subjt:  DTIFCGVFDGQGPYDRFVARIVRDTLPIKLLSPQLCKSSRASFRPSTYITKDKLDYIRSDLWGPARVSTHGGARYFLTFVDDFSIKLWVYMLKTIDEVFE

Query:  KFVEWKDFVENQTDRKIKLLRTDNGLEYVNDRFITLCKTSGIDRHLTTAGTPLQNGLAERFNRTILERK-------------------------------
        KF+EWK  VENQT RK+K LRTDNGLE+VN++F   CK+ GI RH T   TP QNGLAERFNRTI+ER                                
Subjt:  KFVEWKDFVENQTDRKIKLLRTDNGLEYVNDRFITLCKTSGIDRHLTTAGTPLQNGLAERFNRTILERK-------------------------------

Query:  ----------WTGRPPNLRHLRVFGCATYAPTRQDKLQPRVKRCIFLGYPHGIKAYKLWSLDPGKRRCIISKDVSFDEDVMPWKFSSQTENRDKDLEGFE
                  WTG+ P+L HLRVFGC  YA  +  KL  R  +C+F+GYP G+K YKLW ++ G  +CIIS+DV+F+E  MP+    Q + +  D    E
Subjt:  ----------WTGRPPNLRHLRVFGCATYAPTRQDKLQPRVKRCIFLGYPHGIKAYKLWSLDPGKRRCIISKDVSFDEDVMPWKFSSQTENRDKDLEGFE

Query:  IDLLPSSSSPQIANSSSQPVLH-----QPSEEEVTETQTD--IVDTSQQAEEEGI----------------------------LLIISLREIDKGGRTEP
        +  + S   P I   +  P++      Q SE +  ++Q +  ++D     EE                               L+  +L         EP
Subjt:  IDLLPSSSSPQIANSSSQPVLH-----QPSEEEVTETQTD--IVDTSQQAEEEGI----------------------------LLIISLREIDKGGRTEP

Query:  NTYNQAMKSQNSDLWMSAILKELDSLKENNTWSLVKRPHGQKFVDCKWVYKLKLNVDPSALPRFKARLVAKGFTQEEVVDYTEVFSLVVRHTSIRILLSL
         T+ +A+ S +   W  A+ +EL SL +N TWSLV +P  QK +  KW+YK+K     ++ PR+KARLVAKG+TQ+E VD+ E+FS VVRH+SIR++LS+
Subjt:  NTYNQAMKSQNSDLWMSAILKELDSLKENNTWSLVKRPHGQKFVDCKWVYKLKLNVDPSALPRFKARLVAKGFTQEEVVDYTEVFSLVVRHTSIRILLSL

Query:  VVQFNLELQQMDVTTAFLYGHLEEDIYMNLSPKFVEV-GGEDLVCHLHKSIYGLK
         V F++ ++QMDVTTAFL+G LEE IYM   PK  EV G ED+VC LHKS+YGLK
Subjt:  VVQFNLELQQMDVTTAFLYGHLEEDIYMNLSPKFVEV-GGEDLVCHLHKSIYGLK

A0A5D3DNU1 Putative gag-pol polyprotein5.1e-12037.35Show/hide
Query:  MGNNQQCVVKGIGSVTIRLQDGSLKILKEVRFVPELKRNLISLGTLDKAGFGYKSENGLLTVTKDSVVKLTGTTTNSTFNVGLFLSLLENTSSAAPPLFA
        +G+N  C VKG GSV I   DG ++IL  VR+VP+LKRNLISLG LD++G   KSENG++ VTK S+VKL GT  +        L +LE T+ +     A
Subjt:  MGNNQQCVVKGIGSVTIRLQDGSLKILKEVRFVPELKRNLISLGTLDKAGFGYKSENGLLTVTKDSVVKLTGTTTNSTFNVGLFLSLLENTSSAAPPLFA

Query:  LATTANPPSQQQVGRIVEVCLPKKPSGLNVFVSSLVCLDLHLREIGVLMGILSFSTLSDVDVVQVPCGTEMDQKPPKCPKRCPVSPLCSHRLSQKDFMFE
                     G++ ++ +        ++   L     H+ E G L  +     L  V  V++                    P C H +        
Subjt:  LATTANPPSQQQVGRIVEVCLPKKPSGLNVFVSSLVCLDLHLREIGVLMGILSFSTLSDVDVVQVPCGTEMDQKPPKCPKRCPVSPLCSHRLSQKDFMFE

Query:  DTIFCGVFDGQGPYDRFVARIVRDTLPIKLLSPQLCKSSRASFRPSTYITKDKLDYIRSDLWGPARVSTHGGARYFLTFVDDFSIKLWVYMLKTIDEVFE
                                          + KS+R  F    + TK  LDY+ SDLWGP +  + GG+RYF++ +DDFS K+W+Y LK  DE F 
Subjt:  DTIFCGVFDGQGPYDRFVARIVRDTLPIKLLSPQLCKSSRASFRPSTYITKDKLDYIRSDLWGPARVSTHGGARYFLTFVDDFSIKLWVYMLKTIDEVFE

Query:  KFVEWKDFVENQTDRKIKLLRTDNGLEYVNDRFITLCKTSGIDRHLTTAGTPLQNGLAERFNRTILERK-------------------------------
        KF+EWK  VENQT RK+K LRTDNGLE+VN++F   CK+ GI RH T   TP QNGLAERFNRTI+ER                                
Subjt:  KFVEWKDFVENQTDRKIKLLRTDNGLEYVNDRFITLCKTSGIDRHLTTAGTPLQNGLAERFNRTILERK-------------------------------

Query:  ----------WTGRPPNLRHLRVFGCATYAPTRQDKLQPRVKRCIFLGYPHGIKAYKLWSLDPGKRRCIISKDVSFDEDVMPWKFSSQTENRDKDLEGFE
                  WTG+ P+L HLRVFGC  YA  +  KL  R  +C+F+GYP G+K YKLW ++ G  +CIIS+DV+F+E  MP+    Q + +  D    E
Subjt:  ----------WTGRPPNLRHLRVFGCATYAPTRQDKLQPRVKRCIFLGYPHGIKAYKLWSLDPGKRRCIISKDVSFDEDVMPWKFSSQTENRDKDLEGFE

Query:  IDLLPSSSSPQIANSSSQPVLH-----QPSEEEVTETQTD--IVDTSQQAEEEGI----------------------------LLIISLREIDKGGRTEP
        +  + S   P I   +  P++      Q SE +  ++Q +  ++D     EE                               L+  +L         EP
Subjt:  IDLLPSSSSPQIANSSSQPVLH-----QPSEEEVTETQTD--IVDTSQQAEEEGI----------------------------LLIISLREIDKGGRTEP

Query:  NTYNQAMKSQNSDLWMSAILKELDSLKENNTWSLVKRPHGQKFVDCKWVYKLKLNVDPSALPRFKARLVAKGFTQEEVVDYTEVFSLVVRHTSIRILLSL
         T+ +A+ S +   W  A+ +EL SL +N TWSLV +P  QK +  KW+YK+K     ++ PR+KARLVAKG+TQ+E VD+ E+FS VVRH+SIR++LS+
Subjt:  NTYNQAMKSQNSDLWMSAILKELDSLKENNTWSLVKRPHGQKFVDCKWVYKLKLNVDPSALPRFKARLVAKGFTQEEVVDYTEVFSLVVRHTSIRILLSL

Query:  VVQFNLELQQMDVTTAFLYGHLEEDIYMNLSPKFVEV-GGEDLVCHLHKSIYGLK
         V F++ ++QMDVTTAFL+G LEE IYM   PK  EV G ED+VC LHKS+YGLK
Subjt:  VVQFNLELQQMDVTTAFLYGHLEEDIYMNLSPKFVEV-GGEDLVCHLHKSIYGLK

SwissProt top hitse value%identityAlignment
P04146 Copia protein5.0e-4828.74Show/hide
Query:  LSPQLC------KSSRASF---RPSTYITKDKLDYIRSDLWGPARVSTHGGARYFLTFVDDFSIKLWVYMLKTIDEVFEKFVEWKDFVENQTDRKIKLLR
        LS ++C      K +R  F   +  T+I K  L  + SD+ GP    T     YF+ FVD F+     Y++K   +VF  F ++    E   + K+  L 
Subjt:  LSPQLC------KSSRASF---RPSTYITKDKLDYIRSDLWGPARVSTHGGARYFLTFVDDFSIKLWVYMLKTIDEVFEKFVEWKDFVENQTDRKIKLLR

Query:  TDNGLEYVNDRFITLCKTSGIDRHLTTAGTPLQNGLAERFNRTILER-------------------------------------------KWTGRPPNLR
         DNG EY+++     C   GI  HLT   TP  NG++ER  RTI E+                                            W  + P L+
Subjt:  TDNGLEYVNDRFITLCKTSGIDRHLTTAGTPLQNGLAERFNRTILER-------------------------------------------KWTGRPPNLR

Query:  HLRVFGCATYA--PTRQDKLQPRVKRCIFLGY-PHGIKAYKLWSLDPGKRRCIISKDVSFDE-----------DVMPWKFSSQTENRD------------
        HLRVFG   Y     +Q K   +  + IF+GY P+G   +KLW  D    + I+++DV  DE           + +  K S ++EN++            
Subjt:  HLRVFGCATYA--PTRQDKLQPRVKRCIFLGY-PHGIKAYKLWSLDPGKRRCIISKDVSFDE-----------DVMPWKFSSQTENRD------------

Query:  --------------KDLEGFEIDLLPSSSSPQI-------------------ANSSSQPVLHQ------------------PSEEEVTETQTDI--VDTS
                      KD +  E    P+ S   I                   +  S++  L++                  P+E   +ET   +  +   
Subjt:  --------------KDLEGFEIDLLPSSSSPQI-------------------ANSSSQPVLHQ------------------PSEEEVTETQTDI--VDTS

Query:  QQAEEEGILLI------------ISLREIDKG-----------GRTEPNTYNQAMKSQNSDLWMSAILKELDSLKENNTWSLVKRPHGQKFVDCKWVYKL
           + +GI +I            IS  E D                 PN++++     +   W  AI  EL++ K NNTW++ KRP  +  VD +WV+ +
Subjt:  QQAEEEGILLI------------ISLREIDKG-----------GRTEPNTYNQAMKSQNSDLWMSAILKELDSLKENNTWSLVKRPHGQKFVDCKWVYKL

Query:  KLNVDPSALPRFKARLVAKGFTQEEVVDYTEVFSLVVRHTSIRILLSLVVQFNLELQQMDVTTAFLYGHLEEDIYMNLSPKFVEVGGEDLVCHLHKSIYG
        K N   + + R+KARLVA+GFTQ+  +DY E F+ V R +S R +LSLV+Q+NL++ QMDV TAFL G L+E+IYM L P+ +     D VC L+K+IYG
Subjt:  KLNVDPSALPRFKARLVAKGFTQEEVVDYTEVFSLVVRHTSIRILLSLVVQFNLELQQMDVTTAFLYGHLEEDIYMNLSPKFVEVGGEDLVCHLHKSIYG

Query:  LK
        LK
Subjt:  LK

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-948.3e-7536.04Show/hide
Query:  KSSRASFRPSTYITKDKLDYIRSDLWGPARVSTHGGARYFLTFVDDFSIKLWVYMLKTIDEVFEKFVEWKDFVENQTDRKIKLLRTDNGLEYVNDRFITL
        K  R SF+ S+    + LD + SD+ GP  + + GG +YF+TF+DD S KLWVY+LKT D+VF+ F ++   VE +T RK+K LR+DNG EY +  F   
Subjt:  KSSRASFRPSTYITKDKLDYIRSDLWGPARVSTHGGARYFLTFVDDFSIKLWVYMLKTIDEVFEKFVEWKDFVENQTDRKIKLLRTDNGLEYVNDRFITL

Query:  CKTSGIDRHLTTAGTPLQNGLAERFNRTIL-----------------------------------------ERKWTGRPPNLRHLRVFGCATYA---PTR
        C + GI    T  GTP  NG+AER NRTI+                                         ER WT +  +  HL+VFGC  +A     +
Subjt:  CKTSGIDRHLTTAGTPLQNGLAERFNRTIL-----------------------------------------ERKWTGRPPNLRHLRVFGCATYA---PTR

Query:  QDKLQPRVKRCIFLGYPHGIKAYKLWSLDPGKRRCIISKDVSFDEDVMPWKFSSQTENRDKDLEGFEIDLLPSSSSPQIANSSSQPVL---HQPSE----
        + KL  +   CIF+GY      Y+LW  DP K++ I S+DV F E  +        + ++  +  F + +  +S++P  A S++  V     QP E    
Subjt:  QDKLQPRVKRCIFLGYPHGIKAYKLWSLDPGKRRCIISKDVSFDEDVMPWKFSSQTENRDKDLEGFEIDLLPSSSSPQIANSSSQPVL---HQPSE----

Query:  EEVTETQTDIVDTSQQAEEEGILLIISLREIDKGGR------------TEPNTYNQAMKSQNSDLWMSAILKELDSLKENNTWSLVKRPHGQKFVDCKWV
         E  +   + V+   Q EE+   L  S R   +  R             EP +  + +     +  M A+ +E++SL++N T+ LV+ P G++ + CKWV
Subjt:  EEVTETQTDIVDTSQQAEEEGILLIISLREIDKGGR------------TEPNTYNQAMKSQNSDLWMSAILKELDSLKENNTWSLVKRPHGQKFVDCKWV

Query:  YKLKLNVDPSALPRFKARLVAKGFTQEEVVDYTEVFSLVVRHTSIRILLSLVVQFNLELQQMDVTTAFLYGHLEEDIYMNLSPKFVEVGGEDLVCHLHKS
        +KLK + D   L R+KARLV KGF Q++ +D+ E+FS VV+ TSIR +LSL    +LE++Q+DV TAFL+G LEE+IYM     F   G + +VC L+KS
Subjt:  YKLKLNVDPSALPRFKARLVAKGFTQEEVVDYTEVFSLVVRHTSIRILLSLVVQFNLELQQMDVTTAFLYGHLEEDIYMNLSPKFVEVGGEDLVCHLHKS

Query:  IYGLK
        +YGLK
Subjt:  IYGLK

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.5e-0435.44Show/hide
Query:  MGNNQQCVVKGIGSVTIRLQDGSLKILKEVRFVPELKRNLISLGTLDKAGFGYKSENGLLTVTKDSVVKLTGTTTNSTF
        MGN     + GIG + I+   G   +LK+VR VP+L+ NLIS   LD+ G+     N    +TK S+V   G    + +
Subjt:  MGNNQQCVVKGIGSVTIRLQDGSLKILKEVRFVPELKRNLISLGTLDKAGFGYKSENGLLTVTKDSVVKLTGTTTNSTF

P92520 Uncharacterized mitochondrial protein AtMg008202.5e-1543.64Show/hide
Query:  RTEPNTYNQAMKSQNSDLWMSAILKELDSLKENNTWSLVKRPHGQKFVDCKWVYKLKLNVDPSALPRFKARLVAKGFTQEEVVDYTEVFSLVVRHTSIRI
        + EP +   A+K      W  A+ +ELD+L  N TW LV  P  Q  + CKWV+K KL+ D   L R KARLVAKGF QEE + + E +S VVR  +IR 
Subjt:  RTEPNTYNQAMKSQNSDLWMSAILKELDSLKENNTWSLVKRPHGQKFVDCKWVYKLKLNVDPSALPRFKARLVAKGFTQEEVVDYTEVFSLVVRHTSIRI

Query:  LLSLVVQFNL
        +L++  Q  +
Subjt:  LLSLVVQFNL

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.6e-4427.48Show/hide
Query:  KLLSPQLC---KSSRASFRPSTYITKDKLDYIRSDLWGPARVSTHGGARYFLTFVDDFSIKLWVYMLKTIDEVFEKFVEWKDFVENQTDRKIKLLRTDNG
        K LS   C   KS++  F  ST  +   L+YI SD+W  + + +H   RY++ FVD F+   W+Y LK   +V E F+ +K+ +EN+   +I    +DNG
Subjt:  KLLSPQLC---KSSRASFRPSTYITKDKLDYIRSDLWGPARVSTHGGARYFLTFVDDFSIKLWVYMLKTIDEVFEKFVEWKDFVENQTDRKIKLLRTDNG

Query:  LEYVNDRFITLCKTSGIDRHLTTAGTPLQNGLAERFNRTILE-----------------------------------------RKWTGRPPNLRHLRVFG
         E+V           GI    +   TP  NGL+ER +R I+E                                         +K  G  PN   LRVFG
Subjt:  LEYVNDRFITLCKTSGIDRHLTTAGTPLQNGLAERFNRTILE-----------------------------------------RKWTGRPPNLRHLRVFG

Query:  CATY---APTRQDKLQPRVKRCIFLGYPHGIKAYKLWSLDPGKRRCIISKDVSFDEDVMPWK--------------------------------------
        CA Y    P  Q KL  + ++C+FLGY     AY    L     R  IS+ V FDE+  P+                                       
Subjt:  CATY---APTRQDKLQPRVKRCIFLGYPHGIKAYKLWSLDPGKRRCIISKDVSFDEDVMPWK--------------------------------------

Query:  ----------FSSQTENRDKDLEGFEIDLLPSSSSP-------------------------------------------QIANSSSQPVLHQPSEEEVTE
                   S     R+  +    +D   SSS P                                           Q+A S S P     S    T 
Subjt:  ----------FSSQTENRDKDLEGFEIDLLPSSSSP-------------------------------------------QIANSSSQPVLHQPSEEEVTE

Query:  TQT--------------------DIVDTSQQA----------EEEGIL-------LIISLREIDKGGRTEPNTYNQAMKSQNSDLWMSAILKELDSLKEN
        + +                     IV+ + QA           + GI+       L +SL        +EP T  QA+K +    W +A+  E+++   N
Subjt:  TQT--------------------DIVDTSQQA----------EEEGIL-------LIISLREIDKGGRTEPNTYNQAMKSQNSDLWMSAILKELDSLKEN

Query:  NTWSLVKRPHGQ-KFVDCKWVYKLKLNVDPSALPRFKARLVAKGFTQEEVVDYTEVFSLVVRHTSIRILLSLVVQFNLELQQMDVTTAFLYGHLEEDIYM
        +TW LV  P      V C+W++  K N D S L R+KARLVAKG+ Q   +DY E FS V++ TSIRI+L + V  +  ++Q+DV  AFL G L +D+YM
Subjt:  NTWSLVKRPHGQ-KFVDCKWVYKLKLNVDPSALPRFKARLVAKGFTQEEVVDYTEVFSLVVRHTSIRILLSLVVQFNLELQQMDVTTAFLYGHLEEDIYM

Query:  NLSPKFVEVGGEDLVCHLHKSIYGLK
        +  P F++    + VC L K++YGLK
Subjt:  NLSPKFVEVGGEDLVCHLHKSIYGLK

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE26.8e-4528.48Show/hide
Query:  KLLSPQLC---KSSRASFRPSTYITKDKLDYIRSDLWGPARVSTHGGARYFLTFVDDFSIKLWVYMLKTIDEVFEKFVEWKDFVENQTDRKIKLLRTDNG
        KLLS   C   KS +  F  ST  +   L+YI SD+W    +S     RY++ FVD F+   W+Y LK   +V + F+ +K  VEN+   +I  L +DNG
Subjt:  KLLSPQLC---KSSRASFRPSTYITKDKLDYIRSDLWGPARVSTHGGARYFLTFVDDFSIKLWVYMLKTIDEVFEKFVEWKDFVENQTDRKIKLLRTDNG

Query:  LEYVNDRFITLCKTSGIDRHLTTAGTPLQNGLAERFNRTILE-----------------------------------------RKWTGRPPNLRHLRVFG
         E+V  R        GI    +   TP  NGL+ER +R I+E                                         +K  G+PPN   L+VFG
Subjt:  LEYVNDRFITLCKTSGIDRHLTTAGTPLQNGLAERFNRTILE-----------------------------------------RKWTGRPPNLRHLRVFG

Query:  CATY---APTRQDKLQPRVKRCIFLGYPHGIKAYKLWSLDPGKRRCIISKDVSFDEDVMPWK-----FSSQTENRDKDLE--------------------
        CA Y    P  + KL+ + K+C F+GY     AY    +  G  R   S+ V FDE   P+       S+  E R                         
Subjt:  CATY---APTRQDKLQPRVKRCIFLGYPHGIKAYKLWSLDPGKRRCIISKDVSFDEDVMPWK-----FSSQTENRDKDLE--------------------

Query:  -GFEIDL---------------------------LPSSSSP---------------QIANS-SSQPVLHQP---------------------SEEEVTET
         G  +D                             PSSS P               Q  NS S+ P+L+ P                     S   +   
Subjt:  -GFEIDL---------------------------LPSSSSP---------------QIANS-SSQPVLHQP---------------------SEEEVTET

Query:  QTDIVDTSQ---------------------QAEEEGILLIISLREIDKGG----------------RTEPNTYNQAMKSQNSDLWMSAILKELDSLKENN
         T I + +                      Q   +  +   S+    K G                 +EP T  QAMK    D W  A+  E+++   N+
Subjt:  QTDIVDTSQ---------------------QAEEEGILLIISLREIDKGG----------------RTEPNTYNQAMKSQNSDLWMSAILKELDSLKENN

Query:  TWSLV-KRPHGQKFVDCKWVYKLKLNVDPSALPRFKARLVAKGFTQEEVVDYTEVFSLVVRHTSIRILLSLVVQFNLELQQMDVTTAFLYGHLEEDIYMN
        TW LV   P     V C+W++  K N D S L R+KARLVAKG+ Q   +DY E FS V++ TSIRI+L + V  +  ++Q+DV  AFL G L +++YM+
Subjt:  TWSLV-KRPHGQKFVDCKWVYKLKLNVDPSALPRFKARLVAKGFTQEEVVDYTEVFSLVVRHTSIRILLSLVVQFNLELQQMDVTTAFLYGHLEEDIYMN

Query:  LSPKFVEVGGEDLVCHLHKSIYGLK
          P FV+    D VC L K+IYGLK
Subjt:  LSPKFVEVGGEDLVCHLHKSIYGLK

Arabidopsis top hitse value%identityAlignment
AT1G03590.1 Protein phosphatase 2C family protein1.8e-0865.79Show/hide
Query:  KDFMFEDTIFCGVFDGQGPYDRFVARIVRDTLPIKLLS
        +DFM +D  FCGVFDG GP+   VAR VRD+LP+KLLS
Subjt:  KDFMFEDTIFCGVFDGQGPYDRFVARIVRDTLPIKLLS

AT4G03415.1 Protein phosphatase 2C family protein4.7e-0972.22Show/hide
Query:  KDFMFEDTIFCGVFDGQGPYDRFVARIVRDTLPIKL
        +DFM ED  FCGVFDG GPY   VAR VRDTLP+KL
Subjt:  KDFMFEDTIFCGVFDGQGPYDRFVARIVRDTLPIKL

AT4G03415.2 Protein phosphatase 2C family protein4.7e-0972.22Show/hide
Query:  KDFMFEDTIFCGVFDGQGPYDRFVARIVRDTLPIKL
        +DFM ED  FCGVFDG GPY   VAR VRDTLP+KL
Subjt:  KDFMFEDTIFCGVFDGQGPYDRFVARIVRDTLPIKL

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 83.0e-3243.12Show/hide
Query:  EPNTYNQAMKSQNSDLWMSAILKELDSLKENNTWSLVKRPHGQKFVDCKWVYKLKLNVDPSALPRFKARLVAKGFTQEEVVDYTEVFSLVVRHTSIRILL
        EP+TYN+A +     +W  A+  E+ +++  +TW +   P  +K + CKWVYK+K N D   + R+KARLVAKG+TQ+E +D+ E FS V + TS++++L
Subjt:  EPNTYNQAMKSQNSDLWMSAILKELDSLKENNTWSLVKRPHGQKFVDCKWVYKLKLNVDPSALPRFKARLVAKGFTQEEVVDYTEVFSLVVRHTSIRILL

Query:  SLVVQFNLELQQMDVTTAFLYGHLEEDIYMNLSPKFVEVGGEDL----VCHLHKSIYGLK
        ++   +N  L Q+D++ AFL G L+E+IYM L P +    G+ L    VC+L KSIYGLK
Subjt:  SLVVQFNLELQQMDVTTAFLYGHLEEDIYMNLSPKFVEVGGEDL----VCHLHKSIYGLK

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.8e-1643.64Show/hide
Query:  RTEPNTYNQAMKSQNSDLWMSAILKELDSLKENNTWSLVKRPHGQKFVDCKWVYKLKLNVDPSALPRFKARLVAKGFTQEEVVDYTEVFSLVVRHTSIRI
        + EP +   A+K      W  A+ +ELD+L  N TW LV  P  Q  + CKWV+K KL+ D   L R KARLVAKGF QEE + + E +S VVR  +IR 
Subjt:  RTEPNTYNQAMKSQNSDLWMSAILKELDSLKENNTWSLVKRPHGQKFVDCKWVYKLKLNVDPSALPRFKARLVAKGFTQEEVVDYTEVFSLVVRHTSIRI

Query:  LLSLVVQFNL
        +L++  Q  +
Subjt:  LLSLVVQFNL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAACAATCAGCAGTGTGTGGTCAAAGGAATCGGTTCAGTCACCATAAGGTTACAAGATGGATCATTGAAGATTCTTAAAGAGGTGAGATTCGTGCCAGAGCTGAA
AAGAAATTTGATCTCCTTAGGGACCCTCGACAAAGCTGGTTTTGGTTACAAATCTGAAAATGGTCTGCTTACTGTTACTAAAGACAGTGTCGTGAAACTCACTGGCACTA
CAACAAATTCTACCTTTAATGTCGGTCTCTTCCTCTCTCTCCTCGAAAATACCTCCAGCGCCGCCCCTCCCCTCTTTGCTCTCGCAACGACTGCGAATCCTCCATCACAG
CAGCAAGTAGGTCGTATCGTGGAAGTTTGCCTCCCTAAGAAGCCTTCTGGTTTGAATGTTTTTGTTTCTTCTCTTGTTTGTCTTGATTTGCATTTAAGGGAAATTGGAGT
ACTCATGGGGATTTTGAGTTTTTCGACGCTTTCTGATGTAGATGTTGTACAGGTTCCTTGTGGCACTGAGATGGATCAGAAGCCTCCAAAATGTCCTAAACGATGTCCTG
TCTCACCTCTATGCAGTCATAGATTAAGTCAAAAGGATTTCATGTTCGAGGACACAATCTTTTGCGGTGTATTTGATGGTCAGGGACCTTACGATCGTTTCGTTGCTCGA
ATAGTGAGGGATACGTTGCCTATAAAGTTGTTGTCTCCTCAGTTGTGCAAATCATCACGAGCGAGCTTTAGACCTTCAACCTATATTACCAAAGACAAACTTGACTATAT
TCGTTCAGATCTTTGGGGTCCAGCAAGAGTTTCTACACATGGTGGTGCTAGGTACTTTTTGACCTTTGTAGATGACTTCTCTATAAAGCTGTGGGTCTACATGCTAAAGA
CTATAGATGAGGTTTTTGAAAAATTTGTTGAATGGAAAGACTTTGTTGAGAACCAAACCGACAGGAAAATCAAGCTTCTTAGAACAGACAATGGTTTGGAGTACGTGAAT
GATAGATTCATCACTTTGTGCAAGACTTCAGGCATTGATAGACACTTGACAACTGCTGGAACTCCCCTACAAAATGGTCTTGCTGAAAGATTTAACCGGACCATCTTGGA
AAGGAAATGGACTGGTCGTCCACCAAACTTGAGACATCTTAGAGTGTTCGGATGTGCAACATATGCTCCTACTCGACAAGACAAGCTTCAACCAAGGGTAAAGAGATGTA
TCTTCCTAGGATATCCTCATGGAATAAAGGCTTACAAGTTGTGGTCTTTGGATCCTGGAAAAAGAAGATGCATAATAAGCAAGGATGTATCTTTTGATGAGGATGTTATG
CCTTGGAAGTTTTCTTCACAGACTGAGAACAGAGACAAAGATTTAGAAGGCTTTGAAATTGACTTGTTACCTTCGAGTAGCAGTCCACAGATTGCTAATTCAAGCAGTCA
ACCTGTTTTGCATCAACCTTCTGAAGAAGAAGTGACAGAAACACAAACTGACATTGTTGATACTTCTCAACAAGCTGAAGAGGAAGGGATCTTGTTAATTATCAGCTTAC
GAGAGATAGACAAAGGAGGACGCACAGAACCAAACACTTATAATCAGGCTATGAAGTCTCAGAACTCAGATCTTTGGATGAGTGCCATACTGAAAGAGCTAGATTCATTG
AAAGAGAACAACACATGGAGTTTAGTGAAAAGGCCCCATGGACAGAAATTTGTGGACTGCAAATGGGTTTACAAGTTGAAACTGAATGTTGATCCCTCTGCATTACCTAG
ATTTAAAGCTAGGTTAGTGGCTAAAGGCTTCACACAAGAGGAAGTAGTGGATTATACAGAAGTTTTTTCCCTAGTGGTGAGACATACATCAATCAGAATTTTACTATCAT
TGGTAGTTCAGTTCAACTTGGAGCTTCAGCAGATGGATGTCACCACTGCCTTCCTTTATGGCCACTTAGAAGAAGATATCTACATGAATTTGTCGCCCAAATTTGTTGAA
GTTGGAGGGGAAGACTTAGTTTGCCATCTTCACAAGTCCATCTACGGCTTGAAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGGAACAATCAGCAGTGTGTGGTCAAAGGAATCGGTTCAGTCACCATAAGGTTACAAGATGGATCATTGAAGATTCTTAAAGAGGTGAGATTCGTGCCAGAGCTGAA
AAGAAATTTGATCTCCTTAGGGACCCTCGACAAAGCTGGTTTTGGTTACAAATCTGAAAATGGTCTGCTTACTGTTACTAAAGACAGTGTCGTGAAACTCACTGGCACTA
CAACAAATTCTACCTTTAATGTCGGTCTCTTCCTCTCTCTCCTCGAAAATACCTCCAGCGCCGCCCCTCCCCTCTTTGCTCTCGCAACGACTGCGAATCCTCCATCACAG
CAGCAAGTAGGTCGTATCGTGGAAGTTTGCCTCCCTAAGAAGCCTTCTGGTTTGAATGTTTTTGTTTCTTCTCTTGTTTGTCTTGATTTGCATTTAAGGGAAATTGGAGT
ACTCATGGGGATTTTGAGTTTTTCGACGCTTTCTGATGTAGATGTTGTACAGGTTCCTTGTGGCACTGAGATGGATCAGAAGCCTCCAAAATGTCCTAAACGATGTCCTG
TCTCACCTCTATGCAGTCATAGATTAAGTCAAAAGGATTTCATGTTCGAGGACACAATCTTTTGCGGTGTATTTGATGGTCAGGGACCTTACGATCGTTTCGTTGCTCGA
ATAGTGAGGGATACGTTGCCTATAAAGTTGTTGTCTCCTCAGTTGTGCAAATCATCACGAGCGAGCTTTAGACCTTCAACCTATATTACCAAAGACAAACTTGACTATAT
TCGTTCAGATCTTTGGGGTCCAGCAAGAGTTTCTACACATGGTGGTGCTAGGTACTTTTTGACCTTTGTAGATGACTTCTCTATAAAGCTGTGGGTCTACATGCTAAAGA
CTATAGATGAGGTTTTTGAAAAATTTGTTGAATGGAAAGACTTTGTTGAGAACCAAACCGACAGGAAAATCAAGCTTCTTAGAACAGACAATGGTTTGGAGTACGTGAAT
GATAGATTCATCACTTTGTGCAAGACTTCAGGCATTGATAGACACTTGACAACTGCTGGAACTCCCCTACAAAATGGTCTTGCTGAAAGATTTAACCGGACCATCTTGGA
AAGGAAATGGACTGGTCGTCCACCAAACTTGAGACATCTTAGAGTGTTCGGATGTGCAACATATGCTCCTACTCGACAAGACAAGCTTCAACCAAGGGTAAAGAGATGTA
TCTTCCTAGGATATCCTCATGGAATAAAGGCTTACAAGTTGTGGTCTTTGGATCCTGGAAAAAGAAGATGCATAATAAGCAAGGATGTATCTTTTGATGAGGATGTTATG
CCTTGGAAGTTTTCTTCACAGACTGAGAACAGAGACAAAGATTTAGAAGGCTTTGAAATTGACTTGTTACCTTCGAGTAGCAGTCCACAGATTGCTAATTCAAGCAGTCA
ACCTGTTTTGCATCAACCTTCTGAAGAAGAAGTGACAGAAACACAAACTGACATTGTTGATACTTCTCAACAAGCTGAAGAGGAAGGGATCTTGTTAATTATCAGCTTAC
GAGAGATAGACAAAGGAGGACGCACAGAACCAAACACTTATAATCAGGCTATGAAGTCTCAGAACTCAGATCTTTGGATGAGTGCCATACTGAAAGAGCTAGATTCATTG
AAAGAGAACAACACATGGAGTTTAGTGAAAAGGCCCCATGGACAGAAATTTGTGGACTGCAAATGGGTTTACAAGTTGAAACTGAATGTTGATCCCTCTGCATTACCTAG
ATTTAAAGCTAGGTTAGTGGCTAAAGGCTTCACACAAGAGGAAGTAGTGGATTATACAGAAGTTTTTTCCCTAGTGGTGAGACATACATCAATCAGAATTTTACTATCAT
TGGTAGTTCAGTTCAACTTGGAGCTTCAGCAGATGGATGTCACCACTGCCTTCCTTTATGGCCACTTAGAAGAAGATATCTACATGAATTTGTCGCCCAAATTTGTTGAA
GTTGGAGGGGAAGACTTAGTTTGCCATCTTCACAAGTCCATCTACGGCTTGAAGTAG
Protein sequenceShow/hide protein sequence
MGNNQQCVVKGIGSVTIRLQDGSLKILKEVRFVPELKRNLISLGTLDKAGFGYKSENGLLTVTKDSVVKLTGTTTNSTFNVGLFLSLLENTSSAAPPLFALATTANPPSQ
QQVGRIVEVCLPKKPSGLNVFVSSLVCLDLHLREIGVLMGILSFSTLSDVDVVQVPCGTEMDQKPPKCPKRCPVSPLCSHRLSQKDFMFEDTIFCGVFDGQGPYDRFVAR
IVRDTLPIKLLSPQLCKSSRASFRPSTYITKDKLDYIRSDLWGPARVSTHGGARYFLTFVDDFSIKLWVYMLKTIDEVFEKFVEWKDFVENQTDRKIKLLRTDNGLEYVN
DRFITLCKTSGIDRHLTTAGTPLQNGLAERFNRTILERKWTGRPPNLRHLRVFGCATYAPTRQDKLQPRVKRCIFLGYPHGIKAYKLWSLDPGKRRCIISKDVSFDEDVM
PWKFSSQTENRDKDLEGFEIDLLPSSSSPQIANSSSQPVLHQPSEEEVTETQTDIVDTSQQAEEEGILLIISLREIDKGGRTEPNTYNQAMKSQNSDLWMSAILKELDSL
KENNTWSLVKRPHGQKFVDCKWVYKLKLNVDPSALPRFKARLVAKGFTQEEVVDYTEVFSLVVRHTSIRILLSLVVQFNLELQQMDVTTAFLYGHLEEDIYMNLSPKFVE
VGGEDLVCHLHKSIYGLK