; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G23920 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G23920
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationChr3:21018664..21021915
RNA-Seq ExpressionCSPI03G23920
SyntenyCSPI03G23920
Gene Ontology termsGO:0090304 - nucleic acid metabolic process (biological process)
GO:0016740 - transferase activity (molecular function)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AFK13856.1 Ty3/gypsy retrotransposon protein [Beta vulgaris subsp. vulgaris]6.3e-24948.42Show/hide
Query:  LEGEALDWYKWKDDRTKINSWEEFRQLLWARFKPSGQGDKHARSMKLQQETTVREYRRRVEQFSTSLKDMSDAALESKFVCGLREEIQSEIRKLNPVGLE
        +EG+AL WY+W++ R    +WE  +  +  +F+P   G  H + +   Q  +V EYRR+  + +  L  + +  L  KF+ GL  E+QSEIR LNP  L+
Subjt:  LEGEALDWYKWKDDRTKINSWEEFRQLLWARFKPSGQGDKHARSMKLQQETTVREYRRRVEQFSTSLKDMSDAALESKFVCGLREEIQSEIRKLNPVGLE

Query:  AKMLMAQVIED-DQVVQLKRIHGVGANPSLLNKTSGNGPNGSGNQTGSKVMDRVATSRTITINPSRNSSSSSTTITSPHDFNVKNSMAQPYRRMTDSEMR
          M +A  +E+ ++V   +R      + S+ N+    GPN   N +   V      S   T + + NS++S T++ +     + +      RR+T+ E++
Subjt:  AKMLMAQVIED-DQVVQLKRIHGVGANPSLLNKTSGNGPNGSGNQTGSKVMDRVATSRTITINPSRNSSSSSTTITSPHDFNVKNSMAQPYRRMTDSEMR

Query:  MKKEKGLCFRCDEKFNPGPQ---------------GEDLSGEIDKVAEETEDKNEQINTEIANLSLHSLVGFSSPKTIKIKGEIRNCEVVVLVDGGAIHN
         K+ KGLCF+CDEK+  G Q                ++L G +   +E      E+I  E+   SL+S++G S+PKT+K+ G I N EVVV++D GA HN
Subjt:  MKKEKGLCFRCDEKFNPGPQ---------------GEDLSGEIDKVAEETEDKNEQINTEIANLSLHSLVGFSSPKTIKIKGEIRNCEVVVLVDGGAIHN

Query:  FISEEVVKELKIPVETLDAYGVVLGTGGVVRATGMCKSVNLTI-VNLSITHDFLPLPLGSADVNLGVTWLETLGKVIFNYKLSEMEFSLGEFLVILQGNK
        F+S + + +L IPV   + +GV LG G  VR TG+C++V L +   L +  DFLPL LG++DV LGV WLETLG V+ N+K  +M F LG     L G+ 
Subjt:  FISEEVVKELKIPVETLDAYGVVLGTGGVVRATGMCKSVNLTI-VNLSITHDFLPLPLGSADVNLGVTWLETLGKVIFNYKLSEMEFSLGEFLVILQGNK

Query:  SLVKSQVSLKSMI--FKKEDQGVLIELSTVEQGGAEESKDNLADCLSNLKPEVQRILLSFGSVFESINQLPPPRDHDHAIELESGARAVNVRPYRYPQFQ
        +L +S+VSLK+M+   +KE  G+ +E + VE GGA   +D+  +    + P +Q ++  F  VFE+   LPP R H+HAI L+ G+  V VRPYRYPQFQ
Subjt:  SLVKSQVSLKSMI--FKKEDQGVLIELSTVEQGGAEESKDNLADCLSNLKPEVQRILLSFGSVFESINQLPPPRDHDHAIELESGARAVNVRPYRYPQFQ

Query:  KDEIEKLVKKMLLAKIIQPSKSAFSSPVLLVKKKDGSWRFCVDYRALNLATIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHHIRVRATDVHKTAFRT
        KDEIE+L+K+ML A IIQPS S FSSPV+LVKKKDGSWRFCVDYRALN  T+PDKYPIPV+DELLDEL GAT+FSK+DL++GYH I VR  D HKTAFRT
Subjt:  KDEIEKLVKKMLLAKIIQPSKSAFSSPVLLVKKKDGSWRFCVDYRALNLATIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHHIRVRATDVHKTAFRT

Query:  HEGHYEFLVMPFGLKNAPTTFQSVMNDILRPYLRKFVLVFFDDILIYS-SLEEHLHQLAMVLETLVVHKLVANLKKCQFAVDQIEYLGHIISSDGVAADP
        HEGHYEFLVMPFGL NAP TFQS+MN++ RP+LR+FVLVF DDILIYS S EEH+  L MVL  L  H L  N KKC+F   ++ YLGH+IS  GVA D 
Subjt:  HEGHYEFLVMPFGLKNAPTTFQSVMNDILRPYLRKFVLVFFDDILIYS-SLEEHLHQLAMVLETLVVHKLVANLKKCQFAVDQIEYLGHIISSDGVAADP

Query:  TKIAAMVKWPAPKNVKELRGFLGLPGYYRKFVANYGSIALPLTQLLKKGKFQWNETAEKAFQQLKSAMMSVPVLGIPDFTQGFVLKTDASGVGIGAVLMQ
         K+ A+++W  PKN++ELRGFLGL GYYRKFVANY  IA PLT+ LKK  F+W+ TA +AF+QLKSAM+S PVL +P+F   FV++TDASG G+GAVLMQ
Subjt:  TKIAAMVKWPAPKNVKELRGFLGLPGYYRKFVANYGSIALPLTQLLKKGKFQWNETAEKAFQQLKSAMMSVPVLGIPDFTQGFVLKTDASGVGIGAVLMQ

Query:  HQRPVAFFSQALPITHRFKAVYERKLMAIVRAVQKWRPHLLGKPFVVRTDQKSLKFLLEQRAIGGEYQRWIAKLLGYDFVIEYKKGMENKAADALSR-LP
          RP+A++S+ L    + K+VYE++LMAI  AVQKW+ +LLG+ FVVRTDQ+SL+++ +QR IG E+Q+W++KL+GYDF I YK G+ N+ ADALSR   
Subjt:  HQRPVAFFSQALPITHRFKAVYERKLMAIVRAVQKWRPHLLGKPFVVRTDQKSLKFLLEQRAIGGEYQRWIAKLLGYDFVIEYKKGMENKAADALSR-LP

Query:  PLFELGLISVVGRLNPLIFIDQVTGNEALNSIRLSLINGQPTPEGYSLQGEVLCYHDRLVLPEDSPTIPLLLAEFHNCPIG
           ELG I  V  +       ++TG+  L  +R  L  G+ TP  ++L    L +  R V+P  S  IP LL E+H+ P+G
Subjt:  PLFELGLISVVGRLNPLIFIDQVTGNEALNSIRLSLINGQPTPEGYSLQGEVLCYHDRLVLPEDSPTIPLLLAEFHNCPIG

XP_028548251.1 uncharacterized protein LOC110111203 [Dendrobium catenatum]1.4e-24848.98Show/hide
Query:  LEGEALDWYKWKDDRTKINSWEEFRQLLWARFKPSGQGDKHARSMKLQQETTVREYRRRVEQFSTSLKDMSDAALESKFVCGLREEIQSEIRKLNPVGLE
        LE  AL WY+W ++R +   W EFR++   RF+P  +G  H +   L Q  TV  YR R E  S+ L+ M+D  LE  F+ GL+  I+S +R   P  L 
Subjt:  LEGEALDWYKWKDDRTKINSWEEFRQLLWARFKPSGQGDKHARSMKLQQETTVREYRRRVEQFSTSLKDMSDAALESKFVCGLREEIQSEIRKLNPVGLE

Query:  AKMLMAQVIEDD-QVVQLKRIHGVGANPSLLNKTSG-NGPNGSGNQTGSKVMDRVATSRTITINPSRNSSSSSTTITSPHDFNVKNSMAQPYRRMTDSEM
          + MA++IED     Q +R    G    + N T G  G        G K  DR A ++                                +RR+T++E+
Subjt:  AKMLMAQVIEDD-QVVQLKRIHGVGANPSLLNKTSG-NGPNGSGNQTGSKVMDRVATSRTITINPSRNSSSSSTTITSPHDFNVKNSMAQPYRRMTDSEM

Query:  RMKKEKGLCFRCDEKFNPGPQGEDL--------SGEIDKVAEE---TEDKNEQINTEIANLSLHSLVGFSSPKTIKIKGEIRNCEVVVLVDGGAIHNFIS
        + K+ KGLC+RCDEKF PG + +D           E ++V EE    E+  + ++ ++  +SL+S+ G ++  T+K++G I +  V VL+D GA HNFI+
Subjt:  RMKKEKGLCFRCDEKFNPGPQGEDL--------SGEIDKVAEE---TEDKNEQINTEIANLSLHSLVGFSSPKTIKIKGEIRNCEVVVLVDGGAIHNFIS

Query:  EEVVKELKIPVETLDAYGVVLGTGGVVRATGMCKSVNLTIVNLSITHDFLPLPLGSADVNLGVTWLETLGKVIFNYKLSEMEFSLGEFLVILQGNKSLVK
          +V+EL IP+      GV LGTG   +  G C  V L+I    IT DFL L LG+ DV LG+ WL+TLG++  N+K   +E+  GE  V L G+  L +
Subjt:  EEVVKELKIPVETLDAYGVVLGTGGVVRATGMCKSVNLTIVNLSITHDFLPLPLGSADVNLGVTWLETLGKVIFNYKLSEMEFSLGEFLVILQGNKSLVK

Query:  SQVSLKSMI--FKKEDQGVLIELSTVEQGGAEESKDNLADCLSNLKPEVQRILLSFGSVFESINQLPPPRDHDHAIELESGARAVNVRPYRYPQFQKDEI
        S+V+LK+++   + E +G LIEL  +E  G+E  ++       N+  EV  +L  F  VF+    LPP R  +HAI L+ GA  V+VRPYRYP  QK+EI
Subjt:  SQVSLKSMI--FKKEDQGVLIELSTVEQGGAEESKDNLADCLSNLKPEVQRILLSFGSVFESINQLPPPRDHDHAIELESGARAVNVRPYRYPQFQKDEI

Query:  EKLVKKMLLAKIIQPSKSAFSSPVLLVKKKDGSWRFCVDYRALNLATIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHHIRVRATDVHKTAFRTHEGH
        EKLV++M+ A +IQPS S FSSPVLLVKKKDGSWRFCVDYRALN  T+ DK+PIPV+DELLDEL GAT+FSK+DLKSGYH IR+R  D+ KTAFRTHEGH
Subjt:  EKLVKKMLLAKIIQPSKSAFSSPVLLVKKKDGSWRFCVDYRALNLATIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHHIRVRATDVHKTAFRTHEGH

Query:  YEFLVMPFGLKNAPTTFQSVMNDILRPYLRKFVLVFFDDILIYS-SLEEHLHQLAMVLETLVVHKLVANLKKCQFAVDQIEYLGHIISSDGVAADPTKIA
        YEFLVMPFGL NAP+TFQ++MN + +P LR+FVLVFFDDILIYS SL+EHL  L  VL TL  H+L  N KKC FA   +EYLGHIIS++GVAADP+K+ 
Subjt:  YEFLVMPFGLKNAPTTFQSVMNDILRPYLRKFVLVFFDDILIYS-SLEEHLHQLAMVLETLVVHKLVANLKKCQFAVDQIEYLGHIISSDGVAADPTKIA

Query:  AMVKWPAPKNVKELRGFLGLPGYYRKFVANYGSIALPLTQLLKKGKFQWNETAEKAFQQLKSAMMSVPVLGIPDFTQGFVLKTDASGVGIGAVLMQHQRP
        AM  WP PKN++ LRGFLGL GYYRKF+  YGSIA PLT+ LKK  F W   A+ A + LK AM+S PVL +PDF Q  V++TDASG+G+GAVLMQ  RP
Subjt:  AMVKWPAPKNVKELRGFLGLPGYYRKFVANYGSIALPLTQLLKKGKFQWNETAEKAFQQLKSAMMSVPVLGIPDFTQGFVLKTDASGVGIGAVLMQHQRP

Query:  VAFFSQALPITHRFKAVYERKLMAIVRAVQKWRPHLLGKPFVVRTDQKSLKFLLEQRAIGGEYQRWIAKLLGYDFVIEYKKGMENKAADALSRLPPLFEL
        +AF+SQ L    R K+VYER+LMAIV+A+QKWRP+LLG+ F+VRTDQ+SLK+LLEQR +  E+QRW++KLLGYDF I+YK G+ENKAADALSR     +L
Subjt:  VAFFSQALPITHRFKAVYERKLMAIVRAVQKWRPHLLGKPFVVRTDQKSLKFLLEQRAIGGEYQRWIAKLLGYDFVIEYKKGMENKAADALSRLPPLFEL

Query:  GLISVVGRLNPLIFIDQVTGNEALNSIRLSLINGQPTPEGYSLQGEVLCYHDRLVLPEDSPTIPLLLAEFHNCPIG
           SV   ++      +   +E L S R ++  G+  P GY+++ ++L +  RLVLP  S  IPL+L EFH   IG
Subjt:  GLISVVGRLNPLIFIDQVTGNEALNSIRLSLINGQPTPEGYSLQGEVLCYHDRLVLPEDSPTIPLLLAEFHNCPIG

XP_028552250.1 uncharacterized protein LOC114580023 [Dendrobium catenatum]7.2e-25349.79Show/hide
Query:  LEGEALDWYKWKDDRTKINSWEEFRQLLWARFKPSGQGDKHARSMKLQQETTVREYRRRVEQFSTSLKDMSDAALESKFVCGLREEIQSEIRKLNPVGLE
        LEG+AL W++W + R  + SWEEF+ LL  RF+ S +G  + + + L QE TV EYR+  E  S+ L+ ++D  LE  F+ GL+  I++ IR ++P GL 
Subjt:  LEGEALDWYKWKDDRTKINSWEEFRQLLWARFKPSGQGDKHARSMKLQQETTVREYRRRVEQFSTSLKDMSDAALESKFVCGLREEIQSEIRKLNPVGLE

Query:  AKMLMAQVIEDDQVVQLKRIHGVGANPSLLNKTSGNGPNGSGNQTGSKVMDRVATSRTITINPSRNSSSSSTTITSPHDFNVKNSMAQPYRRMTDSEMRM
          M  AQ++ED   V+  R  GV + P+        GP                   TI +     +   +TT  +   F          +R+TDSE++ 
Subjt:  AKMLMAQVIEDDQVVQLKRIHGVGANPSLLNKTSGNGPNGSGNQTGSKVMDRVATSRTITINPSRNSSSSSTTITSPHDFNVKNSMAQPYRRMTDSEMRM

Query:  KKEKGLCFRCDEKFNPGPQ-----------GEDLSGEIDKVAEETEDKNEQINTEIANLSLHSLVGFSSPKTIKIKGEIRNCEVVVLVDGGAIHNFISEE
        K+ KGLC+RCDEKF  G +           G+    E +  A+E  ++ E  +  +  +SL+S+ G +S  T+K++GEI   +V+VL+D GA HNFI+  
Subjt:  KKEKGLCFRCDEKFNPGPQ-----------GEDLSGEIDKVAEETEDKNEQINTEIANLSLHSLVGFSSPKTIKIKGEIRNCEVVVLVDGGAIHNFISEE

Query:  VVKELKIPVETLDAYGVVLGTGGVVRATGMCKSVNLTIVNLSITHDFLPLPLGSADVNLGVTWLETLGKVIFNYKLSEMEFSLGEFLVILQGNKSLVKSQ
         V+++ +PV      GV+LGTG   R  G CK V LT+       DFL L LGS DV LG+ WL++LG++  N+K   ME+  G   V LQG+ SL +++
Subjt:  VVKELKIPVETLDAYGVVLGTGGVVRATGMCKSVNLTIVNLSITHDFLPLPLGSADVNLGVTWLETLGKVIFNYKLSEMEFSLGEFLVILQGNKSLVKSQ

Query:  VSLKSMI--FKKEDQGVLIELSTVEQGGAEESKDNLADCLSNLKPEVQRILLSFGSVFESINQLPPPRDHDHAIELESGARAVNVRPYRYPQFQKDEIEK
        V+LKS+    + E +G LIEL   +Q G E  ++        +  E+Q +L    +VF+    LPP R  +HAI L++G   ++V+PYRYP  QK+EIE+
Subjt:  VSLKSMI--FKKEDQGVLIELSTVEQGGAEESKDNLADCLSNLKPEVQRILLSFGSVFESINQLPPPRDHDHAIELESGARAVNVRPYRYPQFQKDEIEK

Query:  LVKKMLLAKIIQPSKSAFSSPVLLVKKKDGSWRFCVDYRALNLATIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHHIRVRATDVHKTAFRTHEGHYE
        LV +ML A++IQPS S FSSPVLLVKKKDGSWRFCVDYRALN  T+ DK+PIPVVDELLDEL GATIFSK+DLKSGYH IR+RA D+ KTAFRTHEGHYE
Subjt:  LVKKMLLAKIIQPSKSAFSSPVLLVKKKDGSWRFCVDYRALNLATIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHHIRVRATDVHKTAFRTHEGHYE

Query:  FLVMPFGLKNAPTTFQSVMNDILRPYLRKFVLVFFDDILIYS-SLEEHLHQLAMVLETLVVHKLVANLKKCQFAVDQIEYLGHIISSDGVAADPTKIAAM
        FLVMPFGL NAP+TFQ++MN + +PYLR+FVLVFFDDIL+Y+ SL++HLH L +VL TL+ H+L AN KKC FA  ++EYLGHIIS+DGVAADPTKI AM
Subjt:  FLVMPFGLKNAPTTFQSVMNDILRPYLRKFVLVFFDDILIYS-SLEEHLHQLAMVLETLVVHKLVANLKKCQFAVDQIEYLGHIISSDGVAADPTKIAAM

Query:  VKWPAPKNVKELRGFLGLPGYYRKFVANYGSIALPLTQLLKKGKFQWNETAEKAFQQLKSAMMSVPVLGIPDFTQGFVLKTDASGVGIGAVLMQHQRPVA
        V WP PK++K LRGFLGL GYYR+F+  YGSIA PLT  LKK  FQW E AE A + LK AM S PVL +PDFTQ FV++TDAS VG+GAVLMQ+ RP+A
Subjt:  VKWPAPKNVKELRGFLGLPGYYRKFVANYGSIALPLTQLLKKGKFQWNETAEKAFQQLKSAMMSVPVLGIPDFTQGFVLKTDASGVGIGAVLMQHQRPVA

Query:  FFSQALPITHRFKAVYERKLMAIVRAVQKWRPHLLGKPFVVRTDQKSLKFLLEQRAIGGEYQRWIAKLLGYDFVIEYKKGMENKAADALSRLPPLFELGL
        FFS+ L    R K+VYER+LMAIV A+QKWRP+LLG+ F+VRTDQ+SLK+ LEQR +  E+QRW+AKLLGY+F I+YK G++NKAADALSR+    +L  
Subjt:  FFSQALPITHRFKAVYERKLMAIVRAVQKWRPHLLGKPFVVRTDQKSLKFLLEQRAIGGEYQRWIAKLLGYDFVIEYKKGMENKAADALSRLPPLFELGL

Query:  ISVVGRLNPLIFIDQVTGNEALNSIRLSLINGQPTPEGYSLQGEVLCYHDRLVLPEDSPTIPLLLAEFHNCPIG
        +SV   ++    + +    E L  IR ++  G+   +GY L+  +L Y  RLVL  +S  IP+LL E+H+  IG
Subjt:  ISVVGRLNPLIFIDQVTGNEALNSIRLSLINGQPTPEGYSLQGEVLCYHDRLVLPEDSPTIPLLLAEFHNCPIG

XP_028552383.1 uncharacterized protein LOC114580110 [Dendrobium catenatum]1.1e-24848.67Show/hide
Query:  LEGEALDWYKWKDDRTKINSWEEFRQLLWARFKPSGQGDKHARSMKLQQETTVREYRRRVEQFSTSLKDMSDAALESKFVCGLREEIQSEIRKLNPVGLE
        LE  AL WY+W ++R +   W EFR++   RF+P  +G  H +   L Q  TV  YR R E  S+ L+ M+D  LE  F+ GL+  I+S +R   P  L 
Subjt:  LEGEALDWYKWKDDRTKINSWEEFRQLLWARFKPSGQGDKHARSMKLQQETTVREYRRRVEQFSTSLKDMSDAALESKFVCGLREEIQSEIRKLNPVGLE

Query:  AKMLMAQVIEDD-QVVQLKRIHGVGANPSLLNKTSG-NGPNGSGNQTGSKVMDRVATSRTITINPSRNSSSSSTTITSPHDFNVKNSMAQPYRRMTDSEM
          + MA++IED     Q +R    G    + N T G  G        G K  DR A ++                                +RR+T++E+
Subjt:  AKMLMAQVIEDD-QVVQLKRIHGVGANPSLLNKTSG-NGPNGSGNQTGSKVMDRVATSRTITINPSRNSSSSSTTITSPHDFNVKNSMAQPYRRMTDSEM

Query:  RMKKEKGLCFRCDEKFNPG---------------PQGEDLSGEIDKVAEETEDKNEQINTEIANLSLHSLVGFSSPKTIKIKGEIRNCEVVVLVDGGAIH
        + K+ KGLC+RCDEKF PG               P+GE++  E+       E+  + ++ ++  +SL+S+ G ++  T+K++G I +  V VL+D GA H
Subjt:  RMKKEKGLCFRCDEKFNPG---------------PQGEDLSGEIDKVAEETEDKNEQINTEIANLSLHSLVGFSSPKTIKIKGEIRNCEVVVLVDGGAIH

Query:  NFISEEVVKELKIPVETLDAYGVVLGTGGVVRATGMCKSVNLTIVNLSITHDFLPLPLGSADVNLGVTWLETLGKVIFNYKLSEMEFSLGEFLVILQGNK
        NFI+  +V+EL IP+      GV LGTG   +  G C  V L+I    IT DFL L LG+ DV LG+ WL+TLG++  N+K   +E+  GE  V L G+ 
Subjt:  NFISEEVVKELKIPVETLDAYGVVLGTGGVVRATGMCKSVNLTIVNLSITHDFLPLPLGSADVNLGVTWLETLGKVIFNYKLSEMEFSLGEFLVILQGNK

Query:  SLVKSQVSLKSMI--FKKEDQGVLIELSTVEQGGAEESKDNLADCLSNLKPEVQRILLSFGSVFESINQLPPPRDHDHAIELESGARAVNVRPYRYPQFQ
         L +S+V+LK+++   + E +G LIEL  +E  G+E  ++       N+  EV  +L  F  VF+    LPP R  +HAI L+ GA  V+VRPYRYP  Q
Subjt:  SLVKSQVSLKSMI--FKKEDQGVLIELSTVEQGGAEESKDNLADCLSNLKPEVQRILLSFGSVFESINQLPPPRDHDHAIELESGARAVNVRPYRYPQFQ

Query:  KDEIEKLVKKMLLAKIIQPSKSAFSSPVLLVKKKDGSWRFCVDYRALNLATIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHHIRVRATDVHKTAFRT
        K+EIEKLV++M+ A +IQPS S FSSPVLLVKKKDGSWRFCVDYRALN  T+ DK+PIPV+DELLDEL GAT+FSK+DLKSGYH IR+R  D+ KTAFRT
Subjt:  KDEIEKLVKKMLLAKIIQPSKSAFSSPVLLVKKKDGSWRFCVDYRALNLATIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHHIRVRATDVHKTAFRT

Query:  HEGHYEFLVMPFGLKNAPTTFQSVMNDILRPYLRKFVLVFFDDILIYS-SLEEHLHQLAMVLETLVVHKLVANLKKCQFAVDQIEYLGHIISSDGVAADP
        HEGHYEFLVMPFGL NAP+TFQ++MN + +P LR+FVLVFFDDILIYS SL+EHL  L  VL TL  H+L  N KKC FA   +EYLGHIIS++GVAADP
Subjt:  HEGHYEFLVMPFGLKNAPTTFQSVMNDILRPYLRKFVLVFFDDILIYS-SLEEHLHQLAMVLETLVVHKLVANLKKCQFAVDQIEYLGHIISSDGVAADP

Query:  TKIAAMVKWPAPKNVKELRGFLGLPGYYRKFVANYGSIALPLTQLLKKGKFQWNETAEKAFQQLKSAMMSVPVLGIPDFTQGFVLKTDASGVGIGAVLMQ
        +K+ AM  WP PKN++ LRGFLGL GYYRKF+  YGSIA PLT+ LKK  F W   A+ A + LK AM+S PVL +PDF Q  V++TDASG+G+GAVLMQ
Subjt:  TKIAAMVKWPAPKNVKELRGFLGLPGYYRKFVANYGSIALPLTQLLKKGKFQWNETAEKAFQQLKSAMMSVPVLGIPDFTQGFVLKTDASGVGIGAVLMQ

Query:  HQRPVAFFSQALPITHRFKAVYERKLMAIVRAVQKWRPHLLGKPFVVRTDQKSLKFLLEQRAIGGEYQRWIAKLLGYDFVIEYKKGMENKAADALSRLPP
          RP+AF+SQ L    R K+VYER+LMAIV+A+QKWRP+LLG+ F+VRTDQ+SLK+LLEQR +  E+QRW++KLLGYDF I+YK G+ENKAADALSR   
Subjt:  HQRPVAFFSQALPITHRFKAVYERKLMAIVRAVQKWRPHLLGKPFVVRTDQKSLKFLLEQRAIGGEYQRWIAKLLGYDFVIEYKKGMENKAADALSRLPP

Query:  LFELGLISVVGRLNPLIFIDQVTGNEALNSIRLSLINGQPTPEGYSLQGEVLCYHDRLVLPEDSPTIPLLLAEFHNCPIG
          +L   SV   ++      +   +E L S R ++  G+  P GY+++ ++L +  RLVLP  S  IPL+L EFH   IG
Subjt:  LFELGLISVVGRLNPLIFIDQVTGNEALNSIRLSLINGQPTPEGYSLQGEVLCYHDRLVLPEDSPTIPLLLAEFHNCPIG

XP_028552640.1 uncharacterized protein LOC114580166 [Dendrobium catenatum]1.4e-24848.98Show/hide
Query:  LEGEALDWYKWKDDRTKINSWEEFRQLLWARFKPSGQGDKHARSMKLQQETTVREYRRRVEQFSTSLKDMSDAALESKFVCGLREEIQSEIRKLNPVGLE
        LE  AL WY+W ++R +   W EFR++   RF+P  +G  H +   L Q  TV  YR R E  S+ L+ M+D  LE  F+ GL+  I+S +R   P  L 
Subjt:  LEGEALDWYKWKDDRTKINSWEEFRQLLWARFKPSGQGDKHARSMKLQQETTVREYRRRVEQFSTSLKDMSDAALESKFVCGLREEIQSEIRKLNPVGLE

Query:  AKMLMAQVIEDD-QVVQLKRIHGVGANPSLLNKTSG-NGPNGSGNQTGSKVMDRVATSRTITINPSRNSSSSSTTITSPHDFNVKNSMAQPYRRMTDSEM
          + MA++IED     Q +R    G    + N T G  G        G K  DR A ++                                +RR+T++E+
Subjt:  AKMLMAQVIEDD-QVVQLKRIHGVGANPSLLNKTSG-NGPNGSGNQTGSKVMDRVATSRTITINPSRNSSSSSTTITSPHDFNVKNSMAQPYRRMTDSEM

Query:  RMKKEKGLCFRCDEKFNPGPQGEDL--------SGEIDKVAEE---TEDKNEQINTEIANLSLHSLVGFSSPKTIKIKGEIRNCEVVVLVDGGAIHNFIS
        + K+ KGLC+RCDEKF PG + +D           E ++V EE    E+  + ++ ++  +SL+S+ G ++  T+K++G I +  V VL+D GA HNFI+
Subjt:  RMKKEKGLCFRCDEKFNPGPQGEDL--------SGEIDKVAEE---TEDKNEQINTEIANLSLHSLVGFSSPKTIKIKGEIRNCEVVVLVDGGAIHNFIS

Query:  EEVVKELKIPVETLDAYGVVLGTGGVVRATGMCKSVNLTIVNLSITHDFLPLPLGSADVNLGVTWLETLGKVIFNYKLSEMEFSLGEFLVILQGNKSLVK
          +V+EL IP+      GV LGTG   +  G C  V L+I    IT DFL L LG+ DV LG+ WL+TLG++  N+K   +E+  GE  V L G+  L +
Subjt:  EEVVKELKIPVETLDAYGVVLGTGGVVRATGMCKSVNLTIVNLSITHDFLPLPLGSADVNLGVTWLETLGKVIFNYKLSEMEFSLGEFLVILQGNKSLVK

Query:  SQVSLKSMI--FKKEDQGVLIELSTVEQGGAEESKDNLADCLSNLKPEVQRILLSFGSVFESINQLPPPRDHDHAIELESGARAVNVRPYRYPQFQKDEI
        S+V+LK+++   + E +G LIEL  +E  G+E  ++       N+  EV  +L  F  VF+    LPP R  +HAI L+ GA  V+VRPYRYP  QK+EI
Subjt:  SQVSLKSMI--FKKEDQGVLIELSTVEQGGAEESKDNLADCLSNLKPEVQRILLSFGSVFESINQLPPPRDHDHAIELESGARAVNVRPYRYPQFQKDEI

Query:  EKLVKKMLLAKIIQPSKSAFSSPVLLVKKKDGSWRFCVDYRALNLATIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHHIRVRATDVHKTAFRTHEGH
        EKLV++M+ A +IQPS S FSSPVLLVKKKDGSWRFCVDYRALN  T+ DK+PIPV+DELLDEL GAT+FSK+DLKSGYH IR+R  D+ KTAFRTHEGH
Subjt:  EKLVKKMLLAKIIQPSKSAFSSPVLLVKKKDGSWRFCVDYRALNLATIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHHIRVRATDVHKTAFRTHEGH

Query:  YEFLVMPFGLKNAPTTFQSVMNDILRPYLRKFVLVFFDDILIYS-SLEEHLHQLAMVLETLVVHKLVANLKKCQFAVDQIEYLGHIISSDGVAADPTKIA
        YEFLVMPFGL NAP+TFQ++MN + +P LR+FVLVFFDDILIYS SL+EHL  L  VL TL  H+L  N KKC FA   +EYLGHIIS++GVAADP+K+ 
Subjt:  YEFLVMPFGLKNAPTTFQSVMNDILRPYLRKFVLVFFDDILIYS-SLEEHLHQLAMVLETLVVHKLVANLKKCQFAVDQIEYLGHIISSDGVAADPTKIA

Query:  AMVKWPAPKNVKELRGFLGLPGYYRKFVANYGSIALPLTQLLKKGKFQWNETAEKAFQQLKSAMMSVPVLGIPDFTQGFVLKTDASGVGIGAVLMQHQRP
        AM  WP PKN++ LRGFLGL GYYRKF+  YGSIA PLT+ LKK  F W   A+ A + LK AM+S PVL +PDF Q  V++TDASG+G+GAVLMQ  RP
Subjt:  AMVKWPAPKNVKELRGFLGLPGYYRKFVANYGSIALPLTQLLKKGKFQWNETAEKAFQQLKSAMMSVPVLGIPDFTQGFVLKTDASGVGIGAVLMQHQRP

Query:  VAFFSQALPITHRFKAVYERKLMAIVRAVQKWRPHLLGKPFVVRTDQKSLKFLLEQRAIGGEYQRWIAKLLGYDFVIEYKKGMENKAADALSRLPPLFEL
        +AF+SQ L    R K+VYER+LMAIV+A+QKWRP+LLG+ F+VRTDQ+SLK+LLEQR +  E+QRW++KLLGYDF I+YK G+ENKAADALSR     +L
Subjt:  VAFFSQALPITHRFKAVYERKLMAIVRAVQKWRPHLLGKPFVVRTDQKSLKFLLEQRAIGGEYQRWIAKLLGYDFVIEYKKGMENKAADALSRLPPLFEL

Query:  GLISVVGRLNPLIFIDQVTGNEALNSIRLSLINGQPTPEGYSLQGEVLCYHDRLVLPEDSPTIPLLLAEFHNCPIG
           SV   ++      +   +E L S R ++  G+  P GY+++ ++L +  RLVLP  S  IPL+L EFH   IG
Subjt:  GLISVVGRLNPLIFIDQVTGNEALNSIRLSLINGQPTPEGYSLQGEVLCYHDRLVLPEDSPTIPLLLAEFHNCPIG

TrEMBL top hitse value%identityAlignment
A0A087GEK8 Uncharacterized protein5.6e-24348.82Show/hide
Query:  ALDWYKWKDDRTKINSWEEFRQLLWARFKPSGQGDKHARSMKLQQETTVREYRRRVEQFSTSLKDMSDAALESKFVCGLREEIQSEIRKLNPVGLEAKML
        ALDWY+W+ DR    SW + R  + A++          R + L+Q+  V ++ R     +T+  ++ +  LE  F+ GL+  I+S ++   P  LE  M 
Subjt:  ALDWYKWKDDRTKINSWEEFRQLLWARFKPSGQGDKHARSMKLQQETTVREYRRRVEQFSTSLKDMSDAALESKFVCGLREEIQSEIRKLNPVGLEAKML

Query:  MAQVIEDDQVVQLKRIHGVGANPSLLNKTSGNGPN---GSGNQTGSKVMDRVATSR-TITINPSRNSSSSSTTITSPHDFNVKNSMAQPYRRMTDSEMRM
        +A++++              A+    +KT+ +GP    G  N TG      +A ++    + PS N+ S S   T   +    N +  PYRR+T  EM  
Subjt:  MAQVIEDDQVVQLKRIHGVGANPSLLNKTSGNGPN---GSGNQTGSKVMDRVATSR-TITINPSRNSSSSSTTITSPHDFNVKNSMAQPYRRMTDSEMRM

Query:  KKEKGLCFRCDEKFNPGPQ--------------GEDLSGEIDKVAEETEDKNEQINTEIANLSLHSLVGFSSPKTIKIKGEIRNCEVVVLVDGGAIHNFI
        +K  GLCFRCDEK++   Q              G D+  E D   ++  D  +Q  TE+A LSL+S+VG SSP T+K+ G I+  EVVVL+D GA HNF+
Subjt:  KKEKGLCFRCDEKFNPGPQ--------------GEDLSGEIDKVAEETEDKNEQINTEIANLSLHSLVGFSSPKTIKIKGEIRNCEVVVLVDGGAIHNFI

Query:  SEEVVKELKIPVETLDAYGVVLGTGGVVRATGMCKSVNLTIVNLSITHDFLPLPLGSADVNLGVTWLETLGKVIFNYKLSEMEFSLGEFLVILQGNKSLV
        SE++V  L +      +YGV+ G G  VR  G+C+ + L +  L I  DFLPL LGSADV LG+ WL +LG++  N+    M FSLG    +LQG+    
Subjt:  SEEVVKELKIPVETLDAYGVVLGTGGVVRATGMCKSVNLTIVNLSITHDFLPLPLGSADVNLGVTWLETLGKVIFNYKLSEMEFSLGEFLVILQGNKSLV

Query:  KSQVSLKSMIFKKEDQGVLIELSTVEQGGAEESKDNLADCLSNLKPEVQRILLSFGSVFESINQLPPPRDHDHAIELESGARAVNVRPYRYPQFQKDEIE
         S +SLKS++   +DQGV +    VE  G  +S D +A   + +   +  ++  F  VFE    LPP R   H I LESGA+AV+VRP+RYPQ QK EIE
Subjt:  KSQVSLKSMIFKKEDQGVLIELSTVEQGGAEESKDNLADCLSNLKPEVQRILLSFGSVFESINQLPPPRDHDHAIELESGARAVNVRPYRYPQFQKDEIE

Query:  KLVKKMLLAKIIQPSKSAFSSPVLLVKKKDGSWRFCVDYRALNLATIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHHIRVRATDVHKTAFRTHEGHY
        K V  ML A IIQ S S FSSPVLLVKKKDGSWRFC+DYRALN  TIPD +PIP++D+LLDEL GAT+FSK+DLKSGYH I V+  +V KTAFRTH+GHY
Subjt:  KLVKKMLLAKIIQPSKSAFSSPVLLVKKKDGSWRFCVDYRALNLATIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHHIRVRATDVHKTAFRTHEGHY

Query:  EFLVMPFGLKNAPTTFQSVMNDILRPYLRKFVLVFFDDILIY-SSLEEHLHQLAMVLETLVVHKLVANLKKCQFAVDQIEYLGHIISSDGVAADPTKIAA
        EFLVMPFGL NAPTTFQ++MN++ R +LRKFVLVFFDDIL+Y SSL+EH   L +VL+ L   +L AN KKCQF    IEYLGH+IS +GV+ADP+K+ A
Subjt:  EFLVMPFGLKNAPTTFQSVMNDILRPYLRKFVLVFFDDILIY-SSLEEHLHQLAMVLETLVVHKLVANLKKCQFAVDQIEYLGHIISSDGVAADPTKIAA

Query:  MVKWPAPKNVKELRGFLGLPGYYRKFVANYGSIALPLTQLLKKGKFQWNETAEKAFQQLKSAMMSVPVLGIPDFTQGFVLKTDASGVGIGAVLMQHQRPV
        MV WP PKN+K LRGFLGL GYYR+FV  YGSIA PLT LLKK KFQW+E A  AF++LK AM +VPVL + DF++ FV+++DASG+G+GAVL+Q Q+PV
Subjt:  MVKWPAPKNVKELRGFLGLPGYYRKFVANYGSIALPLTQLLKKGKFQWNETAEKAFQQLKSAMMSVPVLGIPDFTQGFVLKTDASGVGIGAVLMQHQRPV

Query:  AFFSQALPITHRFKAVYERKLMAIVRAVQKWRPHLLGKPFVVRTDQKSLKFLLEQRAIGGEYQRWIAKLLGYDFVIEYKKGMENKAADALSRLPPLFELG
        A+FSQAL    + K+VYER+LMAIV A+QKWR +LLG+ F+VRTDQKSLKFLLEQR +  EYQ+W+ K+LG++F I YK G+ENKAADALSR+  L +L 
Subjt:  AFFSQALPITHRFKAVYERKLMAIVRAVQKWRPHLLGKPFVVRTDQKSLKFLLEQRAIGGEYQRWIAKLLGYDFVIEYKKGMENKAADALSRLPPLFELG

Query:  LISVVGRLNPLIFIDQVTGNEALNSIRLSLINGQPTPEGYSLQGEVLCYHDRLVLPEDSPTIPLLLAEFHNCPIG
         +SV   +      ++V  N     I+  ++    T  GYS+    L Y+ +LVLP++S  I +LL EFHN  +G
Subjt:  LISVVGRLNPLIFIDQVTGNEALNSIRLSLINGQPTPEGYSLQGEVLCYHDRLVLPEDSPTIPLLLAEFHNCPIG

A0A2I0WN12 Putative mitochondrial protein7.0e-24648.71Show/hide
Query:  LEGEALDWYKWKDDRTKINSWEEFRQLLWARFKPSGQGDKHARSMKLQQETTVREYRRRVEQFSTSLKDMSDAALESKFVCGLREEIQSEIRKLNPVGLE
        LEG A  W+K+ D    + +W +F++ +  RF+ S   +   +   L QE TV EYR++ E     ++ +S++ L   F+ GL+ EI+  ++ + P  L 
Subjt:  LEGEALDWYKWKDDRTKINSWEEFRQLLWARFKPSGQGDKHARSMKLQQETTVREYRRRVEQFSTSLKDMSDAALESKFVCGLREEIQSEIRKLNPVGLE

Query:  AKMLMAQVIEDDQVVQLKRIHGVGANPSLLNKTSGNGPNGSGNQTGSKVMDRVATSRTITINPSRNSSSSSTTITSPHDFNVKNSMAQPYRRMTDSEMRM
          M +AQ++E+ +                      N  +   N +G         +RT T   +    S S    S  + +     A  ++++T+ EM+ 
Subjt:  AKMLMAQVIEDDQVVQLKRIHGVGANPSLLNKTSGNGPNGSGNQTGSKVMDRVATSRTITINPSRNSSSSSTTITSPHDFNVKNSMAQPYRRMTDSEMRM

Query:  KKEKGLCFRCDEKFNPGPQGED-----LSGEIDKVAEETEDKNEQIN---TEIANLSLHSLVGFSSPKTIKIKGEIRNCEVVVLVDGGAIHNFISEEVVK
        K+ KGLCFRC+EKF PG + +D     L+  ID+  EE +D  E+ +    E+A +SL+S++GF+   T+K+KG+I   EVVVL+D GA HNFIS +V +
Subjt:  KKEKGLCFRCDEKFNPGPQGED-----LSGEIDKVAEETEDKNEQIN---TEIANLSLHSLVGFSSPKTIKIKGEIRNCEVVVLVDGGAIHNFISEEVVK

Query:  ELKIPVETLDAYGVVLGTGGVVRATGMCKSVNLTIVNLSITHDFLPLPLGSADVNLGVTWLETLGKVIFNYKLSEMEFSLGEFLVILQGNKSLVKSQVSL
        EL I      +YGV++GTG +  +TG+CK V +++  + +  DFLPL LGS DV LG+ WL+TLG+   N+    ME  +    V L+G   L ++ VSL
Subjt:  ELKIPVETLDAYGVVLGTGGVVRATGMCKSVNLTIVNLSITHDFLPLPLGSADVNLGVTWLETLGKVIFNYKLSEMEFSLGEFLVILQGNKSLVKSQVSL

Query:  KSM--IFKKEDQGVLIELSTVEQGGAEESKDNLADCLSNLKPEVQRILLSFGSVFESINQLPPPRDHDHAIELESGARAVNVRPYRYPQFQKDEIEKLVK
        +SM  I  +E  G L+EL ++E     E K        ++   VQ  L  F  VF+    LPP R+ +H I L+ G   ++VRPYRYPQ QKDEIEKLV 
Subjt:  KSM--IFKKEDQGVLIELSTVEQGGAEESKDNLADCLSNLKPEVQRILLSFGSVFESINQLPPPRDHDHAIELESGARAVNVRPYRYPQFQKDEIEKLVK

Query:  KMLLAKIIQPSKSAFSSPVLLVKKKDGSWRFCVDYRALNLATIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHHIRVRATDVHKTAFRTHEGHYEFLV
        +ML   IIQPS S FSSPVLLVKKKDGSWRFCVDYRALN  T+PDK+PIPV+DEL+DEL GA +F+KIDLKSGYH IR+R  DV KTAFRTHEGHYEFLV
Subjt:  KMLLAKIIQPSKSAFSSPVLLVKKKDGSWRFCVDYRALNLATIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHHIRVRATDVHKTAFRTHEGHYEFLV

Query:  MPFGLKNAPTTFQSVMNDILRPYLRKFVLVFFDDILIYS-SLEEHLHQLAMVLETLVVHKLVANLKKCQFAVDQIEYLGHIISSDGVAADPTKIAAMVKW
        MPFGL NAP TFQ++MN I RP+LR+FVLVFFDDILIYS + EEHL  L +VL  L  H+L AN KKC FA  Q+EYLGH+IS +GVAAD +KI AM+ W
Subjt:  MPFGLKNAPTTFQSVMNDILRPYLRKFVLVFFDDILIYS-SLEEHLHQLAMVLETLVVHKLVANLKKCQFAVDQIEYLGHIISSDGVAADPTKIAAMVKW

Query:  PAPKNVKELRGFLGLPGYYRKFVANYGSIALPLTQLLKKGKFQWNETAEKAFQQLKSAMMSVPVLGIPDFTQGFVLKTDASGVGIGAVLMQHQRPVAFFS
        P PK +K LRGFLGL GYYR+FV  Y +IA PLT+LLKK  F W E A +AF++LK AM +VPVL +PDF Q FVL+TDASG G+GAVLMQ+ R +A+FS
Subjt:  PAPKNVKELRGFLGLPGYYRKFVANYGSIALPLTQLLKKGKFQWNETAEKAFQQLKSAMMSVPVLGIPDFTQGFVLKTDASGVGIGAVLMQHQRPVAFFS

Query:  QALPITHRFKAVYERKLMAIVRAVQKWRPHLLGKPFVVRTDQKSLKFLLEQRAIGGEYQRWIAKLLGYDFVIEYKKGMENKAADALSRLPPLFELGLISV
        Q L    R K+VYER+LMAIV A+QKWRP+LLG+ F+VRTDQ+SLK+LLEQR +  E+QRW++KLLGYDF I Y+ G+ENKAADALSR    F+   +SV
Subjt:  QALPITHRFKAVYERKLMAIVRAVQKWRPHLLGKPFVVRTDQKSLKFLLEQRAIGGEYQRWIAKLLGYDFVIEYKKGMENKAADALSRLPPLFELGLISV

Query:  VGRLNPLIFIDQVTGNEALNSIRLSLINGQPTPEGYSLQGEVLCYHDRLVLPEDSPTIPLLLAEFHNCPIG
           ++     ++   NE L  I+  +++ + +  GYS+ GE L Y  R V+P  S  IP LL EFH   IG
Subjt:  VGRLNPLIFIDQVTGNEALNSIRLSLINGQPTPEGYSLQGEVLCYHDRLVLPEDSPTIPLLLAEFHNCPIG

A0A2I0X132 Putative mitochondrial protein5.7e-24848.67Show/hide
Query:  LEGEALDWYKWKDDRTKINSWEEFRQLLWARFKPSGQGDKHARSMKLQQETTVREYRRRVEQFSTSLKDMSDAALESKFVCGLREEIQSEIRKLNPVGLE
        LE  AL WY+W ++R +   W EFR++   RF+P  +G  H +   L Q  TV  YR R E  S+ L+ M+D  LE  F+ GL+  I+S +R   P  L 
Subjt:  LEGEALDWYKWKDDRTKINSWEEFRQLLWARFKPSGQGDKHARSMKLQQETTVREYRRRVEQFSTSLKDMSDAALESKFVCGLREEIQSEIRKLNPVGLE

Query:  AKMLMAQVIEDD-QVVQLKRIHGVGANPSLLNKTSG-NGPNGSGNQTGSKVMDRVATSRTITINPSRNSSSSSTTITSPHDFNVKNSMAQPYRRMTDSEM
          + MA++IED     Q +R    G    + N T G  G        G K  DR A ++                                +RR+T++E+
Subjt:  AKMLMAQVIEDD-QVVQLKRIHGVGANPSLLNKTSG-NGPNGSGNQTGSKVMDRVATSRTITINPSRNSSSSSTTITSPHDFNVKNSMAQPYRRMTDSEM

Query:  RMKKEKGLCFRCDEKFNPGPQGEDL--------SGEIDKVAEE---TEDKNEQINTEIANLSLHSLVGFSSPKTIKIKGEIRNCEVVVLVDGGAIHNFIS
        + K+ KGLC+RCDEKF PG + +D           E ++V EE    E+  + ++ ++  +SL+S+ G ++  T+K++G I +  V VL+D GA HNFI+
Subjt:  RMKKEKGLCFRCDEKFNPGPQGEDL--------SGEIDKVAEE---TEDKNEQINTEIANLSLHSLVGFSSPKTIKIKGEIRNCEVVVLVDGGAIHNFIS

Query:  EEVVKELKIPVETLDAYGVVLGTGGVVRATGMCKSVNLTIVNLSITHDFLPLPLGSADVNLGVTWLETLGKVIFNYKLSEMEFSLGEFLVILQGNKSLVK
          +V+EL IP+      GV LGTG   +  G C  V L+I    IT DFL L LG+ DV LG+ WL+TLG++  N+K   +E+  GE  V L G+  L +
Subjt:  EEVVKELKIPVETLDAYGVVLGTGGVVRATGMCKSVNLTIVNLSITHDFLPLPLGSADVNLGVTWLETLGKVIFNYKLSEMEFSLGEFLVILQGNKSLVK

Query:  SQVSLKSMI--FKKEDQGVLIELSTVEQGGAEESKDNLADCLSNLKPEVQRILLSFGSVFESINQLPPPRDHDHAIELESGARAVNVRPYRYPQFQKDEI
        S+V+LK+++   + E +G LIEL  +E  G+E  ++       N+  EV  +L  F  VF+    LPP R  +HAI L+ GA  V+VRPYRYP  QK+EI
Subjt:  SQVSLKSMI--FKKEDQGVLIELSTVEQGGAEESKDNLADCLSNLKPEVQRILLSFGSVFESINQLPPPRDHDHAIELESGARAVNVRPYRYPQFQKDEI

Query:  EKLVKKMLLAKIIQPSKSAFSSPVLLVKKKDGSWRFCVDYRALNLATIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHHIRVRATDVHKTAFRTHEGH
        EKLV++M+ A +IQPS S FSSPVLLVKKKDGSWRFCVDYRALN  T+ DK+PIPV+DELLDEL GAT+FSK+DLKSGYH IR+R  D+ KTAFRTHEGH
Subjt:  EKLVKKMLLAKIIQPSKSAFSSPVLLVKKKDGSWRFCVDYRALNLATIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHHIRVRATDVHKTAFRTHEGH

Query:  YEFLVMPFGLKNAPTTFQSVMNDILRPYLRKFVLVFFDDILIYS-SLEEHLHQLAMVLETLVVHKLVANLKKCQFAVDQIEYLGHIISSDGVAADPTKIA
        YEFLVMPFGL NAP+TFQ++MN + +P LR+FVLVFFDD LIYS SL+EHL  L  VL TL  H+L  N KKC FA   +EYLGHIIS++GVAADP+K+ 
Subjt:  YEFLVMPFGLKNAPTTFQSVMNDILRPYLRKFVLVFFDDILIYS-SLEEHLHQLAMVLETLVVHKLVANLKKCQFAVDQIEYLGHIISSDGVAADPTKIA

Query:  AMVKWPAPKNVKELRGFLGLPGYYRKFVANYGSIALPLTQLLKKGKFQWNETAEKAFQQLKSAMMSVPVLGIPDFTQGFVLKTDASGVGIGAVLMQHQRP
        AM  WP PKN++ LRGFLGL GYYRKF+  YGSIA PLT+ LKK  F W   A+ A + LK AM+S PVL +P+F Q  V++TDASG+G+GAVLMQ  RP
Subjt:  AMVKWPAPKNVKELRGFLGLPGYYRKFVANYGSIALPLTQLLKKGKFQWNETAEKAFQQLKSAMMSVPVLGIPDFTQGFVLKTDASGVGIGAVLMQHQRP

Query:  VAFFSQALPITHRFKAVYERKLMAIVRAVQKWRPHLLGKPFVVRTDQKSLKFLLEQRAIGGEYQRWIAKLLGYDFVIEYKKGMENKAADALSRLPPLFEL
        +AF+SQ L    + K+VYER+LMAIV+A+QKWRP+LLG+ F+VRTDQ+SLK+LLEQR +  E+QRW++KLLGYDF I+YK G+ENKAADALSR     +L
Subjt:  VAFFSQALPITHRFKAVYERKLMAIVRAVQKWRPHLLGKPFVVRTDQKSLKFLLEQRAIGGEYQRWIAKLLGYDFVIEYKKGMENKAADALSRLPPLFEL

Query:  GLISVVGRLNPLIFIDQVTGNEALNSIRLSLINGQPTPEGYSLQGEVLCYHDRLVLPEDSPTIPLLLAEFHNCPIG
           SV   ++      +   +E L S+R ++  G+  P GY+++ ++L +  RLVLP+ S  IPL+L EFH   IG
Subjt:  GLISVVGRLNPLIFIDQVTGNEALNSIRLSLINGQPTPEGYSLQGEVLCYHDRLVLPEDSPTIPLLLAEFHNCPIG

A0A5D3BD16 Ty3/gypsy retrotransposon protein9.5e-24346.81Show/hide
Query:  LEGEALDWYKWKDDRTKINSWEEFRQLLWARFKPSGQGDKHARSMKLQQETTVREYRRRVEQFSTSLKDMSDAALESKFVCGLREEIQSEIRKLNPVGLE
        +EG+ L W++W ++R +  SW+E ++ L+ RF+    G   AR + ++QE +V EY +R E+ S  L +M++  L   F  GL   I++E+  +  VGLE
Subjt:  LEGEALDWYKWKDDRTKINSWEEFRQLLWARFKPSGQGDKHARSMKLQQETTVREYRRRVEQFSTSLKDMSDAALESKFVCGLREEIQSEIRKLNPVGLE

Query:  AKMLMAQVIEDDQVVQLKRIHGVGANPSLLNKTSGNGPNGSGNQTGSKVMDRVATSRTITINPSRNSSSSSTTITSPHDFNVKNSMAQPYRRMTDSEMRM
          M  A++ E+   +  +  HG    P   +  S   P     +T S  +  +A     ++N + NS + +T +    D          +RR TDSE++ 
Subjt:  AKMLMAQVIEDDQVVQLKRIHGVGANPSLLNKTSGNGPNGSGNQTGSKVMDRVATSRTITINPSRNSSSSSTTITSPHDFNVKNSMAQPYRRMTDSEMRM

Query:  KKEKGLCFRCDEKFNPGPQGEDLSGEIDKVAEETED--------KNEQIN-TEIANLSLHSLVGFSSPKTIKIKGEIRNCEVVVLVDGGAIHNFISEEVV
        +++KGLC+RC+E F+ G + ++    +  VA++ ED        + E +  + +  LSL+S+VG ++P T K+KG + N E+V++VD GA HNFIS ++V
Subjt:  KKEKGLCFRCDEKFNPGPQGEDLSGEIDKVAEETED--------KNEQIN-TEIANLSLHSLVGFSSPKTIKIKGEIRNCEVVVLVDGGAIHNFISEEVV

Query:  KELKIPVETLDAYGVVLGTGGVVRATGMCKSVNLTIVNLSITHDFLPLPLGSADVNLGVTWLETLGKVIFNYKLSEMEFSLGEFLVILQGNKSLVKSQVS
        + LK+P+     YGV++G+G  V+  G+CK + + +  +SI  DFLPL LG+ D+ LG+ WL+  G +  ++K   M F +G+  VIL+G+ SL + ++S
Subjt:  KELKIPVETLDAYGVVLGTGGVVRATGMCKSVNLTIVNLSITHDFLPLPLGSADVNLGVTWLETLGKVIFNYKLSEMEFSLGEFLVILQGNKSLVKSQVS

Query:  LKSMI--FKKEDQGVLIELSTVEQGGAEESKDNLADCLSNLKPEVQRILLSFGSVFESINQLPPPRDHDHAIELESGARAVNVRPYRYPQFQKDEIEKLV
        LK ++  ++ +DQG L+    +    A+     + D +   + E  ++   FG VFE  + LPP R  DH I+L+ G   +NVRPYRYP  QK+EIE+LV
Subjt:  LKSMI--FKKEDQGVLIELSTVEQGGAEESKDNLADCLSNLKPEVQRILLSFGSVFESINQLPPPRDHDHAIELESGARAVNVRPYRYPQFQKDEIEKLV

Query:  KKMLLAKIIQPSKSAFSSPVLLVKKKDGSWRFCVDYRALNLATIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHHIRVRATDVHKTAFRTHEGHYEFL
          ML + II+PS S FSSPV+LVKKKDG WRFCVDYRALN AT+PDK+PIP++DELLDEL GA+IFSKIDLKSGYH IRVR  D+ KTAFRTHEGHYEFL
Subjt:  KKMLLAKIIQPSKSAFSSPVLLVKKKDGSWRFCVDYRALNLATIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHHIRVRATDVHKTAFRTHEGHYEFL

Query:  VMPFGLKNAPTTFQSVMNDILRPYLRKFVLVFFDDILIYS-SLEEHLHQLAMVLETLVVHKLVANLKKCQFAVDQIEYLGHIISSDGVAADPTKIAAMVK
        VMPFGL NAP TFQ++MN + RPYLRKF+LVFFDDIL+YS  +E HL  L MV + L  H L AN +KC FA D+IEYLGH +S+ GV AD  KI AM++
Subjt:  VMPFGLKNAPTTFQSVMNDILRPYLRKFVLVFFDDILIYS-SLEEHLHQLAMVLETLVVHKLVANLKKCQFAVDQIEYLGHIISSDGVAADPTKIAAMVK

Query:  WPAPKNVKELRGFLGLPGYYRKFVANYGSIALPLTQLLKKGKFQWNETAEKAFQQLKSAMMSVPVLGIPDFTQGFVLKTDASGVGIGAVLMQHQRPVAFF
        WP PKN++ELRGFLGL GYYR+FVANYG+IA PLT+L KK  F+W+E A KAF+QLK AM+++PVL +PDF   F ++TDASG+G+GAVL Q++RP+A+F
Subjt:  WPAPKNVKELRGFLGLPGYYRKFVANYGSIALPLTQLLKKGKFQWNETAEKAFQQLKSAMMSVPVLGIPDFTQGFVLKTDASGVGIGAVLMQHQRPVAFF

Query:  SQALPITHRFKAVYERKLMAIVRAVQKWRPHLLGKPFVVRTDQKSLKFLLEQRAIGGEYQRWIAKLLGYDFVIEYKKGMENKAADALSRLPPLFELGLIS
        SQ L  T R K+VYER+LMAIV AV+KWR +LLG  FVV TDQK+L+ +LEQR I    Q+W+ KL+G+DF I Y+ G ENKAADALSR+P   EL  I+
Subjt:  SQALPITHRFKAVYERKLMAIVRAVQKWRPHLLGKPFVVRTDQKSLKFLLEQRAIGGEYQRWIAKLLGYDFVIEYKKGMENKAADALSRLPPLFELGLIS

Query:  VVGRLNPLIFIDQVTGNEALNSIRLSLINGQPTPEGYSLQGEVLCYHDRLVLPEDSPTIPLLLAEFHNCPIG
        V   L+  +   +V  +E L +I   ++        Y+++   L Y  RLV+   S  IP +L  FH+  +G
Subjt:  VVGRLNPLIFIDQVTGNEALNSIRLSLINGQPTPEGYSLQGEVLCYHDRLVLPEDSPTIPLLLAEFHNCPIG

J3SDF5 Ty3/gypsy retrotransposon protein3.0e-24948.42Show/hide
Query:  LEGEALDWYKWKDDRTKINSWEEFRQLLWARFKPSGQGDKHARSMKLQQETTVREYRRRVEQFSTSLKDMSDAALESKFVCGLREEIQSEIRKLNPVGLE
        +EG+AL WY+W++ R    +WE  +  +  +F+P   G  H + +   Q  +V EYRR+  + +  L  + +  L  KF+ GL  E+QSEIR LNP  L+
Subjt:  LEGEALDWYKWKDDRTKINSWEEFRQLLWARFKPSGQGDKHARSMKLQQETTVREYRRRVEQFSTSLKDMSDAALESKFVCGLREEIQSEIRKLNPVGLE

Query:  AKMLMAQVIED-DQVVQLKRIHGVGANPSLLNKTSGNGPNGSGNQTGSKVMDRVATSRTITINPSRNSSSSSTTITSPHDFNVKNSMAQPYRRMTDSEMR
          M +A  +E+ ++V   +R      + S+ N+    GPN   N +   V      S   T + + NS++S T++ +     + +      RR+T+ E++
Subjt:  AKMLMAQVIED-DQVVQLKRIHGVGANPSLLNKTSGNGPNGSGNQTGSKVMDRVATSRTITINPSRNSSSSSTTITSPHDFNVKNSMAQPYRRMTDSEMR

Query:  MKKEKGLCFRCDEKFNPGPQ---------------GEDLSGEIDKVAEETEDKNEQINTEIANLSLHSLVGFSSPKTIKIKGEIRNCEVVVLVDGGAIHN
         K+ KGLCF+CDEK+  G Q                ++L G +   +E      E+I  E+   SL+S++G S+PKT+K+ G I N EVVV++D GA HN
Subjt:  MKKEKGLCFRCDEKFNPGPQ---------------GEDLSGEIDKVAEETEDKNEQINTEIANLSLHSLVGFSSPKTIKIKGEIRNCEVVVLVDGGAIHN

Query:  FISEEVVKELKIPVETLDAYGVVLGTGGVVRATGMCKSVNLTI-VNLSITHDFLPLPLGSADVNLGVTWLETLGKVIFNYKLSEMEFSLGEFLVILQGNK
        F+S + + +L IPV   + +GV LG G  VR TG+C++V L +   L +  DFLPL LG++DV LGV WLETLG V+ N+K  +M F LG     L G+ 
Subjt:  FISEEVVKELKIPVETLDAYGVVLGTGGVVRATGMCKSVNLTI-VNLSITHDFLPLPLGSADVNLGVTWLETLGKVIFNYKLSEMEFSLGEFLVILQGNK

Query:  SLVKSQVSLKSMI--FKKEDQGVLIELSTVEQGGAEESKDNLADCLSNLKPEVQRILLSFGSVFESINQLPPPRDHDHAIELESGARAVNVRPYRYPQFQ
        +L +S+VSLK+M+   +KE  G+ +E + VE GGA   +D+  +    + P +Q ++  F  VFE+   LPP R H+HAI L+ G+  V VRPYRYPQFQ
Subjt:  SLVKSQVSLKSMI--FKKEDQGVLIELSTVEQGGAEESKDNLADCLSNLKPEVQRILLSFGSVFESINQLPPPRDHDHAIELESGARAVNVRPYRYPQFQ

Query:  KDEIEKLVKKMLLAKIIQPSKSAFSSPVLLVKKKDGSWRFCVDYRALNLATIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHHIRVRATDVHKTAFRT
        KDEIE+L+K+ML A IIQPS S FSSPV+LVKKKDGSWRFCVDYRALN  T+PDKYPIPV+DELLDEL GAT+FSK+DL++GYH I VR  D HKTAFRT
Subjt:  KDEIEKLVKKMLLAKIIQPSKSAFSSPVLLVKKKDGSWRFCVDYRALNLATIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHHIRVRATDVHKTAFRT

Query:  HEGHYEFLVMPFGLKNAPTTFQSVMNDILRPYLRKFVLVFFDDILIYS-SLEEHLHQLAMVLETLVVHKLVANLKKCQFAVDQIEYLGHIISSDGVAADP
        HEGHYEFLVMPFGL NAP TFQS+MN++ RP+LR+FVLVF DDILIYS S EEH+  L MVL  L  H L  N KKC+F   ++ YLGH+IS  GVA D 
Subjt:  HEGHYEFLVMPFGLKNAPTTFQSVMNDILRPYLRKFVLVFFDDILIYS-SLEEHLHQLAMVLETLVVHKLVANLKKCQFAVDQIEYLGHIISSDGVAADP

Query:  TKIAAMVKWPAPKNVKELRGFLGLPGYYRKFVANYGSIALPLTQLLKKGKFQWNETAEKAFQQLKSAMMSVPVLGIPDFTQGFVLKTDASGVGIGAVLMQ
         K+ A+++W  PKN++ELRGFLGL GYYRKFVANY  IA PLT+ LKK  F+W+ TA +AF+QLKSAM+S PVL +P+F   FV++TDASG G+GAVLMQ
Subjt:  TKIAAMVKWPAPKNVKELRGFLGLPGYYRKFVANYGSIALPLTQLLKKGKFQWNETAEKAFQQLKSAMMSVPVLGIPDFTQGFVLKTDASGVGIGAVLMQ

Query:  HQRPVAFFSQALPITHRFKAVYERKLMAIVRAVQKWRPHLLGKPFVVRTDQKSLKFLLEQRAIGGEYQRWIAKLLGYDFVIEYKKGMENKAADALSR-LP
          RP+A++S+ L    + K+VYE++LMAI  AVQKW+ +LLG+ FVVRTDQ+SL+++ +QR IG E+Q+W++KL+GYDF I YK G+ N+ ADALSR   
Subjt:  HQRPVAFFSQALPITHRFKAVYERKLMAIVRAVQKWRPHLLGKPFVVRTDQKSLKFLLEQRAIGGEYQRWIAKLLGYDFVIEYKKGMENKAADALSR-LP

Query:  PLFELGLISVVGRLNPLIFIDQVTGNEALNSIRLSLINGQPTPEGYSLQGEVLCYHDRLVLPEDSPTIPLLLAEFHNCPIG
           ELG I  V  +       ++TG+  L  +R  L  G+ TP  ++L    L +  R V+P  S  IP LL E+H+ P+G
Subjt:  PLFELGLISVVGRLNPLIFIDQVTGNEALNSIRLSLINGQPTPEGYSLQGEVLCYHDRLVLPEDSPTIPLLLAEFHNCPIG

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.61.3e-9244.42Show/hide
Query:  YRYPQFQKDEIEKLVKKMLLAKIIQPSKSAFSSPVLLV-KKKDGS----WRFCVDYRALNLATIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHHIRV
        Y YPQ  + E+E  ++ ML   II+ S S ++SP+ +V KK+D S    +R  +DYR LN  T+ D++PIP +DE+L +L     F+ IDL  G+H I +
Subjt:  YRYPQFQKDEIEKLVKKMLLAKIIQPSKSAFSSPVLLV-KKKDGS----WRFCVDYRALNLATIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHHIRV

Query:  RATDVHKTAFRTHEGHYEFLVMPFGLKNAPTTFQSVMNDILRPYLRKFVLVFFDDILIYS-SLEEHLHQLAMVLETLVVHKLVANLKKCQFAVDQIEYLG
            V KTAF T  GHYE+L MPFGLKNAP TFQ  MNDILRP L K  LV+ DDI+++S SL+EHL  L +V E L    L   L KC+F   +  +LG
Subjt:  RATDVHKTAFRTHEGHYEFLVMPFGLKNAPTTFQSVMNDILRPYLRKFVLVFFDDILIYS-SLEEHLHQLAMVLETLVVHKLVANLKKCQFAVDQIEYLG

Query:  HIISSDGVAADPTKIAAMVKWPAPKNVKELRGFLGLPGYYRKFVANYGSIALPLTQLLKKGK--FQWNETAEKAFQQLKSAMMSVPVLGIPDFTQGFVLK
        H+++ DG+  +P KI A+ K+P P   KE++ FLGL GYYRKF+ N+  IA P+T+ LKK       N   + AF++LK  +   P+L +PDFT+ F L 
Subjt:  HIISSDGVAADPTKIAAMVKWPAPKNVKELRGFLGLPGYYRKFVANYGSIALPLTQLLKKGK--FQWNETAEKAFQQLKSAMMSVPVLGIPDFTQGFVLK

Query:  TDASGVGIGAVLMQHQRPVAFFSQALPITHRFKAVYERKLMAIVRAVQKWRPHLLGKPFVVRTDQKSLKFLLEQRAIGGEYQRWIAKLLGYDFVIEYKKG
        TDAS V +GAVL Q   P+++ S+ L       +  E++L+AIV A + +R +LLG+ F + +D + L +L   +    +  RW  KL  +DF I+Y KG
Subjt:  TDASGVGIGAVLMQHQRPVAFFSQALPITHRFKAVYERKLMAIVRAVQKWRPHLLGKPFVVRTDQKSLKFLLEQRAIGGEYQRWIAKLLGYDFVIEYKKG

Query:  MENKAADALSRL
         EN  ADALSR+
Subjt:  MENKAADALSRL

P20825 Retrovirus-related Pol polyprotein from transposon 2976.6e-9243.48Show/hide
Query:  RPYRYPQFQKDEIEKLVKKMLLAKIIQPSKSAFSSPVLLVKKKD-----GSWRFCVDYRALNLATIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHHI
        + Y   Q  + E+E  V++ML   +I+ S S ++SP  +V KK        +R  +DYR LN  TIPD+YPIP +DE+L +L     F+ IDL  G+H I
Subjt:  RPYRYPQFQKDEIEKLVKKMLLAKIIQPSKSAFSSPVLLVKKKD-----GSWRFCVDYRALNLATIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHHI

Query:  RVRATDVHKTAFRTHEGHYEFLVMPFGLKNAPTTFQSVMNDILRPYLRKFVLVFFDDILIYS-SLEEHLHQLAMVLETLVVHKLVANLKKCQFAVDQIEY
         +    + KTAF T  GHYE+L MPFGL+NAP TFQ  MN+ILRP L K  LV+ DDI+I+S SL EHL+ + +V   L    L   L KC+F   +  +
Subjt:  RVRATDVHKTAFRTHEGHYEFLVMPFGLKNAPTTFQSVMNDILRPYLRKFVLVFFDDILIYS-SLEEHLHQLAMVLETLVVHKLVANLKKCQFAVDQIEY

Query:  LGHIISSDGVAADPTKIAAMVKWPAPKNVKELRGFLGLPGYYRKFVANYGSIALPLTQLLKKGKFQWNETAE--KAFQQLKSAMMSVPVLGIPDFTQGFV
        LGHI++ DG+  +P K+ A+V +P P   KE+R FLGL GYYRKF+ NY  IA P+T  LKK      +  E  +AF++LK+ ++  P+L +PDF + FV
Subjt:  LGHIISSDGVAADPTKIAAMVKWPAPKNVKELRGFLGLPGYYRKFVANYGSIALPLTQLLKKGKFQWNETAE--KAFQQLKSAMMSVPVLGIPDFTQGFV

Query:  LKTDASGVGIGAVLMQHQRPVAFFSQALPITHRFKAVYERKLMAIVRAVQKWRPHLLGKPFVVRTDQKSLKFLLEQRAIGGEYQRWIAKLLGYDFVIEYK
        L TDAS + +GAVL Q+  P++F S+ L       +  E++L+AIV A + +R +LLG+ F++ +D + L++L   +  G + +RW  +L  Y F I+Y 
Subjt:  LKTDASGVGIGAVLMQHQRPVAFFSQALPITHRFKAVYERKLMAIVRAVQKWRPHLLGKPFVVRTDQKSLKFLLEQRAIGGEYQRWIAKLLGYDFVIEYK

Query:  KGMENKAADALSRL
        KG EN  ADALSR+
Subjt:  KGMENKAADALSRL

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein5.2e-8939.89Show/hide
Query:  NQLPP-PRDHD-----HAIELESGARAVNVRPYRYPQFQKDEIEKLVKKMLLAKIIQPSKSAFSSPVLLVKKKDGSWRFCVDYRALNLATIPDKYPIPVV
        N LPP P D +     H IE++ GAR   ++PY   +  + EI K+V+K+L  K I PSKS  SSPV+LV KKDG++R CVDYR LN ATI D +P+P +
Subjt:  NQLPP-PRDHD-----HAIELESGARAVNVRPYRYPQFQKDEIEKLVKKMLLAKIIQPSKSAFSSPVLLVKKKDGSWRFCVDYRALNLATIPDKYPIPVV

Query:  DELLDELFGATIFSKIDLKSGYHHIRVRATDVHKTAFRTHEGHYEFLVMPFGLKNAPTTFQSVMNDILRPYLRKFVLVFFDDILIYS-SLEEHLHQLAMV
        D LL  +  A IF+ +DL SGYH I +   D +KTAF T  G YE+ VMPFGL NAP+TF   M D  R    +FV V+ DDILI+S S EEH   L  V
Subjt:  DELLDELFGATIFSKIDLKSGYHHIRVRATDVHKTAFRTHEGHYEFLVMPFGLKNAPTTFQSVMNDILRPYLRKFVLVFFDDILIYS-SLEEHLHQLAMV

Query:  LETLVVHKLVANLKKCQFAVDQIEYLGHIISSDGVAADPTKIAAMVKWPAPKNVKELRGFLGLPGYYRKFVANYGSIALPLTQLLKKGKFQWNETAEKAF
        LE L    L+   KKC+FA ++ E+LG+ I    +A    K AA+  +P PK VK+ + FLG+  YYR+F+ N   IA P+ QL    K QW E  +KA 
Subjt:  LETLVVHKLVANLKKCQFAVDQIEYLGHIISSDGVAADPTKIAAMVKWPAPKNVKELRGFLGLPGYYRKFVANYGSIALPLTQLLKKGKFQWNETAEKAF

Query:  QQLKSAMMSVPVLGIPDFTQGFVLKTDASGVGIGAVLMQHQRP------VAFFSQALPITHRFKAVYERKLMAIVRAVQKWRPHLLGKPFVVRTDQKSLK
        ++LK+A+ + PVL   +    + L TDAS  GIGAVL +          V +FS++L    +     E +L+ I++A+  +R  L GK F +RTD  SL 
Subjt:  QQLKSAMMSVPVLGIPDFTQGFVLKTDASGVGIGAVLMQHQRP------VAFFSQALPITHRFKAVYERKLMAIVRAVQKWRPHLLGKPFVVRTDQKSLK

Query:  FLLEQRAIGGEYQRWIAKLLGYDFVIEYKKGMENKAADALSRL-----------------------PPLFELGLISVVGRLNPLIFIDQVTGNEALNSIR
         L  +       QRW+  L  YDF +EY  G +N  ADA+SR                         PL    LI +       +  + ++   A  S +
Subjt:  FLLEQRAIGGEYQRWIAKLLGYDFVIEYKKGMENKAADALSRL-----------------------PPLFELGLISVVGRLNPLIFIDQVTGNEALNSIR

Query:  LSLINGQPTPEGYSLQGEVLCYHDRLVLP
          L   +   + YSL+ E++ Y DRLV+P
Subjt:  LSLINGQPTPEGYSLQGEVLCYHDRLVLP

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus3.0e-8439.86Show/hide
Query:  VNVRPYRYPQFQKDEIEKLVKKMLLAKIIQPSKSAFSSPVLLVKKK-----DGSWRFCVDYRALNLATIPDKYPIPVVDELLDELFGATIFSKIDLKSGY
        +  + Y YP   + E+E+ + ++L   II+PS S ++SP+ +V KK     +  +R  VD++ LN  TIPD YPIP ++  L  L  A  F+ +DL SG+
Subjt:  VNVRPYRYPQFQKDEIEKLVKKMLLAKIIQPSKSAFSSPVLLVKKK-----DGSWRFCVDYRALNLATIPDKYPIPVVDELLDELFGATIFSKIDLKSGY

Query:  HHIRVRATDVHKTAFRTHEGHYEFLVMPFGLKNAPTTFQSVMNDILRPYLRKFVLVFFDDILIYS-SLEEHLHQLAMVLETLVVHKLVANLKKCQFAVDQ
        H I ++ +D+ KTAF T  G YEFL +PFGLKNAP  FQ +++DILR ++ K   V+ DDI+++S   + H   L +VL +L    L  NL+K  F   Q
Subjt:  HHIRVRATDVHKTAFRTHEGHYEFLVMPFGLKNAPTTFQSVMNDILRPYLRKFVLVFFDDILIYS-SLEEHLHQLAMVLETLVVHKLVANLKKCQFAVDQ

Query:  IEYLGHIISSDGVAADPTKIAAMVKWPAPKNVKELRGFLGLPGYYRKFVANYGSIALPLTQLLK------------KGKFQWNETAEKAFQQLKSAMMSV
        +E+LG+I+++DG+ ADP K+ A+ + P P +VKEL+ FLG+  YYRKF+ +Y  +A PLT L +            K     +ETA ++F  LKS + S 
Subjt:  IEYLGHIISSDGVAADPTKIAAMVKWPAPKNVKELRGFLGLPGYYRKFVANYGSIALPLTQLLK------------KGKFQWNETAEKAFQQLKSAMMSV

Query:  PVLGIPDFTQGFVLKTDASGVGIGAVLMQ----HQRPVAFFSQALPITHRFKAVYERKLMAIVRAVQKWRPHLLGKPFV-VRTDQKSLKFLLEQRAIGGE
         +L  P FT+ F L TDAS   IGAVL Q      RP+A+ S++L  T    A  E++++AI+ ++   R +L G   + V TD + L F L  R    +
Subjt:  PVLGIPDFTQGFVLKTDASGVGIGAVLMQ----HQRPVAFFSQALPITHRFKAVYERKLMAIVRAVQKWRPHLLGKPFV-VRTDQKSLKFLLEQRAIGGE

Query:  YQRWIAKLLGYDFVIEYKKGMENKAADALSRLPP
         +RW A++  Y+  + YK G  N  ADALSR+PP
Subjt:  YQRWIAKLLGYDFVIEYKKGMENKAADALSRLPP

Q99315 Transposon Ty3-G Gag-Pol polyprotein1.2e-8839.89Show/hide
Query:  NQLPP-PRDHD-----HAIELESGARAVNVRPYRYPQFQKDEIEKLVKKMLLAKIIQPSKSAFSSPVLLVKKKDGSWRFCVDYRALNLATIPDKYPIPVV
        N LPP P D +     H IE++ GAR   ++PY   +  + EI K+V+K+L  K I PSKS  SSPV+LV KKDG++R CVDYR LN ATI D +P+P +
Subjt:  NQLPP-PRDHD-----HAIELESGARAVNVRPYRYPQFQKDEIEKLVKKMLLAKIIQPSKSAFSSPVLLVKKKDGSWRFCVDYRALNLATIPDKYPIPVV

Query:  DELLDELFGATIFSKIDLKSGYHHIRVRATDVHKTAFRTHEGHYEFLVMPFGLKNAPTTFQSVMNDILRPYLRKFVLVFFDDILIYS-SLEEHLHQLAMV
        D LL  +  A IF+ +DL SGYH I +   D +KTAF T  G YE+ VMPFGL NAP+TF   M D  R    +FV V+ DDILI+S S EEH   L  V
Subjt:  DELLDELFGATIFSKIDLKSGYHHIRVRATDVHKTAFRTHEGHYEFLVMPFGLKNAPTTFQSVMNDILRPYLRKFVLVFFDDILIYS-SLEEHLHQLAMV

Query:  LETLVVHKLVANLKKCQFAVDQIEYLGHIISSDGVAADPTKIAAMVKWPAPKNVKELRGFLGLPGYYRKFVANYGSIALPLTQLLKKGKFQWNETAEKAF
        LE L    L+   KKC+FA ++ E+LG+ I    +A    K AA+  +P PK VK+ + FLG+  YYR+F+ N   IA P+ QL    K QW E  +KA 
Subjt:  LETLVVHKLVANLKKCQFAVDQIEYLGHIISSDGVAADPTKIAAMVKWPAPKNVKELRGFLGLPGYYRKFVANYGSIALPLTQLLKKGKFQWNETAEKAF

Query:  QQLKSAMMSVPVLGIPDFTQGFVLKTDASGVGIGAVLMQHQRP------VAFFSQALPITHRFKAVYERKLMAIVRAVQKWRPHLLGKPFVVRTDQKSLK
         +LK A+ + PVL   +    + L TDAS  GIGAVL +          V +FS++L    +     E +L+ I++A+  +R  L GK F +RTD  SL 
Subjt:  QQLKSAMMSVPVLGIPDFTQGFVLKTDASGVGIGAVLMQHQRP------VAFFSQALPITHRFKAVYERKLMAIVRAVQKWRPHLLGKPFVVRTDQKSLK

Query:  FLLEQRAIGGEYQRWIAKLLGYDFVIEYKKGMENKAADALSRL-----------------------PPLFELGLISVVGRLNPLIFIDQVTGNEALNSIR
         L  +       QRW+  L  YDF +EY  G +N  ADA+SR                         PL    LI +       +  + ++   A  S +
Subjt:  FLLEQRAIGGEYQRWIAKLLGYDFVIEYKKGMENKAADALSRL-----------------------PPLFELGLISVVGRLNPLIFIDQVTGNEALNSIR

Query:  LSLINGQPTPEGYSLQGEVLCYHDRLVLP
          L   +   + YSL+ E++ Y DRLV+P
Subjt:  LSLINGQPTPEGYSLQGEVLCYHDRLVLP

Arabidopsis top hitse value%identityAlignment
AT3G29750.1 Eukaryotic aspartyl protease family protein6.9e-1229.93Show/hide
Query:  LVGFSSPKTIKIKGEIRNCEVVVLVDGGAIHNFISEEVVKELKIPVETLDAYGVVLGTGGVVRATGMCKSVNLTIVNLSITHDFLPLPLGSADVN--LGV
        ++  +  K ++  G I + +VVV +D GA  NFI  E+   LK+P    +   V+LG    +++ G C  + L +  + IT +FL L L   DV+  LG 
Subjt:  LVGFSSPKTIKIKGEIRNCEVVVLVDGGAIHNFISEEVVKELKIPVETLDAYGVVLGTGGVVRATGMCKSVNLTIVNLSITHDFLPLPLGSADVN--LGV

Query:  TWLETLGKVIFNYKLSEMEFSLG-EFLVILQGNKSL--VKSQVSLKS
         WL  LG+ + N++  +  FS   +++ +   ++ L  V ++V +KS
Subjt:  TWLETLGKVIFNYKLSEMEFSLG-EFLVILQGNKSL--VKSQVSLKS

AT3G30770.1 Eukaryotic aspartyl protease family protein6.5e-1028.89Show/hide
Query:  IANLSLHSLVGFSSPKTIKIKGEIRNCEVVVLVDGGAIHNFISEEVVKELKIPVETLDAYGVVLGTGGVVRATGMCKSVNLTIVNLSITHDFLPLPLGSA
        I  +   S   F+  K ++  G I   +VVV++D GA +NFIS+E+   LK+P  T +   V+LG    ++  G C  +NL +  + I  +FL L L   
Subjt:  IANLSLHSLVGFSSPKTIKIKGEIRNCEVVVLVDGGAIHNFISEEVVKELKIPVETLDAYGVVLGTGGVVRATGMCKSVNLTIVNLSITHDFLPLPLGSA

Query:  DVNL-----GVTWLETLGKVIFNYKLSEMEFSLGEFLVILQGNKSL--VKSQVSLKSMIFKKEDQGVLIELSTVEQGGAE
        DV++     G   LE    +  N   S   F   +++ +   +K L  V ++V +KS  +++E     +E   V +GG++
Subjt:  DVNL-----GVTWLETLGKVIFNYKLSEMEFSLGEFLVILQGNKSL--VKSQVSLKSMIFKKEDQGVLIELSTVEQGGAE

AT3G42723.1 aminoacyl-tRNA ligases;ATP binding;nucleotide binding4.7e-0828.79Show/hide
Query:  FGVPCAPNQREIPLFDMRLRKLEGEALDWYKWKDDRTKINSWEEFRQLLWARFKPSGQGDKHARSMKLQQETTVREYRRRVEQFSTSLKDMSDAALESKF
        FG    P Q  + +       LEG+   W K    +    SW+EF+ ++    K + + +       +QQE +VREYR R E        +    LE+ F
Subjt:  FGVPCAPNQREIPLFDMRLRKLEGEALDWYKWKDDRTKINSWEEFRQLLWARFKPSGQGDKHARSMKLQQETTVREYRRRVEQFSTSLKDMSDAALESKF

Query:  VCGLREEIQSEIRKLNPVGLEAKMLMAQVIED
        + GL+  +Q+ +R+L P G+   M  AQ +E+
Subjt:  VCGLREEIQSEIRKLNPVGLEAKMLMAQVIED

ATMG00850.1 DNA/RNA polymerases superfamily protein5.3e-0453.85Show/hide
Query:  QKDEIEKLVKKMLLAKIIQPSKSAFSSPVLLVKKKDGSW
        ++  ++  + +ML A+IIQPS S +SSPVLLV+KKDG W
Subjt:  QKDEIEKLVKKMLLAKIIQPSKSAFSSPVLLVKKKDGSW

ATMG00860.1 DNA/RNA polymerases superfamily protein2.0e-3555.22Show/hide
Query:  LHQLAMVLETLVVHKLVANLKKCQFAVDQIEYLG--HIISSDGVAADPTKIAAMVKWPAPKNVKELRGFLGLPGYYRKFVANYGSIALPLTQLLKKGKFQ
        ++ L MVL+    H+  AN KKC F   QI YLG  HIIS +GV+ADP K+ AMV WP PKN  ELRGFLGL GYYR+FV NYG I  PLT+LLKK   +
Subjt:  LHQLAMVLETLVVHKLVANLKKCQFAVDQIEYLG--HIISSDGVAADPTKIAAMVKWPAPKNVKELRGFLGLPGYYRKFVANYGSIALPLTQLLKKGKFQ

Query:  WNETAEKAFQQLKSAMMSVPVLGIPDFTQGFVLK
        W E A  AF+ LK A+ ++PVL +PD    FV +
Subjt:  WNETAEKAFQQLKSAMMSVPVLGIPDFTQGFVLK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGTGATGGGAAGTGCTTCCAAAGGATCCACGTCTAGCGAAAGAACAATTGCAGACAAGGGAAAGGGTATAAAGGATGAAGAACAAGCTCATGAAGAAAAA
AGCGAAGTGAAAACTAACCCGGTTACGAATGAAGGATTTGGAGTACCTTGTGCACCAAACCAGCGGGAGATACCCCTCTTCGACATGAGATTAAGGAAATTAGAG
GGAGAGGCGTTGGACTGGTATAAGTGGAAAGATGATCGAACGAAGATTAATAGTTGGGAGGAATTCCGGCAGCTTTTATGGGCAAGATTCAAGCCATCTGGTCAG
GGGGATAAGCACGCGAGGTCGATGAAACTGCAGCAAGAAACCACCGTAAGGGAATATCGGCGACGAGTTGAGCAGTTTTCGACCAGCCTTAAGGATATGAGCGAT
GCAGCCCTAGAAAGTAAATTTGTGTGTGGGCTAAGGGAAGAAATCCAAAGTGAGATTCGTAAATTGAATCCAGTGGGCCTGGAAGCAAAAATGTTAATGGCCCAA
GTAATAGAAGATGACCAAGTAGTCCAATTAAAAAGAATACATGGGGTGGGTGCAAACCCTAGTTTGTTAAATAAAACATCTGGTAATGGACCAAATGGGTCAGGA
AATCAAACCGGGTCAAAGGTAATGGATCGGGTTGCAACATCAAGAACCATTACGATTAATCCTAGCCGAAATTCGTCTTCATCATCGACAACAATCACTTCCCCT
CACGATTTTAACGTGAAAAACTCGATGGCTCAACCTTATCGCCGAATGACTGACAGTGAAATGAGAATGAAGAAGGAAAAGGGGCTATGTTTTCGATGTGACGAG
AAATTCAATCCGGGGCCCCAAGGGGAGGACTTAAGTGGGGAAATTGATAAAGTAGCAGAGGAGACTGAAGATAAGAATGAGCAAATCAATACTGAGATTGCAAAT
TTGTCTTTACATTCGTTGGTAGGTTTTAGTTCTCCTAAAACCATAAAAATAAAAGGCGAAATCAGGAATTGCGAAGTTGTCGTGTTAGTCGATGGGGGAGCTATA
CATAACTTTATTTCGGAGGAGGTGGTCAAGGAATTAAAAATTCCAGTGGAAACTTTAGATGCTTATGGCGTTGTTTTGGGAACCGGGGGTGTAGTTCGAGCAACA
GGAATGTGTAAGAGTGTGAATCTGACAATCGTTAATTTATCGATCACTCATGATTTCCTTCCTCTACCACTTGGGAGTGCAGATGTTAATTTAGGAGTTACATGG
TTGGAAACTTTGGGGAAAGTAATTTTCAATTACAAGTTATCAGAGATGGAATTTTCATTGGGAGAATTTTTGGTGATTCTACAAGGGAACAAAAGCCTCGTAAAA
TCACAGGTGTCACTAAAATCAATGATTTTTAAGAAGGAAGATCAAGGAGTGTTAATCGAGTTGAGTACAGTCGAACAAGGGGGAGCGGAAGAGTCAAAGGATAAT
TTAGCGGACTGTCTGAGTAATCTAAAACCCGAAGTTCAAAGAATTCTTTTGTCTTTTGGTAGTGTGTTTGAATCCATAAATCAGTTGCCACCGCCTCGCGATCAT
GATCATGCTATTGAATTAGAGTCAGGGGCCCGGGCGGTGAATGTGCGGCCTTATCGCTATCCCCAATTCCAGAAAGATGAAATTGAGAAATTAGTAAAGAAAATG
TTGTTAGCCAAAATTATTCAGCCAAGCAAAAGTGCATTTTCAAGTCCGGTGCTCCTTGTAAAGAAGAAAGATGGTAGCTGGCGATTCTGCGTGGATTACCGAGCA
CTGAATCTTGCCACCATACCAGATAAGTATCCAATTCCCGTTGTTGATGAACTACTAGATGAATTATTCGGGGCAACAATATTTTCAAAAATTGATCTAAAGTCA
GGCTATCACCATATTAGAGTGCGAGCCACCGATGTTCACAAAACGGCGTTTCGTACCCATGAAGGGCACTACGAATTTCTCGTGATGCCATTTGGGTTGAAGAAT
GCTCCGACCACTTTTCAATCAGTAATGAATGATATTCTCCGCCCGTACTTACGCAAATTTGTTTTGGTTTTCTTCGACGATATCCTAATCTACAGCTCACTAGAA
GAACATTTGCACCAATTGGCCATGGTTTTAGAAACCTTAGTAGTTCATAAACTGGTGGCGAATTTAAAAAAATGTCAATTCGCAGTAGATCAAATTGAATATTTG
GGTCACATTATCTCATCTGATGGGGTGGCGGCGGATCCAACGAAAATAGCGGCTATGGTTAAATGGCCAGCACCCAAGAATGTGAAGGAATTACGAGGTTTCTTG
GGTCTTCCAGGTTATTATCGGAAATTTGTGGCAAACTATGGTTCTATTGCACTGCCCTTGACACAATTGTTAAAGAAAGGAAAGTTTCAGTGGAATGAGACAGCA
GAAAAAGCGTTCCAGCAGTTAAAATCTGCTATGATGTCGGTACCGGTGTTAGGTATTCCAGATTTTACCCAAGGTTTTGTGCTGAAAACTGATGCTTCAGGTGTT
GGTATTGGGGCTGTCCTAATGCAGCATCAACGTCCAGTGGCTTTTTTCAGTCAAGCTTTGCCCATTACCCATAGGTTTAAAGCTGTGTATGAACGGAAACTTATG
GCAATCGTGCGGGCTGTTCAAAAATGGCGCCCACATCTGTTAGGTAAGCCTTTTGTGGTGCGTACAGATCAGAAAAGTTTGAAGTTTCTTCTTGAACAACGAGCC
ATTGGGGGAGAATATCAAAGATGGATCGCTAAACTATTGGGGTATGATTTTGTGATTGAATACAAGAAAGGAATGGAAAACAAAGCGGCTGATGCTTTATCACGA
TTACCACCGCTATTTGAATTGGGTCTCATAAGTGTTGTGGGCAGGCTCAATCCTTTGATTTTCATTGATCAAGTGACTGGGAATGAAGCTCTGAATAGTATTCGG
TTATCCTTGATAAATGGACAGCCAACCCCGGAGGGATATTCGCTACAAGGAGAGGTTCTATGCTATCACGACCGGTTGGTTCTTCCGGAGGATTCCCCTACTATC
CCCTTATTATTAGCGGAGTTTCATAACTGCCCGATAGGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGGAGTGATGGGAAGTGCTTCCAAAGGATCCACGTCTAGCGAAAGAACAATTGCAGACAAGGGAAAGGGTATAAAGGATGAAGAACAAGCTCATGAAGAAAAA
AGCGAAGTGAAAACTAACCCGGTTACGAATGAAGGATTTGGAGTACCTTGTGCACCAAACCAGCGGGAGATACCCCTCTTCGACATGAGATTAAGGAAATTAGAG
GGAGAGGCGTTGGACTGGTATAAGTGGAAAGATGATCGAACGAAGATTAATAGTTGGGAGGAATTCCGGCAGCTTTTATGGGCAAGATTCAAGCCATCTGGTCAG
GGGGATAAGCACGCGAGGTCGATGAAACTGCAGCAAGAAACCACCGTAAGGGAATATCGGCGACGAGTTGAGCAGTTTTCGACCAGCCTTAAGGATATGAGCGAT
GCAGCCCTAGAAAGTAAATTTGTGTGTGGGCTAAGGGAAGAAATCCAAAGTGAGATTCGTAAATTGAATCCAGTGGGCCTGGAAGCAAAAATGTTAATGGCCCAA
GTAATAGAAGATGACCAAGTAGTCCAATTAAAAAGAATACATGGGGTGGGTGCAAACCCTAGTTTGTTAAATAAAACATCTGGTAATGGACCAAATGGGTCAGGA
AATCAAACCGGGTCAAAGGTAATGGATCGGGTTGCAACATCAAGAACCATTACGATTAATCCTAGCCGAAATTCGTCTTCATCATCGACAACAATCACTTCCCCT
CACGATTTTAACGTGAAAAACTCGATGGCTCAACCTTATCGCCGAATGACTGACAGTGAAATGAGAATGAAGAAGGAAAAGGGGCTATGTTTTCGATGTGACGAG
AAATTCAATCCGGGGCCCCAAGGGGAGGACTTAAGTGGGGAAATTGATAAAGTAGCAGAGGAGACTGAAGATAAGAATGAGCAAATCAATACTGAGATTGCAAAT
TTGTCTTTACATTCGTTGGTAGGTTTTAGTTCTCCTAAAACCATAAAAATAAAAGGCGAAATCAGGAATTGCGAAGTTGTCGTGTTAGTCGATGGGGGAGCTATA
CATAACTTTATTTCGGAGGAGGTGGTCAAGGAATTAAAAATTCCAGTGGAAACTTTAGATGCTTATGGCGTTGTTTTGGGAACCGGGGGTGTAGTTCGAGCAACA
GGAATGTGTAAGAGTGTGAATCTGACAATCGTTAATTTATCGATCACTCATGATTTCCTTCCTCTACCACTTGGGAGTGCAGATGTTAATTTAGGAGTTACATGG
TTGGAAACTTTGGGGAAAGTAATTTTCAATTACAAGTTATCAGAGATGGAATTTTCATTGGGAGAATTTTTGGTGATTCTACAAGGGAACAAAAGCCTCGTAAAA
TCACAGGTGTCACTAAAATCAATGATTTTTAAGAAGGAAGATCAAGGAGTGTTAATCGAGTTGAGTACAGTCGAACAAGGGGGAGCGGAAGAGTCAAAGGATAAT
TTAGCGGACTGTCTGAGTAATCTAAAACCCGAAGTTCAAAGAATTCTTTTGTCTTTTGGTAGTGTGTTTGAATCCATAAATCAGTTGCCACCGCCTCGCGATCAT
GATCATGCTATTGAATTAGAGTCAGGGGCCCGGGCGGTGAATGTGCGGCCTTATCGCTATCCCCAATTCCAGAAAGATGAAATTGAGAAATTAGTAAAGAAAATG
TTGTTAGCCAAAATTATTCAGCCAAGCAAAAGTGCATTTTCAAGTCCGGTGCTCCTTGTAAAGAAGAAAGATGGTAGCTGGCGATTCTGCGTGGATTACCGAGCA
CTGAATCTTGCCACCATACCAGATAAGTATCCAATTCCCGTTGTTGATGAACTACTAGATGAATTATTCGGGGCAACAATATTTTCAAAAATTGATCTAAAGTCA
GGCTATCACCATATTAGAGTGCGAGCCACCGATGTTCACAAAACGGCGTTTCGTACCCATGAAGGGCACTACGAATTTCTCGTGATGCCATTTGGGTTGAAGAAT
GCTCCGACCACTTTTCAATCAGTAATGAATGATATTCTCCGCCCGTACTTACGCAAATTTGTTTTGGTTTTCTTCGACGATATCCTAATCTACAGCTCACTAGAA
GAACATTTGCACCAATTGGCCATGGTTTTAGAAACCTTAGTAGTTCATAAACTGGTGGCGAATTTAAAAAAATGTCAATTCGCAGTAGATCAAATTGAATATTTG
GGTCACATTATCTCATCTGATGGGGTGGCGGCGGATCCAACGAAAATAGCGGCTATGGTTAAATGGCCAGCACCCAAGAATGTGAAGGAATTACGAGGTTTCTTG
GGTCTTCCAGGTTATTATCGGAAATTTGTGGCAAACTATGGTTCTATTGCACTGCCCTTGACACAATTGTTAAAGAAAGGAAAGTTTCAGTGGAATGAGACAGCA
GAAAAAGCGTTCCAGCAGTTAAAATCTGCTATGATGTCGGTACCGGTGTTAGGTATTCCAGATTTTACCCAAGGTTTTGTGCTGAAAACTGATGCTTCAGGTGTT
GGTATTGGGGCTGTCCTAATGCAGCATCAACGTCCAGTGGCTTTTTTCAGTCAAGCTTTGCCCATTACCCATAGGTTTAAAGCTGTGTATGAACGGAAACTTATG
GCAATCGTGCGGGCTGTTCAAAAATGGCGCCCACATCTGTTAGGTAAGCCTTTTGTGGTGCGTACAGATCAGAAAAGTTTGAAGTTTCTTCTTGAACAACGAGCC
ATTGGGGGAGAATATCAAAGATGGATCGCTAAACTATTGGGGTATGATTTTGTGATTGAATACAAGAAAGGAATGGAAAACAAAGCGGCTGATGCTTTATCACGA
TTACCACCGCTATTTGAATTGGGTCTCATAAGTGTTGTGGGCAGGCTCAATCCTTTGATTTTCATTGATCAAGTGACTGGGAATGAAGCTCTGAATAGTATTCGG
TTATCCTTGATAAATGGACAGCCAACCCCGGAGGGATATTCGCTACAAGGAGAGGTTCTATGCTATCACGACCGGTTGGTTCTTCCGGAGGATTCCCCTACTATC
CCCTTATTATTAGCGGAGTTTCATAACTGCCCGATAGGATGA
Protein sequenceShow/hide protein sequence
MGVMGSASKGSTSSERTIADKGKGIKDEEQAHEEKSEVKTNPVTNEGFGVPCAPNQREIPLFDMRLRKLEGEALDWYKWKDDRTKINSWEEFRQLLWARFKPSGQ
GDKHARSMKLQQETTVREYRRRVEQFSTSLKDMSDAALESKFVCGLREEIQSEIRKLNPVGLEAKMLMAQVIEDDQVVQLKRIHGVGANPSLLNKTSGNGPNGSG
NQTGSKVMDRVATSRTITINPSRNSSSSSTTITSPHDFNVKNSMAQPYRRMTDSEMRMKKEKGLCFRCDEKFNPGPQGEDLSGEIDKVAEETEDKNEQINTEIAN
LSLHSLVGFSSPKTIKIKGEIRNCEVVVLVDGGAIHNFISEEVVKELKIPVETLDAYGVVLGTGGVVRATGMCKSVNLTIVNLSITHDFLPLPLGSADVNLGVTW
LETLGKVIFNYKLSEMEFSLGEFLVILQGNKSLVKSQVSLKSMIFKKEDQGVLIELSTVEQGGAEESKDNLADCLSNLKPEVQRILLSFGSVFESINQLPPPRDH
DHAIELESGARAVNVRPYRYPQFQKDEIEKLVKKMLLAKIIQPSKSAFSSPVLLVKKKDGSWRFCVDYRALNLATIPDKYPIPVVDELLDELFGATIFSKIDLKS
GYHHIRVRATDVHKTAFRTHEGHYEFLVMPFGLKNAPTTFQSVMNDILRPYLRKFVLVFFDDILIYSSLEEHLHQLAMVLETLVVHKLVANLKKCQFAVDQIEYL
GHIISSDGVAADPTKIAAMVKWPAPKNVKELRGFLGLPGYYRKFVANYGSIALPLTQLLKKGKFQWNETAEKAFQQLKSAMMSVPVLGIPDFTQGFVLKTDASGV
GIGAVLMQHQRPVAFFSQALPITHRFKAVYERKLMAIVRAVQKWRPHLLGKPFVVRTDQKSLKFLLEQRAIGGEYQRWIAKLLGYDFVIEYKKGMENKAADALSR
LPPLFELGLISVVGRLNPLIFIDQVTGNEALNSIRLSLINGQPTPEGYSLQGEVLCYHDRLVLPEDSPTIPLLLAEFHNCPIG