; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg014767 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg014767
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationscaffold3:38312916..38315839
RNA-Seq ExpressionSpg014767
SyntenySpg014767
Gene Ontology termsGO:0050789 - regulation of biological process (biological process)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR005135 - Endonuclease/exonuclease/phosphatase
IPR025558 - Domain of unknown function DUF4283
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039309.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]2.5e-6525.83Show/hide
Query:  RSISIDRKTFTIAFDEHFRGSRAKITEYSRYSSHSISLSWKSLKWLALSFNTIVHSPCSHKFFSDLRSEEYTLWLEKLNNKNGFYVEINQVQNSGSRQRI
        RS  I+RK F +  D++ + +   +TE   + + SI +S + L W+  +  +++ +P S++FF + R  E+ +W+ K  N  G   EI +V +   +  I
Subjt:  RSISIDRKTFTIAFDEHFRGSRAKITEYSRYSSHSISLSWKSLKWLALSFNTIVHSPCSHKFFSDLRSEEYTLWLEKLNNKNGFYVEINQVQNSGSRQRI

Query:  LIPSENNKQGWFSFFSLISEYPAEAHRQPTQP-----SPPSFKDILQTKPPTAAITPSLKGPVKE--ASVSTHAEE-----------------------W
        L+P    K  W SF S+I+  P    +  T+P     S P F+      PP      S    V E  +S+S+ + +                        
Subjt:  LIPSENNKQGWFSFFSLISEYPAEAHRQPTQP-----SPPSFKDILQTKPPTAAITPSLKGPVKE--ASVSTHAEE-----------------------W

Query:  KEIIVLQRCNQHDDWPSIHQSLINGLSLRCSINPFHANKAMLHVYDQGTATNLCSHSDWTHIGKHKLKFYPLTTASAQQDIMTPSYGGWIEISLLPPTLW
        +  +VL R   HDDW  I Q+L        + N FHA K ++H      A  LC +  WT +GK+ ++F     AS     + PSYGGW     +P  LW
Subjt:  KEIIVLQRCNQHDDWPSIHQSLINGLSLRCSINPFHANKAMLHVYDQGTATNLCSHSDWTHIGKHKLKFYPLTTASAQQDIMTPSYGGWIEISLLPPTLW

Query:  TEHIFRFIGDICGGFVETSNLTSRMIVATEARIKVRPNVTGFIPAAVKLSQDLAG---VDLTVHIRG---------ISG----------------SPQRI
            F+ IG  CGG ++ +  T       EA++K+R N +GF+PA VK+         V +  H  G         + G                S Q +
Subjt:  TEHIFRFIGDICGGFVETSNLTSRMIVATEARIKVRPNVTGFIPAAVKLSQDLAG---VDLTVHIRG---------ISG----------------SPQRI

Query:  VHINDKINEEIPNMAS---KDIVFKKREESECSIAK-SKMISSPAVMPKISV---PVHISPSPPKISVPEHISPPPPSSDQLNKGKLPLEAP--------
            + I+ ++ N  S   K I  ++    +  I K +K  +SP  + +  V    +H + +  K+ +   IS    +   L+KGK  ++ P        
Subjt:  VHINDKINEEIPNMAS---KDIVFKKREESECSIAK-SKMISSPAVMPKISV---PVHISPSPPKISVPEHISPPPPSSDQLNKGKLPLEAP--------

Query:  ------------------FPGPESSIIQITEPTNLRCG------NIGSTSKPNLIVGSDTEAFLSSPSTNHSAHHSAHDPN-SPRSMDLTI-FNESQTDG
                          F  P+S+    +     R           ST +P L         ++ P         AHD + S + + LT+         
Subjt:  ------------------FPGPESSIIQITEPTNLRCG------NIGSTSKPNLIVGSDTEAFLSSPSTNHSAHHSAHDPN-SPRSMDLTI-FNESQTDG

Query:  PLDNLPNHPYQ--------TFPSLIVTLPPLQEQPNHNNPPKPLESLRLSPHPSPR---FSPPPNTQAPTNSFPHCLQHLAPLLSKHGL----------C
        P  +  +H           T   ++   P L+      +   P  + R   H   R   +    + +  TNS     Q L   L ++GL           
Subjt:  PLDNLPNHPYQ--------TFPSLIVTLPPLQEQPNHNNPPKPLESLRLSPHPSPR---FSPPPNTQAPTNSFPHCLQHLAPLLSKHGL----------C

Query:  IMALPTVPKSMGAS-GGILIMWSEPEFSVKETIQGLFSLSIHIVLADNFSFWLSAIYGPSRHADRSEFWNELHDLAGLGGDNWILGGDFNVTRWSWEKSH
          +   +   +G+S GGILI+W     S+    +G FSLS +    +N S+WL+ +YGP +  +R   W +LH+L  L    WI+GGD NV R   E + 
Subjt:  IMALPTVPKSMGAS-GGILIMWSEPEFSVKETIQGLFSLSIHIVLADNFSFWLSAIYGPSRHADRSEFWNELHDLAGLGGDNWILGGDFNVTRWSWEKSH

Query:  GRPVTRSMRIFNQWIADYHLIDTPLQNGCYTWSSCGENHYCSLIDRFLMTDTCLNKFGVARFLRLDRVTSDHYP--CTLSFGDLSWGPCPFRFENSWLKK
            + S  + N +I++  LID PL N  YTWS+       S +DRFL        F       L R TSDH+P  C  S   L WGP PFR  +  L  
Subjt:  GRPVTRSMRIFNQWIADYHLIDTPLQNGCYTWSSCGENHYCSLIDRFLMTDTCLNKFGVARFLRLDRVTSDHYP--CTLSFGDLSWGPCPFRFENSWLKK

Query:  DSFRCLMENWWSQNTIQGWPGHGFMMKLKGLKSEIRKWNLSQ-PSSADQLPSLVSQLKLLDDTE
          F+  ME WW  +   G PG  F+ +LK L + I+ W   +  S      +++ ++  +D  E
Subjt:  DSFRCLMENWWSQNTIQGWPGHGFMMKLKGLKSEIRKWNLSQ-PSSADQLPSLVSQLKLLDDTE

KAA0044449.1 hypothetical protein E6C27_scaffold46G001820 [Cucumis melo var. makuwa]2.6e-7026.7Show/hide
Query:  NHSTRSISIDRKTFTIAFDEHFRGSRAKITEYSRYSSHSISLSWKSLKWLALSFNTIVHSPCSHKFFSDLRSEEYTLWLEKLNNKNGFYVEINQVQNSGS
        N   R  S+++K F ++ D+  R S   ITE   Y S SI+++  SL+WL ++F  ++++P + +FF + R  ++ LW++ ++N+ G+  EI +V + G 
Subjt:  NHSTRSISIDRKTFTIAFDEHFRGSRAKITEYSRYSSHSISLSWKSLKWLALSFNTIVHSPCSHKFFSDLRSEEYTLWLEKLNNKNGFYVEINQVQNSGS

Query:  RQRILIPSENNKQGWFSFFSLIS-EYPAEAHRQPT----------QPSPPSFKDILQTKPPTAAITPSLKGPVKEASVSTHAEEWKEIIVLQRCNQHDDW
        +  IL+P   +K GW  F  +++ +  ++    PT          +    S+     ++ P      ++      +S  +   + K    L+R   HDDW
Subjt:  RQRILIPSENNKQGWFSFFSLIS-EYPAEAHRQPT----------QPSPPSFKDILQTKPPTAAITPSLKGPVKEASVSTHAEEWKEIIVLQRCNQHDDW

Query:  PSIHQSLINGLSLRCS---INPFHANKAMLHVYDQGTATNLCSHSDWTHIGKHKLKFYPLTTASAQQDIMTPSYGGWIEISLLPPTLWTEHIFRFIGDIC
          I   L +    + S     PFHA+KA+L + D+  A  LC +  WT +G   +KF   +  +     + PSYGGW     +P  +W  + F  IG+  
Subjt:  PSIHQSLINGLSLRCS---INPFHANKAMLHVYDQGTATNLCSHSDWTHIGKHKLKFYPLTTASAQQDIMTPSYGGWIEISLLPPTLWTEHIFRFIGDIC

Query:  GGFVETSNLTSRMIVATEARIKVRPNVTGFIPAAVKLSQDLAGVDLTV----HIRG---------ISGSPQRIVHINDKINEEIP-----NMASKDIVFK
        GGF++ +  +   +  TEA IKV+ N TGF+PA +++  D  G D  +    H +G         I GS  +    N   NE  P            V  
Subjt:  GGFVETSNLTSRMIVATEARIKVRPNVTGFIPAAVKLSQDLAGVDLTV----HIRG---------ISGSPQRIVHINDKINEEIP-----NMASKDIVFK

Query:  KREESECSIAKSKMISSPAVMPKISVPVHISPSPPKISVP-EHISPPPPSSDQLNKGKLPLEAPFPGPESSIIQITEPTNLRCGNIGSTSKPNLIVGSDT
        K +  E S   SK +++   M         + +   +++  E  S    SS+++ K             S + +  +   + CG       P   + +  
Subjt:  KREESECSIAKSKMISSPAVMPKISVPVHISPSPPKISVP-EHISPPPPSSDQLNKGKLPLEAPFPGPESSIIQITEPTNLRCGNIGSTSKPNLIVGSDT

Query:  EAFLSSPSTN------HSAHHSAHDPNSPRSMDLTIFNESQTDGPLDNLPNHPYQTFP------------------------SLIVTLPPLQ--------
        +   SSP         HSA        SP S       E+++  P  ++    Y+                            L+V L  +         
Subjt:  EAFLSSPSTN------HSAHHSAHDPNSPRSMDLTIFNESQTDGPLDNLPNHPYQTFP------------------------SLIVTLPPLQ--------

Query:  --EQPNH-NNPPKPLE---------SLRLSPHPSPRFSPPPNTQAPTNSFPHCLQH-LAPLLSKHGLCIMA-LPTVPKSMGASGGILIMWSEPEFSV---
          E P++  +P  P E         S+    H         N +  T       +  L   L ++ L + A   +   S+     I I+   P   V   
Subjt:  --EQPNH-NNPPKPLE---------SLRLSPHPSPRFSPPPNTQAPTNSFPHCLQH-LAPLLSKHGLCIMA-LPTVPKSMGASGGILIMWSEPEFSV---

Query:  -KETIQGLFSLSIHIVLADNFSFWLSAIYGPSRHADRSEFWNELHDLAGLGGDNWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIADYHLIDTPLQNG
         ++ I G FS+SI +   +  S+WLSAIYGP++  +R  FW EL +L  +    WILGGDFNV RW  E S   P + SM+ FN +I++ +LID PL N 
Subjt:  -KETIQGLFSLSIHIVLADNFSFWLSAIYGPSRHADRSEFWNELHDLAGLGGDNWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIADYHLIDTPLQNG

Query:  CYTWSSCGENHYCSLIDRFLMTDTCLNKFGVARFLRLDRVTSDHYPCTLSFGDLSWGPCPFRFENSWLKKDSFRCLMENWWSQNTIQGWPGHGFMMKLK-
         +TWS+       S +DRFL +    N F       L R TSDH+P  L    +SWGP PFRF N++LK   ++  +E WW   +  G+ G+ FM +LK 
Subjt:  CYTWSSCGENHYCSLIDRFLMTDTCLNKFGVARFLRLDRVTSDHYPCTLSFGDLSWGPCPFRFENSWLKKDSFRCLMENWWSQNTIQGWPGHGFMMKLK-

Query:  ------------GLKSEIRKWNLS-QPSS
                    GLK  ++ +N++ QP+S
Subjt:  ------------GLKSEIRKWNLS-QPSS

TYJ99315.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]6.0e-6725.65Show/hide
Query:  RSISIDRKTFTIAFDEHFRGSRAKITEYSRYSSHSISLSWKSLKWLALSFNTIVHSPCSHKFFSDLRSEEYTLWLEKLNNKNGFYVEINQVQNSGSRQRI
        RS  ++RK F +  D++ + +   +TE   + + SI +S + L W+  +  +++ +P +++FF + R  E  +W+ K  N  G   EI +V     +  I
Subjt:  RSISIDRKTFTIAFDEHFRGSRAKITEYSRYSSHSISLSWKSLKWLALSFNTIVHSPCSHKFFSDLRSEEYTLWLEKLNNKNGFYVEINQVQNSGSRQRI

Query:  LIPSENNKQGWFSFFSLIS-EYPAEAHRQPT---------QPSPP------SFKDILQTKPPTAAITPSLKGPVKEASVST--------HAEEWKEIIVL
        L+P   +K GW SF S+I+ +   +A  +PT         + SPP      S+   +    P A    S      ++S S+         ++  +  +V+
Subjt:  LIPSENNKQGWFSFFSLIS-EYPAEAHRQPT---------QPSPP------SFKDILQTKPPTAAITPSLKGPVKEASVST--------HAEEWKEIIVL

Query:  QRCNQHDDWPSIHQSLINGLSLRCSINPFHANKAMLHVYDQGTATNLCSHSDWTHIGKHKLKFYPLTTASAQQDIMTPSYGGWIEISLLPPTLWTEHIFR
         R   HDDW  I Q+L        + N FHA KA++H      A  LC +  W+ +GK+ ++F   +        + PSYGGW     +P  LW    F+
Subjt:  QRCNQHDDWPSIHQSLINGLSLRCSINPFHANKAMLHVYDQGTATNLCSHSDWTHIGKHKLKFYPLTTASAQQDIMTPSYGGWIEISLLPPTLWTEHIFR

Query:  FIGDICGGFVETSNLTSRMIVATEARIKVRPNVTGFIPAAVKLSQDLAG---VDLTVHIRG---------ISGSPQRIVHIN-DKINEE-----------
         IG  C G ++ +  T       EARIKVR N +GF+PA V++  +      V +  H  G         + G+ +R    + D  N E           
Subjt:  FIGDICGGFVETSNLTSRMIVATEARIKVRPNVTGFIPAAVKLSQDLAG---VDLTVHIRG---------ISGSPQRIVHIN-DKINEE-----------

Query:  -IPNMASKDIVFKKREESECSIAKSKMISSP---AVMPKI-------SVPVHISPSPPKISVPEHISP---------------PPPSSDQLNKGKLPLEA
          P+  S     +K    +   A   +I  P   A +P            +H + +  K+ +   IS                 P S+  L+K K  +  
Subjt:  -IPNMASKDIVFKKREESECSIAKSKMISSP---AVMPKI-------SVPVHISPSPPKISVPEHISP---------------PPPSSDQLNKGKLPLEA

Query:  PFPGPESSIIQITEPTNLRCGNIGSTSKPNLIVGS-DTEAFLSSPSTNHSAHHS------------AHDPN-SPRSMDLTI-FNESQTDGPLDNLPNHPY
          P  +++I            ++ S  K   +      +   SS   N  A+ +            AHD + + + + LT+   +     P  +L +H  
Subjt:  PFPGPESSIIQITEPTNLRCGNIGSTSKPNLIVGS-DTEAFLSSPSTNHSAHHS------------AHDPN-SPRSMDLTI-FNESQTDGPLDNLPNHPY

Query:  Q--------TFPSLIVTLPPLQEQPNHNNPPKPLESLRLSPHPSPR--FSPPPNTQAPTNSFPHCLQHLAPLLSKHGLCI--------------------
                 T   ++   P ++   N N+      + R   H   R  +      +          + L   L K+GL +                    
Subjt:  Q--------TFPSLIVTLPPLQEQPNHNNPPKPLESLRLSPHPSPR--FSPPPNTQAPTNSFPHCLQHLAPLLSKHGLCI--------------------

Query:  -----MALPTVPKSM--------------GASGGILIMWSEPEFSVKETIQGLFSLSIHIVLADNFSFWLSAIYGPSRHADRSEFWNELHDLAGLGGDNW
             +    + KS+              G+SGGILI+W     S+    +GLFSLS + +L +N S+WL+ +YGP +  +R  FW ELH+L  L    W
Subjt:  -----MALPTVPKSM--------------GASGGILIMWSEPEFSVKETIQGLFSLSIHIVLADNFSFWLSAIYGPSRHADRSEFWNELHDLAGLGGDNW

Query:  ILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIADYHLIDTPLQNGCYTWSSCGENHYCSLIDRFLMTDTCLNKFGVARFLRLDRVTSDHYP--CTLSFGD
        ILGGD NV R   E +     + + R+ N +I++  LID PL N  +TWS+       S IDRFL   +  N F       L R TSDH+P  C  S   
Subjt:  ILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIADYHLIDTPLQNGCYTWSSCGENHYCSLIDRFLMTDTCLNKFGVARFLRLDRVTSDHYP--CTLSFGD

Query:  LSWGPCPFRFENSWLKKDSFRCLMENWWSQNTIQGWPGHGFMMKLKGLKSEIRKW
        LSWGP PFR  +  L    F+  M  WW  +   G+PG  F+ +LK L + I+ W
Subjt:  LSWGPCPFRFENSWLKKDSFRCLMENWWSQNTIQGWPGHGFMMKLKGLKSEIRKW

TYK00493.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]2.5e-6525.83Show/hide
Query:  RSISIDRKTFTIAFDEHFRGSRAKITEYSRYSSHSISLSWKSLKWLALSFNTIVHSPCSHKFFSDLRSEEYTLWLEKLNNKNGFYVEINQVQNSGSRQRI
        RS  I+RK F +  D++ + +   +TE   + + SI +S + L W+  +  +++ +P S++FF + R  E+ +W+ K  N  G   EI +V +   +  I
Subjt:  RSISIDRKTFTIAFDEHFRGSRAKITEYSRYSSHSISLSWKSLKWLALSFNTIVHSPCSHKFFSDLRSEEYTLWLEKLNNKNGFYVEINQVQNSGSRQRI

Query:  LIPSENNKQGWFSFFSLISEYPAEAHRQPTQP-----SPPSFKDILQTKPPTAAITPSLKGPVKE--ASVSTHAEE-----------------------W
        L+P    K  W SF S+I+  P    +  T+P     S P F+      PP      S    V E  +S+S+ + +                        
Subjt:  LIPSENNKQGWFSFFSLISEYPAEAHRQPTQP-----SPPSFKDILQTKPPTAAITPSLKGPVKE--ASVSTHAEE-----------------------W

Query:  KEIIVLQRCNQHDDWPSIHQSLINGLSLRCSINPFHANKAMLHVYDQGTATNLCSHSDWTHIGKHKLKFYPLTTASAQQDIMTPSYGGWIEISLLPPTLW
        +  +VL R   HDDW  I Q+L        + N FHA K ++H      A  LC +  WT +GK+ ++F     AS     + PSYGGW     +P  LW
Subjt:  KEIIVLQRCNQHDDWPSIHQSLINGLSLRCSINPFHANKAMLHVYDQGTATNLCSHSDWTHIGKHKLKFYPLTTASAQQDIMTPSYGGWIEISLLPPTLW

Query:  TEHIFRFIGDICGGFVETSNLTSRMIVATEARIKVRPNVTGFIPAAVKLSQDLAG---VDLTVHIRG---------ISG----------------SPQRI
            F+ IG  CGG ++ +  T       EA++K+R N +GF+PA VK+         V +  H  G         + G                S Q +
Subjt:  TEHIFRFIGDICGGFVETSNLTSRMIVATEARIKVRPNVTGFIPAAVKLSQDLAG---VDLTVHIRG---------ISG----------------SPQRI

Query:  VHINDKINEEIPNMAS---KDIVFKKREESECSIAK-SKMISSPAVMPKISV---PVHISPSPPKISVPEHISPPPPSSDQLNKGKLPLEAP--------
            + I+ ++ N  S   K I  ++    +  I K +K  +SP  + +  V    +H + +  K+ +   IS    +   L+KGK  ++ P        
Subjt:  VHINDKINEEIPNMAS---KDIVFKKREESECSIAK-SKMISSPAVMPKISV---PVHISPSPPKISVPEHISPPPPSSDQLNKGKLPLEAP--------

Query:  ------------------FPGPESSIIQITEPTNLRCG------NIGSTSKPNLIVGSDTEAFLSSPSTNHSAHHSAHDPN-SPRSMDLTI-FNESQTDG
                          F  P+S+    +     R           ST +P L         ++ P         AHD + S + + LT+         
Subjt:  ------------------FPGPESSIIQITEPTNLRCG------NIGSTSKPNLIVGSDTEAFLSSPSTNHSAHHSAHDPN-SPRSMDLTI-FNESQTDG

Query:  PLDNLPNHPYQ--------TFPSLIVTLPPLQEQPNHNNPPKPLESLRLSPHPSPR---FSPPPNTQAPTNSFPHCLQHLAPLLSKHGL----------C
        P  +  +H           T   ++   P L+      +   P  + R   H   R   +    + +  TNS     Q L   L ++GL           
Subjt:  PLDNLPNHPYQ--------TFPSLIVTLPPLQEQPNHNNPPKPLESLRLSPHPSPR---FSPPPNTQAPTNSFPHCLQHLAPLLSKHGL----------C

Query:  IMALPTVPKSMGAS-GGILIMWSEPEFSVKETIQGLFSLSIHIVLADNFSFWLSAIYGPSRHADRSEFWNELHDLAGLGGDNWILGGDFNVTRWSWEKSH
          +   +   +G+S GGILI+W     S+    +G FSLS +    +N S+WL+ +YGP +  +R   W +LH+L  L    WI+GGD NV R   E + 
Subjt:  IMALPTVPKSMGAS-GGILIMWSEPEFSVKETIQGLFSLSIHIVLADNFSFWLSAIYGPSRHADRSEFWNELHDLAGLGGDNWILGGDFNVTRWSWEKSH

Query:  GRPVTRSMRIFNQWIADYHLIDTPLQNGCYTWSSCGENHYCSLIDRFLMTDTCLNKFGVARFLRLDRVTSDHYP--CTLSFGDLSWGPCPFRFENSWLKK
            + S  + N +I++  LID PL N  YTWS+       S +DRFL        F       L R TSDH+P  C  S   L WGP PFR  +  L  
Subjt:  GRPVTRSMRIFNQWIADYHLIDTPLQNGCYTWSSCGENHYCSLIDRFLMTDTCLNKFGVARFLRLDRVTSDHYP--CTLSFGDLSWGPCPFRFENSWLKK

Query:  DSFRCLMENWWSQNTIQGWPGHGFMMKLKGLKSEIRKWNLSQ-PSSADQLPSLVSQLKLLDDTE
          F+  ME WW  +   G PG  F+ +LK L + I+ W   +  S      +++ ++  +D  E
Subjt:  DSFRCLMENWWSQNTIQGWPGHGFMMKLKGLKSEIRKWNLSQ-PSSADQLPSLVSQLKLLDDTE

XP_022158956.1 uncharacterized protein LOC111025405 [Momordica charantia]4.6e-6747.1Show/hide
Query:  LLSKHGLCIMALPTVPKSMGASGGILIMWSEPEFSVKETIQGLFSLSIHIVLADNFSFWLSAIYGPSRHADRSEFWNELHDLAGLGGDNWILGGDFNVTR
        L S HG+   AL     + G + GILI+W++P+    E I+G+FSL+I+  L+D F FW+S IYGPS       FW EL DL+ L  ++WIL GDFNVTR
Subjt:  LLSKHGLCIMALPTVPKSMGASGGILIMWSEPEFSVKETIQGLFSLSIHIVLADNFSFWLSAIYGPSRHADRSEFWNELHDLAGLGGDNWILGGDFNVTR

Query:  WSWEKSHGRPVTRSMRIFNQWIADYHLIDTPLQNGCYTWSSCGENHYCSLIDRFLMTDTCLNKFGVARFLRLDRVTSDHYPCTLSFGDLSWGPCPFRFEN
        WSWEKS+GRP+T+SM +FN +I D  LID PL NG +TWS    N   SLID FL+T+ C++K G+    R+ R TSDH+P  L FG  +WG  PFRFEN
Subjt:  WSWEKSHGRPVTRSMRIFNQWIADYHLIDTPLQNGCYTWSSCGENHYCSLIDRFLMTDTCLNKFGVARFLRLDRVTSDHYPCTLSFGDLSWGPCPFRFEN

Query:  SWLKKDSFRCLMENWWSQNTIQGWPGHGFMMKLKGLKSEIRKWNLSQ----PSSADQLPSLVSQLKLLDDTEDMVP
         WL   +F+  +E WW    + GWPGHG MMKLK LK  I+ W         S  + L +L++ L  L+ ++ + P
Subjt:  SWLKKDSFRCLMENWWSQNTIQGWPGHGFMMKLKGLKSEIRKWNLSQ----PSSADQLPSLVSQLKLLDDTEDMVP

TrEMBL top hitse value%identityAlignment
A0A5A7TDG1 LINE-1 retrotransposable element ORF2 protein1.2e-6525.83Show/hide
Query:  RSISIDRKTFTIAFDEHFRGSRAKITEYSRYSSHSISLSWKSLKWLALSFNTIVHSPCSHKFFSDLRSEEYTLWLEKLNNKNGFYVEINQVQNSGSRQRI
        RS  I+RK F +  D++ + +   +TE   + + SI +S + L W+  +  +++ +P S++FF + R  E+ +W+ K  N  G   EI +V +   +  I
Subjt:  RSISIDRKTFTIAFDEHFRGSRAKITEYSRYSSHSISLSWKSLKWLALSFNTIVHSPCSHKFFSDLRSEEYTLWLEKLNNKNGFYVEINQVQNSGSRQRI

Query:  LIPSENNKQGWFSFFSLISEYPAEAHRQPTQP-----SPPSFKDILQTKPPTAAITPSLKGPVKE--ASVSTHAEE-----------------------W
        L+P    K  W SF S+I+  P    +  T+P     S P F+      PP      S    V E  +S+S+ + +                        
Subjt:  LIPSENNKQGWFSFFSLISEYPAEAHRQPTQP-----SPPSFKDILQTKPPTAAITPSLKGPVKE--ASVSTHAEE-----------------------W

Query:  KEIIVLQRCNQHDDWPSIHQSLINGLSLRCSINPFHANKAMLHVYDQGTATNLCSHSDWTHIGKHKLKFYPLTTASAQQDIMTPSYGGWIEISLLPPTLW
        +  +VL R   HDDW  I Q+L        + N FHA K ++H      A  LC +  WT +GK+ ++F     AS     + PSYGGW     +P  LW
Subjt:  KEIIVLQRCNQHDDWPSIHQSLINGLSLRCSINPFHANKAMLHVYDQGTATNLCSHSDWTHIGKHKLKFYPLTTASAQQDIMTPSYGGWIEISLLPPTLW

Query:  TEHIFRFIGDICGGFVETSNLTSRMIVATEARIKVRPNVTGFIPAAVKLSQDLAG---VDLTVHIRG---------ISG----------------SPQRI
            F+ IG  CGG ++ +  T       EA++K+R N +GF+PA VK+         V +  H  G         + G                S Q +
Subjt:  TEHIFRFIGDICGGFVETSNLTSRMIVATEARIKVRPNVTGFIPAAVKLSQDLAG---VDLTVHIRG---------ISG----------------SPQRI

Query:  VHINDKINEEIPNMAS---KDIVFKKREESECSIAK-SKMISSPAVMPKISV---PVHISPSPPKISVPEHISPPPPSSDQLNKGKLPLEAP--------
            + I+ ++ N  S   K I  ++    +  I K +K  +SP  + +  V    +H + +  K+ +   IS    +   L+KGK  ++ P        
Subjt:  VHINDKINEEIPNMAS---KDIVFKKREESECSIAK-SKMISSPAVMPKISV---PVHISPSPPKISVPEHISPPPPSSDQLNKGKLPLEAP--------

Query:  ------------------FPGPESSIIQITEPTNLRCG------NIGSTSKPNLIVGSDTEAFLSSPSTNHSAHHSAHDPN-SPRSMDLTI-FNESQTDG
                          F  P+S+    +     R           ST +P L         ++ P         AHD + S + + LT+         
Subjt:  ------------------FPGPESSIIQITEPTNLRCG------NIGSTSKPNLIVGSDTEAFLSSPSTNHSAHHSAHDPN-SPRSMDLTI-FNESQTDG

Query:  PLDNLPNHPYQ--------TFPSLIVTLPPLQEQPNHNNPPKPLESLRLSPHPSPR---FSPPPNTQAPTNSFPHCLQHLAPLLSKHGL----------C
        P  +  +H           T   ++   P L+      +   P  + R   H   R   +    + +  TNS     Q L   L ++GL           
Subjt:  PLDNLPNHPYQ--------TFPSLIVTLPPLQEQPNHNNPPKPLESLRLSPHPSPR---FSPPPNTQAPTNSFPHCLQHLAPLLSKHGL----------C

Query:  IMALPTVPKSMGAS-GGILIMWSEPEFSVKETIQGLFSLSIHIVLADNFSFWLSAIYGPSRHADRSEFWNELHDLAGLGGDNWILGGDFNVTRWSWEKSH
          +   +   +G+S GGILI+W     S+    +G FSLS +    +N S+WL+ +YGP +  +R   W +LH+L  L    WI+GGD NV R   E + 
Subjt:  IMALPTVPKSMGAS-GGILIMWSEPEFSVKETIQGLFSLSIHIVLADNFSFWLSAIYGPSRHADRSEFWNELHDLAGLGGDNWILGGDFNVTRWSWEKSH

Query:  GRPVTRSMRIFNQWIADYHLIDTPLQNGCYTWSSCGENHYCSLIDRFLMTDTCLNKFGVARFLRLDRVTSDHYP--CTLSFGDLSWGPCPFRFENSWLKK
            + S  + N +I++  LID PL N  YTWS+       S +DRFL        F       L R TSDH+P  C  S   L WGP PFR  +  L  
Subjt:  GRPVTRSMRIFNQWIADYHLIDTPLQNGCYTWSSCGENHYCSLIDRFLMTDTCLNKFGVARFLRLDRVTSDHYP--CTLSFGDLSWGPCPFRFENSWLKK

Query:  DSFRCLMENWWSQNTIQGWPGHGFMMKLKGLKSEIRKWNLSQ-PSSADQLPSLVSQLKLLDDTE
          F+  ME WW  +   G PG  F+ +LK L + I+ W   +  S      +++ ++  +D  E
Subjt:  DSFRCLMENWWSQNTIQGWPGHGFMMKLKGLKSEIRKWNLSQ-PSSADQLPSLVSQLKLLDDTE

A0A5A7TTA1 DUF4283 domain-containing protein1.3e-7026.7Show/hide
Query:  NHSTRSISIDRKTFTIAFDEHFRGSRAKITEYSRYSSHSISLSWKSLKWLALSFNTIVHSPCSHKFFSDLRSEEYTLWLEKLNNKNGFYVEINQVQNSGS
        N   R  S+++K F ++ D+  R S   ITE   Y S SI+++  SL+WL ++F  ++++P + +FF + R  ++ LW++ ++N+ G+  EI +V + G 
Subjt:  NHSTRSISIDRKTFTIAFDEHFRGSRAKITEYSRYSSHSISLSWKSLKWLALSFNTIVHSPCSHKFFSDLRSEEYTLWLEKLNNKNGFYVEINQVQNSGS

Query:  RQRILIPSENNKQGWFSFFSLIS-EYPAEAHRQPT----------QPSPPSFKDILQTKPPTAAITPSLKGPVKEASVSTHAEEWKEIIVLQRCNQHDDW
        +  IL+P   +K GW  F  +++ +  ++    PT          +    S+     ++ P      ++      +S  +   + K    L+R   HDDW
Subjt:  RQRILIPSENNKQGWFSFFSLIS-EYPAEAHRQPT----------QPSPPSFKDILQTKPPTAAITPSLKGPVKEASVSTHAEEWKEIIVLQRCNQHDDW

Query:  PSIHQSLINGLSLRCS---INPFHANKAMLHVYDQGTATNLCSHSDWTHIGKHKLKFYPLTTASAQQDIMTPSYGGWIEISLLPPTLWTEHIFRFIGDIC
          I   L +    + S     PFHA+KA+L + D+  A  LC +  WT +G   +KF   +  +     + PSYGGW     +P  +W  + F  IG+  
Subjt:  PSIHQSLINGLSLRCS---INPFHANKAMLHVYDQGTATNLCSHSDWTHIGKHKLKFYPLTTASAQQDIMTPSYGGWIEISLLPPTLWTEHIFRFIGDIC

Query:  GGFVETSNLTSRMIVATEARIKVRPNVTGFIPAAVKLSQDLAGVDLTV----HIRG---------ISGSPQRIVHINDKINEEIP-----NMASKDIVFK
        GGF++ +  +   +  TEA IKV+ N TGF+PA +++  D  G D  +    H +G         I GS  +    N   NE  P            V  
Subjt:  GGFVETSNLTSRMIVATEARIKVRPNVTGFIPAAVKLSQDLAGVDLTV----HIRG---------ISGSPQRIVHINDKINEEIP-----NMASKDIVFK

Query:  KREESECSIAKSKMISSPAVMPKISVPVHISPSPPKISVP-EHISPPPPSSDQLNKGKLPLEAPFPGPESSIIQITEPTNLRCGNIGSTSKPNLIVGSDT
        K +  E S   SK +++   M         + +   +++  E  S    SS+++ K             S + +  +   + CG       P   + +  
Subjt:  KREESECSIAKSKMISSPAVMPKISVPVHISPSPPKISVP-EHISPPPPSSDQLNKGKLPLEAPFPGPESSIIQITEPTNLRCGNIGSTSKPNLIVGSDT

Query:  EAFLSSPSTN------HSAHHSAHDPNSPRSMDLTIFNESQTDGPLDNLPNHPYQTFP------------------------SLIVTLPPLQ--------
        +   SSP         HSA        SP S       E+++  P  ++    Y+                            L+V L  +         
Subjt:  EAFLSSPSTN------HSAHHSAHDPNSPRSMDLTIFNESQTDGPLDNLPNHPYQTFP------------------------SLIVTLPPLQ--------

Query:  --EQPNH-NNPPKPLE---------SLRLSPHPSPRFSPPPNTQAPTNSFPHCLQH-LAPLLSKHGLCIMA-LPTVPKSMGASGGILIMWSEPEFSV---
          E P++  +P  P E         S+    H         N +  T       +  L   L ++ L + A   +   S+     I I+   P   V   
Subjt:  --EQPNH-NNPPKPLE---------SLRLSPHPSPRFSPPPNTQAPTNSFPHCLQH-LAPLLSKHGLCIMA-LPTVPKSMGASGGILIMWSEPEFSV---

Query:  -KETIQGLFSLSIHIVLADNFSFWLSAIYGPSRHADRSEFWNELHDLAGLGGDNWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIADYHLIDTPLQNG
         ++ I G FS+SI +   +  S+WLSAIYGP++  +R  FW EL +L  +    WILGGDFNV RW  E S   P + SM+ FN +I++ +LID PL N 
Subjt:  -KETIQGLFSLSIHIVLADNFSFWLSAIYGPSRHADRSEFWNELHDLAGLGGDNWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIADYHLIDTPLQNG

Query:  CYTWSSCGENHYCSLIDRFLMTDTCLNKFGVARFLRLDRVTSDHYPCTLSFGDLSWGPCPFRFENSWLKKDSFRCLMENWWSQNTIQGWPGHGFMMKLK-
         +TWS+       S +DRFL +    N F       L R TSDH+P  L    +SWGP PFRF N++LK   ++  +E WW   +  G+ G+ FM +LK 
Subjt:  CYTWSSCGENHYCSLIDRFLMTDTCLNKFGVARFLRLDRVTSDHYPCTLSFGDLSWGPCPFRFENSWLKKDSFRCLMENWWSQNTIQGWPGHGFMMKLK-

Query:  ------------GLKSEIRKWNLS-QPSS
                    GLK  ++ +N++ QP+S
Subjt:  ------------GLKSEIRKWNLS-QPSS

A0A5D3BL61 LINE-1 retrotransposable element ORF2 protein1.2e-6525.83Show/hide
Query:  RSISIDRKTFTIAFDEHFRGSRAKITEYSRYSSHSISLSWKSLKWLALSFNTIVHSPCSHKFFSDLRSEEYTLWLEKLNNKNGFYVEINQVQNSGSRQRI
        RS  I+RK F +  D++ + +   +TE   + + SI +S + L W+  +  +++ +P S++FF + R  E+ +W+ K  N  G   EI +V +   +  I
Subjt:  RSISIDRKTFTIAFDEHFRGSRAKITEYSRYSSHSISLSWKSLKWLALSFNTIVHSPCSHKFFSDLRSEEYTLWLEKLNNKNGFYVEINQVQNSGSRQRI

Query:  LIPSENNKQGWFSFFSLISEYPAEAHRQPTQP-----SPPSFKDILQTKPPTAAITPSLKGPVKE--ASVSTHAEE-----------------------W
        L+P    K  W SF S+I+  P    +  T+P     S P F+      PP      S    V E  +S+S+ + +                        
Subjt:  LIPSENNKQGWFSFFSLISEYPAEAHRQPTQP-----SPPSFKDILQTKPPTAAITPSLKGPVKE--ASVSTHAEE-----------------------W

Query:  KEIIVLQRCNQHDDWPSIHQSLINGLSLRCSINPFHANKAMLHVYDQGTATNLCSHSDWTHIGKHKLKFYPLTTASAQQDIMTPSYGGWIEISLLPPTLW
        +  +VL R   HDDW  I Q+L        + N FHA K ++H      A  LC +  WT +GK+ ++F     AS     + PSYGGW     +P  LW
Subjt:  KEIIVLQRCNQHDDWPSIHQSLINGLSLRCSINPFHANKAMLHVYDQGTATNLCSHSDWTHIGKHKLKFYPLTTASAQQDIMTPSYGGWIEISLLPPTLW

Query:  TEHIFRFIGDICGGFVETSNLTSRMIVATEARIKVRPNVTGFIPAAVKLSQDLAG---VDLTVHIRG---------ISG----------------SPQRI
            F+ IG  CGG ++ +  T       EA++K+R N +GF+PA VK+         V +  H  G         + G                S Q +
Subjt:  TEHIFRFIGDICGGFVETSNLTSRMIVATEARIKVRPNVTGFIPAAVKLSQDLAG---VDLTVHIRG---------ISG----------------SPQRI

Query:  VHINDKINEEIPNMAS---KDIVFKKREESECSIAK-SKMISSPAVMPKISV---PVHISPSPPKISVPEHISPPPPSSDQLNKGKLPLEAP--------
            + I+ ++ N  S   K I  ++    +  I K +K  +SP  + +  V    +H + +  K+ +   IS    +   L+KGK  ++ P        
Subjt:  VHINDKINEEIPNMAS---KDIVFKKREESECSIAK-SKMISSPAVMPKISV---PVHISPSPPKISVPEHISPPPPSSDQLNKGKLPLEAP--------

Query:  ------------------FPGPESSIIQITEPTNLRCG------NIGSTSKPNLIVGSDTEAFLSSPSTNHSAHHSAHDPN-SPRSMDLTI-FNESQTDG
                          F  P+S+    +     R           ST +P L         ++ P         AHD + S + + LT+         
Subjt:  ------------------FPGPESSIIQITEPTNLRCG------NIGSTSKPNLIVGSDTEAFLSSPSTNHSAHHSAHDPN-SPRSMDLTI-FNESQTDG

Query:  PLDNLPNHPYQ--------TFPSLIVTLPPLQEQPNHNNPPKPLESLRLSPHPSPR---FSPPPNTQAPTNSFPHCLQHLAPLLSKHGL----------C
        P  +  +H           T   ++   P L+      +   P  + R   H   R   +    + +  TNS     Q L   L ++GL           
Subjt:  PLDNLPNHPYQ--------TFPSLIVTLPPLQEQPNHNNPPKPLESLRLSPHPSPR---FSPPPNTQAPTNSFPHCLQHLAPLLSKHGL----------C

Query:  IMALPTVPKSMGAS-GGILIMWSEPEFSVKETIQGLFSLSIHIVLADNFSFWLSAIYGPSRHADRSEFWNELHDLAGLGGDNWILGGDFNVTRWSWEKSH
          +   +   +G+S GGILI+W     S+    +G FSLS +    +N S+WL+ +YGP +  +R   W +LH+L  L    WI+GGD NV R   E + 
Subjt:  IMALPTVPKSMGAS-GGILIMWSEPEFSVKETIQGLFSLSIHIVLADNFSFWLSAIYGPSRHADRSEFWNELHDLAGLGGDNWILGGDFNVTRWSWEKSH

Query:  GRPVTRSMRIFNQWIADYHLIDTPLQNGCYTWSSCGENHYCSLIDRFLMTDTCLNKFGVARFLRLDRVTSDHYP--CTLSFGDLSWGPCPFRFENSWLKK
            + S  + N +I++  LID PL N  YTWS+       S +DRFL        F       L R TSDH+P  C  S   L WGP PFR  +  L  
Subjt:  GRPVTRSMRIFNQWIADYHLIDTPLQNGCYTWSSCGENHYCSLIDRFLMTDTCLNKFGVARFLRLDRVTSDHYP--CTLSFGDLSWGPCPFRFENSWLKK

Query:  DSFRCLMENWWSQNTIQGWPGHGFMMKLKGLKSEIRKWNLSQ-PSSADQLPSLVSQLKLLDDTE
          F+  ME WW  +   G PG  F+ +LK L + I+ W   +  S      +++ ++  +D  E
Subjt:  DSFRCLMENWWSQNTIQGWPGHGFMMKLKGLKSEIRKWNLSQ-PSSADQLPSLVSQLKLLDDTE

A0A5D3BLV7 LINE-1 retrotransposable element ORF2 protein2.9e-6725.65Show/hide
Query:  RSISIDRKTFTIAFDEHFRGSRAKITEYSRYSSHSISLSWKSLKWLALSFNTIVHSPCSHKFFSDLRSEEYTLWLEKLNNKNGFYVEINQVQNSGSRQRI
        RS  ++RK F +  D++ + +   +TE   + + SI +S + L W+  +  +++ +P +++FF + R  E  +W+ K  N  G   EI +V     +  I
Subjt:  RSISIDRKTFTIAFDEHFRGSRAKITEYSRYSSHSISLSWKSLKWLALSFNTIVHSPCSHKFFSDLRSEEYTLWLEKLNNKNGFYVEINQVQNSGSRQRI

Query:  LIPSENNKQGWFSFFSLIS-EYPAEAHRQPT---------QPSPP------SFKDILQTKPPTAAITPSLKGPVKEASVST--------HAEEWKEIIVL
        L+P   +K GW SF S+I+ +   +A  +PT         + SPP      S+   +    P A    S      ++S S+         ++  +  +V+
Subjt:  LIPSENNKQGWFSFFSLIS-EYPAEAHRQPT---------QPSPP------SFKDILQTKPPTAAITPSLKGPVKEASVST--------HAEEWKEIIVL

Query:  QRCNQHDDWPSIHQSLINGLSLRCSINPFHANKAMLHVYDQGTATNLCSHSDWTHIGKHKLKFYPLTTASAQQDIMTPSYGGWIEISLLPPTLWTEHIFR
         R   HDDW  I Q+L        + N FHA KA++H      A  LC +  W+ +GK+ ++F   +        + PSYGGW     +P  LW    F+
Subjt:  QRCNQHDDWPSIHQSLINGLSLRCSINPFHANKAMLHVYDQGTATNLCSHSDWTHIGKHKLKFYPLTTASAQQDIMTPSYGGWIEISLLPPTLWTEHIFR

Query:  FIGDICGGFVETSNLTSRMIVATEARIKVRPNVTGFIPAAVKLSQDLAG---VDLTVHIRG---------ISGSPQRIVHIN-DKINEE-----------
         IG  C G ++ +  T       EARIKVR N +GF+PA V++  +      V +  H  G         + G+ +R    + D  N E           
Subjt:  FIGDICGGFVETSNLTSRMIVATEARIKVRPNVTGFIPAAVKLSQDLAG---VDLTVHIRG---------ISGSPQRIVHIN-DKINEE-----------

Query:  -IPNMASKDIVFKKREESECSIAKSKMISSP---AVMPKI-------SVPVHISPSPPKISVPEHISP---------------PPPSSDQLNKGKLPLEA
          P+  S     +K    +   A   +I  P   A +P            +H + +  K+ +   IS                 P S+  L+K K  +  
Subjt:  -IPNMASKDIVFKKREESECSIAKSKMISSP---AVMPKI-------SVPVHISPSPPKISVPEHISP---------------PPPSSDQLNKGKLPLEA

Query:  PFPGPESSIIQITEPTNLRCGNIGSTSKPNLIVGS-DTEAFLSSPSTNHSAHHS------------AHDPN-SPRSMDLTI-FNESQTDGPLDNLPNHPY
          P  +++I            ++ S  K   +      +   SS   N  A+ +            AHD + + + + LT+   +     P  +L +H  
Subjt:  PFPGPESSIIQITEPTNLRCGNIGSTSKPNLIVGS-DTEAFLSSPSTNHSAHHS------------AHDPN-SPRSMDLTI-FNESQTDGPLDNLPNHPY

Query:  Q--------TFPSLIVTLPPLQEQPNHNNPPKPLESLRLSPHPSPR--FSPPPNTQAPTNSFPHCLQHLAPLLSKHGLCI--------------------
                 T   ++   P ++   N N+      + R   H   R  +      +          + L   L K+GL +                    
Subjt:  Q--------TFPSLIVTLPPLQEQPNHNNPPKPLESLRLSPHPSPR--FSPPPNTQAPTNSFPHCLQHLAPLLSKHGLCI--------------------

Query:  -----MALPTVPKSM--------------GASGGILIMWSEPEFSVKETIQGLFSLSIHIVLADNFSFWLSAIYGPSRHADRSEFWNELHDLAGLGGDNW
             +    + KS+              G+SGGILI+W     S+    +GLFSLS + +L +N S+WL+ +YGP +  +R  FW ELH+L  L    W
Subjt:  -----MALPTVPKSM--------------GASGGILIMWSEPEFSVKETIQGLFSLSIHIVLADNFSFWLSAIYGPSRHADRSEFWNELHDLAGLGGDNW

Query:  ILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIADYHLIDTPLQNGCYTWSSCGENHYCSLIDRFLMTDTCLNKFGVARFLRLDRVTSDHYP--CTLSFGD
        ILGGD NV R   E +     + + R+ N +I++  LID PL N  +TWS+       S IDRFL   +  N F       L R TSDH+P  C  S   
Subjt:  ILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIADYHLIDTPLQNGCYTWSSCGENHYCSLIDRFLMTDTCLNKFGVARFLRLDRVTSDHYP--CTLSFGD

Query:  LSWGPCPFRFENSWLKKDSFRCLMENWWSQNTIQGWPGHGFMMKLKGLKSEIRKW
        LSWGP PFR  +  L    F+  M  WW  +   G+PG  F+ +LK L + I+ W
Subjt:  LSWGPCPFRFENSWLKKDSFRCLMENWWSQNTIQGWPGHGFMMKLKGLKSEIRKW

A0A6J1E2G6 uncharacterized protein LOC1110254052.2e-6747.1Show/hide
Query:  LLSKHGLCIMALPTVPKSMGASGGILIMWSEPEFSVKETIQGLFSLSIHIVLADNFSFWLSAIYGPSRHADRSEFWNELHDLAGLGGDNWILGGDFNVTR
        L S HG+   AL     + G + GILI+W++P+    E I+G+FSL+I+  L+D F FW+S IYGPS       FW EL DL+ L  ++WIL GDFNVTR
Subjt:  LLSKHGLCIMALPTVPKSMGASGGILIMWSEPEFSVKETIQGLFSLSIHIVLADNFSFWLSAIYGPSRHADRSEFWNELHDLAGLGGDNWILGGDFNVTR

Query:  WSWEKSHGRPVTRSMRIFNQWIADYHLIDTPLQNGCYTWSSCGENHYCSLIDRFLMTDTCLNKFGVARFLRLDRVTSDHYPCTLSFGDLSWGPCPFRFEN
        WSWEKS+GRP+T+SM +FN +I D  LID PL NG +TWS    N   SLID FL+T+ C++K G+    R+ R TSDH+P  L FG  +WG  PFRFEN
Subjt:  WSWEKSHGRPVTRSMRIFNQWIADYHLIDTPLQNGCYTWSSCGENHYCSLIDRFLMTDTCLNKFGVARFLRLDRVTSDHYPCTLSFGDLSWGPCPFRFEN

Query:  SWLKKDSFRCLMENWWSQNTIQGWPGHGFMMKLKGLKSEIRKWNLSQ----PSSADQLPSLVSQLKLLDDTEDMVP
         WL   +F+  +E WW    + GWPGHG MMKLK LK  I+ W         S  + L +L++ L  L+ ++ + P
Subjt:  SWLKKDSFRCLMENWWSQNTIQGWPGHGFMMKLKGLKSEIRKWNLSQ----PSSADQLPSLVSQLKLLDDTEDMVP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAACCACAGAGATTCATCCATATCTGCCATGGAATCACTCTACTCGATCCATATCCATAGATAGGAAAACCTTCACAATAGCCTTTGATGAACACTTTAGAGGAAG
TAGAGCCAAGATAACCGAATACAGCAGATATTCATCTCATTCGATTTCCCTTTCTTGGAAATCTCTAAAATGGCTTGCCTTATCTTTCAACACAATTGTTCACTCACCAT
GTTCGCACAAATTCTTCTCGGATCTGAGGAGCGAAGAATACACTCTTTGGCTGGAAAAACTGAATAACAAGAATGGTTTTTATGTGGAAATTAACCAGGTGCAAAATTCT
GGTAGCCGACAAAGGATCCTTATCCCCTCGGAAAACAACAAACAAGGTTGGTTCTCTTTTTTTTCGCTCATCTCAGAATACCCTGCTGAAGCTCATCGCCAGCCCACACA
ACCATCACCTCCATCATTCAAGGACATCCTTCAAACAAAACCACCAACAGCCGCCATTACTCCTTCCTTGAAAGGGCCCGTGAAGGAAGCTTCTGTCTCCACACATGCTG
AAGAATGGAAAGAAATTATTGTTCTCCAACGATGCAATCAACATGACGACTGGCCTAGTATCCATCAATCACTAATTAACGGGTTGTCTCTTCGATGTAGCATCAACCCT
TTCCACGCTAACAAAGCCATGCTCCATGTATATGATCAAGGCACTGCTACAAACTTATGTTCTCACTCGGATTGGACCCATATTGGTAAGCATAAATTGAAGTTTTATCC
ATTAACCACTGCCTCTGCTCAACAGGATATTATGACACCATCTTATGGAGGTTGGATTGAGATTTCTCTTCTTCCCCCTACCTTATGGACTGAGCACATATTCCGTTTCA
TTGGAGATATTTGCGGCGGCTTTGTGGAAACATCTAACCTCACTAGTCGGATGATTGTTGCTACTGAAGCTAGGATAAAAGTTCGGCCAAATGTTACAGGTTTCATTCCC
GCAGCCGTCAAACTCTCACAGGACCTTGCCGGCGTTGACCTTACGGTTCATATTCGAGGAATTTCCGGCAGCCCACAGAGAATCGTTCACATTAATGATAAAATTAATGA
GGAAATTCCCAATATGGCATCTAAGGATATTGTTTTTAAGAAGAGAGAGGAATCAGAGTGTTCGATTGCTAAATCGAAAATGATCTCCTCGCCAGCAGTTATGCCTAAAA
TCTCGGTACCAGTTCATATCTCCCCCTCACCGCCTAAAATATCGGTACCGGAACATATTTCCCCTCCTCCGCCGTCATCTGATCAATTGAATAAAGGGAAGCTCCCTCTC
GAAGCGCCTTTCCCTGGGCCTGAATCATCGATTATACAAATCACAGAACCCACAAATCTTAGATGCGGCAATATTGGATCTACATCAAAGCCCAATTTGATCGTAGGATC
CGATACTGAAGCCTTCCTCTCAAGCCCATCTACCAACCATTCGGCCCACCACTCAGCTCATGATCCAAATTCCCCCCGATCCATGGACCTAACCATTTTTAATGAATCCC
AAACCGACGGCCCACTTGATAATTTACCGAACCACCCATATCAGACCTTTCCTTCCCTAATCGTTACCTTACCCCCATTACAGGAGCAGCCAAACCATAATAATCCTCCC
AAACCTCTGGAATCCTTACGGCTTTCTCCTCACCCTTCCCCACGGTTTTCTCCTCCACCAAATACCCAGGCCCCAACAAATTCCTTTCCCCATTGTCTTCAACATTTGGC
TCCTCTATTAAGCAAGCATGGTCTCTGTATCATGGCTCTTCCGACAGTACCAAAGTCAATGGGCGCCTCTGGAGGCATTCTTATTATGTGGAGTGAACCAGAATTTTCAG
TAAAGGAGACTATTCAAGGTCTTTTCTCTCTCTCTATTCATATCGTTCTGGCTGATAATTTCTCTTTTTGGCTATCGGCTATTTATGGCCCTTCTAGACATGCTGATAGA
TCGGAATTCTGGAATGAACTACACGACTTGGCTGGTTTAGGTGGTGACAATTGGATTCTTGGAGGAGATTTTAATGTCACACGTTGGTCTTGGGAAAAATCGCATGGTCG
ACCCGTGACTAGGAGTATGCGTATTTTTAACCAATGGATTGCTGACTACCATCTTATAGACACCCCTTTACAGAATGGCTGCTATACGTGGTCCAGTTGTGGTGAAAATC
ATTATTGCTCATTGATTGATCGATTCTTAATGACGGATACCTGTCTCAATAAATTTGGTGTAGCTCGTTTTCTTCGCCTTGATAGGGTTACATCGGATCATTATCCTTGT
ACTCTATCTTTTGGGGATCTCTCTTGGGGCCCTTGCCCCTTTAGATTTGAGAATTCTTGGTTGAAAAAAGACTCTTTTCGTTGTCTTATGGAAAATTGGTGGTCACAAAA
CACCATTCAAGGTTGGCCAGGCCATGGGTTTATGATGAAGCTTAAAGGATTGAAATCTGAAATCAGAAAATGGAATTTATCTCAGCCTTCATCTGCTGATCAACTTCCAT
CTCTGGTCTCACAGTTGAAATTGTTGGATGATACAGAAGACATGGTTCCTTATCTATGGAACAAATATCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAAACCACAGAGATTCATCCATATCTGCCATGGAATCACTCTACTCGATCCATATCCATAGATAGGAAAACCTTCACAATAGCCTTTGATGAACACTTTAGAGGAAG
TAGAGCCAAGATAACCGAATACAGCAGATATTCATCTCATTCGATTTCCCTTTCTTGGAAATCTCTAAAATGGCTTGCCTTATCTTTCAACACAATTGTTCACTCACCAT
GTTCGCACAAATTCTTCTCGGATCTGAGGAGCGAAGAATACACTCTTTGGCTGGAAAAACTGAATAACAAGAATGGTTTTTATGTGGAAATTAACCAGGTGCAAAATTCT
GGTAGCCGACAAAGGATCCTTATCCCCTCGGAAAACAACAAACAAGGTTGGTTCTCTTTTTTTTCGCTCATCTCAGAATACCCTGCTGAAGCTCATCGCCAGCCCACACA
ACCATCACCTCCATCATTCAAGGACATCCTTCAAACAAAACCACCAACAGCCGCCATTACTCCTTCCTTGAAAGGGCCCGTGAAGGAAGCTTCTGTCTCCACACATGCTG
AAGAATGGAAAGAAATTATTGTTCTCCAACGATGCAATCAACATGACGACTGGCCTAGTATCCATCAATCACTAATTAACGGGTTGTCTCTTCGATGTAGCATCAACCCT
TTCCACGCTAACAAAGCCATGCTCCATGTATATGATCAAGGCACTGCTACAAACTTATGTTCTCACTCGGATTGGACCCATATTGGTAAGCATAAATTGAAGTTTTATCC
ATTAACCACTGCCTCTGCTCAACAGGATATTATGACACCATCTTATGGAGGTTGGATTGAGATTTCTCTTCTTCCCCCTACCTTATGGACTGAGCACATATTCCGTTTCA
TTGGAGATATTTGCGGCGGCTTTGTGGAAACATCTAACCTCACTAGTCGGATGATTGTTGCTACTGAAGCTAGGATAAAAGTTCGGCCAAATGTTACAGGTTTCATTCCC
GCAGCCGTCAAACTCTCACAGGACCTTGCCGGCGTTGACCTTACGGTTCATATTCGAGGAATTTCCGGCAGCCCACAGAGAATCGTTCACATTAATGATAAAATTAATGA
GGAAATTCCCAATATGGCATCTAAGGATATTGTTTTTAAGAAGAGAGAGGAATCAGAGTGTTCGATTGCTAAATCGAAAATGATCTCCTCGCCAGCAGTTATGCCTAAAA
TCTCGGTACCAGTTCATATCTCCCCCTCACCGCCTAAAATATCGGTACCGGAACATATTTCCCCTCCTCCGCCGTCATCTGATCAATTGAATAAAGGGAAGCTCCCTCTC
GAAGCGCCTTTCCCTGGGCCTGAATCATCGATTATACAAATCACAGAACCCACAAATCTTAGATGCGGCAATATTGGATCTACATCAAAGCCCAATTTGATCGTAGGATC
CGATACTGAAGCCTTCCTCTCAAGCCCATCTACCAACCATTCGGCCCACCACTCAGCTCATGATCCAAATTCCCCCCGATCCATGGACCTAACCATTTTTAATGAATCCC
AAACCGACGGCCCACTTGATAATTTACCGAACCACCCATATCAGACCTTTCCTTCCCTAATCGTTACCTTACCCCCATTACAGGAGCAGCCAAACCATAATAATCCTCCC
AAACCTCTGGAATCCTTACGGCTTTCTCCTCACCCTTCCCCACGGTTTTCTCCTCCACCAAATACCCAGGCCCCAACAAATTCCTTTCCCCATTGTCTTCAACATTTGGC
TCCTCTATTAAGCAAGCATGGTCTCTGTATCATGGCTCTTCCGACAGTACCAAAGTCAATGGGCGCCTCTGGAGGCATTCTTATTATGTGGAGTGAACCAGAATTTTCAG
TAAAGGAGACTATTCAAGGTCTTTTCTCTCTCTCTATTCATATCGTTCTGGCTGATAATTTCTCTTTTTGGCTATCGGCTATTTATGGCCCTTCTAGACATGCTGATAGA
TCGGAATTCTGGAATGAACTACACGACTTGGCTGGTTTAGGTGGTGACAATTGGATTCTTGGAGGAGATTTTAATGTCACACGTTGGTCTTGGGAAAAATCGCATGGTCG
ACCCGTGACTAGGAGTATGCGTATTTTTAACCAATGGATTGCTGACTACCATCTTATAGACACCCCTTTACAGAATGGCTGCTATACGTGGTCCAGTTGTGGTGAAAATC
ATTATTGCTCATTGATTGATCGATTCTTAATGACGGATACCTGTCTCAATAAATTTGGTGTAGCTCGTTTTCTTCGCCTTGATAGGGTTACATCGGATCATTATCCTTGT
ACTCTATCTTTTGGGGATCTCTCTTGGGGCCCTTGCCCCTTTAGATTTGAGAATTCTTGGTTGAAAAAAGACTCTTTTCGTTGTCTTATGGAAAATTGGTGGTCACAAAA
CACCATTCAAGGTTGGCCAGGCCATGGGTTTATGATGAAGCTTAAAGGATTGAAATCTGAAATCAGAAAATGGAATTTATCTCAGCCTTCATCTGCTGATCAACTTCCAT
CTCTGGTCTCACAGTTGAAATTGTTGGATGATACAGAAGACATGGTTCCTTATCTATGGAACAAATATCCTTGA
Protein sequenceShow/hide protein sequence
MKTTEIHPYLPWNHSTRSISIDRKTFTIAFDEHFRGSRAKITEYSRYSSHSISLSWKSLKWLALSFNTIVHSPCSHKFFSDLRSEEYTLWLEKLNNKNGFYVEINQVQNS
GSRQRILIPSENNKQGWFSFFSLISEYPAEAHRQPTQPSPPSFKDILQTKPPTAAITPSLKGPVKEASVSTHAEEWKEIIVLQRCNQHDDWPSIHQSLINGLSLRCSINP
FHANKAMLHVYDQGTATNLCSHSDWTHIGKHKLKFYPLTTASAQQDIMTPSYGGWIEISLLPPTLWTEHIFRFIGDICGGFVETSNLTSRMIVATEARIKVRPNVTGFIP
AAVKLSQDLAGVDLTVHIRGISGSPQRIVHINDKINEEIPNMASKDIVFKKREESECSIAKSKMISSPAVMPKISVPVHISPSPPKISVPEHISPPPPSSDQLNKGKLPL
EAPFPGPESSIIQITEPTNLRCGNIGSTSKPNLIVGSDTEAFLSSPSTNHSAHHSAHDPNSPRSMDLTIFNESQTDGPLDNLPNHPYQTFPSLIVTLPPLQEQPNHNNPP
KPLESLRLSPHPSPRFSPPPNTQAPTNSFPHCLQHLAPLLSKHGLCIMALPTVPKSMGASGGILIMWSEPEFSVKETIQGLFSLSIHIVLADNFSFWLSAIYGPSRHADR
SEFWNELHDLAGLGGDNWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIADYHLIDTPLQNGCYTWSSCGENHYCSLIDRFLMTDTCLNKFGVARFLRLDRVTSDHYPC
TLSFGDLSWGPCPFRFENSWLKKDSFRCLMENWWSQNTIQGWPGHGFMMKLKGLKSEIRKWNLSQPSSADQLPSLVSQLKLLDDTEDMVPYLWNKYP