; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lcy03g008100 (gene) of Sponge gourd (P93075) v1 genome

Gene IDLcy03g008100
OrganismLuffa cylindrica cv. P93075 (Sponge gourd (P93075) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationChr03:39743292..39747433
RNA-Seq ExpressionLcy03g008100
SyntenyLcy03g008100
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR001878 - Zinc finger, CCHC-type
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_010673168.1 PREDICTED: uncharacterized protein LOC104889608 [Beta vulgaris subsp. vulgaris]5.2e-17733.59Show/hide
Query:  DELVRLWKALTLTEEEAEVIRAEAVPAALSVEAIDLCVVGKVLTSKKINPDAFRSVMKAAWRVHGSTRVESAGDNIFAIQFHSKSEKLRVVGSGPWAFDR
        DE+      L +T +E+ VI  E  P       I L +VGKVLT +  N +A +  +   W +  S    +  + +F +QF +  +K +V+   PW+FD+
Subjt:  DELVRLWKALTLTEEEAEVIRAEAVPAALSVEAIDLCVVGKVLTSKKINPDAFRSVMKAAWRVHGSTRVESAGDNIFAIQFHSKSEKLRVVGSGPWAFDR

Query:  SLLVMRAPASTGSAISVEFNECSFWVQIHNIPMHWQTRKMAGFLGNRIGRVEEVSGMEENEWTGPIMRLRVCLNLGRPLRRGFILQVSNEEEVWCPVVYE
        +L++           ++  +   FW++++N+PM  +T      +G+ +G V EV   +   W     R++V +++ +PLRR   ++          V YE
Subjt:  SLLVMRAPASTGSAISVEFNECSFWVQIHNIPMHWQTRKMAGFLGNRIGRVEEVSGMEENEWTGPIMRLRVCLNLGRPLRRGFILQVSNEEEVWCPVVYE

Query:  RLPDFYYNCGRLGHLKRECGHEVSEGGDGRREQYGEWMKAGMMLGEVGRRWGEEKRDGQNPSGQGARESDVGVEVPRGEESVEGGQSGIGLDSGKDGDKG
        RLP+F Y CG LGH++R+C     E     R  +G W++A    G +  +  EE ++ ++     AR  +        EE V      +   +    D+ 
Subjt:  RLPDFYYNCGRLGHLKRECGHEVSEGGDGRREQYGEWMKAGMMLGEVGRRWGEEKRDGQNPSGQGARESDVGVEVPRGEESVEGGQSGIGLDSGKDGDKG

Query:  KGILGVGEVADGDGDIGGMEVDGVE------EGQV---------EGSAEVSKSQKGKGWKR------RAPSVLTEITNTGSGSERLGPKRKTS------T
        K ++        D DI   E            G V         +GS    K +K K   R      + P ++         +   G KRK S       
Subjt:  KGILGVGEVADGDGDIGGMEVDGVE------EGQV---------EGSAEVSKSQKGKGWKR------RAPSVLTEITNTGSGSERLGPKRKTS------T

Query:  SASELQGSKRARGDTLRVVWEKETKVHSSRF--DFIRAKLGFDGCFSVDSSGRSGGLALLWDST-ISFSLISFSPNHIDG-WVDWNQFQWRLTGIYGHPQ
        S     G K+ + D   +       V +     + I+++LG+D  F VDS GRSGGL + W S  + FSL+SFS NHI G  V  N  +WR  GIYG P+
Subjt:  SASELQGSKRARGDTLRVVWEKETKVHSSRF--DFIRAKLGFDGCFSVDSSGRSGGLALLWDST-ISFSLISFSPNHIDG-WVDWNQFQWRLTGIYGHPQ

Query:  AELKKRTWSLLKHLRGHPSTPWLMGGDFNAILFHHEKEGGRDKPESELNGFREAVDTCCMVDLGFKGDKFSWCNGRPGSETVWERIDRCLGNMAWQDLFP
        A  K +TW LL+ L G    P L GGDFN +L   E EGGR      ++ FRE VD   + DLGF G  ++W  G+  +  + ER+DR L +  W D FP
Subjt:  AELKKRTWSLLKHLRGHPSTPWLMGGDFNAILFHHEKEGGRDKPESELNGFREAVDTCCMVDLGFKGDKFSWCNGRPGSETVWERIDRCLGNMAWQDLFP

Query:  QQEVSHLDFSRSDHRPILLSLLPPRQVRYAQRRRIRRFEDVWLMYAEFKNVVSDSWQSGGADASSVGLKVAIERCLASLVKWGRDKVGRYSDRIKEASAE
        Q  V H+   +SDH PI++ L   ++ R  ++++  RF   WL+    +++V  +W      +S +  +  I      LV W +D +      I     E
Subjt:  QQEVSHLDFSRSDHRPILLSLLPPRQVRYAQRRRIRRFEDVWLMYAEFKNVVSDSWQSGGADASSVGLKVAIERCLASLVKWGRDKVGRYSDRIKEASAE

Query:  VQSALCNLSRSVSRVDLQEAESRLESILVEEEVYWKQRSRENWLSWGDRNTRWFHNHASQRRKRNTICGLEDESGMWHQDPRDVLRVISDYFTHMYSSCR
        ++  L + S +  +  L E  S+L+ +L ++E YW  RSR   +  GD+NT++FH+ ASQR++RN I GL DE  +W  D  D+ RV+  Y+ ++++S  
Subjt:  VQSALCNLSRSVSRVDLQEAESRLESILVEEEVYWKQRSRENWLSWGDRNTRWFHNHASQRRKRNTICGLEDESGMWHQDPRDVLRVISDYFTHMYSSCR

Query:  PSVEEVNKVIGNVQPSVTEDMNRLLLKPFTREDIVGALAQIHPNKAPGPDGMSGAFYRQSWEIVGEDMMRFCLGVLNNKESPEVLNETMIVLIPKVRNPR
        PS E ++ V+  V P ++E+MN +L +   +E++  AL Q+HP+KAPGPDGM   FY++ W IVG+D+     G+++    P+ LN T I LIPKV++P 
Subjt:  PSVEEVNKVIGNVQPSVTEDMNRLLLKPFTREDIVGALAQIHPNKAPGPDGMSGAFYRQSWEIVGEDMMRFCLGVLNNKESPEVLNETMIVLIPKVRNPR

Query:  RVSEFRPISLCNVAYKLISKVLVSRMKGLLNKLIAQKQSAFIPGRCVVDNAIVGYECIHALRTKGKGRVGWASLKLDMSKAYDLVEWVFLEQIMLKMGFA
         VSEFRPISLCNV +KL++KVL +R+K +L  ++++ QSAF+PGR + DNA++  E  H+++ + KG  G+ ++KLDMSKAYD VEW FL  ++ KMGFA
Subjt:  RVSEFRPISLCNVAYKLISKVLVSRMKGLLNKLIAQKQSAFIPGRCVVDNAIVGYECIHALRTKGKGRVGWASLKLDMSKAYDLVEWVFLEQIMLKMGFA

Query:  PNWVDLIGKCISTVRFSFSINGSRYGEVKPSRGLRQGDPLSPYLFLFCSEGLSAMLHQAEASSPNTGAQTQENVSEVLH---------VNVTLC------
         +WV  +  C+S+VR+SF +NG   G V PSRGLRQGDP+SPYLF+  ++  SA++ +A A     G Q    + ++L+         +N+         
Subjt:  PNWVDLIGKCISTVRFSFSINGSRYGEVKPSRGLRQGDPLSPYLFLFCSEGLSAMLHQAEASSPNTGAQTQENVSEVLH---------VNVTLC------

Query:  ---------------------HQHYLGLPAFMPRSRVSSLKFTKDRVWRRIQGWKGKLFSVGGR
                             H  YLG+P    RS+        DRVW+++QGWK KL S  G+
Subjt:  ---------------------HQHYLGLPAFMPRSRVSSLKFTKDRVWRRIQGWKGKLFSVGGR

XP_012847426.1 PREDICTED: uncharacterized protein LOC105967373 [Erythranthe guttata]8.1e-17031.39Show/hide
Query:  DELVRLWKALTLTEEEAEVIRAEAVPAALSVEA-IDLCVVGKVLTSKKINPDAFRSVMKAAWR-VHGSTRVESAGDNIFAIQFHSKSEKLRVVGSGPWAF
        D L RL  +L LT++E E+     + A  ++E   DL ++G++LT K+IN +A    M   W  VHG  +V+  GD  F   F  + ++ R +  GPW F
Subjt:  DELVRLWKALTLTEEEAEVIRAEAVPAALSVEA-IDLCVVGKVLTSKKINPDAFRSVMKAAWR-VHGSTRVESAGDNIFAIQFHSKSEKLRVVGSGPWAF

Query:  DRSLLVMRAPASTGSAISVEFNECSFWVQIHNIPMHWQTRKMAGFLGNRIGRVEEVSGMEENEWTGPIMRLRVCLNLGRPLRRGFILQVSNEEEVWCPVV
        D++L+V++   +  +   V  + C F+V +  +P   + R MA  +G+ IG  +  +  ++    G ++RLR  +N+ +PLRR   L+    E V   + 
Subjt:  DRSLLVMRAPASTGSAISVEFNECSFWVQIHNIPMHWQTRKMAGFLGNRIGRVEEVSGMEENEWTGPIMRLRVCLNLGRPLRRGFILQVSNEEEVWCPVV

Query:  YERLPDFYYNCGRLGHLKRECGHEVSEGGDGRR--EQYGEWMKA--------GMML-----GEVGRRWGEEKRDGQN---PSGQG-ARESDVGVEVPRGE
        YERLP+F Y CG + H+   C  + S   + R     YGEW+KA        G++      G+V    G    +GQN   PS     +E D+GV   +G+
Subjt:  YERLPDFYYNCGRLGHLKRECGHEVSEGGDGRR--EQYGEWMKA--------GMML-----GEVGRRWGEEKRDGQN---PSGQG-ARESDVGVEVPRGE

Query:  ESVEGGQSGIGLDSGKD---GDKGKGILGVG-------------------------------EVADG-----------DGDIGGMEVDGVEEGQVEGSAE
              Q  + +    D    D  K I+  G                               E+  G           DG +G  +++G    Q     +
Subjt:  ESVEGGQSGIGLDSGKD---GDKGKGILGVG-------------------------------EVADG-----------DGDIGGMEVDGVEEGQVEGSAE

Query:  VSK--SQKGKGWKRRAPSVLTEIT-NTGSGSERLGPKR---KTSTSASELQGSKRARGDTLRVVWEKETKVHSSRFDFIRAKLGFDGCFSVDSSGRSGGL
        ++   S K K  + R  +   EI+ +T      +  KR   KTS     ++G K+          E+E            +   F      +++G+SGGL
Subjt:  VSK--SQKGKGWKRRAPSVLTEIT-NTGSGSERLGPKR---KTSTSASELQGSKRARGDTLRVVWEKETKVHSSRFDFIRAKLGFDGCFSVDSSGRSGGL

Query:  ALLWDSTISFSLISFSPNHIDGWVDWNQFQ--WRLTGIYGHPQAELKKRTWSLLKHLRGHPSTPWLMGGDFNAILFHHEKEGGRDKPESELNGFREAVDT
        ALLW   +  SL +FS NHID ++  N     WR TG YG+P   L+ ++W+LL+ L    +  WL  GDFNA+L + EK G       ++  F + +  
Subjt:  ALLWDSTISFSLISFSPNHIDGWVDWNQFQ--WRLTGIYGHPQAELKKRTWSLLKHLRGHPSTPWLMGGDFNAILFHHEKEGGRDKPESELNGFREAVDT

Query:  CCMVDLGFKGDKFSWCNGRPGSETVWERIDRCLGNMAWQDLFPQQEVSHLDFSRSDHRPILLSLLPPRQVRYAQRRRIRRFEDVWLMYAEFKNVVSDSWQ
          + DLGF G  F+W N R    T  ER+DR  GN  W +LFP   V HLD   SDH P+L+        +   R R  +FE +WL   E + ++ ++W 
Subjt:  CCMVDLGFKGDKFSWCNGRPGSETVWERIDRCLGNMAWQDLFPQQEVSHLDFSRSDHRPILLSLLPPRQVRYAQRRRIRRFEDVWLMYAEFKNVVSDSWQ

Query:  SGGADASSVGLKVAIERCLASLVKWGRDKVGRYSDRIKEASAEVQSALCNLSRSVSRVDLQEAESRLESILVEEEVYWKQRSRENWLSWGDRNTRWFHNH
        +  +  +S+     +E C   L++W R   G   DRI++   ++      +  + ++ ++ +    L+ +L +EEV W+QR++ +W+  GD+NT++FH  
Subjt:  SGGADASSVGLKVAIERCLASLVKWGRDKVGRYSDRIKEASAEVQSALCNLSRSVSRVDLQEAESRLESILVEEEVYWKQRSRENWLSWGDRNTRWFHNH

Query:  ASQRRKRNTICGLEDESGMWHQDPRDVLRVISDYFTHMYSSCRPSVEEVNKVIGNVQPSVTEDMNRLLLKPFTREDIVGALAQIHPNKAPGPDGMSGAFY
        AS RR++NTI GL +  G+W +   D+ +++SDYF+ +++S       + +V+  ++P V++ +NR+LL+ +T +++  AL  + P K+PGPDG    F+
Subjt:  ASQRRKRNTICGLEDESGMWHQDPRDVLRVISDYFTHMYSSCRPSVEEVNKVIGNVQPSVTEDMNRLLLKPFTREDIVGALAQIHPNKAPGPDGMSGAFY

Query:  RQSWEIVGEDMMRFCLGVLNNKESPEVLNETMIVLIPKVRNPRRVSEFRPISLCNVAYKLISKVLVSRMKGLLNKLIAQKQSAFIPGRCVVDNAIVGYEC
        ++ W +VG D+ ++ L +LN +E P   N T IVLIPK  NPR +++FRPISL NV YK+ SK +V+R+K  +N +I+  QSAF+P R + DN ++ YE 
Subjt:  RQSWEIVGEDMMRFCLGVLNNKESPEVLNETMIVLIPKVRNPRRVSEFRPISLCNVAYKLISKVLVSRMKGLLNKLIAQKQSAFIPGRCVVDNAIVGYEC

Query:  IHALRTKGKGRVGWASLKLDMSKAYDLVEWVFLEQIMLKMGFAPNWVDLIGKCISTVRFSFSINGSRYGEVKPSRGLRQGDPLSPYLFLFCSEGLSAMLH
        +H ++   +      ++KLDMSKAYD +EW FL  +M ++GF  N++DL+  C+STV +SF +NG  +G + P RGLRQGDP+SPYLFLFC+E LSA++ 
Subjt:  IHALRTKGKGRVGWASLKLDMSKAYDLVEWVFLEQIMLKMGFAPNWVDLIGKCISTVRFSFSINGSRYGEVKPSRGLRQGDPLSPYLFLFCSEGLSAMLH

Query:  QAEASSPNTG-----------------------------------------------------------AQTQENVSEV---LHVNVTLCHQHYLGLPAF
        Q E      G                                                             T+EN++ +   L + V   H  YLGLP+ 
Subjt:  QAEASSPNTG-----------------------------------------------------------AQTQENVSEV---LHVNVTLCHQHYLGLPAF

Query:  MPRSRVSSLKFTKDRVWRRIQGWKGKLFSVGGR
        + +S+  +    +DRV RR++GWK K  S GG+
Subjt:  MPRSRVSSLKFTKDRVWRRIQGWKGKLFSVGGR

XP_030923330.1 uncharacterized protein LOC115950239 [Quercus lobata]3.0e-18033.33Show/hide
Query:  LTLTEEEAEVIRAEAVPAALSVEAIDLCVVGKVLTSKKINPDAFRSVMKAAWRVHGSTRVESAGDNIFAIQFHSKSEKLRVVGSGPWAFDRSLLVMRAPA
        + LT EE E I         ++E+ +L ++GK LT K  N  A ++ ++ AW ++ S ++   G N+F  +F S+ +  R++  GPW+FD  LL+++   
Subjt:  LTLTEEEAEVIRAEAVPAALSVEAIDLCVVGKVLTSKKINPDAFRSVMKAAWRVHGSTRVESAGDNIFAIQFHSKSEKLRVVGSGPWAFDRSLLVMRAPA

Query:  STGSAISVEFNECSFWVQIHNIPMHWQTRKMAGFLGNRIGRVEEVSGMEENEWTGPIMRLRVCLNLGRPLRRGFILQVSNEEEVWCPVVYERLPDFYYNC
           +  ++ F   S W+QI + P    +  +A  +G+R+G+VEEV G +  +     MR+RV L + +P+RRG  +  S+  + W    YERLP F + C
Subjt:  STGSAISVEFNECSFWVQIHNIPMHWQTRKMAGFLGNRIGRVEEVSGMEENEWTGPIMRLRVCLNLGRPLRRGFILQVSNEEEVWCPVVYERLPDFYYNC

Query:  GRLGHLKREC-GH-EVSEGGDGRREQYGEWMKAGMMLGEVGRRWGEEKRDGQNPSGQGARESDVGVEVPRGEESVEGGQSG--------IGLDSGKDGDK
        G LGH  R C GH  V + G+    QYG++++A      +G   G  K      SGQ A   +V  +      S +G Q G        + LD  ++   
Subjt:  GRLGHLKREC-GH-EVSEGGDGRREQYGEWMKAGMMLGEVGRRWGEEKRDGQNPSGQGARESDVGVEVPRGEESVEGGQSG--------IGLDSGKDGDK

Query:  GKGILGV-GEVADGDGDIGGMEVDGVEEGQVEGSAEVSK-SQKGKGWKRRAPSVLTEITNTGSGSERLGPKRKTS----------TSASELQGSKRARG-
         +    V  + A   G +  + ++ +E+     +   +K + +  G K +    ++   N     E +G   K++          T    L  S  + G 
Subjt:  GKGILGV-GEVADGDGDIGGMEVDGVEEGQVEGSAEVSK-SQKGKGWKRRAPSVLTEITNTGSGSERLGPKRKTS----------TSASELQGSKRARG-

Query:  DTLRVVWEKETKVHSSRFDFIRAKLG-FDGCFSVDSSGR---------------------SGGLALLWDSTISFSLISFSPNHIDGWV-DWNQFQWRLTG
        +    V  K +    +R DF    LG F     +   G+                      GGLA LW + +   +I+F+ NH+   V + + F W LTG
Subjt:  DTLRVVWEKETKVHSSRFDFIRAKLG-FDGCFSVDSSGR---------------------SGGLALLWDSTISFSLISFSPNHIDGWV-DWNQFQWRLTG

Query:  IYGHPQAELKKRTWSLLKHLRGHPSTPWLMGGDFNAILFHHEKEGGRDKPESELNGFREAVDTCCMVDLGFKGDKFSWCNGRPGSETVWERIDRCLGNMA
         YG P A+ K+ +W LLKHL+     PW++ GDFNA L   EK   R    S++  FREA+ +C + DLGFKG  ++W N RPG      R+DR + N  
Subjt:  IYGHPQAELKKRTWSLLKHLRGHPSTPWLMGGDFNAILFHHEKEGGRDKPESELNGFREAVDTCCMVDLGFKGDKFSWCNGRPGSETVWERIDRCLGNMA

Query:  WQDLFPQQEVSHLDFSRSDHRPILL---SLLPPRQVRYAQRRRIRRFEDVWLMYAEFKNVVSDSWQSGGADASSV-GLKVAIERCLASLVKWGRDKVGRY
        W D F    V HL    SDH P+LL   S   PRQ       R  +FE+ WL+  E   V+ ++W +G  +   +  ++  I+ C   L+ WG       
Subjt:  WQDLFPQQEVSHLDFSRSDHRPILL---SLLPPRQVRYAQRRRIRRFEDVWLMYAEFKNVVSDSWQSGGADASSV-GLKVAIERCLASLVKWGRDKVGRY

Query:  SDRIKEASAEVQSALCNLSRSVSRVDLQEAESRLESILVEEEVYWKQRSRENWLSWGDRNTRWFHNHASQRRKRNTICGLEDESGMWHQDPRDVLRVISD
        +  IKE   ++           S+ +      +++ +L ++E+YW QRSR NWL  GDRNT++FH  ASQRR++N I G+ +  G W ++  +V +V +D
Subjt:  SDRIKEASAEVQSALCNLSRSVSRVDLQEAESRLESILVEEEVYWKQRSRENWLSWGDRNTRWFHNHASQRRKRNTICGLEDESGMWHQDPRDVLRVISD

Query:  YFTHMYSSCRPSVEEVNKVIGNVQPSVTEDMNRLLLKPFTREDIVGALAQIHPNKAPGPDGMSGAFYRQSWEIVGEDMMRFCLGVLNNKESPEVLNETMI
        YF +++ +   + +++ + +  V   VTEDM   L   FT E++  AL Q+ P KAPGPDGM+  FY++ W IVG+ ++   L  LNN      +N T I
Subjt:  YFTHMYSSCRPSVEEVNKVIGNVQPSVTEDMNRLLLKPFTREDIVGALAQIHPNKAPGPDGMSGAFYRQSWEIVGEDMMRFCLGVLNNKESPEVLNETMI

Query:  VLIPKVRNPRRVSEFRPISLCNVAYKLISKVLVSRMKGLLNKLIAQKQSAFIPGRCVVDNAIVGYECIHALRTKGKGRVGWASLKLDMSKAYDLVEWVFL
        VLIPKV+NP R+SEFRPISLCNV YK+ISKVL +R+K +L ++I+  QSAF+PGR + DN +V YE +H +  + KG+ G  +LKLD+SKAYD VEW FL
Subjt:  VLIPKVRNPRRVSEFRPISLCNVAYKLISKVLVSRMKGLLNKLIAQKQSAFIPGRCVVDNAIVGYECIHALRTKGKGRVGWASLKLDMSKAYDLVEWVFL

Query:  EQIMLKMGFAPNWVDLIGKCISTVRFSFSINGSRYGEVKPSRGLRQGDPLSPYLFLFCSEGLSAMLHQAEAS----------------------------
        + IM KMGF   W++ +  C++T  FS  +NG  Y  ++PSRG+RQGDP+SPYLFL C+EGL+A+L++AE +                            
Subjt:  EQIMLKMGFAPNWVDLIGKCISTVRFSFSINGSRYGEVKPSRGLRQGDPLSPYLFLFCSEGLSAMLHQAEAS----------------------------

Query:  ----------------------------------SPNTGAQTQENVSEVLHVNVTLCHQHYLGLPAFMPRSRVSSLKFTKDRVWRRIQGWKGKLFSVGGR
                                          S NT    +  + E+L V        YLGLP  + R++ ++    KDRVW+++QGWKG L S  G+
Subjt:  ----------------------------------SPNTGAQTQENVSEVLHVNVTLCHQHYLGLPAFMPRSRVSSLKFTKDRVWRRIQGWKGKLFSVGGR

XP_030939698.1 uncharacterized protein LOC115964550 [Quercus lobata]1.1e-17432.38Show/hide
Query:  DELVRLWKALTLTEEEAEVIRAEAVPAALSVEAIDLCVVGKVLTSKKINPDAFRSVMKAAWRVHGSTRVESAGDNIFAIQFHSKSEKLRVVGSGPWAFDR
        D + RL K + LTEEE EV++        ++E   L ++G+ LT++  N  A +S++++ W++    R+   G+ +F  +F  +S+   V+ +GPW+FD 
Subjt:  DELVRLWKALTLTEEEAEVIRAEAVPAALSVEAIDLCVVGKVLTSKKINPDAFRSVMKAAWRVHGSTRVESAGDNIFAIQFHSKSEKLRVVGSGPWAFDR

Query:  SLLVMRAPASTGSAISVEFNECSFWVQIHNIPMHWQTRKMAGFLGNRIGRVEEVSGMEENEWTGPIMRLRVCLNLGRPLRRGFILQVSNEEEVWCPVVYE
        +LLV+R      +A SV F     WVQ+  +P      +    +G  +G+V EV            +R+RV + L +P+RRG  +     ++V     YE
Subjt:  SLLVMRAPASTGSAISVEFNECSFWVQIHNIPMHWQTRKMAGFLGNRIGRVEEVSGMEENEWTGPIMRLRVCLNLGRPLRRGFILQVSNEEEVWCPVVYE

Query:  RLPDFYYNCGRLGHLKRECGHEVSEGGDGRREQYGEWMKAGMMLGEVGRRWGEEKRDGQNPSGQGARESDVGVEVPRGEESVEGGQSGIG-----LDSGK
        RL    Y CG+LGH  ++C   V        + YG+W+KAG    ++G    +       P+ + ++   V +           G  GI       D+G 
Subjt:  RLPDFYYNCGRLGHLKRECGHEVSEGGDGRREQYGEWMKAGMMLGEVGRRWGEEKRDGQNPSGQGARESDVGVEVPRGEESVEGGQSGIG-----LDSGK

Query:  DGDKGKGILGVGEVADGDGDIGGMEVDGVEEGQVEGSAEVSKSQKG------KGWKRRAPSVL-TEITNTGSGSER-----------LGPKRKTSTSASE
        + +  K  + V +      ++    + G +  + +    VS +Q G       G +  A   L + + N     E+             P+ K+S   + 
Subjt:  DGDKGKGILGVGEVADGDGDIGGMEVDGVEEGQVEGSAEVSKSQKG------KGWKRRAPSVL-TEITNTGSGSER-----------LGPKRKTSTSASE

Query:  LQG-SKRARGDTLRVVWEKETKVHSSRFDFIRAKLGFDGCFSVDSSGRSGGLALLWDSTISFSLISFSPNHIDGWV-DWNQFQWRLTGIYGHPQAELKKR
        +   S   R    ++++  ETK       +I+  L +   F V S  RSGGLALLW   I   + +F+ NHID  + D     WRLTG YG P+ + K+ 
Subjt:  LQG-SKRARGDTLRVVWEKETKVHSSRFDFIRAKLGFDGCFSVDSSGRSGGLALLWDSTISFSLISFSPNHIDGWV-DWNQFQWRLTGIYGHPQAELKKR

Query:  TWSLLKHLRGHPSTPWLMGGDFNAILFHHEKEGGRDKPESELNGFREAVDTCCMVDLGFKGDKFSWCNGRPGSETVWERIDRCLGNMAWQDLFPQQEVSH
        +W LLKHL    S PWL  GDFN IL   EK+GG  KP + +  FREA+  C +VDLG++G+ F+W NGR   + V ER+DR    + W+D F Q +V+H
Subjt:  TWSLLKHLRGHPSTPWLMGGDFNAILFHHEKEGGRDKPESELNGFREAVDTCCMVDLGFKGDKFSWCNGRPGSETVWERIDRCLGNMAWQDLFPQQEVSH

Query:  LDFSRSDHRPILLSL-LPPRQVRYAQRRRIRRFEDVWLMYAEFKNVVSDSWQSGGADASSVG-LKVAIERCLASLVKWGRDKVGRYSDRIKEASAEVQSA
        L+ S SDH PIL++  + P      +++   RFE+ W  + + +N++  +W S   + S +  L   I+RC  +LV W R   G    +++E   ++   
Subjt:  LDFSRSDHRPILLSL-LPPRQVRYAQRRRIRRFEDVWLMYAEFKNVVSDSWQSGGADASSVG-LKVAIERCLASLVKWGRDKVGRYSDRIKEASAEVQSA

Query:  LCNLSRSVSRVDLQEAESRLESILVEEEVYWKQRSRENWLSWGDRNTRWFHNHASQRRKRNTICGLEDESGMWHQDPRDVLRVISDYFTHMYSSCRPSVE
        LC  +R+ +   ++  ++ + +I+ ++E++W+QRSR  WL  GD+NT++FHN ASQRR++N I G+ D    W      + +V   YF  ++S+  P  +
Subjt:  LCNLSRSVSRVDLQEAESRLESILVEEEVYWKQRSRENWLSWGDRNTRWFHNHASQRRKRNTICGLEDESGMWHQDPRDVLRVISDYFTHMYSSCRPSVE

Query:  EVNKVIGNVQPSVTEDMNRLLLKPFTREDIVGALAQIHPNKAPGPDGMSGAFYRQSWEIVGEDMMRFCLGVLNNKESPEVLNETMIVLIPKVRNPRRVSE
         +  V+ +VQ  VT  MN  L +P+T +++  AL Q+HP+K+PGPDGMS  F+++ W I+G D+    L  L +    + +N T IVLIPK ++P+ +++
Subjt:  EVNKVIGNVQPSVTEDMNRLLLKPFTREDIVGALAQIHPNKAPGPDGMSGAFYRQSWEIVGEDMMRFCLGVLNNKESPEVLNETMIVLIPKVRNPRRVSE

Query:  FRPISLCNVAYKLISKVLVSRMKGLLNKLIAQKQSAFIPGRCVVDNAIVGYECIHALRTKGKGRVGWASLKLDMSKAYDLVEWVFLEQIMLKMGFAPNWV
        +RPISL NV  ++ISKV+ +R+K +L  +I+  QSAF+P R + DN  V YE +H +R + +G+VG  ++KLD+SKAYD VEW FL+ IM K+GF P WV
Subjt:  FRPISLCNVAYKLISKVLVSRMKGLLNKLIAQKQSAFIPGRCVVDNAIVGYECIHALRTKGKGRVGWASLKLDMSKAYDLVEWVFLEQIMLKMGFAPNWV

Query:  DLIGKCISTVRFSFSINGSRYGEVKPSRGLRQGDPLSPYLFLFCSEGLSAMLHQA---------------------------------------------
        +L  + ++T  +S  ING   G +  SRG+RQGDPLSPYLFL C+EGL+A+L++A                                             
Subjt:  DLIGKCISTVRFSFSINGSRYGEVKPSRGLRQGDPLSPYLFLFCSEGLSAMLHQA---------------------------------------------

Query:  ---EAS--------------SPNTGAQTQENVSEVLHVNVTLCHQHYLGLPAFMPRSRVSSLKFTKDRVWRRIQGWKGKLFSVGGR
           EA+              S NT  + +  + + +   V      YLGLP    +S+V++ K  ++++ +R+ GWK K  S  GR
Subjt:  ---EAS--------------SPNTGAQTQENVSEVLHVNVTLCHQHYLGLPAFMPRSRVSSLKFTKDRVWRRIQGWKGKLFSVGGR

XP_042990668.1 uncharacterized protein LOC122317666 [Carya illinoinensis]8.4e-17531.26Show/hide
Query:  DELVRLWKALTLTEEEAEVIRAEAVPAALSVEAIDLCVVGKVLTSKKINPDAFRSVMKAAWRVHGSTRVESAGDNIFAIQFHSKSEKLRVVGSGPWAFDR
        +++   WK+L LTEEE + I         S    +LC++ K+   +  N +AFR+ M   W   G    +    N++ I+F   S+K +V+   PW+FDR
Subjt:  DELVRLWKALTLTEEEAEVIRAEAVPAALSVEAIDLCVVGKVLTSKKINPDAFRSVMKAAWRVHGSTRVESAGDNIFAIQFHSKSEKLRVVGSGPWAFDR

Query:  SLLVMRAPASTGSAISVEFNECSFWVQIHNIPMHWQTRKMAGFLGNRIGRVEEVSGMEENEWTGPIMRLRVCLNLGRPLRRGFILQVSNEEEVWCPVVYE
         L+ M+      S   V F+   FW+Q+HN+P    +++M   +G+ IGRV EV    E    G  +R++  +N+ + L RG  L+ S  ++ W    YE
Subjt:  SLLVMRAPASTGSAISVEFNECSFWVQIHNIPMHWQTRKMAGFLGNRIGRVEEVSGMEENEWTGPIMRLRVCLNLGRPLRRGFILQVSNEEEVWCPVVYE

Query:  RLPDFYYNCGRLGHLKRECGHEVSEGGDGRR-EQYGEWMKAGMMLGEV---------------GRRWGEEKRDGQNPSG-QGARESDVGVEVPRGEESVE
        RLP F + CGR  H +  C      G D    +QYG+W++A  +  +                G  W   + +G + SG  G   S  G E P   ES E
Subjt:  RLPDFYYNCGRLGHLKRECGHEVSEGGDGRR-EQYGEWMKAGMMLGEV---------------GRRWGEEKRDGQNPSG-QGARESDVGVEVPRGEESVE

Query:  GGQSGIGLDSGK--------------DGDKGK-------------------------------------GILGVGEVADGDGDI----GGMEVDGVEEGQ
          +  +G   G                 DKG                                       ++ +   A G   I    GG+         
Subjt:  GGQSGIGLDSGK--------------DGDKGK-------------------------------------GILGVGEVADGDGDI----GGMEVDGVEEGQ

Query:  VEGSAEVSKSQKGKGWKRRA---PSVLTEITNT------------------GSGSERLGPKRKTST---SASELQGSKRARGDTLRVVWEKETKVHSSRF
        +E S+  +   KGK WKR+A   P+ L+++TN                    + ++    K+KT     S  EL    +A+   L  V+  ETK ++ R 
Subjt:  VEGSAEVSKSQKGKGWKRRA---PSVLTEITNT------------------GSGSERLGPKRKTST---SASELQGSKRARGDTLRVVWEKETKVHSSRF

Query:  DFIRAKLGFDGCFSVDSSGRSGGLALLWDSTISFSLISFSPNHIDGWV--DWNQFQWRLTGIYGHPQAELKKRTWSLLKHLRGHPSTPWLMGGDFNAILF
        D I+  LG++ CFSV+S G+SG LALLW  ++   +I+++  HI   +    +  QW+LTG YGHP +  +  +W LL+ L+   + PWL  GDFN I  
Subjt:  DFIRAKLGFDGCFSVDSSGRSGGLALLWDSTISFSLISFSPNHIDGWV--DWNQFQWRLTGIYGHPQAELKKRTWSLLKHLRGHPSTPWLMGGDFNAILF

Query:  HHEKEGGRDKPESELNGFREAVDTCCMVDLGFKGDKFSWCNGRPGSETVWERIDRCLGNMAWQDLFPQQEVSHLDFSRSDHRPILLSLLPPRQVRYAQRR
          EK G  ++P  ++  FR ++  C + DLGF GDKF+W N R G +   ER+DR  GN  W  LF    VSHLD ++SDH+ +L+       +   ++ 
Subjt:  HHEKEGGRDKPESELNGFREAVDTCCMVDLGFKGDKFSWCNGRPGSETVWERIDRCLGNMAWQDLFPQQEVSHLDFSRSDHRPILLSLLPPRQVRYAQRR

Query:  RIRRFEDVWLMYAEFKNVVSDSWQSGGADASSVGLKVAIERCLASLVKWGRDKVGRYSDRIKEASAEVQSALCNLSRSVSRVDLQEAESRLESILVEEEV
        R+ RFE  W    E + ++   W+     +        + +C   L  W R+K       +K    E+   L   ++     ++++    +  I+  E +
Subjt:  RIRRFEDVWLMYAEFKNVVSDSWQSGGADASSVGLKVAIERCLASLVKWGRDKVGRYSDRIKEASAEVQSALCNLSRSVSRVDLQEAESRLESILVEEEV

Query:  YWKQRSRENWLSWGDRNTRWFHNHASQRRKRNTICGLEDESGMWHQDPRDVLRVISDYFTHMYSSCRPSVEEVNKVIGNVQPSVTEDMNRLLLKPFTRED
          +QR+++ WL  GDRNT++FH  +SQR++ N+I  ++ +SG+  QDP+ + + + ++FT +++S  PS   ++  +  +Q  +T+DM   L   FT  +
Subjt:  YWKQRSRENWLSWGDRNTRWFHNHASQRRKRNTICGLEDESGMWHQDPRDVLRVISDYFTHMYSSCRPSVEEVNKVIGNVQPSVTEDMNRLLLKPFTRED

Query:  IVGALAQIHPNKAPGPDGMSGAFYRQSWEIVGEDMMRFCLGVLNNKESPEVLNETMIVLIPKVRNPRRVSEFRPISLCNVAYKLISKVLVSRMKGLLNKL
        +  A+  ++P  +PGPDG    F+++ W+ VG ++ +  L VLN  +    LNET+I LIPK  NP  V++FRPISLCNV YK+I+K L +R+K +L  +
Subjt:  IVGALAQIHPNKAPGPDGMSGAFYRQSWEIVGEDMMRFCLGVLNNKESPEVLNETMIVLIPKVRNPRRVSEFRPISLCNVAYKLISKVLVSRMKGLLNKL

Query:  IAQKQSAFIPGRCVVDNAIVGYECIHALRTKGKGRVGWASLKLDMSKAYDLVEWVFLEQIMLKMGFAPNWVDLIGKCISTVRFSFSINGSRYGEVKPSRG
        I+  Q+AF+PGR + DN IV +E +H ++ + KG  G+ +LKLDMSKAYD +EW FL  +MLKMGF   W++L+ KC+S V +S  +NG       P+RG
Subjt:  IAQKQSAFIPGRCVVDNAIVGYECIHALRTKGKGRVGWASLKLDMSKAYDLVEWVFLEQIMLKMGFAPNWVDLIGKCISTVRFSFSINGSRYGEVKPSRG

Query:  LRQGDPLSPYLFLFCSEGLSAMLHQAEAS--------------------------------------------------------------SPNTGAQTQ
        LRQGDP+SPYLF+ CSE LS +L+ AE +                                                              S NT  QT+
Subjt:  LRQGDPLSPYLFLFCSEGLSAMLHQAEAS--------------------------------------------------------------SPNTGAQTQ

Query:  ENVSEVLHVNVTLCHQHYLGLPAFMPRSRVSSLKFTKDRVWRRIQGWKGKLFSVGGR
        E V+ +  +  T  ++ YLGLPA + R R  S K   DRV  ++  WK KL S  G+
Subjt:  ENVSEVLHVNVTLCHQHYLGLPAFMPRSRVSSLKFTKDRVWRRIQGWKGKLFSVGGR

TrEMBL top hitse value%identityAlignment
A0A2N9F6L9 Reverse transcriptase domain-containing protein4.8e-18432.71Show/hide
Query:  MDELVRLWKALTLTEEEAEV--IRAEAVPAALSVEAIDLCVVGKVLTSKKINPDAFRSVMKAAWRVHGSTRVESAGDNIFAIQFHSKSEKLRVVGSGPWA
        M++L  LWK L+L E+E  +  + ++A P   ++ A       K  T + IN +A     K  W+      ++  GDN+  I+F   ++  RV+   PW+
Subjt:  MDELVRLWKALTLTEEEAEV--IRAEAVPAALSVEAIDLCVVGKVLTSKKINPDAFRSVMKAAWRVHGSTRVESAGDNIFAIQFHSKSEKLRVVGSGPWA

Query:  FDRSLLVMRAPASTGSAISVEFNECSFWVQIHNIPMHWQTRKMAGFLGNRIGRVEEVSGMEENEWTGPIMRLRVCLNLGRPLRRGFILQVSNEEEVWCPV
        +D+ L+     ++  +  ++ F++  FW+Q+HN+P+    RK+A  +G  IG+V   S  EE    G  MR+RV +++ +PL RG  + + N  E W   
Subjt:  FDRSLLVMRAPASTGSAISVEFNECSFWVQIHNIPMHWQTRKMAGFLGNRIGRVEEVSGMEENEWTGPIMRLRVCLNLGRPLRRGFILQVSNEEEVWCPV

Query:  VYERLPDFYYNCGRLGHLKRECGHEVSEGGDGRRE--QYGEWMKA--------------GMMLGEVGRRWGEEKRDGQNP--------SGQGARESDVGV
         YERLP+F Y CG   H +R+C   +S   + R    +YG W++A              G      G +W         P        S   A      +
Subjt:  VYERLPDFYYNCGRLGHLKRECGHEVSEGGDGRRE--QYGEWMKA--------------GMMLGEVGRRWGEEKRDGQNP--------SGQGARESDVGV

Query:  EVPRG--EESVEGGQSGIGLDSGKD-----------GDKGKGILGVGEVADGDGDI-GGMEVDGVEEGQVEGSAEVSKS-----------------QKGK
        EV      + V+   +G    S ++                G     E+   D  I  G  V  + + Q   S + S++                  K  
Subjt:  EVPRG--EESVEGGQSGIGLDSGKD-----------GDKGKGILGVGEVADGDGDI-GGMEVDGVEEGQVEGSAEVSKS-----------------QKGK

Query:  GWKRRAPS----------VLTEITNTGSGSE-RLGPKRKTSTSASELQGSKRARGDTLRVVWEKETKVHSSRFDFIRAKLGFDGCFSVDSSGRSGGLALL
         WK++A S          +LTE   +    E      R+T T+  EL    R +  +   V+  ET  +    + +R  L F     V S+ + GGLAL 
Subjt:  GWKRRAPS----------VLTEITNTGSGSE-RLGPKRKTSTSASELQGSKRARGDTLRVVWEKETKVHSSRFDFIRAKLGFDGCFSVDSSGRSGGLALL

Query:  WDSTISFSLISFSPNHIDGWV-DWNQFQWRLTGIYGHPQAELKKRTWSLLKHLRGHPSTPWLMGGDFNAILFHHEKEGGRDKPESELNGFREAVDTCCMV
        W++ I  S+ S+S NHID  + D     WR+TG+YG P+   + +TW L++ L G     W   GDFN I+   E  G   +P+ ++  FR  +D C +V
Subjt:  WDSTISFSLISFSPNHIDGWV-DWNQFQWRLTGIYGHPQAELKKRTWSLLKHLRGHPSTPWLMGGDFNAILFHHEKEGGRDKPESELNGFREAVDTCCMV

Query:  DLGFKGDKFSWCNGRPGSETVWERIDRCLGNMAWQDLFPQQEVSHLDFSRSDHRPILLSLLPPRQVRYAQRRRIRRFEDVWLMYAEFKNVVSDSWQSGGA
        DLGF G  F+WCN R    T W R+DR + N+ W + FP+  V HLD  +SDH+ + L+  PP   R  +RR+  RFE++W+  +  +  + ++W S  +
Subjt:  DLGFKGDKFSWCNGRPGSETVWERIDRCLGNMAWQDLFPQQEVSHLDFSRSDHRPILLSLLPPRQVRYAQRRRIRRFEDVWLMYAEFKNVVSDSWQSGGA

Query:  DASSVGLKVAIERCLASLVKWGRDKVGRYSDRIKEASAEVQSALCNLSRSVSRVDLQEAESRLESILVEEEVYWKQRSRENWLSWGDRNTRWFHNHASQR
          +   +   +  C   L  W R+  G    +I+    E++ A     +  S  +LQ    RL S+  +EE  W+QRSR  WL+ GDRNT++FH+ A+QR
Subjt:  DASSVGLKVAIERCLASLVKWGRDKVGRYSDRIKEASAEVQSALCNLSRSVSRVDLQEAESRLESILVEEEVYWKQRSRENWLSWGDRNTRWFHNHASQR

Query:  RKRNTICGLEDESGMWHQDPRDVLRVISDYFTHMYSSCRPSVEEVNKVIGNVQPSVTEDMNRLLLKPFTREDIVGALAQIHPNKAPGPDGMSGAFYRQSW
         +RN I GL+D +G+ H     +  +   Y+  ++++ +P  +++  V+  V   VTEDMN+ L++ FT  ++  AL Q+ P KAPGPDGM   FY++ W
Subjt:  RKRNTICGLEDESGMWHQDPRDVLRVISDYFTHMYSSCRPSVEEVNKVIGNVQPSVTEDMNRLLLKPFTREDIVGALAQIHPNKAPGPDGMSGAFYRQSW

Query:  EIVGEDMMRFCLGVLNNKESPEVLNETMIVLIPKVRNPRRVSEFRPISLCNVAYKLISKVLVSRMKGLLNKLIAQKQSAFIPGRCVVDNAIVGYECIHAL
         +VG D+ +  L  LN+    + +N T I LIPK +NP RV+EFRPISLCNV YKLISKVL +R+K +L ++++  QSAF+PGR + DN +V +E +H +
Subjt:  EIVGEDMMRFCLGVLNNKESPEVLNETMIVLIPKVRNPRRVSEFRPISLCNVAYKLISKVLVSRMKGLLNKLIAQKQSAFIPGRCVVDNAIVGYECIHAL

Query:  RTKGKGRVGWASLKLDMSKAYDLVEWVFLEQIMLKMGFAPNWVDLIGKCISTVRFSFSINGSRYGEVKPSRGLRQGDPLSPYLFLFCSEGLSAMLHQAEA
             GR G  +LKLDMSKAYD VEW+FLE+IM K+GF   W+ L+ +CISTV +S  +NG  +G +KPSRGLRQGDPLSPYLFL C+EGL +++ +A  
Subjt:  RTKGKGRVGWASLKLDMSKAYDLVEWVFLEQIMLKMGFAPNWVDLIGKCISTVRFSFSINGSRYGEVKPSRGLRQGDPLSPYLFLFCSEGLSAMLHQAEA

Query:  S--------------------------------------------------------------SPNTGAQTQENVSEVLHVNVTLCHQHYLGLPAFMPRS
                                                                       S  T   TQ  +   L V +   ++ YLGLP+ + R+
Subjt:  S--------------------------------------------------------------SPNTGAQTQENVSEVLHVNVTLCHQHYLGLPAFMPRS

Query:  RVSSLKFTKDRVWRRIQGWKGKLFSVGGR
        R  S    K+RVW++++GWK KL S  GR
Subjt:  RVSSLKFTKDRVWRRIQGWKGKLFSVGGR

A0A2N9GJ35 Uncharacterized protein7.9e-18734.36Show/hide
Query:  DELVRLWKALTLTEEEAEVIRAEAVPAALSVEAIDLCVVGKVLTSKKINPDAFRSVMKAAWRVHGSTRVESAGDNIFAIQFHSKSEKLRVVGSGPWAFDR
        +EL  L + + L+E+E   I     P   S +     ++ K+LT+K  + +AF+  ++A W   G   + S   N+F   F  + +  R+    PW FD+
Subjt:  DELVRLWKALTLTEEEAEVIRAEAVPAALSVEAIDLCVVGKVLTSKKINPDAFRSVMKAAWRVHGSTRVESAGDNIFAIQFHSKSEKLRVVGSGPWAFDR

Query:  SLLVMRAPASTGSAISVEFNECSFWVQIHNIPMHWQTRKMAGFLGNRIGRVEEVSGMEENEWTGPIMRLRVCLNLGRPLRRGFILQVSNEEE----VWCP
         L+ +           V F+  +FW+++ N+P+    R++   +G  IGR+ EV   E     G  +R+RV +++ +PL RG ILQ S+E +     W  
Subjt:  SLLVMRAPASTGSAISVEFNECSFWVQIHNIPMHWQTRKMAGFLGNRIGRVEEVSGMEENEWTGPIMRLRVCLNLGRPLRRGFILQVSNEEE----VWCP

Query:  VVYERLPDFYYNCGRLGHLKREC----GHEVSEGGDGRREQYGEWMKAGMMLGEVGRRWGEEKRDGQNPSGQGARESDVGVEVPRGEESVEGGQSGIGLD
          YE LP F Y CGRLGH   EC    G  +SEG  G  E++G W++A      +  R  + +R     S +G  + D            E G+S +  D
Subjt:  VVYERLPDFYYNCGRLGHLKREC----GHEVSEGGDGRREQYGEWMKAGMMLGEVGRRWGEEKRDGQNPSGQGARESDVGVEVPRGEESVEGGQSGIGLD

Query:  --SGKDGDKGKGILGVG-EVADGDGDIGGMEVDGVEEGQVEGSAEVSKSQKGKGWKRRAPSVLTEITNTGSGSERLGPKRKTSTSASELQGSKRARGDTL
          +  + D    + G G ++ DG      +E    E  Q+E   E      GK     AP V     +       LG       + +EL    +  G   
Subjt:  --SGKDGDKGKGILGVG-EVADGDGDIGGMEVDGVEEGQVEGSAEVSKSQKGKGWKRRAPSVLTEITNTGSGSERLGPKRKTSTSASELQGSKRARGDTL

Query:  RVVWEKETKVHSSRFDFIRAKLGFDGCFSVDSSGRSGGLALLWDSTISFSLISFSPNHIDGWVDWNQ-FQWRLTGIYGHPQAELKKRTWSLLKHLRGHPS
         +V+  ET+++    +++R +LG  GC  V+  G+ GGLALLWDS++  ++ S+S +HIDG V  N   +WRLTG YG+P+A L+ R+WSLL+HLR    
Subjt:  RVVWEKETKVHSSRFDFIRAKLGFDGCFSVDSSGRSGGLALLWDSTISFSLISFSPNHIDGWVDWNQ-FQWRLTGIYGHPQAELKKRTWSLLKHLRGHPS

Query:  TPWLMGGDFNAILFHHEKEGGRDKPESELNGFREAVDTCCMVDLGFKGDKFSWCNGRPGSETVWERIDRCLGNMAWQDLFPQQEVSHLDFSRSDHRPILL
         PW++ GDFN I    EK G  D+  +++  FREA+  C + D+GF G +F+W N R   + V  R+DR + + AW  LFP   ++HL  + SDH  +LL
Subjt:  TPWLMGGDFNAILFHHEKEGGRDKPESELNGFREAVDTCCMVDLGFKGDKFSWCNGRPGSETVWERIDRCLGNMAWQDLFPQQEVSHLDFSRSDHRPILL

Query:  SLL--PPRQVRYAQRRRIRRFEDVWLMYAEFKNVVSDSWQSGGADASSVGLKVAIERCLASLVKWGRDKVGRYSDRIKEASAEVQSALCNLSRSVSRVDL
              PR     ++RR+ RFE  WL  +  + V+  +W+      +   +   I++C   L++W +  V      I     ++Q              +
Subjt:  SLL--PPRQVRYAQRRRIRRFEDVWLMYAEFKNVVSDSWQSGGADASSVGLKVAIERCLASLVKWGRDKVGRYSDRIKEASAEVQSALCNLSRSVSRVDL

Query:  QEAESRLESILVEEEVYWKQRSRENWLSWGDRNTRWFHNHASQRRKRNTICGLEDESGMWHQDPRDVLRVISDYFTHMYSSCRPSVEEVNKVIGNVQPSV
           +  L  +  + E+ W+QRSR  WL+ GDRNT++FH +ASQR+K NTI GL D+   W  +P +V ++  DYF+ +++S  P    +++V+  V+  V
Subjt:  QEAESRLESILVEEEVYWKQRSRENWLSWGDRNTRWFHNHASQRRKRNTICGLEDESGMWHQDPRDVLRVISDYFTHMYSSCRPSVEEVNKVIGNVQPSV

Query:  TEDMNRLLLKPFTREDIVGALAQIHPNKAPGPDGMSGAFYRQSWEIVGEDMMRFCLGVLNNKESPEVLNETMIVLIPKVRNPRRVSEFRPISLCNVAYKL
        T  MN +L++PFT+E+I  AL Q+HP+K+PGPDGMS  F+++ W IV  D+    L  L N      +N T +VLIPKV  P  +++FRPISLCNV YK+
Subjt:  TEDMNRLLLKPFTREDIVGALAQIHPNKAPGPDGMSGAFYRQSWEIVGEDMMRFCLGVLNNKESPEVLNETMIVLIPKVRNPRRVSEFRPISLCNVAYKL

Query:  ISKVLVSRMKGLLNKLIAQKQSAFIPGRCVVDNAIVGYECIHALRTKGKGRVGWASLKLDMSKAYDLVEWVFLEQIMLKMGFAPNWVDLIGKCISTVRFS
        +SKVLV+RMK +L ++I+  QSAF+PGR + DN I+ +E IH L+    G     ++KLDMSKAYD VEW +L+ IM+K+GF   WV L+  C+ T  +S
Subjt:  ISKVLVSRMKGLLNKLIAQKQSAFIPGRCVVDNAIVGYECIHALRTKGKGRVGWASLKLDMSKAYDLVEWVFLEQIMLKMGFAPNWVDLIGKCISTVRFS

Query:  FSINGSRYGEVKPSRGLRQGDPLSPYLFLFCSEGLSAMLHQAEAS-------------------------------------------------------
          +NG   G + P RGLRQGDPLSPYLFL C+EGLSA+L +AE                                                         
Subjt:  FSINGSRYGEVKPSRGLRQGDPLSPYLFLFCSEGLSAMLHQAEAS-------------------------------------------------------

Query:  -------SPNTGAQTQENVSEVLHVNVTLCHQHYLGLPAFMPRSRVSSLKFTKDRVWRRIQGWKGKLFSVGGR
               SPNT  Q++  +      +     + YLGLP  + R++  +    KDRVWRR+QGWK KL S  GR
Subjt:  -------SPNTGAQTQENVSEVLHVNVTLCHQHYLGLPAFMPRSRVSSLKFTKDRVWRRIQGWKGKLFSVGGR

A0A2N9H1U1 Reverse transcriptase domain-containing protein1.8e-18333.08Show/hide
Query:  MDELVRLWKALTLTEEEAEVIRAEAVPAALSVEAIDLCVVGKVLTSKKINPDAFRSVMKAAWRVHGSTRVESAGDNIFAIQFHSKSEKLRVVGSGPWAFD
        MD L  +W+ L+L+E E +    E  P  +S +  +  +  + LT + IN +A     +  WR     R++  GDN+  I F  +++  RV+ +GPW++D
Subjt:  MDELVRLWKALTLTEEEAEVIRAEAVPAALSVEAIDLCVVGKVLTSKKINPDAFRSVMKAAWRVHGSTRVESAGDNIFAIQFHSKSEKLRVVGSGPWAFD

Query:  RSLLVMRAPASTGSAISVEFNECSFWVQIHNIPMHWQTRKMAGFLGNRIGRVEEVSGMEENEWTGPIMRLRVCLNLGRPLRRGFILQVSNEEEVWCPVVY
        +  ++ +         ++ F+    WVQIH +P       +   +G+ +G++      E+    G  +R+ V +N+ RPL RG  + +   +EV     Y
Subjt:  RSLLVMRAPASTGSAISVEFNECSFWVQIHNIPMHWQTRKMAGFLGNRIGRVEEVSGMEENEWTGPIMRLRVCLNLGRPLRRGFILQVSNEEEVWCPVVY

Query:  ERLPDFYYNCGRLGHLKRECGHEVSEGGD--GRREQYGEWMKA--------------GMMLGEVGRRWGEEKRDGQNPSGQGAR----------ESDVGV
        E+L +F Y CG + H  ++C   +   G     ++Q+G WM+A              G +           ++  +  +G+  R          + D+  
Subjt:  ERLPDFYYNCGRLGHLKRECGHEVSEGGD--GRREQYGEWMKA--------------GMMLGEVGRRWGEEKRDGQNPSGQGAR----------ESDVGV

Query:  EVPRGEESVEGG-------QSGIGLDSGKDGDKGKGILGVGEVAD--------------GDGDIGGMEVDGVEEGQVEGSAEVSKSQKGKGWKRR-APSV
         +P  E +   G        +    +S K+    +         D                GD+  ME              V+K    K WKR+ + S+
Subjt:  EVPRGEESVEGG-------QSGIGLDSGKDGDKGKGILGVGEVAD--------------GDGDIGGMEVDGVEEGQVEGSAEVSKSQKGKGWKRR-APSV

Query:  LTEITNTGS------GSERLGPKRKTSTSASELQGSKRARG------DTLRVVWEK--------ETKVHSSRFDFIRAKLGFDGCFSVDSSGRSGGLALL
             +T S      G+E   P     T  S +  + R  G      +  R+V E+        ET +  +R + +R KL F     V    + GGL L 
Subjt:  LTEITNTGS------GSERLGPKRKTSTSASELQGSKRARG------DTLRVVWEK--------ETKVHSSRFDFIRAKLGFDGCFSVDSSGRSGGLALL

Query:  WDSTISFSLISFSPNHIDGWVDWNQFQ-WRLTGIYGHPQAELKKRTWSLLKHLRGHPSTPWLMGGDFNAILFHHEKEGGRDKPESELNGFREAVDTCCMV
        W    + ++ S+S +HID  ++      WR TG YG P+   +  +W+LL+ L    S PWL  GDFN +L   EK+GG  +   ++  FR+A+D C  +
Subjt:  WDSTISFSLISFSPNHIDGWVDWNQFQ-WRLTGIYGHPQAELKKRTWSLLKHLRGHPSTPWLMGGDFNAILFHHEKEGGRDKPESELNGFREAVDTCCMV

Query:  DLGFKGDKFSWCNGRPGSETVWERIDRCLGNMAWQDLFPQQEVSHLDFSRSDHRPILLSLLPPRQVRYAQRRRIRRFEDVWLMYAEFKNVVSDSWQSGGA
        DLG++G  F+WCN R  S TVWER+DR L  + W + FP+  + HL  + SDH PI L   P      A+ R   RFE+VWL     +  V  +W +   
Subjt:  DLGFKGDKFSWCNGRPGSETVWERIDRCLGNMAWQDLFPQQEVSHLDFSRSDHRPILLSLLPPRQVRYAQRRRIRRFEDVWLMYAEFKNVVSDSWQSGGA

Query:  DASSVGLKVAIERCLASLVKWGRDKVGRYSDRIKEASAEVQSALCNLSRSVSRVDLQEAESRLESILVEEEVYWKQRSRENWLSWGDRNTRWFHNHASQR
         +    ++  I  C   L KW R K G  S ++K  +A++++A  N  R +S     E +  ++ +L +EE  W QR+R  WL  GDRNTR+FH  ASQR
Subjt:  DASSVGLKVAIERCLASLVKWGRDKVGRYSDRIKEASAEVQSALCNLSRSVSRVDLQEAESRLESILVEEEVYWKQRSRENWLSWGDRNTRWFHNHASQR

Query:  RKRNTICGLEDESGMWHQDPRDVLRVISDYFTHMYSSCRPSVEEVNKVIGNVQPSVTEDMNRLLLKPFTREDIVGALAQIHPNKAPGPDGMSGAFYRQSW
        R+RN I  L D  G+ H     + R+  +YF  ++ +  P   + N V+  + P VT DMN  L +PF R+++  A+ Q+ P KAPGPDGMS  FY+  W
Subjt:  RKRNTICGLEDESGMWHQDPRDVLRVISDYFTHMYSSCRPSVEEVNKVIGNVQPSVTEDMNRLLLKPFTREDIVGALAQIHPNKAPGPDGMSGAFYRQSW

Query:  EIVGEDMMRFCLGVLNNKESPEVLNETMIVLIPKVRNPRRVSEFRPISLCNVAYKLISKVLVSRMKGLLNKLIAQKQSAFIPGRCVVDNAIVGYECIHAL
        + VG D+    L  LN+    + +N T I LIPK +NP +V++FRPISLCNV YK++SKVL +R+K +L  +I++ QSAF+PGR + DN +V +E +H +
Subjt:  EIVGEDMMRFCLGVLNNKESPEVLNETMIVLIPKVRNPRRVSEFRPISLCNVAYKLISKVLVSRMKGLLNKLIAQKQSAFIPGRCVVDNAIVGYECIHAL

Query:  RTKGKGRVGWASLKLDMSKAYDLVEWVFLEQIMLKMGFAPNWVDLIGKCISTVRFSFSINGSRYGEVKPSRGLRQGDPLSPYLFLFCSEGLSAMLHQAEA
        +T+  G+ G+ +LKLDMSKAYD VEWVFL+ +MLKMGF   WV L+ +CIS+V +S  INGS  G +KP+RGLRQGDPLSPYLFL C+EGL A     + 
Subjt:  RTKGKGRVGWASLKLDMSKAYDLVEWVFLEQIMLKMGFAPNWVDLIGKCISTVRFSFSINGSRYGEVKPSRGLRQGDPLSPYLFLFCSEGLSAMLHQAEA

Query:  S------------------------------SPNTGAQTQENVSEVLHVNVTLCHQHYLGLPAFMPRSRVSSLKFTKDRVWRRIQGWKGKLFSVGGR
                                       S +T    Q  + ++L V V   ++ YLGLP+F+ RSR  S    K+++W+R+QGWK K+ S  GR
Subjt:  S------------------------------SPNTGAQTQENVSEVLHVNVTLCHQHYLGLPAFMPRSRVSSLKFTKDRVWRRIQGWKGKLFSVGGR

A0A2N9HW04 Reverse transcriptase domain-containing protein2.8e-18433.08Show/hide
Query:  MDELVRLWKALTLTEEEAEVIRAEAVPAALSVEAIDLCVVGKVLTSKKINPDAFRSVMKAAWRVHGSTRVESAGDNIFAIQFHSKSEKLRVVGSGPWAFD
        MD L  +W+ L+L+E E +    E  P  +S +  +  +  + LT + IN +A     +  WR     R++  GDN+  I F  +++  RV+ +GPW++D
Subjt:  MDELVRLWKALTLTEEEAEVIRAEAVPAALSVEAIDLCVVGKVLTSKKINPDAFRSVMKAAWRVHGSTRVESAGDNIFAIQFHSKSEKLRVVGSGPWAFD

Query:  RSLLVMRAPASTGSAISVEFNECSFWVQIHNIPMHWQTRKMAGFLGNRIGRVEEVSGMEENEWTGPIMRLRVCLNLGRPLRRGFILQVSNEEEVWCPVVY
        +  ++ +         ++ F+    WVQIH +P       +   +G+ +G++      E+    G  +R+ V +N+ RPL RG  + +   +EV     Y
Subjt:  RSLLVMRAPASTGSAISVEFNECSFWVQIHNIPMHWQTRKMAGFLGNRIGRVEEVSGMEENEWTGPIMRLRVCLNLGRPLRRGFILQVSNEEEVWCPVVY

Query:  ERLPDFYYNCGRLGHLKRECGHEVSEGGD--GRREQYGEWMKA--------------GMMLGEVGRRWGEEKRDGQNPSGQGAR----------ESDVGV
        E+L +F Y CG + H  ++C   +   G     ++Q+G WM+A              G +           ++  +  +G+  R          + D+  
Subjt:  ERLPDFYYNCGRLGHLKRECGHEVSEGGD--GRREQYGEWMKA--------------GMMLGEVGRRWGEEKRDGQNPSGQGAR----------ESDVGV

Query:  EVPRGEESVEGG----------------QSGIGLDSGKDGDKGKGILGV-----GEVADGDGDIGGMEVDGVEEGQVEGSAEVSKSQKGKGWKRR-APSV
         +P  E +   G                +     ++    D  +G + +         +  GD+  ME              V+K    K WKR+ + S+
Subjt:  EVPRGEESVEGG----------------QSGIGLDSGKDGDKGKGILGV-----GEVADGDGDIGGMEVDGVEEGQVEGSAEVSKSQKGKGWKRR-APSV

Query:  LTEITNTGS------GSERLGPKRKTSTSASELQGSKRARG------DTLRVVWEK--------ETKVHSSRFDFIRAKLGFDGCFSVDSSGRSGGLALL
             +T S      G+E   P     T  S +  + R  G      +  R+V E+        ET +  +R + +R KL F     V    + GGL L 
Subjt:  LTEITNTGS------GSERLGPKRKTSTSASELQGSKRARG------DTLRVVWEK--------ETKVHSSRFDFIRAKLGFDGCFSVDSSGRSGGLALL

Query:  WDSTISFSLISFSPNHIDGWVDWNQFQ-WRLTGIYGHPQAELKKRTWSLLKHLRGHPSTPWLMGGDFNAILFHHEKEGGRDKPESELNGFREAVDTCCMV
        W    + ++ S+S +HID  ++      WR TG YG P+   +  +W+LL+ L    S PWL  GDFN +L   EK+GG  +   ++  FR+A+D C  +
Subjt:  WDSTISFSLISFSPNHIDGWVDWNQFQ-WRLTGIYGHPQAELKKRTWSLLKHLRGHPSTPWLMGGDFNAILFHHEKEGGRDKPESELNGFREAVDTCCMV

Query:  DLGFKGDKFSWCNGRPGSETVWERIDRCLGNMAWQDLFPQQEVSHLDFSRSDHRPILLSLLPPRQVRYAQRRRIRRFEDVWLMYAEFKNVVSDSWQSGGA
        DLG++G  F+WCN R  S TVWER+DR L  + W + FP+  + HL  + SDH PI L   P      A+ R   RFE+VWL     +  V  +W +   
Subjt:  DLGFKGDKFSWCNGRPGSETVWERIDRCLGNMAWQDLFPQQEVSHLDFSRSDHRPILLSLLPPRQVRYAQRRRIRRFEDVWLMYAEFKNVVSDSWQSGGA

Query:  DASSVGLKVAIERCLASLVKWGRDKVGRYSDRIKEASAEVQSALCNLSRSVSRVDLQEAESRLESILVEEEVYWKQRSRENWLSWGDRNTRWFHNHASQR
         +    ++  I  C   L KW R K    S ++K  +A++Q+A  N  R +S     E +  ++ +L +EE  W QR+R  WL  GDRNTR+FH  ASQR
Subjt:  DASSVGLKVAIERCLASLVKWGRDKVGRYSDRIKEASAEVQSALCNLSRSVSRVDLQEAESRLESILVEEEVYWKQRSRENWLSWGDRNTRWFHNHASQR

Query:  RKRNTICGLEDESGMWHQDPRDVLRVISDYFTHMYSSCRPSVEEVNKVIGNVQPSVTEDMNRLLLKPFTREDIVGALAQIHPNKAPGPDGMSGAFYRQSW
        R+RN I  L D  G+ H     + R+  +YF  ++ +  P   + N V+  + P VT DMN  L +PF R+++  A+ Q+ P KAPGPDGMS  FY+  W
Subjt:  RKRNTICGLEDESGMWHQDPRDVLRVISDYFTHMYSSCRPSVEEVNKVIGNVQPSVTEDMNRLLLKPFTREDIVGALAQIHPNKAPGPDGMSGAFYRQSW

Query:  EIVGEDMMRFCLGVLNNKESPEVLNETMIVLIPKVRNPRRVSEFRPISLCNVAYKLISKVLVSRMKGLLNKLIAQKQSAFIPGRCVVDNAIVGYECIHAL
        + VG D+    L  LN+    + +N T I LIPK +NP +VS+FRPISLCNV YK++SKVL +R+K +L  +I++ QSAF+PGR + DN +V +E +H +
Subjt:  EIVGEDMMRFCLGVLNNKESPEVLNETMIVLIPKVRNPRRVSEFRPISLCNVAYKLISKVLVSRMKGLLNKLIAQKQSAFIPGRCVVDNAIVGYECIHAL

Query:  RTKGKGRVGWASLKLDMSKAYDLVEWVFLEQIMLKMGFAPNWVDLIGKCISTVRFSFSINGSRYGEVKPSRGLRQGDPLSPYLFLFCSEGLSAMLHQAEA
        +T+  G+ G+ +LKLDMSKAYD VEWVFL+ +MLKMGF   WV L+ +CIS+V +S  INGS  G +KP+RGLRQGDPLSPYLFL C+EGL A     + 
Subjt:  RTKGKGRVGWASLKLDMSKAYDLVEWVFLEQIMLKMGFAPNWVDLIGKCISTVRFSFSINGSRYGEVKPSRGLRQGDPLSPYLFLFCSEGLSAMLHQAEA

Query:  S------------------------------SPNTGAQTQENVSEVLHVNVTLCHQHYLGLPAFMPRSRVSSLKFTKDRVWRRIQGWKGKLFSVGGR
                                       S +T    Q  + ++L V V   ++ YLGLP+F+ RSR  S    K+++W+R+QGWK K+ S  GR
Subjt:  S------------------------------SPNTGAQTQENVSEVLHVNVTLCHQHYLGLPAFMPRSRVSSLKFTKDRVWRRIQGWKGKLFSVGGR

A0A7N2LIH6 Uncharacterized protein6.9e-18332.97Show/hide
Query:  DELVRLWKALTLTEEEAEVIRAEAVPAALSVEAIDLCVVGKVLTSKKINPDAFRSVMKAAWRVHGSTRVESAGDNIFAIQFHSKSEKLRVVGSGPWAFDR
        +EL  LWK LT+TE E E I+  +     + E    CVV K+LT + +  +A +  M+  W+     ++   G+++F ++F    +K +V+   PW++++
Subjt:  DELVRLWKALTLTEEEAEVIRAEAVPAALSVEAIDLCVVGKVLTSKKINPDAFRSVMKAAWRVHGSTRVESAGDNIFAIQFHSKSEKLRVVGSGPWAFDR

Query:  SLLVMRAPASTGSAISVEFNECSFWVQIHNIPMHWQTRKMAGFLGNRIGRVEEVSGMEENEWTGPIMRLRVCLNLGRPLRRGFILQVSNEEEVWCPVVYE
         L++M+          ++     FWVQI N+P+   TR+    +G +IG+V EV   E+    G  +R+R+  +    L RG  + +   E  W    YE
Subjt:  SLLVMRAPASTGSAISVEFNECSFWVQIHNIPMHWQTRKMAGFLGNRIGRVEEVSGMEENEWTGPIMRLRVCLNLGRPLRRGFILQVSNEEEVWCPVVYE

Query:  RLPDFYYNCGRLGHLKRECGH--EVSEGGDGRREQYGEWMK------AGMMLGEVGRRWGEEKRDGQNPS-------------------GQGARESDVGV
        RLP+F Y CGRL H +++C    +    GD  R+QYG W++      +G   G +G     E+RD    +                    Q   E  +G 
Subjt:  RLPDFYYNCGRLGHLKRECGH--EVSEGGDGRREQYGEWMK------AGMMLGEVGRRWGEEKRDGQNPS-------------------GQGARESDVGV

Query:  EVPRGEESVEG---GQSGIG------------LDSGKDGDKGK-------GILGVGEVADGDGDIGGMEV-DGVEEGQVEGSAEVSKSQKGKG-------
        +    +  VE    GQ G+              DS K+  + K        + G+    D +  +   EV D V   + E  ++  +  KG G       
Subjt:  EVPRGEESVEG---GQSGIG------------LDSGKDGDKGK-------GILGVGEVADGDGDIGGMEV-DGVEEGQVEGSAEVSKSQKGKG-------

Query:  -W--KRRAPSVLTEITNTGSGSERLGPK-----------RKTSTSASELQGSKR-------------------------ARG-----------DTLR---
         W     +P  +T     G  SE LGPK           + +S +A  + GS++                          RG           D ++   
Subjt:  -W--KRRAPSVLTEITNTGSGSERLGPK-----------RKTSTSASELQGSKR-------------------------ARG-----------DTLR---

Query:  --VVWEKETKVHSSRFDFIRAKLGFDGCFSVDSSGRSGGLALLWDSTISFSLISFSPNHIDGWVD--WNQFQWRLTGIYGHPQAELKKRTWSLLKHLRGH
          +V+  ETK    +    + KLGF     V S GRSGGLALLW         S S +HID  V    +   WR TG YGHP    +  +W LL+ L   
Subjt:  --VVWEKETKVHSSRFDFIRAKLGFDGCFSVDSSGRSGGLALLWDSTISFSLISFSPNHIDGWVD--WNQFQWRLTGIYGHPQAELKKRTWSLLKHLRGH

Query:  PSTPWLMGGDFNAILFHHEKEGGRDKPESELNGFREAVDTCCMVDLGFKGDKFSWCNGRPGSETVWERIDRCLGNMAWQDLFPQQEVSHLDFSRSDHRPI
           PWL+ GDFN I+   EK G +D+  ++++ FRE +  C ++DLGF G +F+WCNGR G +    R+DR + N AW  +FP+ +V H+  S SDH  +
Subjt:  PSTPWLMGGDFNAILFHHEKEGGRDKPESELNGFREAVDTCCMVDLGFKGDKFSWCNGRPGSETVWERIDRCLGNMAWQDLFPQQEVSHLDFSRSDHRPI

Query:  LLSLLPPRQVRYAQRRRIRRFEDVWLMYAEFKNVVSDSWQSGGADASSVGLKVAIERCLASLVKWGRDKVGRYSDRIKEASAEVQS-ALCNLSRSVSRVD
         L L      R  ++R    FE++W    E K +V  +W     D S++ ++  +ERC   L +W ++  G     IK+    +Q     NL    +  +
Subjt:  LLSLLPPRQVRYAQRRRIRRFEDVWLMYAEFKNVVSDSWQSGGADASSVGLKVAIERCLASLVKWGRDKVGRYSDRIKEASAEVQS-ALCNLSRSVSRVD

Query:  LQEAESRLESILVEEEVYWKQRSRENWLSWGDRNTRWFHNHASQRRKRNTICGLEDESGMWHQDPRDVLRVISDYFTHMYSSCRPSVEEVNKVIGNVQPS
        +Q  +  +  +   EEV WKQRSR +WL +GD+N+++FH  ASQRR++N I GL D+ G+WH+D     ++I DYF  +YSS +P+  +V+  +  +   
Subjt:  LQEAESRLESILVEEEVYWKQRSRENWLSWGDRNTRWFHNHASQRRKRNTICGLEDESGMWHQDPRDVLRVISDYFTHMYSSCRPSVEEVNKVIGNVQPS

Query:  VTEDMNRLLLKPFTREDIVGALAQIHPNKAPGPDGMSGAFYRQSWEIVGEDMMRFCLGVLNNKESPEVLNETMIVLIPKVRNPRRVSEFRPISLCNVAYK
        VT +MN  L K F   ++  AL Q+HP KAPGPDGMS  FY++ W+IVG  +    L  LN+   P+ +N+T I LIPK +NP++++EFRPISLCNV YK
Subjt:  VTEDMNRLLLKPFTREDIVGALAQIHPNKAPGPDGMSGAFYRQSWEIVGEDMMRFCLGVLNNKESPEVLNETMIVLIPKVRNPRRVSEFRPISLCNVAYK

Query:  LISKVLVSRMKGLLNKLIAQKQSAFIPGRCVVDNAIVGYECIHALRTKGKGRVGWASLKLDMSKAYDLVEWVFLEQIMLKMGFAPNWVDLIGKCISTVRF
        +ISKVL +R+K +L+ +I + QSAF+PGR + DN IV +E +H++  + KG+ G  ++KLDMSKAYD VEW +LE +M KMGF   W+ LI  C+++V F
Subjt:  LISKVLVSRMKGLLNKLIAQKQSAFIPGRCVVDNAIVGYECIHALRTKGKGRVGWASLKLDMSKAYDLVEWVFLEQIMLKMGFAPNWVDLIGKCISTVRF

Query:  SFSINGSRYGEVKPSRGLRQGDPLSPYLFLFCSEGLSAMLHQAEAS------------------------------------------------------
        S  ING   G   PSRGLRQGDP+SPYLFL C EGLSAM+ + E                                                        
Subjt:  SFSINGSRYGEVKPSRGLRQGDPLSPYLFLFCSEGLSAMLHQAEAS------------------------------------------------------

Query:  --------SPNTGAQTQENVSEVLHVNVTLCHQHYLGLPAFMPRSRVSSLKFTKDRVWRRIQGWKGKLFSVGGR
                S NT  + +E    +    +   H+ YLGLP  + R++  +    KD+V R+I GWKGKL S  GR
Subjt:  --------SPNTGAQTQENVSEVLHVNVTLCHQHYLGLPAFMPRSRVSSLKFTKDRVWRRIQGWKGKLFSVGGR

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein9.1e-3125.46Show/hide
Query:  KWGRDKVGRYSDRIKEASAEVQSALCNLSRSVSRVDLQEAESRLESILVEEEVYWKQRSRENWLSWGDRNTRWFHNHASQRRKRNTICGLEDESGMWHQD
        K  R K+   + ++KE   + Q+     S++  R ++ +  + L+ I  ++ +     SR  +    ++  R       ++R++N I  ++++ G    D
Subjt:  KWGRDKVGRYSDRIKEASAEVQSALCNLSRSVSRVDLQEAESRLESILVEEEVYWKQRSRENWLSWGDRNTRWFHNHASQRRKRNTICGLEDESGMWHQD

Query:  PRDVLRVISDYFTHMYSSCRPSVEEVNKVIGN-VQPSVTEDMNRLLLKPFTREDIVGALAQIHPNKAPGPDGMSGAFYRQSWEIVGEDMMRFCLGVLNNK
        P ++   I +Y+ H+Y++   ++EE++  +     P + ++    L +P T  +IV  +  +   K+PGPDG +  FY++  E +   +++    +    
Subjt:  PRDVLRVISDYFTHMYSSCRPSVEEVNKVIGN-VQPSVTEDMNRLLLKPFTREDIVGALAQIHPNKAPGPDGMSGAFYRQSWEIVGEDMMRFCLGVLNNK

Query:  ESPEVLNETMIVLIPKV-RNPRRVSEFRPISLCNVAYKLISKVLVSRMKGLLNKLIAQKQSAFIPGRCVVDNAIVGYECI-HALRTKGKGRVGWASLKLD
          P    E  I+LIPK  R+  +   FRPISL N+  K+++K+L +R++  + KLI   Q  FIPG     N       I H  R K K  V    + +D
Subjt:  ESPEVLNETMIVLIPKV-RNPRRVSEFRPISLCNVAYKLISKVLVSRMKGLLNKLIAQKQSAFIPGRCVVDNAIVGYECI-HALRTKGKGRVGWASLKLD

Query:  MSKAYDLVEWVFLEQIMLKMGFAPNWVDLIGKCISTVRFSFSINGSRYGEVKPSRGLRQGDPLSPYLFLFCSEGLSAMLHQ
          KA+D ++  F+ + + K+G    ++ +I         +  +NG +        G RQG PLSP LF    E L+  + Q
Subjt:  MSKAYDLVEWVFLEQIMLKMGFAPNWVDLIGKCISTVRFSFSINGSRYGEVKPSRGLRQGDPLSPYLFLFCSEGLSAMLHQ

P08548 LINE-1 reverse transcriptase homolog1.8e-3121.8Show/hide
Query:  HSSRFDFIRAKL-GFDGCFSVDSSGRSGGLALLWDSTISFSLISFSPN------HIDGWVDWNQFQWRLTGIY--GHPQAELKKRTWSLLKHLRGHPSTP
        H +  D  R K+ G+   F  +   +  G+A+L+   I F       +       + G   +++    +  IY   H   +  + T + + +L    S  
Subjt:  HSSRFDFIRAKL-GFDGCFSVDSSGRSGGLALLWDSTISFSLISFSPN------HIDGWVDWNQFQWRLTGIY--GHPQAELKKRTWSLLKHLRGHPSTP

Query:  WLMGGDFNAILFHHEKEGGRDKPESELNGFREAVDTCCMVDL--GFKGDK-----FSWCNGRPGSETVWERIDRCLGNMAWQDLFPQQEVSHLDFSRSDH
         ++ GDFN  L   ++   + K   E+      +    + D+   F  +K     FS  +G       + +ID  LG+ +    F + E+    F  SDH
Subjt:  WLMGGDFNAILFHHEKEGGRDKPESELNGFREAVDTCCMVDL--GFKGDK-----FSWCNGRPGSETVWERIDRCLGNMAWQDLFPQQEVSHLDFSRSDH

Query:  RPILLSLLPPRQVR-YAQRRRIRR--FEDVWLMYAEFKNVVSDSWQSGGADASSVGLKVAIERCL--------ASLVKWGRDKVGRYSDRIKEASAEVQS
          I + L   R +  + +  ++     +D W++    K +     Q+   D +   L    +  L        A L K  R++V      +K+   E  S
Subjt:  RPILLSLLPPRQVR-YAQRRRIRR--FEDVWLMYAEFKNVVSDSWQSGGADASSVGLKVAIERCL--------ASLVKWGRDKVGRYSDRIKEASAEVQS

Query:  ALCNLSRSVSRVDLQEAESRLESILVEEEVYWKQRSRENWLSWGDRNTRWFHNHASQRRKRNTICGLEDESGMWHQDPRDVLRVISDYFTHMYSSCRPSV
              +   R ++ +  + L  I  +  +    +S+  +    ++  +   N   ++R ++ I  + + +     DP ++ +++++Y+  +YS    ++
Subjt:  ALCNLSRSVSRVDLQEAESRLESILVEEEVYWKQRSRENWLSWGDRNTRWFHNHASQRRKRNTICGLEDESGMWHQDPRDVLRVISDYFTHMYSSCRPSV

Query:  EEVNKVIGNVQ-PSVTEDMNRLLLKPFTREDIVGALAQIHPNKAPGPDGMSGAFYRQSWEIVGEDMMRFCLGVLNNKES----PEVLNETMIVLIPKV-R
        +E+++ +     P +++    +L +P +  +I   +  +   K+PGPDG +  FY    +   E+++   L +  N E     P    E  I LIPK  +
Subjt:  EEVNKVIGNVQ-PSVTEDMNRLLLKPFTREDIVGALAQIHPNKAPGPDGMSGAFYRQSWEIVGEDMMRFCLGVLNNKES----PEVLNETMIVLIPKV-R

Query:  NPRRVSEFRPISLCNVAYKLISKVLVSRMKGLLNKLIAQKQSAFIPGRCVVDNAIVGYECI-HALRTKGKGRVGWASLKLDMSKAYDLVEWVFLEQIMLK
        +P R   +RPISL N+  K+++K+L +R++  + K+I   Q  FIPG     N       I H  + K K  +    L +D  KA+D ++  F+ + + K
Subjt:  NPRRVSEFRPISLCNVAYKLISKVLVSRMKGLLNKLIAQKQSAFIPGRCVVDNAIVGYECI-HALRTKGKGRVGWASLKLDMSKAYDLVEWVFLEQIMLK

Query:  MGFAPNWVDLIGKCISTVRFSFSINGSRYGEVKPSRGLRQGDPLSPYLFLFCSEGLSAMLHQAEA
        +G    ++ LI    S    +  +NG +        G RQG PLSP LF    E L+  + + +A
Subjt:  MGFAPNWVDLIGKCISTVRFSFSINGSRYGEVKPSRGLRQGDPLSPYLFLFCSEGLSAMLHQAEA

P11369 LINE-1 retrotransposable element ORF2 protein6.5e-2927.33Show/hide
Query:  RKRNTICGLEDESGMWHQDPRDVLRVISDYFTHMYSSCRPSVEEVNKVIGNVQ-PSVTEDMNRLLLKPFTREDIVGALAQIHPNKAPGPDGMSGAFYRQS
        R +  I  + +E G    DP ++   I  ++  +YS+   +++E++K +   Q P + +D    L  P + ++I   +  +   K+PGPDG S  FY+  
Subjt:  RKRNTICGLEDESGMWHQDPRDVLRVISDYFTHMYSSCRPSVEEVNKVIGNVQ-PSVTEDMNRLLLKPFTREDIVGALAQIHPNKAPGPDGMSGAFYRQS

Query:  WEIVGEDMMRFCLGVLNNKESPEVLNETMIVLIPK-VRNPRRVSEFRPISLCNVAYKLISKVLVSRMKGLLNKLIAQKQSAFIPGRCVVDNAIVGYECIH
         E +   + +    +      P    E  I LIPK  ++P ++  FRPISL N+  K+++K+L +R++  +  +I   Q  FIPG     N       IH
Subjt:  WEIVGEDMMRFCLGVLNNKESPEVLNETMIVLIPK-VRNPRRVSEFRPISLCNVAYKLISKVLVSRMKGLLNKLIAQKQSAFIPGRCVVDNAIVGYECIH

Query:  AL-RTKGKGRVGWASLKLDMSKAYDLVEWVFLEQIMLKMGFAPNWVDLIGKCISTVRFSFSINGSRYGEVKPSRGLRQGDPLSPYLFLFCSEGLSAMLHQ
         + + K K  +    + LD  KA+D ++  F+ +++ + G    ++++I    S    +  +NG +   +    G RQG PLSPYLF    E L+  + Q
Subjt:  AL-RTKGKGRVGWASLKLDMSKAYDLVEWVFLEQIMLKMGFAPNWVDLIGKCISTVRFSFSINGSRYGEVKPSRGLRQGDPLSPYLFLFCSEGLSAMLHQ

P14381 Transposon TX1 uncharacterized 149 kDa protein7.4e-3324.14Show/hide
Query:  LMGGDFNAILFHHEKEGGRDKPESELNGFREAVDTCCMVDLGFKGD----KFSWCNGRPGSETVWERIDRCLGNMAWQDLFPQQEVSHLDFSRSDHRPIL
        ++GGDFN  L   ++   + +  SE +  RE +    +VD+  + +     F++   R G      RIDR   +           +    FS  +   + 
Subjt:  LMGGDFNAILFHHEKEGGRDKPESELNGFREAVDTCCMVDLGFKGD----KFSWCNGRPGSETVWERIDRCLGNMAWQDLFPQQEVSHLDFSRSDHRPIL

Query:  LSLLPPRQVRYAQRRRIRRFEDVWLMYAEFKNVVSDSWQSGGA------------DASSVGLKVAIERCLASLVKWGRDKVGRYSDRIKEASAEVQSALC
        +S+ P        +     F +  L    F   V D+W+   A            D   V LK+  +       ++ +   G+ +  I+  + EV     
Subjt:  LSLLPPRQVRYAQRRRIRRFEDVWLMYAEFKNVVSDSWQSGGA------------DASSVGLKVAIERCLASLVKWGRDKVGRYSDRIKEASAEVQSALC

Query:  NLSRSVSRVDLQEAESRLESILVEEEVYWK---QRSRENWLSWGDRNTRWFHNHASQRRKRNTICGLEDESGMWHQDPRDVLRVISDYFTHMYSSCRPSV
         LS S  +    E   R E++   E+   +    RSR   L   DR +R+F+    ++  R  I  L  E G   +DP  +      ++ +++S   P  
Subjt:  NLSRSVSRVDLQEAESRLESILVEEEVYWK---QRSRENWLSWGDRNTRWFHNHASQRRKRNTICGLEDESGMWHQDPRDVLRVISDYFTHMYSSCRPSV

Query:  EEVNKVIGNVQPSVTEDMNRLLLKPFTREDIVGALAQIHPNKAPGPDGMSGAFYRQSWEIVGEDMMRFCLGVLNNKESPEVLNETMIVLIPKVRNPRRVS
         +  + + +  P V+E     L  P T +++  AL  +  NK+PG DG++  F++  W+ +G D  R         E P      ++ L+PK  + R + 
Subjt:  EEVNKVIGNVQPSVTEDMNRLLLKPFTREDIVGALAQIHPNKAPGPDGMSGAFYRQSWEIVGEDMMRFCLGVLNNKESPEVLNETMIVLIPKVRNPRRVS

Query:  EFRPISLCNVAYKLISKVLVSRMKGLLNKLIAQKQSAFIPGRCVVDNAIVGYECIHALRTKGKGRVGWASLKLDMSKAYDLVEWVFLEQIMLKMGFAPNW
         +RP+SL +  YK+++K +  R+K +L ++I   QS  +PGR + DN  +  + +H  R  G   +  A L LD  KA+D V+  +L   +    F P +
Subjt:  EFRPISLCNVAYKLISKVLVSRMKGLLNKLIAQKQSAFIPGRCVVDNAIVGYECIHALRTKGKGRVGWASLKLDMSKAYDLVEWVFLEQIMLKMGFAPNW

Query:  VDLIGKCISTVRFSFSINGSRYGEVKPSRGLRQGDPLSPYLFLFCSEGLSAMLHQ
        V  +    ++      IN S    +   RG+RQG PLS  L+    E    +L +
Subjt:  VDLIGKCISTVRFSFSINGSRYGEVKPSRGLRQGDPLSPYLFLFCSEGLSAMLHQ

P16423 Retrovirus-related Pol polyprotein from type-2 retrotransposable element R2DM7.7e-1426.1Show/hide
Query:  SSCRPSVEEVNKVIGNVQPSVTEDMNRLLLKPFTREDIVGALAQIHPNKAPGPDGMSGAFYRQSWEIVGEDMMRFCLGVLNNKESPEVLNETMIVLIPKV
        SSC   V +++  +  V  ++TE   R               +++  + +PGPDG++    + + E+    M+R    +L     P  +     V IPK 
Subjt:  SSCRPSVEEVNKVIGNVQPSVTEDMNRLLLKPFTREDIVGALAQIHPNKAPGPDGMSGAFYRQSWEIVGEDMMRFCLGVLNNKESPEVLNETMIVLIPKV

Query:  RNPRRVSEFRPISLCNVAYKLISKVLVSRMKGLLNKLIAQKQSAFIPGRCVVDNAIVGYECIHALRTKGKGRVGWASLKLDMSKAYDLVEWVFLEQIMLK
           +R  +FRPIS+ +V  + ++ +L +R+   +N     +Q  F+P     DNA +       LR   K         LD+SKA+D +    +   +  
Subjt:  RNPRRVSEFRPISLCNVAYKLISKVLVSRMKGLLNKLIAQKQSAFIPGRCVVDNAIVGYECIHALRTKGKGRVGWASLKLDMSKAYDLVEWVFLEQIMLK

Query:  MGFAPNWVDLIGKCISTVRFSFSINGSRYGEVKPSRGLRQGDPLSPYLF
         G    +VD +         S + +G    E  P+RG++QGDPLSP LF
Subjt:  MGFAPNWVDLIGKCISTVRFSFSINGSRYGEVKPSRGLRQGDPLSPYLF

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein1.6e-2726.61Show/hide
Query:  PESELNGFREAVDTCCMVDLGFKGDKFSWCNGRPGSETVWERIDRCLGNMAWQDLFPQQEVSHLDFSRSDHRP--ILLSLLPPRQVRYAQRRRIRRFEDV
        P   L  F+  +    +VD+  +G  ++W N +  +  +  ++DR + N  W   FP           SDH P  I+L  LP R       ++  R+   
Subjt:  PESELNGFREAVDTCCMVDLGFKGDKFSWCNGRPGSETVWERIDRCLGNMAWQDLFPQQEVSHLDFSRSDHRP--ILLSLLPPRQVRYAQRRRIRRFEDV

Query:  WLMYAEFKNVVSDSWQSG---GADASSVGLKV-AIERCLASLVKWGRDKVGRYSDRIKEASAEVQS-ALCNLSRSVSRVDLQEAESRLESILVEEEVYWK
           +  F   ++ +W+     G+   S+G  + A ++C   L + G   +   +    ++   +QS  L N S S+ RV+   A  +        E +++
Subjt:  WLMYAEFKNVVSDSWQSG---GADASSVGLKV-AIERCLASLVKWGRDKVGRYSDRIKEASAEVQS-ALCNLSRSVSRVDLQEAESRLESILVEEEVYWK

Query:  QRSRENWLSWGDRNTRWFHNHASQRRKRNTICGLEDESGMWHQDPRDVLRVISDYFTHMYSSCRP--SVEEVNKVIGNVQPSVTED--MNRLLLKPFTRE
        Q+SR  WL  GD NTR+FH      + +N I  L  +  +  ++   V  +I  Y+TH+  S     + + V + I ++ P    D   +RL   P  +E
Subjt:  QRSRENWLSWGDRNTRWFHNHASQRRKRNTICGLEDESGMWHQDPRDVLRVISDYFTHMYSSCRP--SVEEVNKVIGNVQPSVTED--MNRLLLKPFTRE

Query:  DIVGALAQIHPNKAPGPDGMSGAFYRQSWEIVGEDMMRFCLGVLNNKESPEVLNETMIVLIPKVRNPRRVSEFRPISLCNVAYKLIS
         I  A+  +  NKAPGPD  +  F+ +SW +V +  +             +  N T I LIPKV    ++S FRP+S C V YK+I+
Subjt:  DIVGALAQIHPNKAPGPDGMSGAFYRQSWEIVGEDMMRFCLGVLNNKESPEVLNETMIVLIPKVRNPRRVSEFRPISLCNVAYKLIS

AT3G42140.1 zinc ion binding;nucleic acid binding1.2e-0625.34Show/hide
Query:  FHSKSEKLRVVGSGPWAFDRSLLVMRAPASTGSAISVEFNECSFWVQIHNIPMHWQTRKMAGFLGNRIGRVEEVSGMEENEWTGPIMRLRVCLNLGRPLR
        F S+     ++  GPW+F+  + V++      S    EF    FW+QI  IP+ + T ++   +G R+G   E                    NLGR + 
Subjt:  FHSKSEKLRVVGSGPWAFDRSLLVMRAPASTGSAISVEFNECSFWVQIHNIPMHWQTRKMAGFLGNRIGRVEEVSGMEENEWTGPIMRLRVCLNLGRPLR

Query:  RGFILQVSNEEEVWCPVVYERLPDFYYNCGRLGHLKRECGHEVSEG
           +L+            YE+L +F   CG L H   EC    ++G
Subjt:  RGFILQVSNEEEVWCPVVYERLPDFYYNCGRLGHLKRECGHEVSEG

AT4G20520.1 RNA binding;RNA-directed DNA polymerases1.2e-1442.05Show/hide
Query:  LVSRMKGLLNKLIAQKQSAFIPGRCVVDNAIVGYECIHALRTKGKGRVGWASLKLDMSKAYDLVEWVFLEQIMLKMGFAPNWVDLIGK
        +V R+K L+  LI   Q++FIPGR   DN +   E +H++R K KG  GW  LKLD+ KAYD + W +LE  ++  GF   W+  I +
Subjt:  LVSRMKGLLNKLIAQKQSAFIPGRCVVDNAIVGYECIHALRTKGKGRVGWASLKLDMSKAYDLVEWVFLEQIMLKMGFAPNWVDLIGK

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)6.9e-1047.62Show/hide
Query:  FSINGSRYGEVKPSRGLRQGDPLSPYLFLFCSEGLSAMLHQAEASSPNTGAQTQENVSEVLHV
        F ING+  G V PSRGLRQGDPLSPYLF+ C+E LS +  +A+      G +   N   + H+
Subjt:  FSINGSRYGEVKPSRGLRQGDPLSPYLFLFCSEGLSAMLHQAEASSPNTGAQTQENVSEVLHV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGAATTGGTTCGATTGTGGAAGGCGTTGACCCTAACGGAAGAGGAAGCAGAGGTTATAAGGGCTGAAGCGGTTCCGGCGGCTTTGTCTGTTGAAGCTATT
GATCTTTGTGTAGTGGGAAAAGTGTTGACGTCAAAGAAGATCAACCCGGATGCTTTTCGATCGGTCATGAAGGCTGCATGGAGGGTTCATGGGTCGACGCGAGTC
GAATCAGCCGGTGATAATATTTTTGCAATTCAGTTTCATTCCAAAAGTGAGAAGCTTAGAGTTGTGGGCAGTGGGCCGTGGGCTTTTGATAGGTCTCTGCTAGTC
ATGAGGGCACCAGCCTCAACAGGTTCAGCAATCTCTGTTGAGTTTAATGAGTGTTCGTTCTGGGTGCAGATTCACAACATTCCTATGCATTGGCAGACTAGAAAG
ATGGCTGGGTTTCTAGGGAATAGAATAGGTAGGGTGGAGGAGGTGAGTGGTATGGAAGAAAATGAGTGGACCGGGCCGATCATGAGACTCAGAGTATGTTTGAAC
TTGGGTCGACCATTACGACGGGGCTTCATACTCCAAGTTAGCAATGAGGAGGAGGTTTGGTGCCCAGTTGTATATGAGAGGCTCCCCGATTTCTACTACAACTGT
GGACGGTTAGGACATTTGAAGAGGGAGTGTGGGCATGAGGTTTCAGAGGGGGGTGATGGGCGCAGGGAGCAGTATGGGGAATGGATGAAAGCGGGGATGATGCTG
GGGGAAGTGGGTAGGAGATGGGGTGAGGAGAAAAGGGATGGACAGAATCCATCTGGGCAGGGGGCAAGGGAGAGTGATGTGGGGGTGGAGGTTCCTCGGGGGGAG
GAGTCGGTAGAAGGGGGACAGTCTGGGATTGGCTTAGATTCGGGGAAGGATGGTGATAAGGGGAAAGGGATTCTAGGGGTAGGGGAGGTGGCGGATGGGGATGGA
GACATTGGGGGGATGGAGGTGGATGGGGTAGAGGAAGGACAGGTTGAGGGGTCGGCTGAAGTGTCGAAGTCTCAAAAAGGGAAGGGTTGGAAGAGGAGAGCCCCG
AGTGTGTTGACTGAAATCACAAACACAGGTAGTGGCTCCGAGAGGTTAGGTCCCAAACGTAAGACCTCTACTAGTGCATCTGAGCTGCAGGGATCTAAGAGGGCT
AGAGGCGACACTTTGAGGGTGGTGTGGGAGAAGGAAACAAAGGTTCATTCTTCACGATTTGATTTTATTAGAGCCAAGCTGGGCTTTGATGGGTGCTTCAGTGTT
GACAGTTCAGGAAGAAGTGGTGGTTTGGCTTTGTTATGGGATTCTACGATCAGTTTCAGCCTTATTTCTTTCTCCCCGAACCATATAGATGGCTGGGTGGATTGG
AACCAGTTTCAGTGGCGTTTGACAGGTATTTATGGGCACCCTCAGGCGGAGTTGAAGAAGAGAACATGGTCCCTCCTAAAGCATCTCCGTGGGCATCCGTCTACT
CCATGGCTTATGGGAGGGGATTTTAACGCTATTTTGTTCCATCATGAGAAGGAAGGGGGGAGGGATAAACCGGAGTCTGAGCTGAATGGCTTCCGTGAGGCAGTG
GATACTTGTTGTATGGTTGATTTGGGCTTCAAGGGGGATAAGTTCTCCTGGTGTAACGGGCGCCCGGGGAGCGAAACGGTGTGGGAGCGGATTGACAGATGTTTG
GGCAATATGGCCTGGCAGGACCTGTTCCCTCAACAGGAAGTATCCCATCTAGATTTCAGCAGATCTGACCATAGGCCGATTCTGCTTTCACTCCTCCCTCCGCGC
CAGGTAAGGTATGCTCAGAGGCGTAGAATTCGTCGTTTTGAGGATGTCTGGCTGATGTATGCTGAGTTTAAGAATGTGGTGTCGGATTCATGGCAGTCAGGTGGA
GCTGATGCTTCATCAGTGGGTTTGAAGGTGGCTATTGAACGTTGCTTAGCCTCTTTGGTTAAGTGGGGTAGGGATAAGGTGGGACGGTATAGTGACCGTATCAAG
GAAGCTTCAGCCGAGGTGCAGTCTGCTTTATGTAATTTGAGTAGATCGGTTTCGAGGGTCGATTTACAGGAGGCAGAATCGAGGCTTGAGTCTATTCTGGTGGAG
GAGGAAGTTTACTGGAAGCAGAGATCCAGGGAAAACTGGTTGAGTTGGGGTGACAGGAACACTCGGTGGTTCCATAATCATGCATCGCAGAGACGGAAAAGGAAT
ACCATCTGTGGTCTTGAGGATGAGTCAGGTATGTGGCATCAGGATCCGAGAGATGTTCTTAGAGTGATCTCAGACTACTTTACACACATGTACTCCTCTTGTAGG
CCTTCGGTGGAGGAAGTAAATAAGGTCATTGGTAACGTTCAGCCGTCAGTGACAGAGGATATGAATAGGCTCTTGCTCAAACCTTTCACACGAGAGGATATTGTT
GGGGCTCTGGCGCAGATTCACCCGAATAAGGCCCCAGGTCCGGATGGAATGTCAGGTGCTTTCTACCGACAGTCTTGGGAGATAGTTGGAGAGGACATGATGAGG
TTTTGTCTGGGGGTCCTGAATAACAAGGAATCTCCAGAGGTTTTAAACGAGACGATGATTGTGTTGATACCTAAGGTGAGGAATCCTAGGCGGGTGTCTGAGTTC
CGCCCTATATCCTTGTGTAATGTGGCCTACAAGCTGATTTCTAAAGTATTAGTGAGCAGAATGAAGGGGTTGTTGAACAAGCTGATAGCCCAGAAGCAGAGTGCT
TTTATACCAGGAAGGTGTGTGGTAGATAATGCCATTGTGGGATATGAGTGCATTCATGCTCTGAGGACAAAAGGAAAGGGAAGAGTAGGTTGGGCCTCGCTAAAG
CTAGATATGAGCAAGGCATATGATCTGGTGGAGTGGGTGTTTCTGGAACAGATCATGTTGAAAATGGGGTTCGCTCCCAATTGGGTGGACCTGATTGGCAAATGC
ATTTCGACTGTCAGGTTCTCGTTCAGTATTAATGGGTCCCGTTATGGTGAAGTGAAACCGAGTAGGGGCCTTCGACAAGGAGACCCTCTGTCGCCATACCTCTTC
CTATTTTGTAGTGAGGGATTGTCTGCCATGTTGCATCAGGCAGAGGCTTCCAGCCCAAATACTGGTGCTCAGACACAGGAAAATGTGAGTGAGGTTCTCCATGTG
AATGTCACTCTCTGCCATCAGCATTATCTGGGGCTCCCAGCTTTCATGCCTCGTAGTAGAGTGAGCTCGTTGAAGTTCACTAAAGATAGAGTGTGGAGGAGAATA
CAAGGGTGGAAGGGGAAGCTTTTTTCGGTGGGTGGAAGATAA
mRNA sequenceShow/hide mRNA sequence
TGAAAAAAAGCCTTCAAAGACATTTTTACAACAAACTTCAAGGGCTAAAATGAAACTTTTTGAAAGTTCAATGGTATAACTGAAACCAACCAAAAAATATCCACG
TTTCTCTCGTCATGAGAGTTTTCTCTCGCCATGAGAGTTTTCCTCGTCATGGGGATTAGGGTTTTGTTAGGGTTTCTTTAGGGTTTTTTCGCTGGGGTCGCGTAA
GGTTGTGGGTTTTCTTAGGGTGTTGCGAGTGTGTGCAGGATCTGTGGTCTGTTCCGCTGTTGTCATACCGAAGCGTGTGGGTGCGATTGGGTTGCAATTTTCTCC
ATCGAGTAATAGCGAGAGTGTTTGTGGAGTCGGCAGGGTCCTGACCGTCTGCGGGTGAAGGGCTTGTCCCAGGGAGAAGCATCTTGCGTGTGAAGCTTGGCTCGG
GGATAAGCGCCTGCGTGTACTGGTTTTGGTGAGCTTGCGAGGGGTGAGGTTAAGGGGTACGATGGATGAATTGGTTCGATTGTGGAAGGCGTTGACCCTAACGGA
AGAGGAAGCAGAGGTTATAAGGGCTGAAGCGGTTCCGGCGGCTTTGTCTGTTGAAGCTATTGATCTTTGTGTAGTGGGAAAAGTGTTGACGTCAAAGAAGATCAA
CCCGGATGCTTTTCGATCGGTCATGAAGGCTGCATGGAGGGTTCATGGGTCGACGCGAGTCGAATCAGCCGGTGATAATATTTTTGCAATTCAGTTTCATTCCAA
AAGTGAGAAGCTTAGAGTTGTGGGCAGTGGGCCGTGGGCTTTTGATAGGTCTCTGCTAGTCATGAGGGCACCAGCCTCAACAGGTTCAGCAATCTCTGTTGAGTT
TAATGAGTGTTCGTTCTGGGTGCAGATTCACAACATTCCTATGCATTGGCAGACTAGAAAGATGGCTGGGTTTCTAGGGAATAGAATAGGTAGGGTGGAGGAGGT
GAGTGGTATGGAAGAAAATGAGTGGACCGGGCCGATCATGAGACTCAGAGTATGTTTGAACTTGGGTCGACCATTACGACGGGGCTTCATACTCCAAGTTAGCAA
TGAGGAGGAGGTTTGGTGCCCAGTTGTATATGAGAGGCTCCCCGATTTCTACTACAACTGTGGACGGTTAGGACATTTGAAGAGGGAGTGTGGGCATGAGGTTTC
AGAGGGGGGTGATGGGCGCAGGGAGCAGTATGGGGAATGGATGAAAGCGGGGATGATGCTGGGGGAAGTGGGTAGGAGATGGGGTGAGGAGAAAAGGGATGGACA
GAATCCATCTGGGCAGGGGGCAAGGGAGAGTGATGTGGGGGTGGAGGTTCCTCGGGGGGAGGAGTCGGTAGAAGGGGGACAGTCTGGGATTGGCTTAGATTCGGG
GAAGGATGGTGATAAGGGGAAAGGGATTCTAGGGGTAGGGGAGGTGGCGGATGGGGATGGAGACATTGGGGGGATGGAGGTGGATGGGGTAGAGGAAGGACAGGT
TGAGGGGTCGGCTGAAGTGTCGAAGTCTCAAAAAGGGAAGGGTTGGAAGAGGAGAGCCCCGAGTGTGTTGACTGAAATCACAAACACAGGTAGTGGCTCCGAGAG
GTTAGGTCCCAAACGTAAGACCTCTACTAGTGCATCTGAGCTGCAGGGATCTAAGAGGGCTAGAGGCGACACTTTGAGGGTGGTGTGGGAGAAGGAAACAAAGGT
TCATTCTTCACGATTTGATTTTATTAGAGCCAAGCTGGGCTTTGATGGGTGCTTCAGTGTTGACAGTTCAGGAAGAAGTGGTGGTTTGGCTTTGTTATGGGATTC
TACGATCAGTTTCAGCCTTATTTCTTTCTCCCCGAACCATATAGATGGCTGGGTGGATTGGAACCAGTTTCAGTGGCGTTTGACAGGTATTTATGGGCACCCTCA
GGCGGAGTTGAAGAAGAGAACATGGTCCCTCCTAAAGCATCTCCGTGGGCATCCGTCTACTCCATGGCTTATGGGAGGGGATTTTAACGCTATTTTGTTCCATCA
TGAGAAGGAAGGGGGGAGGGATAAACCGGAGTCTGAGCTGAATGGCTTCCGTGAGGCAGTGGATACTTGTTGTATGGTTGATTTGGGCTTCAAGGGGGATAAGTT
CTCCTGGTGTAACGGGCGCCCGGGGAGCGAAACGGTGTGGGAGCGGATTGACAGATGTTTGGGCAATATGGCCTGGCAGGACCTGTTCCCTCAACAGGAAGTATC
CCATCTAGATTTCAGCAGATCTGACCATAGGCCGATTCTGCTTTCACTCCTCCCTCCGCGCCAGGTAAGGTATGCTCAGAGGCGTAGAATTCGTCGTTTTGAGGA
TGTCTGGCTGATGTATGCTGAGTTTAAGAATGTGGTGTCGGATTCATGGCAGTCAGGTGGAGCTGATGCTTCATCAGTGGGTTTGAAGGTGGCTATTGAACGTTG
CTTAGCCTCTTTGGTTAAGTGGGGTAGGGATAAGGTGGGACGGTATAGTGACCGTATCAAGGAAGCTTCAGCCGAGGTGCAGTCTGCTTTATGTAATTTGAGTAG
ATCGGTTTCGAGGGTCGATTTACAGGAGGCAGAATCGAGGCTTGAGTCTATTCTGGTGGAGGAGGAAGTTTACTGGAAGCAGAGATCCAGGGAAAACTGGTTGAG
TTGGGGTGACAGGAACACTCGGTGGTTCCATAATCATGCATCGCAGAGACGGAAAAGGAATACCATCTGTGGTCTTGAGGATGAGTCAGGTATGTGGCATCAGGA
TCCGAGAGATGTTCTTAGAGTGATCTCAGACTACTTTACACACATGTACTCCTCTTGTAGGCCTTCGGTGGAGGAAGTAAATAAGGTCATTGGTAACGTTCAGCC
GTCAGTGACAGAGGATATGAATAGGCTCTTGCTCAAACCTTTCACACGAGAGGATATTGTTGGGGCTCTGGCGCAGATTCACCCGAATAAGGCCCCAGGTCCGGA
TGGAATGTCAGGTGCTTTCTACCGACAGTCTTGGGAGATAGTTGGAGAGGACATGATGAGGTTTTGTCTGGGGGTCCTGAATAACAAGGAATCTCCAGAGGTTTT
AAACGAGACGATGATTGTGTTGATACCTAAGGTGAGGAATCCTAGGCGGGTGTCTGAGTTCCGCCCTATATCCTTGTGTAATGTGGCCTACAAGCTGATTTCTAA
AGTATTAGTGAGCAGAATGAAGGGGTTGTTGAACAAGCTGATAGCCCAGAAGCAGAGTGCTTTTATACCAGGAAGGTGTGTGGTAGATAATGCCATTGTGGGATA
TGAGTGCATTCATGCTCTGAGGACAAAAGGAAAGGGAAGAGTAGGTTGGGCCTCGCTAAAGCTAGATATGAGCAAGGCATATGATCTGGTGGAGTGGGTGTTTCT
GGAACAGATCATGTTGAAAATGGGGTTCGCTCCCAATTGGGTGGACCTGATTGGCAAATGCATTTCGACTGTCAGGTTCTCGTTCAGTATTAATGGGTCCCGTTA
TGGTGAAGTGAAACCGAGTAGGGGCCTTCGACAAGGAGACCCTCTGTCGCCATACCTCTTCCTATTTTGTAGTGAGGGATTGTCTGCCATGTTGCATCAGGCAGA
GGCTTCCAGCCCAAATACTGGTGCTCAGACACAGGAAAATGTGAGTGAGGTTCTCCATGTGAATGTCACTCTCTGCCATCAGCATTATCTGGGGCTCCCAGCTTT
CATGCCTCGTAGTAGAGTGAGCTCGTTGAAGTTCACTAAAGATAGAGTGTGGAGGAGAATACAAGGGTGGAAGGGGAAGCTTTTTTCGGTGGGTGGAAGATAA
Protein sequenceShow/hide protein sequence
MDELVRLWKALTLTEEEAEVIRAEAVPAALSVEAIDLCVVGKVLTSKKINPDAFRSVMKAAWRVHGSTRVESAGDNIFAIQFHSKSEKLRVVGSGPWAFDRSLLV
MRAPASTGSAISVEFNECSFWVQIHNIPMHWQTRKMAGFLGNRIGRVEEVSGMEENEWTGPIMRLRVCLNLGRPLRRGFILQVSNEEEVWCPVVYERLPDFYYNC
GRLGHLKRECGHEVSEGGDGRREQYGEWMKAGMMLGEVGRRWGEEKRDGQNPSGQGARESDVGVEVPRGEESVEGGQSGIGLDSGKDGDKGKGILGVGEVADGDG
DIGGMEVDGVEEGQVEGSAEVSKSQKGKGWKRRAPSVLTEITNTGSGSERLGPKRKTSTSASELQGSKRARGDTLRVVWEKETKVHSSRFDFIRAKLGFDGCFSV
DSSGRSGGLALLWDSTISFSLISFSPNHIDGWVDWNQFQWRLTGIYGHPQAELKKRTWSLLKHLRGHPSTPWLMGGDFNAILFHHEKEGGRDKPESELNGFREAV
DTCCMVDLGFKGDKFSWCNGRPGSETVWERIDRCLGNMAWQDLFPQQEVSHLDFSRSDHRPILLSLLPPRQVRYAQRRRIRRFEDVWLMYAEFKNVVSDSWQSGG
ADASSVGLKVAIERCLASLVKWGRDKVGRYSDRIKEASAEVQSALCNLSRSVSRVDLQEAESRLESILVEEEVYWKQRSRENWLSWGDRNTRWFHNHASQRRKRN
TICGLEDESGMWHQDPRDVLRVISDYFTHMYSSCRPSVEEVNKVIGNVQPSVTEDMNRLLLKPFTREDIVGALAQIHPNKAPGPDGMSGAFYRQSWEIVGEDMMR
FCLGVLNNKESPEVLNETMIVLIPKVRNPRRVSEFRPISLCNVAYKLISKVLVSRMKGLLNKLIAQKQSAFIPGRCVVDNAIVGYECIHALRTKGKGRVGWASLK
LDMSKAYDLVEWVFLEQIMLKMGFAPNWVDLIGKCISTVRFSFSINGSRYGEVKPSRGLRQGDPLSPYLFLFCSEGLSAMLHQAEASSPNTGAQTQENVSEVLHV
NVTLCHQHYLGLPAFMPRSRVSSLKFTKDRVWRRIQGWKGKLFSVGGR