; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G18820 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G18820
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRibonuclease H
Genome locationChr1:14239572..14243162
RNA-Seq ExpressionCSPI01G18820
SyntenyCSPI01G18820
Gene Ontology termsNA
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033221.1 uncharacterized protein E6C27_scaffold845G00100 [Cucumis melo var. makuwa]0.0e+0054.04Show/hide
Query:  MSSPLVGQTNYSSNPHGGGHS---QPNSSFVKNNWRQIHFDPILMSYTDLLPQLLKSHQVALVPQEPLQPHYPKWYDPSAKCEYHAGAVGHSTKNCFPLK
        +SSP+VGQTN+S +   GG S   Q     ++NNW+Q  FDPI MSYT+LLP L+KSHQVA+VPQEPLQP YPKWYDP+AKCEYHAGAVGHST+NCFPLK
Subjt:  MSSPLVGQTNYSSNPHGGGHS---QPNSSFVKNNWRQIHFDPILMSYTDLLPQLLKSHQVALVPQEPLQPHYPKWYDPSAKCEYHAGAVGHSTKNCFPLK

Query:  AKVQSLVNAGWLKFQKTGKEPNVNQNPLPNHENPVINVVVTLVKHYKSDVYKVTTSMRILFQILHEAGYIPLSADDGNVERKESINKKTCLFHLGTCEQS
        AKVQSLV AGWL+F+KTG+EP+VNQNPLPNHE P IN V T ++ +K+ V  V TSM+ LFQILH AGY+    ++ + E+    N + CL H  T + S
Subjt:  AKVQSLVNAGWLKFQKTGKEPNVNQNPLPNHENPVINVVVTLVKHYKSDVYKVTTSMRILFQILHEAGYIPLSADDGNVERKESINKKTCLFHLGTCEQS

Query:  IETCSEFRFEVQKLMDAKILIVSQTNIQEVEIDVIFDATSS---------VQEPLIIQYEGKPDSTTCIQMPKTITVEVLGPFPYKNNQVVPWKYECQLI
        IE C EF+ EVQKLMD+KIL++ Q ++Q++E+++I +A+S+         + +PL+I YE KP   + IQ PK +TVE                      
Subjt:  IETCSEFRFEVQKLMDAKILIVSQTNIQEVEIDVIFDATSS---------VQEPLIIQYEGKPDSTTCIQMPKTITVEVLGPFPYKNNQVVPWKYECQLI

Query:  TNNVASVTTGGMTRSGRCYTPYVLKDISKEDEVRRRKGKAMEMIGGDDLSDLSKVFTAKTTLAEKKINQEVVSKKESLEFLKLIKRSEYKVIEQLHRTPA
              +  GG+TRSGRCYTP  LKD+SKEDEVRRRKGKA+EM G DDL+DLSKVFT K TL EK+ + EVVSK+E+ E                     
Subjt:  TNNVASVTTGGMTRSGRCYTPYVLKDISKEDEVRRRKGKAMEMIGGDDLSDLSKVFTAKTTLAEKKINQEVVSKKESLEFLKLIKRSEYKVIEQLHRTPA

Query:  RISILSLFMYSEPHRKMKKSPPEGTGHTKALHIFVKCKDHHVARVLVDNRSSLNIMSSSTLMKLPKDPSYLRPSTMVVKAFDGAHREVIGDIDVPFKIEP
                          + PPEGTGHTKALHI VKCKDHHVARVLVDN SSLNIMS STLMKLP DPSYLRPSTMVV+AFD A REVIGDID+P KI P
Subjt:  RISILSLFMYSEPHRKMKKSPPEGTGHTKALHIFVKCKDHHVARVLVDNRSSLNIMSSSTLMKLPKDPSYLRPSTMVVKAFDGAHREVIGDIDVPFKIEP

Query:  STFNVPFQ----------------IHSAGAVLSSLHQKLKFSVEGGQAIVYGEEDMFVTKTSVLPYVEATEEALKCSYRSFEIANATIFPTE--------
        STFNV FQ                IHSAGAV SSLHQ+LKFSVEGGQAIVYGEEDMFVTKTSVLPYVEATEEAL+CSYRSFE ANATIFPTE        
Subjt:  STFNVPFQ----------------IHSAGAVLSSLHQKLKFSVEGGQAIVYGEEDMFVTKTSVLPYVEATEEALKCSYRSFEIANATIFPTE--------

Query:  -------------------------------------ANERFGLGFKPTASDWEK------------------------------TFRLGERLFNTDQRK
                                             A E+FGLG+KP  S+ EK                              TF+LGE LF+T QRK
Subjt:  -------------------------------------ANERFGLGFKPTASDWEK------------------------------TFRLGERLFNTDQRK

Query:  GRNEDSETPIAVISEN------------------------------GSQKGIRVESDISDVVDFEVPICNLEQNIESDESDVSLELLRMIEQEEKKTLPY
          NEDSE  IAV+SEN                              GSQK  RVE D+ D VDFEVPICNLEQNI   ESD+S ELLR+IEQEEKKT+ Y
Subjt:  GRNEDSETPIAVISEN------------------------------GSQKGIRVESDISDVVDFEVPICNLEQNIESDESDVSLELLRMIEQEEKKTLPY

Query:  QETLKVINLGAREEVKKVQIGTLPSKQDQSDLVTLLHEFKDIFAWSYQDMPNLDTEIVTHRLPRKPECKPVRQKLRKLKPEMLIKIKEEVKKQFDVGFLV
        QETL VIN G  EEVK+V+IGTL S+QDQS+LVTLLHEFKDIFAWSYQDM  LDTEIVTHRLP KPECKP+RQKLRKLKPEMLIKIKEEVKKQFD GFL 
Subjt:  QETLKVINLGAREEVKKVQIGTLPSKQDQSDLVTLLHEFKDIFAWSYQDMPNLDTEIVTHRLPRKPECKPVRQKLRKLKPEMLIKIKEEVKKQFDVGFLV

Query:  VAKYPDWVANIVSIPKKYGK--------------------------------------------------------------------------------
        VAKYP WVANIV +PKK GK                                                                                
Subjt:  VAKYPDWVANIVSIPKKYGK--------------------------------------------------------------------------------

Query:  -----RAMVTLFHYLMHKEIEVYVDDMIAKSRPEEKHVVTLRRHQGGSRQIKVIVGLKPPKTQNEVRSFLGRLNYIARFISHLTQTCEPILKLLRKSEIC
             RAMVTLFH LMHKEIEVYVDDMIAKSR EEKHVVTLR+     R  K  + L P K    V S  G+L  +   +S      +P  K+  K+ + 
Subjt:  -----RAMVTLFHYLMHKEIEVYVDDMIAKSRPEEKHVVTLRRHQGGSRQIKVIVGLKPPKTQNEVRSFLGRLNYIARFISHLTQTCEPILKLLRKSEIC

Query:  HWNEDCQRAFDKIKDYLQSPPILVLPTPGRPLILYLTVKDGSMGCVLGQHDSTRRKKRAVYYLSKKFTNYESKYSLLEKTCCALTWTT
          +   Q+   +IKDYLQSPPILV PTPGRPLILYLTVK+GSMGCVLGQ DST +K++AVYYLSKKFTNYESKYSLLEKTCC L WTT
Subjt:  HWNEDCQRAFDKIKDYLQSPPILVLPTPGRPLILYLTVKDGSMGCVLGQHDSTRRKKRAVYYLSKKFTNYESKYSLLEKTCCALTWTT

KAA0065293.1 uncharacterized protein E6C27_scaffold1023G00060 [Cucumis melo var. makuwa]0.0e+0057.39Show/hide
Query:  MSSPLVGQTNYSSNPHGGGHS---QPNSSFVKNNWRQIHFDPILMSYTDLLPQLLKSHQVALVPQEPLQPHYPKWYDPSAKCEYHAGAVGHSTKNCFPLK
        +SSP+VGQTN+S +   GG S   Q     ++NNW+Q  FDPI MSYT+LLPQL+KSHQVA+VPQEPLQP YPKWYDP+AKCEYHAGAVGHST+NCFPLK
Subjt:  MSSPLVGQTNYSSNPHGGGHS---QPNSSFVKNNWRQIHFDPILMSYTDLLPQLLKSHQVALVPQEPLQPHYPKWYDPSAKCEYHAGAVGHSTKNCFPLK

Query:  AKVQSLVNAGWLKFQKTGKEPNVNQNPLPNHENPVINVVVTLVKHYKSDVYKVTTSMRILFQILHEAGYIPLSADDGNVERKESINKKTCLFHLGTCEQS
        AKVQSLV AGWL+F+KTG+EP+VNQNPLPNHE P+IN V T ++ +K+ V  V TSM+ LFQILH AGY+    ++ + E+    N + CLFH  T + S
Subjt:  AKVQSLVNAGWLKFQKTGKEPNVNQNPLPNHENPVINVVVTLVKHYKSDVYKVTTSMRILFQILHEAGYIPLSADDGNVERKESINKKTCLFHLGTCEQS

Query:  IETCSEFRFEVQKLMDAKILIVSQTNIQEVEIDVIFDATSS---------VQEPLIIQYEGKPDSTTCIQMPKTITVEVLGPFPYKNNQVVPWKYECQLI
        IE C EF+ EVQKLMD+KIL++ Q ++QE+E+++I +A+S+         + +PL+I YE KP   + IQ PK +TVE+ GPF YK+N  VPWKYECQ I
Subjt:  IETCSEFRFEVQKLMDAKILIVSQTNIQEVEIDVIFDATSS---------VQEPLIIQYEGKPDSTTCIQMPKTITVEVLGPFPYKNNQVVPWKYECQLI

Query:  TNNVASVTTGGMTRSGRCYTPYVLKDISKEDEVRRRKGKAMEMIGGDDLSDLSKVFTAKTTLAEKKINQEVVSKKESLEFLKLIKRSEYKVIEQLHRTPA
        T+NV   T  G+TRSGRCYTP  LKD+SKEDEVRRRKGKA+EM G DDL+DLSKVFT K TL EK+ + EVVSK+E+ EFLKLIK+SEYKVIEQLHRTPA
Subjt:  TNNVASVTTGGMTRSGRCYTPYVLKDISKEDEVRRRKGKAMEMIGGDDLSDLSKVFTAKTTLAEKKINQEVVSKKESLEFLKLIKRSEYKVIEQLHRTPA

Query:  RISILSLFMYSEPHRKM------------------------------------KKSPPEGTGHTKALHIFVKCKDHHVARVLVDNRSSLNIMSSSTLMKL
        RIS+LSLF YSEPHRK+                                    ++ PPEGTGHTKALHI VKCKDHHVARVLVDN SSLNIMS STLMKL
Subjt:  RISILSLFMYSEPHRKM------------------------------------KKSPPEGTGHTKALHIFVKCKDHHVARVLVDNRSSLNIMSSSTLMKL

Query:  PKDPSYLRPSTMVVKAFDGAHREVIGDIDVPFKIEPSTFNVPFQ----------------IHSAGAVLSSLHQKLKFSVEGGQAIVYGEEDMFVTKTSVL
        P DPSYLRPSTMVV+AFDGA REVIGDID+P KI PSTFNV FQ                IHSAGAV SSLHQ                           
Subjt:  PKDPSYLRPSTMVVKAFDGAHREVIGDIDVPFKIEPSTFNVPFQ----------------IHSAGAVLSSLHQKLKFSVEGGQAIVYGEEDMFVTKTSVL

Query:  PYVEATEEALKCSYRSFEIANATIFPTE---------------------------------------------ANERFGLGFKPTASDWEK---------
            +   +  CSYRSFEIANATIFPTE                                             A E+FGLG+KP  S+WEK         
Subjt:  PYVEATEEALKCSYRSFEIANATIFPTE---------------------------------------------ANERFGLGFKPTASDWEK---------

Query:  ---------------------TFRLGERLFNTDQRKGRNEDSETPIAVISEN------------------------------GSQK----GIRVESDISD
                             TF+LGE LF+T QRK  NEDSE  I V+SEN                              GSQK      RVE D+ D
Subjt:  ---------------------TFRLGERLFNTDQRKGRNEDSETPIAVISEN------------------------------GSQK----GIRVESDISD

Query:  VVDFEVPICNLEQNIESDESDVSLELLRMIEQEEKKTLPYQETLKVINLGAREEVKKVQIGTLPSKQDQSDLVTLLHEFKDIFAWSYQDMPNLDTEIVTH
         VDFEVPICNLEQNI   ESD+S ELLR+IEQEEKKT+PYQETLKVINLG  EEVK+V+IGTL S+QDQS+LVTLLHEFKDIFAWSYQDMP LDTEIVTH
Subjt:  VVDFEVPICNLEQNIESDESDVSLELLRMIEQEEKKTLPYQETLKVINLGAREEVKKVQIGTLPSKQDQSDLVTLLHEFKDIFAWSYQDMPNLDTEIVTH

Query:  RLPRKPECKPVRQKLRKLKPEMLIKIKEEVKKQFDVGFLVVAKYPDWVANIVSIPKKYGK----------------------------------------
        RLP KPECKP+RQKLRKLKPEMLIKIKEEVKKQFD GFL VAKYP WVANIV +PKK GK                                        
Subjt:  RLPRKPECKPVRQKLRKLKPEMLIKIKEEVKKQFDVGFLVVAKYPDWVANIVSIPKKYGK----------------------------------------

Query:  --------------RAMVTLFHYLMHKEIEVYVDDMIAKSRPEEKHVVTLRR--------------------------------HQG---GSRQIKVIVG
                      RAMVTLFH LMHKEIEVYVDDM+AKSR EEKHVVTLR+                                 +G      +I+ IV 
Subjt:  --------------RAMVTLFHYLMHKEIEVYVDDMIAKSRPEEKHVVTLRR--------------------------------HQG---GSRQIKVIVG

Query:  LKPPKTQNEVRSFLGRLNYIARFISHLTQTCEPILKLLRKSEICHWNEDCQRAFDKIKDYLQSPPILVLPTPGRPLILYLTVKDGSMGCVLGQHDSTRRK
        L+PPKTQ EVRSFLGRLNY ARFISHLTQTCEPILKLLRK+EICHWNEDCQ+AFDKIKDYLQSPPILV PTPGRPLI  LTVK+GSMGCVLGQHDST +K
Subjt:  LKPPKTQNEVRSFLGRLNYIARFISHLTQTCEPILKLLRKSEICHWNEDCQRAFDKIKDYLQSPPILVLPTPGRPLILYLTVKDGSMGCVLGQHDSTRRK

Query:  KRAVYYLSKKFTNYESKYSLLEKTCCALTWTT
        ++AVYYLSKKFTNYESKYSLLEKTCCAL WTT
Subjt:  KRAVYYLSKKFTNYESKYSLLEKTCCALTWTT

XP_031738857.1 LOW QUALITY PROTEIN: uncharacterized protein LOC116402780 [Cucumis sativus]1.6e-24141.07Show/hide
Query:  KNNWRQIHFDPILMSYTDLLPQLLKSHQVALVPQEPLQPHYPKWYDPSAKCEYHAGAVGHSTKNCFPLKAKVQSLVNAGWLKFQKTGKEPNVNQNPLPNH
        K N     FDPI M+YT+LLPQL+ + Q+A +P  P+QP YPKWYDP+A+C+YHAG VGHST+NC  LK KVQSL+NAGWL F+K G++P+VN NPLPNH
Subjt:  KNNWRQIHFDPILMSYTDLLPQLLKSHQVALVPQEPLQPHYPKWYDPSAKCEYHAGAVGHSTKNCFPLKAKVQSLVNAGWLKFQKTGKEPNVNQNPLPNH

Query:  ENPVINVVVTLVKHYKSDVYKVTTSMRILFQILHEAGYIPLSADDGNVERKESINKKTCLFHLGTCEQSIETCSEFRFEVQKLMDAKILIV----SQTNI
        EN  +N +   V   K++V+++   M  LF+ L EAGY+ L   D N+  +     K C+FH G     I+ C  FR +VQ+ MD+KIL V     +  +
Subjt:  ENPVINVVVTLVKHYKSDVYKVTTSMRILFQILHEAGYIPLSADDGNVERKESINKKTCLFHLGTCEQSIETCSEFRFEVQKLMDAKILIV----SQTNI

Query:  QEVEIDVIFDATSSVQEPL------IIQYEGKPDSTTCIQMPKTITVEVLGPFPYKNNQVVPWKYECQLITNNVASVTTG--GMTRSGRCYTPYVLKDIS
        ++ +I  +    +  + P       +   E +  S++C   PK + VEV  PF +K+ + VPW+Y+CQ+IT       TG  G+TRSGRCY P  L   S
Subjt:  QEVEIDVIFDATSSVQEPL------IIQYEGKPDSTTCIQMPKTITVEVLGPFPYKNNQVVPWKYECQLITNNVASVTTG--GMTRSGRCYTPYVLKDIS

Query:  KEDEV-RRRKGKAMEMIGGDDLSDLSKVFTAKTTLAEKKINQEVVSKKESLEFLKLIKRSEYKVIEQLHRTPARISILSLFMYSEPHRKM----------
            + + RK +   +   D   D+    TAK    +K +  E  +     EFLK++K+SEYK+IEQ+H TPARIS+LSLF+ SEPHRK+          
Subjt:  KEDEV-RRRKGKAMEMIGGDDLSDLSKVFTAKTTLAEKKINQEVVSKKESLEFLKLIKRSEYKVIEQLHRTPARISILSLFMYSEPHRKM----------

Query:  --------------------------KKSPPEGTGHTKALHIFVKCKDHHVARVLVDNRSSLNIMSSSTLMKLPKDPSYLRPSTMVVKAFDGAHREVIGD
                                   + PPEG GH KALHI VK KD+ +ARVLVDN S+LNIM  STL+KLP D SY++ STMVV+AFDG+ REVIGD
Subjt:  --------------------------KKSPPEGTGHTKALHIFVKCKDHHVARVLVDNRSSLNIMSSSTLMKLPKDPSYLRPSTMVVKAFDGAHREVIGD

Query:  IDVPFKIEPSTFNVPFQ----------------IHSAGAVLSSLHQKLKFSVEGGQAIVYGEEDMFVTKTSVLPYVEATEEALKCSYRSFEIANATIFPT
        I++P KI P TFN+ FQ                IHSAG V S+LHQKLKF V      + GEED  +TK    PYVEATEEAL+CS+RSFEIA+AT+   
Subjt:  IDVPFKIEPSTFNVPFQ----------------IHSAGAVLSSLHQKLKFSVEGGQAIVYGEEDMFVTKTSVLPYVEATEEALKCSYRSFEIANATIFPT

Query:  EANE-------------------------------------RFGLGFKPTASD--------------------WEKTFRLGERLFNTDQRKG-----RNE
         A E                                     RFGLG+ P   D                    ++ + +    L++T +  G     R+ 
Subjt:  EANE-------------------------------------RFGLGFKPTASD--------------------WEKTFRLGERLFNTDQRKG-----RNE

Query:  DSETPIAVISENGSQKGIRVESDISDVVDFEVP----------------------------------------------ICNLEQNIESDESD---VSLE
        DS+  +    E+ S   +  E+   D   +  P                                              I  +E + ESD+ +   +S E
Subjt:  DSETPIAVISENGSQKGIRVESDISDVVDFEVP----------------------------------------------ICNLEQNIESDESD---VSLE

Query:  LLRMIEQEEKKTLPYQETLKVINLGAREEVKKVQIGTLPSKQDQSDLVTLLHEFKDIFAWSYQDMPNLDTEIVTHRLPRKPECKPVRQKLRKLKPEMLIK
        LLR++E+E+K   P+QE ++VINLG++EE K+V+IGT  + + +  ++ LL E+ DIFAWSYQDMP L+T+IV HR+P KPEC PVRQKLRK+KP++LIK
Subjt:  LLRMIEQEEKKTLPYQETLKVINLGAREEVKKVQIGTLPSKQDQSDLVTLLHEFKDIFAWSYQDMPNLDTEIVTHRLPRKPECKPVRQKLRKLKPEMLIK

Query:  IKEEVKKQFDVGFLVVAKYPDWVANIVSIPKKYGK-----------------------------------------------------------------
        +KEEV+KQ + GFL+V+KYP+WVANIV +PKK GK                                                                 
Subjt:  IKEEVKKQFDVGFLVVAKYPDWVANIVSIPKKYGK-----------------------------------------------------------------

Query:  --------------------RAMVTLFHYLMHKEIEVYVDDMIAKSRPEEKHVV-------TLRRHQ----------GGSR------------------Q
                            RAMVTLFH +MHKEIEVYVDDMIAKS+  E H          LR++Q          G +                   +
Subjt:  --------------------RAMVTLFHYLMHKEIEVYVDDMIAKSRPEEKHVV-------TLRRHQ----------GGSR------------------Q

Query:  IKVIVGLKPPKTQNEVRSFLGRLNYIARFISHLTQTCEPILKLLRKSEICHWNEDCQRAFDKIKDYLQSPPILVLPTPGRPLILYLTVKDGSMGCVLGQH
        +K I+ +  P+T+ EVR+FLGRLNYI+RFISHLT TCEPI KLLRK+    WNEDC+ AF+KIK YLQSPP+L+ P PGRPLILYLTV +GSMG VLGQH
Subjt:  IKVIVGLKPPKTQNEVRSFLGRLNYIARFISHLTQTCEPILKLLRKSEICHWNEDCQRAFDKIKDYLQSPPILVLPTPGRPLILYLTVKDGSMGCVLGQH

Query:  DSTRRKKRAVYYLSKKFTNYESKYSLLEKTCCALTWT
        D + +K+ A+YYLSKKFT+YESKYS+LE+TCCAL WT
Subjt:  DSTRRKKRAVYYLSKKFTNYESKYSLLEKTCCALTWT

XP_031741698.1 uncharacterized protein LOC116403895 [Cucumis sativus]1.2e-24141.07Show/hide
Query:  KNNWRQIHFDPILMSYTDLLPQLLKSHQVALVPQEPLQPHYPKWYDPSAKCEYHAGAVGHSTKNCFPLKAKVQSLVNAGWLKFQKTGKEPNVNQNPLPNH
        K N     FDPI M+YT+LLPQL+ + Q+A +P  P+QP YPKWYDP+A+C+YHAG VGHST+NC  LK KVQSL+NAGWL F+K G++P+VN NPLPNH
Subjt:  KNNWRQIHFDPILMSYTDLLPQLLKSHQVALVPQEPLQPHYPKWYDPSAKCEYHAGAVGHSTKNCFPLKAKVQSLVNAGWLKFQKTGKEPNVNQNPLPNH

Query:  ENPVINVVVTLVKHYKSDVYKVTTSMRILFQILHEAGYIPLSADDGNVERKESINKKTCLFHLGTCEQSIETCSEFRFEVQKLMDAKILIV----SQTNI
        EN  +N +   V   K++V+++   M  LF+ L EAGY+ L   D N+  +     K C+FH G     I+ C  FR +VQ+ MD+KIL V     +  +
Subjt:  ENPVINVVVTLVKHYKSDVYKVTTSMRILFQILHEAGYIPLSADDGNVERKESINKKTCLFHLGTCEQSIETCSEFRFEVQKLMDAKILIV----SQTNI

Query:  QEVEIDVIFDATSSVQEPL------IIQYEGKPDSTTCIQMPKTITVEVLGPFPYKNNQVVPWKYECQLITNNVASVTTG--GMTRSGRCYTPYVLKDIS
        ++ +I  +    +  + P       +   E +  S++C   PK + VEV  PF +K+ + VPW+Y+CQ+IT       TG  G+TRSGRCY P  L   S
Subjt:  QEVEIDVIFDATSSVQEPL------IIQYEGKPDSTTCIQMPKTITVEVLGPFPYKNNQVVPWKYECQLITNNVASVTTG--GMTRSGRCYTPYVLKDIS

Query:  KEDEV-RRRKGKAMEMIGGDDLSDLSKVFTAKTTLAEKKINQEVVSKKESLEFLKLIKRSEYKVIEQLHRTPARISILSLFMYSEPHRKM----------
            + + RK +   +   D   D+    TAK    +K +  E  +     EFLK++K+SEYK+IEQ+H TPARIS+LSLF+ SEPHRK+          
Subjt:  KEDEV-RRRKGKAMEMIGGDDLSDLSKVFTAKTTLAEKKINQEVVSKKESLEFLKLIKRSEYKVIEQLHRTPARISILSLFMYSEPHRKM----------

Query:  --------------------------KKSPPEGTGHTKALHIFVKCKDHHVARVLVDNRSSLNIMSSSTLMKLPKDPSYLRPSTMVVKAFDGAHREVIGD
                                   + PPEG GH KALHI VK KD+ +ARVLVDN S+LNIM  STL+KLP D SY++ STMVV+AFDG+ REVIGD
Subjt:  --------------------------KKSPPEGTGHTKALHIFVKCKDHHVARVLVDNRSSLNIMSSSTLMKLPKDPSYLRPSTMVVKAFDGAHREVIGD

Query:  IDVPFKIEPSTFNVPFQ----------------IHSAGAVLSSLHQKLKFSVEGGQAIVYGEEDMFVTKTSVLPYVEATEEALKCSYRSFEIANATIFPT
        I++P KI P TFN+ FQ                IHSAG V S+LHQKLKF V      + GEED  +TK    PYVEATEEAL+CS+RSFEIA+AT+   
Subjt:  IDVPFKIEPSTFNVPFQ----------------IHSAGAVLSSLHQKLKFSVEGGQAIVYGEEDMFVTKTSVLPYVEATEEALKCSYRSFEIANATIFPT

Query:  EANE-------------------------------------RFGLGFKPTASD--------------------WEKTFRLGERLFNTDQRKG-----RNE
         A E                                     RFGLG+ P   D                    ++ + +    L++T +  G     R+ 
Subjt:  EANE-------------------------------------RFGLGFKPTASD--------------------WEKTFRLGERLFNTDQRKG-----RNE

Query:  DSETPIAVISENGSQKGIRVESDISDVVDFEVP----------------------------------------------ICNLEQNIESDESD---VSLE
        DS+  +    E+ S   +  E+   D   +  P                                              I  +E + ESD+ +   +S E
Subjt:  DSETPIAVISENGSQKGIRVESDISDVVDFEVP----------------------------------------------ICNLEQNIESDESD---VSLE

Query:  LLRMIEQEEKKTLPYQETLKVINLGAREEVKKVQIGTLPSKQDQSDLVTLLHEFKDIFAWSYQDMPNLDTEIVTHRLPRKPECKPVRQKLRKLKPEMLIK
        LLR++E+E+K   P+QE ++VINLG++EE K+V+IGT  + + +  ++ LL E+ DIFAWSYQDMP L+T+IV HR+P KPEC PVRQKLRK+KP++LIK
Subjt:  LLRMIEQEEKKTLPYQETLKVINLGAREEVKKVQIGTLPSKQDQSDLVTLLHEFKDIFAWSYQDMPNLDTEIVTHRLPRKPECKPVRQKLRKLKPEMLIK

Query:  IKEEVKKQFDVGFLVVAKYPDWVANIVSIPKKYGK-----------------------------------------------------------------
        +KEEV+KQ + GFL+V+KYP+WVANIV +PKK GK                                                                 
Subjt:  IKEEVKKQFDVGFLVVAKYPDWVANIVSIPKKYGK-----------------------------------------------------------------

Query:  --------------------RAMVTLFHYLMHKEIEVYVDDMIAKSRPEEKHVV-------TLRRHQ----------GGSR------------------Q
                            RAMVTLFH +MHKEIEVYVDDMIAKS+  E H          LR++Q          G +                   +
Subjt:  --------------------RAMVTLFHYLMHKEIEVYVDDMIAKSRPEEKHVV-------TLRRHQ----------GGSR------------------Q

Query:  IKVIVGLKPPKTQNEVRSFLGRLNYIARFISHLTQTCEPILKLLRKSEICHWNEDCQRAFDKIKDYLQSPPILVLPTPGRPLILYLTVKDGSMGCVLGQH
        +K I+ +  P+T+ EVR+FLGRLNYI+RFISHLT TCEPI KLLRK+    WNEDC+ AF+KIK YLQSPP+L+ P PGRPLILYLTV +GSMG VLGQH
Subjt:  IKVIVGLKPPKTQNEVRSFLGRLNYIARFISHLTQTCEPILKLLRKSEICHWNEDCQRAFDKIKDYLQSPPILVLPTPGRPLILYLTVKDGSMGCVLGQH

Query:  DSTRRKKRAVYYLSKKFTNYESKYSLLEKTCCALTWT
        D + +K+ A+YYLSKKFT+YESKYS+LE+TCCAL WT
Subjt:  DSTRRKKRAVYYLSKKFTNYESKYSLLEKTCCALTWT

XP_031742360.1 uncharacterized protein LOC116404324 [Cucumis sativus]1.6e-24141.07Show/hide
Query:  KNNWRQIHFDPILMSYTDLLPQLLKSHQVALVPQEPLQPHYPKWYDPSAKCEYHAGAVGHSTKNCFPLKAKVQSLVNAGWLKFQKTGKEPNVNQNPLPNH
        K N     FDPI M+YT+LLPQL+ + Q+A +P  P+QP YPKWYDP+A+C+YHAG VGHST+NC  LK KVQSL+NAGWL F+K G++P+VN NPLPNH
Subjt:  KNNWRQIHFDPILMSYTDLLPQLLKSHQVALVPQEPLQPHYPKWYDPSAKCEYHAGAVGHSTKNCFPLKAKVQSLVNAGWLKFQKTGKEPNVNQNPLPNH

Query:  ENPVINVVVTLVKHYKSDVYKVTTSMRILFQILHEAGYIPLSADDGNVERKESINKKTCLFHLGTCEQSIETCSEFRFEVQKLMDAKILIV----SQTNI
        EN  +N +   V   K++V+++   M  LF+ L EAGY+ L   D N+  +     K C+FH G     I+ C  FR +VQ+ MD+KIL V     +  +
Subjt:  ENPVINVVVTLVKHYKSDVYKVTTSMRILFQILHEAGYIPLSADDGNVERKESINKKTCLFHLGTCEQSIETCSEFRFEVQKLMDAKILIV----SQTNI

Query:  QEVEIDVIFDATSSVQEPL------IIQYEGKPDSTTCIQMPKTITVEVLGPFPYKNNQVVPWKYECQLITNNVASVTTG--GMTRSGRCYTPYVLKDIS
        ++ +I  +    +  + P       +   E +  S++C   PK + VEV  PF +K+ + VPW+Y+CQ+IT       TG  G+TRSGRCY P  L   S
Subjt:  QEVEIDVIFDATSSVQEPL------IIQYEGKPDSTTCIQMPKTITVEVLGPFPYKNNQVVPWKYECQLITNNVASVTTG--GMTRSGRCYTPYVLKDIS

Query:  KEDEV-RRRKGKAMEMIGGDDLSDLSKVFTAKTTLAEKKINQEVVSKKESLEFLKLIKRSEYKVIEQLHRTPARISILSLFMYSEPHRKM----------
            + + RK +   +   D   D+    TAK    +K +  E  +     EFLK++K+SEYK+IEQ+H TPARIS+LSLF+ SEPHRK+          
Subjt:  KEDEV-RRRKGKAMEMIGGDDLSDLSKVFTAKTTLAEKKINQEVVSKKESLEFLKLIKRSEYKVIEQLHRTPARISILSLFMYSEPHRKM----------

Query:  --------------------------KKSPPEGTGHTKALHIFVKCKDHHVARVLVDNRSSLNIMSSSTLMKLPKDPSYLRPSTMVVKAFDGAHREVIGD
                                   + PPEG GH KALHI VK KD+ +ARVLVDN S+LNIM  STL+KLP D SY++ STMVV+AFDG+ REVIGD
Subjt:  --------------------------KKSPPEGTGHTKALHIFVKCKDHHVARVLVDNRSSLNIMSSSTLMKLPKDPSYLRPSTMVVKAFDGAHREVIGD

Query:  IDVPFKIEPSTFNVPFQ----------------IHSAGAVLSSLHQKLKFSVEGGQAIVYGEEDMFVTKTSVLPYVEATEEALKCSYRSFEIANATIFPT
        I++P KI P TFN+ FQ                IHSAG V S+LHQKLKF V      + GEED  +TK    PYVEATEEAL+CS+RSFEIA+AT+   
Subjt:  IDVPFKIEPSTFNVPFQ----------------IHSAGAVLSSLHQKLKFSVEGGQAIVYGEEDMFVTKTSVLPYVEATEEALKCSYRSFEIANATIFPT

Query:  EANE-------------------------------------RFGLGFKPTASD--------------------WEKTFRLGERLFNTDQRKG-----RNE
         A E                                     RFGLG+ P   D                    ++ + +    L++T +  G     R+ 
Subjt:  EANE-------------------------------------RFGLGFKPTASD--------------------WEKTFRLGERLFNTDQRKG-----RNE

Query:  DSETPIAVISENGSQKGIRVESDISDVVDFEVP----------------------------------------------ICNLEQNIESDESD---VSLE
        DS+  +    E+ S   +  E+   D   +  P                                              I  +E + ESD+ +   +S E
Subjt:  DSETPIAVISENGSQKGIRVESDISDVVDFEVP----------------------------------------------ICNLEQNIESDESD---VSLE

Query:  LLRMIEQEEKKTLPYQETLKVINLGAREEVKKVQIGTLPSKQDQSDLVTLLHEFKDIFAWSYQDMPNLDTEIVTHRLPRKPECKPVRQKLRKLKPEMLIK
        LLR++E+E+K   P+QE ++VINLG++EE K+V+IGT  + + +  ++ LL E+ DIFAWSYQDMP L+T+IV HR+P KPEC PVRQKLRK+KP++LIK
Subjt:  LLRMIEQEEKKTLPYQETLKVINLGAREEVKKVQIGTLPSKQDQSDLVTLLHEFKDIFAWSYQDMPNLDTEIVTHRLPRKPECKPVRQKLRKLKPEMLIK

Query:  IKEEVKKQFDVGFLVVAKYPDWVANIVSIPKKYGK-----------------------------------------------------------------
        +KEEV+KQ + GFL+V+KYP+WVANIV +PKK GK                                                                 
Subjt:  IKEEVKKQFDVGFLVVAKYPDWVANIVSIPKKYGK-----------------------------------------------------------------

Query:  --------------------RAMVTLFHYLMHKEIEVYVDDMIAKSRPEEKHVV-------TLRRHQ----------GGSR------------------Q
                            RAMVTLFH +MHKEIEVYVDDMIAKS+  E H          LR++Q          G +                   +
Subjt:  --------------------RAMVTLFHYLMHKEIEVYVDDMIAKSRPEEKHVV-------TLRRHQ----------GGSR------------------Q

Query:  IKVIVGLKPPKTQNEVRSFLGRLNYIARFISHLTQTCEPILKLLRKSEICHWNEDCQRAFDKIKDYLQSPPILVLPTPGRPLILYLTVKDGSMGCVLGQH
        +K I+ +  P+T+ EVR+FLGRLNYI+RFISHLT TCEPI KLLRK+    WNEDC+ AF+KIK YLQSPP+L+ P PGRPLILYLTV +GSMG VLGQH
Subjt:  IKVIVGLKPPKTQNEVRSFLGRLNYIARFISHLTQTCEPILKLLRKSEICHWNEDCQRAFDKIKDYLQSPPILVLPTPGRPLILYLTVKDGSMGCVLGQH

Query:  DSTRRKKRAVYYLSKKFTNYESKYSLLEKTCCALTWT
        D + +K+ A+YYLSKKFT+YESKYS+LE+TCCAL WT
Subjt:  DSTRRKKRAVYYLSKKFTNYESKYSLLEKTCCALTWT

TrEMBL top hitse value%identityAlignment
A0A5A7SUT0 Reverse transcriptase0.0e+0054.04Show/hide
Query:  MSSPLVGQTNYSSNPHGGGHS---QPNSSFVKNNWRQIHFDPILMSYTDLLPQLLKSHQVALVPQEPLQPHYPKWYDPSAKCEYHAGAVGHSTKNCFPLK
        +SSP+VGQTN+S +   GG S   Q     ++NNW+Q  FDPI MSYT+LLP L+KSHQVA+VPQEPLQP YPKWYDP+AKCEYHAGAVGHST+NCFPLK
Subjt:  MSSPLVGQTNYSSNPHGGGHS---QPNSSFVKNNWRQIHFDPILMSYTDLLPQLLKSHQVALVPQEPLQPHYPKWYDPSAKCEYHAGAVGHSTKNCFPLK

Query:  AKVQSLVNAGWLKFQKTGKEPNVNQNPLPNHENPVINVVVTLVKHYKSDVYKVTTSMRILFQILHEAGYIPLSADDGNVERKESINKKTCLFHLGTCEQS
        AKVQSLV AGWL+F+KTG+EP+VNQNPLPNHE P IN V T ++ +K+ V  V TSM+ LFQILH AGY+    ++ + E+    N + CL H  T + S
Subjt:  AKVQSLVNAGWLKFQKTGKEPNVNQNPLPNHENPVINVVVTLVKHYKSDVYKVTTSMRILFQILHEAGYIPLSADDGNVERKESINKKTCLFHLGTCEQS

Query:  IETCSEFRFEVQKLMDAKILIVSQTNIQEVEIDVIFDATSS---------VQEPLIIQYEGKPDSTTCIQMPKTITVEVLGPFPYKNNQVVPWKYECQLI
        IE C EF+ EVQKLMD+KIL++ Q ++Q++E+++I +A+S+         + +PL+I YE KP   + IQ PK +TVE                      
Subjt:  IETCSEFRFEVQKLMDAKILIVSQTNIQEVEIDVIFDATSS---------VQEPLIIQYEGKPDSTTCIQMPKTITVEVLGPFPYKNNQVVPWKYECQLI

Query:  TNNVASVTTGGMTRSGRCYTPYVLKDISKEDEVRRRKGKAMEMIGGDDLSDLSKVFTAKTTLAEKKINQEVVSKKESLEFLKLIKRSEYKVIEQLHRTPA
              +  GG+TRSGRCYTP  LKD+SKEDEVRRRKGKA+EM G DDL+DLSKVFT K TL EK+ + EVVSK+E+ E                     
Subjt:  TNNVASVTTGGMTRSGRCYTPYVLKDISKEDEVRRRKGKAMEMIGGDDLSDLSKVFTAKTTLAEKKINQEVVSKKESLEFLKLIKRSEYKVIEQLHRTPA

Query:  RISILSLFMYSEPHRKMKKSPPEGTGHTKALHIFVKCKDHHVARVLVDNRSSLNIMSSSTLMKLPKDPSYLRPSTMVVKAFDGAHREVIGDIDVPFKIEP
                          + PPEGTGHTKALHI VKCKDHHVARVLVDN SSLNIMS STLMKLP DPSYLRPSTMVV+AFD A REVIGDID+P KI P
Subjt:  RISILSLFMYSEPHRKMKKSPPEGTGHTKALHIFVKCKDHHVARVLVDNRSSLNIMSSSTLMKLPKDPSYLRPSTMVVKAFDGAHREVIGDIDVPFKIEP

Query:  STFNVPFQ----------------IHSAGAVLSSLHQKLKFSVEGGQAIVYGEEDMFVTKTSVLPYVEATEEALKCSYRSFEIANATIFPTE--------
        STFNV FQ                IHSAGAV SSLHQ+LKFSVEGGQAIVYGEEDMFVTKTSVLPYVEATEEAL+CSYRSFE ANATIFPTE        
Subjt:  STFNVPFQ----------------IHSAGAVLSSLHQKLKFSVEGGQAIVYGEEDMFVTKTSVLPYVEATEEALKCSYRSFEIANATIFPTE--------

Query:  -------------------------------------ANERFGLGFKPTASDWEK------------------------------TFRLGERLFNTDQRK
                                             A E+FGLG+KP  S+ EK                              TF+LGE LF+T QRK
Subjt:  -------------------------------------ANERFGLGFKPTASDWEK------------------------------TFRLGERLFNTDQRK

Query:  GRNEDSETPIAVISEN------------------------------GSQKGIRVESDISDVVDFEVPICNLEQNIESDESDVSLELLRMIEQEEKKTLPY
          NEDSE  IAV+SEN                              GSQK  RVE D+ D VDFEVPICNLEQNI   ESD+S ELLR+IEQEEKKT+ Y
Subjt:  GRNEDSETPIAVISEN------------------------------GSQKGIRVESDISDVVDFEVPICNLEQNIESDESDVSLELLRMIEQEEKKTLPY

Query:  QETLKVINLGAREEVKKVQIGTLPSKQDQSDLVTLLHEFKDIFAWSYQDMPNLDTEIVTHRLPRKPECKPVRQKLRKLKPEMLIKIKEEVKKQFDVGFLV
        QETL VIN G  EEVK+V+IGTL S+QDQS+LVTLLHEFKDIFAWSYQDM  LDTEIVTHRLP KPECKP+RQKLRKLKPEMLIKIKEEVKKQFD GFL 
Subjt:  QETLKVINLGAREEVKKVQIGTLPSKQDQSDLVTLLHEFKDIFAWSYQDMPNLDTEIVTHRLPRKPECKPVRQKLRKLKPEMLIKIKEEVKKQFDVGFLV

Query:  VAKYPDWVANIVSIPKKYGK--------------------------------------------------------------------------------
        VAKYP WVANIV +PKK GK                                                                                
Subjt:  VAKYPDWVANIVSIPKKYGK--------------------------------------------------------------------------------

Query:  -----RAMVTLFHYLMHKEIEVYVDDMIAKSRPEEKHVVTLRRHQGGSRQIKVIVGLKPPKTQNEVRSFLGRLNYIARFISHLTQTCEPILKLLRKSEIC
             RAMVTLFH LMHKEIEVYVDDMIAKSR EEKHVVTLR+     R  K  + L P K    V S  G+L  +   +S      +P  K+  K+ + 
Subjt:  -----RAMVTLFHYLMHKEIEVYVDDMIAKSRPEEKHVVTLRRHQGGSRQIKVIVGLKPPKTQNEVRSFLGRLNYIARFISHLTQTCEPILKLLRKSEIC

Query:  HWNEDCQRAFDKIKDYLQSPPILVLPTPGRPLILYLTVKDGSMGCVLGQHDSTRRKKRAVYYLSKKFTNYESKYSLLEKTCCALTWTT
          +   Q+   +IKDYLQSPPILV PTPGRPLILYLTVK+GSMGCVLGQ DST +K++AVYYLSKKFTNYESKYSLLEKTCC L WTT
Subjt:  HWNEDCQRAFDKIKDYLQSPPILVLPTPGRPLILYLTVKDGSMGCVLGQHDSTRRKKRAVYYLSKKFTNYESKYSLLEKTCCALTWTT

A0A5A7T7R4 Ribonuclease H3.3e-22941.13Show/hide
Query:  SSNPHGGGHSQPNSSFVKNNWRQIHFDPILMSYTDLLPQLLKSHQVALVPQEPLQPHYPKWYDPSAKCEYHAGAVGHSTKNCFPLKAKVQSLVNAGWLKF
        S  P   G     SS   + WR   FD I M+YT+LLPQL+++ Q+A +P  P+QP YPKWYD +A+C+YHAG VGHST+NC  LK KVQSL+N GWL F
Subjt:  SSNPHGGGHSQPNSSFVKNNWRQIHFDPILMSYTDLLPQLLKSHQVALVPQEPLQPHYPKWYDPSAKCEYHAGAVGHSTKNCFPLKAKVQSLVNAGWLKF

Query:  QKTGKEPNVNQNPLPNHENPVINVVVTLVKHYKSDVYKVTTSMRILFQILHEAGYIPLSADDGNVERKESINKKTCLFHLGTCEQSIETCSEFRFEVQKL
        +K+G++ NVN+NPLP+HENP +NVV +LV+  K++V+++   M  LF+ L EAGY+     D N+  +     + C+ H G     ++ C +FR +VQ+ 
Subjt:  QKTGKEPNVNQNPLPNHENPVINVVVTLVKHYKSDVYKVTTSMRILFQILHEAGYIPLSADDGNVERKESINKKTCLFHLGTCEQSIETCSEFRFEVQKL

Query:  MDAKILIV----SQTNIQEVEIDVIFDATSSVQE-----PLIIQYEGKPDSTTCIQMPKTITVEVLGPFPYKNNQVVPWKYECQLITNNVASVTTG--GM
        MD+KIL V     +  +++ ++  + D  S  ++     PL + Y+   + +T    PK +T++V  PF +K+ + VPW+Y+CQ+IT       TG  G+
Subjt:  MDAKILIV----SQTNIQEVEIDVIFDATSSVQE-----PLIIQYEGKPDSTTCIQMPKTITVEVLGPFPYKNNQVVPWKYECQLITNNVASVTTG--GM

Query:  TRSGRCYTPYVLKDISKEDEVRRRKGKAMEMIGGDDLSDLSKVFTAKTTLAEKKIN-QEVVSKKESLEFLKLIKRSEYKVIEQLHRTPARISILSLFMYS
        TRSGRCY P  L   S  D +   +G+  E     ++ +  K    +  +  K I  +++V+ +E+ EFLK++K+SEYK+IEQ+H TPARIS+LSLF+ S
Subjt:  TRSGRCYTPYVLKDISKEDEVRRRKGKAMEMIGGDDLSDLSKVFTAKTTLAEKKIN-QEVVSKKESLEFLKLIKRSEYKVIEQLHRTPARISILSLFMYS

Query:  EPHRKM------------------------------------KKSPPEGTGHTKALHIFVKCKDHHVARVLVDNRSSLNIMSSSTLMKLPKDPSYLRPST
        EPHRK+                                     + PPEG GHTKALHI VKCKD+ +ARVLVDN S+LNIM  STL+KLP D S+++ ST
Subjt:  EPHRKM------------------------------------KKSPPEGTGHTKALHIFVKCKDHHVARVLVDNRSSLNIMSSSTLMKLPKDPSYLRPST

Query:  MVVKAFDGAHREVIGDIDVPFKIEPSTFNVPFQ----------------IHSAGAVLSSLHQKLKFSVEGGQAIVYGEEDMFVTKTSVLPYVEATEEALK
        MVVK FDG+ REV+GDI++P KI P  FN+ FQ                IHSAG V S+LHQKLKF V      V GEED  +TK+  + YVEATEEAL+
Subjt:  MVVKAFDGAHREVIGDIDVPFKIEPSTFNVPFQ----------------IHSAGAVLSSLHQKLKFSVEGGQAIVYGEEDMFVTKTSVLPYVEATEEALK

Query:  CSYRSFEIANA--------------------------------------TIFPTEANE-RFGLGFKPTASD--------------------WEKTFRLGE
        CS+RSFEIA+A                                      T+  T +N+ RFGLG+KP+  D                    ++ + +   
Subjt:  CSYRSFEIANA--------------------------------------TIFPTEANE-RFGLGFKPTASD--------------------WEKTFRLGE

Query:  RLFNTDQRKG---RNEDSETPIAVISENGS------------QKGIRVESDISDVVD-FEVPICNLEQNIESDESD---VSLELLRMIEQEEKKTLPYQE
         L++  +  G    +++S+  I ++++ GS            +  I    D+S+    F+V I  +E + ESD+ D   +S ELLRM+E+E++   P+QE
Subjt:  RLFNTDQRKG---RNEDSETPIAVISENGS------------QKGIRVESDISDVVD-FEVPICNLEQNIESDESD---VSLELLRMIEQEEKKTLPYQE

Query:  TLKVINLGAREEVKKVQIGTLPSKQDQSDLVTLLHEFKDIFAWSYQDMPNLDTEIVTHRLPRKPECKPVRQKLRKLKPEMLIKIKEEVKKQFDVGFLVVA
         ++ INLG++EE K                               +DMP L T+IV HR+P KPEC PVRQKLRK+KP++LIKIKEEV+KQ + GFL V+
Subjt:  TLKVINLGAREEVKKVQIGTLPSKQDQSDLVTLLHEFKDIFAWSYQDMPNLDTEIVTHRLPRKPECKPVRQKLRKLKPEMLIKIKEEVKKQFDVGFLVVA

Query:  KYPDWVANIVSIPKKYGK------------------------------------------------------RAMVTLFHYLMHKEIEVYVDDMIAKSRP
        KYP+WVAN+V +PKK GK                                                      R MVTLFH +MHKEIEVYVDDMIAKS+ 
Subjt:  KYPDWVANIVSIPKKYGK------------------------------------------------------RAMVTLFHYLMHKEIEVYVDDMIAKSRP

Query:  EEKHVVT-------LRRHQ----------GGSR------------------QIKVIVGLKPPKTQNEVRSFLGRLNYIARFISHLTQTCEPILKLLRKSE
        +E H  T       LR++Q          G +                   +++ I+ +  PKT+ E+R FLGRLNYI+RFISHLT TCEPI KLL K+ 
Subjt:  EEKHVVT-------LRRHQ----------GGSR------------------QIKVIVGLKPPKTQNEVRSFLGRLNYIARFISHLTQTCEPILKLLRKSE

Query:  ICHWNEDCQRAFDKIKDYLQSPPILVLPTPGRPLILYLTVKDGSMGCVLGQHDSTRRKKRAVYYLSKKFTNYESKYSLLEKTCCALTWT
           WNEDC+ AF+KIK YLQSPP+L+ P PGRPLILYLTV + SMG VLGQHD + +KK  +YYLSKKFTNYES+YS+LE+TCC L WT
Subjt:  ICHWNEDCQRAFDKIKDYLQSPPILVLPTPGRPLILYLTVKDGSMGCVLGQHDSTRRKKRAVYYLSKKFTNYESKYSLLEKTCCALTWT

A0A5A7V681 Retrotrans_gag domain-containing protein9.9e-23459.04Show/hide
Query:  MSSPLVGQTNYSSNPHGGGHS---QPNSSFVKNNWRQIHFDPILMSYTDLLPQLLKSHQVALVPQEPLQPHYPKWYDPSAKCEYHAGAVGHSTKNCFPLK
        +SSP+VGQTN+S +   GG S   Q     ++NNW+Q  FDPI MSYT+LLPQL+KSHQVA+VPQEPLQP YPKWYDP+AKCEYHAGAVGHST+NCFPLK
Subjt:  MSSPLVGQTNYSSNPHGGGHS---QPNSSFVKNNWRQIHFDPILMSYTDLLPQLLKSHQVALVPQEPLQPHYPKWYDPSAKCEYHAGAVGHSTKNCFPLK

Query:  AKVQSLVNAGWLKFQKTGKEPNVNQNPLPNHENPVINVVVTLVKHYKSDVYKVTTSMRILFQILHEAGYIPLSADDGNVERKESINKKTCLFHLGTCEQS
        AKVQSLV AGWL+F+KTG+EP+VNQNPLPNHE P IN V T ++ +K+ V  V TSM+ LFQILH AGY+    ++ + E+    N + CLFH  T + S
Subjt:  AKVQSLVNAGWLKFQKTGKEPNVNQNPLPNHENPVINVVVTLVKHYKSDVYKVTTSMRILFQILHEAGYIPLSADDGNVERKESINKKTCLFHLGTCEQS

Query:  IETCSEFRFEVQKLMDAKILIVSQTNIQEVEIDVIFDATSS---------VQEPLIIQYEGKPDSTTCIQMPKTITVEVLGPFPYKNNQVVPWKYECQLI
        IE C EF+ EVQKLMD+KIL++ Q ++QE+E+++I +A+S+         + +P +I YE KP   + IQ PK +TVE+ GPF YK+N  VPWKYECQ I
Subjt:  IETCSEFRFEVQKLMDAKILIVSQTNIQEVEIDVIFDATSS---------VQEPLIIQYEGKPDSTTCIQMPKTITVEVLGPFPYKNNQVVPWKYECQLI

Query:  TNNVASVTTGGMTRSGRCYTPYVLKDISKEDEVRRRKGKAMEMIGGDDLSDLSKVFTAKTTLAEKKINQEVVSKKESLEFLKLIKRSEYKVIEQLHRTPA
        T+NV S T GG+TRSGRCYTP  LKD+SKEDEVRRRKGKA+EM G DDL+DLSKVFT K TL EK+ + EVVSK+E+ EFLKLIK+SEYKVIEQLHRTPA
Subjt:  TNNVASVTTGGMTRSGRCYTPYVLKDISKEDEVRRRKGKAMEMIGGDDLSDLSKVFTAKTTLAEKKINQEVVSKKESLEFLKLIKRSEYKVIEQLHRTPA

Query:  RISILSLFMYSEPHRKM------------------------------------KKSPPEGTGHTKALHIFVKCKDHHVARVLVDNRSSLNIMSSSTLMKL
        RIS+LSLFMYSEPHRK+                                    ++ PPEGTGHTKALHI VKCKDHHVARVLVDN SSLNIMS STLMKL
Subjt:  RISILSLFMYSEPHRKM------------------------------------KKSPPEGTGHTKALHIFVKCKDHHVARVLVDNRSSLNIMSSSTLMKL

Query:  PKDPSYLRPSTMVVKAFDGAHREVIGDIDVPFKIEPSTFNVPFQ----------------IHSAGAVLSSLHQKLKFSVEGGQAIVYGEEDMFVTKTSVL
        P DPSYLRPSTMVV+AFDGA REVIGDID+P KI PSTFNV FQ                IHSAGAV SSLHQ+LKFSVE GQAIVYGEEDMFVTKTSVL
Subjt:  PKDPSYLRPSTMVVKAFDGAHREVIGDIDVPFKIEPSTFNVPFQ----------------IHSAGAVLSSLHQKLKFSVEGGQAIVYGEEDMFVTKTSVL

Query:  PYVEATEEALKCSYRSFEIANATIFPTE---------------------------------------------ANERFGLGFKPTASDWEK---------
        PYVEATEEAL+CSYRSFEIANATIFPTE                                             A E FGLG+KP  S+WEK         
Subjt:  PYVEATEEALKCSYRSFEIANATIFPTE---------------------------------------------ANERFGLGFKPTASDWEK---------

Query:  ---------------------TFRLGERLFNTDQRKGRNEDSETPIAVISEN
                             TF+LGE LF+T QRK  NEDSE  IAV+SEN
Subjt:  ---------------------TFRLGERLFNTDQRKGRNEDSETPIAVISEN

A0A5A7VAU5 Uncharacterized protein0.0e+0057.39Show/hide
Query:  MSSPLVGQTNYSSNPHGGGHS---QPNSSFVKNNWRQIHFDPILMSYTDLLPQLLKSHQVALVPQEPLQPHYPKWYDPSAKCEYHAGAVGHSTKNCFPLK
        +SSP+VGQTN+S +   GG S   Q     ++NNW+Q  FDPI MSYT+LLPQL+KSHQVA+VPQEPLQP YPKWYDP+AKCEYHAGAVGHST+NCFPLK
Subjt:  MSSPLVGQTNYSSNPHGGGHS---QPNSSFVKNNWRQIHFDPILMSYTDLLPQLLKSHQVALVPQEPLQPHYPKWYDPSAKCEYHAGAVGHSTKNCFPLK

Query:  AKVQSLVNAGWLKFQKTGKEPNVNQNPLPNHENPVINVVVTLVKHYKSDVYKVTTSMRILFQILHEAGYIPLSADDGNVERKESINKKTCLFHLGTCEQS
        AKVQSLV AGWL+F+KTG+EP+VNQNPLPNHE P+IN V T ++ +K+ V  V TSM+ LFQILH AGY+    ++ + E+    N + CLFH  T + S
Subjt:  AKVQSLVNAGWLKFQKTGKEPNVNQNPLPNHENPVINVVVTLVKHYKSDVYKVTTSMRILFQILHEAGYIPLSADDGNVERKESINKKTCLFHLGTCEQS

Query:  IETCSEFRFEVQKLMDAKILIVSQTNIQEVEIDVIFDATSS---------VQEPLIIQYEGKPDSTTCIQMPKTITVEVLGPFPYKNNQVVPWKYECQLI
        IE C EF+ EVQKLMD+KIL++ Q ++QE+E+++I +A+S+         + +PL+I YE KP   + IQ PK +TVE+ GPF YK+N  VPWKYECQ I
Subjt:  IETCSEFRFEVQKLMDAKILIVSQTNIQEVEIDVIFDATSS---------VQEPLIIQYEGKPDSTTCIQMPKTITVEVLGPFPYKNNQVVPWKYECQLI

Query:  TNNVASVTTGGMTRSGRCYTPYVLKDISKEDEVRRRKGKAMEMIGGDDLSDLSKVFTAKTTLAEKKINQEVVSKKESLEFLKLIKRSEYKVIEQLHRTPA
        T+NV   T  G+TRSGRCYTP  LKD+SKEDEVRRRKGKA+EM G DDL+DLSKVFT K TL EK+ + EVVSK+E+ EFLKLIK+SEYKVIEQLHRTPA
Subjt:  TNNVASVTTGGMTRSGRCYTPYVLKDISKEDEVRRRKGKAMEMIGGDDLSDLSKVFTAKTTLAEKKINQEVVSKKESLEFLKLIKRSEYKVIEQLHRTPA

Query:  RISILSLFMYSEPHRKM------------------------------------KKSPPEGTGHTKALHIFVKCKDHHVARVLVDNRSSLNIMSSSTLMKL
        RIS+LSLF YSEPHRK+                                    ++ PPEGTGHTKALHI VKCKDHHVARVLVDN SSLNIMS STLMKL
Subjt:  RISILSLFMYSEPHRKM------------------------------------KKSPPEGTGHTKALHIFVKCKDHHVARVLVDNRSSLNIMSSSTLMKL

Query:  PKDPSYLRPSTMVVKAFDGAHREVIGDIDVPFKIEPSTFNVPFQ----------------IHSAGAVLSSLHQKLKFSVEGGQAIVYGEEDMFVTKTSVL
        P DPSYLRPSTMVV+AFDGA REVIGDID+P KI PSTFNV FQ                IHSAGAV SSLHQ                           
Subjt:  PKDPSYLRPSTMVVKAFDGAHREVIGDIDVPFKIEPSTFNVPFQ----------------IHSAGAVLSSLHQKLKFSVEGGQAIVYGEEDMFVTKTSVL

Query:  PYVEATEEALKCSYRSFEIANATIFPTE---------------------------------------------ANERFGLGFKPTASDWEK---------
            +   +  CSYRSFEIANATIFPTE                                             A E+FGLG+KP  S+WEK         
Subjt:  PYVEATEEALKCSYRSFEIANATIFPTE---------------------------------------------ANERFGLGFKPTASDWEK---------

Query:  ---------------------TFRLGERLFNTDQRKGRNEDSETPIAVISEN------------------------------GSQK----GIRVESDISD
                             TF+LGE LF+T QRK  NEDSE  I V+SEN                              GSQK      RVE D+ D
Subjt:  ---------------------TFRLGERLFNTDQRKGRNEDSETPIAVISEN------------------------------GSQK----GIRVESDISD

Query:  VVDFEVPICNLEQNIESDESDVSLELLRMIEQEEKKTLPYQETLKVINLGAREEVKKVQIGTLPSKQDQSDLVTLLHEFKDIFAWSYQDMPNLDTEIVTH
         VDFEVPICNLEQNI   ESD+S ELLR+IEQEEKKT+PYQETLKVINLG  EEVK+V+IGTL S+QDQS+LVTLLHEFKDIFAWSYQDMP LDTEIVTH
Subjt:  VVDFEVPICNLEQNIESDESDVSLELLRMIEQEEKKTLPYQETLKVINLGAREEVKKVQIGTLPSKQDQSDLVTLLHEFKDIFAWSYQDMPNLDTEIVTH

Query:  RLPRKPECKPVRQKLRKLKPEMLIKIKEEVKKQFDVGFLVVAKYPDWVANIVSIPKKYGK----------------------------------------
        RLP KPECKP+RQKLRKLKPEMLIKIKEEVKKQFD GFL VAKYP WVANIV +PKK GK                                        
Subjt:  RLPRKPECKPVRQKLRKLKPEMLIKIKEEVKKQFDVGFLVVAKYPDWVANIVSIPKKYGK----------------------------------------

Query:  --------------RAMVTLFHYLMHKEIEVYVDDMIAKSRPEEKHVVTLRR--------------------------------HQG---GSRQIKVIVG
                      RAMVTLFH LMHKEIEVYVDDM+AKSR EEKHVVTLR+                                 +G      +I+ IV 
Subjt:  --------------RAMVTLFHYLMHKEIEVYVDDMIAKSRPEEKHVVTLRR--------------------------------HQG---GSRQIKVIVG

Query:  LKPPKTQNEVRSFLGRLNYIARFISHLTQTCEPILKLLRKSEICHWNEDCQRAFDKIKDYLQSPPILVLPTPGRPLILYLTVKDGSMGCVLGQHDSTRRK
        L+PPKTQ EVRSFLGRLNY ARFISHLTQTCEPILKLLRK+EICHWNEDCQ+AFDKIKDYLQSPPILV PTPGRPLI  LTVK+GSMGCVLGQHDST +K
Subjt:  LKPPKTQNEVRSFLGRLNYIARFISHLTQTCEPILKLLRKSEICHWNEDCQRAFDKIKDYLQSPPILVLPTPGRPLILYLTVKDGSMGCVLGQHDSTRRK

Query:  KRAVYYLSKKFTNYESKYSLLEKTCCALTWTT
        ++AVYYLSKKFTNYESKYSLLEKTCCAL WTT
Subjt:  KRAVYYLSKKFTNYESKYSLLEKTCCALTWTT

A0A6P5Y6V7 Ribonuclease H6.2e-22040.36Show/hide
Query:  QPNSS--FVKNNWRQIHFDPILMSYTDLLPQLLKSHQVALVPQEPLQPHYPKWYDPSAKCEYHAGAVGHSTKNCFPLKAKVQSLVNAGWLKFQKTGK-EP
        +PNS     +N   ++ FDPI M+YTDL P+L++ H +A V  EP++P +PKWYDP+A C+YH G  GHST+NC  LK KVQ L+N G L F  + +  P
Subjt:  QPNSS--FVKNNWRQIHFDPILMSYTDLLPQLLKSHQVALVPQEPLQPHYPKWYDPSAKCEYHAGAVGHSTKNCFPLKAKVQSLVNAGWLKFQKTGK-EP

Query:  NVNQNPLPNHENPVINVVVT-LVKHYKSDVYKVTTSMRILFQILHEAGYI-PLSADDGNVERKESINK-KTCLFHLGTCEQSIETCSEFRFEVQKLMDAK
        NVN NPLPNH  P IN ++     + K  + +V T M  +F++L +   + P+   +      E IN  ++C +H G    SI+ C +FR EVQKL+D  
Subjt:  NVNQNPLPNHENPVINVVVT-LVKHYKSDVYKVTTSMRILFQILHEAGYI-PLSADDGNVERKESINK-KTCLFHLGTCEQSIETCSEFRFEVQKLMDAK

Query:  ILIVSQT--NIQEVEIDVIFDATSSVQEPLIIQYEGKPDSTTCIQMP---KTITVEVLGPFPYKNNQVVPWKYECQLITN-NVASVT-TGGMTRSGRCYT
         +   Q    + + E+  I      V+ P+ I Y+ KP  T+ ++ P     +T++V GPFPY+N++ VPWKY+  ++ N + A++T  GG+TRSGRCYT
Subjt:  ILIVSQT--NIQEVEIDVIFDATSSVQEPLIIQYEGKPDSTTCIQMP---KTITVEVLGPFPYKNNQVVPWKYECQLITN-NVASVT-TGGMTRSGRCYT

Query:  PYVLKDISKEDEVRRRKGKAMEMIGGDDLSDLSKVFTAKTTLAEKKINQEVVSKKESLEFLKLIKRSEYKVIEQLHRTPARISILSLFMYSEPHRKM---
        P  L+   KE   + + G+  E   G D           TTL  +K  Q+ VS+ E+ EFLKLIK SEY V+EQL++ PARIS+LSL + SE HR     
Subjt:  PYVLKDISKEDEVRRRKGKAMEMIGGDDLSDLSKVFTAKTTLAEKKINQEVVSKKESLEFLKLIKRSEYKVIEQLHRTPARISILSLFMYSEPHRKM---

Query:  ---------------------------------KKSPPEGTGHTKALHIFVKCKDHHVARVLVDNRSSLNIMSSSTLMKLPKDPSYLRPSTMVVKAFDGA
                                         ++ PPEG G TKALHI VKCK H +A+VL+DN SS+N+M  +TL KLP D SY++PS M+V+AFDG 
Subjt:  ---------------------------------KKSPPEGTGHTKALHIFVKCKDHHVARVLVDNRSSLNIMSSSTLMKLPKDPSYLRPSTMVVKAFDGA

Query:  HREVIGDIDVPFKIEPSTFNVPFQ----------------IHSAGAVLSSLHQKLKFSVEGGQAIVYGEEDMFVTKTSVLPYVEATEEALKCSYRSFEIA
         REV+GDI+VP +I P  FNV FQ                IH AGAV SSLHQK+KF VEG    V GEED+ VT+    PYVE+ +EAL+CSYRSFEIA
Subjt:  HREVIGDIDVPFKIEPSTFNVPFQ----------------IHSAGAVLSSLHQKLKFSVEGGQAIVYGEEDMFVTKTSVLPYVEATEEALKCSYRSFEIA

Query:  NATIF------------------------------------------PTEANE---RFGLGFKPTASD--------------------------------
        NAT                                            P+  NE   RFGLG+KPT  D                                
Subjt:  NATIF------------------------------------------PTEANE---RFGLGFKPTASD--------------------------------

Query:  ---------------------WEKTFRLGERLFNTDQRKGRNEDSET-PIAVISENGSQKGIRVESDISDVVDFE--VPICNLEQNIESDESDVSLELLR
                              EK   L  ++ +  ++K R+E+    P     E  + K + +      +  FE  + +  L+   +SD   + L+LL+
Subjt:  ---------------------WEKTFRLGERLFNTDQRKGRNEDSET-PIAVISENGSQKGIRVESDISDVVDFE--VPICNLEQNIESDESDVSLELLR

Query:  MIEQEEKKTLPYQETLKVINLGAREEVKKVQIGTLPSKQDQSDLVTLLHEFKDIFAWSYQDMPNLDTEIVTHRLPRKPECKPVRQKLRKLKPEMLIKIKE
        +IE E+K   P+QE  + +NLG  E  ++V++GT     ++  L  LL E+ D+FAW+YQDMP L T+IV H+LP K  CKP++QKLR++KPEML+KIKE
Subjt:  MIEQEEKKTLPYQETLKVINLGAREEVKKVQIGTLPSKQDQSDLVTLLHEFKDIFAWSYQDMPNLDTEIVTHRLPRKPECKPVRQKLRKLKPEMLIKIKE

Query:  EVKKQFDVGFLVVAKYPDWVANIVSIPKKYGK--------------------------------------------------------------------
        EVKKQFD GFL VAKYP+WVANIV +PKK GK                                                                    
Subjt:  EVKKQFDVGFLVVAKYPDWVANIVSIPKKYGK--------------------------------------------------------------------

Query:  -----------------RAMVTLFHYLMHKEIEVYVDDMIAKSRPEEKHVVT-------LRRHQ---------GGSR-------------------QIKV
                         RAMVTLFH +MHKEIEVYVDDMIAKSR  E H+V        LRR Q          G++                   +I+ 
Subjt:  -----------------RAMVTLFHYLMHKEIEVYVDDMIAKSRPEEKHVVT-------LRRHQ---------GGSR-------------------QIKV

Query:  IVGLKPPKTQNEVRSFLGRLNYIARFISHLTQTCEPILKLLRKSEICHWNEDCQRAFDKIKDYLQSPPILVLPTPGRPLILYLTVKDGSMGCVLGQHDST
        I  L PP T  EVR FLGRLNYIARFIS LT  C+PI KLLRK +   WN++CQ AFDKIK+YL +PP+L+ P PG+PLILYLTV + SMGCVLGQ D T
Subjt:  IVGLKPPKTQNEVRSFLGRLNYIARFISHLTQTCEPILKLLRKSEICHWNEDCQRAFDKIKDYLQSPPILVLPTPGRPLILYLTVKDGSMGCVLGQHDST

Query:  RRKKRAVYYLSKKFTNYESKYSLLEKTCCALTWT
         RK+RA+YYLSKKFT+YESKYS LEK CCALTWT
Subjt:  RRKKRAVYYLSKKFTNYESKYSLLEKTCCALTWT

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein4.1e-1132.59Show/hide
Query:  IKVIVGLKPPKTQNEVRSFLGRLNYIARFISHLTQTCEPILKLLRKSEICHWNEDCQRAFDKIKDYLQSPPILVLPTPGRPLILYLTVKDGSMGCVLGQ-
        I  ++  K PK + E+R FLG +NY+ +FI   +Q   P+  LL+K     W     +A + IK  L SPP+L      + ++L     D ++G VL Q 
Subjt:  IKVIVGLKPPKTQNEVRSFLGRLNYIARFISHLTQTCEPILKLLRKSEICHWNEDCQRAFDKIKDYLQSPPILVLPTPGRPLILYLTVKDGSMGCVLGQ-

Query:  HDSTRRKKRAVYYLSKKFTNYESKYSLLEKTCCAL
        HD    K   V Y S K +  +  YS+ +K   A+
Subjt:  HDSTRRKKRAVYYLSKKFTNYESKYSLLEKTCCAL

P0CT35 Transposon Tf2-2 polyprotein4.1e-1132.59Show/hide
Query:  IKVIVGLKPPKTQNEVRSFLGRLNYIARFISHLTQTCEPILKLLRKSEICHWNEDCQRAFDKIKDYLQSPPILVLPTPGRPLILYLTVKDGSMGCVLGQ-
        I  ++  K PK + E+R FLG +NY+ +FI   +Q   P+  LL+K     W     +A + IK  L SPP+L      + ++L     D ++G VL Q 
Subjt:  IKVIVGLKPPKTQNEVRSFLGRLNYIARFISHLTQTCEPILKLLRKSEICHWNEDCQRAFDKIKDYLQSPPILVLPTPGRPLILYLTVKDGSMGCVLGQ-

Query:  HDSTRRKKRAVYYLSKKFTNYESKYSLLEKTCCAL
        HD    K   V Y S K +  +  YS+ +K   A+
Subjt:  HDSTRRKKRAVYYLSKKFTNYESKYSLLEKTCCAL

P0CT36 Transposon Tf2-3 polyprotein4.1e-1132.59Show/hide
Query:  IKVIVGLKPPKTQNEVRSFLGRLNYIARFISHLTQTCEPILKLLRKSEICHWNEDCQRAFDKIKDYLQSPPILVLPTPGRPLILYLTVKDGSMGCVLGQ-
        I  ++  K PK + E+R FLG +NY+ +FI   +Q   P+  LL+K     W     +A + IK  L SPP+L      + ++L     D ++G VL Q 
Subjt:  IKVIVGLKPPKTQNEVRSFLGRLNYIARFISHLTQTCEPILKLLRKSEICHWNEDCQRAFDKIKDYLQSPPILVLPTPGRPLILYLTVKDGSMGCVLGQ-

Query:  HDSTRRKKRAVYYLSKKFTNYESKYSLLEKTCCAL
        HD    K   V Y S K +  +  YS+ +K   A+
Subjt:  HDSTRRKKRAVYYLSKKFTNYESKYSLLEKTCCAL

P0CT41 Transposon Tf2-12 polyprotein4.1e-1132.59Show/hide
Query:  IKVIVGLKPPKTQNEVRSFLGRLNYIARFISHLTQTCEPILKLLRKSEICHWNEDCQRAFDKIKDYLQSPPILVLPTPGRPLILYLTVKDGSMGCVLGQ-
        I  ++  K PK + E+R FLG +NY+ +FI   +Q   P+  LL+K     W     +A + IK  L SPP+L      + ++L     D ++G VL Q 
Subjt:  IKVIVGLKPPKTQNEVRSFLGRLNYIARFISHLTQTCEPILKLLRKSEICHWNEDCQRAFDKIKDYLQSPPILVLPTPGRPLILYLTVKDGSMGCVLGQ-

Query:  HDSTRRKKRAVYYLSKKFTNYESKYSLLEKTCCAL
        HD    K   V Y S K +  +  YS+ +K   A+
Subjt:  HDSTRRKKRAVYYLSKKFTNYESKYSLLEKTCCAL

Q9UR07 Transposon Tf2-11 polyprotein4.1e-1132.59Show/hide
Query:  IKVIVGLKPPKTQNEVRSFLGRLNYIARFISHLTQTCEPILKLLRKSEICHWNEDCQRAFDKIKDYLQSPPILVLPTPGRPLILYLTVKDGSMGCVLGQ-
        I  ++  K PK + E+R FLG +NY+ +FI   +Q   P+  LL+K     W     +A + IK  L SPP+L      + ++L     D ++G VL Q 
Subjt:  IKVIVGLKPPKTQNEVRSFLGRLNYIARFISHLTQTCEPILKLLRKSEICHWNEDCQRAFDKIKDYLQSPPILVLPTPGRPLILYLTVKDGSMGCVLGQ-

Query:  HDSTRRKKRAVYYLSKKFTNYESKYSLLEKTCCAL
        HD    K   V Y S K +  +  YS+ +K   A+
Subjt:  HDSTRRKKRAVYYLSKKFTNYESKYSLLEKTCCAL

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCATCACCGTTAGTTGGACAAACAAATTACTCTTCTAATCCCCATGGTGGTGGACATAGTCAGCCAAATAGTAGTTTTGTAAAGAATAATTGGCGGCAAATTCACTT
CGACCCCATACTCATGTCATATACTGATTTGTTGCCACAACTACTGAAGAGTCATCAGGTTGCTTTGGTGCCTCAAGAGCCTTTACAACCACACTACCCCAAGTGGTATG
ACCCTAGTGCAAAATGTGAGTATCATGCTGGAGCAGTGGGGCATTCTACGAAAAATTGTTTTCCATTGAAAGCAAAGGTACAAAGTTTGGTAAATGCTGGCTGGTTGAAA
TTTCAGAAGACTGGGAAGGAGCCAAATGTCAATCAAAACCCTCTCCCAAACCATGAAAATCCTGTCATAAACGTCGTTGTTACATTAGTAAAGCATTACAAGAGTGATGT
ATATAAGGTAACTACATCAATGAGGATCCTCTTTCAGATTCTTCATGAAGCTGGATATATACCACTAAGCGCTGATGATGGCAATGTAGAGAGAAAGGAGAGTATTAACA
AGAAGACATGTTTATTCCATTTGGGGACATGTGAGCAGTCCATAGAAACTTGTTCTGAGTTTAGGTTTGAAGTGCAAAAATTGATGGACGCAAAAATTTTAATAGTGAGT
CAGACGAACATACAAGAGGTTGAGATTGATGTGATTTTTGATGCGACATCATCTGTACAAGAACCATTAATCATCCAATATGAGGGAAAGCCGGATAGCACAACTTGCAT
TCAGATGCCTAAGACAATAACTGTAGAAGTACTAGGTCCTTTTCCTTATAAGAACAATCAAGTTGTACCATGGAAATATGAATGCCAATTGATAACAAATAATGTTGCCT
CTGTAACAACGGGAGGGATGACTCGTAGTGGAAGATGTTATACACCATATGTTTTAAAAGATATCTCCAAAGAGGATGAAGTTCGACGACGTAAGGGCAAAGCTATGGAG
ATGATAGGTGGGGATGATTTAAGTGATCTAAGCAAGGTTTTTACTGCGAAAACCACATTAGCTGAGAAAAAGATAAATCAGGAAGTTGTCTCCAAAAAGGAGTCACTTGA
ATTTTTGAAATTGATTAAGCGGAGCGAGTACAAAGTGATTGAGCAATTACATCGTACCCCAGCTCGTATATCGATTTTATCATTATTCATGTACTCTGAGCCACACCGTA
AGATGAAGAAATCCCCCCCTGAAGGTACTGGACATACTAAGGCATTGCATATATTCGTGAAGTGTAAGGATCATCATGTGGCGAGGGTCCTAGTGGATAATAGATCATCT
TTAAATATAATGTCAAGCTCCACTTTGATGAAACTTCCCAAAGATCCATCGTACCTAAGACCAAGTACTATGGTCGTTAAAGCCTTCGACGGTGCTCATAGGGAAGTAAT
CGGAGATATAGATGTCCCTTTCAAAATTGAACCTTCCACTTTCAATGTTCCATTTCAGATTCATTCAGCAGGGGCAGTTCTGTCTTCATTGCATCAAAAATTGAAGTTTA
GTGTAGAAGGTGGTCAAGCTATTGTCTACGGAGAAGAGGACATGTTTGTAACAAAAACATCAGTACTTCCTTATGTTGAAGCAACAGAAGAAGCTCTGAAATGTTCTTAC
AGATCATTCGAAATTGCTAATGCTACTATCTTTCCAACTGAAGCCAATGAGAGGTTCGGGTTGGGCTTTAAGCCAACGGCTTCTGATTGGGAGAAGACATTTAGACTTGG
AGAAAGGTTATTCAATACAGACCAAAGAAAGGGACGTAACGAAGATTCTGAGACCCCAATTGCTGTTATCTCTGAGAATGGATCACAAAAGGGCATCAGGGTTGAAAGCG
ACATCAGTGACGTTGTTGATTTTGAGGTTCCAATATGTAATCTTGAGCAAAATATTGAAAGCGACGAATCTGATGTATCTCTTGAATTACTAAGAATGATAGAGCAAGAG
GAAAAGAAAACTTTACCATATCAAGAGACTTTGAAAGTTATCAATTTGGGAGCACGAGAAGAAGTGAAGAAAGTACAAATTGGCACTTTGCCTTCAAAGCAGGATCAATC
AGATTTGGTAACCCTACTTCACGAGTTCAAAGATATATTTGCATGGTCCTACCAAGATATGCCCAATTTGGATACAGAAATTGTAACTCATCGATTGCCTCGTAAACCAG
AATGTAAGCCTGTACGACAAAAGCTTCGTAAATTGAAACCTGAGATGTTGATTAAAATTAAAGAAGAGGTTAAGAAGCAGTTTGATGTTGGTTTCTTAGTAGTGGCAAAA
TATCCAGATTGGGTAGCAAATATAGTCTCAATCCCAAAGAAATATGGAAAGAGAGCAATGGTTACACTATTCCATTACTTAATGCATAAAGAAATTGAAGTTTATGTTGA
TGATATGATTGCCAAGTCTAGACCTGAAGAAAAGCATGTGGTTACCCTTCGAAGGCATCAAGGTGGATCCAGACAAATTAAGGTGATAGTAGGCTTAAAGCCACCAAAAA
CACAAAATGAGGTTAGAAGTTTTCTAGGGAGGTTGAATTACATTGCACGATTCATTTCACACCTCACTCAAACTTGCGAACCAATTTTAAAACTCCTTCGCAAAAGTGAG
ATTTGTCACTGGAATGAAGATTGTCAAAGAGCTTTTGATAAAATCAAAGACTACTTGCAAAGTCCTCCTATCCTTGTCCTGCCAACTCCAGGACGACCATTAATCTTATA
CTTGACAGTGAAAGATGGGTCAATGGGATGTGTGCTAGGGCAACATGATTCTACTAGAAGGAAGAAACGAGCTGTCTATTATTTAAGCAAGAAGTTCACGAATTATGAAT
CAAAGTACTCACTGTTGGAAAAGACGTGTTGCGCCCTGACATGGACAACTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCATCACCGTTAGTTGGACAAACAAATTACTCTTCTAATCCCCATGGTGGTGGACATAGTCAGCCAAATAGTAGTTTTGTAAAGAATAATTGGCGGCAAATTCACTT
CGACCCCATACTCATGTCATATACTGATTTGTTGCCACAACTACTGAAGAGTCATCAGGTTGCTTTGGTGCCTCAAGAGCCTTTACAACCACACTACCCCAAGTGGTATG
ACCCTAGTGCAAAATGTGAGTATCATGCTGGAGCAGTGGGGCATTCTACGAAAAATTGTTTTCCATTGAAAGCAAAGGTACAAAGTTTGGTAAATGCTGGCTGGTTGAAA
TTTCAGAAGACTGGGAAGGAGCCAAATGTCAATCAAAACCCTCTCCCAAACCATGAAAATCCTGTCATAAACGTCGTTGTTACATTAGTAAAGCATTACAAGAGTGATGT
ATATAAGGTAACTACATCAATGAGGATCCTCTTTCAGATTCTTCATGAAGCTGGATATATACCACTAAGCGCTGATGATGGCAATGTAGAGAGAAAGGAGAGTATTAACA
AGAAGACATGTTTATTCCATTTGGGGACATGTGAGCAGTCCATAGAAACTTGTTCTGAGTTTAGGTTTGAAGTGCAAAAATTGATGGACGCAAAAATTTTAATAGTGAGT
CAGACGAACATACAAGAGGTTGAGATTGATGTGATTTTTGATGCGACATCATCTGTACAAGAACCATTAATCATCCAATATGAGGGAAAGCCGGATAGCACAACTTGCAT
TCAGATGCCTAAGACAATAACTGTAGAAGTACTAGGTCCTTTTCCTTATAAGAACAATCAAGTTGTACCATGGAAATATGAATGCCAATTGATAACAAATAATGTTGCCT
CTGTAACAACGGGAGGGATGACTCGTAGTGGAAGATGTTATACACCATATGTTTTAAAAGATATCTCCAAAGAGGATGAAGTTCGACGACGTAAGGGCAAAGCTATGGAG
ATGATAGGTGGGGATGATTTAAGTGATCTAAGCAAGGTTTTTACTGCGAAAACCACATTAGCTGAGAAAAAGATAAATCAGGAAGTTGTCTCCAAAAAGGAGTCACTTGA
ATTTTTGAAATTGATTAAGCGGAGCGAGTACAAAGTGATTGAGCAATTACATCGTACCCCAGCTCGTATATCGATTTTATCATTATTCATGTACTCTGAGCCACACCGTA
AGATGAAGAAATCCCCCCCTGAAGGTACTGGACATACTAAGGCATTGCATATATTCGTGAAGTGTAAGGATCATCATGTGGCGAGGGTCCTAGTGGATAATAGATCATCT
TTAAATATAATGTCAAGCTCCACTTTGATGAAACTTCCCAAAGATCCATCGTACCTAAGACCAAGTACTATGGTCGTTAAAGCCTTCGACGGTGCTCATAGGGAAGTAAT
CGGAGATATAGATGTCCCTTTCAAAATTGAACCTTCCACTTTCAATGTTCCATTTCAGATTCATTCAGCAGGGGCAGTTCTGTCTTCATTGCATCAAAAATTGAAGTTTA
GTGTAGAAGGTGGTCAAGCTATTGTCTACGGAGAAGAGGACATGTTTGTAACAAAAACATCAGTACTTCCTTATGTTGAAGCAACAGAAGAAGCTCTGAAATGTTCTTAC
AGATCATTCGAAATTGCTAATGCTACTATCTTTCCAACTGAAGCCAATGAGAGGTTCGGGTTGGGCTTTAAGCCAACGGCTTCTGATTGGGAGAAGACATTTAGACTTGG
AGAAAGGTTATTCAATACAGACCAAAGAAAGGGACGTAACGAAGATTCTGAGACCCCAATTGCTGTTATCTCTGAGAATGGATCACAAAAGGGCATCAGGGTTGAAAGCG
ACATCAGTGACGTTGTTGATTTTGAGGTTCCAATATGTAATCTTGAGCAAAATATTGAAAGCGACGAATCTGATGTATCTCTTGAATTACTAAGAATGATAGAGCAAGAG
GAAAAGAAAACTTTACCATATCAAGAGACTTTGAAAGTTATCAATTTGGGAGCACGAGAAGAAGTGAAGAAAGTACAAATTGGCACTTTGCCTTCAAAGCAGGATCAATC
AGATTTGGTAACCCTACTTCACGAGTTCAAAGATATATTTGCATGGTCCTACCAAGATATGCCCAATTTGGATACAGAAATTGTAACTCATCGATTGCCTCGTAAACCAG
AATGTAAGCCTGTACGACAAAAGCTTCGTAAATTGAAACCTGAGATGTTGATTAAAATTAAAGAAGAGGTTAAGAAGCAGTTTGATGTTGGTTTCTTAGTAGTGGCAAAA
TATCCAGATTGGGTAGCAAATATAGTCTCAATCCCAAAGAAATATGGAAAGAGAGCAATGGTTACACTATTCCATTACTTAATGCATAAAGAAATTGAAGTTTATGTTGA
TGATATGATTGCCAAGTCTAGACCTGAAGAAAAGCATGTGGTTACCCTTCGAAGGCATCAAGGTGGATCCAGACAAATTAAGGTGATAGTAGGCTTAAAGCCACCAAAAA
CACAAAATGAGGTTAGAAGTTTTCTAGGGAGGTTGAATTACATTGCACGATTCATTTCACACCTCACTCAAACTTGCGAACCAATTTTAAAACTCCTTCGCAAAAGTGAG
ATTTGTCACTGGAATGAAGATTGTCAAAGAGCTTTTGATAAAATCAAAGACTACTTGCAAAGTCCTCCTATCCTTGTCCTGCCAACTCCAGGACGACCATTAATCTTATA
CTTGACAGTGAAAGATGGGTCAATGGGATGTGTGCTAGGGCAACATGATTCTACTAGAAGGAAGAAACGAGCTGTCTATTATTTAAGCAAGAAGTTCACGAATTATGAAT
CAAAGTACTCACTGTTGGAAAAGACGTGTTGCGCCCTGACATGGACAACTTAG
Protein sequenceShow/hide protein sequence
MSSPLVGQTNYSSNPHGGGHSQPNSSFVKNNWRQIHFDPILMSYTDLLPQLLKSHQVALVPQEPLQPHYPKWYDPSAKCEYHAGAVGHSTKNCFPLKAKVQSLVNAGWLK
FQKTGKEPNVNQNPLPNHENPVINVVVTLVKHYKSDVYKVTTSMRILFQILHEAGYIPLSADDGNVERKESINKKTCLFHLGTCEQSIETCSEFRFEVQKLMDAKILIVS
QTNIQEVEIDVIFDATSSVQEPLIIQYEGKPDSTTCIQMPKTITVEVLGPFPYKNNQVVPWKYECQLITNNVASVTTGGMTRSGRCYTPYVLKDISKEDEVRRRKGKAME
MIGGDDLSDLSKVFTAKTTLAEKKINQEVVSKKESLEFLKLIKRSEYKVIEQLHRTPARISILSLFMYSEPHRKMKKSPPEGTGHTKALHIFVKCKDHHVARVLVDNRSS
LNIMSSSTLMKLPKDPSYLRPSTMVVKAFDGAHREVIGDIDVPFKIEPSTFNVPFQIHSAGAVLSSLHQKLKFSVEGGQAIVYGEEDMFVTKTSVLPYVEATEEALKCSY
RSFEIANATIFPTEANERFGLGFKPTASDWEKTFRLGERLFNTDQRKGRNEDSETPIAVISENGSQKGIRVESDISDVVDFEVPICNLEQNIESDESDVSLELLRMIEQE
EKKTLPYQETLKVINLGAREEVKKVQIGTLPSKQDQSDLVTLLHEFKDIFAWSYQDMPNLDTEIVTHRLPRKPECKPVRQKLRKLKPEMLIKIKEEVKKQFDVGFLVVAK
YPDWVANIVSIPKKYGKRAMVTLFHYLMHKEIEVYVDDMIAKSRPEEKHVVTLRRHQGGSRQIKVIVGLKPPKTQNEVRSFLGRLNYIARFISHLTQTCEPILKLLRKSE
ICHWNEDCQRAFDKIKDYLQSPPILVLPTPGRPLILYLTVKDGSMGCVLGQHDSTRRKKRAVYYLSKKFTNYESKYSLLEKTCCALTWTT