; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0035746 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0035746
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr3:29480758..29489131
RNA-Seq ExpressionLag0035746
SyntenyLag0035746
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_006491472.1 uncharacterized protein LOC102626455 [Citrus sinensis]3.1e-19935.97Show/hide
Query:  SLGFSNCINVDSQGASGGLALLWQNQPQISVNSYSKGQIDVTIK-EADWWWRFTGFYGSPEQSKRKESWRLLESRG-----RWLVFQPW--VVEVEEEAE
        S  F N   VD  G  GGLAL W +   +++ S+S   ID  ++ ++   WR TG YG  E S++  +W LL+         W     +  ++   E+  
Subjt:  SLGFSNCINVDSQGASGGLALLWQNQPQISVNSYSKGQIDVTIK-EADWWWRFTGFYGSPEQSKRKESWRLLESRG-----RWLVFQPW--VVEVEEEAE

Query:  DVAHVTKLRSVL-ESV-----------DHK-------------EESLAVMI-------ELQLIMPVLLQNLMIDRA----VTNDLSLQL-YRENL-----
           H + + S   ES+           +HK             EE+L  +          Q +    L N + D         D   +L Y++N      
Subjt:  DVAHVTKLRSVL-ESV-----------DHK-------------EESLAVMI-------ELQLIMPVLLQNLMIDRA----VTNDLSLQL-YRENL-----

Query:  ---------------PGGAAFNDEMDILEKELEE-----ELGQKSVEGVVFTWCKNKRTKEATNERIDRFFVNSKMEQKASTM-KVDQNFHHSDHRPILL
                        GG   ND    +  E +E      L     +G  FTW   +       ER+DR   +         +  +      SDH PI+ 
Subjt:  ---------------PGGAAFNDEMDILEKELEE-----ELGQKSVEGVVFTWCKNKRTKEATNERIDRFFVNSKMEQKASTM-KVDQNFHHSDHRPILL

Query:  EIE---RNQHQKRTFSKRKIKLEESWIEMEGSRMAFKEAWTA----TGEITNVNFSEKIQEGLTTMHLWNRERLKGTLKGAIQRKEIEISHLEASNSPDA
        E++   +  H K+    R    E+ W   E      +  W +    + E     F    +  L  + +W++E  +G  K   Q + I+   +       A
Subjt:  EIE---RNQHQKRTFSKRKIKLEESWIEMEGSRMAFKEAWTA----TGEITNVNFSEKIQEGLTTMHLWNRERLKGTLKGAIQRKEIEISHLEASNSPDA

Query:  FD--SLGQAERELEALLEEEEGYWKWRSREDWPKSGDKNTKRFHSRAWQRKRKNEIKGILDPFGKWVEEEKEIGNIANIFFQSLLSSNHPSTEVLKEVVE
         D   + + E ++  +L +EE YWK RSR DW K GDKNTK FHS+A  R+RKN+I G+ D  G WV++ + I      FFQ L +S++PS   + E ++
Subjt:  FD--SLGQAERELEALLEEEEGYWKWRSREDWPKSGDKNTKRFHSRAWQRKRKNEIKGILDPFGKWVEEEKEIGNIANIFFQSLLSSNHPSTEVLKEVVE

Query:  AIDTQISEDQKRRLDSSFTREEIEQVVKNMKPTKAPGPDGAHAKLYQNMWDTAGENTIKVCLGILNNNESIEPLNKTHIALIPKVNSPKRMSEFRPISLC
         +  ++S++    L+  FT E+I + +  M PTKAPGPDG  A  +Q  W   GE   K CL ILN   +++ LN T IALIPKV  P+++ EFRPISLC
Subjt:  AIDTQISEDQKRRLDSSFTREEIEQVVKNMKPTKAPGPDGAHAKLYQNMWDTAGENTIKVCLGILNNNESIEPLNKTHIALIPKVNSPKRMSEFRPISLC

Query:  NVTYKIVAKALANRMKRVLNAIISPNQSAFVPGRQISDNVIVGFECIHALNSRRKEKTGYVAIKLDMSKAYDRVEWRFVEEIMKKMNLSEKWTRIILNCI
        NV Y+IVAKA+ANR+K +LN IISPNQSAF+P R I+DNVI+G+EC+H +   +  + G VA+KLD+SKAYDRVEW F+E+ M  +  S KW  +I++CI
Subjt:  NVTYKIVAKALANRMKRVLNAIISPNQSAFVPGRQISDNVIVGFECIHALNSRRKEKTGYVAIKLDMSKAYDRVEWRFVEEIMKKMNLSEKWTRIILNCI

Query:  STISYSILINGIPQEGFRPSRGIRQGDPLSPYLFLLCAEGFSALLKREETLSNLSGFQINKHCPKLTHLFFADDSLIFIKAAEKDLLSLKKVLKSYEEAS
        +T  +S+LING P    +P RG+RQG PLSPYLF+LCAE FS LL + E    + G +  +    +THL FADDSL+F KA+  D   LK +   Y +AS
Subjt:  STISYSILINGIPQEGFRPSRGIRQGDPLSPYLFLLCAEGFSALLKREETLSNLSGFQINKHCPKLTHLFFADDSLIFIKAAEKDLLSLKKVLKSYEEAS

Query:  GQTINLEKSAFMTSRNMGREAKEECEKILGINRTESLGTYLGMPSRIGKNKSATFSRIKNRVEKLLQGWKEKLFSLGGKEVLIKDVAQ-----CSASLTL
        GQ  N EKS+   S     E     + I  +        YLG+P  +G+NK + F  +K +V   +  W  KLFS GGKE+LIK VAQ       +   L
Subjt:  GQTINLEKSAFMTSRNMGREAKEECEKILGINRTESLGTYLGMPSRIGKNKSATFSRIKNRVEKLLQGWKEKLFSLGGKEVLIKDVAQ-----CSASLTL

Query:  PNALCSDLDRLCARFWWGATRDKDRMHWMSWRRMCTTKEKGGMGFRQIAAFNQAMLAKQSWRLIRNPNNLLYKTLKGRYFKDGNFLEAHIGNSPSSTWMS
        P  LC D+ +  ARFWWG  +DK  +HW  W  M   K +GG+GFR + +FNQA++AKQ WRL+R PN+L+ + +K RY+K+  F  A +G++PS  W S
Subjt:  PNALCSDLDRLCARFWWGATRDKDRMHWMSWRRMCTTKEKGGMGFRQIAAFNQAMLAKQSWRLIRNPNNLLYKTLKGRYFKDGNFLEAHIGNSPSSTWMS

Query:  ICWGKELFKKGFRWRVGNGNMIQLAKDPWIRREESLHPLYVCDNIKDKNVNWLLDDNNNWKETLIKECFIAQDAEAILNMYTGDKNSKDEIIWHLDKKGI
        I WG ++ KKG RWR+G+G  + + KD WI R  +  P+       +  V  L+D  N W+   +++ F+ +D EAIL +       +DE++WH DKKG 
Subjt:  ICWGKELFKKGFRWRVGNGNMIQLAKDPWIRREESLHPLYVCDNIKDKNVNWLLDDNNNWKETLIKECFIAQDAEAILNMYTGDKNSKDEIIWHLDKKGI

Query:  FKVKSAYHLAMDIKSQMEASQSNSDKEASSWKSLWNLEVLPRTKICTWKIINDIVPSKANILKKR
        + VKS Y LA++     E   SNS      WK  W L++  + KI  W+ + +I+P+  N+ K+R
Subjt:  FKVKSAYHLAMDIKSQMEASQSNSDKEASSWKSLWNLEVLPRTKICTWKIINDIVPSKANILKKR

XP_018816058.1 uncharacterized protein LOC108987582 [Juglans regia]3.6e-19535.45Show/hide
Query:  KLRRSLGFSNCINVDSQGASGGLALLWQNQPQISVNSYSKGQIDVTIKEA--DWWWRFTGFYGSPEQSKRKESWRLLESRGRWLVFQPWVVEVEEEAEDV
        +L + L F +C+ VDS+G+SGGLAL+W+    IS+ +YS+  I   +KE      W  TGFYG PE +KR  SW LL+                      
Subjt:  KLRRSLGFSNCINVDSQGASGGLALLWQNQPQISVNSYSKGQIDVTIKEA--DWWWRFTGFYGSPEQSKRKESWRLLESRGRWLVFQPWVVEVEEEAEDV

Query:  AHVTKLRSVLESVDHKEESLAVMIELQLIMPVLLQNLMIDRAVTNDLSLQLYRENLPGGAAFNDEMD-ILEKELEEELGQKSVEGVVFTWCKNKRTKEAT
                                       + L+N +      +   +    E   GG     +M+   E  +E  L Q    G  FTW  N+R ++ T
Subjt:  AHVTKLRSVLESVDHKEESLAVMIELQLIMPVLLQNLMIDRAVTNDLSLQLYRENLPGGAAFNDEMD-ILEKELEEELGQKSVEGVVFTWCKNKRTKEAT

Query:  NERIDRFFVN----SKMEQKASTMKVDQNFHHSDHRPILLEIERNQ--HQKRTFSKRKIKLEESWIEMEGSRMAFKEAWT--ATGEITNVNFSEKIQEGL
         E++DR   N    ++       +    N   SDH P+L+ +   +  +Q + F     + E +W          K AW   A G    V  + ++++  
Subjt:  NERIDRFFVN----SKMEQKASTMKVDQNFHHSDHRPILLEIERNQ--HQKRTFSKRKIKLEESWIEMEGSRMAFKEAWT--ATGEITNVNFSEKIQEGL

Query:  TTMHLWNRERLKGTLKGAIQRKEIEISHLEASNSPDAFDSLGQAERELEALLEEEEGYWKWRSREDWPKSGDKNTKRFHSRAWQRKRKNEIKGILDPFGK
          +  WN      T K AI  K   +  ++ SN  +  +++ Q ++ELE  L E++  WK R+++ W ++GDKNTK FH  A QR++ N+I  +LD    
Subjt:  TTMHLWNRERLKGTLKGAIQRKEIEISHLEASNSPDAFDSLGQAERELEALLEEEEGYWKWRSREDWPKSGDKNTKRFHSRAWQRKRKNEIKGILDPFGK

Query:  WVEEEKEIGNIANIFFQSLLSSNHPSTEVLKEVVEAIDTQISEDQKRRLDSSFTREEIEQVVKNMKPTKAPGPDGAHAKLYQNMWDTAGENTIKVCLGIL
         + E++EIGN+   FF  L +S++P    ++  +E +  ++S      L   FT EE++  V  MK   +PGPDG  A  YQ+ WD  G+   +  L IL
Subjt:  WVEEEKEIGNIANIFFQSLLSSNHPSTEVLKEVVEAIDTQISEDQKRRLDSSFTREEIEQVVKNMKPTKAPGPDGAHAKLYQNMWDTAGENTIKVCLGIL

Query:  NNNESIEPLNKTHIALIPKVNSPKRMSEFRPISLCNVTYKIVAKALANRMKRVLNAIISPNQSAFVPGRQISDNVIVGFECIHALNSRRKEKTGYVAIKL
        N+  S+E +N+T I LIPKV  PKR+ ++RPISLCNV YKIVAK ++NRMK VL  IIS NQSAFVPGR IS+N++V +E +H + +R K K+G++A+KL
Subjt:  NNNESIEPLNKTHIALIPKVNSPKRMSEFRPISLCNVTYKIVAKALANRMKRVLNAIISPNQSAFVPGRQISDNVIVGFECIHALNSRRKEKTGYVAIKL

Query:  DMSKAYDRVEWRFVEEIMKKMNLSEKWTRIILNCISTISYSILINGIPQEGFRPSRGIRQGDPLSPYLFLLCAEGFSALLKREETLSNLSGFQINKHCPK
        D+SKAYDRVEW+F+  +M ++   + W  I+LNCIS+ SYSIL+NG  Q  F+PSRG+RQGDPLSPYLF+LC E  SA++++ + + ++S   + K   +
Subjt:  DMSKAYDRVEWRFVEEIMKKMNLSEKWTRIILNCISTISYSILINGIPQEGFRPSRGIRQGDPLSPYLFLLCAEGFSALLKREETLSNLSGFQINKHCPK

Query:  LTHLFFADDSLIFIKAAEKDLLSLKKVLKSYEEASGQTINLEKSAFMTSRNMGREAKEECEKILGINRTESLGTYLGMPSRIGKNKSATFSRIKNRVEKL
        ++HLFFADDSLIF K+   +  +L ++L  YE ASGQ +N EKS+   S+N   E ++    I G+  + S   YLG+PS +G+ K+A F  + ++    
Subjt:  LTHLFFADDSLIFIKAAEKDLLSLKKVLKSYEEASGQTINLEKSAFMTSRNMGREAKEECEKILGINRTESLGTYLGMPSRIGKNKSATFSRIKNRVEKL

Query:  LQGWKEKLFSLGGKEVLIKDVAQCSAS-----LTLPNALCSDLDRLCARFWWGATRDKDRMHWMSWRRMCTTKEKGGMGFRQIAAFNQAMLAKQSWRLIR
        +  WK    S  GKE+L+K V Q   +       LP ++   LD+LC +FWWG+T  + ++ W+ W  +   K +GG+GFR  ++FN A+LAKQ W ++ 
Subjt:  LQGWKEKLFSLGGKEVLIKDVAQCSAS-----LTLPNALCSDLDRLCARFWWGATRDKDRMHWMSWRRMCTTKEKGGMGFRQIAAFNQAMLAKQSWRLIR

Query:  NPNNLLYKTLKGRYFKDGNFLEAHIGNSPSSTWMSICWGKELFKKGFRWRVGNGNMIQLAKDPWIRREESLHPLYVCDNIKDKNVNWLLDDNNNWKETLI
        NPN+L  + LK +YF   N LEA +G+ PS  W S+  G EL K+G  WR+GNG+ +++ +  W+ +   L   +  D+  ++     + D   W   L+
Subjt:  NPNNLLYKTLKGRYFKDGNFLEAHIGNSPSSTWMSICWGKELFKKGFRWRVGNGNMIQLAKDPWIRREESLHPLYVCDNIKDKNVNWLLDDNNNWKETLI

Query:  KECFIAQDAEAILNMYTGDKNSKDEIIWHLDKKGIFKVKSAYHLAMDIKSQMEASQSNSDKEASSWKSLWNLEVLPRTKICTWKIINDIVPSKANILKKR
        +  F  Q+ E I ++     N +D++ W   + G+F V+SAYHL  +I  +  A  S     +S+WKSLW L+V    ++  W+  N+ +P+ AN+ +++
Subjt:  KECFIAQDAEAILNMYTGDKNSKDEIIWHLDKKGIFKVKSAYHLAMDIKSQMEASQSNSDKEASSWKSLWNLEVLPRTKICTWKIINDIVPSKANILKKR

XP_023878301.1 uncharacterized protein LOC111990748 [Quercus suber]6.3e-20838.45Show/hide
Query:  KLRRSLGFSNCINVDSQGASGGLALLWQNQPQISVNSYSKGQIDVTIKE--ADWWWRFTGFYGSPEQSKRKESWRLLESRGRWLVFQPWVVEVEEEAEDV
        +++  +GF+N + V   G SGG+ALLW  +  + V SY++  ID  I E   D+ WR TGFYG PE  KR +SW LL          PW+          
Subjt:  KLRRSLGFSNCINVDSQGASGGLALLWQNQPQISVNSYSKGQIDVTIKE--ADWWWRFTGFYGSPEQSKRKESWRLLESRGRWLVFQPWVVEVEEEAEDV

Query:  AHVTKLRSVLESVDHKEESLAVMIELQLIMPVLLQNLMIDRAVTNDLSLQLYRENLPGGAAFNDEMDILEKELEEELGQKSVEGVVFTWCKNKRTKEATN
                                 L     +L  N     A  +   +  +R+ +     F+D            LG     G  +TW   +  +    
Subjt:  AHVTKLRSVLESVDHKEESLAVMIELQLIMPVLLQNLMIDRAVTNDLSLQLYRENLPGGAAFNDEMDILEKELEEELGQKSVEGVVFTWCKNKRTKEATN

Query:  ERIDRFFVNSKMEQKASTMKVDQNFHH-----SDHRPILLEIERNQHQKRTFSKRKIKLEESWIEMEGSRMAFKEAWTATGEI-TNVNFSEKIQEGLTTM
         R+DR         K   MKV    HH      DH  +L+     +H+ R    ++   E  W + E  +   + +W    ++ T    SE ++     +
Subjt:  ERIDRFFVNSKMEQKASTMKVDQNFHH-----SDHRPILLEIERNQHQKRTFSKRKIKLEESWIEMEGSRMAFKEAWTATGEI-TNVNFSEKIQEGLTTM

Query:  HLWNRERLKGTLKGAIQRKEIEISHLEASN-SPDAFDSLGQAERELEALLEEEEGYWKWRSREDWPKSGDKNTKRFHSRAWQRKRKNEIKGILDPFGKWV
          W+   + G +   IQ K   ++ L       D    + +   E+ ALL++EE YW  R++  W K GD+NTK FH++A +R+++N I GI D  G+W 
Subjt:  HLWNRERLKGTLKGAIQRKEIEISHLEASN-SPDAFDSLGQAERELEALLEEEEGYWKWRSREDWPKSGDKNTKRFHSRAWQRKRKNEIKGILDPFGKWV

Query:  EEEKEIGNIANIFFQSLLSSNHPSTEVLKEVVEAIDTQISEDQKRRLDSSFTREEIEQVVKNMKPTKAPGPDGAHAKLYQNMWDTAGENTIKVCLGILNN
        + E+ I   A  +F ++ SS+HPS   ++EV EAI  +++E+    L   FT+EE+   +K + P KAPGPDG  A  +Q  W   G N   + L +LN+
Subjt:  EEEKEIGNIANIFFQSLLSSNHPSTEVLKEVVEAIDTQISEDQKRRLDSSFTREEIEQVVKNMKPTKAPGPDGAHAKLYQNMWDTAGENTIKVCLGILNN

Query:  NESIEPLNKTHIALIPKVNSPKRMSEFRPISLCNVTYKIVAKALANRMKRVLNAIISPNQSAFVPGRQISDNVIVGFECIHALNSRRKEKTGYVAIKLDM
        N  I  LNKT+I+LIPK N+PKRM++FRPISLCNV YK+++K LANR+K +L  IIS NQSAF   R I+DNV+V FE +H L+ +   K G++AIKLDM
Subjt:  NESIEPLNKTHIALIPKVNSPKRMSEFRPISLCNVTYKIVAKALANRMKRVLNAIISPNQSAFVPGRQISDNVIVGFECIHALNSRRKEKTGYVAIKLDM

Query:  SKAYDRVEWRFVEEIMKKMNLSEKWTRIILNCISTISYSILINGIPQEGFRPSRGIRQGDPLSPYLFLLCAEGFSALLKREETLSNLSGFQINKHCPKLT
        SKA+DRVEW F+ ++M++M    +W  +++ CI+++SYSILING+      PSRG+RQGDPLSP LFLLCAEG SAL+ +      ++G  IN+ CPK+T
Subjt:  SKAYDRVEWRFVEEIMKKMNLSEKWTRIILNCISTISYSILINGIPQEGFRPSRGIRQGDPLSPYLFLLCAEGFSALLKREETLSNLSGFQINKHCPKLT

Query:  HLFFADDSLIFIKAAEKDLLSLKKVLKSYEEASGQTINLEKSAFMTSRNMGREAKEECEKILGINRTESLGTYLGMPSRIGKNKSATFSRIKNRVEKLLQ
        HLFFADDS++F KAA ++   L+ +L  YEEASGQ IN +KS+   S N  +E ++E   ILG  +      YLG+PS IG++KS  F+ +K +V   L 
Subjt:  HLFFADDSLIFIKAAEKDLLSLKKVLKSYEEASGQTINLEKSAFMTSRNMGREAKEECEKILGINRTESLGTYLGMPSRIGKNKSATFSRIKNRVEKLLQ

Query:  GWKEKLFSLGGKEVLIKDVAQCSASLT-----LPNALCSDLDRLCARFWWGATRDKDRMHWMSWRRMCTTKEKGGMGFRQIAAFNQAMLAKQSWRLIRNP
        GWK KL S+GGKE+LIK VAQ   + T     LP  LC D++R+   FWWG    + +M W+SW+RMC +K  GG+GFR + AFN AMLAKQ+WR++ NP
Subjt:  GWKEKLFSLGGKEVLIKDVAQCSASLT-----LPNALCSDLDRLCARFWWGATRDKDRMHWMSWRRMCTTKEKGGMGFRQIAAFNQAMLAKQSWRLIRNP

Query:  NNLLYKTLKGRYFKDGNFLEAHIGNSPSSTWMSICWGKELFKKGFRWRVGNGNMIQLAKDPWIRREESLHPLY-VCDNIKDKNVNWLLD-DNNNWKETLI
        N+L+ + LK RYF  G+ L A +G+SPS +W SI    E+ ++G RWRVGNG  I + +D W+    +   +     N +   V+ L+D D   WK   +
Subjt:  NNLLYKTLKGRYFKDGNFLEAHIGNSPSSTWMSICWGKELFKKGFRWRVGNGNMIQLAKDPWIRREESLHPLY-VCDNIKDKNVNWLLD-DNNNWKETLI

Query:  KECFIAQDAEAILNMYTGDKNSKDEIIWHLDKKGIFKVKSAYHLAMDIKSQMEASQ-SNSDKEASSWKSLWNLEVLPRTKICTWKIINDIVPSKANILKK
        +  F+  + E IL +       +D++IW  +KKG F VKSAYH+A  I    E  + SN D     WK LW L +  + KI  W+   D +P+  NI K+
Subjt:  KECFIAQDAEAILNMYTGDKNSKDEIIWHLDKKGIFKVKSAYHLAMDIKSQMEASQ-SNSDKEASSWKSLWNLEVLPRTKICTWKIINDIVPSKANILKK

XP_023881891.1 uncharacterized protein LOC111994244 [Quercus suber]6.5e-19740.44Show/hide
Query:  KEATNERIDRFFVNSKMEQKAST-----MKVDQNFHH-----SDHRPILLEIERNQHQKRTFSKRKIKLEESWIEMEGSRMAFKEAWTATGEITNV-NFS
        K A N RI    V  ++++  +T        D   HH     SDH  +L+  +    QK   ++R+ + E  W   E  +   +  W ++ E+ +    +
Subjt:  KEATNERIDRFFVNSKMEQKAST-----MKVDQNFHH-----SDHRPILLEIERNQHQKRTFSKRKIKLEESWIEMEGSRMAFKEAWTATGEITNV-NFS

Query:  EKIQEGLTTMHLWNRERLKGTLKGAIQRKEIEISHLEASNSPDAF-DSLGQAERELEALLEEEEGYWKWRSREDWPKSGDKNTKRFHSRAWQRKRKNEIK
         +++     +  WN+  + G +   IQ K+  ++ L  S+   +    +    +E+  LL+ EE  W+ RSR  W   GD+NTK FH++A  R+R+N I 
Subjt:  EKIQEGLTTMHLWNRERLKGTLKGAIQRKEIEISHLEASNSPDAF-DSLGQAERELEALLEEEEGYWKWRSREDWPKSGDKNTKRFHSRAWQRKRKNEIK

Query:  GILDPFGKWVEEEKEIGNIANIFFQSLLSSNHPSTEVLKEVVEAIDTQISEDQKRRLDSSFTREEIEQVVKNMKPTKAPGPDGAHAKLYQNMWDTAGENT
        GI+D  G W +  + I  +A  +FQ++ SS+ P+   + EV++AI T ++E+    L   FTREEIE  +  M PTKAPGPDG  A  +Q  W+  G + 
Subjt:  GILDPFGKWVEEEKEIGNIANIFFQSLLSSNHPSTEVLKEVVEAIDTQISEDQKRRLDSSFTREEIEQVVKNMKPTKAPGPDGAHAKLYQNMWDTAGENT

Query:  IKVCLGILNNNESIEPLNKTHIALIPKVNSPKRMSEFRPISLCNVTYKIVAKALANRMKRVLNAIISPNQSAFVPGRQISDNVIVGFECIHALNSRRKEK
        + + L +LN+N S+  +NKT+I L+PK+ +P +MS+FRPISLCNV YK+++K LANR+K +L  IIS NQSAF+ GR I+DNV+V FE +H L  +++ K
Subjt:  IKVCLGILNNNESIEPLNKTHIALIPKVNSPKRMSEFRPISLCNVTYKIVAKALANRMKRVLNAIISPNQSAFVPGRQISDNVIVGFECIHALNSRRKEK

Query:  TGYVAIKLDMSKAYDRVEWRFVEEIMKKMNLSEKWTRIILNCISTISYSILINGIPQEGFRPSRGIRQGDPLSPYLFLLCAEGFSALLKREETLSNLSGF
         G+ AIKLDMSKAYDRVEW F++++M+KM   EKW +++++CI+++SYSIL+NG       P+RG+RQGDP+SPY+FLLCA+GFS+LL        +SG 
Subjt:  TGYVAIKLDMSKAYDRVEWRFVEEIMKKMNLSEKWTRIILNCISTISYSILINGIPQEGFRPSRGIRQGDPLSPYLFLLCAEGFSALLKREETLSNLSGF

Query:  QINKHCPKLTHLFFADDSLIFIKAAEKDLLSLKKVLKSYEEASGQTINLEKSAFMTSRNMGREAKEECEKILGINRTESLGTYLGMPSRIGKNKSATFSR
         I + CPK+THLFFADDSL+F KA  ++  +L  +L+ YE+ASGQ IN++KS+   S N   E + E  ++LG  +      YLG+PS IGK+K   F+ 
Subjt:  QINKHCPKLTHLFFADDSLIFIKAAEKDLLSLKKVLKSYEEASGQTINLEKSAFMTSRNMGREAKEECEKILGINRTESLGTYLGMPSRIGKNKSATFSR

Query:  IKNRVEKLLQGWKEKLFSLGGKEVLIKDVAQCSASLT-----LPNALCSDLDRLCARFWWGATRDKDRMHWMSWRRMCTTKEKGGMGFRQIAAFNQAMLA
        +K RVE+ L GWKEKL S+GG+E+LIK VAQ   + T     +P  LC +++ +  RFWWG    + ++ W+SW+++C  K+ GGMGFR + AFN AMLA
Subjt:  IKNRVEKLLQGWKEKLFSLGGKEVLIKDVAQCSASLT-----LPNALCSDLDRLCARFWWGATRDKDRMHWMSWRRMCTTKEKGGMGFRQIAAFNQAMLA

Query:  KQSWRLIRNPNNLLYKTLKGRYFKDGNFLEAHIGNSPSSTWMSICWGKELFKKGFRWRVGNGNMIQLAKDPWIRREESLHPLYVCDNIKD-KNVNWLLD-
        KQ WRLI NPN+L+ +  K RY+  G+  +A +G SPS TW SI  G E+ ++G RWRVGNG  I + +D W+    +   +       D   V+ L+D 
Subjt:  KQSWRLIRNPNNLLYKTLKGRYFKDGNFLEAHIGNSPSSTWMSICWGKELFKKGFRWRVGNGNMIQLAKDPWIRREESLHPLYVCDNIKD-KNVNWLLD-

Query:  DNNNWKETLIKECFIAQDAEAILNMYTGDKNSKDEIIWHLDKKGIFKVKSAYHLAMDIKSQMEASQSNS-DKEASSWKSLWNLEVLPRTKICTWKIINDI
        +   WK+ ++++ F+  +A  IL++       +D+IIW  ++KG F VKSAY++A+ +   +E  +S+S D  +  W+ LW+L + P+ +I  WK+  + 
Subjt:  DNNNWKETLIKECFIAQDAEAILNMYTGDKNSKDEIIWHLDKKGIFKVKSAYHLAMDIKSQMEASQSNS-DKEASSWKSLWNLEVLPRTKICTWKIINDI

Query:  VPSKANILKK
        +P+  N+L+K
Subjt:  VPSKANILKK

XP_030923330.1 uncharacterized protein LOC115950239 [Quercus lobata]3.7e-20037.82Show/hide
Query:  ASGGLALLWQNQPQISVNSYSKGQIDVTIKEADWW-WRFTGFYGSPEQSKRKESWRLLESRGRWLVFQPWVVEVEEEAEDVAHVTKLRSVLESVDHKEES
        A GGLA LW+N   + V +++   +   + E D + W  TGFYG P   +++ SWRLL+    ++V  PWVV           +    + L    H  E 
Subjt:  ASGGLALLWQNQPQISVNSYSKGQIDVTIKEADWW-WRFTGFYGSPEQSKRKESWRLLESRGRWLVFQPWVVEVEEEAEDVAHVTKLRSVLESVDHKEES

Query:  LAVMIELQLIMPVLLQNLMIDRAVTNDLSLQLYRENLPGGAAFNDEMDILEKELEEELGQKSVEGVVFTWCKNKRTKEA-TNERIDRFFVNSKMEQKAST
        ++         P   Q             ++ +RE L                   +L     +G  +TW  NKR  EA T  R+DR   N +   +   
Subjt:  LAVMIELQLIMPVLLQNLMIDRAVTNDLSLQLYRENLPGGAAFNDEMDILEKELEEELGQKSVEGVVFTWCKNKRTKEA-TNERIDRFFVNSKMEQKAST

Query:  MK-VDQNFHHSDHRPILLEIERNQHQKRTFSKRKIKLEESWIEMEGSRMAFKEAWTATGEITNVNFS---EKIQEGLTTMHLWNRERLKGTLKGAIQRKE
         + V  + H SDH P+LL ++ +  Q R    R  K EESW+  +      +EAW   G+      +   EKI+     +  W    +     GAI+  +
Subjt:  MK-VDQNFHHSDHRPILLEIERNQHQKRTFSKRKIKLEESWIEMEGSRMAFKEAWTATGEITNVNFS---EKIQEGLTTMHLWNRERLKGTLKGAIQRKE

Query:  IEISHLEASNSPDAFDS-LGQAERELEALLEEEEGYWKWRSREDWPKSGDKNTKRFHSRAWQRKRKNEIKGILDPFGKWVEEEKEIGNIANIFFQSLLSS
         ++  L  +   +A  +      ++++ LL+++E YW  RSR +W + GD+NTK FH++A QR+RKN I+GI +  G+WVE  +E+G +A  +F +L  +
Subjt:  IEISHLEASNSPDAFDS-LGQAERELEALLEEEEGYWKWRSREDWPKSGDKNTKRFHSRAWQRKRKNEIKGILDPFGKWVEEEKEIGNIANIFFQSLLSS

Query:  NHPSTEVLKEVVEAIDTQISEDQKRRLDSSFTREEIEQVVKNMKPTKAPGPDGAHAKLYQNMWDTAGENTIKVCLGILNNNESIEPLNKTHIALIPKVNS
           + + ++E ++A+DT+++ED +  L + FT EE++  +  M PTKAPGPDG +A  YQ  W   G++ +   L  LNN   +  +N T+I LIPKV +
Subjt:  NHPSTEVLKEVVEAIDTQISEDQKRRLDSSFTREEIEQVVKNMKPTKAPGPDGAHAKLYQNMWDTAGENTIKVCLGILNNNESIEPLNKTHIALIPKVNS

Query:  PKRMSEFRPISLCNVTYKIVAKALANRMKRVLNAIISPNQSAFVPGRQISDNVIVGFECIHALNSRRKEKTGYVAIKLDMSKAYDRVEWRFVEEIMKKMN
        P+RMSEFRPISLCNV YKI++K LANR+K+VL  IIS  QSAFVPGR I+DNV+V +E +H +++R+K K G VA+KLD+SKAYDRVEW F++ IM+KM 
Subjt:  PKRMSEFRPISLCNVTYKIVAKALANRMKRVLNAIISPNQSAFVPGRQISDNVIVGFECIHALNSRRKEKTGYVAIKLDMSKAYDRVEWRFVEEIMKKMN

Query:  LSEKWTRIILNCISTISYSILINGIPQEGFRPSRGIRQGDPLSPYLFLLCAEGFSALLKREETLSNLSGFQINKHCPKLTHLFFADDSLIFIKAAEKDLL
            W   +++C++T S+SIL+NG P E  +PSRGIRQGDP+SPYLFLLCAEG +ALL + E    ++G  I +  PK+T+L FADDSL+F +A   +  
Subjt:  LSEKWTRIILNCISTISYSILINGIPQEGFRPSRGIRQGDPLSPYLFLLCAEGFSALLKREETLSNLSGFQINKHCPKLTHLFFADDSLIFIKAAEKDLL

Query:  SLKKVLKSYEEASGQTINLEKSAFMTSRNMGREAKEECEKILGINRTESLGTYLGMPSRIGKNKSATFSRIKNRVEKLLQGWKEKLFSLGGKEVLIKDVA
        ++ ++L+ YE ASGQ+INLEKS+   S N     K +  +ILG+   +    YLG+P+ IG+ K  TFS +K+RV K LQGWK  L S  GKE+LIK VA
Subjt:  SLKKVLKSYEEASGQTINLEKSAFMTSRNMGREAKEECEKILGINRTESLGTYLGMPSRIGKNKSATFSRIKNRVEKLLQGWKEKLFSLGGKEVLIKDVA

Query:  QCSASLT-----LPNALCSDLDRLCARFWWGATRDKDRMHWMSWRRMCTTKEKGGMGFRQIAAFNQAMLAKQSWRLIRNPNNLLYKTLKGRYFKDGNFLE
        Q   + T     +P  LCS+L+ LCARFWWG   ++ ++HW SW ++   K++GGMGFR + AFN AMLAKQ WRL++  ++LLY+  K RYF   +FLE
Subjt:  QCSASLT-----LPNALCSDLDRLCARFWWGATRDKDRMHWMSWRRMCTTKEKGGMGFRQIAAFNQAMLAKQSWRLIRNPNNLLYKTLKGRYFKDGNFLE

Query:  AHIGNSPSSTWMSICWGKELFKKGFRWRVGNGNMIQLAKDPWIRREESLHPLYVCDNIKDKNVNWLL-----DDNNNWKETLIKECFIAQDAEAILNMYT
        A    + S  W S+   + + + G+ WRVGNG  I   KD W+    +     V ++++      L+      + N W    I+  F   +AEAI  +  
Subjt:  AHIGNSPSSTWMSICWGKELFKKGFRWRVGNGNMIQLAKDPWIRREESLHPLYVCDNIKDKNVNWLL-----DDNNNWKETLIKECFIAQDAEAILNMYT

Query:  GDKNSKDEIIWHLDKKGIFKVKSAYHLAMDIKSQMEASQSNSDKEASS-WKSLWNLEVLPRTKICTWKIINDIVPSKANILKKR
          ++  D I W    +G+F VKSAYH+A  I +      ++    A + W ++W L +  + K+  W+  ++I+P+  N+  ++
Subjt:  GDKNSKDEIIWHLDKKGIFKVKSAYHLAMDIKSQMEASQSNSDKEASS-WKSLWNLEVLPRTKICTWKIINDIVPSKANILKKR

TrEMBL top hitse value%identityAlignment
A0A2N9FN80 Uncharacterized protein1.5e-19936.45Show/hide
Query:  KLRRSLGFSNCINVDSQGASGGLALLWQNQPQISVNSYSKGQIDVTIKEA-DWWWRFTGFYGSPEQSKRKESWRLLESRGRWLVFQPWVVEVEEEAEDVA
        +LR  L F N    +S+   GGL LLW+    I V+S+S   ID  + EA D  WRFTGFYG+PE  KR+ESW LL          PW         D  
Subjt:  KLRRSLGFSNCINVDSQGASGGLALLWQNQPQISVNSYSKGQIDVTIKEA-DWWWRFTGFYGSPEQSKRKESWRLLESRGRWLVFQPWVVEVEEEAEDVA

Query:  HVTKLRSVLESVDHKEESLAVMIELQLIMPVLLQNLMIDRAVTNDLSLQLYRENLPGGAAFNDEMDILEKELEEELGQKSVEGVVFTWCKNKRTKEATNE
         + +L         K++                      R   ++  +Q++R             D+L++    +LG     G  FTW  N R  + T E
Subjt:  HVTKLRSVLESVDHKEESLAVMIELQLIMPVLLQNLMIDRAVTNDLSLQLYRENLPGGAAFNDEMDILEKELEEELGQKSVEGVVFTWCKNKRTKEATNE

Query:  RIDRFFVNSKMEQKASTMKVDQ-NFHHSDHRPILLEIERNQHQKRTFSKRKIKLEESWIEMEGSRMAFKEAW-TATGEITNVNFSEKIQEGLTTMHLWNR
        R+D+     +      + +V   +   SDH+P+ +  E         S++  K EE W   +G       AW      +   +   KI      + +W++
Subjt:  RIDRFFVNSKMEQKASTMKVDQ-NFHHSDHRPILLEIERNQHQKRTFSKRKIKLEESWIEMEGSRMAFKEAW-TATGEITNVNFSEKIQEGLTTMHLWNR

Query:  ERLKGTLKGAIQRKEIEISHLE-ASNSPDAFDSLGQAERELEALLEEEEGYWKWRSREDWPKSGDKNTKRFHSRAWQRKRKNEIKGILDPFGKWVEEEKE
        +   G +K  IQ  E  +   E  S        +   +REL +LL +EE  W+ RSR +W K+GD+NT+ FH RA QR+R+N +  +  P G W     +
Subjt:  ERLKGTLKGAIQRKEIEISHLE-ASNSPDAFDSLGQAERELEALLEEEEGYWKWRSREDWPKSGDKNTKRFHSRAWQRKRKNEIKGILDPFGKWVEEEKE

Query:  IGNIANIFFQSLLSSNHPSTEVLKEVVEAIDTQISEDQKRRLDSSFTREEIEQVVKNMKPTKAPGPDGAHAKLYQNMWDTAGENTIKVCLGILNNNESIE
        +  +   ++ SL  + +P    +++VVE I   ++E     L   FT  E+   +K M P KAPGPDG     Y   W   G+   K  L  LN  + ++
Subjt:  IGNIANIFFQSLLSSNHPSTEVLKEVVEAIDTQISEDQKRRLDSSFTREEIEQVVKNMKPTKAPGPDGAHAKLYQNMWDTAGENTIKVCLGILNNNESIE

Query:  PLNKTHIALIPKVNSPKRMSEFRPISLCNVTYKIVAKALANRMKRVLNAIISPNQSAFVPGRQISDNVIVGFECIHALNSRRKEKTGYVAIKLDMSKAYD
          N T+I LIPK+ +P+ + +FRPISLCNV YK+++K LANR+K +L  I+S +QSAFVPGR I+DN++V FE +H +  +R  +   +A+KLDMSKAYD
Subjt:  PLNKTHIALIPKVNSPKRMSEFRPISLCNVTYKIVAKALANRMKRVLNAIISPNQSAFVPGRQISDNVIVGFECIHALNSRRKEKTGYVAIKLDMSKAYD

Query:  RVEWRFVEEIMKKMNLSEKWTRIILNCISTISYSILINGIPQEGFRPSRGIRQGDPLSPYLFLLCAEGFSALLKREETLSNLSGFQINKHCPKLTHLFFA
        +VEW++++ +M++M  + KW  +++ CIS++SYSIL+NG P    +PSRG+RQGDPLSPYLFLLCAEGF +LL++E+   +L G  I++  PK+THLFFA
Subjt:  RVEWRFVEEIMKKMNLSEKWTRIILNCISTISYSILINGIPQEGFRPSRGIRQGDPLSPYLFLLCAEGFSALLKREETLSNLSGFQINKHCPKLTHLFFA

Query:  DDSLIFIKAAEKDLLSLKKVLKSYEEASGQTINLEKSAFMTSRNMGREAKEECEKILGINRTESLGTYLGMPSRIGKNKSATFSRIKNRVEKLLQGWKEK
        DDSL+F +A   D+  ++ +L  YE ASGQ IN +K+    S++    AK   + +LG+   +    YLG+PS IG+ K ++F++IK RV   L+GWKEK
Subjt:  DDSLIFIKAAEKDLLSLKKVLKSYEEASGQTINLEKSAFMTSRNMGREAKEECEKILGINRTESLGTYLGMPSRIGKNKSATFSRIKNRVEKLLQGWKEK

Query:  LFSLGGKEVLIKDVAQ-----CSASLTLPNALCSDLDRLCARFWWGATRDKDRMHWMSWRRMCTTKEKGGMGFRQIAAFNQAMLAKQSWRLIRNPNNLLY
        L S  G+E+LIK VAQ       +   LPN L  +++ L  RFWWG   DK +MHW+ W  +C +K  GG+GFR++ +FN+A+LAKQ WRL+ N ++L Y
Subjt:  LFSLGGKEVLIKDVAQ-----CSASLTLPNALCSDLDRLCARFWWGATRDKDRMHWMSWRRMCTTKEKGGMGFRQIAAFNQAMLAKQSWRLIRNPNNLLY

Query:  KTLKGRYFKDGNFLEAHIGNSPSSTWMSICWGKELFKKGFRWRVGNGNMIQLAKDPWIRREESLHPLYV--CDNIKDKNVNWLLD-DNNNWKETLIKECF
        K  K +YF   + LEA + ++ S  W SI   ++L KKG  WRVG+G+ IQ+  D W+      H +    C N     V  L+D  +  WKE LI+E F
Subjt:  KTLKGRYFKDGNFLEAHIGNSPSSTWMSICWGKELFKKGFRWRVGNGNMIQLAKDPWIRREESLHPLYV--CDNIKDKNVNWLLD-DNNNWKETLIKECF

Query:  IAQDAEAILNMYTGDKNSKDEIIWHLDKKGIFKVKSAYHLAMDIKSQMEASQSNSDKEASSWKSLWNLEVLPRTKICTWKIINDIVPSKANI
        +  DA AIL +    ++  D ++W   K G++ V+S YH  M  K+Q  A  S++ + +  W ++W+L++ P+ +   W+  ++ +P++ N+
Subjt:  IAQDAEAILNMYTGDKNSKDEIIWHLDKKGIFKVKSAYHLAMDIKSQMEASQSNSDKEASSWKSLWNLEVLPRTKICTWKIINDIVPSKANI

A0A7N2LIH6 Uncharacterized protein1.0e-20037.39Show/hide
Query:  RRSLGFSNCINVDSQGASGGLALLWQNQPQISVNSYSKGQIDVTIKEADWW--WRFTGFYGSPEQSKRKESWRLLESRGRWLVFQPWVVEVEEEAEDVAH
        +  LGF+  I V S G SGGLALLW+    I   S S   IDV +  A     WR TGFYG P+  KR  SW+LLE         PW+  V  +  ++ H
Subjt:  RRSLGFSNCINVDSQGASGGLALLWQNQPQISVNSYSKGQIDVTIKEADWW--WRFTGFYGSPEQSKRKESWRLLESRGRWLVFQPWVVEVEEEAEDVAH

Query:  VTKLRSVLESVDHKEESLAVMIELQLIMPVLLQNLMIDRAVTNDLSLQLYRENLPGGAAFNDEMDILEKELEEELGQKSVEGVVFTWCKNKRTKEATNER
                E +  K+   A M                            +RE             +L K    +LG     G  FTWC  +   + T  R
Subjt:  VTKLRSVLESVDHKEESLAVMIELQLIMPVLLQNLMIDRAVTNDLSLQLYRENLPGGAAFNDEMDILEKELEEELGQKSVEGVVFTWCKNKRTKEATNER

Query:  IDRFFVNSKMEQKASTMKVDQNFHH-----SDHRPILLEIERNQHQKRTFSKRKIKLEESWIEMEGSRMAFKEAWTATGEITNVNFSEKIQEGLTTMHLW
        +DR   N    +  S M  +   HH     SDH  + L + +  +Q+R   K++   EE W  +E  +   + AW    E + +   E+++     +  W
Subjt:  IDRFFVNSKMEQKASTMKVDQNFHH-----SDHRPILLEIERNQHQKRTFSKRKIKLEESWIEMEGSRMAFKEAWTATGEITNVNFSEKIQEGLTTMHLW

Query:  NRERLKGTLKGAIQRKEIEISHLEASN-SPDAFDSLGQAERELEALLEEEEGYWKWRSREDWPKSGDKNTKRFHSRAWQRKRKNEIKGILDPFGKWVEEE
        N+       KG I++K+  +  LE+ N   +  + +   ++E+  L   EE  WK RSR  W + GDKN+K FH+ A QR++KN I G++D  G W E++
Subjt:  NRERLKGTLKGAIQRKEIEISHLEASN-SPDAFDSLGQAERELEALLEEEEGYWKWRSREDWPKSGDKNTKRFHSRAWQRKRKNEIKGILDPFGKWVEEE

Query:  KEIGNIANIFFQSLLSSNHPSTEVLKEVVEAIDTQISEDQKRRLDSSFTREEIEQVVKNMKPTKAPGPDGAHAKLYQNMWDTAGENTIKVCLGILNNNES
        +    +   +F+ + SSN P++      +EA+D +++ +    L   F   E+ Q ++ M PTKAPGPDG     YQ  WD  G +     L  LN+   
Subjt:  KEIGNIANIFFQSLLSSNHPSTEVLKEVVEAIDTQISEDQKRRLDSSFTREEIEQVVKNMKPTKAPGPDGAHAKLYQNMWDTAGENTIKVCLGILNNNES

Query:  IEPLNKTHIALIPKVNSPKRMSEFRPISLCNVTYKIVAKALANRMKRVLNAIISPNQSAFVPGRQISDNVIVGFECIHALNSRRKEKTGYVAIKLDMSKA
         + +NKT+I LIPK  +P++++EFRPISLCNV YKI++K LANR+K+VL+ +I   QSAFVPGR I+DNVIV FE +H++N RRK K G +AIKLDMSKA
Subjt:  IEPLNKTHIALIPKVNSPKRMSEFRPISLCNVTYKIVAKALANRMKRVLNAIISPNQSAFVPGRQISDNVIVGFECIHALNSRRKEKTGYVAIKLDMSKA

Query:  YDRVEWRFVEEIMKKMNLSEKWTRIILNCISTISYSILINGIPQEGFRPSRGIRQGDPLSPYLFLLCAEGFSALLKREETLSNLSGFQINKHCPKLTHLF
        YDRVEW ++E +MKKM   ++W  +I+ C++++S+S+LING P+  F PSRG+RQGDP+SPYLFLLC EG SA++K++E    + G    +  P+++HLF
Subjt:  YDRVEWRFVEEIMKKMNLSEKWTRIILNCISTISYSILINGIPQEGFRPSRGIRQGDPLSPYLFLLCAEGFSALLKREETLSNLSGFQINKHCPKLTHLF

Query:  FADDSLIFIKAAEKDLLSLKKVLKSYEEASGQTINLEKSAFMTSRNMGREAKEECEKILGINRTESLGTYLGMPSRIGKNKSATFSRIKNRVEKLLQGWK
        FADDS+IF +A   +   + KVL+ YEE SGQ +N +K++   SRN   E KE  + I G    +    YLG+P  IG+ K   F+RIK++V + + GWK
Subjt:  FADDSLIFIKAAEKDLLSLKKVLKSYEEASGQTINLEKSAFMTSRNMGREAKEECEKILGINRTESLGTYLGMPSRIGKNKSATFSRIKNRVEKLLQGWK

Query:  EKLFSLGGKEVLIKDVAQCSASLT-----LPNALCSDLDRLCARFWWGATRDKDRMHWMSWRRMCTTKEKGGMGFRQIAAFNQAMLAKQSWRLIRNPNNL
         KL S  G+EVLIK VAQ + + T     LP++LC++L+ +   FWWG    + +M W+SW+ +C  K  GGMGF+ + AFN A+LAKQ WRL +NPN+L
Subjt:  EKLFSLGGKEVLIKDVAQCSASLT-----LPNALCSDLDRLCARFWWGATRDKDRMHWMSWRRMCTTKEKGGMGFRQIAAFNQAMLAKQSWRLIRNPNNL

Query:  LYKTLKGRYFKDGNFLEAHIGNSPSSTWMSICWGKELFKKGFRWRVGNGNMIQLAKDPWIRREESLHPLYV-CDNIKDKNVNWLL-DDNNNWKETLIKEC
         ++ LK +YF + +F+EA +G  PS  W SI   K + K+G RW VG+G  I++    W+    S   +     +++ + V  L+  +   WK TL+++ 
Subjt:  LYKTLKGRYFKDGNFLEAHIGNSPSSTWMSICWGKELFKKGFRWRVGNGNMIQLAKDPWIRREESLHPLYV-CDNIKDKNVNWLL-DDNNNWKETLIKEC

Query:  FIAQDAEAILNMYTGDKNSKDEIIWHLDKKGIFKVKSAYHLAMDI---KSQMEASQSNSDKEASS--WKSLWNLEVLPRTKICTWKIINDIVPSKANILK
        FI  +AE IL++     N  D ++W     G F VKSAY  A        + EA+   SDK   S  WK++W L+   + K   W+    I+P+K  ++ 
Subjt:  FIAQDAEAILNMYTGDKNSKDEIIWHLDKKGIFKVKSAYHLAMDI---KSQMEASQSNSDKEASS--WKSLWNLEVLPRTKICTWKIINDIVPSKANILK

Query:  KR
        ++
Subjt:  KR

A0A803PBM9 Uncharacterized protein6.1e-20137.45Show/hide
Query:  LRRSLGFSNCINVDSQGASGGLALLWQNQPQISVNSYSKGQIDVTI-KEADWWWRFTGFYGSPEQSKRKESWRLLESRGRWLVFQPWVVEVEEEAEDVAH
        LR SLG+  C  V++ G SGGL LLW N    ++ S+S   ID  I KE   WWRFTGFYG P+ ++R ESW+LL   GR +   PWV+  +        
Subjt:  LRRSLGFSNCINVDSQGASGGLALLWQNQPQISVNSYSKGQIDVTI-KEADWWWRFTGFYGSPEQSKRKESWRLLESRGRWLVFQPWVVEVEEEAEDVAH

Query:  VTKLRSVLESVDHKEESLAVMIELQLIMPVLLQNLMIDRAVTNDLSLQLYRENLPGGAAFNDEMDILEKELEEELGQKSVEGVVFTWCKNKRTKEATNER
                E + +KE                                +L  +  PG    N  M +    L E       EG  +TWC N R  E   ER
Subjt:  VTKLRSVLESVDHKEESLAVMIELQLIMPVLLQNLMIDRAVTNDLSLQLYRENLPGGAAFNDEMDILEKELEEELGQKSVEGVVFTWCKNKRTKEATNER

Query:  IDRFFVNSKMEQKASTMKVDQ-NFHHSDHRPILLE--IERNQHQKRTFSKRKIKLEESWIEMEGSRMAFKEAWTATGEI-TNVNFSEKIQEGLTTMHLWN
        +DR   N +        KV+  +   SDH P+LL   ++ ++++K      +   E +W + E      KE+W   G   T +   +K+      +  WN
Subjt:  IDRFFVNSKMEQKASTMKVDQ-NFHHSDHRPILLE--IERNQHQKRTFSKRKIKLEESWIEMEGSRMAFKEAWTATGEI-TNVNFSEKIQEGLTTMHLWN

Query:  RERLKGTLKGAIQRKEIEISHLEASNSPDAFDSLGQAERELEALLEEEEGYWKWRSREDWPKSGDKNTKRFHSRAWQRKRKNEIKGILDPFGKWVEEEKE
        + R K  +K  ++  E +I+ L  S +   +  L   E++   LL++EE +W+ RSR  W K GD+NTK FH +A  RKRKN I G+LD   KWV   K 
Subjt:  RERLKGTLKGAIQRKEIEISHLEASNSPDAFDSLGQAERELEALLEEEEGYWKWRSREDWPKSGDKNTKRFHSRAWQRKRKNEIKGILDPFGKWVEEEKE

Query:  IGNIANIFFQSLLSSNHPSTEVLKEVVEAIDTQISEDQKRRLDSSFTREEIEQVVKNMKPTKAPGPDGAHAKLYQNMWDTAGENTIKVCLGILNNNESIE
        +G +A ++FQ + +SN  S   L+E    +  +IS +    L + FT+E++   ++N+ P KAPG DG     Y+  W   GE   KVCLGILN    + 
Subjt:  IGNIANIFFQSLLSSNHPSTEVLKEVVEAIDTQISEDQKRRLDSSFTREEIEQVVKNMKPTKAPGPDGAHAKLYQNMWDTAGENTIKVCLGILNNNESIE

Query:  PLNKTHIALIPKVNSPKRMSEFRPISLCNVTYKIVAKALANRMKRVLNAIISPNQSAFVPGRQISDNVIVGFECIHALNSRRKEKTGYVAIKLDMSKAYD
         +N T I LIPK+  P RM+ FRPISLCNV YKIVAK LA R K  L   IS  QSAFV GR I DN I+GFE +H +  RR      +A+KLDMSKAYD
Subjt:  PLNKTHIALIPKVNSPKRMSEFRPISLCNVTYKIVAKALANRMKRVLNAIISPNQSAFVPGRQISDNVIVGFECIHALNSRRKEKTGYVAIKLDMSKAYD

Query:  RVEWRFVEEIMKKMNLSEKWTRIILNCISTISYSILINGIPQEGFRPSRGIRQGDPLSPYLFLLCAEGFSALLKREETLSNLSGFQINKHCPKLTHLFFA
        RVEW F+  +M+ +   E W   I+ C++++S+S+LING     F P+RG+RQGD LSPYLFL+C+EG S L++  E   N++G +  K   K++HLFFA
Subjt:  RVEWRFVEEIMKKMNLSEKWTRIILNCISTISYSILINGIPQEGFRPSRGIRQGDPLSPYLFLLCAEGFSALLKREETLSNLSGFQINKHCPKLTHLFFA

Query:  DDSLIFIKAAEKDLLSLKKVLKSYEEASGQTINLEKSAFMTSRNMGREAKEECEKILGINRTESLGTYLGMPSRIGKNKSATFSRIKNRVEKLLQGWKEK
        DDS +F++  E +  ++  +L+ Y   SGQ INLEKS       +  +        LG++       YLG+PS IG+ K   F  IK++V   L+ WK  
Subjt:  DDSLIFIKAAEKDLLSLKKVLKSYEEASGQTINLEKSAFMTSRNMGREAKEECEKILGINRTESLGTYLGMPSRIGKNKSATFSRIKNRVEKLLQGWKEK

Query:  LFSLGGKEVLIKDVAQCSASLT-----LPNALCSDLDRLCARFWWGATRDKDRMHWMSWRRMCTTKEKGGMGFRQIAAFNQAMLAKQSWRLIRNPNNLLY
        +FS  GKE+LIK V Q   S +     LP  L   L  L A FWWG T++  ++HW +W ++C  KE+GG+GFR +  FNQA+LAKQ WRLI +P++LL 
Subjt:  LFSLGGKEVLIKDVAQCSASLT-----LPNALCSDLDRLCARFWWGATRDKDRMHWMSWRRMCTTKEKGGMGFRQIAAFNQAMLAKQSWRLIRNPNNLLY

Query:  KTLKGRYFKDGNFLEAHIGNSPSSTWMSICWGKELFKKGFRWRVGNGNMIQLAKDPWIRREESLHPLYVCDNIKDKNVNWLLDDNNNWKETLIKECFIAQ
        + LK  Y+ + +FL+A      SS W  I WG+++  +G RWRVGNG  +++ +D WI R          D   +  ++ LL +  +WK  +I   F   
Subjt:  KTLKGRYFKDGNFLEAHIGNSPSSTWMSICWGKELFKKGFRWRVGNGNMIQLAKDPWIRREESLHPLYVCDNIKDKNVNWLLDDNNNWKETLIKECFIAQ

Query:  DAEAILNMYTGDKNSKDEIIWHLDKKGIFKVKSAYHLAMDIKSQMEASQSNSDKEASSWKSLWNLEVLPRTKICTWKIINDIVPSKANILKK
        D   IL     D   +D I W L  KG + VKS Y +A ++ +      SN D+  + WK  W+L++ PR K+  WK+ ++ +P+K+N++ +
Subjt:  DAEAILNMYTGDKNSKDEIIWHLDKKGIFKVKSAYHLAMDIKSQMEASQSNSDKEASSWKSLWNLEVLPRTKICTWKIINDIVPSKANILKK

A0A803QC75 Uncharacterized protein1.8e-20537.57Show/hide
Query:  SLGFSNCINVDSQGASGGLALLWQNQPQISVNSYSKGQIDVTIK-EADWWWRFTGFYGSPEQSKRKESWRLLESRGRWLVFQPWVVEVEEEAEDVAHVTK
        S  FSN + V   G SGG+ LLW ++  +++ +      D  +K +    + FTGFYG+P+ + +  SW L +  G      PW+            +  
Subjt:  SLGFSNCINVDSQGASGGLALLWQNQPQISVNSYSKGQIDVTIK-EADWWWRFTGFYGSPEQSKRKESWRLLESRGRWLVFQPWVVEVEEEAEDVAHVTK

Query:  LRSVLESVDHKEESLAVMIELQLIMPVLLQNLMIDRAVTNDLSLQLYRENLPGGAAFNDEMDILEKELEE-ELGQKSVEGVVFTWCKNKRTKEATNERID
           +L + D    SL                                RE+          MD     L++  L +    G  FTW KN+ T     ER+D
Subjt:  LRSVLESVDHKEESLAVMIELQLIMPVLLQNLMIDRAVTNDLSLQLYRENLPGGAAFNDEMDILEKELEE-ELGQKSVEGVVFTWCKNKRTKEATNERID

Query:  RFFVNSKMEQKASTMKVDQ-NFHHSDHRPILLEIE-RNQHQKRTFSKRKIKLEESWIEMEGSRMAFKEAWTA---TGEITNVNFSEKIQEGLTTMHLWNR
          F+N+K         VD  ++  SDHR + +     +Q +    SK + + E+ W+    S     ++W +   +  IT V   + +    T +  W+ 
Subjt:  RFFVNSKMEQKASTMKVDQ-NFHHSDHRPILLEIE-RNQHQKRTFSKRKIKLEESWIEMEGSRMAFKEAWTA---TGEITNVNFSEKIQEGLTTMHLWNR

Query:  ERLKGTLKGAIQRKEIEISHLEASN--SPDAFDSLGQAERELEALLEEEEGYWKWRSREDWPKSGDKNTKRFHSRAWQRKRKNEIKGILDPFGKWVEEEK
         +  G +K  I   + +++ L  SN  S D +  L  AE  LE LLE+EE YW+ RSR DW   GD+NTK FH++A  RK  N+IK + +  G+ V  + 
Subjt:  ERLKGTLKGAIQRKEIEISHLEASN--SPDAFDSLGQAERELEALLEEEEGYWKWRSREDWPKSGDKNTKRFHSRAWQRKRKNEIKGILDPFGKWVEEEK

Query:  EIGNIANIFFQSLLSSNHPSTEVLKEVVEAIDTQISEDQKRRLDSSFTREEIEQVVKNMKPTKAPGPDGAHAKLYQNMWDTAGENTIKVCLGILNNNESI
        +I  +   F+  L SS     E L   +  I T +S +    L   FT +E+   + +M P K+PG DG  A  YQ  W   G+   KV L ILN+N   
Subjt:  EIGNIANIFFQSLLSSNHPSTEVLKEVVEAIDTQISEDQKRRLDSSFTREEIEQVVKNMKPTKAPGPDGAHAKLYQNMWDTAGENTIKVCLGILNNNESI

Query:  EPLNKTHIALIPKVNSPKRMSEFRPISLCNVTYKIVAKALANRMKRVLNAIISPNQSAFVPGRQISDNVIVGFECIHALNSRRKEKTGYVAIKLDMSKAY
          LNKT I LIPKV  P+ + E+RPISLCNV  K+V K L +R K  L  +IS  QSAF+P R I+DNV+V FE +HA+ ++   + G  + KLDMSKA+
Subjt:  EPLNKTHIALIPKVNSPKRMSEFRPISLCNVTYKIVAKALANRMKRVLNAIISPNQSAFVPGRQISDNVIVGFECIHALNSRRKEKTGYVAIKLDMSKAY

Query:  DRVEWRFVEEIMKKMNLSEKWTRIILNCISTISYSILINGIPQEGFRPSRGIRQGDPLSPYLFLLCAEGFSALLKREETLSNLSGFQINKHCPKLTHLFF
        DRVEW+F+EE+M+KM  +E+W  +I++C++T ++S +ING       PSRG+RQG PLSPYLFL+C+EGFS LL+ E+  +NL GF++ +H P +THLFF
Subjt:  DRVEWRFVEEIMKKMNLSEKWTRIILNCISTISYSILINGIPQEGFRPSRGIRQGDPLSPYLFLLCAEGFSALLKREETLSNLSGFQINKHCPKLTHLFF

Query:  ADDSLIFIKAAEKDLLSLKKVLKSYEEASGQTINLEKSAFMTSRNMGREAKEECEKILGINRTESLGTYLGMPSRIGKNKSATFSRIKNRVEKLLQGWKE
        ADDSL+F +A E+  L++K+VL +Y +ASGQ +NL+KS    S N    A+    + L +   E    YLG+PS  G++K   FS IK R+ KL+  W E
Subjt:  ADDSLIFIKAAEKDLLSLKKVLKSYEEASGQTINLEKSAFMTSRNMGREAKEECEKILGINRTESLGTYLGMPSRIGKNKSATFSRIKNRVEKLLQGWKE

Query:  KLFSLGGKEVLIKDVAQ-----CSASLTLPNALCSDLDRLCARFWWGATRDKDRMHWMSWRRMCTTKEKGGMGFRQIAAFNQAMLAKQSWRLIRNPNNLL
        K+FS GGKE+L+K V Q       +   LP   C  ++ + A FWWG T +  R+HW SW  +C +K +GGMGFR    FNQA+LAKQ+WR+   P++LL
Subjt:  KLFSLGGKEVLIKDVAQ-----CSASLTLPNALCSDLDRLCARFWWGATRDKDRMHWMSWRRMCTTKEKGGMGFRQIAAFNQAMLAKQSWRLIRNPNNLL

Query:  YKTLKGRYFKDGNFLEAHIGNSPSSTWMSICWGKELFKKGFRWRVGNGNMIQLAKDPWIRREESLHPLYVCDNIKDKNVNWLLDDNNNWKETLIKECFIA
         + LK RYF + NFLEA +G+SPS TW  I W +EL  KG RW+VG+G  IQ   DPWI    +  P +      +  V+ L+ D   W   L+K+ F A
Subjt:  YKTLKGRYFKDGNFLEAHIGNSPSSTWMSICWGKELFKKGFRWRVGNGNMIQLAKDPWIRREESLHPLYVCDNIKDKNVNWLLDDNNNWKETLIKECFIA

Query:  QDAEAILNMYTGDKNSKDEIIWHLDKKGIFKVKSAYHLAMDIKSQMEASQSNSDKEASSWKSLWNLEVLPRTKICTWKIINDIVPSKANILKKR
         D E IL++     +S+D +IWH    G++ VKS YHLA D+    ++S SN D  +  WK  W+L++ P+ KI  W+ I+D +P   +++K++
Subjt:  QDAEAILNMYTGDKNSKDEIIWHLDKKGIFKVKSAYHLAMDIKSQMEASQSNSDKEASSWKSLWNLEVLPRTKICTWKIINDIVPSKANILKKR

A0A803QH07 Uncharacterized protein2.7e-20436.55Show/hide
Query:  KLRRSLGFSNCINVDSQGASGGLALLWQNQPQISVNSYSKGQIDVTIKEADW-WWRFTGFYGSPEQSKRKESWRLLESRGRWLVFQPWVVEVEEEAEDVA
        + R  L FS+ + V   G SGGL LLW ++  +++N+++    D  +   +     FT FYG+P    R  SW LL+         PW+           
Subjt:  KLRRSLGFSNCINVDSQGASGGLALLWQNQPQISVNSYSKGQIDVTIKEADW-WWRFTGFYGSPEQSKRKESWRLLESRGRWLVFQPWVVEVEEEAEDVA

Query:  HVTKLRSVLESVDHKEESLAVMIELQLIMPVLLQNLMIDRAVTNDLSLQLYRENLPGGAAFND-EMDILEKELEE-ELGQKSVEGVVFTWCKNKRTKEAT
                                                 +  D +  LY  N  GG+   + +M+     L+   L +    G  FTW KN+   E  
Subjt:  HVTKLRSVLESVDHKEESLAVMIELQLIMPVLLQNLMIDRAVTNDLSLQLYRENLPGGAAFND-EMDILEKELEE-ELGQKSVEGVVFTWCKNKRTKEAT

Query:  NERIDRFFVNSKME---QKASTMKVDQNFHHSDHRPILLEI-ERNQHQKRTFSKRKIKLEESWIEMEGSRMAFKEAW--TATGEITNVNFSEKIQEGLTT
         ER+D  F N       Q  ST  +D  F+ SDHR I + I   N HQ++T  K + + E+ W++ E +    K+ W   A+G+IT  +F   +      
Subjt:  NERIDRFFVNSKME---QKASTMKVDQNFHHSDHRPILLEI-ERNQHQKRTFSKRKIKLEESWIEMEGSRMAFKEAW--TATGEITNVNFSEKIQEGLTT

Query:  MHLWNRERLKGTLKGAIQRKEIEISHLE--ASNSPDAFDSLGQAERELEALLEEEEGYWKWRSREDWPKSGDKNTKRFHSRAWQRKRKNEIKGILDPFGK
        +  W+  +  G +K  I + +  ++ L   A  +      L Q+E+ L+ LL  EE YW  RSR DW + GDKNTK FH++A  RK  N IK + +  G 
Subjt:  MHLWNRERLKGTLKGAIQRKEIEISHLE--ASNSPDAFDSLGQAERELEALLEEEEGYWKWRSREDWPKSGDKNTKRFHSRAWQRKRKNEIKGILDPFGK

Query:  WVEEEKEIGNIANIFFQSLLSSNHPSTEVLKEVVEAIDTQISEDQKRRLDSSFTREEIEQVVKNMKPTKAPGPDGAHAKLYQNMWDTAGENTIKVCLGIL
         V  ++ +  +   +++ L +S    ++ L+ V+ AI + I     + L + F+  E+   +++M P K+PG DG  A  YQN WD  G +  ++ LGIL
Subjt:  WVEEEKEIGNIANIFFQSLLSSNHPSTEVLKEVVEAIDTQISEDQKRRLDSSFTREEIEQVVKNMKPTKAPGPDGAHAKLYQNMWDTAGENTIKVCLGIL

Query:  NNNESIEPLNKTHIALIPKVNSPKRMSEFRPISLCNVTYKIVAKALANRMKRVLNAIISPNQSAFVPGRQISDNVIVGFECIHALNSRRKEKTGYVAIKL
        N+   +  LN + I LIPKV +P  M ++RPISLCNV YK+++K +  R ++VL  +IS  QSAF+  R I+DN++V FE IH L  + + + GY A+KL
Subjt:  NNNESIEPLNKTHIALIPKVNSPKRMSEFRPISLCNVTYKIVAKALANRMKRVLNAIISPNQSAFVPGRQISDNVIVGFECIHALNSRRKEKTGYVAIKL

Query:  DMSKAYDRVEWRFVEEIMKKMNLSEKWTRIILNCISTISYSILINGIPQEGFRPSRGIRQGDPLSPYLFLLCAEGFSALLKREETLSNLSGFQINKHCPK
        DMSKA+DRVEW ++E +M KM  + +W  +I++CI+T S+S  +NG      +P RG+RQGDPLSPYLFL+C+EG S LL+ EE   +L G ++ ++ P 
Subjt:  DMSKAYDRVEWRFVEEIMKKMNLSEKWTRIILNCISTISYSILINGIPQEGFRPSRGIRQGDPLSPYLFLLCAEGFSALLKREETLSNLSGFQINKHCPK

Query:  LTHLFFADDSLIFIKAAEKDLLSLKKVLKSYEEASGQTINLEKSAFMTSRNMGREAKEECEKILGINRTESLGTYLGMPSRIGKNKSATFSRIKNRVEKL
        ++HL FADDSL+F +A E+  L+LK+ L +Y +ASGQ +N +KS    S N    A+      L +  T+    YLG+PS  G++K   FS IK +V KL
Subjt:  LTHLFFADDSLIFIKAAEKDLLSLKKVLKSYEEASGQTINLEKSAFMTSRNMGREAKEECEKILGINRTESLGTYLGMPSRIGKNKSATFSRIKNRVEKL

Query:  LQGWKEKLFSLGGKEVLIKDVAQ-----CSASLTLPNALCSDLDRLCARFWWGATRDKDRMHWMSWRRMCTTKEKGGMGFRQIAAFNQAMLAKQSWRLIR
        L  W EK+FS+GGKEVL+K V Q       +   L    C+ L+ + A FWWG  ++  ++HW  W+ +C +K +GGMGFR    FNQA+LAKQ+WR+  
Subjt:  LQGWKEKLFSLGGKEVLIKDVAQ-----CSASLTLPNALCSDLDRLCARFWWGATRDKDRMHWMSWRRMCTTKEKGGMGFRQIAAFNQAMLAKQSWRLIR

Query:  NPNNLLYKTLKGRYFKDGNFLEAHIGNSPSSTWMSICWGKELFKKGFRWRVGNGNMIQLAKDPWIRREESLHPLYVCDNIKDKNVNWLLDDNNNWKETLI
         P++LL + LK RYF   +FL+A IG+SPS TW SICWG+EL  KG R++VGNG+ I  +KDPWI   +S  P+  C       V+ L++DN  W  TL+
Subjt:  NPNNLLYKTLKGRYFKDGNFLEAHIGNSPSSTWMSICWGKELFKKGFRWRVGNGNMIQLAKDPWIRREESLHPLYVCDNIKDKNVNWLLDDNNNWKETLI

Query:  KECFIAQDAEAILNMYTGDKNSKDEIIWHLDKKGIFKVKSAYHLAMDIKSQMEASQSNSDKEASSWKSLWNLEVLPRTKICTWKIINDIVPSKANILKKR
         + F   D E IL++      ++D +IWH    G + VKS +HLA  ++ Q+++S S+++++   WK  WNL + P+ +I  WK+I  I+P  A + K++
Subjt:  KECFIAQDAEAILNMYTGDKNSKDEIIWHLDKKGIFKVKSAYHLAMDIKSQMEASQSNSDKEASSWKSLWNLEVLPRTKICTWKIINDIVPSKANILKKR

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein8.4e-3823.7Show/hide
Query:  TNERIDRFFVNSKMEQKASTMKVDQNFHHSDHRPILLEIE-RNQHQKR--TFSKRKIKLEESWI--EMEGSRMAFKEAWTATGEITNVNFSEKIQEGLTT
        T  +ID    +  +  K    ++  N+  SDH  I LE+  +N  Q R  T+    + L + W+  EM+     F E        TN N      +  T 
Subjt:  TNERIDRFFVNSKMEQKASTMKVDQNFHHSDHRPILLEIE-RNQHQKR--TFSKRKIKLEESWI--EMEGSRMAFKEAWTATGEITNVNFSEKIQEGLTT

Query:  MHLWNRERLKGTLKGAIQRKEIEISHLEASNSPDAFDSLGQAERELEALLEEEEGYWKWRSREDWPK--------SGDKNTKRFH-SRAW----------
         +LW+        K   + K I ++  +        D+L    +ELE   ++E+ + K   R++  K           K  ++ + SR+W          
Subjt:  MHLWNRERLKGTLKGAIQRKEIEISHLEASNSPDAFDSLGQAERELEALLEEEEGYWKWRSREDWPK--------SGDKNTKRFH-SRAW----------

Query:  -------QRKRKNEIKGILDPFGKWVEEEKEIGNIANIFFQSLLSSNHPSTEVLKEVVEAIDT----QISEDQKRRLDSSFTREEIEQVVKNMKPTKAPG
               +++ KN+I  I +  G    +  EI      +++ L ++     E L+E+   +DT    ++++++   L+   T  EI  ++ ++   K+PG
Subjt:  -------QRKRKNEIKGILDPFGKWVEEEKEIGNIANIFFQSLLSSNHPSTEVLKEVVEAIDT----QISEDQKRRLDSSFTREEIEQVVKNMKPTKAPG

Query:  PDGAHAKLYQNMWDTAGENTIKVCLGILNNNESIEPLNKTHIALIPKVNSPKRMSE-FRPISLCNVTYKIVAKALANRMKRVLNAIISPNQSAFVPGRQI
        PDG  A+ YQ   +      +K+   I           +  I LIPK        E FRPISL N+  KI+ K LANR+++ +  +I  +Q  F+PG Q 
Subjt:  PDGAHAKLYQNMWDTAGENTIKVCLGILNNNESIEPLNKTHIALIPKVNSPKRMSE-FRPISLCNVTYKIVAKALANRMKRVLNAIISPNQSAFVPGRQI

Query:  SDNVIVGFECIHALNSRRKEKTGYVAIKLDMSKAYDRVEWRFVEEIMKKMNLSEKWTRIILNCISTISYSILINGIPQEGFRPSRGIRQGDPLSPYLFLL
          N+      I  +N R K+K  +V I +D  KA+D+++  F+ + + K+ +   + +II       + +I++NG   E F    G RQG PLSP LF +
Subjt:  SDNVIVGFECIHALNSRRKEKTGYVAIKLDMSKAYDRVEWRFVEEIMKKMNLSEKWTRIILNCISTISYSILINGIPQEGFRPSRGIRQGDPLSPYLFLL

Query:  CAEGFSALLKREETLSNLSGFQINKHCPKLTHLFFADDSLIFIKAAEKDLLSLKKVLKSYEEASGQTINLEKS-AFMTSRNMGREAK--EECEKILGINR
          E  +  +++E+    + G Q+ K   KL+   FADD +++++       +L K++ ++ + SG  IN++KS AF+ + N   E++   E    +   R
Subjt:  CAEGFSALLKREETLSNLSGFQINKHCPKLTHLFFADDSLIFIKAAEKDLLSLKKVLKSYEEASGQTINLEKS-AFMTSRNMGREAK--EECEKILGINR

Query:  TESLGTYL--GMPSRIGKNKSATFSRIKNRVEKLLQGWKEKLFSLGGK------EVLIKDVAQCSA-SLTLPNALCSDLDRLCARFWWGATRDKDRMHWM
         + LG  L   +     +N       IK    K    WK    S  G+       +L K + + +A  + LP    ++L++   +F W   R +     +
Subjt:  TESLGTYL--GMPSRIGKNKSATFSRIKNRVEKLLQGWKEKLFSLGGK------EVLIKDVAQCSA-SLTLPNALCSDLDRLCARFWWGATRDKDRMHWM

Query:  SWRRMCTTKEKGGMGFRQIAAFNQAMLAKQSWRLIRNPNNLLYKTLKGRYFKDG--NFLEAHIGNSPSSTWMSICWGKELFKKGFRWRVGNGNMIQLAKD
        S +      + GG+       + +A + K +W   +N +   +   +         N+L   I + P        WGK+     + W        +L  D
Subjt:  SWRRMCTTKEKGGMGFRQIAAFNQAMLAKQSWRLIRNPNNLLYKTLKGRYFKDG--NFLEAHIGNSPSSTWMSICWGKELFKKGFRWRVGNGNMIQLAKD

Query:  PWIRREESLHPLYVCD-NIKDKNVNWL
        P++     ++  ++ D N+K K +  L
Subjt:  PWIRREESLHPLYVCD-NIKDKNVNWL

P11369 LINE-1 retrotransposable element ORF2 protein8.1e-4126.44Show/hide
Query:  IKGILDPFGKWVEEEKEIGNIANIFFQSLLSSNHPSTEVLKEVVEAIDT-QISEDQKRRLDSSFTREEIEQVVKNMKPTKAPGPDGAHAKLYQNMWDTAG
        I  I +  G    + +EI N    F++ L S+   + + + + ++     ++++DQ   L+S  + +EIE V+ ++   K+PGPDG  A+ YQ    T  
Subjt:  IKGILDPFGKWVEEEKEIGNIANIFFQSLLSSNHPSTEVLKEVVEAIDT-QISEDQKRRLDSSFTREEIEQVVKNMKPTKAPGPDGAHAKLYQNMWDTAG

Query:  ENTIKVCLGILNNNESIEPLNKTH----IALIPK-VNSPKRMSEFRPISLCNVTYKIVAKALANRMKRVLNAIISPNQSAFVPGRQISDNVIVGFECIHA
        E+ I +   + +  E    L  +     I LIPK    P ++  FRPISL N+  KI+ K LANR++  + AII P+Q  F+PG Q   N+      IH 
Subjt:  ENTIKVCLGILNNNESIEPLNKTH----IALIPK-VNSPKRMSEFRPISLCNVTYKIVAKALANRMKRVLNAIISPNQSAFVPGRQISDNVIVGFECIHA

Query:  LNSRRKEKTGYVAIKLDMSKAYDRVEWRFVEEIMKKMNLSEKWTRIILNCISTISYSILINGIPQEGFRPSRGIRQGDPLSPYLFLLCAEGFSALLKREE
        +N + K+K  ++ I LD  KA+D+++  F+ +++++  +   +  +I    S    +I +NG   E      G RQG PLSPYLF +  E  +  +++++
Subjt:  LNSRRKEKTGYVAIKLDMSKAYDRVEWRFVEEIMKKMNLSEKWTRIILNCISTISYSILINGIPQEGFRPSRGIRQGDPLSPYLFLLCAEGFSALLKREE

Query:  TLSNLSGFQINKHCPKLTHLFFADDSLIFIKAAEKDLLSLKKVLKSYEEASGQTINLEKS-AFMTSRNMGREAKEECEKILGINRTESLGTYLG--MPSR
            + G QI K   K++ L  ADD +++I   +     L  ++ S+ E  G  IN  KS AF+ ++N  ++A++E  +    +   +   YLG  +   
Subjt:  TLSNLSGFQINKHCPKLTHLFFADDSLIFIKAAEKDLLSLKKVLKSYEEASGQTINLEKS-AFMTSRNMGREAKEECEKILGINRTESLGTYLG--MPSR

Query:  IGKNKSATFSRIKNRVEKLLQGWKEKLFSLGGK------EVLIKDVAQCSA-SLTLPNALCSDLDRLCARFWWGATRDKDRMHWMSWRRMCTTKEKGGMG
        +       F  +K  +++ L+ WK+   S  G+       +L K + + +A  + +P    ++L+    +F W   + +     +  +R       GG+ 
Subjt:  IGKNKSATFSRIKNRVEKLLQGWKEKLFSLGGK------EVLIKDVAQCSA-SLTLPNALCSDLDRLCARFWWGATRDKDRMHWMSWRRMCTTKEKGGMG

Query:  FRQIAAFNQAMLAKQSWRLIRN
           +  + +A++ K +W   R+
Subjt:  FRQIAAFNQAMLAKQSWRLIRN

P14381 Transposon TX1 uncharacterized 149 kDa protein1.6e-3324.11Show/hide
Query:  QKSVEGVVFTWCKNKRTKEATNERIDRFFVNSKMEQKASTMKVDQNFHHSDHRPILLEIERNQHQKRTFSKRKIKLEESWIEMEGSRMAFKEAWTATGEI
        +++ E V FT+ +  R    +  RIDR +++S +  +A +  + +    SDH  + L +       +           S +E EG   + ++ W      
Subjt:  QKSVEGVVFTWCKNKRTKEATNERIDRFFVNSKMEQKASTMKVDQNFHHSDHRPILLEIERNQHQKRTFSKRKIKLEESWIEMEGSRMAFKEAWTATGEI

Query:  TN--VNFSEKIQEGLTTMHLWNRERLK---GTLKGAIQRKEIEISHLE--ASNSPDAFDSLGQAERELEALLEEEEGYWKW---RSREDWPKSGDKNTKR
         +     ++    G   + L  +E  K   G     I+    E+  LE   S S D        ER+ EAL   E+   +    RSR       D+ ++ 
Subjt:  TN--VNFSEKIQEGLTTMHLWNRERLK---GTLKGAIQRKEIEISHLE--ASNSPDAFDSLGQAERELEALLEEEEGYWKW---RSREDWPKSGDKNTKR

Query:  FHSRAWQRKRKNEIKGILDPFGKWVEEEKEIGNIANIFFQSLLSSNHPSTEVLKEVVEAIDTQISEDQKRRLDSSFTREEIEQVVKNMKPTKAPGPDGAH
        F++   ++  + +I  +    G  +E+ + I + A  F+Q+L S +  S +  +E+ + +   +SE +K RL++  T +E+ Q ++ M   K+PG DG  
Subjt:  FHSRAWQRKRKNEIKGILDPFGKWVEEEKEIGNIANIFFQSLLSSNHPSTEVLKEVVEAIDTQISEDQKRRLDSSFTREEIEQVVKNMKPTKAPGPDGAH

Query:  AKLYQNMWDTAGENTIKVCLGILNNNESIEPLNKTHIALIPKVNSPKRMSEFRPISLCNVTYKIVAKALANRMKRVLNAIISPNQSAFVPGRQISDNVIV
         + +Q  WDT G +  +V        E      +  ++L+PK    + +  +RP+SL +  YKIVAKA++ R+K VL  +I P+QS  VPGR I DNV +
Subjt:  AKLYQNMWDTAGENTIKVCLGILNNNESIEPLNKTHIALIPKVNSPKRMSEFRPISLCNVTYKIVAKALANRMKRVLNAIISPNQSAFVPGRQISDNVIV

Query:  GFECIHALNSRRKEKTGYVAIKLDMSKAYDRVEWRFVEEIMKKMNLSEKWTRIILNCISTISYSILINGIPQEGFRPSRGIRQGDPLSPYLFLLCAEGFS
          + +H     R+       + LD  KA+DRV+ +++   ++  +   ++   +    ++    + IN          RG+RQG PLS  L+ L  E F 
Subjt:  GFECIHALNSRRKEKTGYVAIKLDMSKAYDRVEWRFVEEIMKKMNLSEKWTRIILNCISTISYSILINGIPQEGFRPSRGIRQGDPLSPYLFLLCAEGFS

Query:  ALLKREETLSNLSGFQINKHCPKLTHLFFADDSLIFIKAAEKDLLSLKKVLKSYEEASGQTINLEKSAFMTSRNMGREAKEECEKILGINRTESLGTYLG
         LL++      L+G  + +   ++    +ADD +I +     DL   ++  + Y  AS   IN  KS+ +   ++  +      +   I+    +  YLG
Subjt:  ALLKREETLSNLSGFQINKHCPKLTHLFFADDSLIFIKAAEKDLLSLKKVLKSYEEASGQTINLEKSAFMTSRNMGREAKEECEKILGINRTESLGTYLG

Query:  M-PSRIGKNKSATFSRIKNRVEKLLQGWK--EKLFSLGGKEVLIKDVAQCS-----ASLTLPNALCSDLDRLCARFWWGATRDKDRMHWMSWRRMCTTKE
        +  S      S  F  ++  V   L  WK   K+ S+ G+ ++I  +           L+      + + R    F W         HW+S        +
Subjt:  M-PSRIGKNKSATFSRIKNRVEKLLQGWK--EKLFSLGGKEVLIKDVAQCS-----ASLTLPNALCSDLDRLCARFWWGATRDKDRMHWMSWRRMCTTKE

Query:  KGGMG
        +GG G
Subjt:  KGGMG

P92555 Uncharacterized mitochondrial protein AtMg012501.7e-1451.47Show/hide
Query:  LINGIPQEGFRPSRGIRQGDPLSPYLFLLCAEGFSALLKREETLSNLSGFQINKHCPKLTHLFFADDS
        +ING PQ    PSRG+RQGDPLSPYLF+LC E  S L +R +    L G +++ + P++ HL FADD+
Subjt:  LINGIPQEGFRPSRGIRQGDPLSPYLFLLCAEGFSALLKREETLSNLSGFQINKHCPKLTHLFFADDS

P93295 Uncharacterized mitochondrial protein AtMg003106.6e-2739.72Show/hide
Query:  LPNALCSDLDRLCARFWWGATRDKDRMHWMSWRRMCTTKE-KGGMGFRQIAAFNQAMLAKQSWRLIRNPNNLLYKTLKGRYFKDGNFLEAHIGNSPSSTW
        L   LC  L      FWW +  +K ++ W++W+++C +KE  GG+GFR +  FNQA+LAKQS+R+I  P+ LL + L+ RYF   + +E  +G  PS  W
Subjt:  LPNALCSDLDRLCARFWWGATRDKDRMHWMSWRRMCTTKE-KGGMGFRQIAAFNQAMLAKQSWRLIRNPNNLLYKTLKGRYFKDGNFLEAHIGNSPSSTW

Query:  MSICWGKELFKKGFRWRVGNGNMIQLAKDPWIRREESLHPL
         SI  G+EL  +G    +G+G   ++  D WI  E  L PL
Subjt:  MSICWGKELFKKGFRWRVGNGNMIQLAKDPWIRREESLHPL

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein6.6e-1424.02Show/hide
Query:  GVVFTWCKNKRTKEATNERIDRFFVNSK-MEQKASTMKVDQNFHHSDHRPILLEIERNQHQKRTFSKRKIKLEESWIEMEGSRMAFKEAWTATGEITNVN
        GV +TW  N +       ++DR   N        S + V +    SDH P ++ +E    +    SK+  +            ++   AW     + +  
Subjt:  GVVFTWCKNKRTKEATNERIDRFFVNSK-MEQKASTMKVDQNFHHSDHRPILLEIERNQHQKRTFSKRKIKLEESWIEMEGSRMAFKEAWTATGEITNVN

Query:  FS--EKIQEGLTTMHLWNRE---RLKGTLKGAIQRKEIEISHLEASNSPDAFDSLGQAERELEALLEEEEGYWKWRSREDWPKSGDKNTKRFHSRAWQRK
        FS  E ++       L NR+    ++   K A+   E   S L  + S   F     A ++        E +++ +SR  W + GD NT+ FH      +
Subjt:  FS--EKIQEGLTTMHLWNRE---RLKGTLKGAIQRKEIEISHLEASNSPDAFDSLGQAERELEALLEEEEGYWKWRSREDWPKSGDKNTKRFHSRAWQRK

Query:  RKNEIKGILDPFGKWVEEEKEIGNIANIFFQSLLSSNHP--STEVLKEVVEAIDTQISEDQKRRLDSSFTREEIEQVVKNMKPTKAPGPDGAHAKLYQNM
         KN IK +       VE   ++  +   ++  LL S+    + + ++ + +    + ++    RL +  + +EI   V  M   KAPGPD   A+ +   
Subjt:  RKNEIKGILDPFGKWVEEEKEIGNIANIFFQSLLSSNHP--STEVLKEVVEAIDTQISEDQKRRLDSSFTREEIEQVVKNMKPTKAPGPDGAHAKLYQNM

Query:  WDTAGENTIKVCLGILNNNESIEPLNKTHIALIPKVNSPKRMSEFRPISLCNVTYKIV
        W    ++TI            ++  N T I LIPKV    ++S FRP+S C V YKI+
Subjt:  WDTAGENTIKVCLGILNNNESIEPLNKTHIALIPKVNSPKRMSEFRPISLCNVTYKIV

AT4G20520.1 RNA binding;RNA-directed DNA polymerases3.1e-1134.15Show/hide
Query:  LANRMKRVLNAIISPNQSAFVPGRQISDNVIVGFECIHALNSRRKEKTGYVAIKLDMSKAYDRVEWRFVEEIMKKMNLSEKW
        +  R+K ++  +I P Q++F+PGR  +DN++   E +H++  R+K   G++ +KLD+ KAYDR+ W ++E+ +      E W
Subjt:  LANRMKRVLNAIISPNQSAFVPGRQISDNVIVGFECIHALNSRRKEKTGYVAIKLDMSKAYDRVEWRFVEEIMKKMNLSEKW

AT4G29090.1 Ribonuclease H-like superfamily protein6.4e-3328.04Show/hide
Query:  ASLTLPNALCSDLDRLCARFWWGATRDKDRMHWMSWRRMCTTKEKGGMGFRQIAAFNQAMLAKQSWRLIRNPNNLLYKTLKGRYFKDGNFLEAHIGNSPS
        A   LP  +C  +  + A FWW   ++   MHW +W  +   K +GG+GF+ I AFN A+L KQ WR++  P +L+ K  K RYF   + L A +G+ PS
Subjt:  ASLTLPNALCSDLDRLCARFWWGATRDKDRMHWMSWRRMCTTKEKGGMGFRQIAAFNQAMLAKQSWRLIRNPNNLLYKTLKGRYFKDGNFLEAHIGNSPS

Query:  STWMSICWGKELFKKGFRWRVGNGNMIQLAKDPWI--------RREESLHPLYVCDNIKDKNVNWLLDDN-NNWKETLIKECFIAQDAEAILNMYTGDKN
          W SI   +E+ ++G R  VGNG  I + +  W+         R + + P           V+ L+D++   W++ +I+  F   + + I  +  G + 
Subjt:  STWMSICWGKELFKKGFRWRVGNGNMIQLAKDPWI--------RREESLHPLYVCDNIKDKNVNWLLDDN-NNWKETLIKECFIAQDAEAILNMYTGDKN

Query:  SKDEIIWHLDKKGIFKVKSAYHLAMDIKSQMEASQSNSDKEASS-WKSLWNLEVLPRTKICTWKIINDIVP
          D   W     G + VKS Y +   I ++  + Q  S+   +  ++ +W  +  P+ +   WK +++ +P
Subjt:  SKDEIIWHLDKKGIFKVKSAYHLAMDIKSQMEASQSNSDKEASS-WKSLWNLEVLPRTKICTWKIINDIVP

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein4.7e-2839.72Show/hide
Query:  LPNALCSDLDRLCARFWWGATRDKDRMHWMSWRRMCTTKE-KGGMGFRQIAAFNQAMLAKQSWRLIRNPNNLLYKTLKGRYFKDGNFLEAHIGNSPSSTW
        L   LC  L      FWW +  +K ++ W++W+++C +KE  GG+GFR +  FNQA+LAKQS+R+I  P+ LL + L+ RYF   + +E  +G  PS  W
Subjt:  LPNALCSDLDRLCARFWWGATRDKDRMHWMSWRRMCTTKE-KGGMGFRQIAAFNQAMLAKQSWRLIRNPNNLLYKTLKGRYFKDGNFLEAHIGNSPSSTW

Query:  MSICWGKELFKKGFRWRVGNGNMIQLAKDPWIRREESLHPL
         SI  G+EL  +G    +G+G   ++  D WI  E  L PL
Subjt:  MSICWGKELFKKGFRWRVGNGNMIQLAKDPWIRREESLHPL

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)1.2e-1551.47Show/hide
Query:  LINGIPQEGFRPSRGIRQGDPLSPYLFLLCAEGFSALLKREETLSNLSGFQINKHCPKLTHLFFADDS
        +ING PQ    PSRG+RQGDPLSPYLF+LC E  S L +R +    L G +++ + P++ HL FADD+
Subjt:  LINGIPQEGFRPSRGIRQGDPLSPYLFLLCAEGFSALLKREETLSNLSGFQINKHCPKLTHLFFADDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGAGGACAACCCTAGGGAGGATGAGGAAAAGACAAAAAAAACGGCTAGTTTGGAGGCAAGAAAACTAGTCGACCGAATGATTACTGATGAGGGTCCCAAGAACAC
AGTAACCAAGGAAGATCGTGGAAACGCAGAGCTAGAGAGGAACAGAAACCCAAGACGTTTGTGCAGGAAATGGGAAGCAAAGGAAAAAATGGGAAACACGACAGAGAAGA
GAGTGAAGAGGAAAGAGTTAACAAAAAACAATGTTATGAAAACTTCGGTAACACCGTTTGGATATCGGCGGAGGCTGCAGAGCAGCCCCGCCGAACGCCATGAAAATCCT
AAGTTGGAACATAAGCTCAGAAGGAGTTTGGGTTTCAGCAACTGTATTAATGTGGATAGTCAAGGTGCTAGTGGAGGTCTTGCTCTTCTTTGGCAAAATCAACCCCAAAT
ATCAGTGAATTCCTACTCAAAAGGCCAAATTGATGTCACTATCAAAGAGGCTGACTGGTGGTGGAGGTTCACGGGTTTCTATGGGAGCCCGGAGCAAAGCAAGAGGAAAG
AGTCTTGGAGACTTCTCGAAAGCCGTGGGCGGTGGTTGGTGTTTCAGCCGTGGGTGGTGGAAGTTGAGGAAGAGGCAGAAGATGTAGCTCATGTTACAAAGTTGCGATCA
GTTCTTGAGTCAGTTGATCATAAAGAAGAAAGTTTGGCAGTGATGATTGAACTTCAGCTTATAATGCCAGTTCTGCTTCAAAATTTGATGATTGATAGAGCTGTTACAAA
TGATCTGAGCCTTCAACTTTACAGGGAGAACCTACCTGGTGGAGCAGCATTTAATGATGAAATGGATATTCTCGAAAAGGAGCTCGAAGAGGAACTTGGCCAGAAGAGCG
TCGAAGGAGTTGTGTTTACTTGGTGTAAAAACAAAAGGACCAAAGAAGCTACAAATGAGAGAATTGATCGCTTTTTTGTCAATTCCAAAATGGAACAAAAGGCTTCGACC
ATGAAGGTGGACCAAAATTTCCACCATTCAGATCATCGTCCTATTCTCCTGGAGATCGAACGGAATCAACATCAAAAGAGGACCTTTTCTAAAAGAAAAATAAAGTTGGA
GGAAAGCTGGATAGAAATGGAAGGGAGCAGGATGGCTTTTAAAGAAGCTTGGACAGCCACGGGTGAAATCACAAATGTGAACTTTAGTGAGAAAATCCAAGAGGGGCTTA
CGACTATGCACTTGTGGAACAGAGAAAGATTGAAGGGTACCTTAAAGGGTGCTATTCAAAGAAAGGAAATCGAAATTAGCCATCTGGAAGCTTCAAACAGTCCTGATGCG
TTCGATAGTCTGGGTCAAGCGGAGAGAGAGCTGGAGGCCTTGCTTGAGGAGGAGGAGGGCTATTGGAAGTGGAGATCGAGAGAAGATTGGCCCAAGAGTGGGGACAAGAA
CACTAAGAGGTTCCATTCAAGAGCTTGGCAAAGGAAAAGGAAGAACGAAATAAAAGGCATCCTTGACCCTTTCGGTAAGTGGGTAGAGGAGGAGAAAGAAATTGGCAATA
TTGCGAACATCTTCTTCCAAAGCCTTCTCAGTTCCAACCATCCTTCTACGGAGGTTTTAAAGGAAGTTGTAGAGGCCATTGACACTCAAATATCAGAAGACCAAAAAAGA
AGGTTGGATTCCTCTTTCACGAGAGAAGAAATAGAGCAAGTTGTCAAGAATATGAAGCCTACAAAAGCTCCAGGTCCTGATGGAGCTCATGCTAAGCTTTATCAAAATAT
GTGGGACACGGCTGGGGAGAATACGATCAAGGTGTGCCTAGGTATCCTAAACAATAATGAGAGTATAGAGCCCCTAAACAAAACGCACATAGCTCTCATCCCTAAAGTTA
ATTCCCCAAAGAGGATGAGTGAGTTTAGGCCTATCAGCCTTTGTAATGTTACATACAAGATCGTTGCTAAGGCGTTAGCGAACAGAATGAAAAGGGTCCTAAATGCTATT
ATTTCTCCAAACCAATCAGCTTTTGTCCCTGGTAGACAAATTTCTGACAACGTCATTGTTGGGTTCGAATGCATTCACGCTTTGAATAGCAGAAGAAAAGAAAAGACGGG
GTACGTTGCTATCAAACTAGATATGAGCAAGGCGTACGACCGAGTCGAATGGAGATTTGTGGAAGAGATCATGAAGAAAATGAACTTAAGTGAGAAGTGGACCAGGATTA
TATTGAACTGCATTTCAACTATTAGCTACTCGATACTCATCAACGGGATTCCCCAAGAAGGTTTCAGGCCGTCGAGAGGAATCAGACAAGGGGACCCTTTATCCCCTTAT
CTATTCCTCTTGTGTGCGGAAGGATTCTCAGCTCTATTAAAAAGGGAGGAAACTCTTTCTAATCTTAGTGGTTTCCAAATTAATAAGCATTGCCCTAAGCTCACTCACTT
ATTTTTTGCAGATGACAGCCTCATTTTCATCAAAGCAGCAGAGAAGGATCTTCTCTCTCTCAAGAAAGTTCTGAAGAGCTACGAGGAAGCATCGGGCCAAACCATAAACT
TGGAGAAATCTGCCTTTATGACGAGCAGGAACATGGGCAGGGAAGCTAAAGAGGAGTGTGAAAAGATCCTGGGGATTAATAGAACTGAATCCCTAGGGACCTACCTAGGG
ATGCCATCCCGGATTGGCAAAAACAAGAGCGCAACTTTCTCCAGAATCAAAAATAGAGTTGAAAAGTTGCTCCAGGGGTGGAAAGAAAAGCTCTTCTCACTCGGGGGAAA
AGAAGTCCTAATCAAAGATGTGGCTCAATGTTCCGCCTCCCTAACGCTCCCTAACGCTCTATGTTCTGACCTTGATAGGTTATGTGCGAGATTCTGGTGGGGAGCTACAA
GAGATAAAGATAGGATGCATTGGATGAGCTGGAGAAGAATGTGCACCACTAAGGAGAAGGGCGGGATGGGGTTCAGACAAATTGCAGCTTTCAACCAAGCTATGTTGGCC
AAGCAAAGCTGGAGACTTATAAGGAATCCTAATAACCTCCTTTACAAAACCTTGAAGGGCAGATACTTTAAAGACGGGAACTTCCTCGAAGCCCATATAGGCAATTCTCC
GTCCTCGACGTGGATGAGTATCTGTTGGGGCAAAGAACTATTCAAAAAAGGTTTCAGATGGAGAGTAGGGAACGGGAACATGATCCAGCTAGCCAAAGACCCATGGATTC
GACGAGAAGAAAGTCTACATCCCCTCTATGTTTGCGATAACATAAAGGACAAGAATGTCAACTGGTTGCTAGATGACAACAATAACTGGAAGGAAACCCTCATTAAAGAA
TGTTTCATCGCCCAGGATGCCGAGGCTATTTTGAACATGTATACAGGGGACAAAAACTCAAAGGATGAAATTATCTGGCACTTAGACAAAAAAGGCATCTTTAAAGTCAA
AAGTGCGTATCATTTAGCTATGGACATCAAATCTCAAATGGAAGCATCCCAATCTAACAGTGACAAAGAAGCTTCGAGCTGGAAGAGCCTCTGGAACCTTGAAGTTCTTC
CTAGAACGAAAATTTGCACTTGGAAGATCATCAATGACATTGTCCCAAGCAAGGCCAATATCCTTAAAAAAAGGGATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGGAGGACAACCCTAGGGAGGATGAGGAAAAGACAAAAAAAACGGCTAGTTTGGAGGCAAGAAAACTAGTCGACCGAATGATTACTGATGAGGGTCCCAAGAACAC
AGTAACCAAGGAAGATCGTGGAAACGCAGAGCTAGAGAGGAACAGAAACCCAAGACGTTTGTGCAGGAAATGGGAAGCAAAGGAAAAAATGGGAAACACGACAGAGAAGA
GAGTGAAGAGGAAAGAGTTAACAAAAAACAATGTTATGAAAACTTCGGTAACACCGTTTGGATATCGGCGGAGGCTGCAGAGCAGCCCCGCCGAACGCCATGAAAATCCT
AAGTTGGAACATAAGCTCAGAAGGAGTTTGGGTTTCAGCAACTGTATTAATGTGGATAGTCAAGGTGCTAGTGGAGGTCTTGCTCTTCTTTGGCAAAATCAACCCCAAAT
ATCAGTGAATTCCTACTCAAAAGGCCAAATTGATGTCACTATCAAAGAGGCTGACTGGTGGTGGAGGTTCACGGGTTTCTATGGGAGCCCGGAGCAAAGCAAGAGGAAAG
AGTCTTGGAGACTTCTCGAAAGCCGTGGGCGGTGGTTGGTGTTTCAGCCGTGGGTGGTGGAAGTTGAGGAAGAGGCAGAAGATGTAGCTCATGTTACAAAGTTGCGATCA
GTTCTTGAGTCAGTTGATCATAAAGAAGAAAGTTTGGCAGTGATGATTGAACTTCAGCTTATAATGCCAGTTCTGCTTCAAAATTTGATGATTGATAGAGCTGTTACAAA
TGATCTGAGCCTTCAACTTTACAGGGAGAACCTACCTGGTGGAGCAGCATTTAATGATGAAATGGATATTCTCGAAAAGGAGCTCGAAGAGGAACTTGGCCAGAAGAGCG
TCGAAGGAGTTGTGTTTACTTGGTGTAAAAACAAAAGGACCAAAGAAGCTACAAATGAGAGAATTGATCGCTTTTTTGTCAATTCCAAAATGGAACAAAAGGCTTCGACC
ATGAAGGTGGACCAAAATTTCCACCATTCAGATCATCGTCCTATTCTCCTGGAGATCGAACGGAATCAACATCAAAAGAGGACCTTTTCTAAAAGAAAAATAAAGTTGGA
GGAAAGCTGGATAGAAATGGAAGGGAGCAGGATGGCTTTTAAAGAAGCTTGGACAGCCACGGGTGAAATCACAAATGTGAACTTTAGTGAGAAAATCCAAGAGGGGCTTA
CGACTATGCACTTGTGGAACAGAGAAAGATTGAAGGGTACCTTAAAGGGTGCTATTCAAAGAAAGGAAATCGAAATTAGCCATCTGGAAGCTTCAAACAGTCCTGATGCG
TTCGATAGTCTGGGTCAAGCGGAGAGAGAGCTGGAGGCCTTGCTTGAGGAGGAGGAGGGCTATTGGAAGTGGAGATCGAGAGAAGATTGGCCCAAGAGTGGGGACAAGAA
CACTAAGAGGTTCCATTCAAGAGCTTGGCAAAGGAAAAGGAAGAACGAAATAAAAGGCATCCTTGACCCTTTCGGTAAGTGGGTAGAGGAGGAGAAAGAAATTGGCAATA
TTGCGAACATCTTCTTCCAAAGCCTTCTCAGTTCCAACCATCCTTCTACGGAGGTTTTAAAGGAAGTTGTAGAGGCCATTGACACTCAAATATCAGAAGACCAAAAAAGA
AGGTTGGATTCCTCTTTCACGAGAGAAGAAATAGAGCAAGTTGTCAAGAATATGAAGCCTACAAAAGCTCCAGGTCCTGATGGAGCTCATGCTAAGCTTTATCAAAATAT
GTGGGACACGGCTGGGGAGAATACGATCAAGGTGTGCCTAGGTATCCTAAACAATAATGAGAGTATAGAGCCCCTAAACAAAACGCACATAGCTCTCATCCCTAAAGTTA
ATTCCCCAAAGAGGATGAGTGAGTTTAGGCCTATCAGCCTTTGTAATGTTACATACAAGATCGTTGCTAAGGCGTTAGCGAACAGAATGAAAAGGGTCCTAAATGCTATT
ATTTCTCCAAACCAATCAGCTTTTGTCCCTGGTAGACAAATTTCTGACAACGTCATTGTTGGGTTCGAATGCATTCACGCTTTGAATAGCAGAAGAAAAGAAAAGACGGG
GTACGTTGCTATCAAACTAGATATGAGCAAGGCGTACGACCGAGTCGAATGGAGATTTGTGGAAGAGATCATGAAGAAAATGAACTTAAGTGAGAAGTGGACCAGGATTA
TATTGAACTGCATTTCAACTATTAGCTACTCGATACTCATCAACGGGATTCCCCAAGAAGGTTTCAGGCCGTCGAGAGGAATCAGACAAGGGGACCCTTTATCCCCTTAT
CTATTCCTCTTGTGTGCGGAAGGATTCTCAGCTCTATTAAAAAGGGAGGAAACTCTTTCTAATCTTAGTGGTTTCCAAATTAATAAGCATTGCCCTAAGCTCACTCACTT
ATTTTTTGCAGATGACAGCCTCATTTTCATCAAAGCAGCAGAGAAGGATCTTCTCTCTCTCAAGAAAGTTCTGAAGAGCTACGAGGAAGCATCGGGCCAAACCATAAACT
TGGAGAAATCTGCCTTTATGACGAGCAGGAACATGGGCAGGGAAGCTAAAGAGGAGTGTGAAAAGATCCTGGGGATTAATAGAACTGAATCCCTAGGGACCTACCTAGGG
ATGCCATCCCGGATTGGCAAAAACAAGAGCGCAACTTTCTCCAGAATCAAAAATAGAGTTGAAAAGTTGCTCCAGGGGTGGAAAGAAAAGCTCTTCTCACTCGGGGGAAA
AGAAGTCCTAATCAAAGATGTGGCTCAATGTTCCGCCTCCCTAACGCTCCCTAACGCTCTATGTTCTGACCTTGATAGGTTATGTGCGAGATTCTGGTGGGGAGCTACAA
GAGATAAAGATAGGATGCATTGGATGAGCTGGAGAAGAATGTGCACCACTAAGGAGAAGGGCGGGATGGGGTTCAGACAAATTGCAGCTTTCAACCAAGCTATGTTGGCC
AAGCAAAGCTGGAGACTTATAAGGAATCCTAATAACCTCCTTTACAAAACCTTGAAGGGCAGATACTTTAAAGACGGGAACTTCCTCGAAGCCCATATAGGCAATTCTCC
GTCCTCGACGTGGATGAGTATCTGTTGGGGCAAAGAACTATTCAAAAAAGGTTTCAGATGGAGAGTAGGGAACGGGAACATGATCCAGCTAGCCAAAGACCCATGGATTC
GACGAGAAGAAAGTCTACATCCCCTCTATGTTTGCGATAACATAAAGGACAAGAATGTCAACTGGTTGCTAGATGACAACAATAACTGGAAGGAAACCCTCATTAAAGAA
TGTTTCATCGCCCAGGATGCCGAGGCTATTTTGAACATGTATACAGGGGACAAAAACTCAAAGGATGAAATTATCTGGCACTTAGACAAAAAAGGCATCTTTAAAGTCAA
AAGTGCGTATCATTTAGCTATGGACATCAAATCTCAAATGGAAGCATCCCAATCTAACAGTGACAAAGAAGCTTCGAGCTGGAAGAGCCTCTGGAACCTTGAAGTTCTTC
CTAGAACGAAAATTTGCACTTGGAAGATCATCAATGACATTGTCCCAAGCAAGGCCAATATCCTTAAAAAAAGGGATTAA
Protein sequenceShow/hide protein sequence
MEEDNPREDEEKTKKTASLEARKLVDRMITDEGPKNTVTKEDRGNAELERNRNPRRLCRKWEAKEKMGNTTEKRVKRKELTKNNVMKTSVTPFGYRRRLQSSPAERHENP
KLEHKLRRSLGFSNCINVDSQGASGGLALLWQNQPQISVNSYSKGQIDVTIKEADWWWRFTGFYGSPEQSKRKESWRLLESRGRWLVFQPWVVEVEEEAEDVAHVTKLRS
VLESVDHKEESLAVMIELQLIMPVLLQNLMIDRAVTNDLSLQLYRENLPGGAAFNDEMDILEKELEEELGQKSVEGVVFTWCKNKRTKEATNERIDRFFVNSKMEQKAST
MKVDQNFHHSDHRPILLEIERNQHQKRTFSKRKIKLEESWIEMEGSRMAFKEAWTATGEITNVNFSEKIQEGLTTMHLWNRERLKGTLKGAIQRKEIEISHLEASNSPDA
FDSLGQAERELEALLEEEEGYWKWRSREDWPKSGDKNTKRFHSRAWQRKRKNEIKGILDPFGKWVEEEKEIGNIANIFFQSLLSSNHPSTEVLKEVVEAIDTQISEDQKR
RLDSSFTREEIEQVVKNMKPTKAPGPDGAHAKLYQNMWDTAGENTIKVCLGILNNNESIEPLNKTHIALIPKVNSPKRMSEFRPISLCNVTYKIVAKALANRMKRVLNAI
ISPNQSAFVPGRQISDNVIVGFECIHALNSRRKEKTGYVAIKLDMSKAYDRVEWRFVEEIMKKMNLSEKWTRIILNCISTISYSILINGIPQEGFRPSRGIRQGDPLSPY
LFLLCAEGFSALLKREETLSNLSGFQINKHCPKLTHLFFADDSLIFIKAAEKDLLSLKKVLKSYEEASGQTINLEKSAFMTSRNMGREAKEECEKILGINRTESLGTYLG
MPSRIGKNKSATFSRIKNRVEKLLQGWKEKLFSLGGKEVLIKDVAQCSASLTLPNALCSDLDRLCARFWWGATRDKDRMHWMSWRRMCTTKEKGGMGFRQIAAFNQAMLA
KQSWRLIRNPNNLLYKTLKGRYFKDGNFLEAHIGNSPSSTWMSICWGKELFKKGFRWRVGNGNMIQLAKDPWIRREESLHPLYVCDNIKDKNVNWLLDDNNNWKETLIKE
CFIAQDAEAILNMYTGDKNSKDEIIWHLDKKGIFKVKSAYHLAMDIKSQMEASQSNSDKEASSWKSLWNLEVLPRTKICTWKIINDIVPSKANILKKRD