; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0007715 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0007715
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr9:3612015..3616816
RNA-Seq ExpressionLag0007715
SyntenyLag0007715
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033221.1 uncharacterized protein E6C27_scaffold845G00100 [Cucumis melo var. makuwa]3.1e-19032.62Show/hide
Query:  DELSTVLQWAEQTQLKHGDNLPFKSPVSSSFSSQVQIIANELGELKAIWEGLTPERRFLFPKRYDHIADLMYIPVNRFTIQALLKFWDPAYRCFVFQDFD
        D    V QW+E  QL+ G + P K  V    + Q+  I N L ELK++WE L PE+R  F K Y  I DL+Y  +N  T+QAL  FWDP  +CF F  FD
Subjt:  DELSTVLQWAEQTQLKHGDNLPFKSPVSSSFSSQVQIIANELGELKAIWEGLTPERRFLFPKRYDHIADLMYIPVNRFTIQALLKFWDPAYRCFVFQDFD

Query:  LVPTIEEYHTMQNIEEKGGEMIYCFNPQLTAKRTLPKFL-VVHLKEVQKNVKVKGVEESISADYLTELARKHINEEKGLVLLALCIYGMVLYPRVKGYVD
        L PTIEEY  + ++    G  +Y ++ +LT +R+L KF+  +H  E++K +K K     I  +YL  LAR  +  +KGL L+ALCIYG V++PR+KGYV+
Subjt:  LVPTIEEYHTMQNIEEKGGEMIYCFNPQLTAKRTLPKFL-VVHLKEVQKNVKVKGVEESISADYLTELARKHINEEKGLVLLALCIYGMVLYPRVKGYVD

Query:  GDVLKLFFGIEQGVDPTIPLLAETVRALNYCKSNKGGKLRCCVPLLYIWVHSHFKFPEDFGLSKNQLQQHVESN----KEYNCRIWHGPMELKFSEERSM
         +V+K+F GIE+GV+P IP++AET R+LN+C+    GK   C P+L+IW+ SH ++P +FG  + +            +E+N   W  P + +F      
Subjt:  GDVLKLFFGIEQGVDPTIPLLAETVRALNYCKSNKGGKLRCCVPLLYIWVHSHFKFPEDFGLSKNQLQQHVESN----KEYNCRIWHGPMELKFSEERSM

Query:  GVFLFEFQIRACDLEGTLDAPKSLNLQMRELPHSSSSRPLGMRPLCSVVGLATNCEDSNEKKRQVLASWRTVRRINGNSHSEGVTPEYLQWRIKRSKIPI
        G   +   +          AP    L+  E  + S+                     +N+K  + + +W+ V+++    H EG T +Y  WR  R+ I  
Subjt:  GVFLFEFQIRACDLEGTLDAPKSLNLQMRELPHSSSSRPLGMRPLCSVVGLATNCEDSNEKKRQVLASWRTVRRINGNSHSEGVTPEYLQWRIKRSKIPI

Query:  MTRDNVGESSSRAVD--KPSQLATEWKELVGRNQTLKLENERLQQEVKRLMDQVTHASGKLEEVERVSKNRDKRERDYNILDEENRRLIKENHALRNENT
         T + + ES    ++  K      E K L   N  L  ENE+L++EVK+ + Q  +    L+E +R                   +RL            
Subjt:  MTRDNVGESSSRAVD--KPSQLATEWKELVGRNQTLKLENERLQQEVKRLMDQVTHASGKLEEVERVSKNRDKRERDYNILDEENRRLIKENHALRNENT

Query:  TLRRTTRSQEDRIKDISRGKETLLELVAELNETINKQKTQLIEFEEANTALRRTLDNLRVNMQAQSEESEAFKSYSSSLERQLQAFQRASEQLSLERGQL
                            ET +  +  L  +I + K+QLIE E  N  L++ +                     +S E QL   ++A E ++ +  QL
Subjt:  TLRRTTRSQEDRIKDISRGKETLLELVAELNETINKQKTQLIEFEEANTALRRTLDNLRVNMQAQSEESEAFKSYSSSLERQLQAFQRASEQLSLERGQL

Query:  EEKYSSL-------REDYAIMREDMQIILGKVNETMNTIKIMARRARGFAEWARNLRENTSPMASNAEELFEFLGMI------RRDLGEKVDAIIAALE-
         EKY  +       +++Y  +R      +G++      ++ MAR A  F+  AR LR+   P     +EL  FLG+I      R+D+    D +   LE 
Subjt:  EEKYSSL-------REDYAIMREDMQIILGKVNETMNTIKIMARRARGFAEWARNLRENTSPMASNAEELFEFLGMI------RRDLGEKVDAIIAALE-

Query:  ----KGKMIADTTAP-DTPIGNPQAGLP-FPPSFASHVQCQVPQIPMASQAGASYFKPEFSKIPFAVPN--------LDDP------EVKKELTAP--SN
            +GK +  T++  +  +      +P +PP F        PQ   + + G   +   F   P   PN         ++P      E  K+++    S 
Subjt:  ----KGKMIADTTAP-DTPIGNPQAGLP-FPPSFASHVQCQVPQIPMASQAGASYFKPEFSKIPFAVPN--------LDDP------EVKKELTAP--SN

Query:  EKLEVLEERLRAVEGTSVFGNMDATQLCLVPDVVIPPKFKLPEFEKYDGTTCPKSHIAMYCRKMAVHVHNDKLLIHCFQDSLKGPTARWYMQLDSSHVVT
         +LE LEERLR +E                            +FEKY+GT+CPKSH+ MYCRKM+ + H+DKLLIHCFQDSL  P +RWYMQLD S V  
Subjt:  EKLEVLEERLRAVEGTSVFGNMDATQLCLVPDVVIPPKFKLPEFEKYDGTTCPKSHIAMYCRKMAVHVHNDKLLIHCFQDSLKGPTARWYMQLDSSHVVT

Query:  WKNLVDSFLKQYKHNIDLAPDRLDLQRMEKKSTEDFKDYAQRWRDTTAQVQPPLTDKELSAMFINTLRSPFYEHMVGRASTNFSDIMTIGERIEYGIKHG
        WK+L DSFLKQYK+NID+APDRLDLQRMEKK+ E FK+YAQRWR+  AQVQPPLTDKEL+AMFINTLR+P+Y+ MVG ASTNFSD++TIGERIE+G+K+G
Subjt:  WKNLVDSFLKQYKHNIDLAPDRLDLQRMEKKSTEDFKDYAQRWRDTTAQVQPPLTDKELSAMFINTLRSPFYEHMVGRASTNFSDIMTIGERIEYGIKHG

Query:  RIAKTVKSSGTKKGNSSKKKEGEVQMIWKVDKSHRQIHQPVAQYPSNYLPTYGYYPHQVNNASTQYSPFYGQNTRPQMNQNFKSRRQQPVTPSQQNNRAP
        RI+    +S T++  + KKKEGEV  +    +   ++  P+    +N+ P+Y          +   SPF GQ+T                   Q+N R  
Subjt:  RIAKTVKSSGTKKGNSSKKKEGEVQMIWKVDKSHRQIHQPVAQYPSNYLPTYGYYPHQVNNASTQYSPFYGQNTRPQMNQNFKSRRQQPVTPSQQNNRAP

Query:  YKQAQFDPIPMTYTELLPQLIQNHQLTPKPVAPLQPPYPRWYDPNARCDYHAGVVGHSTENCTALKYRVQELIKAGW-----------------------
        +KQ +FDPIPM+YTELLP LI++HQ+   P  PLQPPYP+WYDPNA+C+YHAG VGHSTENC  LK +VQ L+KAGW                       
Subjt:  YKQAQFDPIPMTYTELLPQLIQNHQLTPKPVAPLQPPYPRWYDPNARCDYHAGVVGHSTENCTALKYRVQELIKAGW-----------------------

Query:  ---------------------------------------------------------------------------RVQELVDSKFLMVTQAHHQEDEIDV
                                                                                    VQ+L+DSK L++ Q   Q+ E+++
Subjt:  ---------------------------------------------------------------------------RVQELVDSKFLMVTQAHHQEDEIDV

Query:  VEELMPKE--SLNSSFKLKPLTIYYREKTT----THDPKSITIQVTAPFKYKSSKAIPWSYEYKVTINSEMPPLPVDNISRTGGVTRSGKCYTPDQLLKC
        +      E  S  ++F  KPL I+Y EK +       PK++T+++                             PV      GG+TRSG+CYTPD L   
Subjt:  VEELMPKE--SLNSSFKLKPLTIYYREKTT----THDPKSITIQVTAPFKYKSSKAIPWSYEYKVTINSEMPPLPVDNISRTGGVTRSGKCYTPDQLLKC

Query:  SK--NTIEEKNEKVE--GNAD----SNVKPKETTVMKKSRGKQTVSDEETQEFLK------------LVKQSEYKVVEQLAQTPAKISILSLLLSSGSHR
        SK       K + +E  G  D    S V  ++ T+++K    + VS EE  E +              VK  ++ V   L    + ++I+S        R
Subjt:  SK--NTIEEKNEKVE--GNAD----SNVKPKETTVMKKSRGKQTVSDEETQEFLK------------LVKQSEYKVVEQLAQTPAKISILSLLLSSGSHR

Query:  DTILVKGIKVPTQR------KRIDWTLRRILINIKIPLR
         T++   I     R      +  D   R ++ +I IPL+
Subjt:  DTILVKGIKVPTQR------KRIDWTLRRILINIKIPLR

KAA0036933.1 uncharacterized protein E6C27_scaffold86G00060 [Cucumis melo var. makuwa]1.0e-23342.61Show/hide
Query:  IWEGLTPERRFLFPKRYDHIADLMYIPVNRFTIQALLKFWDPAYRCFVFQDFDLVPTIEEYHTMQNIEEKGGEMIYCFNPQLTAKRTLPKFL-VVHLKEV
        IWE LTP+RRF+F K+Y HIA+LMYIPVN F ++A++ F DPAY CF F   DL+PTIEEY  M ++ +K  E++Y FNP+ T KRTL KFL  VH  E+
Subjt:  IWEGLTPERRFLFPKRYDHIADLMYIPVNRFTIQALLKFWDPAYRCFVFQDFDLVPTIEEYHTMQNIEEKGGEMIYCFNPQLTAKRTLPKFL-VVHLKEV

Query:  QKNVKVKGVEESISADYLTELARKHINEEKGLVLLALCIYGMVLYPRVKGYVDGDVLKLFFGIEQGVDPTIPLLAETVRALNYCKSNKG---------GK
        QK +K KG EE++  DYL ++ + +I+E+KGL LLALCIYG V++P+ +GYVD  V+KLFF +E+GV+P IP+LAET R+LNYC+ NKG         GK
Subjt:  QKNVKVKGVEESISADYLTELARKHINEEKGLVLLALCIYGMVLYPRVKGYVDGDVLKLFFGIEQGVDPTIPLLAETVRALNYCKSNKG---------GK

Query:  LRCCVPLLYIWVHSHFKFPEDFGLSKNQLQQ--HVESN--KEYNCRIWHGPMELKFSEERSMGVFLFEFQIRACDLEGTLDAPKSLNLQMRELPHSSSSR
        L CCVPLLYIW+HSH KFP +F   +       ++  N   E+   +W    +  +  + +   F  +        +      K++  +  +        
Subjt:  LRCCVPLLYIWVHSHFKFPEDFGLSKNQLQQ--HVESN--KEYNCRIWHGPMELKFSEERSMGVFLFEFQIRACDLEGTLDAPKSLNLQMRELPHSSSSR

Query:  PLGMRPLCSVVGLATNCEDSNEKKRQVLASWRTVRRINGNSHSEGVTPEYLQWRIKRSK-IPIMTRDNVGESSSRAVDKPSQLATEWKELVGRNQTLKLE
        P G      ++ L           +Q +     + +I    H EGVT  Y  W+  R K I  ++R+ V      + ++P+Q   +  EL  +N+ L+ E
Subjt:  PLGMRPLCSVVGLATNCEDSNEKKRQVLASWRTVRRINGNSHSEGVTPEYLQWRIKRSK-IPIMTRDNVGESSSRAVDKPSQLATEWKELVGRNQTLKLE

Query:  NERLQQEVKRLMDQVTHASGKLEEVERVSKNRDKRERDYNILDEENRRLIKENHALRNENTTLRRTTRSQEDRIKDISRGKETLLELVAELNETINKQKT
        NE+L++E  + MD  T+   +LE+ +   KN+DK E D   LD+E RR+ K N +++NE TTL+ T                                  
Subjt:  NERLQQEVKRLMDQVTHASGKLEEVERVSKNRDKRERDYNILDEENRRLIKENHALRNENTTLRRTTRSQEDRIKDISRGKETLLELVAELNETINKQKT

Query:  QLIEFEEANTALRRTLDNLRVNMQAQSEESEAFKSYSSSLERQLQAFQRASEQLSLERGQLEEKYSSLREDYAIMREDMQIILGKVNETMNTIKIMARRA
                          L + M  +SEE +  K+Y+  L  QL A Q +S++++ E   L   Y  ++ DY +   D Q+++ +V++T+  ++++++RA
Subjt:  QLIEFEEANTALRRTLDNLRVNMQAQSEESEAFKSYSSSLERQLQAFQRASEQLSLERGQLEEKYSSLREDYAIMREDMQIILGKVNETMNTIKIMARRA

Query:  RGFAEWARNLRE--------NTSPMASNAEELFEFLGMIRRD---LGEKVDAIIAALE--KGKMIADTTAPDTPIGNPQAGLPFPPSFASHVQCQVPQIP
         GFAEWA             NT   +   EE  + +  +R++   LGE+V  I+  L   KGK   DTT    PI +    + +PP F  +    VP++ 
Subjt:  RGFAEWARNLRE--------NTSPMASNAEELFEFLGMIRRD---LGEKVDAIIAALE--KGKMIADTTAPDTPIGNPQAGLPFPPSFASHVQCQVPQIP

Query:  MASQAGASYFKPEFSKIPFAVPNLDDPEVKKELTAPSNEKLEVLEERLRAVEGTSVFGNMDATQLCLVPDVVIPPKFKLPEFEKYDGTTCPKSHIAMYCR
                                  P     L   S +KL+VLEERLRA+E T V+GN+DATQLCLVP ++IP KFK+PEF KYDG+TCP+SH+ MYCR
Subjt:  MASQAGASYFKPEFSKIPFAVPNLDDPEVKKELTAPSNEKLEVLEERLRAVEGTSVFGNMDATQLCLVPDVVIPPKFKLPEFEKYDGTTCPKSHIAMYCR

Query:  KMAVHVHNDKLLIHCFQDSLKGPTARWYMQLDSSHVVTWKNLVDSFLKQYKHNIDLAPDRLDLQRMEKKSTEDFKDYAQRWRDTTAQVQPPLTDKELSAM
        KMAVH++NDKLL+HCFQDSL  P +RWY+QLD++H+  WK+L D+FLKQYK NID+APDRLDLQRMEKKS+E FK+YAQRWRD  A+VQPPLTDKE+++M
Subjt:  KMAVHVHNDKLLIHCFQDSLKGPTARWYMQLDSSHVVTWKNLVDSFLKQYKHNIDLAPDRLDLQRMEKKSTEDFKDYAQRWRDTTAQVQPPLTDKELSAM

Query:  FINTLRSPFYEHMVGRASTNFSDIMTIGERIEYGIKHGRIAK-TVKSSGTKKGNSSKKKEGEVQMIWKVDKSHRQIHQPVAQYPSNYLPTYGYYPHQVNN
        F+NTLR+PFYE M+G ASTNFSDI+ IGERIEYGIKHGR+A+ T +  G KKG  SKKKEGEV  I   +    +    + +Y  N       +P  +NN
Subjt:  FINTLRSPFYEHMVGRASTNFSDIMTIGERIEYGIKHGRIAK-TVKSSGTKKGNSSKKKEGEVQMIWKVDKSHRQIHQPVAQYPSNYLPTYGYYPHQVNN

Query:  AS-TQYSPFYGQNTRPQMNQNFKSRRQQPVTPSQQNNRAPYKQAQFDPIPMTYTELLPQLIQNHQLTPKPVAPLQPPYPRWYDPNARCDYHAGVVGHSTE
         S   Y+ +   +T  +  +   S   +P     Q ++      +FDPIPMTYTELLPQLIQN QL   P+ P+QPPYP+WYD NARCDYHAG VGHSTE
Subjt:  AS-TQYSPFYGQNTRPQMNQNFKSRRQQPVTPSQQNNRAPYKQAQFDPIPMTYTELLPQLIQNHQLTPKPVAPLQPPYPRWYDPNARCDYHAGVVGHSTE

Query:  NCTALKYRVQELIKAGW
        NC ALK  VQ LI AGW
Subjt:  NCTALKYRVQELIKAGW

XP_022147189.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111016200 [Momordica charantia]4.0e-19052.88Show/hide
Query:  KPEFSKIPFAVPNLDDPEVKKE---LTAPSNEKLEVLEERLRAVEGTSVFGNMDATQLCLVPDVVIPPKFKLPEFEKYDGTTCPKSHIAMYCRKMAVHVH
        +PE    P  V NL D   K +     APSNEK EVL+ERLRA+E T VFGN+DA+QLC V  +VIPPK K+PEFEKY+G++CPK+H+ MYCRKMA +V 
Subjt:  KPEFSKIPFAVPNLDDPEVKKE---LTAPSNEKLEVLEERLRAVEGTSVFGNMDATQLCLVPDVVIPPKFKLPEFEKYDGTTCPKSHIAMYCRKMAVHVH

Query:  NDKLLIHCFQDSLKGPTARWYMQLDSSHVVTWKNLVDSFLKQYKHNIDLAPDRLDLQRMEKKSTEDFKDYAQRWRDTTAQVQPPLTDKELSAMFINTLRS
        NDKLLIHCFQDSL GP +RWYMQLDSSHV +WKNL DSFLKQYKHNID+APDRLDLQRMEKKST+ FK+YAQRWRDT AQVQPPL DKELSAMFINTL+ 
Subjt:  NDKLLIHCFQDSLKGPTARWYMQLDSSHVVTWKNLVDSFLKQYKHNIDLAPDRLDLQRMEKKSTEDFKDYAQRWRDTTAQVQPPLTDKELSAMFINTLRS

Query:  PFYEHMVGRASTNFSDIMTIGERIEYGIKHGRIAKTV-KSSGTKKGNSSKKKEGEVQMI------WKVDKSHRQIHQPVAQYPSNYLPT-YGYYPHQVNN
        PFY+ M+G ASTNFSDIMTIGERIEYG++HGRI  T  +    KK + SKKKEGEVQM+      WK     +Q ++   QY   Y PT YGY    VNN
Subjt:  PFYEHMVGRASTNFSDIMTIGERIEYGIKHGRIAKTV-KSSGTKKGNSSKKKEGEVQMI------WKVDKSHRQIHQPVAQYPSNYLPT-YGYYPHQVNN

Query:  ASTQYSPFYGQNTRPQMNQNFK----SRRQQP--------VTPSQQNNRAPYKQAQFDPIPMTYTELLPQLIQNHQLTPKPVAPLQPPYPRWYDPNARCD
        A++ Y P+  QN RP  +QNF+    S+  QP         T  QQNNR   KQ QFDPIPMTYTELLPQL QN+QL P PV P+QPPYPRWYD NARCD
Subjt:  ASTQYSPFYGQNTRPQMNQNFK----SRRQQP--------VTPSQQNNRAPYKQAQFDPIPMTYTELLPQLIQNHQLTPKPVAPLQPPYPRWYDPNARCD

Query:  YHAGVVGHSTENCTALKYRVQELIKAGW------------------------------------------------------------------------
        YHAG + HSTENCT LKYRVQ LIKAGW                                                                        
Subjt:  YHAGVVGHSTENCTALKYRVQELIKAGW------------------------------------------------------------------------

Query:  -----------------------RVQELVDSKFLMVTQAHHQEDEIDVVEELMPKESLNSSFKLKPLTIYYREK--TTTHDPKSITIQVTAPFKYKSSKA
                               +VQEL+DSK L    +H ++   +VVE+++  E  + S K KPLTI+YREK    +   K   I V  PF+YKSSKA
Subjt:  -----------------------RVQELVDSKFLMVTQAHHQEDEIDVVEELMPKESLNSSFKLKPLTIYYREK--TTTHDPKSITIQVTAPFKYKSSKA

Query:  IPWSYEYKVTINSEM--PPLPVDNISRTGGVTRSGKCYTPDQLLKCSKNTIEEKN---------EKVE----GNA----DSNVKPKETTVMKKSRGKQTV
        +PW YE KVT+  ++  P LPVDNI+  GG+TR+G+CYTPD LLK    T  EKN         EKVE    G A    D+  +  E  V+K    KQ +
Subjt:  IPWSYEYKVTINSEM--PPLPVDNISRTGGVTRSGKCYTPDQLLKCSKNTIEEKN---------EKVE----GNA----DSNVKPKETTVMKKSRGKQTV

Query:  SDEETQEFLKLVKQSEYKVVEQLAQTPAKISILSLLLSSGSHRDTIL
        S+EETQEFLKLVKQSEYKV+EQL +TPA ISILSLLLSS +H++ +L
Subjt:  SDEETQEFLKLVKQSEYKVVEQLAQTPAKISILSLLLSSGSHRDTIL

XP_022158986.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111025431 [Momordica charantia]1.3e-19654.18Show/hide
Query:  PFAVPNLDDPEVKKEL---TAPSNEKLEVLEERLRAVEGTSVFGNMDATQLCLVPDVVIPPKFKLPEFEKYDGTTCPKSHIAMYCRKMAVHVHNDKLLIH
        P  V NL     K +L    APSNEK EVLEERLRA+EGT VFGN+DA+QLCLV  +VIPPKFK+PEFEKYDG++CPK+H+ MYCRKMA +V NDKLLIH
Subjt:  PFAVPNLDDPEVKKEL---TAPSNEKLEVLEERLRAVEGTSVFGNMDATQLCLVPDVVIPPKFKLPEFEKYDGTTCPKSHIAMYCRKMAVHVHNDKLLIH

Query:  CFQDSLKGPTARWYMQLDSSHVVTWKNLVDSFLKQYKHNIDLAPDRLDLQRMEKKSTEDFKDYAQRWRDTTAQVQPPLTDKELSAMFINTLRSPFYEHMV
        CFQDSL GP +RWYMQLDSS+V +WKNL DSFLKQYKHNID+APDRLDLQRMEKKSTE FK+YAQRWRDT AQVQPPLTDKELSAMFINTL+ PFY+ M+
Subjt:  CFQDSLKGPTARWYMQLDSSHVVTWKNLVDSFLKQYKHNIDLAPDRLDLQRMEKKSTEDFKDYAQRWRDTTAQVQPPLTDKELSAMFINTLRSPFYEHMV

Query:  GRASTNFSDIMTIGERIEYGIKHGRIAKTV-KSSGTKKGNSSKKKEGEVQMI------WKVDKSHRQIHQPVAQYPSNYLPT-YGYYPHQVNNASTQYSP
        G ASTNFSDIMTIGERIEYG++HGRI  TV +    KK + SKKKEGEVQM+      WK     +Q +    +Y   Y PT YGY    VNNA++ YSP
Subjt:  GRASTNFSDIMTIGERIEYGIKHGRIAKTV-KSSGTKKGNSSKKKEGEVQMI------WKVDKSHRQIHQPVAQYPSNYLPT-YGYYPHQVNNASTQYSP

Query:  FYGQNTRPQMNQNFK----SRRQQP--------VTPSQQNNRAPYKQAQFDPIPMTYTELLPQLIQNHQLTPKPVAPLQPPYPRWYDPNARCDYHAGVVG
        +  QN RP  +QNF+    S+  QP         T  QQ NR   KQ QFDPIPMTYTELLPQL QN+QL P PV P+QPPYPRWYD NARCDYHAG +G
Subjt:  FYGQNTRPQMNQNFK----SRRQQP--------VTPSQQNNRAPYKQAQFDPIPMTYTELLPQLIQNHQLTPKPVAPLQPPYPRWYDPNARCDYHAGVVG

Query:  HSTENCTALKYRVQELIKAGW-------------------------------------------------------------------------------
        HSTENCTALKYRVQ LIKAGW                                                                               
Subjt:  HSTENCTALKYRVQELIKAGW-------------------------------------------------------------------------------

Query:  ------------------RVQELVDSKFLMVTQAHHQEDEIDVVEELMPKESLNSSFKLKPLTIYYREKTTTHD--PKSITIQVTAPFKYKSSKAIPWSY
                          +VQEL+DSK L V  + HQ+  I++VE++   E  + + K K LTI+Y EK    +   K ITI V APF+YKSSKA+PW Y
Subjt:  ------------------RVQELVDSKFLMVTQAHHQEDEIDVVEELMPKESLNSSFKLKPLTIYYREKTTTHD--PKSITIQVTAPFKYKSSKAIPWSY

Query:  EYKVTINSEM--PPLPVDNISRTGGVTRSGKCYTPDQLLKCSKNTIEEKN---------EKVEGNADSNVKPK--------ETTVMKKSRGKQTVSDEET
        + KVT+  ++  PPLP+DNI+  GG+TR+G+CYTPD LLKC   T  EKN         EKVE +     K          E  V+K    KQ +S+EET
Subjt:  EYKVTINSEM--PPLPVDNISRTGGVTRSGKCYTPDQLLKCSKNTIEEKN---------EKVEGNADSNVKPK--------ETTVMKKSRGKQTVSDEET

Query:  QEFLKLVKQSEYKVVEQLAQTPAKISILSLLLSSGSHRDTIL
        QE LKLVKQSEYKV+EQL +TPAKISILSLLLSS +HR+ +L
Subjt:  QEFLKLVKQSEYKVVEQLAQTPAKISILSLLLSSGSHRDTIL

XP_031738551.1 LOW QUALITY PROTEIN: uncharacterized protein LOC101203611 [Cucumis sativus]7.2e-21641.39Show/hide
Query:  EDSNEKKRQVLASWRTVRRINGNSHSEGVTPEYLQWRIKRSKIPIMT-RDNVGESSSRAVDKPSQLATEWKELVGRNQTLKLENERLQQEVKRLMDQVTH
        ED   KK + + +W++VR+I    H EGVT  Y  W+  R K  I T R+ V      + ++P Q   +  +L  +N+ L+ ENE+LQ+E  + +D  T 
Subjt:  EDSNEKKRQVLASWRTVRRINGNSHSEGVTPEYLQWRIKRSKIPIMT-RDNVGESSSRAVDKPSQLATEWKELVGRNQTLKLENERLQQEVKRLMDQVTH

Query:  ASGKLEEVERVSKNRDKRERDYNILDEENRRLIKENHALRNENTTLRRTTRSQEDRIKDISRGKETLLELVAELNETINKQKTQLIEFEEANTALRRTLD
           +LE+ +   KN+DK E++  +LDEE RR+ K N +L+NE T L+ T  SQ++ IKD+   KE  LELV +L  +I K++ Q+++ E  N +LR+T+D
Subjt:  ASGKLEEVERVSKNRDKRERDYNILDEENRRLIKENHALRNENTTLRRTTRSQEDRIKDISRGKETLLELVAELNETINKQKTQLIEFEEANTALRRTLD

Query:  NLRVNMQAQSEESEAFKSYSSSLERQLQAFQRASEQLSLERGQLEEKYSSLREDYAIMREDMQIILGKVNETMNTIKIMARRARGFAEWARNLRENTSPM
        +L V M   SE+ +  K+Y+ SL  QL AFQ +SE++  E   L+  Y  ++ DY + R D Q ++ +V++T+  ++I++RRA GFAEWA +LR N   +
Subjt:  NLRVNMQAQSEESEAFKSYSSSLERQLQAFQRASEQLSLERGQLEEKYSSLREDYAIMREDMQIILGKVNETMNTIKIMARRARGFAEWARNLRENTSPM

Query:  ASNAEELFEFLGMIRRDLGEKVDAIIAALEKGKMIADTTAPDTPIGNPQAGLPFPPSFA-SHVQCQVPQIPMASQAGASY--FKPEFSKIPFAVPNLDDP
          ++++L  FL MI R+LG           KGK++ +T     P+ +    + +PP F   H+          SQ    Y    P F  +P  VP+++  
Subjt:  ASNAEELFEFLGMIRRDLGEKVDAIIAALEKGKMIADTTAPDTPIGNPQAGLPFPPSFA-SHVQCQVPQIPMASQAGASY--FKPEFSKIPFAVPNLDDP

Query:  EVKKEL-------TAPSNEKLEVLEERLRAVEGTSVFGNMDATQLCLVPDVVIPPKFKLPEFEKYDGTTCPKSHIAMYCRKMAVHVHNDKLLIHCFQDSL
        E + ++         P+ +KL+VLEERLRA+EGT V+GN+DATQLCLVP ++IP KFK+P F+KYDG++CP+SH+ MYCRKMA H+ NDKLLIHCFQDSL
Subjt:  EVKKEL-------TAPSNEKLEVLEERLRAVEGTSVFGNMDATQLCLVPDVVIPPKFKLPEFEKYDGTTCPKSHIAMYCRKMAVHVHNDKLLIHCFQDSL

Query:  KGPTARWYMQLDSSHVVTWKNLVDSFLKQYKHNIDLAPDRLDLQRMEKKSTEDFKDYAQRWRDTTAQVQPPLTDKELSAMFINTLRSPFYEHMVGRASTN
         GP  RWY+QLD++H+  WK+L D+FLKQYKHNID+APDRLDLQRMEKKS+E FK+YAQRWRD  A+VQPPLTDKE++ MF+NTLR+PFY+ M+G A+TN
Subjt:  KGPTARWYMQLDSSHVVTWKNLVDSFLKQYKHNIDLAPDRLDLQRMEKKSTEDFKDYAQRWRDTTAQVQPPLTDKELSAMFINTLRSPFYEHMVGRASTN

Query:  FSDIMTIGERIEYGIKHGRIAKT-VKSSGTKKGNSSKKKEGEVQMIWKVDKSHRQIHQPVAQYPS--NYLPTYGY------YPHQVNNAS----TQYSPF
        FSDI+ IGERIEYGIKHGR+ +T  +  G KKG + KKKEGEV  I                +P+  N+  T+G       +P  ++N +      Y P 
Subjt:  FSDIMTIGERIEYGIKHGRIAKT-VKSSGTKKGNSSKKKEGEVQMIWKVDKSHRQIHQPVAQYPS--NYLPTYGY------YPHQVNNAS----TQYSPF

Query:  YGQNTRPQM-NQNFKSRRQQPVTPSQQNNRAPYKQAQFDPIPMTYTELLPQLIQNHQLTPKPVAPLQPPYPRWYDPNARCDYHAGVVGHSTENCTALKYR
        +  +  P++ N NF     Q      Q ++   +  +FDPIPMTYTELLPQL+ N QL P P+ P+QPPYP+WYDPNARCDYHAG VGHSTENC ALK +
Subjt:  YGQNTRPQM-NQNFKSRRQQPVTPSQQNNRAPYKQAQFDPIPMTYTELLPQLIQNHQLTPKPVAPLQPPYPRWYDPNARCDYHAGVVGHSTENCTALKYR

Query:  VQELIKAGW-------------------------------------RVQEL---VDSKFLMVTQA-------------------------------HHQE
        VQ LI AGW                                      V E+   +++ F  + +A                               H   
Subjt:  VQELIKAGW-------------------------------------RVQEL---VDSKFLMVTQA-------------------------------HHQE

Query:  DEIDVVEELMPKESLNSS---------------FKLKPLTIYYRE---KTTTHDPKSITIQVTAPFKYKSSKAIPWSYEYKVTINSEMPPLPVDNISRTG
        D I  V E   K+ +  +               F  +PLT++Y+E   K+++ +PK + ++V +PFK+K  KA+PW Y+ +V          VDNI+   
Subjt:  DEIDVVEELMPKESLNSS---------------FKLKPLTIYYRE---KTTTHDPKSITIQVTAPFKYKSSKAIPWSYEYKVTINSEMPPLPVDNISRTG

Query:  GVTRSGKCYTPDQLLKCSKNTI---EEKNEKVEGNADSNVKPKETTVMKKSRGKQTVSDEETQEFLKLVKQSEYKVVEQLAQTPAKISILSLLLSSGSHR
        G+TRSG+CY PD L   S +       K+EK   N     +  E +V K    K+ V+DE   EFLK+VKQSEYK++EQ+  TPA+IS+LSL L+S  HR
Subjt:  GVTRSGKCYTPDQLLKCSKNTI---EEKNEKVEGNADSNVKPKETTVMKKSRGKQTVSDEETQEFLKLVKQSEYKVVEQLAQTPAKISILSLLLSSGSHR

Query:  DTIL
          +L
Subjt:  DTIL

TrEMBL top hitse value%identityAlignment
A0A5A7SUT0 Reverse transcriptase1.5e-19032.62Show/hide
Query:  DELSTVLQWAEQTQLKHGDNLPFKSPVSSSFSSQVQIIANELGELKAIWEGLTPERRFLFPKRYDHIADLMYIPVNRFTIQALLKFWDPAYRCFVFQDFD
        D    V QW+E  QL+ G + P K  V    + Q+  I N L ELK++WE L PE+R  F K Y  I DL+Y  +N  T+QAL  FWDP  +CF F  FD
Subjt:  DELSTVLQWAEQTQLKHGDNLPFKSPVSSSFSSQVQIIANELGELKAIWEGLTPERRFLFPKRYDHIADLMYIPVNRFTIQALLKFWDPAYRCFVFQDFD

Query:  LVPTIEEYHTMQNIEEKGGEMIYCFNPQLTAKRTLPKFL-VVHLKEVQKNVKVKGVEESISADYLTELARKHINEEKGLVLLALCIYGMVLYPRVKGYVD
        L PTIEEY  + ++    G  +Y ++ +LT +R+L KF+  +H  E++K +K K     I  +YL  LAR  +  +KGL L+ALCIYG V++PR+KGYV+
Subjt:  LVPTIEEYHTMQNIEEKGGEMIYCFNPQLTAKRTLPKFL-VVHLKEVQKNVKVKGVEESISADYLTELARKHINEEKGLVLLALCIYGMVLYPRVKGYVD

Query:  GDVLKLFFGIEQGVDPTIPLLAETVRALNYCKSNKGGKLRCCVPLLYIWVHSHFKFPEDFGLSKNQLQQHVESN----KEYNCRIWHGPMELKFSEERSM
         +V+K+F GIE+GV+P IP++AET R+LN+C+    GK   C P+L+IW+ SH ++P +FG  + +            +E+N   W  P + +F      
Subjt:  GDVLKLFFGIEQGVDPTIPLLAETVRALNYCKSNKGGKLRCCVPLLYIWVHSHFKFPEDFGLSKNQLQQHVESN----KEYNCRIWHGPMELKFSEERSM

Query:  GVFLFEFQIRACDLEGTLDAPKSLNLQMRELPHSSSSRPLGMRPLCSVVGLATNCEDSNEKKRQVLASWRTVRRINGNSHSEGVTPEYLQWRIKRSKIPI
        G   +   +          AP    L+  E  + S+                     +N+K  + + +W+ V+++    H EG T +Y  WR  R+ I  
Subjt:  GVFLFEFQIRACDLEGTLDAPKSLNLQMRELPHSSSSRPLGMRPLCSVVGLATNCEDSNEKKRQVLASWRTVRRINGNSHSEGVTPEYLQWRIKRSKIPI

Query:  MTRDNVGESSSRAVD--KPSQLATEWKELVGRNQTLKLENERLQQEVKRLMDQVTHASGKLEEVERVSKNRDKRERDYNILDEENRRLIKENHALRNENT
         T + + ES    ++  K      E K L   N  L  ENE+L++EVK+ + Q  +    L+E +R                   +RL            
Subjt:  MTRDNVGESSSRAVD--KPSQLATEWKELVGRNQTLKLENERLQQEVKRLMDQVTHASGKLEEVERVSKNRDKRERDYNILDEENRRLIKENHALRNENT

Query:  TLRRTTRSQEDRIKDISRGKETLLELVAELNETINKQKTQLIEFEEANTALRRTLDNLRVNMQAQSEESEAFKSYSSSLERQLQAFQRASEQLSLERGQL
                            ET +  +  L  +I + K+QLIE E  N  L++ +                     +S E QL   ++A E ++ +  QL
Subjt:  TLRRTTRSQEDRIKDISRGKETLLELVAELNETINKQKTQLIEFEEANTALRRTLDNLRVNMQAQSEESEAFKSYSSSLERQLQAFQRASEQLSLERGQL

Query:  EEKYSSL-------REDYAIMREDMQIILGKVNETMNTIKIMARRARGFAEWARNLRENTSPMASNAEELFEFLGMI------RRDLGEKVDAIIAALE-
         EKY  +       +++Y  +R      +G++      ++ MAR A  F+  AR LR+   P     +EL  FLG+I      R+D+    D +   LE 
Subjt:  EEKYSSL-------REDYAIMREDMQIILGKVNETMNTIKIMARRARGFAEWARNLRENTSPMASNAEELFEFLGMI------RRDLGEKVDAIIAALE-

Query:  ----KGKMIADTTAP-DTPIGNPQAGLP-FPPSFASHVQCQVPQIPMASQAGASYFKPEFSKIPFAVPN--------LDDP------EVKKELTAP--SN
            +GK +  T++  +  +      +P +PP F        PQ   + + G   +   F   P   PN         ++P      E  K+++    S 
Subjt:  ----KGKMIADTTAP-DTPIGNPQAGLP-FPPSFASHVQCQVPQIPMASQAGASYFKPEFSKIPFAVPN--------LDDP------EVKKELTAP--SN

Query:  EKLEVLEERLRAVEGTSVFGNMDATQLCLVPDVVIPPKFKLPEFEKYDGTTCPKSHIAMYCRKMAVHVHNDKLLIHCFQDSLKGPTARWYMQLDSSHVVT
         +LE LEERLR +E                            +FEKY+GT+CPKSH+ MYCRKM+ + H+DKLLIHCFQDSL  P +RWYMQLD S V  
Subjt:  EKLEVLEERLRAVEGTSVFGNMDATQLCLVPDVVIPPKFKLPEFEKYDGTTCPKSHIAMYCRKMAVHVHNDKLLIHCFQDSLKGPTARWYMQLDSSHVVT

Query:  WKNLVDSFLKQYKHNIDLAPDRLDLQRMEKKSTEDFKDYAQRWRDTTAQVQPPLTDKELSAMFINTLRSPFYEHMVGRASTNFSDIMTIGERIEYGIKHG
        WK+L DSFLKQYK+NID+APDRLDLQRMEKK+ E FK+YAQRWR+  AQVQPPLTDKEL+AMFINTLR+P+Y+ MVG ASTNFSD++TIGERIE+G+K+G
Subjt:  WKNLVDSFLKQYKHNIDLAPDRLDLQRMEKKSTEDFKDYAQRWRDTTAQVQPPLTDKELSAMFINTLRSPFYEHMVGRASTNFSDIMTIGERIEYGIKHG

Query:  RIAKTVKSSGTKKGNSSKKKEGEVQMIWKVDKSHRQIHQPVAQYPSNYLPTYGYYPHQVNNASTQYSPFYGQNTRPQMNQNFKSRRQQPVTPSQQNNRAP
        RI+    +S T++  + KKKEGEV  +    +   ++  P+    +N+ P+Y          +   SPF GQ+T                   Q+N R  
Subjt:  RIAKTVKSSGTKKGNSSKKKEGEVQMIWKVDKSHRQIHQPVAQYPSNYLPTYGYYPHQVNNASTQYSPFYGQNTRPQMNQNFKSRRQQPVTPSQQNNRAP

Query:  YKQAQFDPIPMTYTELLPQLIQNHQLTPKPVAPLQPPYPRWYDPNARCDYHAGVVGHSTENCTALKYRVQELIKAGW-----------------------
        +KQ +FDPIPM+YTELLP LI++HQ+   P  PLQPPYP+WYDPNA+C+YHAG VGHSTENC  LK +VQ L+KAGW                       
Subjt:  YKQAQFDPIPMTYTELLPQLIQNHQLTPKPVAPLQPPYPRWYDPNARCDYHAGVVGHSTENCTALKYRVQELIKAGW-----------------------

Query:  ---------------------------------------------------------------------------RVQELVDSKFLMVTQAHHQEDEIDV
                                                                                    VQ+L+DSK L++ Q   Q+ E+++
Subjt:  ---------------------------------------------------------------------------RVQELVDSKFLMVTQAHHQEDEIDV

Query:  VEELMPKE--SLNSSFKLKPLTIYYREKTT----THDPKSITIQVTAPFKYKSSKAIPWSYEYKVTINSEMPPLPVDNISRTGGVTRSGKCYTPDQLLKC
        +      E  S  ++F  KPL I+Y EK +       PK++T+++                             PV      GG+TRSG+CYTPD L   
Subjt:  VEELMPKE--SLNSSFKLKPLTIYYREKTT----THDPKSITIQVTAPFKYKSSKAIPWSYEYKVTINSEMPPLPVDNISRTGGVTRSGKCYTPDQLLKC

Query:  SK--NTIEEKNEKVE--GNAD----SNVKPKETTVMKKSRGKQTVSDEETQEFLK------------LVKQSEYKVVEQLAQTPAKISILSLLLSSGSHR
        SK       K + +E  G  D    S V  ++ T+++K    + VS EE  E +              VK  ++ V   L    + ++I+S        R
Subjt:  SK--NTIEEKNEKVE--GNAD----SNVKPKETTVMKKSRGKQTVSDEETQEFLK------------LVKQSEYKVVEQLAQTPAKISILSLLLSSGSHR

Query:  DTILVKGIKVPTQR------KRIDWTLRRILINIKIPLR
         T++   I     R      +  D   R ++ +I IPL+
Subjt:  DTILVKGIKVPTQR------KRIDWTLRRILINIKIPLR

A0A5A7T1W2 Retrotrans_gag domain-containing protein4.9e-23442.61Show/hide
Query:  IWEGLTPERRFLFPKRYDHIADLMYIPVNRFTIQALLKFWDPAYRCFVFQDFDLVPTIEEYHTMQNIEEKGGEMIYCFNPQLTAKRTLPKFL-VVHLKEV
        IWE LTP+RRF+F K+Y HIA+LMYIPVN F ++A++ F DPAY CF F   DL+PTIEEY  M ++ +K  E++Y FNP+ T KRTL KFL  VH  E+
Subjt:  IWEGLTPERRFLFPKRYDHIADLMYIPVNRFTIQALLKFWDPAYRCFVFQDFDLVPTIEEYHTMQNIEEKGGEMIYCFNPQLTAKRTLPKFL-VVHLKEV

Query:  QKNVKVKGVEESISADYLTELARKHINEEKGLVLLALCIYGMVLYPRVKGYVDGDVLKLFFGIEQGVDPTIPLLAETVRALNYCKSNKG---------GK
        QK +K KG EE++  DYL ++ + +I+E+KGL LLALCIYG V++P+ +GYVD  V+KLFF +E+GV+P IP+LAET R+LNYC+ NKG         GK
Subjt:  QKNVKVKGVEESISADYLTELARKHINEEKGLVLLALCIYGMVLYPRVKGYVDGDVLKLFFGIEQGVDPTIPLLAETVRALNYCKSNKG---------GK

Query:  LRCCVPLLYIWVHSHFKFPEDFGLSKNQLQQ--HVESN--KEYNCRIWHGPMELKFSEERSMGVFLFEFQIRACDLEGTLDAPKSLNLQMRELPHSSSSR
        L CCVPLLYIW+HSH KFP +F   +       ++  N   E+   +W    +  +  + +   F  +        +      K++  +  +        
Subjt:  LRCCVPLLYIWVHSHFKFPEDFGLSKNQLQQ--HVESN--KEYNCRIWHGPMELKFSEERSMGVFLFEFQIRACDLEGTLDAPKSLNLQMRELPHSSSSR

Query:  PLGMRPLCSVVGLATNCEDSNEKKRQVLASWRTVRRINGNSHSEGVTPEYLQWRIKRSK-IPIMTRDNVGESSSRAVDKPSQLATEWKELVGRNQTLKLE
        P G      ++ L           +Q +     + +I    H EGVT  Y  W+  R K I  ++R+ V      + ++P+Q   +  EL  +N+ L+ E
Subjt:  PLGMRPLCSVVGLATNCEDSNEKKRQVLASWRTVRRINGNSHSEGVTPEYLQWRIKRSK-IPIMTRDNVGESSSRAVDKPSQLATEWKELVGRNQTLKLE

Query:  NERLQQEVKRLMDQVTHASGKLEEVERVSKNRDKRERDYNILDEENRRLIKENHALRNENTTLRRTTRSQEDRIKDISRGKETLLELVAELNETINKQKT
        NE+L++E  + MD  T+   +LE+ +   KN+DK E D   LD+E RR+ K N +++NE TTL+ T                                  
Subjt:  NERLQQEVKRLMDQVTHASGKLEEVERVSKNRDKRERDYNILDEENRRLIKENHALRNENTTLRRTTRSQEDRIKDISRGKETLLELVAELNETINKQKT

Query:  QLIEFEEANTALRRTLDNLRVNMQAQSEESEAFKSYSSSLERQLQAFQRASEQLSLERGQLEEKYSSLREDYAIMREDMQIILGKVNETMNTIKIMARRA
                          L + M  +SEE +  K+Y+  L  QL A Q +S++++ E   L   Y  ++ DY +   D Q+++ +V++T+  ++++++RA
Subjt:  QLIEFEEANTALRRTLDNLRVNMQAQSEESEAFKSYSSSLERQLQAFQRASEQLSLERGQLEEKYSSLREDYAIMREDMQIILGKVNETMNTIKIMARRA

Query:  RGFAEWARNLRE--------NTSPMASNAEELFEFLGMIRRD---LGEKVDAIIAALE--KGKMIADTTAPDTPIGNPQAGLPFPPSFASHVQCQVPQIP
         GFAEWA             NT   +   EE  + +  +R++   LGE+V  I+  L   KGK   DTT    PI +    + +PP F  +    VP++ 
Subjt:  RGFAEWARNLRE--------NTSPMASNAEELFEFLGMIRRD---LGEKVDAIIAALE--KGKMIADTTAPDTPIGNPQAGLPFPPSFASHVQCQVPQIP

Query:  MASQAGASYFKPEFSKIPFAVPNLDDPEVKKELTAPSNEKLEVLEERLRAVEGTSVFGNMDATQLCLVPDVVIPPKFKLPEFEKYDGTTCPKSHIAMYCR
                                  P     L   S +KL+VLEERLRA+E T V+GN+DATQLCLVP ++IP KFK+PEF KYDG+TCP+SH+ MYCR
Subjt:  MASQAGASYFKPEFSKIPFAVPNLDDPEVKKELTAPSNEKLEVLEERLRAVEGTSVFGNMDATQLCLVPDVVIPPKFKLPEFEKYDGTTCPKSHIAMYCR

Query:  KMAVHVHNDKLLIHCFQDSLKGPTARWYMQLDSSHVVTWKNLVDSFLKQYKHNIDLAPDRLDLQRMEKKSTEDFKDYAQRWRDTTAQVQPPLTDKELSAM
        KMAVH++NDKLL+HCFQDSL  P +RWY+QLD++H+  WK+L D+FLKQYK NID+APDRLDLQRMEKKS+E FK+YAQRWRD  A+VQPPLTDKE+++M
Subjt:  KMAVHVHNDKLLIHCFQDSLKGPTARWYMQLDSSHVVTWKNLVDSFLKQYKHNIDLAPDRLDLQRMEKKSTEDFKDYAQRWRDTTAQVQPPLTDKELSAM

Query:  FINTLRSPFYEHMVGRASTNFSDIMTIGERIEYGIKHGRIAK-TVKSSGTKKGNSSKKKEGEVQMIWKVDKSHRQIHQPVAQYPSNYLPTYGYYPHQVNN
        F+NTLR+PFYE M+G ASTNFSDI+ IGERIEYGIKHGR+A+ T +  G KKG  SKKKEGEV  I   +    +    + +Y  N       +P  +NN
Subjt:  FINTLRSPFYEHMVGRASTNFSDIMTIGERIEYGIKHGRIAK-TVKSSGTKKGNSSKKKEGEVQMIWKVDKSHRQIHQPVAQYPSNYLPTYGYYPHQVNN

Query:  AS-TQYSPFYGQNTRPQMNQNFKSRRQQPVTPSQQNNRAPYKQAQFDPIPMTYTELLPQLIQNHQLTPKPVAPLQPPYPRWYDPNARCDYHAGVVGHSTE
         S   Y+ +   +T  +  +   S   +P     Q ++      +FDPIPMTYTELLPQLIQN QL   P+ P+QPPYP+WYD NARCDYHAG VGHSTE
Subjt:  AS-TQYSPFYGQNTRPQMNQNFKSRRQQPVTPSQQNNRAPYKQAQFDPIPMTYTELLPQLIQNHQLTPKPVAPLQPPYPRWYDPNARCDYHAGVVGHSTE

Query:  NCTALKYRVQELIKAGW
        NC ALK  VQ LI AGW
Subjt:  NCTALKYRVQELIKAGW

A0A6J1D099 Ribonuclease H1.9e-19052.88Show/hide
Query:  KPEFSKIPFAVPNLDDPEVKKE---LTAPSNEKLEVLEERLRAVEGTSVFGNMDATQLCLVPDVVIPPKFKLPEFEKYDGTTCPKSHIAMYCRKMAVHVH
        +PE    P  V NL D   K +     APSNEK EVL+ERLRA+E T VFGN+DA+QLC V  +VIPPK K+PEFEKY+G++CPK+H+ MYCRKMA +V 
Subjt:  KPEFSKIPFAVPNLDDPEVKKE---LTAPSNEKLEVLEERLRAVEGTSVFGNMDATQLCLVPDVVIPPKFKLPEFEKYDGTTCPKSHIAMYCRKMAVHVH

Query:  NDKLLIHCFQDSLKGPTARWYMQLDSSHVVTWKNLVDSFLKQYKHNIDLAPDRLDLQRMEKKSTEDFKDYAQRWRDTTAQVQPPLTDKELSAMFINTLRS
        NDKLLIHCFQDSL GP +RWYMQLDSSHV +WKNL DSFLKQYKHNID+APDRLDLQRMEKKST+ FK+YAQRWRDT AQVQPPL DKELSAMFINTL+ 
Subjt:  NDKLLIHCFQDSLKGPTARWYMQLDSSHVVTWKNLVDSFLKQYKHNIDLAPDRLDLQRMEKKSTEDFKDYAQRWRDTTAQVQPPLTDKELSAMFINTLRS

Query:  PFYEHMVGRASTNFSDIMTIGERIEYGIKHGRIAKTV-KSSGTKKGNSSKKKEGEVQMI------WKVDKSHRQIHQPVAQYPSNYLPT-YGYYPHQVNN
        PFY+ M+G ASTNFSDIMTIGERIEYG++HGRI  T  +    KK + SKKKEGEVQM+      WK     +Q ++   QY   Y PT YGY    VNN
Subjt:  PFYEHMVGRASTNFSDIMTIGERIEYGIKHGRIAKTV-KSSGTKKGNSSKKKEGEVQMI------WKVDKSHRQIHQPVAQYPSNYLPT-YGYYPHQVNN

Query:  ASTQYSPFYGQNTRPQMNQNFK----SRRQQP--------VTPSQQNNRAPYKQAQFDPIPMTYTELLPQLIQNHQLTPKPVAPLQPPYPRWYDPNARCD
        A++ Y P+  QN RP  +QNF+    S+  QP         T  QQNNR   KQ QFDPIPMTYTELLPQL QN+QL P PV P+QPPYPRWYD NARCD
Subjt:  ASTQYSPFYGQNTRPQMNQNFK----SRRQQP--------VTPSQQNNRAPYKQAQFDPIPMTYTELLPQLIQNHQLTPKPVAPLQPPYPRWYDPNARCD

Query:  YHAGVVGHSTENCTALKYRVQELIKAGW------------------------------------------------------------------------
        YHAG + HSTENCT LKYRVQ LIKAGW                                                                        
Subjt:  YHAGVVGHSTENCTALKYRVQELIKAGW------------------------------------------------------------------------

Query:  -----------------------RVQELVDSKFLMVTQAHHQEDEIDVVEELMPKESLNSSFKLKPLTIYYREK--TTTHDPKSITIQVTAPFKYKSSKA
                               +VQEL+DSK L    +H ++   +VVE+++  E  + S K KPLTI+YREK    +   K   I V  PF+YKSSKA
Subjt:  -----------------------RVQELVDSKFLMVTQAHHQEDEIDVVEELMPKESLNSSFKLKPLTIYYREK--TTTHDPKSITIQVTAPFKYKSSKA

Query:  IPWSYEYKVTINSEM--PPLPVDNISRTGGVTRSGKCYTPDQLLKCSKNTIEEKN---------EKVE----GNA----DSNVKPKETTVMKKSRGKQTV
        +PW YE KVT+  ++  P LPVDNI+  GG+TR+G+CYTPD LLK    T  EKN         EKVE    G A    D+  +  E  V+K    KQ +
Subjt:  IPWSYEYKVTINSEM--PPLPVDNISRTGGVTRSGKCYTPDQLLKCSKNTIEEKN---------EKVE----GNA----DSNVKPKETTVMKKSRGKQTV

Query:  SDEETQEFLKLVKQSEYKVVEQLAQTPAKISILSLLLSSGSHRDTIL
        S+EETQEFLKLVKQSEYKV+EQL +TPA ISILSLLLSS +H++ +L
Subjt:  SDEETQEFLKLVKQSEYKVVEQLAQTPAKISILSLLLSSGSHRDTIL

A0A6J1DM29 LOW QUALITY PROTEIN: uncharacterized protein LOC1110222312.1e-18950.12Show/hide
Query:  DLGEKVDAIIAALEKGKMIADTTAPDTPIGNPQAGLPFPPSF----------------------------ASHVQCQVPQIPMASQAGASYFKPEFSKIP
        +L EK+DAI+ ALEKGK IA+T+ P      PQ    F PSF                            A      + + P  + AG  + +PE    P
Subjt:  DLGEKVDAIIAALEKGKMIADTTAPDTPIGNPQAGLPFPPSF----------------------------ASHVQCQVPQIPMASQAGASYFKPEFSKIP

Query:  FAVPNLDDPEVKKE---LTAPSNEKLEVLEERLRAVEGTSVFGNMDATQLCLVPDVVIPPKFKLPEFEKYDGTTCPKSHIAMYCRKMAVHVHNDKLLIHC
          V NL  P+  K+     APS+EKLEVLEERLRAVEGT VFGN+DA+QLCL   +VIPPKFK+PEFEKYDG++CPK+H+ MYCRKMA ++ NDKLLIHC
Subjt:  FAVPNLDDPEVKKE---LTAPSNEKLEVLEERLRAVEGTSVFGNMDATQLCLVPDVVIPPKFKLPEFEKYDGTTCPKSHIAMYCRKMAVHVHNDKLLIHC

Query:  FQDSLKGPTARWYMQLDSSHVVTWKNLVDSFLKQYKHNIDLAPDRLDLQRMEKKSTEDFKDYAQRWRDTTAQVQPPLTDKELSAMFINTLRSPFYEHMVG
        FQDSL GP + WYM LDS HV +WKNL DSFLKQYKHNID+  DRLDLQ MEKK+ E FK+Y QRWRDT AQ QPP TDKELS+MFINTL+ PFY+ M+G
Subjt:  FQDSLKGPTARWYMQLDSSHVVTWKNLVDSFLKQYKHNIDLAPDRLDLQRMEKKSTEDFKDYAQRWRDTTAQVQPPLTDKELSAMFINTLRSPFYEHMVG

Query:  RASTNFSDIMTIGERIEYGIKHGRIAKTVKSSGT-KKGNSSKKKEGEVQMIWKVDKSHRQ-IHQPVAQYPSNYLPT-YGYYPHQVNNASTQYSPFYGQNT
         AST+FSDI+TIGERIEYG+ HGRI  TV  S T K   SSKKKEGEVQM+       RQ  +     Y   Y P+ YGY    VN A+ QY+    QN 
Subjt:  RASTNFSDIMTIGERIEYGIKHGRIAKTVKSSGT-KKGNSSKKKEGEVQMIWKVDKSHRQ-IHQPVAQYPSNYLPT-YGYYPHQVNNASTQYSPFYGQNT

Query:  RPQMNQNFKSRRQQP---VTPSQQNNRAPYKQAQFDPIPMTYTELLPQLIQNHQLTPKPVAPLQPPYPRWYDPNARCDYHAGVVGHSTENCTALKYRVQE
        RP  +Q F+ R  Q     T  QQNNR+  KQ+QFDPIPMTYTELLPQL QN+QL P PV P+QPPYP WYD N RCDYHAG +GHSTENCTALKYRVQ 
Subjt:  RPQMNQNFKSRRQQP---VTPSQQNNRAPYKQAQFDPIPMTYTELLPQLIQNHQLTPKPVAPLQPPYPRWYDPNARCDYHAGVVGHSTENCTALKYRVQE

Query:  LIKAG-----------------------------------------------------W-----------------------------------------
        LIKAG                                                     W                                         
Subjt:  LIKAG-----------------------------------------------------W-----------------------------------------

Query:  ---RVQELVDSKFLMVTQAHHQEDEIDVVEELMPKESLNSSFKLKPLTIYYREK--TTTHDPKSITIQVTAPFKYKSSKAIPWSYEYKVTI--NSEMPPL
           +VQEL+D K L VTQ+ HQE  IDVVE +   ES ++++K KPLT+ YREK    ++  + ITIQV APF+Y SSKA+PW YE KVT+   ++   L
Subjt:  ---RVQELVDSKFLMVTQAHHQEDEIDVVEELMPKESLNSSFKLKPLTIYYREK--TTTHDPKSITIQVTAPFKYKSSKAIPWSYEYKVTI--NSEMPPL

Query:  PVDNISRTGGVTRSGKCYTPDQLLKCS---------KNTIEEKNEKVEGNADSNVKPKETTV--MKKSRG-KQTVSDEETQEFLKLVKQSEYKVVEQLAQ
        PVDNI+R GG+TR+G+CYTP+ LLK +         +   E+KNEKVE +     K  E      +++ G KQ VS+EETQEFLKL+K SEYK++EQL +
Subjt:  PVDNISRTGGVTRSGKCYTPDQLLKCS---------KNTIEEKNEKVEGNADSNVKPKETTV--MKKSRG-KQTVSDEETQEFLKLVKQSEYKVVEQLAQ

Query:  TPAKISILSLLLSS
        TPA+ISILSLLLSS
Subjt:  TPAKISILSLLLSS

A0A6J1E2J7 Ribonuclease H6.2e-19754.18Show/hide
Query:  PFAVPNLDDPEVKKEL---TAPSNEKLEVLEERLRAVEGTSVFGNMDATQLCLVPDVVIPPKFKLPEFEKYDGTTCPKSHIAMYCRKMAVHVHNDKLLIH
        P  V NL     K +L    APSNEK EVLEERLRA+EGT VFGN+DA+QLCLV  +VIPPKFK+PEFEKYDG++CPK+H+ MYCRKMA +V NDKLLIH
Subjt:  PFAVPNLDDPEVKKEL---TAPSNEKLEVLEERLRAVEGTSVFGNMDATQLCLVPDVVIPPKFKLPEFEKYDGTTCPKSHIAMYCRKMAVHVHNDKLLIH

Query:  CFQDSLKGPTARWYMQLDSSHVVTWKNLVDSFLKQYKHNIDLAPDRLDLQRMEKKSTEDFKDYAQRWRDTTAQVQPPLTDKELSAMFINTLRSPFYEHMV
        CFQDSL GP +RWYMQLDSS+V +WKNL DSFLKQYKHNID+APDRLDLQRMEKKSTE FK+YAQRWRDT AQVQPPLTDKELSAMFINTL+ PFY+ M+
Subjt:  CFQDSLKGPTARWYMQLDSSHVVTWKNLVDSFLKQYKHNIDLAPDRLDLQRMEKKSTEDFKDYAQRWRDTTAQVQPPLTDKELSAMFINTLRSPFYEHMV

Query:  GRASTNFSDIMTIGERIEYGIKHGRIAKTV-KSSGTKKGNSSKKKEGEVQMI------WKVDKSHRQIHQPVAQYPSNYLPT-YGYYPHQVNNASTQYSP
        G ASTNFSDIMTIGERIEYG++HGRI  TV +    KK + SKKKEGEVQM+      WK     +Q +    +Y   Y PT YGY    VNNA++ YSP
Subjt:  GRASTNFSDIMTIGERIEYGIKHGRIAKTV-KSSGTKKGNSSKKKEGEVQMI------WKVDKSHRQIHQPVAQYPSNYLPT-YGYYPHQVNNASTQYSP

Query:  FYGQNTRPQMNQNFK----SRRQQP--------VTPSQQNNRAPYKQAQFDPIPMTYTELLPQLIQNHQLTPKPVAPLQPPYPRWYDPNARCDYHAGVVG
        +  QN RP  +QNF+    S+  QP         T  QQ NR   KQ QFDPIPMTYTELLPQL QN+QL P PV P+QPPYPRWYD NARCDYHAG +G
Subjt:  FYGQNTRPQMNQNFK----SRRQQP--------VTPSQQNNRAPYKQAQFDPIPMTYTELLPQLIQNHQLTPKPVAPLQPPYPRWYDPNARCDYHAGVVG

Query:  HSTENCTALKYRVQELIKAGW-------------------------------------------------------------------------------
        HSTENCTALKYRVQ LIKAGW                                                                               
Subjt:  HSTENCTALKYRVQELIKAGW-------------------------------------------------------------------------------

Query:  ------------------RVQELVDSKFLMVTQAHHQEDEIDVVEELMPKESLNSSFKLKPLTIYYREKTTTHD--PKSITIQVTAPFKYKSSKAIPWSY
                          +VQEL+DSK L V  + HQ+  I++VE++   E  + + K K LTI+Y EK    +   K ITI V APF+YKSSKA+PW Y
Subjt:  ------------------RVQELVDSKFLMVTQAHHQEDEIDVVEELMPKESLNSSFKLKPLTIYYREKTTTHD--PKSITIQVTAPFKYKSSKAIPWSY

Query:  EYKVTINSEM--PPLPVDNISRTGGVTRSGKCYTPDQLLKCSKNTIEEKN---------EKVEGNADSNVKPK--------ETTVMKKSRGKQTVSDEET
        + KVT+  ++  PPLP+DNI+  GG+TR+G+CYTPD LLKC   T  EKN         EKVE +     K          E  V+K    KQ +S+EET
Subjt:  EYKVTINSEM--PPLPVDNISRTGGVTRSGKCYTPDQLLKCSKNTIEEKN---------EKVEGNADSNVKPK--------ETTVMKKSRGKQTVSDEET

Query:  QEFLKLVKQSEYKVVEQLAQTPAKISILSLLLSSGSHRDTIL
        QE LKLVKQSEYKV+EQL +TPAKISILSLLLSS +HR+ +L
Subjt:  QEFLKLVKQSEYKVVEQLAQTPAKISILSLLLSSGSHRDTIL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATGCCAACGTACCCATAGAGTCGATACGGTCATCTTCCTCTGAGTATGATGAGTTGAGTACTGTGCTACAATGGGCAGAGCAGACACAACTAAAACATGGGGATAA
TCTGCCTTTTAAAAGCCCCGTGTCGTCATCCTTCTCAAGTCAAGTTCAAATCATTGCTAACGAGCTAGGGGAGCTGAAGGCCATTTGGGAAGGTTTAACGCCAGAAAGAA
GATTTTTGTTCCCAAAGAGATACGACCATATTGCAGACCTCATGTATATTCCAGTTAATCGGTTCACCATCCAGGCTTTGTTAAAATTCTGGGACCCGGCCTACAGGTGT
TTCGTATTTCAAGATTTCGACTTAGTCCCAACCATTGAGGAGTACCACACGATGCAGAACATTGAAGAAAAAGGTGGAGAAATGATCTATTGTTTCAACCCACAACTAAC
TGCAAAGAGAACTTTACCCAAATTTCTAGTTGTACACCTGAAGGAAGTGCAAAAAAATGTGAAAGTGAAAGGGGTTGAAGAGTCTATATCTGCAGATTATCTAACAGAAC
TCGCTCGCAAACACATAAATGAAGAAAAGGGTCTGGTTCTGTTAGCTTTGTGCATCTACGGAATGGTCTTGTATCCCAGAGTTAAAGGATATGTGGATGGTGATGTCTTG
AAGCTATTCTTCGGCATTGAGCAAGGGGTGGATCCAACCATACCTCTTCTAGCGGAAACGGTTCGAGCGCTCAATTATTGTAAGTCCAACAAAGGCGGGAAGTTGAGATG
TTGTGTGCCCTTACTCTATATTTGGGTGCATAGTCATTTTAAGTTCCCAGAAGATTTTGGGCTGTCCAAAAATCAACTTCAGCAGCACGTGGAATCTAATAAGGAGTACA
ATTGCAGAATTTGGCACGGCCCCATGGAACTCAAGTTTTCCGAGGAAAGAAGCATGGGTGTCTTTCTTTTCGAATTCCAAATTAGAGCATGTGATTTGGAAGGCACATTG
GATGCCCCTAAAAGCCTTAATCTACAGATGCGGGAGCTTCCACACAGTTCCTCTAGTAGGCCCTTGGGGATGCGTCCATTATGCTCCGTTGTTGGCCTTGCGACAAACTG
TGAGGATTCTAACGAAAAGAAGCGACAAGTGTTGGCTTCCTGGAGAACAGTTAGAAGGATAAATGGCAATAGTCACTCTGAGGGAGTTACTCCAGAATATCTGCAATGGC
GCATTAAGAGGAGCAAGATCCCAATTATGACTCGTGACAATGTGGGAGAATCATCTAGTAGAGCAGTAGATAAGCCTAGCCAGCTAGCGACAGAGTGGAAAGAGTTAGTG
GGAAGAAATCAAACATTGAAGCTAGAAAATGAAAGATTACAACAAGAGGTCAAACGTTTGATGGACCAAGTAACTCATGCTAGTGGAAAACTCGAGGAAGTAGAGAGAGT
TTCAAAAAACCGTGACAAGCGAGAAAGAGATTATAATATACTGGACGAGGAAAATAGAAGGTTGATTAAGGAGAATCATGCTCTAAGAAATGAAAATACTACATTACGAA
GGACAACTCGTTCACAAGAGGATAGGATCAAAGACATTTCGAGAGGCAAAGAGACTCTCTTAGAGTTAGTTGCGGAATTAAACGAAACCATCAACAAGCAGAAAACGCAA
CTTATTGAGTTTGAAGAAGCCAATACTGCTCTAAGGCGAACGTTGGACAATCTACGCGTGAACATGCAGGCTCAATCAGAAGAGTCTGAAGCTTTTAAAAGTTATTCAAG
CTCGTTGGAACGTCAGCTCCAAGCATTTCAAAGGGCAAGCGAACAGTTGTCACTAGAGAGAGGACAACTAGAGGAAAAATACTCCTCATTAAGGGAAGATTATGCCATTA
TGAGAGAAGACATGCAAATAATTCTTGGGAAAGTAAATGAAACTATGAACACTATCAAGATCATGGCTAGGAGAGCCCGAGGATTTGCAGAATGGGCAAGGAATCTGCGA
GAGAATACATCACCTATGGCCTCTAATGCGGAGGAGTTGTTTGAGTTTTTAGGGATGATTCGTAGAGACCTTGGGGAAAAGGTTGATGCAATCATTGCCGCTTTGGAAAA
GGGCAAAATGATTGCAGATACAACTGCACCAGACACTCCGATTGGAAACCCTCAAGCTGGCCTACCATTTCCACCCAGTTTCGCTTCACATGTTCAATGCCAAGTCCCCC
AAATCCCCATGGCTAGCCAAGCTGGTGCTTCTTATTTCAAACCAGAATTTTCAAAAATACCTTTTGCGGTCCCTAACTTGGATGACCCAGAAGTAAAGAAGGAGTTAACC
GCGCCCTCTAATGAAAAACTTGAGGTTTTAGAGGAAAGATTGAGGGCAGTCGAGGGCACTAGTGTTTTTGGAAACATGGATGCCACTCAATTATGTTTAGTTCCAGACGT
TGTGATCCCACCGAAATTCAAACTTCCTGAATTTGAGAAATACGATGGGACAACATGCCCTAAAAGTCACATTGCTATGTACTGTAGGAAGATGGCAGTGCACGTTCATA
ATGATAAGCTGCTCATACATTGCTTTCAGGATAGCTTAAAGGGTCCAACTGCCCGTTGGTATATGCAGTTGGATAGTTCACATGTTGTCACATGGAAGAACTTGGTTGAT
TCCTTCCTCAAACAGTACAAGCACAATATTGATTTGGCTCCCGATCGACTGGACTTACAGAGGATGGAAAAGAAGAGTACTGAAGACTTCAAAGATTATGCTCAAAGGTG
GCGGGATACGACAGCTCAAGTCCAACCTCCTTTAACGGATAAGGAGCTTTCTGCCATGTTCATCAACACCTTAAGGTCCCCATTCTATGAACATATGGTTGGAAGGGCGT
CAACCAATTTCTCCGATATCATGACAATTGGAGAAAGAATTGAGTACGGGATCAAACATGGTAGAATAGCCAAAACCGTTAAATCCTCAGGCACAAAGAAGGGCAACAGT
TCAAAGAAGAAGGAGGGAGAAGTACAGATGATATGGAAGGTGGATAAGTCCCATCGACAAATCCACCAACCCGTCGCACAGTATCCTTCGAATTATTTACCTACATACGG
ATACTATCCACACCAGGTGAATAATGCATCAACGCAGTACTCCCCATTTTATGGCCAAAATACCAGACCCCAAATGAATCAGAATTTTAAGTCTCGTAGACAACAACCAG
TCACTCCAAGTCAGCAAAATAATCGAGCACCTTACAAGCAAGCCCAATTTGATCCGATTCCGATGACATATACTGAGCTCTTGCCTCAGTTAATTCAGAACCATCAGTTG
ACACCCAAGCCTGTTGCTCCACTGCAACCCCCTTATCCAAGATGGTATGATCCCAATGCAAGGTGTGATTACCATGCAGGTGTTGTTGGGCATTCTACAGAAAATTGCAC
TGCTTTGAAGTATAGGGTGCAAGAGCTCATTAAGGCCGGATGGAGGGTCCAAGAACTGGTGGACTCAAAATTTCTTATGGTCACCCAAGCCCATCACCAGGAGGATGAAA
TAGACGTCGTGGAAGAATTGATGCCTAAAGAAAGTTTGAATTCATCTTTCAAACTGAAGCCACTCACGATCTATTACCGTGAGAAGACTACTACTCATGATCCAAAGTCG
ATCACCATTCAGGTGACGGCTCCTTTCAAGTACAAGAGTTCTAAGGCAATACCATGGAGTTATGAGTACAAAGTAACTATTAACTCAGAGATGCCACCACTTCCAGTTGA
CAACATTTCCAGAACGGGAGGCGTAACACGGAGTGGAAAATGCTACACTCCAGATCAGTTATTGAAGTGCTCAAAGAATACAATAGAGGAGAAAAATGAGAAAGTTGAAG
GTAATGCTGATAGCAATGTCAAGCCTAAAGAAACAACAGTGATGAAGAAATCAAGGGGCAAGCAGACTGTGTCCGACGAAGAAACTCAAGAATTTCTAAAGTTGGTTAAG
CAAAGTGAATACAAAGTAGTAGAACAGCTAGCTCAGACGCCTGCTAAGATTTCCATATTATCTTTGTTGCTATCCTCTGGTTCGCACCGAGACACGATATTGGTGAAGGG
AATAAAAGTCCCCACGCAGCGGAAGCGCATCGATTGGACCTTACGCCGTATATTAATTAATATTAAAATACCGTTACGATAA
mRNA sequenceShow/hide mRNA sequence
ATGAATGCCAACGTACCCATAGAGTCGATACGGTCATCTTCCTCTGAGTATGATGAGTTGAGTACTGTGCTACAATGGGCAGAGCAGACACAACTAAAACATGGGGATAA
TCTGCCTTTTAAAAGCCCCGTGTCGTCATCCTTCTCAAGTCAAGTTCAAATCATTGCTAACGAGCTAGGGGAGCTGAAGGCCATTTGGGAAGGTTTAACGCCAGAAAGAA
GATTTTTGTTCCCAAAGAGATACGACCATATTGCAGACCTCATGTATATTCCAGTTAATCGGTTCACCATCCAGGCTTTGTTAAAATTCTGGGACCCGGCCTACAGGTGT
TTCGTATTTCAAGATTTCGACTTAGTCCCAACCATTGAGGAGTACCACACGATGCAGAACATTGAAGAAAAAGGTGGAGAAATGATCTATTGTTTCAACCCACAACTAAC
TGCAAAGAGAACTTTACCCAAATTTCTAGTTGTACACCTGAAGGAAGTGCAAAAAAATGTGAAAGTGAAAGGGGTTGAAGAGTCTATATCTGCAGATTATCTAACAGAAC
TCGCTCGCAAACACATAAATGAAGAAAAGGGTCTGGTTCTGTTAGCTTTGTGCATCTACGGAATGGTCTTGTATCCCAGAGTTAAAGGATATGTGGATGGTGATGTCTTG
AAGCTATTCTTCGGCATTGAGCAAGGGGTGGATCCAACCATACCTCTTCTAGCGGAAACGGTTCGAGCGCTCAATTATTGTAAGTCCAACAAAGGCGGGAAGTTGAGATG
TTGTGTGCCCTTACTCTATATTTGGGTGCATAGTCATTTTAAGTTCCCAGAAGATTTTGGGCTGTCCAAAAATCAACTTCAGCAGCACGTGGAATCTAATAAGGAGTACA
ATTGCAGAATTTGGCACGGCCCCATGGAACTCAAGTTTTCCGAGGAAAGAAGCATGGGTGTCTTTCTTTTCGAATTCCAAATTAGAGCATGTGATTTGGAAGGCACATTG
GATGCCCCTAAAAGCCTTAATCTACAGATGCGGGAGCTTCCACACAGTTCCTCTAGTAGGCCCTTGGGGATGCGTCCATTATGCTCCGTTGTTGGCCTTGCGACAAACTG
TGAGGATTCTAACGAAAAGAAGCGACAAGTGTTGGCTTCCTGGAGAACAGTTAGAAGGATAAATGGCAATAGTCACTCTGAGGGAGTTACTCCAGAATATCTGCAATGGC
GCATTAAGAGGAGCAAGATCCCAATTATGACTCGTGACAATGTGGGAGAATCATCTAGTAGAGCAGTAGATAAGCCTAGCCAGCTAGCGACAGAGTGGAAAGAGTTAGTG
GGAAGAAATCAAACATTGAAGCTAGAAAATGAAAGATTACAACAAGAGGTCAAACGTTTGATGGACCAAGTAACTCATGCTAGTGGAAAACTCGAGGAAGTAGAGAGAGT
TTCAAAAAACCGTGACAAGCGAGAAAGAGATTATAATATACTGGACGAGGAAAATAGAAGGTTGATTAAGGAGAATCATGCTCTAAGAAATGAAAATACTACATTACGAA
GGACAACTCGTTCACAAGAGGATAGGATCAAAGACATTTCGAGAGGCAAAGAGACTCTCTTAGAGTTAGTTGCGGAATTAAACGAAACCATCAACAAGCAGAAAACGCAA
CTTATTGAGTTTGAAGAAGCCAATACTGCTCTAAGGCGAACGTTGGACAATCTACGCGTGAACATGCAGGCTCAATCAGAAGAGTCTGAAGCTTTTAAAAGTTATTCAAG
CTCGTTGGAACGTCAGCTCCAAGCATTTCAAAGGGCAAGCGAACAGTTGTCACTAGAGAGAGGACAACTAGAGGAAAAATACTCCTCATTAAGGGAAGATTATGCCATTA
TGAGAGAAGACATGCAAATAATTCTTGGGAAAGTAAATGAAACTATGAACACTATCAAGATCATGGCTAGGAGAGCCCGAGGATTTGCAGAATGGGCAAGGAATCTGCGA
GAGAATACATCACCTATGGCCTCTAATGCGGAGGAGTTGTTTGAGTTTTTAGGGATGATTCGTAGAGACCTTGGGGAAAAGGTTGATGCAATCATTGCCGCTTTGGAAAA
GGGCAAAATGATTGCAGATACAACTGCACCAGACACTCCGATTGGAAACCCTCAAGCTGGCCTACCATTTCCACCCAGTTTCGCTTCACATGTTCAATGCCAAGTCCCCC
AAATCCCCATGGCTAGCCAAGCTGGTGCTTCTTATTTCAAACCAGAATTTTCAAAAATACCTTTTGCGGTCCCTAACTTGGATGACCCAGAAGTAAAGAAGGAGTTAACC
GCGCCCTCTAATGAAAAACTTGAGGTTTTAGAGGAAAGATTGAGGGCAGTCGAGGGCACTAGTGTTTTTGGAAACATGGATGCCACTCAATTATGTTTAGTTCCAGACGT
TGTGATCCCACCGAAATTCAAACTTCCTGAATTTGAGAAATACGATGGGACAACATGCCCTAAAAGTCACATTGCTATGTACTGTAGGAAGATGGCAGTGCACGTTCATA
ATGATAAGCTGCTCATACATTGCTTTCAGGATAGCTTAAAGGGTCCAACTGCCCGTTGGTATATGCAGTTGGATAGTTCACATGTTGTCACATGGAAGAACTTGGTTGAT
TCCTTCCTCAAACAGTACAAGCACAATATTGATTTGGCTCCCGATCGACTGGACTTACAGAGGATGGAAAAGAAGAGTACTGAAGACTTCAAAGATTATGCTCAAAGGTG
GCGGGATACGACAGCTCAAGTCCAACCTCCTTTAACGGATAAGGAGCTTTCTGCCATGTTCATCAACACCTTAAGGTCCCCATTCTATGAACATATGGTTGGAAGGGCGT
CAACCAATTTCTCCGATATCATGACAATTGGAGAAAGAATTGAGTACGGGATCAAACATGGTAGAATAGCCAAAACCGTTAAATCCTCAGGCACAAAGAAGGGCAACAGT
TCAAAGAAGAAGGAGGGAGAAGTACAGATGATATGGAAGGTGGATAAGTCCCATCGACAAATCCACCAACCCGTCGCACAGTATCCTTCGAATTATTTACCTACATACGG
ATACTATCCACACCAGGTGAATAATGCATCAACGCAGTACTCCCCATTTTATGGCCAAAATACCAGACCCCAAATGAATCAGAATTTTAAGTCTCGTAGACAACAACCAG
TCACTCCAAGTCAGCAAAATAATCGAGCACCTTACAAGCAAGCCCAATTTGATCCGATTCCGATGACATATACTGAGCTCTTGCCTCAGTTAATTCAGAACCATCAGTTG
ACACCCAAGCCTGTTGCTCCACTGCAACCCCCTTATCCAAGATGGTATGATCCCAATGCAAGGTGTGATTACCATGCAGGTGTTGTTGGGCATTCTACAGAAAATTGCAC
TGCTTTGAAGTATAGGGTGCAAGAGCTCATTAAGGCCGGATGGAGGGTCCAAGAACTGGTGGACTCAAAATTTCTTATGGTCACCCAAGCCCATCACCAGGAGGATGAAA
TAGACGTCGTGGAAGAATTGATGCCTAAAGAAAGTTTGAATTCATCTTTCAAACTGAAGCCACTCACGATCTATTACCGTGAGAAGACTACTACTCATGATCCAAAGTCG
ATCACCATTCAGGTGACGGCTCCTTTCAAGTACAAGAGTTCTAAGGCAATACCATGGAGTTATGAGTACAAAGTAACTATTAACTCAGAGATGCCACCACTTCCAGTTGA
CAACATTTCCAGAACGGGAGGCGTAACACGGAGTGGAAAATGCTACACTCCAGATCAGTTATTGAAGTGCTCAAAGAATACAATAGAGGAGAAAAATGAGAAAGTTGAAG
GTAATGCTGATAGCAATGTCAAGCCTAAAGAAACAACAGTGATGAAGAAATCAAGGGGCAAGCAGACTGTGTCCGACGAAGAAACTCAAGAATTTCTAAAGTTGGTTAAG
CAAAGTGAATACAAAGTAGTAGAACAGCTAGCTCAGACGCCTGCTAAGATTTCCATATTATCTTTGTTGCTATCCTCTGGTTCGCACCGAGACACGATATTGGTGAAGGG
AATAAAAGTCCCCACGCAGCGGAAGCGCATCGATTGGACCTTACGCCGTATATTAATTAATATTAAAATACCGTTACGATAA
Protein sequenceShow/hide protein sequence
MNANVPIESIRSSSSEYDELSTVLQWAEQTQLKHGDNLPFKSPVSSSFSSQVQIIANELGELKAIWEGLTPERRFLFPKRYDHIADLMYIPVNRFTIQALLKFWDPAYRC
FVFQDFDLVPTIEEYHTMQNIEEKGGEMIYCFNPQLTAKRTLPKFLVVHLKEVQKNVKVKGVEESISADYLTELARKHINEEKGLVLLALCIYGMVLYPRVKGYVDGDVL
KLFFGIEQGVDPTIPLLAETVRALNYCKSNKGGKLRCCVPLLYIWVHSHFKFPEDFGLSKNQLQQHVESNKEYNCRIWHGPMELKFSEERSMGVFLFEFQIRACDLEGTL
DAPKSLNLQMRELPHSSSSRPLGMRPLCSVVGLATNCEDSNEKKRQVLASWRTVRRINGNSHSEGVTPEYLQWRIKRSKIPIMTRDNVGESSSRAVDKPSQLATEWKELV
GRNQTLKLENERLQQEVKRLMDQVTHASGKLEEVERVSKNRDKRERDYNILDEENRRLIKENHALRNENTTLRRTTRSQEDRIKDISRGKETLLELVAELNETINKQKTQ
LIEFEEANTALRRTLDNLRVNMQAQSEESEAFKSYSSSLERQLQAFQRASEQLSLERGQLEEKYSSLREDYAIMREDMQIILGKVNETMNTIKIMARRARGFAEWARNLR
ENTSPMASNAEELFEFLGMIRRDLGEKVDAIIAALEKGKMIADTTAPDTPIGNPQAGLPFPPSFASHVQCQVPQIPMASQAGASYFKPEFSKIPFAVPNLDDPEVKKELT
APSNEKLEVLEERLRAVEGTSVFGNMDATQLCLVPDVVIPPKFKLPEFEKYDGTTCPKSHIAMYCRKMAVHVHNDKLLIHCFQDSLKGPTARWYMQLDSSHVVTWKNLVD
SFLKQYKHNIDLAPDRLDLQRMEKKSTEDFKDYAQRWRDTTAQVQPPLTDKELSAMFINTLRSPFYEHMVGRASTNFSDIMTIGERIEYGIKHGRIAKTVKSSGTKKGNS
SKKKEGEVQMIWKVDKSHRQIHQPVAQYPSNYLPTYGYYPHQVNNASTQYSPFYGQNTRPQMNQNFKSRRQQPVTPSQQNNRAPYKQAQFDPIPMTYTELLPQLIQNHQL
TPKPVAPLQPPYPRWYDPNARCDYHAGVVGHSTENCTALKYRVQELIKAGWRVQELVDSKFLMVTQAHHQEDEIDVVEELMPKESLNSSFKLKPLTIYYREKTTTHDPKS
ITIQVTAPFKYKSSKAIPWSYEYKVTINSEMPPLPVDNISRTGGVTRSGKCYTPDQLLKCSKNTIEEKNEKVEGNADSNVKPKETTVMKKSRGKQTVSDEETQEFLKLVK
QSEYKVVEQLAQTPAKISILSLLLSSGSHRDTILVKGIKVPTQRKRIDWTLRRILINIKIPLR