; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc11g25580 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc11g25580
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRibonuclease H
Genome locationchr11:18723238..18732244
RNA-Seq ExpressionMoc11g25580
SyntenyMoc11g25580
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR001969 - Aspartic peptidase, active site
IPR002156 - Ribonuclease H domain
IPR005162 - Retrotransposon gag domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022143495.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111013372 [Momordica charantia]0.0e+0055.43Show/hide
Query:  GLPAKTDPVGQSAPSNEKFEVLEERLRSMRG-----------------------------QTFLATLMPHN----CAWKMAAYVQNDKLLIHYFQDSLSD
        GLPAKTDPVGQ+APSNEKFEVL+ERLR++ G                             + +  +  P N       KMAAYVQNDKLLIH FQDSLS 
Subjt:  GLPAKTDPVGQSAPSNEKFEVLEERLRSMRG-----------------------------QTFLATLMPHN----CAWKMAAYVQNDKLLIHYFQDSLSD

Query:  PASRWYMQLDSSHVGSWKNLADSFLKQYKHNIDMAPDRLNLQRMENKSTKSFKEYAQRWRDTTTQVQPPLTDKELSVMFINTLKHHFYDRMIGITSTNFS
        PASRWYMQLDSSHVGSWKNLADSFLKQYKHNIDMAPDRL+LQRME KST+SFKEYAQRWRDT  QVQPPLTDKELS MFINTLKH FYDRM+G  STNFS
Subjt:  PASRWYMQLDSSHVGSWKNLADSFLKQYKHNIDMAPDRLNLQRMENKSTKSFKEYAQRWRDTTTQVQPPLTDKELSVMFINTLKHHFYDRMIGITSTNFS

Query:  DIMTIEERIEYGVKHGRITSTAEEPLAAKKASHSKKKEGEVQMVGAERHSWKQQPYGQTPRYTPYYYPTPYGYNQPFVNNATSHYSPYASQNFQPPASQN
        DIM I ERIEYGV+HGRITSTA+EPLAAKK SHSKKKEGE+  V                                                        
Subjt:  DIMTIEERIEYGVKHGRITSTAEEPLAAKKASHSKKKEGEVQMVGAERHSWKQQPYGQTPRYTPYYYPTPYGYNQPFVNNATSHYSPYASQNFQPPASQN

Query:  FQPRDLIQPPYPRWYDANARCDYHAGAIGHSTENCTALKYRVQALIKAGWLNFKKENGLDVSKNPLPNHQNVQINAIECQGVESKSKVADITTPMEELFE
          P D IQPPYPRW DANARCDYH GAIGHS ENCTALKYRVQALIKAGWLNFKKENG +VS NPLPNH NVQINAIECQ +ESKSKVADITTPMEELFE
Subjt:  FQPRDLIQPPYPRWYDANARCDYHAGAIGHSTENCTALKYRVQALIKAGWLNFKKENGLDVSKNPLPNHQNVQINAIECQGVESKSKVADITTPMEELFE

Query:  ILLGSGYVSVEYLCPNLKYKGYDESMTCPFHAGAK-----------------------------------------------------------------
        ILLGSGYVSVEYLCPNLKYKGYDES+TCPFHAGAK                                                                 
Subjt:  ILLGSGYVSVEYLCPNLKYKGYDESMTCPFHAGAK-----------------------------------------------------------------

Query:  ---DAPNYSQKPITITVSTPFEYKSSKAVPWRYECKVTVGQE-------------------------------RLNEPASEKNKEKASEKKKEKAEEDKK
           DAPN S+KPITITV  PFEYKSSKAVPW+YECKVTVGQ+                               R++E  SEKNKEKASEKKKEK EEDKK
Subjt:  ---DAPNYSQKPITITVSTPFEYKSSKAVPWRYECKVTVGQE-------------------------------RLNEPASEKNKEKASEKKKEKAEEDKK

Query:  GKTKLKDDIYDELVEAIVVKDANPKQPVSEEETQEFLKLVKQSEYKVIEQLGRTPAKISILSLLLSSKVHRNTLLEALKQAFVSQDITVDNLSNVVGNIT
        GK KL +D++DELVEAIVVKD +PKQ V EEE QEFLKLVKQSEYKV EQLGRTPAKISILSLLLSS+ HRNTLLE LKQAFVSQDITVDNLSNVVGNIT
Subjt:  GKTKLKDDIYDELVEAIVVKDANPKQPVSEEETQEFLKLVKQSEYKVIEQLGRTPAKISILSLLLSSKVHRNTLLEALKQAFVSQDITVDNLSNVVGNIT

Query:  ASSSITFTDEEIPLE-------------------AKVLVDSGSSLNIMPRSTLEKLPVDMSRIRPSTVIVRAFDGARSAVF------------LAGATVD
        ASSSITFTDEEIP E                   AKVLVD+GSSLNIMPRSTLEKLPVDMS +RPSTVIVRAFDGARSAV                 T  
Subjt:  ASSSITFTDEEIPLE-------------------AKVLVDSGSSLNIMPRSTLEKLPVDMSRIRPSTVIVRAFDGARSAVF------------LAGATVD

Query:  TFDRSSSIHFT-----------------SETKFAVDQKLVIISGQEDILVSRLASMSYVEAAEEAFESSFQSFEIANATTLHGKFGRPKPRLLETAFKGV
          D +S+  F                   + KFAVDQKLVIISGQEDILVSRLASM YVEAAEEAFESSFQSFEIANATTLHGKFGRPKPRLLETAFKG 
Subjt:  TFDRSSSIHFT-----------------SETKFAVDQKLVIISGQEDILVSRLASMSYVEAAEEAFESSFQSFEIANATTLHGKFGRPKPRLLETAFKGV

Query:  NGSLDKLLRMAKNTKKFGLGYKPSRGYIIRVRSLEKAKCLSRFENEERDYPRRTVPPLSHSFRSADTIHQEYDETSVVAAVTEEREQV------------
        N SLDKLLRMAKNTKKFGLGYKPSRG IIRVRSLEKAK LSRFENEERDYPRRTVPPLSHSFRSA TIHQEYD +SVVAAVTEEREQV            
Subjt:  NGSLDKLLRMAKNTKKFGLGYKPSRGYIIRVRSLEKAKCLSRFENEERDYPRRTVPPLSHSFRSADTIHQEYDETSVVAAVTEEREQV------------

Query:  ----------------------------------ENASGALLKFLE----------------------------------------------------CW
                                          +  S  LL+ LE                                                     W
Subjt:  ----------------------------------ENASGALLKFLE----------------------------------------------------CW

Query:  -------------------------------------------RCARRIAASFKIAS-------------------------------------------
                                                      ++I A F   S                                           
Subjt:  -------------------------------------------RCARRIAASFKIAS-------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------------------------------------------------VSTAE----------------------
                                                                                  +T E                      
Subjt:  -------------------------------------------------------------------------VSTAE----------------------

Query:  ---------------------------VTENSMGCVLGQHDDSGRKEQAIYYLSKEFTDCETRYSQVEKTCCALAWAARRLRQYMLYYTKWLISKMDLKK
                                   VTENSMGCVLGQHDDSGRKEQAIYYLSK+FTDCETRYSQVEKTCCALAW ARRLRQYMLYYT WLISKMD  K
Subjt:  ---------------------------VTENSMGCVLGQHDDSGRKEQAIYYLSKEFTDCETRYSQVEKTCCALAWAARRLRQYMLYYTKWLISKMDLKK

Query:  YIFEKPSLSGRIARWQVLLSEYDIVYVTQKTIKGSAWADYLAQQLINDYVPVKFDFPDEYISTITASEESLDPQTWTMMFDGASNELGHEIGAILISPKG
        YIFEK SLS RIAR QVLLSEYDIVYVTQK IKGSA ADYLAQQ INDY+PVKFDFPDEYISTITASEESLDPQTWTMMFDGASNELGH IGAILISPKG
Subjt:  YIFEKPSLSGRIARWQVLLSEYDIVYVTQKTIKGSAWADYLAQQLINDYVPVKFDFPDEYISTITASEESLDPQTWTMMFDGASNELGHEIGAILISPKG

Query:  ELYPLTARLCFDCTHNMTEYEACSMGVQAAVDIKVKKLKVFGDSILVIHQLRGEWETRDVKLLPYKQLITELSQEFDEISFDYLPRENNQVADALATLAV
        ELYPLTARLCFDCTHNM EYEACSMGVQAAVD+KVKKLKVFGDS+LVIHQLRGEWETRDVKLLPYKQLITELSQEFDE+SFDYLPRENNQV DALATLAV
Subjt:  ELYPLTARLCFDCTHNMTEYEACSMGVQAAVDIKVKKLKVFGDSILVIHQLRGEWETRDVKLLPYKQLITELSQEFDEISFDYLPRENNQVADALATLAV

Query:  MFNLKLNEDVRPIKVGRRDVPASCMSIEEEPDGNPWFHDIKQYIKSKEYPPNASENDKRTLRKLAMKFFLNGEILGK
        MFNL+LNEDV PIKVGRRDVPASCMSIEEEPDGNPWFHDIKQYI SKEYPPNASENDKRT RKLAMKFFLNGEIL K
Subjt:  MFNLKLNEDVRPIKVGRRDVPASCMSIEEEPDGNPWFHDIKQYIKSKEYPPNASENDKRTLRKLAMKFFLNGEILGK

XP_022147189.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111016200 [Momordica charantia]0.0e+0054.99Show/hide
Query:  MPQYTTYNPLCGLP----------------------------------------AKTDPVGQSAPSNEKFEVLEERLRSMRG------------------
        MPQYTTYNPL  +P                                        AKTDPVGQ+APSNEKFEVL+ERLR++                    
Subjt:  MPQYTTYNPLCGLP----------------------------------------AKTDPVGQSAPSNEKFEVLEERLRSMRG------------------

Query:  -----------QTFLATLMPHN----CAWKMAAYVQNDKLLIHYFQDSLSDPASRWYMQLDSSHVGSWKNLADSFLKQYKHNIDMAPDRLNLQRMENKST
                   + +  +  P N       KMAAYVQNDKLLIH FQDSLS PASRWYMQLDSSHVGSWKNLADSFLKQYKHNIDMAPDRL+LQRME KST
Subjt:  -----------QTFLATLMPHN----CAWKMAAYVQNDKLLIHYFQDSLSDPASRWYMQLDSSHVGSWKNLADSFLKQYKHNIDMAPDRLNLQRMENKST

Query:  KSFKEYAQRWRDTTTQVQPPLTDKELSVMFINTLKHHFYDRMIGITSTNFSDIMTIEERIEYGVKHGRITSTAEEPLAAKKASHSKKKEGEVQMVGAERH
        KSFKEYAQRWRDT  QVQPPL DKELS MFINTLKH FYDRMIG  STNFSDIMTI ERIEYGV+HGRITST +EPLAAKKASHSKKKEGEVQMVGA+RH
Subjt:  KSFKEYAQRWRDTTTQVQPPLTDKELSVMFINTLKHHFYDRMIGITSTNFSDIMTIEERIEYGVKHGRITSTAEEPLAAKKASHSKKKEGEVQMVGAERH

Query:  SWKQQPYGQTPRYTPYYYPTPYGYNQPFVNNATSHYSPYASQNFQPPASQNFQ-----------------------------------------------
        SWKQQPY +TP+Y+PYYYPTPYGYNQPFVNNATSHY PYASQNF+PPASQNFQ                                               
Subjt:  SWKQQPYGQTPRYTPYYYPTPYGYNQPFVNNATSHYSPYASQNFQPPASQNFQ-----------------------------------------------

Query:  -----------PRDLIQPPYPRWYDANARCDYHAGAIGHSTENCTALKYRVQALIKAGWLNFKKENGLDVSKNPLPNHQNVQINAIECQGVESKSKVADI
                   P D IQPPYPRWYDANARCDYHAGAI HSTENCT LKYRVQALIKAGW NFKKENG DVSK  L NHQNVQINAIECQG+ESKSKVA+I
Subjt:  -----------PRDLIQPPYPRWYDANARCDYHAGAIGHSTENCTALKYRVQALIKAGWLNFKKENGLDVSKNPLPNHQNVQINAIECQGVESKSKVADI

Query:  TTPMEELFEILLGSGYVSVEYLCPNLKYKGYDESMTCPFHAGAK--------------------------------------------------------
        TTPM ELFEILLGSGY+SVEYLCP  KYKGYDES+TC FH GAK                                                        
Subjt:  TTPMEELFEILLGSGYVSVEYLCPNLKYKGYDESMTCPFHAGAK--------------------------------------------------------

Query:  -------DAPNYSQKPITITVSTPFEYKSSKAVPWRYECKVTVGQE-------------------------------RLNEPASEKNKEKASEKKKEKAE
               DAP+ S+KP  ITV  PFEYKSSKAVPW+YECKVTVGQ+                               R+NE  SEKNKEKASEKKKEK E
Subjt:  -------DAPNYSQKPITITVSTPFEYKSSKAVPWRYECKVTVGQE-------------------------------RLNEPASEKNKEKASEKKKEKAE

Query:  EDKKGKTKLKDDIYDELVEAIVVKDANPKQPVSEEETQEFLKLVKQSEYKVIEQLGRTPAKISILSLLLSSKVHRNTLLEALKQAFVSQDITVDNLSNVV
        EDKKGK KL +D  DELVEAIVVKD +PKQP+SEEETQEFLKLVKQSEYKVIEQLGRTPA ISILSLLLSS+ H+N LLEALKQAFVSQDITVDNLSNVV
Subjt:  EDKKGKTKLKDDIYDELVEAIVVKDANPKQPVSEEETQEFLKLVKQSEYKVIEQLGRTPAKISILSLLLSSKVHRNTLLEALKQAFVSQDITVDNLSNVV

Query:  GNITASSSITFTDEEIPLE-------------------AKVLVDSGSSLNIMPRSTLEKLPVDMSRIRPSTVIVRAFDGARSAVF------------LAG
        GNITASSSI+FTDEEIP E                   AKVLVD+GSSLNIMPRSTLEKLPVDMS +RPSTVIVRAFDGARSAV                
Subjt:  GNITASSSITFTDEEIPLE-------------------AKVLVDSGSSLNIMPRSTLEKLPVDMSRIRPSTVIVRAFDGARSAVF------------LAG

Query:  ATVDTFDRSSSIHFT-----------------SETKFAVDQKLVIISGQEDILVSRLASMSYVEAAEEAFESSFQSFEIANATTLHGKFGRPKPRLLETA
         T    D +S+  F                   + KFAVDQKLVIISGQEDILVSR ASMSYVE AEEAFESSFQSFEIANATTLHGKFGRPKPRLLETA
Subjt:  ATVDTFDRSSSIHFT-----------------SETKFAVDQKLVIISGQEDILVSRLASMSYVEAAEEAFESSFQSFEIANATTLHGKFGRPKPRLLETA

Query:  FKGVNGSLDKLLRMAKNTKKFGLGYKPSRGYIIRVRSLEKAKCLSRFENEERDYPRRTVPPLSHSFRSADTIHQEYDETSVVAAVTEEREQV--------
        FKG NGSLDKLLRMAKNTKKFGLGYKPSRG IIRVRSLEKAK LSRFENEERDYPRR VPPL+HSFRSA TIHQEYDE+SVVAAVTEEREQV        
Subjt:  FKGVNGSLDKLLRMAKNTKKFGLGYKPSRGYIIRVRSLEKAKCLSRFENEERDYPRRTVPPLSHSFRSADTIHQEYDETSVVAAVTEEREQV--------

Query:  --------------------------------------------------ENASGALLKFLE--------------------------------------
                                                          +  S  LL+ LE                                      
Subjt:  --------------------------------------------------ENASGALLKFLE--------------------------------------

Query:  --------------CW-------------------------------------------RCARRIAASFKIAS---------------------------
                       W                                              ++I A F   S                           
Subjt:  --------------CW-------------------------------------------RCARRIAASFKIAS---------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------------------------------------------------------------------------VSTAE------
                                                                                                  +T E      
Subjt:  -----------------------------------------------------------------------------------------VSTAE------

Query:  -------------------------------------------VTENSMGCVLGQHDDSGRKEQAIYYLSKEFTDCETRYSQVEKTCCALAWAARRLRQY
                                                   VTENSMGCVLGQHDDSGRKEQAIYYLSK+FTDCETRYSQVEKTCCALAWAARRLRQY
Subjt:  -------------------------------------------VTENSMGCVLGQHDDSGRKEQAIYYLSKEFTDCETRYSQVEKTCCALAWAARRLRQY

Query:  MLYYTKWLISKMDLKKYIFEKPSLSGRIARWQVLLSEYDIVYVTQKTIKGSAWADYLAQQLINDYVPVKFDFPDEYISTITASEESLDPQTWTMMFDGAS
        MLYYT WLISKMD  KYIFEKPSLSG IARWQVLLSEYDIVYVTQK IKGSA ADYLAQQ INDY+PVKFDFPDEYISTITASEESLDPQTWTMMFDGAS
Subjt:  MLYYTKWLISKMDLKKYIFEKPSLSGRIARWQVLLSEYDIVYVTQKTIKGSAWADYLAQQLINDYVPVKFDFPDEYISTITASEESLDPQTWTMMFDGAS

Query:  NELGHEIGAILISPKGELYPLTARLCFDCTHNMTEYEACSMGVQAAVDIKVKKLKVFGDSILVIHQLRGEWETRDVKLLPYKQLITELSQEFDEISFDYL
        NELGH IGAILISPKGELYPL ARLCFDC HNM EYEACSMGVQAA+D+KVKKLKVFGDS+LVIHQLRGEWETRDVKLLPYKQ ITELSQEFDEISFDYL
Subjt:  NELGHEIGAILISPKGELYPLTARLCFDCTHNMTEYEACSMGVQAAVDIKVKKLKVFGDSILVIHQLRGEWETRDVKLLPYKQLITELSQEFDEISFDYL

Query:  PRENNQVADALATLAVMFNLKLNEDVRPIKVGRRDVPASCMSIEEEPDGNPWFHDIKQYIKSKEYPPNASENDKRTLRKLAMKFFLNGEILGK
        PRENNQVADALATLAVMFNL+LNEDVRPIKVGRRDVPASCMSIEEEPDG PWFHDIKQYIKSKEYPPNASENDKRTLRKLA+KFFLNGEIL K
Subjt:  PRENNQVADALATLAVMFNLKLNEDVRPIKVGRRDVPASCMSIEEEPDGNPWFHDIKQYIKSKEYPPNASENDKRTLRKLAMKFFLNGEILGK

XP_022155098.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111022231, partial [Momordica charantia]0.0e+0047.3Show/hide
Query:  MMEDQKAEQEKTRKDIEELCEKLDAILLALEKGKAAADPATSSNPVHEPQETPPYPPGY-------GPPLRPVAEGFMPQYTTYNPLCGLP---------
        M +D+K+EQEKTRKDIEEL EKLDAILLALEKGK  A+   +SNP+HEP  TP +PPG+       GP  RP+ EGFM QYTTYNPL  +P         
Subjt:  MMEDQKAEQEKTRKDIEELCEKLDAILLALEKGKAAADPATSSNPVHEPQETPPYPPGY-------GPPLRPVAEGFMPQYTTYNPLCGLP---------

Query:  -------------------------------AKTDPVGQSAPSNEKFEVLEERLRSMRG-----------------------------QTFLATLMPHN-
                                       A  DP  Q APS+EK EVLEERLR++ G                             + +  +  P N 
Subjt:  -------------------------------AKTDPVGQSAPSNEKFEVLEERLRSMRG-----------------------------QTFLATLMPHN-

Query:  ---CAWKMAAYVQNDKLLIHYFQDSLSDPASRWYMQLDSSHVGSWKNLADSFLKQYKHNIDMAPDRLNLQRMENKSTKSFKEYAQRWRDTTTQVQPPLTD
              KMAAY+QNDKLLIH FQDSLS P S WYM LDS HV SWKNLADSFLKQYKHNIDM  DRL+LQ ME K+ +SFKEY QRWRDT  Q QPP TD
Subjt:  ---CAWKMAAYVQNDKLLIHYFQDSLSDPASRWYMQLDSSHVGSWKNLADSFLKQYKHNIDMAPDRLNLQRMENKSTKSFKEYAQRWRDTTTQVQPPLTD

Query:  KELSVMFINTLKHHFYDRMIGITSTNFSDIMTIEERIEYGVKHGRITSTAEEPLAAKKASHSKKKEGEVQMVGAERHSWKQQPYGQTPRYTPYYYPTPYG
        KELS MFINTLKH FYDRMIG  ST+FSDI+TI ERIEYGV HGRITST  E    K  + SKKKEGEVQMVGA+RH W+Q PYG+T  Y PYYYP+PYG
Subjt:  KELSVMFINTLKHHFYDRMIGITSTNFSDIMTIEERIEYGVKHGRITSTAEEPLAAKKASHSKKKEGEVQMVGAERHSWKQQPYGQTPRYTPYYYPTPYG

Query:  YNQPFVNNATSHYSPYASQNFQPPASQNFQPR-------------------------------------------------DLIQPPYPRWYDANARCDY
        YNQP+VN AT  Y+  ASQNF+PPASQ FQPR                                                 D IQPPYP WYDAN RCDY
Subjt:  YNQPFVNNATSHYSPYASQNFQPPASQNFQPR-------------------------------------------------DLIQPPYPRWYDANARCDY

Query:  HAGAIGHSTENCTALKYRVQALIKAGWLNFKKENGLDVSKNPLPNHQNVQINAIECQGVESKSKVADITTPMEELFEILLGSGYVSVEYLCPNLKYKGYD
        HAGAIGHSTENCTALKYRVQALIKAG L FKKEN  DV  NPLPNH+NVQINA+ECQG+ES+SKV++ITTPM+ LFEIL   GY+S+E+LCP+++ + YD
Subjt:  HAGAIGHSTENCTALKYRVQALIKAGWLNFKKENGLDVSKNPLPNHQNVQINAIECQGVESKSKVADITTPMEELFEILLGSGYVSVEYLCPNLKYKGYD

Query:  ESMTCPFHAGAK---------------------------------------------------------------DAPNYSQKPITITVSTPFEYKSSKA
        E++TCP+HAGA+                                                               + P+ S +PITI V  PFEY SSKA
Subjt:  ESMTCPFHAGAK---------------------------------------------------------------DAPNYSQKPITITVSTPFEYKSSKA

Query:  VPWRYECKVTVGQ-------------------------------ERLNEPASEKNKEKASEKKKEKAEEDKKGKTKLKDDIYDELVEAIVVKDANPKQPV
        VPW+YECKVTVGQ                               +  N+P S   KEKASEKK EK E+DKKGK KL +DIYDE  EA+      PKQPV
Subjt:  VPWRYECKVTVGQ-------------------------------ERLNEPASEKNKEKASEKKKEKAEEDKKGKTKLKDDIYDELVEAIVVKDANPKQPV

Query:  SEEETQEFLKLVKQSEYKVIEQLGRTPAKISILSLLLSSKVHRNTLLEALKQAFVSQDITVDNLSN--VVGNITASSSITFTDEEIPLE-----AKVLVD
        SEEETQEFLKL+K SEYK+IEQLGRTPA+ISILSLLLSS++ R    E        +   + NL +   V  +   ++++    +  +E     A V   
Subjt:  SEEETQEFLKLVKQSEYKVIEQLGRTPAKISILSLLLSSKVHRNTLLEALKQAFVSQDITVDNLSN--VVGNITASSSITFTDEEIPLE-----AKVLVD

Query:  SGSSLNIMPRS-TLEKLPVD---------MSRIRPSTVIVRAFDGARSAVFLAGATVDTFDRSSS-----IHFTSETKFAVDQKLVIISGQE--------
        S   +  +     + KLP++         + ++RP  ++++  D  R  +     TV  +    +          + +  VD + +  +  +        
Subjt:  SGSSLNIMPRS-TLEKLPVD---------MSRIRPSTVIVRAFDGARSAVFLAGATVDTFDRSSS-----IHFTSETKFAVDQKLVIISGQE--------

Query:  DILVSRLASMSYVE-----AAEEAFESSFQSFEIANATTLHGKF------------GRPKPRLLETAF-----KGVNGSLDKLL--------------RM
        D+LV   A  S        +     + + +  E     TL G F            G    R + T F     K +   +D ++              ++
Subjt:  DILVSRLASMSYVE-----AAEEAFESSFQSFEIANATTLHGKF------------GRPKPRLLETAF-----KGVNGSLDKLL--------------RM

Query:  AKNTKKFGLGYKPSR-----------GYIIRVRSL----EKAKCLSRFENEERDYP-RRTVPPLSHSFRSADTIHQEYDETSVVAAVTEEREQVENASGA
         +  +KF L   P++           G+++    +    +K K +      +     R  +  L++  R    +    +    +     +    E+   A
Subjt:  AKNTKKFGLGYKPSR-----------GYIIRVRSL----EKAKCLSRFENEERDYP-RRTVPPLSHSFRSADTIHQEYDETSVVAAVTEEREQVENASGA

Query:  LLKFLECWRCAR-RIAASFKIASVSTAEVTENSMGCVLGQHDDSGRKEQAIYYLSKEFTDCETRYSQVEKTCCALAWAARRLRQYMLYYTKWLISKMDLK
          K  +  +     +  +     +    +TENSMGCVLGQHDDSGRKEQAIYYLSK+FTDCET YS VEKTCCALAWAA RLRQYMLYYT WLISKMD  
Subjt:  LLKFLECWRCAR-RIAASFKIASVSTAEVTENSMGCVLGQHDDSGRKEQAIYYLSKEFTDCETRYSQVEKTCCALAWAARRLRQYMLYYTKWLISKMDLK

Query:  KYIFEKPSLSGRIARWQVLLSEYDIVYVTQKTIKGSAWADYLAQQLINDYVPVKFDFPDEYISTITASEESLDPQTWTMMFDGASNELGHEIGAILISPK
        KYIFEKPSLSGRIARWQVLLSEYDIVYVTQK IKGSA ADYLAQQ IN+Y  +KFDFP EYISTIT  EE+LDPQ WTMMFDGASNELGH IGAILI P 
Subjt:  KYIFEKPSLSGRIARWQVLLSEYDIVYVTQKTIKGSAWADYLAQQLINDYVPVKFDFPDEYISTITASEESLDPQTWTMMFDGASNELGHEIGAILISPK

Query:  GELYPLTARLCFDCTHNMTEYEACSMGVQAAVDIKVKKLKVFGDSILVIHQLRGEWETRDVKLLP
        G L+PLTARLCFDCTHNM EYEACSMG+QAAVD+KVKKLKVFGDS+LV++QLRG+WETRD KLLP
Subjt:  GELYPLTARLCFDCTHNMTEYEACSMGVQAAVDIKVKKLKVFGDSILVIHQLRGEWETRDVKLLP

XP_022157796.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111024415 [Momordica charantia]0.0e+0054.46Show/hide
Query:  MEDQKAEQEKTRKDIEELCEKLDAILLALEKGKAAADPATSSNPVHEPQETPPYPPGYGPPLRPVAEGFMPQYTTYNPLC--------------------
        MEDQKAEQEKTRKDIEEL EKLD I L LEKGKA ADPATSSNP+HEPQETPPYPPGYGPPLRPVAEGFMPQYTTYNPL                     
Subjt:  MEDQKAEQEKTRKDIEELCEKLDAILLALEKGKAAADPATSSNPVHEPQETPPYPPGYGPPLRPVAEGFMPQYTTYNPLC--------------------

Query:  --------------------GLPAKTDPVGQSAPSNEKFEVLEERLRSMRG-----------------------------QTFLATLMPHN----CAWKM
                            G PAKTDPV Q+A S EK EVLEERLR++ G                             + +  +  P N       KM
Subjt:  --------------------GLPAKTDPVGQSAPSNEKFEVLEERLRSMRG-----------------------------QTFLATLMPHN----CAWKM

Query:  AAYVQNDKLLIHYFQDSLSDPASRWYMQLDSSHVGSWKNLADSFLKQYKHNIDMAPDRLNLQRMENKSTKSFKEYAQRWRDTTTQVQPPLTDKELSVMFI
         AYVQN KLLIH FQDSL   ASRWYMQLDSSHVGSWKNLADSFLKQYKHNIDMAPDRL+LQRME  ST+SFKEYAQRWRDT  QVQPPLTDKELS MFI
Subjt:  AAYVQNDKLLIHYFQDSLSDPASRWYMQLDSSHVGSWKNLADSFLKQYKHNIDMAPDRLNLQRMENKSTKSFKEYAQRWRDTTTQVQPPLTDKELSVMFI

Query:  NTLKHHFYDRMIGITSTNFSDIMTIEERIEYGVKHGRITSTAEEPLAAKKASHSKKKEGEVQMVGAERHSWKQQPYGQTPRYTPYYYPTPYGYNQPFVNN
        NTLKH FYDRMIG  STNFSDIMTI ERIEYGV+H RITSTA+EPLAAKKASHSKKKEGE+  V                                    
Subjt:  NTLKHHFYDRMIGITSTNFSDIMTIEERIEYGVKHGRITSTAEEPLAAKKASHSKKKEGEVQMVGAERHSWKQQPYGQTPRYTPYYYPTPYGYNQPFVNN

Query:  ATSHYSPYASQNFQPPASQNFQPRDLIQPPYPRWYDANARCDYHAGAIGHSTENCTALKYRVQALIKAGWLNFKKENGLDVSKNPLPNHQNVQINAIECQ
                              P D IQP YPRWYDANARCDYHAGAIGHSTENCTALKYRVQAL+KAGWLNFKKEN  DVSKNPL NHQNVQINAIECQ
Subjt:  ATSHYSPYASQNFQPPASQNFQPRDLIQPPYPRWYDANARCDYHAGAIGHSTENCTALKYRVQALIKAGWLNFKKENGLDVSKNPLPNHQNVQINAIECQ

Query:  GVESKSKVADITTPMEELFEILLGSGYVSVEYLCPNLKYKGYDESMTCPFHAGAK---------------------------------------------
        G+ESKSKVADI TP EELFEILLGSGYVSVEYLCPNLKYK YDES+TCPFHAGAK                                             
Subjt:  GVESKSKVADITTPMEELFEILLGSGYVSVEYLCPNLKYKGYDESMTCPFHAGAK---------------------------------------------

Query:  ------------------DAPNYSQKPITITVSTPFEYKSSKAVPWRYECKVTVGQERLNEPASEKNKEKASEKKKEKAEEDKKGKTKLKDDIYDELVEA
                          DAP+ SQKPITITV  PFEYKSSKAVPW+Y+CKVTVGQ+  + P    N  +  +   +   ++ K  T +  +   +L+E 
Subjt:  ------------------DAPNYSQKPITITVSTPFEYKSSKAVPWRYECKVTVGQERLNEPASEKNKEKASEKKKEKAEEDKKGKTKLKDDIYDELVEA

Query:  IVVKDANPKQPVSEEETQEFLKLVKQSEYKVIEQLGRTPAKISILSLLLSSKVHRNTLL---EALKQAFVSQDITVDNLSNVVGNITASSSITFTDEEIP
        +           ++     +  +       V+ +L   P    +   L   K+  + L+   + +++   +  +TV N    V NI            +P
Subjt:  IVVKDANPKQPVSEEETQEFLKLVKQSEYKVIEQLGRTPAKISILSLLLSSKVHRNTLL---EALKQAFVSQDITVDNLSNVVGNITASSSITFTDEEIP

Query:  LEAKVLVDSGSSLNIMPRSTLEKLPVDMSRIRPSTVIVRAFDGARSAVFLAGAT------VDTFDRSSSIHFTSETKFAVDQKLVIISGQEDI-------
        +  K   +    + +  R      P D   +    V+V    G  +  F+ G +      +   DR  +   T    F      V+  G +++       
Subjt:  LEAKVLVDSGSSLNIMPRSTLEKLPVDMSRIRPSTVIVRAFDGARSAVFLAGAT------VDTFDRSSSIHFTSETKFAVDQKLVIISGQEDI-------

Query:  LVSRLASMSYVEAAEEAFESSFQSFEIANATTLHGK------------------FGRPKPRLL--ETAFKGVNGSLDKLLRMAK----NTKKFGLGYKPS
        +V+    + + E      +   +S +    TT+  K                  FG    +LL    + +G+    DK+  + +     T+K   G+   
Subjt:  LVSRLASMSYVEAAEEAFESSFQSFEIANATTLHGK------------------FGRPKPRLL--ETAFKGVNGSLDKLLRMAK----NTKKFGLGYKPS

Query:  RGYIIRVRSLEKAKCLSRFENEERDYPRRTVPPLSHSFRSADTIHQEYDETSVVAAVTEEREQVENASGALLKFLECWRCARRIAASFKIASVSTAEVTE
          YI R  S   A C   F+   ++           +F   + I Q   +  ++   T  R         L+ +L                      VTE
Subjt:  RGYIIRVRSLEKAKCLSRFENEERDYPRRTVPPLSHSFRSADTIHQEYDETSVVAAVTEEREQVENASGALLKFLECWRCARRIAASFKIASVSTAEVTE

Query:  NSMGCVLGQHDDSGRKEQAIYYLSKEFTDCETRYSQVEKTCCALAWAARRLRQYMLYYTKWLISKMDLKKYIFEKPSLSGRIARWQVLLSEYDIVYVTQK
        NSMGCVLGQHDDSGRKEQAIYYLSK+FTDCETRYSQVEKTCCALAW ARRLRQYMLYYT WLISKMD  KYIFEKPSLSGRIARWQVLLSEYDIVYVTQK
Subjt:  NSMGCVLGQHDDSGRKEQAIYYLSKEFTDCETRYSQVEKTCCALAWAARRLRQYMLYYTKWLISKMDLKKYIFEKPSLSGRIARWQVLLSEYDIVYVTQK

Query:  TIKGSAWADYLAQQLINDYVPVKFDFPDEYISTITASEESLDPQTWTMMFDGASNELGHEIGAILISPKGELYPLTARLCFDCTHNMTEYEACSMGVQAA
         IKGSA ADYLAQQ INDY+PVKFDFPDEYISTITASEESLDPQTWTMMFDGASNELGHEIGAILISPKGELYPLT +LCFDCTHNM EYEACSMGVQAA
Subjt:  TIKGSAWADYLAQQLINDYVPVKFDFPDEYISTITASEESLDPQTWTMMFDGASNELGHEIGAILISPKGELYPLTARLCFDCTHNMTEYEACSMGVQAA

Query:  VDIKVKKLKVFGDSILVIHQLRGEWETRDVKLLPYKQLITELSQEFDEISFDYLPRENNQVADALATLAVMFNLKLNEDVRPIKVGRRDVPASCMSIEEE
        +D+KVKK KVFGDS LVIHQLRGEWETRDVKLLPYKQLITELSQEFDEISFDYLPRENNQVADALATLAVMFNL+LNEDVRPIKVGRRDVPASCMSIEEE
Subjt:  VDIKVKKLKVFGDSILVIHQLRGEWETRDVKLLPYKQLITELSQEFDEISFDYLPRENNQVADALATLAVMFNLKLNEDVRPIKVGRRDVPASCMSIEEE

Query:  PDGNPWFHDIKQYIKSKEYPPNASENDKRTLRKLAMKFFLNGEILGK
        PDGNPWFHDIKQYIKSKEY PNASENDKRTLRKLAMKFFLNGEIL K
Subjt:  PDGNPWFHDIKQYIKSKEYPPNASENDKRTLRKLAMKFFLNGEILGK

XP_022158986.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111025431 [Momordica charantia]0.0e+0061.81Show/hide
Query:  LCGLPAKTDPVGQSAPSNEKFEVLEERLRSMRG-----------------------------QTFLATLMPHN----CAWKMAAYVQNDKLLIHYFQDSL
        L GLPAKTD VGQ+APSNEKFEVLEERLR++ G                             + +  +  P N       KMAAYVQNDKLLIH FQDSL
Subjt:  LCGLPAKTDPVGQSAPSNEKFEVLEERLRSMRG-----------------------------QTFLATLMPHN----CAWKMAAYVQNDKLLIHYFQDSL

Query:  SDPASRWYMQLDSSHVGSWKNLADSFLKQYKHNIDMAPDRLNLQRMENKSTKSFKEYAQRWRDTTTQVQPPLTDKELSVMFINTLKHHFYDRMIGITSTN
        S PASRWYMQLDSS+VGSWKNLADSFLKQYKHNIDMAPDRL+LQRME KST+SFKEYAQRWRDT  QVQPPLTDKELS MFINTLKH FYDRMIG  STN
Subjt:  SDPASRWYMQLDSSHVGSWKNLADSFLKQYKHNIDMAPDRLNLQRMENKSTKSFKEYAQRWRDTTTQVQPPLTDKELSVMFINTLKHHFYDRMIGITSTN

Query:  FSDIMTIEERIEYGVKHGRITSTAEEPLAAKKASHSKKKEGEVQMVGAERHSWKQQPYGQTPRYTPYYYPTPYGYNQPFVNNATSHYSPYASQNFQPPAS
        FSDIMTI ERIEYGV+HGRITST +EPLAAKKASHSKKKEGEVQMVGA+RHSWKQQPY +TPRYTPYYYPTPYGYNQPFVNNATSHYSPY  QNF+PPAS
Subjt:  FSDIMTIEERIEYGVKHGRITSTAEEPLAAKKASHSKKKEGEVQMVGAERHSWKQQPYGQTPRYTPYYYPTPYGYNQPFVNNATSHYSPYASQNFQPPAS

Query:  QNFQPR----------------------------------------------------------DLIQPPYPRWYDANARCDYHAGAIGHSTENCTALKY
        QNFQP                                                           D IQPPYPRWYD NARCDYHAGAIGHSTENCTALKY
Subjt:  QNFQPR----------------------------------------------------------DLIQPPYPRWYDANARCDYHAGAIGHSTENCTALKY

Query:  RVQALIKAGWLNFKKENGLDVSKNPLPNHQNVQINAIECQGVESKSKVADITTPMEELFEILLGSGYVSVEYLCPNLKYKGYDESMTCPFHAGAK-----
        RVQALIKAGWLNFKKENG DVSKNPLPNHQNVQINAIECQ +ESKSKVADI TPM ELFEILLGSGYVSVEYLCPNLKYKGYDES+TCPFHAGAK     
Subjt:  RVQALIKAGWLNFKKENGLDVSKNPLPNHQNVQINAIECQGVESKSKVADITTPMEELFEILLGSGYVSVEYLCPNLKYKGYDESMTCPFHAGAK-----

Query:  ----------------------------------------------------------DAPNYSQKPITITVSTPFEYKSSKAVPWRYECKVTVGQE---
                                                                  +APN S+KPITITV  PFEYKSSKAVPW+Y+CKVTVGQ+   
Subjt:  ----------------------------------------------------------DAPNYSQKPITITVSTPFEYKSSKAVPWRYECKVTVGQE---

Query:  ----------------------------RLNEPASEKNKEKASEKKKEKAEEDKKGKTKLKDDIYDELVEAIVVKDANPKQPVSEEETQEFLKLVKQSEY
                                     +NE  SEKNKEKASEKKKEK EEDKKGK KL +D++DELVEAIVVKD +PKQP+SEEETQE LKLVKQSEY
Subjt:  ----------------------------RLNEPASEKNKEKASEKKKEKAEEDKKGKTKLKDDIYDELVEAIVVKDANPKQPVSEEETQEFLKLVKQSEY

Query:  KVIEQLGRTPAKISILSLLLSSKVHRNTLLEALKQAFVSQDITVDNLSNVVGNITASSSITFTDEEIPLE-------------------AKVLVDSGSSL
        KVIEQLGRTPAKISILSLLLSS+ HRN LLEALKQAFVSQDITVDNLSNVVGNI+ +SSITFTDEEIP E                   AKVLVD+GSSL
Subjt:  KVIEQLGRTPAKISILSLLLSSKVHRNTLLEALKQAFVSQDITVDNLSNVVGNITASSSITFTDEEIPLE-------------------AKVLVDSGSSL

Query:  NIMPRSTLEKLPVDMSRIRPSTVIVRAFDGARSAVF------------LAGATVDTFDRSSSIHFT-----------------SETKFAVDQK-----LV
        NIMPRSTLEKLPVDMS +RPSTVIVRAFDGARSAV                 T    D +S+  F                   + KFAVDQ      L 
Subjt:  NIMPRSTLEKLPVDMSRIRPSTVIVRAFDGARSAVF------------LAGATVDTFDRSSSIHFT-----------------SETKFAVDQK-----LV

Query:  IISGQE-------DILVSR---LASMSYVE--AAEEAFESSFQSFEIANATTLHGKF------------GRPKPRLLETAF-----KGVNGSLDKLLRMA
          S ++       D+LV      ++ S+++  +     + + +  E     TL G F            G    R + T F     K +   +D ++  +
Subjt:  IISGQE-------DILVSR---LASMSYVE--AAEEAFESSFQSFEIANATTLHGKF------------GRPKPRLLETAF-----KGVNGSLDKLLRMA

Query:  K--------------NTKKFGLGYKPSR-----------GYIIRVR----SLEKAKCL-----SRFENEERDYPRRTVPPLSHSFRSADTIHQEYDETSV
        K                +KF L   P++           G+++        L+K K +      + + E R++  R    L++  R    +    +    
Subjt:  K--------------NTKKFGLGYKPSR-----------GYIIRVR----SLEKAKCL-----SRFENEERDYPRRTVPPLSHSFRSADTIHQEYDETSV

Query:  VAAVTEEREQVENASGALLKFLECWRCAR-RIAASFKIASVSTAEVTENSMGCVLGQHDDSGRKEQAIYYLSKEFTDCETRYSQVEKTCCALAWAARRLR
        +     +    E+   A  K  +  +     +  +     +    VTENSMGCVLGQHDDSGRKEQAIYYLSK+FTDCETRYSQVEKTCCALAWA RRLR
Subjt:  VAAVTEEREQVENASGALLKFLECWRCAR-RIAASFKIASVSTAEVTENSMGCVLGQHDDSGRKEQAIYYLSKEFTDCETRYSQVEKTCCALAWAARRLR

Query:  QYMLYYTKWLISKMDLKKYIFEKPSLSGRIARWQVLLSEYDIVYVTQKTIKGSAWADYLAQQLINDYVPVKFDFPDEYISTITASEESLDPQTWTMMFDG
        QYMLYYT WLISKMD  KYIFEKPSLSGRIARWQVLLSEYDIVYVT+K IKGSA ADYLAQQ INDY+PVKFDFPDEYISTITASEESLDPQTWTMMFDG
Subjt:  QYMLYYTKWLISKMDLKKYIFEKPSLSGRIARWQVLLSEYDIVYVTQKTIKGSAWADYLAQQLINDYVPVKFDFPDEYISTITASEESLDPQTWTMMFDG

Query:  ASNELGHEIGAILISPKGELYPLTARLCFDCTHNMTEYEACSMGVQAAVDIKVKKLKVFGDSILVIHQLRGEWETRDVKLLPYKQLITELSQEFDEISFD
        ASNELGH IG ILISPKGELYPLTARLCFDCTHNM EYEACSMGVQAAVD+KVKKLKVFGDS+LVIHQLRGEWETRDVKLLPYKQLITELSQEFDEISFD
Subjt:  ASNELGHEIGAILISPKGELYPLTARLCFDCTHNMTEYEACSMGVQAAVDIKVKKLKVFGDSILVIHQLRGEWETRDVKLLPYKQLITELSQEFDEISFD

Query:  YLPRENNQVADALATLAVMFNLKLNEDVRPIKVGRRDVPASCMSIEEEPDGNPWFHDIKQYIKSKEYPPNASENDKRTLRKLAMKFFLNGEILGK
        YLPRENNQVADALATLAVMFNL+LNEDV PIKVGRRDVPASCMSIEEEPDGNPWFH+IK YIKSKEYPPNASENDKRTLRKLAMKFFLNGEIL K
Subjt:  YLPRENNQVADALATLAVMFNLKLNEDVRPIKVGRRDVPASCMSIEEEPDGNPWFHDIKQYIKSKEYPPNASENDKRTLRKLAMKFFLNGEILGK

TrEMBL top hitse value%identityAlignment
A0A6J1CNY7 Ribonuclease H0.0e+0055.49Show/hide
Query:  GLPAKTDPVGQSAPSNEKFEVLEERLRSMRG-----------------------------QTFLATLMPHN----CAWKMAAYVQNDKLLIHYFQDSLSD
        GLPAKTDPVGQ+APSNEKFEVL+ERLR++ G                             + +  +  P N       KMAAYVQNDKLLIH FQDSLS 
Subjt:  GLPAKTDPVGQSAPSNEKFEVLEERLRSMRG-----------------------------QTFLATLMPHN----CAWKMAAYVQNDKLLIHYFQDSLSD

Query:  PASRWYMQLDSSHVGSWKNLADSFLKQYKHNIDMAPDRLNLQRMENKSTKSFKEYAQRWRDTTTQVQPPLTDKELSVMFINTLKHHFYDRMIGITSTNFS
        PASRWYMQLDSSHVGSWKNLADSFLKQYKHNIDMAPDRL+LQRME KST+SFKEYAQRWRDT  QVQPPLTDKELS MFINTLKH FYDRM+G  STNFS
Subjt:  PASRWYMQLDSSHVGSWKNLADSFLKQYKHNIDMAPDRLNLQRMENKSTKSFKEYAQRWRDTTTQVQPPLTDKELSVMFINTLKHHFYDRMIGITSTNFS

Query:  DIMTIEERIEYGVKHGRITSTAEEPLAAKKASHSKKKEGEVQMVGAERHSWKQQPYGQTPRYTPYYYPTPYGYNQPFVNNATSHYSPYASQNFQPPASQN
        DIM I ERIEYGV+HGRITSTA+EPLAAKK SHSKKKEGE+  V                                                        
Subjt:  DIMTIEERIEYGVKHGRITSTAEEPLAAKKASHSKKKEGEVQMVGAERHSWKQQPYGQTPRYTPYYYPTPYGYNQPFVNNATSHYSPYASQNFQPPASQN

Query:  FQPRDLIQPPYPRWYDANARCDYHAGAIGHSTENCTALKYRVQALIKAGWLNFKKENGLDVSKNPLPNHQNVQINAIECQGVESKSKVADITTPMEELFE
          P D IQPPYPRW DANARCDYH GAIGHS ENCTALKYRVQALIKAGWLNFKKENG DVS NPLPNH NVQINAIECQ +ESKSKVADITTPMEELFE
Subjt:  FQPRDLIQPPYPRWYDANARCDYHAGAIGHSTENCTALKYRVQALIKAGWLNFKKENGLDVSKNPLPNHQNVQINAIECQGVESKSKVADITTPMEELFE

Query:  ILLGSGYVSVEYLCPNLKYKGYDESMTCPFHAGAK-----------------------------------------------------------------
        ILLGSGYVSVEYLCPNLKYKGYDES+TCPFHAGAK                                                                 
Subjt:  ILLGSGYVSVEYLCPNLKYKGYDESMTCPFHAGAK-----------------------------------------------------------------

Query:  ---DAPNYSQKPITITVSTPFEYKSSKAVPWRYECKVTVGQE-------------------------------RLNEPASEKNKEKASEKKKEKAEEDKK
           DAPN S+KPITITV  PFEYKSSKAVPW+YECKVTVGQ+                               R++E  SEKNKEKASEKKKEK EEDKK
Subjt:  ---DAPNYSQKPITITVSTPFEYKSSKAVPWRYECKVTVGQE-------------------------------RLNEPASEKNKEKASEKKKEKAEEDKK

Query:  GKTKLKDDIYDELVEAIVVKDANPKQPVSEEETQEFLKLVKQSEYKVIEQLGRTPAKISILSLLLSSKVHRNTLLEALKQAFVSQDITVDNLSNVVGNIT
        GK KL +D++DELVEAIVVKD +PKQ V EEE QEFLKLVKQSEYKV EQLGRTPAKISILSLLLSS+ HRNTLLE LKQAFVSQDITVDNLSNVVGNIT
Subjt:  GKTKLKDDIYDELVEAIVVKDANPKQPVSEEETQEFLKLVKQSEYKVIEQLGRTPAKISILSLLLSSKVHRNTLLEALKQAFVSQDITVDNLSNVVGNIT

Query:  ASSSITFTDEEIPLE-------------------AKVLVDSGSSLNIMPRSTLEKLPVDMSRIRPSTVIVRAFDGARSAVF------------LAGATVD
        ASSSITFTDEEIP E                   AKVLVD+GSSLNIMPRSTLEKLPVDMS +RPSTVIVRAFDGARSAV                 T  
Subjt:  ASSSITFTDEEIPLE-------------------AKVLVDSGSSLNIMPRSTLEKLPVDMSRIRPSTVIVRAFDGARSAVF------------LAGATVD

Query:  TFDRSSSIHFT-----------------SETKFAVDQKLVIISGQEDILVSRLASMSYVEAAEEAFESSFQSFEIANATTLHGKFGRPKPRLLETAFKGV
          D +S+  F                   + KFAVDQKLVIISGQEDILVSRLASM YVEAAEEAFESSFQSFEIANATTLHGKFGRPKPRLLETAFKG 
Subjt:  TFDRSSSIHFT-----------------SETKFAVDQKLVIISGQEDILVSRLASMSYVEAAEEAFESSFQSFEIANATTLHGKFGRPKPRLLETAFKGV

Query:  NGSLDKLLRMAKNTKKFGLGYKPSRGYIIRVRSLEKAKCLSRFENEERDYPRRTVPPLSHSFRSADTIHQEYDETSVVAAVTEEREQV------------
        N SLDKLLRMAKNTKKFGLGYKPSRG IIRVRSLEKAK LSRFENEERDYPRRTVPPLSHSFRSA TIHQEYD +SVVAAVTEEREQV            
Subjt:  NGSLDKLLRMAKNTKKFGLGYKPSRGYIIRVRSLEKAKCLSRFENEERDYPRRTVPPLSHSFRSADTIHQEYDETSVVAAVTEEREQV------------

Query:  ----------------------------------ENASGALLKFLE----------------------------------------------------CW
                                          +  S  LL+ LE                                                     W
Subjt:  ----------------------------------ENASGALLKFLE----------------------------------------------------CW

Query:  -------------------------------------------RCARRIAASFKIAS-------------------------------------------
                                                      ++I A F   S                                           
Subjt:  -------------------------------------------RCARRIAASFKIAS-------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------------------------------------------------VSTAE----------------------
                                                                                  +T E                      
Subjt:  -------------------------------------------------------------------------VSTAE----------------------

Query:  ---------------------------VTENSMGCVLGQHDDSGRKEQAIYYLSKEFTDCETRYSQVEKTCCALAWAARRLRQYMLYYTKWLISKMDLKK
                                   VTENSMGCVLGQHDDSGRKEQAIYYLSK+FTDCETRYSQVEKTCCALAW ARRLRQYMLYYT WLISKMD  K
Subjt:  ---------------------------VTENSMGCVLGQHDDSGRKEQAIYYLSKEFTDCETRYSQVEKTCCALAWAARRLRQYMLYYTKWLISKMDLKK

Query:  YIFEKPSLSGRIARWQVLLSEYDIVYVTQKTIKGSAWADYLAQQLINDYVPVKFDFPDEYISTITASEESLDPQTWTMMFDGASNELGHEIGAILISPKG
        YIFEK SLS RIAR QVLLSEYDIVYVTQK IKGSA ADYLAQQ INDY+PVKFDFPDEYISTITASEESLDPQTWTMMFDGASNELGH IGAILISPKG
Subjt:  YIFEKPSLSGRIARWQVLLSEYDIVYVTQKTIKGSAWADYLAQQLINDYVPVKFDFPDEYISTITASEESLDPQTWTMMFDGASNELGHEIGAILISPKG

Query:  ELYPLTARLCFDCTHNMTEYEACSMGVQAAVDIKVKKLKVFGDSILVIHQLRGEWETRDVKLLPYKQLITELSQEFDEISFDYLPRENNQVADALATLAV
        ELYPLTARLCFDCTHNM EYEACSMGVQAAVD+KVKKLKVFGDS+LVIHQLRGEWETRDVKLLPYKQLITELSQEFDE+SFDYLPRENNQV DALATLAV
Subjt:  ELYPLTARLCFDCTHNMTEYEACSMGVQAAVDIKVKKLKVFGDSILVIHQLRGEWETRDVKLLPYKQLITELSQEFDEISFDYLPRENNQVADALATLAV

Query:  MFNLKLNEDVRPIKVGRRDVPASCMSIEEEPDGNPWFHDIKQYIKSKEYPPNASENDKRTLRKLAMKFFLNGEILGK
        MFNL+LNEDV PIKVGRRDVPASCMSIEEEPDGNPWFHDIKQYI SKEYPPNASENDKRT RKLAMKFFLNGEIL K
Subjt:  MFNLKLNEDVRPIKVGRRDVPASCMSIEEEPDGNPWFHDIKQYIKSKEYPPNASENDKRTLRKLAMKFFLNGEILGK

A0A6J1D099 Ribonuclease H0.0e+0055.04Show/hide
Query:  MPQYTTYNPLCGLP----------------------------------------AKTDPVGQSAPSNEKFEVLEERLRSMRG------------------
        MPQYTTYNPL  +P                                        AKTDPVGQ+APSNEKFEVL+ERLR++                    
Subjt:  MPQYTTYNPLCGLP----------------------------------------AKTDPVGQSAPSNEKFEVLEERLRSMRG------------------

Query:  -----------QTFLATLMPHN----CAWKMAAYVQNDKLLIHYFQDSLSDPASRWYMQLDSSHVGSWKNLADSFLKQYKHNIDMAPDRLNLQRMENKST
                   + +  +  P N       KMAAYVQNDKLLIH FQDSLS PASRWYMQLDSSHVGSWKNLADSFLKQYKHNIDMAPDRL+LQRME KST
Subjt:  -----------QTFLATLMPHN----CAWKMAAYVQNDKLLIHYFQDSLSDPASRWYMQLDSSHVGSWKNLADSFLKQYKHNIDMAPDRLNLQRMENKST

Query:  KSFKEYAQRWRDTTTQVQPPLTDKELSVMFINTLKHHFYDRMIGITSTNFSDIMTIEERIEYGVKHGRITSTAEEPLAAKKASHSKKKEGEVQMVGAERH
        KSFKEYAQRWRDT  QVQPPL DKELS MFINTLKH FYDRMIG  STNFSDIMTI ERIEYGV+HGRITST +EPLAAKKASHSKKKEGEVQMVGA+RH
Subjt:  KSFKEYAQRWRDTTTQVQPPLTDKELSVMFINTLKHHFYDRMIGITSTNFSDIMTIEERIEYGVKHGRITSTAEEPLAAKKASHSKKKEGEVQMVGAERH

Query:  SWKQQPYGQTPRYTPYYYPTPYGYNQPFVNNATSHYSPYASQNFQPPASQNFQ-----------------------------------------------
        SWKQQPY +TP+Y+PYYYPTPYGYNQPFVNNATSHY PYASQNF+PPASQNFQ                                               
Subjt:  SWKQQPYGQTPRYTPYYYPTPYGYNQPFVNNATSHYSPYASQNFQPPASQNFQ-----------------------------------------------

Query:  -----------PRDLIQPPYPRWYDANARCDYHAGAIGHSTENCTALKYRVQALIKAGWLNFKKENGLDVSKNPLPNHQNVQINAIECQGVESKSKVADI
                   P D IQPPYPRWYDANARCDYHAGAI HSTENCT LKYRVQALIKAGW NFKKENG DVSK  L NHQNVQINAIECQG+ESKSKVADI
Subjt:  -----------PRDLIQPPYPRWYDANARCDYHAGAIGHSTENCTALKYRVQALIKAGWLNFKKENGLDVSKNPLPNHQNVQINAIECQGVESKSKVADI

Query:  TTPMEELFEILLGSGYVSVEYLCPNLKYKGYDESMTCPFHAGAK--------------------------------------------------------
        TTPM ELFEILLGSGY+SVEYLCP  KYKGYDES+TC FH GAK                                                        
Subjt:  TTPMEELFEILLGSGYVSVEYLCPNLKYKGYDESMTCPFHAGAK--------------------------------------------------------

Query:  -------DAPNYSQKPITITVSTPFEYKSSKAVPWRYECKVTVGQE-------------------------------RLNEPASEKNKEKASEKKKEKAE
               DAP+ S+KP  ITV  PFEYKSSKAVPW+YECKVTVGQ+                               R+NE  SEKNKEKASEKKKEK E
Subjt:  -------DAPNYSQKPITITVSTPFEYKSSKAVPWRYECKVTVGQE-------------------------------RLNEPASEKNKEKASEKKKEKAE

Query:  EDKKGKTKLKDDIYDELVEAIVVKDANPKQPVSEEETQEFLKLVKQSEYKVIEQLGRTPAKISILSLLLSSKVHRNTLLEALKQAFVSQDITVDNLSNVV
        EDKKGK KL +D  DELVEAIVVKD +PKQP+SEEETQEFLKLVKQSEYKVIEQLGRTPA ISILSLLLSS+ H+N LLEALKQAFVSQDITVDNLSNVV
Subjt:  EDKKGKTKLKDDIYDELVEAIVVKDANPKQPVSEEETQEFLKLVKQSEYKVIEQLGRTPAKISILSLLLSSKVHRNTLLEALKQAFVSQDITVDNLSNVV

Query:  GNITASSSITFTDEEIPLE-------------------AKVLVDSGSSLNIMPRSTLEKLPVDMSRIRPSTVIVRAFDGARSAVF------------LAG
        GNITASSSI+FTDEEIP E                   AKVLVD+GSSLNIMPRSTLEKLPVDMS +RPSTVIVRAFDGARSAV                
Subjt:  GNITASSSITFTDEEIPLE-------------------AKVLVDSGSSLNIMPRSTLEKLPVDMSRIRPSTVIVRAFDGARSAVF------------LAG

Query:  ATVDTFDRSSSIHFT-----------------SETKFAVDQKLVIISGQEDILVSRLASMSYVEAAEEAFESSFQSFEIANATTLHGKFGRPKPRLLETA
         T    D +S+  F                   + KFAVDQKLVIISGQEDILVSR ASMSYVE AEEAFESSFQSFEIANATTLHGKFGRPKPRLLETA
Subjt:  ATVDTFDRSSSIHFT-----------------SETKFAVDQKLVIISGQEDILVSRLASMSYVEAAEEAFESSFQSFEIANATTLHGKFGRPKPRLLETA

Query:  FKGVNGSLDKLLRMAKNTKKFGLGYKPSRGYIIRVRSLEKAKCLSRFENEERDYPRRTVPPLSHSFRSADTIHQEYDETSVVAAVTEEREQV--------
        FKG NGSLDKLLRMAKNTKKFGLGYKPSRG IIRVRSLEKAK LSRFENEERDYPRR VPPL+HSFRSA TIHQEYDE+SVVAAVTEEREQV        
Subjt:  FKGVNGSLDKLLRMAKNTKKFGLGYKPSRGYIIRVRSLEKAKCLSRFENEERDYPRRTVPPLSHSFRSADTIHQEYDETSVVAAVTEEREQV--------

Query:  --------------------------------------------------ENASGALLKFLE--------------------------------------
                                                          +  S  LL+ LE                                      
Subjt:  --------------------------------------------------ENASGALLKFLE--------------------------------------

Query:  --------------CW-------------------------------------------RCARRIAASFKIAS---------------------------
                       W                                              ++I A F   S                           
Subjt:  --------------CW-------------------------------------------RCARRIAASFKIAS---------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------------------------------------------------------------------------VSTAE------
                                                                                                  +T E      
Subjt:  -----------------------------------------------------------------------------------------VSTAE------

Query:  -------------------------------------------VTENSMGCVLGQHDDSGRKEQAIYYLSKEFTDCETRYSQVEKTCCALAWAARRLRQY
                                                   VTENSMGCVLGQHDDSGRKEQAIYYLSK+FTDCETRYSQVEKTCCALAWAARRLRQY
Subjt:  -------------------------------------------VTENSMGCVLGQHDDSGRKEQAIYYLSKEFTDCETRYSQVEKTCCALAWAARRLRQY

Query:  MLYYTKWLISKMDLKKYIFEKPSLSGRIARWQVLLSEYDIVYVTQKTIKGSAWADYLAQQLINDYVPVKFDFPDEYISTITASEESLDPQTWTMMFDGAS
        MLYYT WLISKMD  KYIFEKPSLSG IARWQVLLSEYDIVYVTQK IKGSA ADYLAQQ INDY+PVKFDFPDEYISTITASEESLDPQTWTMMFDGAS
Subjt:  MLYYTKWLISKMDLKKYIFEKPSLSGRIARWQVLLSEYDIVYVTQKTIKGSAWADYLAQQLINDYVPVKFDFPDEYISTITASEESLDPQTWTMMFDGAS

Query:  NELGHEIGAILISPKGELYPLTARLCFDCTHNMTEYEACSMGVQAAVDIKVKKLKVFGDSILVIHQLRGEWETRDVKLLPYKQLITELSQEFDEISFDYL
        NELGH IGAILISPKGELYPL ARLCFDC HNM EYEACSMGVQAA+D+KVKKLKVFGDS+LVIHQLRGEWETRDVKLLPYKQ ITELSQEFDEISFDYL
Subjt:  NELGHEIGAILISPKGELYPLTARLCFDCTHNMTEYEACSMGVQAAVDIKVKKLKVFGDSILVIHQLRGEWETRDVKLLPYKQLITELSQEFDEISFDYL

Query:  PRENNQVADALATLAVMFNLKLNEDVRPIKVGRRDVPASCMSIEEEPDGNPWFHDIKQYIKSKEYPPNASENDKRTLRKLAMKFFLNGEILGK
        PRENNQVADALATLAVMFNL+LNEDVRPIKVGRRDVPASCMSIEEEPDG PWFHDIKQYIKSKEYPPNASENDKRTLRKLA+KFFLNGEIL K
Subjt:  PRENNQVADALATLAVMFNLKLNEDVRPIKVGRRDVPASCMSIEEEPDGNPWFHDIKQYIKSKEYPPNASENDKRTLRKLAMKFFLNGEILGK

A0A6J1DM29 LOW QUALITY PROTEIN: uncharacterized protein LOC1110222310.0e+0047.37Show/hide
Query:  MMEDQKAEQEKTRKDIEELCEKLDAILLALEKGKAAADPATSSNPVHEPQETPPYPPGY-------GPPLRPVAEGFMPQYTTYNPLCGLP---------
        M +D+K+EQEKTRKDIEEL EKLDAILLALEKGK  A+   +SNP+HEP  TP +PPG+       GP  RP+ EGFM QYTTYNPL  +P         
Subjt:  MMEDQKAEQEKTRKDIEELCEKLDAILLALEKGKAAADPATSSNPVHEPQETPPYPPGY-------GPPLRPVAEGFMPQYTTYNPLCGLP---------

Query:  -------------------------------AKTDPVGQSAPSNEKFEVLEERLRSMRG-----------------------------QTFLATLMPHN-
                                       A  DP  Q APS+EK EVLEERLR++ G                             + +  +  P N 
Subjt:  -------------------------------AKTDPVGQSAPSNEKFEVLEERLRSMRG-----------------------------QTFLATLMPHN-

Query:  ---CAWKMAAYVQNDKLLIHYFQDSLSDPASRWYMQLDSSHVGSWKNLADSFLKQYKHNIDMAPDRLNLQRMENKSTKSFKEYAQRWRDTTTQVQPPLTD
              KMAAY+QNDKLLIH FQDSLS P S WYM LDS HV SWKNLADSFLKQYKHNIDM  DRL+LQ ME K+ +SFKEY QRWRDT  Q QPP TD
Subjt:  ---CAWKMAAYVQNDKLLIHYFQDSLSDPASRWYMQLDSSHVGSWKNLADSFLKQYKHNIDMAPDRLNLQRMENKSTKSFKEYAQRWRDTTTQVQPPLTD

Query:  KELSVMFINTLKHHFYDRMIGITSTNFSDIMTIEERIEYGVKHGRITSTAEEPLAAKKASHSKKKEGEVQMVGAERHSWKQQPYGQTPRYTPYYYPTPYG
        KELS MFINTLKH FYDRMIG  ST+FSDI+TI ERIEYGV HGRITST  E    K  + SKKKEGEVQMVGA+RH W+Q PYG+T  Y PYYYP+PYG
Subjt:  KELSVMFINTLKHHFYDRMIGITSTNFSDIMTIEERIEYGVKHGRITSTAEEPLAAKKASHSKKKEGEVQMVGAERHSWKQQPYGQTPRYTPYYYPTPYG

Query:  YNQPFVNNATSHYSPYASQNFQPPASQNFQPR-------------------------------------------------DLIQPPYPRWYDANARCDY
        YNQP+VN AT  Y+  ASQNF+PPASQ FQPR                                                 D IQPPYP WYDAN RCDY
Subjt:  YNQPFVNNATSHYSPYASQNFQPPASQNFQPR-------------------------------------------------DLIQPPYPRWYDANARCDY

Query:  HAGAIGHSTENCTALKYRVQALIKAGWLNFKKENGLDVSKNPLPNHQNVQINAIECQGVESKSKVADITTPMEELFEILLGSGYVSVEYLCPNLKYKGYD
        HAGAIGHSTENCTALKYRVQALIKAG L FKKEN  DV  NPLPNH+NVQINA+ECQG+ES+SKV++ITTPM+ LFEIL   GY+S+E+LCP+++ + YD
Subjt:  HAGAIGHSTENCTALKYRVQALIKAGWLNFKKENGLDVSKNPLPNHQNVQINAIECQGVESKSKVADITTPMEELFEILLGSGYVSVEYLCPNLKYKGYD

Query:  ESMTCPFHAGAK---------------------------------------------------------------DAPNYSQKPITITVSTPFEYKSSKA
        E++TCP+HAGA+                                                               + P+ S +PITI V  PFEY SSKA
Subjt:  ESMTCPFHAGAK---------------------------------------------------------------DAPNYSQKPITITVSTPFEYKSSKA

Query:  VPWRYECKVTVGQ-------------------------------ERLNEPASEKNKEKASEKKKEKAEEDKKGKTKLKDDIYDELVEAIVVKDANPKQPV
        VPW+YECKVTVGQ                               +  N+P S   KEKASEKK EK E+DKKGK KL +DIYDE  EA+      PKQPV
Subjt:  VPWRYECKVTVGQ-------------------------------ERLNEPASEKNKEKASEKKKEKAEEDKKGKTKLKDDIYDELVEAIVVKDANPKQPV

Query:  SEEETQEFLKLVKQSEYKVIEQLGRTPAKISILSLLLSSKVHRNTLLEALKQAFVSQDITVDNLSN--VVGNITASSSITFTDEEIPLE-----AKVLVD
        SEEETQEFLKL+K SEYK+IEQLGRTPA+ISILSLLLSS++ R    E        +   + NL +   V  +   ++++    +  +E     A V   
Subjt:  SEEETQEFLKLVKQSEYKVIEQLGRTPAKISILSLLLSSKVHRNTLLEALKQAFVSQDITVDNLSN--VVGNITASSSITFTDEEIPLE-----AKVLVD

Query:  SGSSLNIMPRS-TLEKLPVD---------MSRIRPSTVIVRAFDGARSAVFLAGATVDTFDRSSS-----IHFTSETKFAVDQKLVIISGQE--------
        S   +  +     + KLP++         + ++RP  ++++  D  R  +     TV  +    +          + +  VD + +  +  +        
Subjt:  SGSSLNIMPRS-TLEKLPVD---------MSRIRPSTVIVRAFDGARSAVFLAGATVDTFDRSSS-----IHFTSETKFAVDQKLVIISGQE--------

Query:  DILVSRLASMSYVE-----AAEEAFESSFQSFEIANATTLHGKF------------GRPKPRLLETAF-----KGVNGSLDKLL--------------RM
        D+LV   A  S        +     + + +  E     TL G F            G    R + T F     K +   +D ++              ++
Subjt:  DILVSRLASMSYVE-----AAEEAFESSFQSFEIANATTLHGKF------------GRPKPRLLETAF-----KGVNGSLDKLL--------------RM

Query:  AKNTKKFGLGYKPSR-----------GYIIRVRSL----EKAKCLSRFENEERDYP-RRTVPPLSHSFRSADTIHQEYDETSVVAAVTEEREQVENASGA
         +  +KF L   P++           G+++    +    +K K +      +     R  +  L++  R    +    +    +     +    E+   A
Subjt:  AKNTKKFGLGYKPSR-----------GYIIRVRSL----EKAKCLSRFENEERDYP-RRTVPPLSHSFRSADTIHQEYDETSVVAAVTEEREQVENASGA

Query:  LLKFLECWRCAR-RIAASFKIASVSTAEVTENSMGCVLGQHDDSGRKEQAIYYLSKEFTDCETRYSQVEKTCCALAWAARRLRQYMLYYTKWLISKMDLK
          K  +  +     +  +     +    +TENSMGCVLGQHDDSGRKEQAIYYLSK+FTDCET YS VEKTCCALAWAA RLRQYMLYYT WLISKMD  
Subjt:  LLKFLECWRCAR-RIAASFKIASVSTAEVTENSMGCVLGQHDDSGRKEQAIYYLSKEFTDCETRYSQVEKTCCALAWAARRLRQYMLYYTKWLISKMDLK

Query:  KYIFEKPSLSGRIARWQVLLSEYDIVYVTQKTIKGSAWADYLAQQLINDYVPVKFDFPDEYISTITASEESLDPQTWTMMFDGASNELGHEIGAILISPK
        KYIFEKPSLSGRIARWQVLLSEYDIVYVTQK IKGSA ADYLAQQ INDY  +KFDFP EYISTIT  EE+LDPQ WTMMFDGASNELGH IGAILI P 
Subjt:  KYIFEKPSLSGRIARWQVLLSEYDIVYVTQKTIKGSAWADYLAQQLINDYVPVKFDFPDEYISTITASEESLDPQTWTMMFDGASNELGHEIGAILISPK

Query:  GELYPLTARLCFDCTHNMTEYEACSMGVQAAVDIKVKKLKVFGDSILVIHQLRGEWETRDVKLLP
        G L+PLTARLCFDCTHNM EYEACSMG+QAAVD+KVKKLKVFGDS+LV++QLRG+WETRD KLLP
Subjt:  GELYPLTARLCFDCTHNMTEYEACSMGVQAAVDIKVKKLKVFGDSILVIHQLRGEWETRDVKLLP

A0A6J1DZ90 Ribonuclease H0.0e+0054.46Show/hide
Query:  MEDQKAEQEKTRKDIEELCEKLDAILLALEKGKAAADPATSSNPVHEPQETPPYPPGYGPPLRPVAEGFMPQYTTYNPLC--------------------
        MEDQKAEQEKTRKDIEEL EKLD I L LEKGKA ADPATSSNP+HEPQETPPYPPGYGPPLRPVAEGFMPQYTTYNPL                     
Subjt:  MEDQKAEQEKTRKDIEELCEKLDAILLALEKGKAAADPATSSNPVHEPQETPPYPPGYGPPLRPVAEGFMPQYTTYNPLC--------------------

Query:  --------------------GLPAKTDPVGQSAPSNEKFEVLEERLRSMRG-----------------------------QTFLATLMPHN----CAWKM
                            G PAKTDPV Q+A S EK EVLEERLR++ G                             + +  +  P N       KM
Subjt:  --------------------GLPAKTDPVGQSAPSNEKFEVLEERLRSMRG-----------------------------QTFLATLMPHN----CAWKM

Query:  AAYVQNDKLLIHYFQDSLSDPASRWYMQLDSSHVGSWKNLADSFLKQYKHNIDMAPDRLNLQRMENKSTKSFKEYAQRWRDTTTQVQPPLTDKELSVMFI
         AYVQN KLLIH FQDSL   ASRWYMQLDSSHVGSWKNLADSFLKQYKHNIDMAPDRL+LQRME  ST+SFKEYAQRWRDT  QVQPPLTDKELS MFI
Subjt:  AAYVQNDKLLIHYFQDSLSDPASRWYMQLDSSHVGSWKNLADSFLKQYKHNIDMAPDRLNLQRMENKSTKSFKEYAQRWRDTTTQVQPPLTDKELSVMFI

Query:  NTLKHHFYDRMIGITSTNFSDIMTIEERIEYGVKHGRITSTAEEPLAAKKASHSKKKEGEVQMVGAERHSWKQQPYGQTPRYTPYYYPTPYGYNQPFVNN
        NTLKH FYDRMIG  STNFSDIMTI ERIEYGV+H RITSTA+EPLAAKKASHSKKKEGE+  V                                    
Subjt:  NTLKHHFYDRMIGITSTNFSDIMTIEERIEYGVKHGRITSTAEEPLAAKKASHSKKKEGEVQMVGAERHSWKQQPYGQTPRYTPYYYPTPYGYNQPFVNN

Query:  ATSHYSPYASQNFQPPASQNFQPRDLIQPPYPRWYDANARCDYHAGAIGHSTENCTALKYRVQALIKAGWLNFKKENGLDVSKNPLPNHQNVQINAIECQ
                              P D IQP YPRWYDANARCDYHAGAIGHSTENCTALKYRVQAL+KAGWLNFKKEN  DVSKNPL NHQNVQINAIECQ
Subjt:  ATSHYSPYASQNFQPPASQNFQPRDLIQPPYPRWYDANARCDYHAGAIGHSTENCTALKYRVQALIKAGWLNFKKENGLDVSKNPLPNHQNVQINAIECQ

Query:  GVESKSKVADITTPMEELFEILLGSGYVSVEYLCPNLKYKGYDESMTCPFHAGAK---------------------------------------------
        G+ESKSKVADI TP EELFEILLGSGYVSVEYLCPNLKYK YDES+TCPFHAGAK                                             
Subjt:  GVESKSKVADITTPMEELFEILLGSGYVSVEYLCPNLKYKGYDESMTCPFHAGAK---------------------------------------------

Query:  ------------------DAPNYSQKPITITVSTPFEYKSSKAVPWRYECKVTVGQERLNEPASEKNKEKASEKKKEKAEEDKKGKTKLKDDIYDELVEA
                          DAP+ SQKPITITV  PFEYKSSKAVPW+Y+CKVTVGQ+  + P    N  +  +   +   ++ K  T +  +   +L+E 
Subjt:  ------------------DAPNYSQKPITITVSTPFEYKSSKAVPWRYECKVTVGQERLNEPASEKNKEKASEKKKEKAEEDKKGKTKLKDDIYDELVEA

Query:  IVVKDANPKQPVSEEETQEFLKLVKQSEYKVIEQLGRTPAKISILSLLLSSKVHRNTLL---EALKQAFVSQDITVDNLSNVVGNITASSSITFTDEEIP
        +           ++     +  +       V+ +L   P    +   L   K+  + L+   + +++   +  +TV N    V NI            +P
Subjt:  IVVKDANPKQPVSEEETQEFLKLVKQSEYKVIEQLGRTPAKISILSLLLSSKVHRNTLL---EALKQAFVSQDITVDNLSNVVGNITASSSITFTDEEIP

Query:  LEAKVLVDSGSSLNIMPRSTLEKLPVDMSRIRPSTVIVRAFDGARSAVFLAGAT------VDTFDRSSSIHFTSETKFAVDQKLVIISGQEDI-------
        +  K   +    + +  R      P D   +    V+V    G  +  F+ G +      +   DR  +   T    F      V+  G +++       
Subjt:  LEAKVLVDSGSSLNIMPRSTLEKLPVDMSRIRPSTVIVRAFDGARSAVFLAGAT------VDTFDRSSSIHFTSETKFAVDQKLVIISGQEDI-------

Query:  LVSRLASMSYVEAAEEAFESSFQSFEIANATTLHGK------------------FGRPKPRLL--ETAFKGVNGSLDKLLRMAK----NTKKFGLGYKPS
        +V+    + + E      +   +S +    TT+  K                  FG    +LL    + +G+    DK+  + +     T+K   G+   
Subjt:  LVSRLASMSYVEAAEEAFESSFQSFEIANATTLHGK------------------FGRPKPRLL--ETAFKGVNGSLDKLLRMAK----NTKKFGLGYKPS

Query:  RGYIIRVRSLEKAKCLSRFENEERDYPRRTVPPLSHSFRSADTIHQEYDETSVVAAVTEEREQVENASGALLKFLECWRCARRIAASFKIASVSTAEVTE
          YI R  S   A C   F+   ++           +F   + I Q   +  ++   T  R         L+ +L                      VTE
Subjt:  RGYIIRVRSLEKAKCLSRFENEERDYPRRTVPPLSHSFRSADTIHQEYDETSVVAAVTEEREQVENASGALLKFLECWRCARRIAASFKIASVSTAEVTE

Query:  NSMGCVLGQHDDSGRKEQAIYYLSKEFTDCETRYSQVEKTCCALAWAARRLRQYMLYYTKWLISKMDLKKYIFEKPSLSGRIARWQVLLSEYDIVYVTQK
        NSMGCVLGQHDDSGRKEQAIYYLSK+FTDCETRYSQVEKTCCALAW ARRLRQYMLYYT WLISKMD  KYIFEKPSLSGRIARWQVLLSEYDIVYVTQK
Subjt:  NSMGCVLGQHDDSGRKEQAIYYLSKEFTDCETRYSQVEKTCCALAWAARRLRQYMLYYTKWLISKMDLKKYIFEKPSLSGRIARWQVLLSEYDIVYVTQK

Query:  TIKGSAWADYLAQQLINDYVPVKFDFPDEYISTITASEESLDPQTWTMMFDGASNELGHEIGAILISPKGELYPLTARLCFDCTHNMTEYEACSMGVQAA
         IKGSA ADYLAQQ INDY+PVKFDFPDEYISTITASEESLDPQTWTMMFDGASNELGHEIGAILISPKGELYPLT +LCFDCTHNM EYEACSMGVQAA
Subjt:  TIKGSAWADYLAQQLINDYVPVKFDFPDEYISTITASEESLDPQTWTMMFDGASNELGHEIGAILISPKGELYPLTARLCFDCTHNMTEYEACSMGVQAA

Query:  VDIKVKKLKVFGDSILVIHQLRGEWETRDVKLLPYKQLITELSQEFDEISFDYLPRENNQVADALATLAVMFNLKLNEDVRPIKVGRRDVPASCMSIEEE
        +D+KVKK KVFGDS LVIHQLRGEWETRDVKLLPYKQLITELSQEFDEISFDYLPRENNQVADALATLAVMFNL+LNEDVRPIKVGRRDVPASCMSIEEE
Subjt:  VDIKVKKLKVFGDSILVIHQLRGEWETRDVKLLPYKQLITELSQEFDEISFDYLPRENNQVADALATLAVMFNLKLNEDVRPIKVGRRDVPASCMSIEEE

Query:  PDGNPWFHDIKQYIKSKEYPPNASENDKRTLRKLAMKFFLNGEILGK
        PDGNPWFHDIKQYIKSKEY PNASENDKRTLRKLAMKFFLNGEIL K
Subjt:  PDGNPWFHDIKQYIKSKEYPPNASENDKRTLRKLAMKFFLNGEILGK

A0A6J1E2J7 Ribonuclease H0.0e+0061.81Show/hide
Query:  LCGLPAKTDPVGQSAPSNEKFEVLEERLRSMRG-----------------------------QTFLATLMPHN----CAWKMAAYVQNDKLLIHYFQDSL
        L GLPAKTD VGQ+APSNEKFEVLEERLR++ G                             + +  +  P N       KMAAYVQNDKLLIH FQDSL
Subjt:  LCGLPAKTDPVGQSAPSNEKFEVLEERLRSMRG-----------------------------QTFLATLMPHN----CAWKMAAYVQNDKLLIHYFQDSL

Query:  SDPASRWYMQLDSSHVGSWKNLADSFLKQYKHNIDMAPDRLNLQRMENKSTKSFKEYAQRWRDTTTQVQPPLTDKELSVMFINTLKHHFYDRMIGITSTN
        S PASRWYMQLDSS+VGSWKNLADSFLKQYKHNIDMAPDRL+LQRME KST+SFKEYAQRWRDT  QVQPPLTDKELS MFINTLKH FYDRMIG  STN
Subjt:  SDPASRWYMQLDSSHVGSWKNLADSFLKQYKHNIDMAPDRLNLQRMENKSTKSFKEYAQRWRDTTTQVQPPLTDKELSVMFINTLKHHFYDRMIGITSTN

Query:  FSDIMTIEERIEYGVKHGRITSTAEEPLAAKKASHSKKKEGEVQMVGAERHSWKQQPYGQTPRYTPYYYPTPYGYNQPFVNNATSHYSPYASQNFQPPAS
        FSDIMTI ERIEYGV+HGRITST +EPLAAKKASHSKKKEGEVQMVGA+RHSWKQQPY +TPRYTPYYYPTPYGYNQPFVNNATSHYSPY  QNF+PPAS
Subjt:  FSDIMTIEERIEYGVKHGRITSTAEEPLAAKKASHSKKKEGEVQMVGAERHSWKQQPYGQTPRYTPYYYPTPYGYNQPFVNNATSHYSPYASQNFQPPAS

Query:  QNFQPR----------------------------------------------------------DLIQPPYPRWYDANARCDYHAGAIGHSTENCTALKY
        QNFQP                                                           D IQPPYPRWYD NARCDYHAGAIGHSTENCTALKY
Subjt:  QNFQPR----------------------------------------------------------DLIQPPYPRWYDANARCDYHAGAIGHSTENCTALKY

Query:  RVQALIKAGWLNFKKENGLDVSKNPLPNHQNVQINAIECQGVESKSKVADITTPMEELFEILLGSGYVSVEYLCPNLKYKGYDESMTCPFHAGAK-----
        RVQALIKAGWLNFKKENG DVSKNPLPNHQNVQINAIECQ +ESKSKVADI TPM ELFEILLGSGYVSVEYLCPNLKYKGYDES+TCPFHAGAK     
Subjt:  RVQALIKAGWLNFKKENGLDVSKNPLPNHQNVQINAIECQGVESKSKVADITTPMEELFEILLGSGYVSVEYLCPNLKYKGYDESMTCPFHAGAK-----

Query:  ----------------------------------------------------------DAPNYSQKPITITVSTPFEYKSSKAVPWRYECKVTVGQE---
                                                                  +APN S+KPITITV  PFEYKSSKAVPW+Y+CKVTVGQ+   
Subjt:  ----------------------------------------------------------DAPNYSQKPITITVSTPFEYKSSKAVPWRYECKVTVGQE---

Query:  ----------------------------RLNEPASEKNKEKASEKKKEKAEEDKKGKTKLKDDIYDELVEAIVVKDANPKQPVSEEETQEFLKLVKQSEY
                                     +NE  SEKNKEKASEKKKEK EEDKKGK KL +D++DELVEAIVVKD +PKQP+SEEETQE LKLVKQSEY
Subjt:  ----------------------------RLNEPASEKNKEKASEKKKEKAEEDKKGKTKLKDDIYDELVEAIVVKDANPKQPVSEEETQEFLKLVKQSEY

Query:  KVIEQLGRTPAKISILSLLLSSKVHRNTLLEALKQAFVSQDITVDNLSNVVGNITASSSITFTDEEIPLE-------------------AKVLVDSGSSL
        KVIEQLGRTPAKISILSLLLSS+ HRN LLEALKQAFVSQDITVDNLSNVVGNI+ +SSITFTDEEIP E                   AKVLVD+GSSL
Subjt:  KVIEQLGRTPAKISILSLLLSSKVHRNTLLEALKQAFVSQDITVDNLSNVVGNITASSSITFTDEEIPLE-------------------AKVLVDSGSSL

Query:  NIMPRSTLEKLPVDMSRIRPSTVIVRAFDGARSAVF------------LAGATVDTFDRSSSIHFT-----------------SETKFAVDQK-----LV
        NIMPRSTLEKLPVDMS +RPSTVIVRAFDGARSAV                 T    D +S+  F                   + KFAVDQ      L 
Subjt:  NIMPRSTLEKLPVDMSRIRPSTVIVRAFDGARSAVF------------LAGATVDTFDRSSSIHFT-----------------SETKFAVDQK-----LV

Query:  IISGQE-------DILVSR---LASMSYVE--AAEEAFESSFQSFEIANATTLHGKF------------GRPKPRLLETAF-----KGVNGSLDKLLRMA
          S ++       D+LV      ++ S+++  +     + + +  E     TL G F            G    R + T F     K +   +D ++  +
Subjt:  IISGQE-------DILVSR---LASMSYVE--AAEEAFESSFQSFEIANATTLHGKF------------GRPKPRLLETAF-----KGVNGSLDKLLRMA

Query:  K--------------NTKKFGLGYKPSR-----------GYIIRVR----SLEKAKCL-----SRFENEERDYPRRTVPPLSHSFRSADTIHQEYDETSV
        K                +KF L   P++           G+++        L+K K +      + + E R++  R    L++  R    +    +    
Subjt:  K--------------NTKKFGLGYKPSR-----------GYIIRVR----SLEKAKCL-----SRFENEERDYPRRTVPPLSHSFRSADTIHQEYDETSV

Query:  VAAVTEEREQVENASGALLKFLECWRCAR-RIAASFKIASVSTAEVTENSMGCVLGQHDDSGRKEQAIYYLSKEFTDCETRYSQVEKTCCALAWAARRLR
        +     +    E+   A  K  +  +     +  +     +    VTENSMGCVLGQHDDSGRKEQAIYYLSK+FTDCETRYSQVEKTCCALAWA RRLR
Subjt:  VAAVTEEREQVENASGALLKFLECWRCAR-RIAASFKIASVSTAEVTENSMGCVLGQHDDSGRKEQAIYYLSKEFTDCETRYSQVEKTCCALAWAARRLR

Query:  QYMLYYTKWLISKMDLKKYIFEKPSLSGRIARWQVLLSEYDIVYVTQKTIKGSAWADYLAQQLINDYVPVKFDFPDEYISTITASEESLDPQTWTMMFDG
        QYMLYYT WLISKMD  KYIFEKPSLSGRIARWQVLLSEYDIVYVT+K IKGSA ADYLAQQ INDY+PVKFDFPDEYISTITASEESLDPQTWTMMFDG
Subjt:  QYMLYYTKWLISKMDLKKYIFEKPSLSGRIARWQVLLSEYDIVYVTQKTIKGSAWADYLAQQLINDYVPVKFDFPDEYISTITASEESLDPQTWTMMFDG

Query:  ASNELGHEIGAILISPKGELYPLTARLCFDCTHNMTEYEACSMGVQAAVDIKVKKLKVFGDSILVIHQLRGEWETRDVKLLPYKQLITELSQEFDEISFD
        ASNELGH IG ILISPKGELYPLTARLCFDCTHNM EYEACSMGVQAAVD+KVKKLKVFGDS+LVIHQLRGEWETRDVKLLPYKQLITELSQEFDEISFD
Subjt:  ASNELGHEIGAILISPKGELYPLTARLCFDCTHNMTEYEACSMGVQAAVDIKVKKLKVFGDSILVIHQLRGEWETRDVKLLPYKQLITELSQEFDEISFD

Query:  YLPRENNQVADALATLAVMFNLKLNEDVRPIKVGRRDVPASCMSIEEEPDGNPWFHDIKQYIKSKEYPPNASENDKRTLRKLAMKFFLNGEILGK
        YLPRENNQVADALATLAVMFNL+LNEDV PIKVGRRDVPASCMSIEEEPDGNPWFH+IK YIKSKEYPPNASENDKRTLRKLAMKFFLNGEIL K
Subjt:  YLPRENNQVADALATLAVMFNLKLNEDVRPIKVGRRDVPASCMSIEEEPDGNPWFHDIKQYIKSKEYPPNASENDKRTLRKLAMKFFLNGEILGK

SwissProt top hitse value%identityAlignment
P64956 Uncharacterized protein Mb2253c4.1e-0629.73Show/hide
Query:  THNMTEYEACSMGVQAAVDIKVKKLKVFGDSILVIHQLRGEWETRDVKLLPYKQLITELSQEFDEISFDYLPRENNQVADALATLAV-MFNLKLNEDVRP
        T+N+ EY     G+  AV +   +  V  DS LV+ Q+ G W+ +   LL        L+ +F  I+++++PR  N  AD LA  A+         D  P
Subjt:  THNMTEYEACSMGVQAAVDIKVKKLKVFGDSILVIHQLRGEWETRDVKLLPYKQLITELSQEFDEISFDYLPRENNQVADALATLAV-MFNLKLNEDVRP

Query:  IKVGRRDVPAS
         K+   + P S
Subjt:  IKVGRRDVPAS

P9WLH4 Uncharacterized protein MT22874.1e-0629.73Show/hide
Query:  THNMTEYEACSMGVQAAVDIKVKKLKVFGDSILVIHQLRGEWETRDVKLLPYKQLITELSQEFDEISFDYLPRENNQVADALATLAV-MFNLKLNEDVRP
        T+N+ EY     G+  AV +   +  V  DS LV+ Q+ G W+ +   LL        L+ +F  I+++++PR  N  AD LA  A+         D  P
Subjt:  THNMTEYEACSMGVQAAVDIKVKKLKVFGDSILVIHQLRGEWETRDVKLLPYKQLITELSQEFDEISFDYLPRENNQVADALATLAV-MFNLKLNEDVRP

Query:  IKVGRRDVPAS
         K+   + P S
Subjt:  IKVGRRDVPAS

P9WLH5 Bifunctional protein Rv2228c4.1e-0629.73Show/hide
Query:  THNMTEYEACSMGVQAAVDIKVKKLKVFGDSILVIHQLRGEWETRDVKLLPYKQLITELSQEFDEISFDYLPRENNQVADALATLAV-MFNLKLNEDVRP
        T+N+ EY     G+  AV +   +  V  DS LV+ Q+ G W+ +   LL        L+ +F  I+++++PR  N  AD LA  A+         D  P
Subjt:  THNMTEYEACSMGVQAAVDIKVKKLKVFGDSILVIHQLRGEWETRDVKLLPYKQLITELSQEFDEISFDYLPRENNQVADALATLAV-MFNLKLNEDVRP

Query:  IKVGRRDVPAS
         K+   + P S
Subjt:  IKVGRRDVPAS

Q9HSF6 Ribonuclease HI2.7e-1035.77Show/hide
Query:  FDGAS--NELGHEIGAILISPKGELYPLTARLCFDCTHNMTEYEACSMGVQAAVDIKVKKLKVFGDSILVIHQLRGEWETRDVKLLPYKQLITELSQEFD
        FDGAS  N     +G +L+S  G +           T+N  EY+A    ++AA D     +++ GDS LV  QL G W+T D  L   +    EL   FD
Subjt:  FDGAS--NELGHEIGAILISPKGELYPLTARLCFDCTHNMTEYEACSMGVQAAVDIKVKKLKVFGDSILVIHQLRGEWETRDVKLLPYKQLITELSQEFD

Query:  EISFDYLPRENNQVADALATLAV
        + S  ++PR  N+ ADALA  A+
Subjt:  EISFDYLPRENNQVADALATLAV

Arabidopsis top hitse value%identityAlignment
AT1G24090.1 RNase H family protein1.4e-0630.14Show/hide
Query:  PDEYISTITASEESLDPQTWTMMFDGAS--NELGHEIGAILISPKGELYPLTARLCFDCTHNMTEYEACSMGVQAAVDIKVKKLKVFGDSILVIHQLRGE
        P E +S +  S    D +T  + FDGAS  N       A+L +  G L     +     T+N  EY A  +G++ A++   K +KV GDS LV  Q++G+
Subjt:  PDEYISTITASEESLDPQTWTMMFDGAS--NELGHEIGAILISPKGELYPLTARLCFDCTHNMTEYEACSMGVQAAVDIKVKKLKVFGDSILVIHQLRGE

Query:  WETRDVKLLPYKQLITELSQEFDEISFDYLPRENNQVADALATLAV
        W+     L    +    L  +       ++ R  N  AD  A LAV
Subjt:  WETRDVKLLPYKQLITELSQEFDEISFDYLPRENNQVADALATLAV

AT3G01410.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.8e-1031.75Show/hide
Query:  TMMFDGAS--NELGHEIGAILISPKGELYPLTARLCFDCTHNMTEYEACSMGVQAAVDIKVKKLKVFGDSILVIHQLRGEWETRDVKLLPYKQLITELSQ
        T+ FDGAS  N      GA+L +    +         + T+N+ EY A  +G+++A+D   K + V GDS+LV  Q++G W+T   K+    +   EL  
Subjt:  TMMFDGAS--NELGHEIGAILISPKGELYPLTARLCFDCTHNMTEYEACSMGVQAAVDIKVKKLKVFGDSILVIHQLRGEWETRDVKLLPYKQLITELSQ

Query:  EFDEISFDYLPRENNQVADALATLAV
         F      ++ RE N  AD  A  A+
Subjt:  EFDEISFDYLPRENNQVADALATLAV

AT3G01410.2 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.8e-1031.75Show/hide
Query:  TMMFDGAS--NELGHEIGAILISPKGELYPLTARLCFDCTHNMTEYEACSMGVQAAVDIKVKKLKVFGDSILVIHQLRGEWETRDVKLLPYKQLITELSQ
        T+ FDGAS  N      GA+L +    +         + T+N+ EY A  +G+++A+D   K + V GDS+LV  Q++G W+T   K+    +   EL  
Subjt:  TMMFDGAS--NELGHEIGAILISPKGELYPLTARLCFDCTHNMTEYEACSMGVQAAVDIKVKKLKVFGDSILVIHQLRGEWETRDVKLLPYKQLITELSQ

Query:  EFDEISFDYLPRENNQVADALATLAV
         F      ++ RE N  AD  A  A+
Subjt:  EFDEISFDYLPRENNQVADALATLAV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGGAAGATCAAAAAGCCGAGCAGGAGAAAACAAGGAAAGATATTGAGGAGTTGTGCGAAAAGTTAGATGCCATCCTCCTTGCGTTGGAAAAAGGCAAAGCGGCTGC
GGATCCTGCTACATCTAGCAACCCTGTCCATGAGCCGCAAGAAACCCCGCCTTATCCACCTGGGTACGGTCCACCTCTTAGGCCAGTGGCAGAGGGTTTCATGCCACAAT
ATACAACGTACAACCCTCTCTGTGGTCTCCCAGCCAAGACAGACCCAGTTGGACAAAGTGCCCCCAGTAACGAGAAGTTCGAAGTTCTGGAGGAAAGATTAAGATCAATG
AGGGGACAGACGTTTTTGGCAACATTGATGCCTCACAATTGTGCTTGGAAGATGGCAGCGTACGTCCAAAATGACAAATTGTTGATACACTACTTTCAAGATAGTTTATC
TGATCCAGCCTCTCGTTGGTACATGCAGTTAGATAGCTCTCATGTTGGCTCGTGGAAGAATCTGGCCGACTCCTTCCTAAAACAATATAAGCATAACATAGACATGGCTC
CAGATCGCTTAAATTTACAGAGGATGGAGAACAAGAGTACAAAAAGCTTTAAAGAGTATGCTCAAAGGTGGAGGGACACAACAACTCAAGTCCAACCTCCTTTAACAGAT
AAAGAGCTATCTGTCATGTTCATCAATACCCTAAAACATCATTTCTATGATCGGATGATAGGAATCACTTCCACAAATTTCTCTGATATTATGACCATAGAAGAAAGGAT
CGAATACGGGGTTAAACACGGGCGAATAACCAGTACTGCCGAGGAGCCATTAGCCGCAAAGAAGGCAAGTCATTCCAAGAAGAAGGAAGGTGAGGTGCAAATGGTGGGAG
CAGAACGACACTCTTGGAAACAACAACCGTACGGTCAGACACCGCGATACACTCCATATTATTACCCAACGCCATACGGGTATAATCAACCGTTTGTGAATAATGCAACT
TCACATTACTCCCCTTATGCCTCTCAAAATTTTCAACCCCCAGCCAGTCAAAATTTCCAACCTAGGGATCTGATCCAGCCTCCATACCCAAGATGGTATGATGCAAATGC
TCGTTGTGACTATCATGCAGGAGCCATAGGGCATTCAACAGAGAACTGCACTGCATTGAAATACAGAGTTCAAGCTTTGATCAAGGCAGGTTGGTTGAATTTTAAAAAAG
AAAATGGGCTTGATGTAAGTAAGAATCCGCTGCCGAATCATCAGAATGTCCAAATAAATGCAATCGAATGCCAAGGGGTCGAGTCGAAGAGTAAGGTTGCTGATATTACA
ACCCCTATGGAGGAACTATTTGAAATTCTCTTGGGCAGTGGATATGTATCAGTGGAGTACCTATGCCCAAACCTCAAGTATAAAGGGTATGATGAAAGTATGACGTGCCC
GTTCCACGCTGGGGCAAAAGATGCACCCAACTACAGTCAGAAACCAATCACTATCACGGTCTCAACTCCTTTCGAGTATAAAAGTTCCAAAGCAGTGCCTTGGAGATATG
AGTGCAAGGTAACTGTAGGGCAAGAGCGCCTGAATGAGCCCGCTAGTGAAAAGAATAAAGAGAAAGCAAGTGAGAAGAAGAAGGAAAAAGCAGAGGAAGATAAGAAAGGG
AAGACCAAACTCAAAGATGATATTTATGATGAACTGGTGGAGGCAATTGTTGTAAAGGATGCAAATCCTAAACAACCCGTGTCCGAAGAAGAGACTCAAGAGTTTCTAAA
GCTAGTGAAGCAAAGTGAATACAAAGTTATTGAACAATTAGGTCGGACACCCGCAAAGATCTCTATATTATCTTTACTGTTATCCTCTAAAGTGCATCGGAATACACTGT
TGGAGGCCTTGAAGCAGGCTTTCGTTTCACAAGATATCACAGTGGATAATTTGAGCAACGTTGTGGGGAATATAACAGCATCTAGCTCAATCACTTTTACAGACGAGGAA
ATACCACTAGAGGCAAAAGTCCTTGTGGACAGTGGATCTTCTTTAAACATAATGCCAAGATCCACGCTAGAGAAGTTACCCGTTGATATGTCCCGTATAAGACCTAGTAC
TGTGATAGTAAGAGCTTTTGATGGAGCTCGCAGCGCTGTTTTTCTTGCTGGGGCGACCGTGGATACATTCGACAGGAGCAGTTCCATCCACTTTACATCAGAAACTAAGT
TTGCGGTTGACCAAAAGTTAGTGATCATATCGGGACAAGAAGACATTCTAGTCTCAAGGCTTGCTTCGATGTCATACGTTGAAGCAGCAGAAGAAGCTTTTGAGTCTTCA
TTCCAATCATTCGAGATTGCAAATGCTACAACATTACATGGGAAGTTTGGAAGACCTAAGCCACGACTTTTAGAGACCGCCTTTAAAGGAGTCAATGGAAGCTTAGACAA
ACTGCTGAGGATGGCTAAGAATACAAAGAAGTTCGGGTTGGGGTATAAACCAAGTAGAGGCTACATCATTAGAGTGCGGAGTCTGGAAAAGGCAAAATGCCTCTCAAGAT
TTGAGAATGAGGAGCGTGATTACCCTAGGAGGACTGTTCCACCTCTCAGCCACTCTTTCAGAAGTGCCGACACAATCCATCAAGAGTACGATGAGACCTCTGTAGTGGCA
GCAGTGACAGAAGAAAGAGAGCAAGTGGAGAATGCGTCAGGCGCATTGCTGAAGTTTCTGGAGTGTTGGCGATGCGCCAGACGCATCGCTGCTAGCTTTAAGATAGCCTC
TGTGTCAACTGCAGAAGTGACTGAAAACTCAATGGGATGTGTACTGGGGCAGCATGATGATTCAGGCAGGAAAGAACAGGCTATATATTATTTAAGTAAGGAGTTCACCG
ATTGCGAGACTAGATACTCTCAAGTAGAAAAAACTTGTTGTGCTCTAGCTTGGGCTGCTCGACGTCTAAGACAATACATGTTGTATTATACCAAATGGCTCATTTCAAAG
ATGGACCTCAAAAAGTACATTTTTGAAAAGCCGTCTCTCTCGGGTCGAATTGCAAGGTGGCAGGTTCTCTTGTCCGAATATGATATTGTCTATGTGACTCAAAAGACCAT
TAAGGGGAGTGCTTGGGCCGACTACCTAGCTCAACAACTTATAAATGACTACGTACCGGTGAAGTTCGATTTTCCAGATGAGTATATCTCCACCATAACCGCAAGTGAGG
AAAGTTTAGACCCGCAAACTTGGACCATGATGTTTGATGGTGCCTCTAACGAGTTAGGTCATGAGATAGGGGCTATTTTGATATCACCCAAAGGGGAACTATACCCTCTT
ACCGCTAGATTATGTTTTGACTGCACACATAATATGACCGAGTATGAAGCATGCTCGATGGGCGTCCAAGCTGCTGTTGATATAAAGGTTAAGAAACTTAAAGTTTTTGG
GGATTCTATACTAGTAATTCATCAACTAAGAGGAGAATGGGAAACAAGAGACGTTAAGTTGTTGCCTTATAAACAACTCATAACAGAATTGTCACAAGAATTTGATGAAA
TCTCATTTGATTATTTGCCAAGAGAAAATAATCAAGTAGCAGATGCATTGGCCACATTAGCGGTGATGTTCAATTTAAAACTCAATGAGGATGTCCGTCCGATTAAAGTT
GGGAGGAGAGATGTCCCAGCTTCTTGTATGAGCATTGAGGAAGAACCCGACGGTAACCCCTGGTTTCATGACATCAAACAATATATAAAGAGTAAAGAATATCCACCAAA
TGCTTCAGAAAATGATAAGCGCACCCTTCGCAAGTTGGCAATGAAGTTTTTCTTAAATGGAGAGATACTGGGCAAAGTCGTCGCCCCATTTGTTTTGGGTACTCTCGATC
CAGTTAAAAGGATTTTCGAAAGAAAGCGAAGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGATGGAAGATCAAAAAGCCGAGCAGGAGAAAACAAGGAAAGATATTGAGGAGTTGTGCGAAAAGTTAGATGCCATCCTCCTTGCGTTGGAAAAAGGCAAAGCGGCTGC
GGATCCTGCTACATCTAGCAACCCTGTCCATGAGCCGCAAGAAACCCCGCCTTATCCACCTGGGTACGGTCCACCTCTTAGGCCAGTGGCAGAGGGTTTCATGCCACAAT
ATACAACGTACAACCCTCTCTGTGGTCTCCCAGCCAAGACAGACCCAGTTGGACAAAGTGCCCCCAGTAACGAGAAGTTCGAAGTTCTGGAGGAAAGATTAAGATCAATG
AGGGGACAGACGTTTTTGGCAACATTGATGCCTCACAATTGTGCTTGGAAGATGGCAGCGTACGTCCAAAATGACAAATTGTTGATACACTACTTTCAAGATAGTTTATC
TGATCCAGCCTCTCGTTGGTACATGCAGTTAGATAGCTCTCATGTTGGCTCGTGGAAGAATCTGGCCGACTCCTTCCTAAAACAATATAAGCATAACATAGACATGGCTC
CAGATCGCTTAAATTTACAGAGGATGGAGAACAAGAGTACAAAAAGCTTTAAAGAGTATGCTCAAAGGTGGAGGGACACAACAACTCAAGTCCAACCTCCTTTAACAGAT
AAAGAGCTATCTGTCATGTTCATCAATACCCTAAAACATCATTTCTATGATCGGATGATAGGAATCACTTCCACAAATTTCTCTGATATTATGACCATAGAAGAAAGGAT
CGAATACGGGGTTAAACACGGGCGAATAACCAGTACTGCCGAGGAGCCATTAGCCGCAAAGAAGGCAAGTCATTCCAAGAAGAAGGAAGGTGAGGTGCAAATGGTGGGAG
CAGAACGACACTCTTGGAAACAACAACCGTACGGTCAGACACCGCGATACACTCCATATTATTACCCAACGCCATACGGGTATAATCAACCGTTTGTGAATAATGCAACT
TCACATTACTCCCCTTATGCCTCTCAAAATTTTCAACCCCCAGCCAGTCAAAATTTCCAACCTAGGGATCTGATCCAGCCTCCATACCCAAGATGGTATGATGCAAATGC
TCGTTGTGACTATCATGCAGGAGCCATAGGGCATTCAACAGAGAACTGCACTGCATTGAAATACAGAGTTCAAGCTTTGATCAAGGCAGGTTGGTTGAATTTTAAAAAAG
AAAATGGGCTTGATGTAAGTAAGAATCCGCTGCCGAATCATCAGAATGTCCAAATAAATGCAATCGAATGCCAAGGGGTCGAGTCGAAGAGTAAGGTTGCTGATATTACA
ACCCCTATGGAGGAACTATTTGAAATTCTCTTGGGCAGTGGATATGTATCAGTGGAGTACCTATGCCCAAACCTCAAGTATAAAGGGTATGATGAAAGTATGACGTGCCC
GTTCCACGCTGGGGCAAAAGATGCACCCAACTACAGTCAGAAACCAATCACTATCACGGTCTCAACTCCTTTCGAGTATAAAAGTTCCAAAGCAGTGCCTTGGAGATATG
AGTGCAAGGTAACTGTAGGGCAAGAGCGCCTGAATGAGCCCGCTAGTGAAAAGAATAAAGAGAAAGCAAGTGAGAAGAAGAAGGAAAAAGCAGAGGAAGATAAGAAAGGG
AAGACCAAACTCAAAGATGATATTTATGATGAACTGGTGGAGGCAATTGTTGTAAAGGATGCAAATCCTAAACAACCCGTGTCCGAAGAAGAGACTCAAGAGTTTCTAAA
GCTAGTGAAGCAAAGTGAATACAAAGTTATTGAACAATTAGGTCGGACACCCGCAAAGATCTCTATATTATCTTTACTGTTATCCTCTAAAGTGCATCGGAATACACTGT
TGGAGGCCTTGAAGCAGGCTTTCGTTTCACAAGATATCACAGTGGATAATTTGAGCAACGTTGTGGGGAATATAACAGCATCTAGCTCAATCACTTTTACAGACGAGGAA
ATACCACTAGAGGCAAAAGTCCTTGTGGACAGTGGATCTTCTTTAAACATAATGCCAAGATCCACGCTAGAGAAGTTACCCGTTGATATGTCCCGTATAAGACCTAGTAC
TGTGATAGTAAGAGCTTTTGATGGAGCTCGCAGCGCTGTTTTTCTTGCTGGGGCGACCGTGGATACATTCGACAGGAGCAGTTCCATCCACTTTACATCAGAAACTAAGT
TTGCGGTTGACCAAAAGTTAGTGATCATATCGGGACAAGAAGACATTCTAGTCTCAAGGCTTGCTTCGATGTCATACGTTGAAGCAGCAGAAGAAGCTTTTGAGTCTTCA
TTCCAATCATTCGAGATTGCAAATGCTACAACATTACATGGGAAGTTTGGAAGACCTAAGCCACGACTTTTAGAGACCGCCTTTAAAGGAGTCAATGGAAGCTTAGACAA
ACTGCTGAGGATGGCTAAGAATACAAAGAAGTTCGGGTTGGGGTATAAACCAAGTAGAGGCTACATCATTAGAGTGCGGAGTCTGGAAAAGGCAAAATGCCTCTCAAGAT
TTGAGAATGAGGAGCGTGATTACCCTAGGAGGACTGTTCCACCTCTCAGCCACTCTTTCAGAAGTGCCGACACAATCCATCAAGAGTACGATGAGACCTCTGTAGTGGCA
GCAGTGACAGAAGAAAGAGAGCAAGTGGAGAATGCGTCAGGCGCATTGCTGAAGTTTCTGGAGTGTTGGCGATGCGCCAGACGCATCGCTGCTAGCTTTAAGATAGCCTC
TGTGTCAACTGCAGAAGTGACTGAAAACTCAATGGGATGTGTACTGGGGCAGCATGATGATTCAGGCAGGAAAGAACAGGCTATATATTATTTAAGTAAGGAGTTCACCG
ATTGCGAGACTAGATACTCTCAAGTAGAAAAAACTTGTTGTGCTCTAGCTTGGGCTGCTCGACGTCTAAGACAATACATGTTGTATTATACCAAATGGCTCATTTCAAAG
ATGGACCTCAAAAAGTACATTTTTGAAAAGCCGTCTCTCTCGGGTCGAATTGCAAGGTGGCAGGTTCTCTTGTCCGAATATGATATTGTCTATGTGACTCAAAAGACCAT
TAAGGGGAGTGCTTGGGCCGACTACCTAGCTCAACAACTTATAAATGACTACGTACCGGTGAAGTTCGATTTTCCAGATGAGTATATCTCCACCATAACCGCAAGTGAGG
AAAGTTTAGACCCGCAAACTTGGACCATGATGTTTGATGGTGCCTCTAACGAGTTAGGTCATGAGATAGGGGCTATTTTGATATCACCCAAAGGGGAACTATACCCTCTT
ACCGCTAGATTATGTTTTGACTGCACACATAATATGACCGAGTATGAAGCATGCTCGATGGGCGTCCAAGCTGCTGTTGATATAAAGGTTAAGAAACTTAAAGTTTTTGG
GGATTCTATACTAGTAATTCATCAACTAAGAGGAGAATGGGAAACAAGAGACGTTAAGTTGTTGCCTTATAAACAACTCATAACAGAATTGTCACAAGAATTTGATGAAA
TCTCATTTGATTATTTGCCAAGAGAAAATAATCAAGTAGCAGATGCATTGGCCACATTAGCGGTGATGTTCAATTTAAAACTCAATGAGGATGTCCGTCCGATTAAAGTT
GGGAGGAGAGATGTCCCAGCTTCTTGTATGAGCATTGAGGAAGAACCCGACGGTAACCCCTGGTTTCATGACATCAAACAATATATAAAGAGTAAAGAATATCCACCAAA
TGCTTCAGAAAATGATAAGCGCACCCTTCGCAAGTTGGCAATGAAGTTTTTCTTAAATGGAGAGATACTGGGCAAAGTCGTCGCCCCATTTGTTTTGGGTACTCTCGATC
CAGTTAAAAGGATTTTCGAAAGAAAGCGAAGGTGA
Protein sequenceShow/hide protein sequence
MMEDQKAEQEKTRKDIEELCEKLDAILLALEKGKAAADPATSSNPVHEPQETPPYPPGYGPPLRPVAEGFMPQYTTYNPLCGLPAKTDPVGQSAPSNEKFEVLEERLRSM
RGQTFLATLMPHNCAWKMAAYVQNDKLLIHYFQDSLSDPASRWYMQLDSSHVGSWKNLADSFLKQYKHNIDMAPDRLNLQRMENKSTKSFKEYAQRWRDTTTQVQPPLTD
KELSVMFINTLKHHFYDRMIGITSTNFSDIMTIEERIEYGVKHGRITSTAEEPLAAKKASHSKKKEGEVQMVGAERHSWKQQPYGQTPRYTPYYYPTPYGYNQPFVNNAT
SHYSPYASQNFQPPASQNFQPRDLIQPPYPRWYDANARCDYHAGAIGHSTENCTALKYRVQALIKAGWLNFKKENGLDVSKNPLPNHQNVQINAIECQGVESKSKVADIT
TPMEELFEILLGSGYVSVEYLCPNLKYKGYDESMTCPFHAGAKDAPNYSQKPITITVSTPFEYKSSKAVPWRYECKVTVGQERLNEPASEKNKEKASEKKKEKAEEDKKG
KTKLKDDIYDELVEAIVVKDANPKQPVSEEETQEFLKLVKQSEYKVIEQLGRTPAKISILSLLLSSKVHRNTLLEALKQAFVSQDITVDNLSNVVGNITASSSITFTDEE
IPLEAKVLVDSGSSLNIMPRSTLEKLPVDMSRIRPSTVIVRAFDGARSAVFLAGATVDTFDRSSSIHFTSETKFAVDQKLVIISGQEDILVSRLASMSYVEAAEEAFESS
FQSFEIANATTLHGKFGRPKPRLLETAFKGVNGSLDKLLRMAKNTKKFGLGYKPSRGYIIRVRSLEKAKCLSRFENEERDYPRRTVPPLSHSFRSADTIHQEYDETSVVA
AVTEEREQVENASGALLKFLECWRCARRIAASFKIASVSTAEVTENSMGCVLGQHDDSGRKEQAIYYLSKEFTDCETRYSQVEKTCCALAWAARRLRQYMLYYTKWLISK
MDLKKYIFEKPSLSGRIARWQVLLSEYDIVYVTQKTIKGSAWADYLAQQLINDYVPVKFDFPDEYISTITASEESLDPQTWTMMFDGASNELGHEIGAILISPKGELYPL
TARLCFDCTHNMTEYEACSMGVQAAVDIKVKKLKVFGDSILVIHQLRGEWETRDVKLLPYKQLITELSQEFDEISFDYLPRENNQVADALATLAVMFNLKLNEDVRPIKV
GRRDVPASCMSIEEEPDGNPWFHDIKQYIKSKEYPPNASENDKRTLRKLAMKFFLNGEILGKVVAPFVLGTLDPVKRIFERKRR