; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cla97C04G072120 (gene) of Watermelon (97103) v2.5 genome

Gene IDCla97C04G072120
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
Descriptionnuclear intron maturase 4, mitochondrial isoform X2
Genome locationCla97Chr04:18768980..18772683
RNA-Seq ExpressionCla97C04G072120
SyntenyCla97C04G072120
Gene Ontology termsGO:0000373 - Group II intron splicing (biological process)
GO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006315 - homing of group II introns (biological process)
GO:0007005 - mitochondrion organization (biological process)
GO:0009845 - seed germination (biological process)
GO:0032885 - regulation of polysaccharide biosynthetic process (biological process)
GO:0090615 - mitochondrial mRNA processing (biological process)
GO:1900864 - mitochondrial RNA modification (biological process)
GO:0005739 - mitochondrion (cellular component)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR024937 - Domain X
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0041778.1 hypothetical protein E6C27_scaffold67G001360 [Cucumis melo var. makuwa]0.0e+0088.48Show/hide
Query:  MKFGGLQRFCRINMRNISILQNVNVCKVNSSFVSDI----------------------GECVQRVQSSGNYSTLAYADDEIDKGIEKKKLATNLASLIEE
        M+FGGL+RFC+IN RN S  Q+VNVC VNSSFVSDI                      G+C Q VQSS NYSTLA ADDEIDKG+EK KLA NLASL+EE
Subjt:  MKFGGLQRFCRINMRNISILQNVNVCKVNSSFVSDI----------------------GECVQRVQSSGNYSTLAYADDEIDKGIEKKKLATNLASLIEE

Query:  SLDVDLRRSKTQMELKRSLEIQIKKRVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMSTDCLISFESMAEELSNGNFDVNANTFSILSSRK
        SLDVDLRRSKT+MELKRSLEIQIK+RVKAQYLNGKFLDLMG VIACP TLQNAYDC+RINSNVDI S DCLISFESMAEELS+GNFDVN NTFSILSSRK
Subjt:  SLDVDLRRSKTQMELKRSLEIQIKKRVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMSTDCLISFESMAEELSNGNFDVNANTFSILSSRK

Query:  EVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKLITVMEDKIDDPRLFAVIRSIYVAG
        EVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHST LKYI+KEIK+PDWWFTVDLSKKMD+LVMAKLITVMEDKI+DP+LFAVIRSI++AG
Subjt:  EVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKLITVMEDKIDDPRLFAVIRSIYVAG

Query:  ALNLEFGGFPKGHGLPQEGVLSPILMNIYLNLFDQEFFRLSMKYEAINENGNTGQDGSQSRLRSWFRRQLKGNSFEYPGEEKDKIRVYCCRYMDEIFLAV
        ALNLEFG FPKGHGLPQEGVLSPIL NIYLNLFDQEFFRLSMKYEAINE GNTGQDGSQS+LRSWFRRQLKGNS +YPGEEKDKIRVYCCRYMDEIFLAV
Subjt:  ALNLEFGGFPKGHGLPQEGVLSPILMNIYLNLFDQEFFRLSMKYEAINENGNTGQDGSQSRLRSWFRRQLKGNSFEYPGEEKDKIRVYCCRYMDEIFLAV

Query:  SGSKDVALSFRSEIFDFMQKTLHLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHGLKKVKE
        SGSKDVALSFRSEIFDFMQKTLHLDVNH+EEMVSC ETHGIRFLGCLVRRSVQESPAVKS+HKLKEKVELF LQKQETW +WTVWLGKKWLAHGLKKVKE
Subjt:  SGSKDVALSFRSEIFDFMQKTLHLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHGLKKVKE

Query:  SEIKHLAKNSSLNQISSFRKAGMETDHWYKVLLKIWMQDLNARAAESEEKILSKYAVEPSLPMELRDSFYEFQRCVKQYISAETASTVALLPNYDPSVKP
        SEIKHLAKNSSLNQISSFRK GMETDHWYKVLLKIWMQDLNARAAESEEKILSK+AVEPSLP ELRDSFYEFQR V++YIS+ETAST+ALLPNYDPSVKP
Subjt:  SEIKHLAKNSSLNQISSFRKAGMETDHWYKVLLKIWMQDLNARAAESEEKILSKYAVEPSLPMELRDSFYEFQRCVKQYISAETASTVALLPNYDPSVKP

Query:  TFITEIIAPVNSIRKRLFRYRLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELVLICDQVRKSCIRTLAAKHRIHESEIEKKFDSE
        TFITEIIAPVNSIRKRLFRYRLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYN  SNFSEL LI DQVRKSCIRTLAAKHRIHESEIEKKFDSE
Subjt:  TFITEIIAPVNSIRKRLFRYRLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELVLICDQVRKSCIRTLAAKHRIHESEIEKKFDSE

Query:  LSNIYSSPELEQEEEKKSSDTHGLDHDEALKYGISYSGLCLLSLARMVNPSRPCNCFVVGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFG
        LS IYSSPE+EQ +E KS+DTH LDHDEAL YGISYSGLCLLSLARMV+ SRPCNCFVVGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFG
Subjt:  LSNIYSSPELEQEEEKKSSDTHGLDHDEALKYGISYSGLCLLSLARMVNPSRPCNCFVVGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFG

Query:  LCKKHLEDLYLGHISLQSIDFGAWK
        LCK+HL DLYLG ISLQS+DFGAWK
Subjt:  LCKKHLEDLYLGHISLQSIDFGAWK

XP_008442019.1 PREDICTED: uncharacterized protein LOC103486008 [Cucumis melo]0.0e+0090.78Show/hide
Query:  MKFGGLQRFCRINMRNISILQNVNVCKVNSSFVSDIGECVQRVQSSGNYSTLAYADDEIDKGIEKKKLATNLASLIEESLDVDLRRSKTQMELKRSLEIQ
        M+FGGL+RFC+IN RN S  Q+VNVC VNSSFVSDIG+C Q VQSS NYSTLA ADDEIDKG+EK KLA NLASL+EESLDVDLRRSKT+MELKRSLEIQ
Subjt:  MKFGGLQRFCRINMRNISILQNVNVCKVNSSFVSDIGECVQRVQSSGNYSTLAYADDEIDKGIEKKKLATNLASLIEESLDVDLRRSKTQMELKRSLEIQ

Query:  IKKRVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMSTDCLISFESMAEELSNGNFDVNANTFSILSSRKEVLILPKIKLKVLQEAIRIVLE
        IK+RVKAQYLNGKFLDLMG VIACP TLQNAYDC+RINSNVDI S DCLISFESMA+ELS+GNFDVN NTFSILSSRKEVLILPKIKLKVLQEAIRIVLE
Subjt:  IKKRVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMSTDCLISFESMAEELSNGNFDVNANTFSILSSRKEVLILPKIKLKVLQEAIRIVLE

Query:  CVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKLITVMEDKIDDPRLFAVIRSIYVAGALNLEFGGFPKGHGLPQEGVLS
        CVFRPHFSKISHGCRSGRGHST LKYI+KEIK+PDWWFTVDLSKKMDELVMAKLITVMEDKI+DP+LFAVIRSI++AGALNLEFG FPKGHGLPQEGVLS
Subjt:  CVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKLITVMEDKIDDPRLFAVIRSIYVAGALNLEFGGFPKGHGLPQEGVLS

Query:  PILMNIYLNLFDQEFFRLSMKYEAINENGNTGQDGSQSRLRSWFRRQLKGNSFEYPGEEKDKIRVYCCRYMDEIFLAVSGSKDVALSFRSEIFDFMQKTL
        PIL NIYLNLFDQEFFRLSMKYEAINE GNTGQDGSQS+LRSWFRRQLK NS +YPGEEKDKIRVYCCRYMDEIFLAVSGSKDVALSFRSEIFDFMQKTL
Subjt:  PILMNIYLNLFDQEFFRLSMKYEAINENGNTGQDGSQSRLRSWFRRQLKGNSFEYPGEEKDKIRVYCCRYMDEIFLAVSGSKDVALSFRSEIFDFMQKTL

Query:  HLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSSLNQISSFRKAG
        HLDVNH+EEMVSC ETHGIRFLGCLVRRSVQESPAVKS+HKLKEKVELF LQKQETW +WTVWLGKKWLAHGLKKVKESEIKHLAKNSSLNQISSFRK G
Subjt:  HLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSSLNQISSFRKAG

Query:  METDHWYKVLLKIWMQDLNARAAESEEKILSKYAVEPSLPMELRDSFYEFQRCVKQYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLFRYRL
        METDHWYKVLLKIWMQDLNARAAESEEKILSK+AVEPSLP ELRDSFYEFQR V++YIS+ETAST+ALLPNYDPSVKPTFITEIIAPVNSIRKRLFRYRL
Subjt:  METDHWYKVLLKIWMQDLNARAAESEEKILSKYAVEPSLPMELRDSFYEFQRCVKQYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLFRYRL

Query:  VTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELVLICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSNIYSSPELEQEEEKKSSDTH
        VTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYN  SNFSEL LI DQVRKSCIRTLAAKH+IHESEIEKKFDSELS IYSSPE+EQE+E KS+DTH
Subjt:  VTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELVLICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSNIYSSPELEQEEEKKSSDTH

Query:  GLDHDEALKYGISYSGLCLLSLARMVNPSRPCNCFVVGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKKHLEDLYLGHISLQSIDFG
         LDHDEAL YGISYSGLCLLSLARMV+ SRPCNCFVVGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCK+HL DLYLG ISLQS+DFG
Subjt:  GLDHDEALKYGISYSGLCLLSLARMVNPSRPCNCFVVGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKKHLEDLYLGHISLQSIDFG

Query:  AWK
        AWK
Subjt:  AWK

XP_038882001.1 nuclear intron maturase 4, mitochondrial isoform X1 [Benincasa hispida]0.0e+0093.56Show/hide
Query:  GYANMKFGGLQRFCRINMRNISILQNVNVCKVNSSFVSDIGECVQRVQSSGNYSTLAYADDEIDKGIEKKKLATNLASLIEESLDVDLRRSKTQMELKRS
        GYA M+FGGLQRFCRINMRN++ L  VNVCKV+SS VS IG+ VQRVQ+S NYSTL  ADDEIDKG+EK KLA NLASL+EESLDVDL+RSKTQMELKRS
Subjt:  GYANMKFGGLQRFCRINMRNISILQNVNVCKVNSSFVSDIGECVQRVQSSGNYSTLAYADDEIDKGIEKKKLATNLASLIEESLDVDLRRSKTQMELKRS

Query:  LEIQIKKRVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMSTDCLISFESMAEELSNGNFDVNANTFSILSSRKEVLILPKIKLKVLQEAIR
        LEIQIK+RVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMS DCLISFESMAEELSNGNFDVNANTFSILSSRKEVL+LPKI+LKVLQEAIR
Subjt:  LEIQIKKRVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMSTDCLISFESMAEELSNGNFDVNANTFSILSSRKEVLILPKIKLKVLQEAIR

Query:  IVLECVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKLITVMEDKIDDPRLFAVIRSIYVAGALNLEFGGFPKGHGLPQE
        IVLECVFRPHFSKISHGCRSGRGHST LKYIRKEIKNPDWWFT+DLSKKMDELVMAKLITVMEDKI+DP+LFAVIRSIYVAGALNLEFGGFPKGHGLPQE
Subjt:  IVLECVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKLITVMEDKIDDPRLFAVIRSIYVAGALNLEFGGFPKGHGLPQE

Query:  GVLSPILMNIYLNLFDQEFFRLSMKYEAINENGNTGQDGSQSRLRSWFRRQLKGNSFEYPGEEKDKIRVYCCRYMDEIFLAVSGSKDVALSFRSEIFDFM
        G+LSPIL NIYLNLFDQEFFRLSMKYEAINE GNTGQDGSQSRLRSWFRRQLKGNS +YPGE+KDKIRVYCCRYMDEIFLAVSGSKDVALSFRSEIF F+
Subjt:  GVLSPILMNIYLNLFDQEFFRLSMKYEAINENGNTGQDGSQSRLRSWFRRQLKGNSFEYPGEEKDKIRVYCCRYMDEIFLAVSGSKDVALSFRSEIFDFM

Query:  QKTLHLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSSLNQISSF
        QKTLHLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLK+KVELFALQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSSLNQISSF
Subjt:  QKTLHLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSSLNQISSF

Query:  RKAGMETDHWYKVLLKIWMQDLNARAAESEEKILSKYAVEPSLPMELRDSFYEFQRCVKQYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLF
        RKAGMETDHWYKVLLKIWMQDLNARAAESEEKILSK+AVEPSLP+ELRDSFYEFQRCV++YISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRL 
Subjt:  RKAGMETDHWYKVLLKIWMQDLNARAAESEEKILSKYAVEPSLPMELRDSFYEFQRCVKQYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLF

Query:  RYRLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELVLICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSNIYSSPELEQEEEKKS
        RYRLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSEL+LICD VRKSCIRTLAAKHRIHESEIEKKFDSELS +YSSPE+EQEEE KS
Subjt:  RYRLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELVLICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSNIYSSPELEQEEEKKS

Query:  SDTHGLDHDEALKYGISYSGLCLLSLARMVNPSRPCNCFVVGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKKHLEDLYLGHISLQS
         DTHGLDHDEALKYGISYSGLCLLSLARMV+ SRPCNCFVVGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKKHLEDLYLGHISLQS
Subjt:  SDTHGLDHDEALKYGISYSGLCLLSLARMVNPSRPCNCFVVGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKKHLEDLYLGHISLQS

Query:  IDFGAWK
        IDFGAWK
Subjt:  IDFGAWK

XP_038882003.1 nuclear intron maturase 4, mitochondrial isoform X2 [Benincasa hispida]0.0e+0092.75Show/hide
Query:  YSIYVIGGYANMKFGGLQRFCRINMRNISILQNVNVCKVNSSFVSDIGECVQRVQSSGNYSTLAYADDEIDKGIEKKKLATNLASLIEESLDVDLRRSKT
        + + V+  YA M+FGGLQRFCRINMRN++ L  VNVCKV+SS VS IG+ VQRVQ+S NYSTL  ADDEIDKG+EK KLA NLASL+EESLDVDL+RSKT
Subjt:  YSIYVIGGYANMKFGGLQRFCRINMRNISILQNVNVCKVNSSFVSDIGECVQRVQSSGNYSTLAYADDEIDKGIEKKKLATNLASLIEESLDVDLRRSKT

Query:  QMELKRSLEIQIKKRVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMSTDCLISFESMAEELSNGNFDVNANTFSILSSRKEVLILPKIKLK
        QMELKRSLEIQIK+RVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMS DCLISFESMAEELSNGNFDVNANTFSILSSRKEVL+LPKI+LK
Subjt:  QMELKRSLEIQIKKRVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMSTDCLISFESMAEELSNGNFDVNANTFSILSSRKEVLILPKIKLK

Query:  VLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKLITVMEDKIDDPRLFAVIRSIYVAGALNLEFGGFPK
        VLQEAIRIVLECVFRPHFSKISHGCRSGRGHST LKYIRKEIKNPDWWFT+DLSKKMDELVMAKLITVMEDKI+DP+LFAVIRSIYVAGALNLEFGGFPK
Subjt:  VLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKLITVMEDKIDDPRLFAVIRSIYVAGALNLEFGGFPK

Query:  GHGLPQEGVLSPILMNIYLNLFDQEFFRLSMKYEAINENGNTGQDGSQSRLRSWFRRQLKGNSFEYPGEEKDKIRVYCCRYMDEIFLAVSGSKDVALSFR
        GHGLPQEG+LSPIL NIYLNLFDQEFFRLSMKYEAINE GNTGQDGSQSRLRSWFRRQLKGNS +YPGE+KDKIRVYCCRYMDEIFLAVSGSKDVALSFR
Subjt:  GHGLPQEGVLSPILMNIYLNLFDQEFFRLSMKYEAINENGNTGQDGSQSRLRSWFRRQLKGNSFEYPGEEKDKIRVYCCRYMDEIFLAVSGSKDVALSFR

Query:  SEIFDFMQKTLHLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSS
        SEIF F+QKTLHLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLK+KVELFALQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSS
Subjt:  SEIFDFMQKTLHLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSS

Query:  LNQISSFRKAGMETDHWYKVLLKIWMQDLNARAAESEEKILSKYAVEPSLPMELRDSFYEFQRCVKQYISAETASTVALLPNYDPSVKPTFITEIIAPVN
        LNQISSFRKAGMETDHWYKVLLKIWMQDLNARAAESEEKILSK+AVEPSLP+ELRDSFYEFQRCV++YISAETASTVALLPNYDPSVKPTFITEIIAPVN
Subjt:  LNQISSFRKAGMETDHWYKVLLKIWMQDLNARAAESEEKILSKYAVEPSLPMELRDSFYEFQRCVKQYISAETASTVALLPNYDPSVKPTFITEIIAPVN

Query:  SIRKRLFRYRLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELVLICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSNIYSSPELE
        SIRKRL RYRLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSEL+LICD VRKSCIRTLAAKHRIHESEIEKKFDSELS +YSSPE+E
Subjt:  SIRKRLFRYRLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELVLICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSNIYSSPELE

Query:  QEEEKKSSDTHGLDHDEALKYGISYSGLCLLSLARMVNPSRPCNCFVVGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKKHLEDLYL
        QEEE KS DTHGLDHDEALKYGISYSGLCLLSLARMV+ SRPCNCFVVGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKKHLEDLYL
Subjt:  QEEEKKSSDTHGLDHDEALKYGISYSGLCLLSLARMVNPSRPCNCFVVGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKKHLEDLYL

Query:  GHISLQSIDFGAWK
        GHISLQSIDFGAWK
Subjt:  GHISLQSIDFGAWK

XP_038882004.1 nuclear intron maturase 4, mitochondrial isoform X3 [Benincasa hispida]0.0e+0093.65Show/hide
Query:  MKFGGLQRFCRINMRNISILQNVNVCKVNSSFVSDIGECVQRVQSSGNYSTLAYADDEIDKGIEKKKLATNLASLIEESLDVDLRRSKTQMELKRSLEIQ
        M+FGGLQRFCRINMRN++ L  VNVCKV+SS VS IG+ VQRVQ+S NYSTL  ADDEIDKG+EK KLA NLASL+EESLDVDL+RSKTQMELKRSLEIQ
Subjt:  MKFGGLQRFCRINMRNISILQNVNVCKVNSSFVSDIGECVQRVQSSGNYSTLAYADDEIDKGIEKKKLATNLASLIEESLDVDLRRSKTQMELKRSLEIQ

Query:  IKKRVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMSTDCLISFESMAEELSNGNFDVNANTFSILSSRKEVLILPKIKLKVLQEAIRIVLE
        IK+RVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMS DCLISFESMAEELSNGNFDVNANTFSILSSRKEVL+LPKI+LKVLQEAIRIVLE
Subjt:  IKKRVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMSTDCLISFESMAEELSNGNFDVNANTFSILSSRKEVLILPKIKLKVLQEAIRIVLE

Query:  CVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKLITVMEDKIDDPRLFAVIRSIYVAGALNLEFGGFPKGHGLPQEGVLS
        CVFRPHFSKISHGCRSGRGHST LKYIRKEIKNPDWWFT+DLSKKMDELVMAKLITVMEDKI+DP+LFAVIRSIYVAGALNLEFGGFPKGHGLPQEG+LS
Subjt:  CVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKLITVMEDKIDDPRLFAVIRSIYVAGALNLEFGGFPKGHGLPQEGVLS

Query:  PILMNIYLNLFDQEFFRLSMKYEAINENGNTGQDGSQSRLRSWFRRQLKGNSFEYPGEEKDKIRVYCCRYMDEIFLAVSGSKDVALSFRSEIFDFMQKTL
        PIL NIYLNLFDQEFFRLSMKYEAINE GNTGQDGSQSRLRSWFRRQLKGNS +YPGE+KDKIRVYCCRYMDEIFLAVSGSKDVALSFRSEIF F+QKTL
Subjt:  PILMNIYLNLFDQEFFRLSMKYEAINENGNTGQDGSQSRLRSWFRRQLKGNSFEYPGEEKDKIRVYCCRYMDEIFLAVSGSKDVALSFRSEIFDFMQKTL

Query:  HLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSSLNQISSFRKAG
        HLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLK+KVELFALQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSSLNQISSFRKAG
Subjt:  HLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSSLNQISSFRKAG

Query:  METDHWYKVLLKIWMQDLNARAAESEEKILSKYAVEPSLPMELRDSFYEFQRCVKQYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLFRYRL
        METDHWYKVLLKIWMQDLNARAAESEEKILSK+AVEPSLP+ELRDSFYEFQRCV++YISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRL RYRL
Subjt:  METDHWYKVLLKIWMQDLNARAAESEEKILSKYAVEPSLPMELRDSFYEFQRCVKQYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLFRYRL

Query:  VTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELVLICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSNIYSSPELEQEEEKKSSDTH
        VTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSEL+LICD VRKSCIRTLAAKHRIHESEIEKKFDSELS +YSSPE+EQEEE KS DTH
Subjt:  VTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELVLICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSNIYSSPELEQEEEKKSSDTH

Query:  GLDHDEALKYGISYSGLCLLSLARMVNPSRPCNCFVVGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKKHLEDLYLGHISLQSIDFG
        GLDHDEALKYGISYSGLCLLSLARMV+ SRPCNCFVVGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKKHLEDLYLGHISLQSIDFG
Subjt:  GLDHDEALKYGISYSGLCLLSLARMVNPSRPCNCFVVGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKKHLEDLYLGHISLQSIDFG

Query:  AWK
        AWK
Subjt:  AWK

TrEMBL top hitse value%identityAlignment
A0A0A0KWB0 Reverse transcriptase domain-containing protein0.0e+0090.04Show/hide
Query:  MKFGGLQRFCRINMRNISILQNVNVCKVNSSFVSDIGECVQRVQSSGNYSTLAYADDEIDKGIEKKKLATNLASLIEESLDVDLRRSKTQMELKRSLEIQ
        MKFGGL+RFCRINMRN S LQ+VNVC  NSSFVSDIG+CVQ VQ S NYSTLA A  EIDKG+E+ KLA NLASL+EESLDVDLRRSKTQMELKRSLEI+
Subjt:  MKFGGLQRFCRINMRNISILQNVNVCKVNSSFVSDIGECVQRVQSSGNYSTLAYADDEIDKGIEKKKLATNLASLIEESLDVDLRRSKTQMELKRSLEIQ

Query:  IKKRVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMSTDCLISFESMAEELSNGNFDVNANTFSILSSRKEVLILPKIKLKVLQEAIRIVLE
        IK+RVKAQYLNGKFLDLMG VIACP TLQN YDC+RINSNVDI S D LISFESMAEELSNGNFDVN NTFSILSSRKEVLILPKIKLKVLQEAIRIVLE
Subjt:  IKKRVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMSTDCLISFESMAEELSNGNFDVNANTFSILSSRKEVLILPKIKLKVLQEAIRIVLE

Query:  CVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKLITVMEDKIDDPRLFAVIRSIYVAGALNLEFGGFPKGHGLPQEGVLS
        CVFRPHFSKISHGCRSGRGHST LKYI+KEIK+PDWWFTVDLSKKMDELVMAKLITVMEDKI+DP+LFAVIRSIY+AGALNLEFGGFPKGHGLPQEGVLS
Subjt:  CVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKLITVMEDKIDDPRLFAVIRSIYVAGALNLEFGGFPKGHGLPQEGVLS

Query:  PILMNIYLNLFDQEFFRLSMKYEAINENGNTGQDGSQSRLRSWFRRQLKGNSFEYPGEEKDKIRVYCCRYMDEIFLAVSGSKDVALSFRSEIFDFMQKTL
        PIL NIYLNLFDQEFFRLSMKYEAINE GNTGQDGSQSRLRSWFRRQLKGN+ +Y GEEKDKIRVYCCRYMDEIFLAVSGSKDVA SFRSEIF F+QKTL
Subjt:  PILMNIYLNLFDQEFFRLSMKYEAINENGNTGQDGSQSRLRSWFRRQLKGNSFEYPGEEKDKIRVYCCRYMDEIFLAVSGSKDVALSFRSEIFDFMQKTL

Query:  HLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSSLNQISSFRKAG
        HLDVN +EEMVSC ETHGIRFLGCLVRRSVQESPAVKS+HKLKEKVELF LQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSSLN+ISSFRK G
Subjt:  HLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSSLNQISSFRKAG

Query:  METDHWYKVLLKIWMQDLNARAAESEEKILSKYAVEPSLPMELRDSFYEFQRCVKQYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLFRYRL
        METDHWYKVLLKIWMQDLNARAAESEEKILSK+AVE SLP ELRDSFYEFQR VK+YIS+ETAST+ALLPNYDPS KPTFITEIIAPVNSIRKRL RYRL
Subjt:  METDHWYKVLLKIWMQDLNARAAESEEKILSKYAVEPSLPMELRDSFYEFQRCVKQYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLFRYRL

Query:  VTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELVLICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSNIYSSPELEQEEEKKSSDTH
        VTNKGHPCSSPFLILQDNTQIIDWF+GVSRR FRWYNN SNFSEL LI DQVRKSCIRTLAAKHRIHESEIEKKFDSELS IYSS E++QE+E KS+DTH
Subjt:  VTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELVLICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSNIYSSPELEQEEEKKSSDTH

Query:  GLDHDEALKYGISYSGLCLLSLARMVNPSRPCNCFVVGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKKHLEDLYLGHISLQSIDFG
         LDHDEALKYGISYSGLCLLS ARMV+ SRPCNCFV+GCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCK+HL DLYLG ISLQS+DFG
Subjt:  GLDHDEALKYGISYSGLCLLSLARMVNPSRPCNCFVVGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKKHLEDLYLGHISLQSIDFG

Query:  AWK
        AWK
Subjt:  AWK

A0A1S3B491 uncharacterized protein LOC1034860080.0e+0090.78Show/hide
Query:  MKFGGLQRFCRINMRNISILQNVNVCKVNSSFVSDIGECVQRVQSSGNYSTLAYADDEIDKGIEKKKLATNLASLIEESLDVDLRRSKTQMELKRSLEIQ
        M+FGGL+RFC+IN RN S  Q+VNVC VNSSFVSDIG+C Q VQSS NYSTLA ADDEIDKG+EK KLA NLASL+EESLDVDLRRSKT+MELKRSLEIQ
Subjt:  MKFGGLQRFCRINMRNISILQNVNVCKVNSSFVSDIGECVQRVQSSGNYSTLAYADDEIDKGIEKKKLATNLASLIEESLDVDLRRSKTQMELKRSLEIQ

Query:  IKKRVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMSTDCLISFESMAEELSNGNFDVNANTFSILSSRKEVLILPKIKLKVLQEAIRIVLE
        IK+RVKAQYLNGKFLDLMG VIACP TLQNAYDC+RINSNVDI S DCLISFESMA+ELS+GNFDVN NTFSILSSRKEVLILPKIKLKVLQEAIRIVLE
Subjt:  IKKRVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMSTDCLISFESMAEELSNGNFDVNANTFSILSSRKEVLILPKIKLKVLQEAIRIVLE

Query:  CVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKLITVMEDKIDDPRLFAVIRSIYVAGALNLEFGGFPKGHGLPQEGVLS
        CVFRPHFSKISHGCRSGRGHST LKYI+KEIK+PDWWFTVDLSKKMDELVMAKLITVMEDKI+DP+LFAVIRSI++AGALNLEFG FPKGHGLPQEGVLS
Subjt:  CVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKLITVMEDKIDDPRLFAVIRSIYVAGALNLEFGGFPKGHGLPQEGVLS

Query:  PILMNIYLNLFDQEFFRLSMKYEAINENGNTGQDGSQSRLRSWFRRQLKGNSFEYPGEEKDKIRVYCCRYMDEIFLAVSGSKDVALSFRSEIFDFMQKTL
        PIL NIYLNLFDQEFFRLSMKYEAINE GNTGQDGSQS+LRSWFRRQLK NS +YPGEEKDKIRVYCCRYMDEIFLAVSGSKDVALSFRSEIFDFMQKTL
Subjt:  PILMNIYLNLFDQEFFRLSMKYEAINENGNTGQDGSQSRLRSWFRRQLKGNSFEYPGEEKDKIRVYCCRYMDEIFLAVSGSKDVALSFRSEIFDFMQKTL

Query:  HLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSSLNQISSFRKAG
        HLDVNH+EEMVSC ETHGIRFLGCLVRRSVQESPAVKS+HKLKEKVELF LQKQETW +WTVWLGKKWLAHGLKKVKESEIKHLAKNSSLNQISSFRK G
Subjt:  HLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSSLNQISSFRKAG

Query:  METDHWYKVLLKIWMQDLNARAAESEEKILSKYAVEPSLPMELRDSFYEFQRCVKQYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLFRYRL
        METDHWYKVLLKIWMQDLNARAAESEEKILSK+AVEPSLP ELRDSFYEFQR V++YIS+ETAST+ALLPNYDPSVKPTFITEIIAPVNSIRKRLFRYRL
Subjt:  METDHWYKVLLKIWMQDLNARAAESEEKILSKYAVEPSLPMELRDSFYEFQRCVKQYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLFRYRL

Query:  VTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELVLICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSNIYSSPELEQEEEKKSSDTH
        VTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYN  SNFSEL LI DQVRKSCIRTLAAKH+IHESEIEKKFDSELS IYSSPE+EQE+E KS+DTH
Subjt:  VTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELVLICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSNIYSSPELEQEEEKKSSDTH

Query:  GLDHDEALKYGISYSGLCLLSLARMVNPSRPCNCFVVGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKKHLEDLYLGHISLQSIDFG
         LDHDEAL YGISYSGLCLLSLARMV+ SRPCNCFVVGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCK+HL DLYLG ISLQS+DFG
Subjt:  GLDHDEALKYGISYSGLCLLSLARMVNPSRPCNCFVVGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKKHLEDLYLGHISLQSIDFG

Query:  AWK
        AWK
Subjt:  AWK

A0A5A7TFZ7 Reverse transcriptase domain-containing protein0.0e+0088.48Show/hide
Query:  MKFGGLQRFCRINMRNISILQNVNVCKVNSSFVSDI----------------------GECVQRVQSSGNYSTLAYADDEIDKGIEKKKLATNLASLIEE
        M+FGGL+RFC+IN RN S  Q+VNVC VNSSFVSDI                      G+C Q VQSS NYSTLA ADDEIDKG+EK KLA NLASL+EE
Subjt:  MKFGGLQRFCRINMRNISILQNVNVCKVNSSFVSDI----------------------GECVQRVQSSGNYSTLAYADDEIDKGIEKKKLATNLASLIEE

Query:  SLDVDLRRSKTQMELKRSLEIQIKKRVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMSTDCLISFESMAEELSNGNFDVNANTFSILSSRK
        SLDVDLRRSKT+MELKRSLEIQIK+RVKAQYLNGKFLDLMG VIACP TLQNAYDC+RINSNVDI S DCLISFESMAEELS+GNFDVN NTFSILSSRK
Subjt:  SLDVDLRRSKTQMELKRSLEIQIKKRVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMSTDCLISFESMAEELSNGNFDVNANTFSILSSRK

Query:  EVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKLITVMEDKIDDPRLFAVIRSIYVAG
        EVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHST LKYI+KEIK+PDWWFTVDLSKKMD+LVMAKLITVMEDKI+DP+LFAVIRSI++AG
Subjt:  EVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKLITVMEDKIDDPRLFAVIRSIYVAG

Query:  ALNLEFGGFPKGHGLPQEGVLSPILMNIYLNLFDQEFFRLSMKYEAINENGNTGQDGSQSRLRSWFRRQLKGNSFEYPGEEKDKIRVYCCRYMDEIFLAV
        ALNLEFG FPKGHGLPQEGVLSPIL NIYLNLFDQEFFRLSMKYEAINE GNTGQDGSQS+LRSWFRRQLKGNS +YPGEEKDKIRVYCCRYMDEIFLAV
Subjt:  ALNLEFGGFPKGHGLPQEGVLSPILMNIYLNLFDQEFFRLSMKYEAINENGNTGQDGSQSRLRSWFRRQLKGNSFEYPGEEKDKIRVYCCRYMDEIFLAV

Query:  SGSKDVALSFRSEIFDFMQKTLHLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHGLKKVKE
        SGSKDVALSFRSEIFDFMQKTLHLDVNH+EEMVSC ETHGIRFLGCLVRRSVQESPAVKS+HKLKEKVELF LQKQETW +WTVWLGKKWLAHGLKKVKE
Subjt:  SGSKDVALSFRSEIFDFMQKTLHLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHGLKKVKE

Query:  SEIKHLAKNSSLNQISSFRKAGMETDHWYKVLLKIWMQDLNARAAESEEKILSKYAVEPSLPMELRDSFYEFQRCVKQYISAETASTVALLPNYDPSVKP
        SEIKHLAKNSSLNQISSFRK GMETDHWYKVLLKIWMQDLNARAAESEEKILSK+AVEPSLP ELRDSFYEFQR V++YIS+ETAST+ALLPNYDPSVKP
Subjt:  SEIKHLAKNSSLNQISSFRKAGMETDHWYKVLLKIWMQDLNARAAESEEKILSKYAVEPSLPMELRDSFYEFQRCVKQYISAETASTVALLPNYDPSVKP

Query:  TFITEIIAPVNSIRKRLFRYRLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELVLICDQVRKSCIRTLAAKHRIHESEIEKKFDSE
        TFITEIIAPVNSIRKRLFRYRLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYN  SNFSEL LI DQVRKSCIRTLAAKHRIHESEIEKKFDSE
Subjt:  TFITEIIAPVNSIRKRLFRYRLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELVLICDQVRKSCIRTLAAKHRIHESEIEKKFDSE

Query:  LSNIYSSPELEQEEEKKSSDTHGLDHDEALKYGISYSGLCLLSLARMVNPSRPCNCFVVGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFG
        LS IYSSPE+EQ +E KS+DTH LDHDEAL YGISYSGLCLLSLARMV+ SRPCNCFVVGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFG
Subjt:  LSNIYSSPELEQEEEKKSSDTHGLDHDEALKYGISYSGLCLLSLARMVNPSRPCNCFVVGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFG

Query:  LCKKHLEDLYLGHISLQSIDFGAWK
        LCK+HL DLYLG ISLQS+DFGAWK
Subjt:  LCKKHLEDLYLGHISLQSIDFGAWK

A0A6J1CXL0 nuclear intron maturase 4, mitochondrial isoform X20.0e+0085.32Show/hide
Query:  MKFGGLQRFCRINMRNISILQNVNVCKVNSSFVSDIGECVQRVQSSGNYSTLAYADDEIDKGIEKKKLATNLASLIEESLDVDLRRSKTQMELKRSLEIQ
        M+FGG QRFCR+NMRN ++L+   +CKVNSSFVSDIG+CVQRVQ+S NYS LA ADD+  KG+EKKKLA NLASL+EESLDVD RR K++MELKRSLEIQ
Subjt:  MKFGGLQRFCRINMRNISILQNVNVCKVNSSFVSDIGECVQRVQSSGNYSTLAYADDEIDKGIEKKKLATNLASLIEESLDVDLRRSKTQMELKRSLEIQ

Query:  IKKRVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMSTDCLISFESMAEELSNGNFDVNANTFSILSSRKEVLILPKIKLKVLQEAIRIVLE
        IKKRVKAQY+NGKF+DLMGKVIACP TLQNAYDCVRINSNVDI S D LISFESMAEEL NG+FDVNANTFSI SS+KEVLILPK+KLKVLQEAIRIVLE
Subjt:  IKKRVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMSTDCLISFESMAEELSNGNFDVNANTFSILSSRKEVLILPKIKLKVLQEAIRIVLE

Query:  CVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKLITVMEDKIDDPRLFAVIRSIYVAGALNLEFGGFPKGHGLPQEGVLS
        CVFRPHFSKISHGCRSGRGHST LKYIRKEI NPDWWFTVD+SKKMDEL MAKLI+VMEDKI+DP  FA+IRSI+ AGALNLEFGGFPKGHGLPQEGVLS
Subjt:  CVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKLITVMEDKIDDPRLFAVIRSIYVAGALNLEFGGFPKGHGLPQEGVLS

Query:  PILMNIYLNLFDQEFFRLSMKYEAINENGNTGQDGSQSRLRSWFRRQLKGNSFEYPGEEKDKIRVYCCRYMDEIFLAVSGSKDVALSFRSEIFDFMQKTL
        PILMNIYLNLFDQEFFRLSMKYEAIN+ GN  QDGSQS+LRSWFRR+LKGN  EYP +EKD IRVYCCRYMDEIF+AVSGSKDVALSFRSEI DF+QK+L
Subjt:  PILMNIYLNLFDQEFFRLSMKYEAINENGNTGQDGSQSRLRSWFRRQLKGNSFEYPGEEKDKIRVYCCRYMDEIFLAVSGSKDVALSFRSEIFDFMQKTL

Query:  HLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNS-SLNQISSFRKA
        HLDVNHQEEMVSC ET GIRFLGCLVRRS +ESPAVK+VHKLKEKVELFALQKQE WN WTVWLGKKWLAHGLKKVKESEIKHLAKNS SLNQISSFRK 
Subjt:  HLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNS-SLNQISSFRKA

Query:  GMETDHWYKVLLKIWMQDLNARAAESEEKILSKYAVEPSLPMELRDSFYEFQRCVKQYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLFRYR
        GMETDHWYKVLLKIWMQD+NA+AAE+EE ILS Y VEPSLP+ELRDSFYEFQR V++Y+S+ETASTVALLPNYDPSVK TFITEIIAPVNSIRKRL RYR
Subjt:  GMETDHWYKVLLKIWMQDLNARAAESEEKILSKYAVEPSLPMELRDSFYEFQRCVKQYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLFRYR

Query:  LVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELVLICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSNIYSSPELEQEEEKKSSDT
        L+TNKG+PC+SPFLIL DNTQIIDWFLGV RRW +WY+NCSNFSE++LICDQVRKSCIRTLAAKHR HESEIEKKFD ELS I S+PE+EQEEE+++SDT
Subjt:  LVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELVLICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSNIYSSPELEQEEEKKSSDT

Query:  HGLDHDEALKYGISYSGLCLLSLARMVNPSRPCNCFVVGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKKHLEDLYLGHISLQSIDF
        HGL HDEA  YGISYSGLCLLSLARMV+ SRPCNCFV+GCLA APSVYTLHVMERQKFPGWKTGFSSSIHPSLN+RR GLCK+HL+DLYLGHISLQS++F
Subjt:  HGLDHDEALKYGISYSGLCLLSLARMVNPSRPCNCFVVGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKKHLEDLYLGHISLQSIDF

Query:  GAWK
        GAWK
Subjt:  GAWK

A0A6J1CYJ7 nuclear intron maturase 4, mitochondrial isoform X10.0e+0085.22Show/hide
Query:  MKFGGLQRFCRINMRNISILQNVNVCKVNSSFVSDI-GECVQRVQSSGNYSTLAYADDEIDKGIEKKKLATNLASLIEESLDVDLRRSKTQMELKRSLEI
        M+FGG QRFCR+NMRN ++L+   +CKVNSSFVSDI G+CVQRVQ+S NYS LA ADD+  KG+EKKKLA NLASL+EESLDVD RR K++MELKRSLEI
Subjt:  MKFGGLQRFCRINMRNISILQNVNVCKVNSSFVSDI-GECVQRVQSSGNYSTLAYADDEIDKGIEKKKLATNLASLIEESLDVDLRRSKTQMELKRSLEI

Query:  QIKKRVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMSTDCLISFESMAEELSNGNFDVNANTFSILSSRKEVLILPKIKLKVLQEAIRIVL
        QIKKRVKAQY+NGKF+DLMGKVIACP TLQNAYDCVRINSNVDI S D LISFESMAEEL NG+FDVNANTFSI SS+KEVLILPK+KLKVLQEAIRIVL
Subjt:  QIKKRVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMSTDCLISFESMAEELSNGNFDVNANTFSILSSRKEVLILPKIKLKVLQEAIRIVL

Query:  ECVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKLITVMEDKIDDPRLFAVIRSIYVAGALNLEFGGFPKGHGLPQEGVL
        ECVFRPHFSKISHGCRSGRGHST LKYIRKEI NPDWWFTVD+SKKMDEL MAKLI+VMEDKI+DP  FA+IRSI+ AGALNLEFGGFPKGHGLPQEGVL
Subjt:  ECVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKLITVMEDKIDDPRLFAVIRSIYVAGALNLEFGGFPKGHGLPQEGVL

Query:  SPILMNIYLNLFDQEFFRLSMKYEAINENGNTGQDGSQSRLRSWFRRQLKGNSFEYPGEEKDKIRVYCCRYMDEIFLAVSGSKDVALSFRSEIFDFMQKT
        SPILMNIYLNLFDQEFFRLSMKYEAIN+ GN  QDGSQS+LRSWFRR+LKGN  EYP +EKD IRVYCCRYMDEIF+AVSGSKDVALSFRSEI DF+QK+
Subjt:  SPILMNIYLNLFDQEFFRLSMKYEAINENGNTGQDGSQSRLRSWFRRQLKGNSFEYPGEEKDKIRVYCCRYMDEIFLAVSGSKDVALSFRSEIFDFMQKT

Query:  LHLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNS-SLNQISSFRK
        LHLDVNHQEEMVSC ET GIRFLGCLVRRS +ESPAVK+VHKLKEKVELFALQKQE WN WTVWLGKKWLAHGLKKVKESEIKHLAKNS SLNQISSFRK
Subjt:  LHLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNS-SLNQISSFRK

Query:  AGMETDHWYKVLLKIWMQDLNARAAESEEKILSKYAVEPSLPMELRDSFYEFQRCVKQYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLFRY
         GMETDHWYKVLLKIWMQD+NA+AAE+EE ILS Y VEPSLP+ELRDSFYEFQR V++Y+S+ETASTVALLPNYDPSVK TFITEIIAPVNSIRKRL RY
Subjt:  AGMETDHWYKVLLKIWMQDLNARAAESEEKILSKYAVEPSLPMELRDSFYEFQRCVKQYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLFRY

Query:  RLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELVLICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSNIYSSPELEQEEEKKSSD
        RL+TNKG+PC+SPFLIL DNTQIIDWFLGV RRW +WY+NCSNFSE++LICDQVRKSCIRTLAAKHR HESEIEKKFD ELS I S+PE+EQEEE+++SD
Subjt:  RLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELVLICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSNIYSSPELEQEEEKKSSD

Query:  THGLDHDEALKYGISYSGLCLLSLARMVNPSRPCNCFVVGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKKHLEDLYLGHISLQSID
        THGL HDEA  YGISYSGLCLLSLARMV+ SRPCNCFV+GCLA APSVYTLHVMERQKFPGWKTGFSSSIHPSLN+RR GLCK+HL+DLYLGHISLQS++
Subjt:  THGLDHDEALKYGISYSGLCLLSLARMVNPSRPCNCFVVGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKKHLEDLYLGHISLQSID

Query:  FGAWK
        FGAWK
Subjt:  FGAWK

SwissProt top hitse value%identityAlignment
B1N1A3 Putative nicotine oxidoreductase2.7e-1927.66Show/hide
Query:  KVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKLITVMEDKIDDPRLFAVIRSIYVAGALNLEFGGFP
        KV+QE IR +LE ++ P FSK SHG R+G+   T LK +R+      W    D+    D +  +KLI  +  +I D R   +IR    AG    E G F 
Subjt:  KVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKLITVMEDKIDDPRLFAVIRSIYVAGALNLEFGGFP

Query:  KGH-GLPQEGVLSPILMNIYLNLFDQEFFRLSMKYEAINENGNTGQDGSQSRLRSWFRRQLKGNSFEYPGEEKDK-------------------------
            G PQ  ++SPIL N++L+  D++  +L +K     E G+   D +  +L+   +  L+  + +  G E+D                          
Subjt:  KGH-GLPQEGVLSPILMNIYLNLFDQEFFRLSMKYEAINENGNTGQDGSQSRLRSWFRRQLKGNSFEYPGEEKDK-------------------------

Query:  IRVYCCRYMDEIFLAVSGSKDVALSFRSEIFDFMQKTLHLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLKE
        IRV   RY D+  + V+G K +A   RS + +F++    L+++ ++  +   ++   +FLG  +R   + S  +K +   K+
Subjt:  IRVYCCRYMDEIFLAVSGSKDVALSFRSEIFDFMQKTLHLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLKE

P0A3U0 Group II intron-encoded protein LtrA1.4e-2027.84Show/hide
Query:  SSRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKLITVMEDKIDDPRLFAVIRSI
        S +   L +P    K++QEA+RI+LE ++ P F  +SHG R  R   T LK I++E     W+   D+    D +    LI ++  KI D ++  +I   
Subjt:  SSRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKLITVMEDKIDDPRLFAVIRSI

Query:  YVAGALNLEFGGFPKGH-GLPQEGVLSPILMNIYLNLFDQEFFRLSMKYEAINENGNTGQD---GSQSRLRSWFRRQLKGNS-----FEYPGEEK-----
          AG   LE   + K + G PQ G+LSP+L NIYL+  D+   +L MK++  +    T +     ++ +  S   ++L+G        EY  + K     
Subjt:  YVAGALNLEFGGFPKGH-GLPQEGVLSPILMNIYLNLFDQEFFRLSMKYEAINENGNTGQD---GSQSRLRSWFRRQLKGNS-----FEYPGEEK-----

Query:  ------DKIRVYCCRYMDEIFLAVSGSKDVALSFRSEIFDFMQKTLHLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLKEK
              +K+  Y  RY D+  ++V GSK+     + ++  F+   L ++++ ++ +++   +   RFLG  +R  V+ S  +K   K+K++
Subjt:  ------DKIRVYCCRYMDEIFLAVSGSKDVALSFRSEIFDFMQKTLHLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLKEK

P0A3U1 Group II intron-encoded protein LtrA1.4e-2027.84Show/hide
Query:  SSRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKLITVMEDKIDDPRLFAVIRSI
        S +   L +P    K++QEA+RI+LE ++ P F  +SHG R  R   T LK I++E     W+   D+    D +    LI ++  KI D ++  +I   
Subjt:  SSRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKLITVMEDKIDDPRLFAVIRSI

Query:  YVAGALNLEFGGFPKGH-GLPQEGVLSPILMNIYLNLFDQEFFRLSMKYEAINENGNTGQD---GSQSRLRSWFRRQLKGNS-----FEYPGEEK-----
          AG   LE   + K + G PQ G+LSP+L NIYL+  D+   +L MK++  +    T +     ++ +  S   ++L+G        EY  + K     
Subjt:  YVAGALNLEFGGFPKGH-GLPQEGVLSPILMNIYLNLFDQEFFRLSMKYEAINENGNTGQD---GSQSRLRSWFRRQLKGNS-----FEYPGEEK-----

Query:  ------DKIRVYCCRYMDEIFLAVSGSKDVALSFRSEIFDFMQKTLHLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLKEK
              +K+  Y  RY D+  ++V GSK+     + ++  F+   L ++++ ++ +++   +   RFLG  +R  V+ S  +K   K+K++
Subjt:  ------DKIRVYCCRYMDEIFLAVSGSKDVALSFRSEIFDFMQKTLHLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLKEK

Q9CA78 Nuclear intron maturase 4, mitochondrial6.6e-25258.95Show/hide
Query:  LATNLASLIEESLD--VDLRRSKTQMELKRSLEIQIKKRVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMSTDCLISFESMAEELSNGNFD
        LA  LASL+EES     D  + +++MELKRSLE+++KKRVK Q +NGKF DL+ KVIA P TL++AYDC+R+NSNV I   +  ++F+S+AEELS+G FD
Subjt:  LATNLASLIEESLD--VDLRRSKTQMELKRSLEIQIKKRVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMSTDCLISFESMAEELSNGNFD

Query:  VNANTFSILS--SRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKLITVMEDKID
        V +NTFSI++    KEVL+LP + LKV+QEAIRIVLE VF PHFSKISH CRSGRG ++ LKYI   I   DW FT+ L+KK+D  V   L++VME+K++
Subjt:  VNANTFSILS--SRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKLITVMEDKID

Query:  DPRLFAVIRSIYVAGALNLEFGGFPKGHGLPQEGVLSPILMNIYLNLFDQEFFRLSMKYEAINENGNTGQDGSQSRLRSWFRRQLKGNSFEYPGEEKDKI
        D  L  ++RS++ A  LNLEFGGFPKGHGLPQEGVLS +LMNIYL+ FD EF+R+SM++EA+  +  T +D   S+LRSWFRRQ      +   E+   +
Subjt:  DPRLFAVIRSIYVAGALNLEFGGFPKGHGLPQEGVLSPILMNIYLNLFDQEFFRLSMKYEAINENGNTGQDGSQSRLRSWFRRQLKGNSFEYPGEEKDKI

Query:  RVYCCRYMDEIFLAVSGSKDVALSFRSEIFDFMQKTLHLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLKEKVELFALQKQETWNAWTVW
        RVYCCR+MDEI+ +VSG K VA   RSE   F++ +LHLD+  + +   C  T G+R LG LVR++V+ESP VK+VHKLKEKV LFALQK+E W   TV 
Subjt:  RVYCCRYMDEIFLAVSGSKDVALSFRSEIFDFMQKTLHLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLKEKVELFALQKQETWNAWTVW

Query:  LGKKWLAHGLKKVKESEIKHLA-KNSSLNQISSFRKAGMETDHWYKVLLKIWMQD-LNARAAESEEKILSKYAVEPSLPMELRDSFYEFQRCVKQYISAE
        +GKKWL HGLKKVKESEIK LA  NS+L+QIS  RKAGMETDHWYK+LL+IWM+D L   A  SEE +LSK+ VEP++P ELRD+FY+FQ     Y+S+E
Subjt:  LGKKWLAHGLKKVKESEIKHLA-KNSSLNQISSFRKAGMETDHWYKVLLKIWMQD-LNARAAESEEKILSKYAVEPSLPMELRDSFYEFQRCVKQYISAE

Query:  TASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLFRYRLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSEL-VLICDQVRKSCIRTL
        TA+  ALLP      +P F  +++AP N+I +RL+RY L+T KG+  S+  LIL D  QIIDW+ G+ RRW  WY  CSNF E+  LI +Q+R SCIRTL
Subjt:  TASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLFRYRLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSEL-VLICDQVRKSCIRTL

Query:  AAKHRIHESEIEKKFDSELSNIYSSPELEQEEEKKSSDTHGLDHDEALKYGISYSGLCLLSLARMVNPSRPCNCFVVGCLAPAPSVYTLHVMERQKFPGW
        AAK+RIHE+EIEK+ D ELS I S+ ++EQE + +  D+   D DE L YG+S SGLCLLSLAR+V+ SRPCNCFV+GC   AP+VYTLH MERQKFPGW
Subjt:  AAKHRIHESEIEKKFDSELSNIYSSPELEQEEEKKSSDTHGLDHDEALKYGISYSGLCLLSLARMVNPSRPCNCFVVGCLAPAPSVYTLHVMERQKFPGW

Query:  KTGFSSSIHPSLNKRRFGLCKKHLEDLYLGHISLQSIDFGAWK
        KTGFS  I  SLN RR GLCK+HL+DLY+G ISLQ++DFGAW+
Subjt:  KTGFSSSIHPSLNKRRFGLCKKHLEDLYLGHISLQSIDFGAWK

Q9LZA5 Nuclear intron maturase 3, mitochondrial1.5e-5425.49Show/hide
Query:  SLDVDLRRSKTQMELKRSLEIQIKKRVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMSTDCL---ISFESMAEELSNGNFDVNANTFSILS
        SL ++  ++ T+  +K  LE  + K    QY +GKF  L+   ++ P  L  A   + +++N      D +    S E M  E+  G FD+ +     +S
Subjt:  SLDVDLRRSKTQMELKRSLEIQIKKRVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMSTDCL---ISFESMAEELSNGNFDVNANTFSILS

Query:  SRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKM-DELVMAKLITVMEDKIDDPRLFAVIRSI
        S    L+LP +KLKVL EAIR+VLE V+   F+  S+G R G G  T ++Y++  ++NP WWF V  +++M +E  +  L   + +KI+D  L  +I+ +
Subjt:  SRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKM-DELVMAKLITVMEDKIDDPRLFAVIRSI

Query:  YVAGALNLEFGGFPKGHGLPQEGVLSPILMNIYLNLFDQEFFRLSMKYEAINENGNTGQDGSQSRLRSWFRRQLKGNSFEYPGEEKDKIRVYCCRYMDEI
        +  G L +E GG   G G PQE  L  IL+N+Y +  D+E   L +K +  N    TG + S             GN F  P      + +Y  RY+DEI
Subjt:  YVAGALNLEFGGFPKGHGLPQEGVLSPILMNIYLNLFDQEFFRLSMKYEAINENGNTGQDGSQSRLRSWFRRQLKGNSFEYPGEEKDKIRVYCCRYMDEI

Query:  FLAVSGSKDVALSFRSEIFDFMQKTLHLDVNHQEEMVSCGETHGIRFLGCL-------VRRSVQESPAVKSVHKLKEKVELFALQKQETWNAWTVWLGKK
         +  SGSK + +  +  I D +++ L L V+     +    +  I FLG         V R  +   AV+++ K + + ++  L+ +         LG K
Subjt:  FLAVSGSKDVALSFRSEIFDFMQKTLHLDVNHQEEMVSCGETHGIRFLGCL-------VRRSVQESPAVKSVHKLKEKVELFALQKQETWNAWTVWLGKK

Query:  WLAHGLKKVKESEIKHLAKNSSLNQISSFRKAGMETDHWYKVLLKIW----MQDLNARAAE--------SEEKILSKYAVEPSLPMELRDSFYEFQRCVK
           H LKK+K+S              + F+  G E ++  + + + W    MQD      E        +    LS   +   LP +L D++ EFQ  V 
Subjt:  WLAHGLKKVKESEIKHLAKNSSLNQISSFRKAGMETDHWYKVLLKIW----MQDLNARAAE--------SEEKILSKYAVEPSLPMELRDSFYEFQRCVK

Query:  QYISAETASTVALLPNYDPSVK---------------PTFITEIIAPVNSIRKRLFRYRLVTNKGHPCSSPFLILQDNTQIIDWFL-----GVSRRWFRW
        ++++   A  V  L + +  V+                    ++ AP   +RK +       + G P     L+  +++ II W+      G +++  R 
Subjt:  QYISAETASTVALLPNYDPSVK---------------PTFITEIIAPVNSIRKRLFRYRLVTNKGHPCSSPFLILQDNTQIIDWFL-----GVSRRWFRW

Query:  YNNCSNFSELVLICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSNIYSSPELEQEEEKKSSDTHGLDHDEALKYGISYSGLCLLSLARMVNPSRPCNCF
        Y      S+L                      +   E  F SE        E++   +K  SD   +D            G   L L R+ +     +C 
Subjt:  YNNCSNFSELVLICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSNIYSSPELEQEEEKKSSDTHGLDHDEALKYGISYSGLCLLSLARMVNPSRPCNCF

Query:  VVGCLAPAPSVYTLHVMERQKF------PGWKTGFSSSIHPSLNKRRFGLCKKHLEDLYLGHISLQSID
           C      ++ +H+++ +          W  G   +IH +LN++   LC  H+ D+YLG I+LQ +D
Subjt:  VVGCLAPAPSVYTLHVMERQKF------PGWKTGFSSSIHPSLNKRRFGLCKKHLEDLYLGHISLQSID

Arabidopsis top hitse value%identityAlignment
AT1G74350.1 Intron maturase, type II family protein4.7e-25358.95Show/hide
Query:  LATNLASLIEESLD--VDLRRSKTQMELKRSLEIQIKKRVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMSTDCLISFESMAEELSNGNFD
        LA  LASL+EES     D  + +++MELKRSLE+++KKRVK Q +NGKF DL+ KVIA P TL++AYDC+R+NSNV I   +  ++F+S+AEELS+G FD
Subjt:  LATNLASLIEESLD--VDLRRSKTQMELKRSLEIQIKKRVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMSTDCLISFESMAEELSNGNFD

Query:  VNANTFSILS--SRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKLITVMEDKID
        V +NTFSI++    KEVL+LP + LKV+QEAIRIVLE VF PHFSKISH CRSGRG ++ LKYI   I   DW FT+ L+KK+D  V   L++VME+K++
Subjt:  VNANTFSILS--SRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKLITVMEDKID

Query:  DPRLFAVIRSIYVAGALNLEFGGFPKGHGLPQEGVLSPILMNIYLNLFDQEFFRLSMKYEAINENGNTGQDGSQSRLRSWFRRQLKGNSFEYPGEEKDKI
        D  L  ++RS++ A  LNLEFGGFPKGHGLPQEGVLS +LMNIYL+ FD EF+R+SM++EA+  +  T +D   S+LRSWFRRQ      +   E+   +
Subjt:  DPRLFAVIRSIYVAGALNLEFGGFPKGHGLPQEGVLSPILMNIYLNLFDQEFFRLSMKYEAINENGNTGQDGSQSRLRSWFRRQLKGNSFEYPGEEKDKI

Query:  RVYCCRYMDEIFLAVSGSKDVALSFRSEIFDFMQKTLHLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLKEKVELFALQKQETWNAWTVW
        RVYCCR+MDEI+ +VSG K VA   RSE   F++ +LHLD+  + +   C  T G+R LG LVR++V+ESP VK+VHKLKEKV LFALQK+E W   TV 
Subjt:  RVYCCRYMDEIFLAVSGSKDVALSFRSEIFDFMQKTLHLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLKEKVELFALQKQETWNAWTVW

Query:  LGKKWLAHGLKKVKESEIKHLA-KNSSLNQISSFRKAGMETDHWYKVLLKIWMQD-LNARAAESEEKILSKYAVEPSLPMELRDSFYEFQRCVKQYISAE
        +GKKWL HGLKKVKESEIK LA  NS+L+QIS  RKAGMETDHWYK+LL+IWM+D L   A  SEE +LSK+ VEP++P ELRD+FY+FQ     Y+S+E
Subjt:  LGKKWLAHGLKKVKESEIKHLA-KNSSLNQISSFRKAGMETDHWYKVLLKIWMQD-LNARAAESEEKILSKYAVEPSLPMELRDSFYEFQRCVKQYISAE

Query:  TASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLFRYRLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSEL-VLICDQVRKSCIRTL
        TA+  ALLP      +P F  +++AP N+I +RL+RY L+T KG+  S+  LIL D  QIIDW+ G+ RRW  WY  CSNF E+  LI +Q+R SCIRTL
Subjt:  TASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLFRYRLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSEL-VLICDQVRKSCIRTL

Query:  AAKHRIHESEIEKKFDSELSNIYSSPELEQEEEKKSSDTHGLDHDEALKYGISYSGLCLLSLARMVNPSRPCNCFVVGCLAPAPSVYTLHVMERQKFPGW
        AAK+RIHE+EIEK+ D ELS I S+ ++EQE + +  D+   D DE L YG+S SGLCLLSLAR+V+ SRPCNCFV+GC   AP+VYTLH MERQKFPGW
Subjt:  AAKHRIHESEIEKKFDSELSNIYSSPELEQEEEKKSSDTHGLDHDEALKYGISYSGLCLLSLARMVNPSRPCNCFVVGCLAPAPSVYTLHVMERQKFPGW

Query:  KTGFSSSIHPSLNKRRFGLCKKHLEDLYLGHISLQSIDFGAWK
        KTGFS  I  SLN RR GLCK+HL+DLY+G ISLQ++DFGAW+
Subjt:  KTGFSSSIHPSLNKRRFGLCKKHLEDLYLGHISLQSIDFGAWK

AT5G04050.1 RNA-directed DNA polymerase (reverse transcriptase)7.6e-5427.24Show/hide
Query:  SLDVDLRRSKTQMELKRSLEIQIKKRVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMSTDCL---ISFESMAEELSNGNFDVNANTFSILS
        SL ++  ++ T+  +K  LE  + K    QY +GKF  L+   ++ P  L  A   + +++N      D +    S E M  E+  G FD+ +     +S
Subjt:  SLDVDLRRSKTQMELKRSLEIQIKKRVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMSTDCL---ISFESMAEELSNGNFDVNANTFSILS

Query:  SRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKM-DELVMAKLITVMEDKIDDPRLFAVIRSI
        S    L+LP +KLKVL EAIR+VLE V+   F+  S+G R G G  T ++Y++  ++NP WWF V  +++M +E  +  L   + +KI+D  L  +I+ +
Subjt:  SRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKM-DELVMAKLITVMEDKIDDPRLFAVIRSI

Query:  YVAGALNLEFGGFPKGHGLPQEGVLSPILMNIYLNLFDQEFFRLSMKYEAINENGNTGQDGSQSRLRSWFRRQLKGNSFEYPGEEKDKIRVYCCRYMDEI
        +  G L +E GG   G G PQE  L  IL+N+Y +  D+E   L +K +  N    TG + S             GN F  P      + +Y  RY+DEI
Subjt:  YVAGALNLEFGGFPKGHGLPQEGVLSPILMNIYLNLFDQEFFRLSMKYEAINENGNTGQDGSQSRLRSWFRRQLKGNSFEYPGEEKDKIRVYCCRYMDEI

Query:  FLAVSGSKDVALSFRSEIFDFMQKTLHLDVNHQEEMVSCGETHGIRFLGCL-------VRRSVQESPAVKSVHKLKEKVELFALQKQETWNAWTVWLGKK
         +  SGSK + +  +  I D +++ L L V+     +    +  I FLG         V R  +   AV+++ K + + ++  L+ +         LG K
Subjt:  FLAVSGSKDVALSFRSEIFDFMQKTLHLDVNHQEEMVSCGETHGIRFLGCL-------VRRSVQESPAVKSVHKLKEKVELFALQKQETWNAWTVWLGKK

Query:  WLAHGLKKVKESEIKHLAKNSSLNQISSFRKAGMETDHWYKVLLKIW----MQDLNARAAE--------SEEKILSKYAVEPSLPMELRDSFYEFQRCVK
           H LKK+K+S              + F+  G E ++  + + + W    MQD      E        +    LS   +   LP +L D++ EFQ  V 
Subjt:  WLAHGLKKVKESEIKHLAKNSSLNQISSFRKAGMETDHWYKVLLKIW----MQDLNARAAE--------SEEKILSKYAVEPSLPMELRDSFYEFQRCVK

Query:  QYISAETASTVALLPNYDPSVK---------------PTFITEIIAPVNSIRKRLFRYRLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCS
        ++++   A  V  L + +  V+                    ++ AP   +RK +       + G P     L+  +++ II W+ GV R+W  ++  C 
Subjt:  QYISAETASTVALLPNYDPSVK---------------PTFITEIIAPVNSIRKRLFRYRLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCS

Query:  NF
        N+
Subjt:  NF

AT5G04050.2 RNA-directed DNA polymerase (reverse transcriptase)3.0e-5024.53Show/hide
Query:  SLDVDLRRSKTQMELKRSLEIQIKKRVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMSTDCL---ISFESMAEELSNGNFDVNANTFSILS
        SL ++  ++ T+  +K  LE  + K    QY +GKF  L+   ++ P  L  A   + +++N      D +    S E M  E+  G FD+ +     +S
Subjt:  SLDVDLRRSKTQMELKRSLEIQIKKRVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMSTDCL---ISFESMAEELSNGNFDVNANTFSILS

Query:  SRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKM-DELVMAKLITVMEDKIDDPRLFAVIRSI
        S    L+LP +KLKVL EAIR+VLE V+   F+  S+G R G G  T ++Y++  ++NP WWF V  +++M +E  +  L   + +KI+D  L  +I+ +
Subjt:  SRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKM-DELVMAKLITVMEDKIDDPRLFAVIRSI

Query:  YVAGALNLEFGGFPKGHGLPQEGVLSPILMNIYLNLFDQEFFRLSMKYEAINENGNTGQDGSQSRLRSWFRRQLKGNSFEYPGEEKDKIRVYCCRYMDEI
        +  G L +E GG   G G PQE  L  IL+N+Y +  D+E   L +K +  N    TG + S             GN F  P      + +Y  RY+DEI
Subjt:  YVAGALNLEFGGFPKGHGLPQEGVLSPILMNIYLNLFDQEFFRLSMKYEAINENGNTGQDGSQSRLRSWFRRQLKGNSFEYPGEEKDKIRVYCCRYMDEI

Query:  FLAVSGSKDVALSFRSEIFDFMQKTLHLDVNHQEEMVSCGETHGIRFLGCL-------VRRSVQESPAVKSVHKLKEKVELFALQKQETWNAWTVWLGKK
         +  SGSK + +  +  I D +++ L L V+     +    +  I FLG         V R  +   AV+++ K + + ++  L+ +         LG K
Subjt:  FLAVSGSKDVALSFRSEIFDFMQKTLHLDVNHQEEMVSCGETHGIRFLGCL-------VRRSVQESPAVKSVHKLKEKVELFALQKQETWNAWTVWLGKK

Query:  WLAHGLKKVKESEIKHLAKNSSLNQISSFRKAGMETDHWYKVLLKIW----MQDLNARAAE--------SEEKILSKYAVEPSLPMELRDSFYEFQRCVK
           H LKK+K+S              + F+  G E ++  + + + W    MQD      E        +    LS   +   LP +L D++ EFQ  V 
Subjt:  WLAHGLKKVKESEIKHLAKNSSLNQISSFRKAGMETDHWYKVLLKIW----MQDLNARAAE--------SEEKILSKYAVEPSLPMELRDSFYEFQRCVK

Query:  QYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLFRYRLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELVLICDQVRKS
        ++++   A  V                                                L+D  + ++                  ++E  +  + + K 
Subjt:  QYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLFRYRLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELVLICDQVRKS

Query:  CIRTLAAKHRIHESEIEKKFDS-ELSNIYSSPELEQEEEKKSSDTHGLDHDEALKYGISYSGLCLLSLARMVNPSRPCNCFVVGCLAPAPSVYTLHVMER
        C++  A +  + ++      D  E ++  S  E++   +K  SD   +D            G   L L R+ +     +C    C      ++ +H+++ 
Subjt:  CIRTLAAKHRIHESEIEKKFDS-ELSNIYSSPELEQEEEKKSSDTHGLDHDEALKYGISYSGLCLLSLARMVNPSRPCNCFVVGCLAPAPSVYTLHVMER

Query:  QKF------PGWKTGFSSSIHPSLNKRRFGLCKKHLEDLYLGHISLQSID
        +          W  G   +IH +LN++   LC  H+ D+YLG I+LQ +D
Subjt:  QKF------PGWKTGFSSSIHPSLNKRRFGLCKKHLEDLYLGHISLQSID

ATMG00520.1 Intron maturase, type II family protein7.5e-1736.76Show/hide
Query:  KVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKLITVMEDKIDDPRLFAVIRSIYVAGALNLEFGGFP
        K+++EAIR+VLE ++ P F   SH  RSG+G  +VL+ I++E     W+   D+ K    +   +LI +++++IDDP+ F  I+ ++ AG L     G  
Subjt:  KVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKLITVMEDKIDDPRLFAVIRSIYVAGALNLEFGGFP

Query:  KG-HGLPQEGVLSPILMNIYLNLFDQEFFRLSMKYE
        +G + +P   +LS +  NIYL+  DQE  R+  KYE
Subjt:  KG-HGLPQEGVLSPILMNIYLNLFDQEFFRLSMKYE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGGATTACAAGGTCTTTGGTTATTCTATTTATGTAATTGGTGGTTATGCAAATATGAAATTTGGGGGGCTTCAAAGATTTTGCAGGATCAACATGCGGAACATTTC
AATTTTGCAAAATGTAAATGTTTGCAAAGTCAATTCATCCTTTGTTTCTGACATTGGAGAGTGTGTTCAGAGAGTTCAAAGTTCTGGAAATTATTCAACTCTCGCCTATG
CTGATGATGAAATTGACAAGGGAATAGAGAAAAAGAAACTGGCCACAAACTTGGCCTCACTTATTGAAGAATCTCTTGATGTTGATCTGAGAAGATCAAAGACTCAAATG
GAACTGAAGAGATCACTTGAAATTCAGATCAAGAAGAGGGTGAAGGCACAATATTTGAATGGGAAGTTTTTGGACTTGATGGGGAAAGTGATTGCTTGCCCCACAACTCT
TCAGAATGCTTATGACTGTGTTAGAATTAACTCAAATGTTGATATAATGTCGACTGACTGTTTAATCTCATTTGAATCTATGGCTGAAGAGCTATCTAATGGTAACTTTG
ATGTCAATGCCAATACTTTCTCCATATTAAGCTCAAGAAAAGAAGTACTCATTTTACCGAAGATAAAGTTGAAGGTTCTTCAAGAAGCCATTAGGATAGTTTTGGAGTGT
GTGTTTAGGCCACATTTTTCCAAGATATCTCATGGCTGTCGAAGTGGAAGAGGACACTCAACGGTATTGAAGTACATCAGAAAAGAGATAAAAAATCCTGATTGGTGGTT
CACAGTTGACTTAAGCAAAAAGATGGATGAGCTTGTAATGGCTAAACTCATTACAGTAATGGAGGACAAGATAGATGACCCCAGATTATTTGCTGTTATTAGAAGTATAT
ATGTGGCTGGGGCACTGAACTTGGAGTTTGGGGGTTTCCCAAAAGGTCACGGTCTTCCACAAGAGGGGGTTTTGTCTCCTATATTAATGAACATCTATCTCAACCTCTTT
GACCAAGAATTTTTCAGATTATCTATGAAATATGAGGCTATTAATGAGAATGGCAATACCGGTCAAGATGGGTCACAATCAAGGCTGCGGAGCTGGTTTAGGAGACAATT
GAAAGGAAATAGTTTTGAATATCCAGGTGAGGAGAAAGACAAAATAAGAGTATACTGTTGTCGCTATATGGATGAAATTTTTTTGGCGGTATCGGGTTCTAAAGATGTTG
CTCTTAGTTTTAGGTCTGAGATTTTTGATTTCATGCAGAAGACTTTGCATTTGGACGTCAATCATCAAGAGGAAATGGTATCATGTGGGGAGACTCATGGAATTCGTTTT
CTTGGTTGTTTGGTCAGACGAAGTGTGCAGGAAAGTCCAGCTGTGAAATCTGTCCACAAGTTGAAGGAAAAAGTTGAGCTATTTGCTTTACAAAAGCAGGAGACTTGGAA
TGCTTGGACGGTGTGGTTGGGAAAGAAATGGCTCGCTCATGGTTTGAAGAAGGTTAAAGAGTCGGAGATCAAGCATTTAGCTAAAAATAGCTCTTTGAATCAAATTTCCA
GTTTTCGTAAAGCTGGAATGGAAACTGATCACTGGTACAAGGTTCTATTGAAAATTTGGATGCAAGATCTAAATGCAAGGGCTGCAGAGAGTGAAGAAAAAATCTTATCT
AAGTATGCAGTGGAACCTTCTCTTCCTATGGAACTTCGAGATTCCTTTTATGAGTTCCAAAGGTGTGTGAAACAATATATTTCTGCTGAGACAGCTTCTACTGTTGCCCT
TTTACCAAATTATGATCCTTCTGTCAAACCTACTTTCATAACTGAGATTATAGCACCTGTCAATTCTATAAGAAAACGACTTTTTCGATATAGGTTAGTTACAAATAAGG
GACATCCATGCTCCTCTCCTTTCCTCATCTTACAAGATAACACTCAAATTATTGACTGGTTTTTAGGAGTATCTCGCCGTTGGTTTAGATGGTATAACAACTGTTCTAAC
TTCAGCGAGTTGGTCTTAATTTGTGATCAAGTTAGGAAATCCTGTATCCGAACGCTAGCAGCAAAGCATCGTATACACGAAAGTGAAATAGAAAAGAAGTTTGACTCAGA
ACTGAGTAACATTTACTCCTCTCCTGAACTAGAGCAAGAAGAAGAGAAGAAGTCATCAGATACTCATGGTTTAGACCATGATGAGGCACTAAAGTATGGAATTTCATATA
GTGGTCTGTGTTTGCTATCTCTTGCTAGAATGGTCAACCCATCTCGTCCATGCAATTGTTTTGTCGTTGGGTGTTTGGCTCCTGCACCAAGCGTTTATACTCTTCATGTC
ATGGAGAGACAAAAGTTTCCAGGATGGAAGACTGGATTCTCGAGTTCCATCCATCCTAGCTTGAACAAACGACGATTCGGGTTATGCAAAAAACATTTGGAGGATTTGTA
TTTGGGTCACATTTCATTGCAATCTATTGACTTTGGTGCATGGAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGCAGGATTACAAGGTCTTTGGTTATTCTATTTATGTAATTGGTGGTTATGCAAATATGAAATTTGGGGGGCTTCAAAGATTTTGCAGGATCAACATGCGGAACATTTC
AATTTTGCAAAATGTAAATGTTTGCAAAGTCAATTCATCCTTTGTTTCTGACATTGGAGAGTGTGTTCAGAGAGTTCAAAGTTCTGGAAATTATTCAACTCTCGCCTATG
CTGATGATGAAATTGACAAGGGAATAGAGAAAAAGAAACTGGCCACAAACTTGGCCTCACTTATTGAAGAATCTCTTGATGTTGATCTGAGAAGATCAAAGACTCAAATG
GAACTGAAGAGATCACTTGAAATTCAGATCAAGAAGAGGGTGAAGGCACAATATTTGAATGGGAAGTTTTTGGACTTGATGGGGAAAGTGATTGCTTGCCCCACAACTCT
TCAGAATGCTTATGACTGTGTTAGAATTAACTCAAATGTTGATATAATGTCGACTGACTGTTTAATCTCATTTGAATCTATGGCTGAAGAGCTATCTAATGGTAACTTTG
ATGTCAATGCCAATACTTTCTCCATATTAAGCTCAAGAAAAGAAGTACTCATTTTACCGAAGATAAAGTTGAAGGTTCTTCAAGAAGCCATTAGGATAGTTTTGGAGTGT
GTGTTTAGGCCACATTTTTCCAAGATATCTCATGGCTGTCGAAGTGGAAGAGGACACTCAACGGTATTGAAGTACATCAGAAAAGAGATAAAAAATCCTGATTGGTGGTT
CACAGTTGACTTAAGCAAAAAGATGGATGAGCTTGTAATGGCTAAACTCATTACAGTAATGGAGGACAAGATAGATGACCCCAGATTATTTGCTGTTATTAGAAGTATAT
ATGTGGCTGGGGCACTGAACTTGGAGTTTGGGGGTTTCCCAAAAGGTCACGGTCTTCCACAAGAGGGGGTTTTGTCTCCTATATTAATGAACATCTATCTCAACCTCTTT
GACCAAGAATTTTTCAGATTATCTATGAAATATGAGGCTATTAATGAGAATGGCAATACCGGTCAAGATGGGTCACAATCAAGGCTGCGGAGCTGGTTTAGGAGACAATT
GAAAGGAAATAGTTTTGAATATCCAGGTGAGGAGAAAGACAAAATAAGAGTATACTGTTGTCGCTATATGGATGAAATTTTTTTGGCGGTATCGGGTTCTAAAGATGTTG
CTCTTAGTTTTAGGTCTGAGATTTTTGATTTCATGCAGAAGACTTTGCATTTGGACGTCAATCATCAAGAGGAAATGGTATCATGTGGGGAGACTCATGGAATTCGTTTT
CTTGGTTGTTTGGTCAGACGAAGTGTGCAGGAAAGTCCAGCTGTGAAATCTGTCCACAAGTTGAAGGAAAAAGTTGAGCTATTTGCTTTACAAAAGCAGGAGACTTGGAA
TGCTTGGACGGTGTGGTTGGGAAAGAAATGGCTCGCTCATGGTTTGAAGAAGGTTAAAGAGTCGGAGATCAAGCATTTAGCTAAAAATAGCTCTTTGAATCAAATTTCCA
GTTTTCGTAAAGCTGGAATGGAAACTGATCACTGGTACAAGGTTCTATTGAAAATTTGGATGCAAGATCTAAATGCAAGGGCTGCAGAGAGTGAAGAAAAAATCTTATCT
AAGTATGCAGTGGAACCTTCTCTTCCTATGGAACTTCGAGATTCCTTTTATGAGTTCCAAAGGTGTGTGAAACAATATATTTCTGCTGAGACAGCTTCTACTGTTGCCCT
TTTACCAAATTATGATCCTTCTGTCAAACCTACTTTCATAACTGAGATTATAGCACCTGTCAATTCTATAAGAAAACGACTTTTTCGATATAGGTTAGTTACAAATAAGG
GACATCCATGCTCCTCTCCTTTCCTCATCTTACAAGATAACACTCAAATTATTGACTGGTTTTTAGGAGTATCTCGCCGTTGGTTTAGATGGTATAACAACTGTTCTAAC
TTCAGCGAGTTGGTCTTAATTTGTGATCAAGTTAGGAAATCCTGTATCCGAACGCTAGCAGCAAAGCATCGTATACACGAAAGTGAAATAGAAAAGAAGTTTGACTCAGA
ACTGAGTAACATTTACTCCTCTCCTGAACTAGAGCAAGAAGAAGAGAAGAAGTCATCAGATACTCATGGTTTAGACCATGATGAGGCACTAAAGTATGGAATTTCATATA
GTGGTCTGTGTTTGCTATCTCTTGCTAGAATGGTCAACCCATCTCGTCCATGCAATTGTTTTGTCGTTGGGTGTTTGGCTCCTGCACCAAGCGTTTATACTCTTCATGTC
ATGGAGAGACAAAAGTTTCCAGGATGGAAGACTGGATTCTCGAGTTCCATCCATCCTAGCTTGAACAAACGACGATTCGGGTTATGCAAAAAACATTTGGAGGATTTGTA
TTTGGGTCACATTTCATTGCAATCTATTGACTTTGGTGCATGGAAGTGA
Protein sequenceShow/hide protein sequence
MQDYKVFGYSIYVIGGYANMKFGGLQRFCRINMRNISILQNVNVCKVNSSFVSDIGECVQRVQSSGNYSTLAYADDEIDKGIEKKKLATNLASLIEESLDVDLRRSKTQM
ELKRSLEIQIKKRVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMSTDCLISFESMAEELSNGNFDVNANTFSILSSRKEVLILPKIKLKVLQEAIRIVLEC
VFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKLITVMEDKIDDPRLFAVIRSIYVAGALNLEFGGFPKGHGLPQEGVLSPILMNIYLNLF
DQEFFRLSMKYEAINENGNTGQDGSQSRLRSWFRRQLKGNSFEYPGEEKDKIRVYCCRYMDEIFLAVSGSKDVALSFRSEIFDFMQKTLHLDVNHQEEMVSCGETHGIRF
LGCLVRRSVQESPAVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSSLNQISSFRKAGMETDHWYKVLLKIWMQDLNARAAESEEKILS
KYAVEPSLPMELRDSFYEFQRCVKQYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLFRYRLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSN
FSELVLICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSNIYSSPELEQEEEKKSSDTHGLDHDEALKYGISYSGLCLLSLARMVNPSRPCNCFVVGCLAPAPSVYTLHV
MERQKFPGWKTGFSSSIHPSLNKRRFGLCKKHLEDLYLGHISLQSIDFGAWK