; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG04G005530 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG04G005530
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
Descriptionnuclear intron maturase 4, mitochondrial isoform X2
Genome locationCG_Chr04:19227074..19230777
RNA-Seq ExpressionClCG04G005530
SyntenyClCG04G005530
Gene Ontology termsGO:0000373 - Group II intron splicing (biological process)
GO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006315 - homing of group II introns (biological process)
GO:0007005 - mitochondrion organization (biological process)
GO:0009845 - seed germination (biological process)
GO:0032885 - regulation of polysaccharide biosynthetic process (biological process)
GO:0090615 - mitochondrial mRNA processing (biological process)
GO:1900864 - mitochondrial RNA modification (biological process)
GO:0005739 - mitochondrion (cellular component)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR024937 - Domain X
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0041778.1 hypothetical protein E6C27_scaffold67G001360 [Cucumis melo var. makuwa]0.0e+0088.48Show/hide
Query:  MKFGGLQRFCRINMRNISILQNVNVCKVNSSFVSDI----------------------GECVQRVQSSGNYSTLAYADDEIDKGIEKKKLATNLASLIEE
        M+FGGL+RFC+IN RN S  Q+VNVC VNSSFVSDI                      G+C Q VQSS NYSTLA ADDEIDKG+EK KLA NLASL+EE
Subjt:  MKFGGLQRFCRINMRNISILQNVNVCKVNSSFVSDI----------------------GECVQRVQSSGNYSTLAYADDEIDKGIEKKKLATNLASLIEE

Query:  SLDVDLRRSKTQMELKRSLEIQIKKRVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMSTDCLISFESMAEELSNGNFDVNANTFSILSSRK
        SLDVDLRRSKT+MELKRSLEIQIK+RVKAQYLNGKFLDLMG VIACP TLQNAYDC+RINSNVDI S DCLISFESMAEELS+GNFDVN NTFSILSSRK
Subjt:  SLDVDLRRSKTQMELKRSLEIQIKKRVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMSTDCLISFESMAEELSNGNFDVNANTFSILSSRK

Query:  EVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKLITVMEDKIDDPRLFAVIRSIYVAG
        EVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHST LKYI+KEIK+PDWWFTVDLSKKMD+LVMAKLITVMEDKI+DP+LFAVIRSI++AG
Subjt:  EVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKLITVMEDKIDDPRLFAVIRSIYVAG

Query:  ALNLEFGGFPKGHGLPQEGVLSPILMNIYLNLFDQEFFRLSMKYEAINENGNTGQDGSQSRLRSWFRRQLKGNSFEYPGEEKDKIRVYCCRYMDEIFLAV
        ALNLEFG FPKGHGLPQEGVLSPIL NIYLNLFDQEFFRLSMKYEAINE GNTGQDGSQS+LRSWFRRQLKGNS +YPGEEKDKIRVYCCRYMDEIFLAV
Subjt:  ALNLEFGGFPKGHGLPQEGVLSPILMNIYLNLFDQEFFRLSMKYEAINENGNTGQDGSQSRLRSWFRRQLKGNSFEYPGEEKDKIRVYCCRYMDEIFLAV

Query:  SGSKDVALSFRSEIFDFMQKTLHLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHGLKKVKE
        SGSKDVALSFRSEIFDFMQKTLHLDVNH+EEMVSC ETHGIRFLGCLVRRSVQESPAVKS+HKLKEKVELF LQKQETW +WTVWLGKKWLAHGLKKVKE
Subjt:  SGSKDVALSFRSEIFDFMQKTLHLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHGLKKVKE

Query:  SEIKHLAKNSSLNQISSFRKAGMETDHWYKVLLKIWMQDLNARAAESEEKILSKYAVEPSLPMELRDSFYEFQRCVKQYISAETASTVALLPNYDPSVKP
        SEIKHLAKNSSLNQISSFRK GMETDHWYKVLLKIWMQDLNARAAESEEKILSK+AVEPSLP ELRDSFYEFQR V++YIS+ETAST+ALLPNYDPSVKP
Subjt:  SEIKHLAKNSSLNQISSFRKAGMETDHWYKVLLKIWMQDLNARAAESEEKILSKYAVEPSLPMELRDSFYEFQRCVKQYISAETASTVALLPNYDPSVKP

Query:  TFITEIIAPVNSIRKRLFRYRLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELVLICDQVRKSCIRTLAAKHRIHESEIEKKFDSE
        TFITEIIAPVNSIRKRLFRYRLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYN  SNFSEL LI DQVRKSCIRTLAAKHRIHESEIEKKFDSE
Subjt:  TFITEIIAPVNSIRKRLFRYRLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELVLICDQVRKSCIRTLAAKHRIHESEIEKKFDSE

Query:  LSNIYSSPELEQEEEKKSSDTHGLDHDEALKYGISYSGLCLLSLARMVNPSRPCNCFVVGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFG
        LS IYSSPE+EQ +E KS+DTH LDHDEAL YGISYSGLCLLSLARMV+ SRPCNCFVVGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFG
Subjt:  LSNIYSSPELEQEEEKKSSDTHGLDHDEALKYGISYSGLCLLSLARMVNPSRPCNCFVVGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFG

Query:  LCKKHLEDLYLGHISLQSIDFGAWK
        LCK+HL DLYLG ISLQS+DFGAWK
Subjt:  LCKKHLEDLYLGHISLQSIDFGAWK

XP_008442019.1 PREDICTED: uncharacterized protein LOC103486008 [Cucumis melo]0.0e+0090.78Show/hide
Query:  MKFGGLQRFCRINMRNISILQNVNVCKVNSSFVSDIGECVQRVQSSGNYSTLAYADDEIDKGIEKKKLATNLASLIEESLDVDLRRSKTQMELKRSLEIQ
        M+FGGL+RFC+IN RN S  Q+VNVC VNSSFVSDIG+C Q VQSS NYSTLA ADDEIDKG+EK KLA NLASL+EESLDVDLRRSKT+MELKRSLEIQ
Subjt:  MKFGGLQRFCRINMRNISILQNVNVCKVNSSFVSDIGECVQRVQSSGNYSTLAYADDEIDKGIEKKKLATNLASLIEESLDVDLRRSKTQMELKRSLEIQ

Query:  IKKRVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMSTDCLISFESMAEELSNGNFDVNANTFSILSSRKEVLILPKIKLKVLQEAIRIVLE
        IK+RVKAQYLNGKFLDLMG VIACP TLQNAYDC+RINSNVDI S DCLISFESMA+ELS+GNFDVN NTFSILSSRKEVLILPKIKLKVLQEAIRIVLE
Subjt:  IKKRVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMSTDCLISFESMAEELSNGNFDVNANTFSILSSRKEVLILPKIKLKVLQEAIRIVLE

Query:  CVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKLITVMEDKIDDPRLFAVIRSIYVAGALNLEFGGFPKGHGLPQEGVLS
        CVFRPHFSKISHGCRSGRGHST LKYI+KEIK+PDWWFTVDLSKKMDELVMAKLITVMEDKI+DP+LFAVIRSI++AGALNLEFG FPKGHGLPQEGVLS
Subjt:  CVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKLITVMEDKIDDPRLFAVIRSIYVAGALNLEFGGFPKGHGLPQEGVLS

Query:  PILMNIYLNLFDQEFFRLSMKYEAINENGNTGQDGSQSRLRSWFRRQLKGNSFEYPGEEKDKIRVYCCRYMDEIFLAVSGSKDVALSFRSEIFDFMQKTL
        PIL NIYLNLFDQEFFRLSMKYEAINE GNTGQDGSQS+LRSWFRRQLK NS +YPGEEKDKIRVYCCRYMDEIFLAVSGSKDVALSFRSEIFDFMQKTL
Subjt:  PILMNIYLNLFDQEFFRLSMKYEAINENGNTGQDGSQSRLRSWFRRQLKGNSFEYPGEEKDKIRVYCCRYMDEIFLAVSGSKDVALSFRSEIFDFMQKTL

Query:  HLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSSLNQISSFRKAG
        HLDVNH+EEMVSC ETHGIRFLGCLVRRSVQESPAVKS+HKLKEKVELF LQKQETW +WTVWLGKKWLAHGLKKVKESEIKHLAKNSSLNQISSFRK G
Subjt:  HLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSSLNQISSFRKAG

Query:  METDHWYKVLLKIWMQDLNARAAESEEKILSKYAVEPSLPMELRDSFYEFQRCVKQYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLFRYRL
        METDHWYKVLLKIWMQDLNARAAESEEKILSK+AVEPSLP ELRDSFYEFQR V++YIS+ETAST+ALLPNYDPSVKPTFITEIIAPVNSIRKRLFRYRL
Subjt:  METDHWYKVLLKIWMQDLNARAAESEEKILSKYAVEPSLPMELRDSFYEFQRCVKQYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLFRYRL

Query:  VTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELVLICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSNIYSSPELEQEEEKKSSDTH
        VTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYN  SNFSEL LI DQVRKSCIRTLAAKH+IHESEIEKKFDSELS IYSSPE+EQE+E KS+DTH
Subjt:  VTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELVLICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSNIYSSPELEQEEEKKSSDTH

Query:  GLDHDEALKYGISYSGLCLLSLARMVNPSRPCNCFVVGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKKHLEDLYLGHISLQSIDFG
         LDHDEAL YGISYSGLCLLSLARMV+ SRPCNCFVVGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCK+HL DLYLG ISLQS+DFG
Subjt:  GLDHDEALKYGISYSGLCLLSLARMVNPSRPCNCFVVGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKKHLEDLYLGHISLQSIDFG

Query:  AWK
        AWK
Subjt:  AWK

XP_038882001.1 nuclear intron maturase 4, mitochondrial isoform X1 [Benincasa hispida]0.0e+0093.56Show/hide
Query:  GYANMKFGGLQRFCRINMRNISILQNVNVCKVNSSFVSDIGECVQRVQSSGNYSTLAYADDEIDKGIEKKKLATNLASLIEESLDVDLRRSKTQMELKRS
        GYA M+FGGLQRFCRINMRN++ L  VNVCKV+SS VS IG+ VQRVQ+S NYSTL  ADDEIDKG+EK KLA NLASL+EESLDVDL+RSKTQMELKRS
Subjt:  GYANMKFGGLQRFCRINMRNISILQNVNVCKVNSSFVSDIGECVQRVQSSGNYSTLAYADDEIDKGIEKKKLATNLASLIEESLDVDLRRSKTQMELKRS

Query:  LEIQIKKRVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMSTDCLISFESMAEELSNGNFDVNANTFSILSSRKEVLILPKIKLKVLQEAIR
        LEIQIK+RVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMS DCLISFESMAEELSNGNFDVNANTFSILSSRKEVL+LPKI+LKVLQEAIR
Subjt:  LEIQIKKRVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMSTDCLISFESMAEELSNGNFDVNANTFSILSSRKEVLILPKIKLKVLQEAIR

Query:  IVLECVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKLITVMEDKIDDPRLFAVIRSIYVAGALNLEFGGFPKGHGLPQE
        IVLECVFRPHFSKISHGCRSGRGHST LKYIRKEIKNPDWWFT+DLSKKMDELVMAKLITVMEDKI+DP+LFAVIRSIYVAGALNLEFGGFPKGHGLPQE
Subjt:  IVLECVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKLITVMEDKIDDPRLFAVIRSIYVAGALNLEFGGFPKGHGLPQE

Query:  GVLSPILMNIYLNLFDQEFFRLSMKYEAINENGNTGQDGSQSRLRSWFRRQLKGNSFEYPGEEKDKIRVYCCRYMDEIFLAVSGSKDVALSFRSEIFDFM
        G+LSPIL NIYLNLFDQEFFRLSMKYEAINE GNTGQDGSQSRLRSWFRRQLKGNS +YPGE+KDKIRVYCCRYMDEIFLAVSGSKDVALSFRSEIF F+
Subjt:  GVLSPILMNIYLNLFDQEFFRLSMKYEAINENGNTGQDGSQSRLRSWFRRQLKGNSFEYPGEEKDKIRVYCCRYMDEIFLAVSGSKDVALSFRSEIFDFM

Query:  QKTLHLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSSLNQISSF
        QKTLHLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLK+KVELFALQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSSLNQISSF
Subjt:  QKTLHLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSSLNQISSF

Query:  RKAGMETDHWYKVLLKIWMQDLNARAAESEEKILSKYAVEPSLPMELRDSFYEFQRCVKQYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLF
        RKAGMETDHWYKVLLKIWMQDLNARAAESEEKILSK+AVEPSLP+ELRDSFYEFQRCV++YISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRL 
Subjt:  RKAGMETDHWYKVLLKIWMQDLNARAAESEEKILSKYAVEPSLPMELRDSFYEFQRCVKQYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLF

Query:  RYRLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELVLICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSNIYSSPELEQEEEKKS
        RYRLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSEL+LICD VRKSCIRTLAAKHRIHESEIEKKFDSELS +YSSPE+EQEEE KS
Subjt:  RYRLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELVLICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSNIYSSPELEQEEEKKS

Query:  SDTHGLDHDEALKYGISYSGLCLLSLARMVNPSRPCNCFVVGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKKHLEDLYLGHISLQS
         DTHGLDHDEALKYGISYSGLCLLSLARMV+ SRPCNCFVVGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKKHLEDLYLGHISLQS
Subjt:  SDTHGLDHDEALKYGISYSGLCLLSLARMVNPSRPCNCFVVGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKKHLEDLYLGHISLQS

Query:  IDFGAWK
        IDFGAWK
Subjt:  IDFGAWK

XP_038882003.1 nuclear intron maturase 4, mitochondrial isoform X2 [Benincasa hispida]0.0e+0092.75Show/hide
Query:  YSIYVIGGYANMKFGGLQRFCRINMRNISILQNVNVCKVNSSFVSDIGECVQRVQSSGNYSTLAYADDEIDKGIEKKKLATNLASLIEESLDVDLRRSKT
        + + V+  YA M+FGGLQRFCRINMRN++ L  VNVCKV+SS VS IG+ VQRVQ+S NYSTL  ADDEIDKG+EK KLA NLASL+EESLDVDL+RSKT
Subjt:  YSIYVIGGYANMKFGGLQRFCRINMRNISILQNVNVCKVNSSFVSDIGECVQRVQSSGNYSTLAYADDEIDKGIEKKKLATNLASLIEESLDVDLRRSKT

Query:  QMELKRSLEIQIKKRVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMSTDCLISFESMAEELSNGNFDVNANTFSILSSRKEVLILPKIKLK
        QMELKRSLEIQIK+RVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMS DCLISFESMAEELSNGNFDVNANTFSILSSRKEVL+LPKI+LK
Subjt:  QMELKRSLEIQIKKRVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMSTDCLISFESMAEELSNGNFDVNANTFSILSSRKEVLILPKIKLK

Query:  VLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKLITVMEDKIDDPRLFAVIRSIYVAGALNLEFGGFPK
        VLQEAIRIVLECVFRPHFSKISHGCRSGRGHST LKYIRKEIKNPDWWFT+DLSKKMDELVMAKLITVMEDKI+DP+LFAVIRSIYVAGALNLEFGGFPK
Subjt:  VLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKLITVMEDKIDDPRLFAVIRSIYVAGALNLEFGGFPK

Query:  GHGLPQEGVLSPILMNIYLNLFDQEFFRLSMKYEAINENGNTGQDGSQSRLRSWFRRQLKGNSFEYPGEEKDKIRVYCCRYMDEIFLAVSGSKDVALSFR
        GHGLPQEG+LSPIL NIYLNLFDQEFFRLSMKYEAINE GNTGQDGSQSRLRSWFRRQLKGNS +YPGE+KDKIRVYCCRYMDEIFLAVSGSKDVALSFR
Subjt:  GHGLPQEGVLSPILMNIYLNLFDQEFFRLSMKYEAINENGNTGQDGSQSRLRSWFRRQLKGNSFEYPGEEKDKIRVYCCRYMDEIFLAVSGSKDVALSFR

Query:  SEIFDFMQKTLHLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSS
        SEIF F+QKTLHLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLK+KVELFALQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSS
Subjt:  SEIFDFMQKTLHLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSS

Query:  LNQISSFRKAGMETDHWYKVLLKIWMQDLNARAAESEEKILSKYAVEPSLPMELRDSFYEFQRCVKQYISAETASTVALLPNYDPSVKPTFITEIIAPVN
        LNQISSFRKAGMETDHWYKVLLKIWMQDLNARAAESEEKILSK+AVEPSLP+ELRDSFYEFQRCV++YISAETASTVALLPNYDPSVKPTFITEIIAPVN
Subjt:  LNQISSFRKAGMETDHWYKVLLKIWMQDLNARAAESEEKILSKYAVEPSLPMELRDSFYEFQRCVKQYISAETASTVALLPNYDPSVKPTFITEIIAPVN

Query:  SIRKRLFRYRLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELVLICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSNIYSSPELE
        SIRKRL RYRLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSEL+LICD VRKSCIRTLAAKHRIHESEIEKKFDSELS +YSSPE+E
Subjt:  SIRKRLFRYRLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELVLICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSNIYSSPELE

Query:  QEEEKKSSDTHGLDHDEALKYGISYSGLCLLSLARMVNPSRPCNCFVVGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKKHLEDLYL
        QEEE KS DTHGLDHDEALKYGISYSGLCLLSLARMV+ SRPCNCFVVGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKKHLEDLYL
Subjt:  QEEEKKSSDTHGLDHDEALKYGISYSGLCLLSLARMVNPSRPCNCFVVGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKKHLEDLYL

Query:  GHISLQSIDFGAWK
        GHISLQSIDFGAWK
Subjt:  GHISLQSIDFGAWK

XP_038882004.1 nuclear intron maturase 4, mitochondrial isoform X3 [Benincasa hispida]0.0e+0093.65Show/hide
Query:  MKFGGLQRFCRINMRNISILQNVNVCKVNSSFVSDIGECVQRVQSSGNYSTLAYADDEIDKGIEKKKLATNLASLIEESLDVDLRRSKTQMELKRSLEIQ
        M+FGGLQRFCRINMRN++ L  VNVCKV+SS VS IG+ VQRVQ+S NYSTL  ADDEIDKG+EK KLA NLASL+EESLDVDL+RSKTQMELKRSLEIQ
Subjt:  MKFGGLQRFCRINMRNISILQNVNVCKVNSSFVSDIGECVQRVQSSGNYSTLAYADDEIDKGIEKKKLATNLASLIEESLDVDLRRSKTQMELKRSLEIQ

Query:  IKKRVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMSTDCLISFESMAEELSNGNFDVNANTFSILSSRKEVLILPKIKLKVLQEAIRIVLE
        IK+RVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMS DCLISFESMAEELSNGNFDVNANTFSILSSRKEVL+LPKI+LKVLQEAIRIVLE
Subjt:  IKKRVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMSTDCLISFESMAEELSNGNFDVNANTFSILSSRKEVLILPKIKLKVLQEAIRIVLE

Query:  CVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKLITVMEDKIDDPRLFAVIRSIYVAGALNLEFGGFPKGHGLPQEGVLS
        CVFRPHFSKISHGCRSGRGHST LKYIRKEIKNPDWWFT+DLSKKMDELVMAKLITVMEDKI+DP+LFAVIRSIYVAGALNLEFGGFPKGHGLPQEG+LS
Subjt:  CVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKLITVMEDKIDDPRLFAVIRSIYVAGALNLEFGGFPKGHGLPQEGVLS

Query:  PILMNIYLNLFDQEFFRLSMKYEAINENGNTGQDGSQSRLRSWFRRQLKGNSFEYPGEEKDKIRVYCCRYMDEIFLAVSGSKDVALSFRSEIFDFMQKTL
        PIL NIYLNLFDQEFFRLSMKYEAINE GNTGQDGSQSRLRSWFRRQLKGNS +YPGE+KDKIRVYCCRYMDEIFLAVSGSKDVALSFRSEIF F+QKTL
Subjt:  PILMNIYLNLFDQEFFRLSMKYEAINENGNTGQDGSQSRLRSWFRRQLKGNSFEYPGEEKDKIRVYCCRYMDEIFLAVSGSKDVALSFRSEIFDFMQKTL

Query:  HLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSSLNQISSFRKAG
        HLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLK+KVELFALQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSSLNQISSFRKAG
Subjt:  HLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSSLNQISSFRKAG

Query:  METDHWYKVLLKIWMQDLNARAAESEEKILSKYAVEPSLPMELRDSFYEFQRCVKQYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLFRYRL
        METDHWYKVLLKIWMQDLNARAAESEEKILSK+AVEPSLP+ELRDSFYEFQRCV++YISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRL RYRL
Subjt:  METDHWYKVLLKIWMQDLNARAAESEEKILSKYAVEPSLPMELRDSFYEFQRCVKQYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLFRYRL

Query:  VTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELVLICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSNIYSSPELEQEEEKKSSDTH
        VTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSEL+LICD VRKSCIRTLAAKHRIHESEIEKKFDSELS +YSSPE+EQEEE KS DTH
Subjt:  VTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELVLICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSNIYSSPELEQEEEKKSSDTH

Query:  GLDHDEALKYGISYSGLCLLSLARMVNPSRPCNCFVVGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKKHLEDLYLGHISLQSIDFG
        GLDHDEALKYGISYSGLCLLSLARMV+ SRPCNCFVVGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKKHLEDLYLGHISLQSIDFG
Subjt:  GLDHDEALKYGISYSGLCLLSLARMVNPSRPCNCFVVGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKKHLEDLYLGHISLQSIDFG

Query:  AWK
        AWK
Subjt:  AWK

TrEMBL top hitse value%identityAlignment
A0A0A0KWB0 Reverse transcriptase domain-containing protein0.0e+0090.04Show/hide
Query:  MKFGGLQRFCRINMRNISILQNVNVCKVNSSFVSDIGECVQRVQSSGNYSTLAYADDEIDKGIEKKKLATNLASLIEESLDVDLRRSKTQMELKRSLEIQ
        MKFGGL+RFCRINMRN S LQ+VNVC  NSSFVSDIG+CVQ VQ S NYSTLA A  EIDKG+E+ KLA NLASL+EESLDVDLRRSKTQMELKRSLEI+
Subjt:  MKFGGLQRFCRINMRNISILQNVNVCKVNSSFVSDIGECVQRVQSSGNYSTLAYADDEIDKGIEKKKLATNLASLIEESLDVDLRRSKTQMELKRSLEIQ

Query:  IKKRVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMSTDCLISFESMAEELSNGNFDVNANTFSILSSRKEVLILPKIKLKVLQEAIRIVLE
        IK+RVKAQYLNGKFLDLMG VIACP TLQN YDC+RINSNVDI S D LISFESMAEELSNGNFDVN NTFSILSSRKEVLILPKIKLKVLQEAIRIVLE
Subjt:  IKKRVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMSTDCLISFESMAEELSNGNFDVNANTFSILSSRKEVLILPKIKLKVLQEAIRIVLE

Query:  CVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKLITVMEDKIDDPRLFAVIRSIYVAGALNLEFGGFPKGHGLPQEGVLS
        CVFRPHFSKISHGCRSGRGHST LKYI+KEIK+PDWWFTVDLSKKMDELVMAKLITVMEDKI+DP+LFAVIRSIY+AGALNLEFGGFPKGHGLPQEGVLS
Subjt:  CVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKLITVMEDKIDDPRLFAVIRSIYVAGALNLEFGGFPKGHGLPQEGVLS

Query:  PILMNIYLNLFDQEFFRLSMKYEAINENGNTGQDGSQSRLRSWFRRQLKGNSFEYPGEEKDKIRVYCCRYMDEIFLAVSGSKDVALSFRSEIFDFMQKTL
        PIL NIYLNLFDQEFFRLSMKYEAINE GNTGQDGSQSRLRSWFRRQLKGN+ +Y GEEKDKIRVYCCRYMDEIFLAVSGSKDVA SFRSEIF F+QKTL
Subjt:  PILMNIYLNLFDQEFFRLSMKYEAINENGNTGQDGSQSRLRSWFRRQLKGNSFEYPGEEKDKIRVYCCRYMDEIFLAVSGSKDVALSFRSEIFDFMQKTL

Query:  HLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSSLNQISSFRKAG
        HLDVN +EEMVSC ETHGIRFLGCLVRRSVQESPAVKS+HKLKEKVELF LQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSSLN+ISSFRK G
Subjt:  HLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSSLNQISSFRKAG

Query:  METDHWYKVLLKIWMQDLNARAAESEEKILSKYAVEPSLPMELRDSFYEFQRCVKQYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLFRYRL
        METDHWYKVLLKIWMQDLNARAAESEEKILSK+AVE SLP ELRDSFYEFQR VK+YIS+ETAST+ALLPNYDPS KPTFITEIIAPVNSIRKRL RYRL
Subjt:  METDHWYKVLLKIWMQDLNARAAESEEKILSKYAVEPSLPMELRDSFYEFQRCVKQYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLFRYRL

Query:  VTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELVLICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSNIYSSPELEQEEEKKSSDTH
        VTNKGHPCSSPFLILQDNTQIIDWF+GVSRR FRWYNN SNFSEL LI DQVRKSCIRTLAAKHRIHESEIEKKFDSELS IYSS E++QE+E KS+DTH
Subjt:  VTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELVLICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSNIYSSPELEQEEEKKSSDTH

Query:  GLDHDEALKYGISYSGLCLLSLARMVNPSRPCNCFVVGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKKHLEDLYLGHISLQSIDFG
         LDHDEALKYGISYSGLCLLS ARMV+ SRPCNCFV+GCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCK+HL DLYLG ISLQS+DFG
Subjt:  GLDHDEALKYGISYSGLCLLSLARMVNPSRPCNCFVVGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKKHLEDLYLGHISLQSIDFG

Query:  AWK
        AWK
Subjt:  AWK

A0A1S3B491 uncharacterized protein LOC1034860080.0e+0090.78Show/hide
Query:  MKFGGLQRFCRINMRNISILQNVNVCKVNSSFVSDIGECVQRVQSSGNYSTLAYADDEIDKGIEKKKLATNLASLIEESLDVDLRRSKTQMELKRSLEIQ
        M+FGGL+RFC+IN RN S  Q+VNVC VNSSFVSDIG+C Q VQSS NYSTLA ADDEIDKG+EK KLA NLASL+EESLDVDLRRSKT+MELKRSLEIQ
Subjt:  MKFGGLQRFCRINMRNISILQNVNVCKVNSSFVSDIGECVQRVQSSGNYSTLAYADDEIDKGIEKKKLATNLASLIEESLDVDLRRSKTQMELKRSLEIQ

Query:  IKKRVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMSTDCLISFESMAEELSNGNFDVNANTFSILSSRKEVLILPKIKLKVLQEAIRIVLE
        IK+RVKAQYLNGKFLDLMG VIACP TLQNAYDC+RINSNVDI S DCLISFESMA+ELS+GNFDVN NTFSILSSRKEVLILPKIKLKVLQEAIRIVLE
Subjt:  IKKRVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMSTDCLISFESMAEELSNGNFDVNANTFSILSSRKEVLILPKIKLKVLQEAIRIVLE

Query:  CVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKLITVMEDKIDDPRLFAVIRSIYVAGALNLEFGGFPKGHGLPQEGVLS
        CVFRPHFSKISHGCRSGRGHST LKYI+KEIK+PDWWFTVDLSKKMDELVMAKLITVMEDKI+DP+LFAVIRSI++AGALNLEFG FPKGHGLPQEGVLS
Subjt:  CVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKLITVMEDKIDDPRLFAVIRSIYVAGALNLEFGGFPKGHGLPQEGVLS

Query:  PILMNIYLNLFDQEFFRLSMKYEAINENGNTGQDGSQSRLRSWFRRQLKGNSFEYPGEEKDKIRVYCCRYMDEIFLAVSGSKDVALSFRSEIFDFMQKTL
        PIL NIYLNLFDQEFFRLSMKYEAINE GNTGQDGSQS+LRSWFRRQLK NS +YPGEEKDKIRVYCCRYMDEIFLAVSGSKDVALSFRSEIFDFMQKTL
Subjt:  PILMNIYLNLFDQEFFRLSMKYEAINENGNTGQDGSQSRLRSWFRRQLKGNSFEYPGEEKDKIRVYCCRYMDEIFLAVSGSKDVALSFRSEIFDFMQKTL

Query:  HLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSSLNQISSFRKAG
        HLDVNH+EEMVSC ETHGIRFLGCLVRRSVQESPAVKS+HKLKEKVELF LQKQETW +WTVWLGKKWLAHGLKKVKESEIKHLAKNSSLNQISSFRK G
Subjt:  HLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSSLNQISSFRKAG

Query:  METDHWYKVLLKIWMQDLNARAAESEEKILSKYAVEPSLPMELRDSFYEFQRCVKQYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLFRYRL
        METDHWYKVLLKIWMQDLNARAAESEEKILSK+AVEPSLP ELRDSFYEFQR V++YIS+ETAST+ALLPNYDPSVKPTFITEIIAPVNSIRKRLFRYRL
Subjt:  METDHWYKVLLKIWMQDLNARAAESEEKILSKYAVEPSLPMELRDSFYEFQRCVKQYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLFRYRL

Query:  VTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELVLICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSNIYSSPELEQEEEKKSSDTH
        VTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYN  SNFSEL LI DQVRKSCIRTLAAKH+IHESEIEKKFDSELS IYSSPE+EQE+E KS+DTH
Subjt:  VTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELVLICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSNIYSSPELEQEEEKKSSDTH

Query:  GLDHDEALKYGISYSGLCLLSLARMVNPSRPCNCFVVGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKKHLEDLYLGHISLQSIDFG
         LDHDEAL YGISYSGLCLLSLARMV+ SRPCNCFVVGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCK+HL DLYLG ISLQS+DFG
Subjt:  GLDHDEALKYGISYSGLCLLSLARMVNPSRPCNCFVVGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKKHLEDLYLGHISLQSIDFG

Query:  AWK
        AWK
Subjt:  AWK

A0A5A7TFZ7 Reverse transcriptase domain-containing protein0.0e+0088.48Show/hide
Query:  MKFGGLQRFCRINMRNISILQNVNVCKVNSSFVSDI----------------------GECVQRVQSSGNYSTLAYADDEIDKGIEKKKLATNLASLIEE
        M+FGGL+RFC+IN RN S  Q+VNVC VNSSFVSDI                      G+C Q VQSS NYSTLA ADDEIDKG+EK KLA NLASL+EE
Subjt:  MKFGGLQRFCRINMRNISILQNVNVCKVNSSFVSDI----------------------GECVQRVQSSGNYSTLAYADDEIDKGIEKKKLATNLASLIEE

Query:  SLDVDLRRSKTQMELKRSLEIQIKKRVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMSTDCLISFESMAEELSNGNFDVNANTFSILSSRK
        SLDVDLRRSKT+MELKRSLEIQIK+RVKAQYLNGKFLDLMG VIACP TLQNAYDC+RINSNVDI S DCLISFESMAEELS+GNFDVN NTFSILSSRK
Subjt:  SLDVDLRRSKTQMELKRSLEIQIKKRVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMSTDCLISFESMAEELSNGNFDVNANTFSILSSRK

Query:  EVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKLITVMEDKIDDPRLFAVIRSIYVAG
        EVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHST LKYI+KEIK+PDWWFTVDLSKKMD+LVMAKLITVMEDKI+DP+LFAVIRSI++AG
Subjt:  EVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKLITVMEDKIDDPRLFAVIRSIYVAG

Query:  ALNLEFGGFPKGHGLPQEGVLSPILMNIYLNLFDQEFFRLSMKYEAINENGNTGQDGSQSRLRSWFRRQLKGNSFEYPGEEKDKIRVYCCRYMDEIFLAV
        ALNLEFG FPKGHGLPQEGVLSPIL NIYLNLFDQEFFRLSMKYEAINE GNTGQDGSQS+LRSWFRRQLKGNS +YPGEEKDKIRVYCCRYMDEIFLAV
Subjt:  ALNLEFGGFPKGHGLPQEGVLSPILMNIYLNLFDQEFFRLSMKYEAINENGNTGQDGSQSRLRSWFRRQLKGNSFEYPGEEKDKIRVYCCRYMDEIFLAV

Query:  SGSKDVALSFRSEIFDFMQKTLHLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHGLKKVKE
        SGSKDVALSFRSEIFDFMQKTLHLDVNH+EEMVSC ETHGIRFLGCLVRRSVQESPAVKS+HKLKEKVELF LQKQETW +WTVWLGKKWLAHGLKKVKE
Subjt:  SGSKDVALSFRSEIFDFMQKTLHLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHGLKKVKE

Query:  SEIKHLAKNSSLNQISSFRKAGMETDHWYKVLLKIWMQDLNARAAESEEKILSKYAVEPSLPMELRDSFYEFQRCVKQYISAETASTVALLPNYDPSVKP
        SEIKHLAKNSSLNQISSFRK GMETDHWYKVLLKIWMQDLNARAAESEEKILSK+AVEPSLP ELRDSFYEFQR V++YIS+ETAST+ALLPNYDPSVKP
Subjt:  SEIKHLAKNSSLNQISSFRKAGMETDHWYKVLLKIWMQDLNARAAESEEKILSKYAVEPSLPMELRDSFYEFQRCVKQYISAETASTVALLPNYDPSVKP

Query:  TFITEIIAPVNSIRKRLFRYRLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELVLICDQVRKSCIRTLAAKHRIHESEIEKKFDSE
        TFITEIIAPVNSIRKRLFRYRLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYN  SNFSEL LI DQVRKSCIRTLAAKHRIHESEIEKKFDSE
Subjt:  TFITEIIAPVNSIRKRLFRYRLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELVLICDQVRKSCIRTLAAKHRIHESEIEKKFDSE

Query:  LSNIYSSPELEQEEEKKSSDTHGLDHDEALKYGISYSGLCLLSLARMVNPSRPCNCFVVGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFG
        LS IYSSPE+EQ +E KS+DTH LDHDEAL YGISYSGLCLLSLARMV+ SRPCNCFVVGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFG
Subjt:  LSNIYSSPELEQEEEKKSSDTHGLDHDEALKYGISYSGLCLLSLARMVNPSRPCNCFVVGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFG

Query:  LCKKHLEDLYLGHISLQSIDFGAWK
        LCK+HL DLYLG ISLQS+DFGAWK
Subjt:  LCKKHLEDLYLGHISLQSIDFGAWK

A0A6J1CXL0 nuclear intron maturase 4, mitochondrial isoform X20.0e+0085.32Show/hide
Query:  MKFGGLQRFCRINMRNISILQNVNVCKVNSSFVSDIGECVQRVQSSGNYSTLAYADDEIDKGIEKKKLATNLASLIEESLDVDLRRSKTQMELKRSLEIQ
        M+FGG QRFCR+NMRN ++L+   +CKVNSSFVSDIG+CVQRVQ+S NYS LA ADD+  KG+EKKKLA NLASL+EESLDVD RR K++MELKRSLEIQ
Subjt:  MKFGGLQRFCRINMRNISILQNVNVCKVNSSFVSDIGECVQRVQSSGNYSTLAYADDEIDKGIEKKKLATNLASLIEESLDVDLRRSKTQMELKRSLEIQ

Query:  IKKRVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMSTDCLISFESMAEELSNGNFDVNANTFSILSSRKEVLILPKIKLKVLQEAIRIVLE
        IKKRVKAQY+NGKF+DLMGKVIACP TLQNAYDCVRINSNVDI S D LISFESMAEEL NG+FDVNANTFSI SS+KEVLILPK+KLKVLQEAIRIVLE
Subjt:  IKKRVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMSTDCLISFESMAEELSNGNFDVNANTFSILSSRKEVLILPKIKLKVLQEAIRIVLE

Query:  CVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKLITVMEDKIDDPRLFAVIRSIYVAGALNLEFGGFPKGHGLPQEGVLS
        CVFRPHFSKISHGCRSGRGHST LKYIRKEI NPDWWFTVD+SKKMDEL MAKLI+VMEDKI+DP  FA+IRSI+ AGALNLEFGGFPKGHGLPQEGVLS
Subjt:  CVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKLITVMEDKIDDPRLFAVIRSIYVAGALNLEFGGFPKGHGLPQEGVLS

Query:  PILMNIYLNLFDQEFFRLSMKYEAINENGNTGQDGSQSRLRSWFRRQLKGNSFEYPGEEKDKIRVYCCRYMDEIFLAVSGSKDVALSFRSEIFDFMQKTL
        PILMNIYLNLFDQEFFRLSMKYEAIN+ GN  QDGSQS+LRSWFRR+LKGN  EYP +EKD IRVYCCRYMDEIF+AVSGSKDVALSFRSEI DF+QK+L
Subjt:  PILMNIYLNLFDQEFFRLSMKYEAINENGNTGQDGSQSRLRSWFRRQLKGNSFEYPGEEKDKIRVYCCRYMDEIFLAVSGSKDVALSFRSEIFDFMQKTL

Query:  HLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNS-SLNQISSFRKA
        HLDVNHQEEMVSC ET GIRFLGCLVRRS +ESPAVK+VHKLKEKVELFALQKQE WN WTVWLGKKWLAHGLKKVKESEIKHLAKNS SLNQISSFRK 
Subjt:  HLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNS-SLNQISSFRKA

Query:  GMETDHWYKVLLKIWMQDLNARAAESEEKILSKYAVEPSLPMELRDSFYEFQRCVKQYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLFRYR
        GMETDHWYKVLLKIWMQD+NA+AAE+EE ILS Y VEPSLP+ELRDSFYEFQR V++Y+S+ETASTVALLPNYDPSVK TFITEIIAPVNSIRKRL RYR
Subjt:  GMETDHWYKVLLKIWMQDLNARAAESEEKILSKYAVEPSLPMELRDSFYEFQRCVKQYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLFRYR

Query:  LVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELVLICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSNIYSSPELEQEEEKKSSDT
        L+TNKG+PC+SPFLIL DNTQIIDWFLGV RRW +WY+NCSNFSE++LICDQVRKSCIRTLAAKHR HESEIEKKFD ELS I S+PE+EQEEE+++SDT
Subjt:  LVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELVLICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSNIYSSPELEQEEEKKSSDT

Query:  HGLDHDEALKYGISYSGLCLLSLARMVNPSRPCNCFVVGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKKHLEDLYLGHISLQSIDF
        HGL HDEA  YGISYSGLCLLSLARMV+ SRPCNCFV+GCLA APSVYTLHVMERQKFPGWKTGFSSSIHPSLN+RR GLCK+HL+DLYLGHISLQS++F
Subjt:  HGLDHDEALKYGISYSGLCLLSLARMVNPSRPCNCFVVGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKKHLEDLYLGHISLQSIDF

Query:  GAWK
        GAWK
Subjt:  GAWK

A0A6J1CYJ7 nuclear intron maturase 4, mitochondrial isoform X10.0e+0085.22Show/hide
Query:  MKFGGLQRFCRINMRNISILQNVNVCKVNSSFVSDI-GECVQRVQSSGNYSTLAYADDEIDKGIEKKKLATNLASLIEESLDVDLRRSKTQMELKRSLEI
        M+FGG QRFCR+NMRN ++L+   +CKVNSSFVSDI G+CVQRVQ+S NYS LA ADD+  KG+EKKKLA NLASL+EESLDVD RR K++MELKRSLEI
Subjt:  MKFGGLQRFCRINMRNISILQNVNVCKVNSSFVSDI-GECVQRVQSSGNYSTLAYADDEIDKGIEKKKLATNLASLIEESLDVDLRRSKTQMELKRSLEI

Query:  QIKKRVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMSTDCLISFESMAEELSNGNFDVNANTFSILSSRKEVLILPKIKLKVLQEAIRIVL
        QIKKRVKAQY+NGKF+DLMGKVIACP TLQNAYDCVRINSNVDI S D LISFESMAEEL NG+FDVNANTFSI SS+KEVLILPK+KLKVLQEAIRIVL
Subjt:  QIKKRVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMSTDCLISFESMAEELSNGNFDVNANTFSILSSRKEVLILPKIKLKVLQEAIRIVL

Query:  ECVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKLITVMEDKIDDPRLFAVIRSIYVAGALNLEFGGFPKGHGLPQEGVL
        ECVFRPHFSKISHGCRSGRGHST LKYIRKEI NPDWWFTVD+SKKMDEL MAKLI+VMEDKI+DP  FA+IRSI+ AGALNLEFGGFPKGHGLPQEGVL
Subjt:  ECVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKLITVMEDKIDDPRLFAVIRSIYVAGALNLEFGGFPKGHGLPQEGVL

Query:  SPILMNIYLNLFDQEFFRLSMKYEAINENGNTGQDGSQSRLRSWFRRQLKGNSFEYPGEEKDKIRVYCCRYMDEIFLAVSGSKDVALSFRSEIFDFMQKT
        SPILMNIYLNLFDQEFFRLSMKYEAIN+ GN  QDGSQS+LRSWFRR+LKGN  EYP +EKD IRVYCCRYMDEIF+AVSGSKDVALSFRSEI DF+QK+
Subjt:  SPILMNIYLNLFDQEFFRLSMKYEAINENGNTGQDGSQSRLRSWFRRQLKGNSFEYPGEEKDKIRVYCCRYMDEIFLAVSGSKDVALSFRSEIFDFMQKT

Query:  LHLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNS-SLNQISSFRK
        LHLDVNHQEEMVSC ET GIRFLGCLVRRS +ESPAVK+VHKLKEKVELFALQKQE WN WTVWLGKKWLAHGLKKVKESEIKHLAKNS SLNQISSFRK
Subjt:  LHLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNS-SLNQISSFRK

Query:  AGMETDHWYKVLLKIWMQDLNARAAESEEKILSKYAVEPSLPMELRDSFYEFQRCVKQYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLFRY
         GMETDHWYKVLLKIWMQD+NA+AAE+EE ILS Y VEPSLP+ELRDSFYEFQR V++Y+S+ETASTVALLPNYDPSVK TFITEIIAPVNSIRKRL RY
Subjt:  AGMETDHWYKVLLKIWMQDLNARAAESEEKILSKYAVEPSLPMELRDSFYEFQRCVKQYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLFRY

Query:  RLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELVLICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSNIYSSPELEQEEEKKSSD
        RL+TNKG+PC+SPFLIL DNTQIIDWFLGV RRW +WY+NCSNFSE++LICDQVRKSCIRTLAAKHR HESEIEKKFD ELS I S+PE+EQEEE+++SD
Subjt:  RLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELVLICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSNIYSSPELEQEEEKKSSD

Query:  THGLDHDEALKYGISYSGLCLLSLARMVNPSRPCNCFVVGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKKHLEDLYLGHISLQSID
        THGL HDEA  YGISYSGLCLLSLARMV+ SRPCNCFV+GCLA APSVYTLHVMERQKFPGWKTGFSSSIHPSLN+RR GLCK+HL+DLYLGHISLQS++
Subjt:  THGLDHDEALKYGISYSGLCLLSLARMVNPSRPCNCFVVGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKKHLEDLYLGHISLQSID

Query:  FGAWK
        FGAWK
Subjt:  FGAWK

SwissProt top hitse value%identityAlignment
B1N1A3 Putative nicotine oxidoreductase2.7e-1927.66Show/hide
Query:  KVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKLITVMEDKIDDPRLFAVIRSIYVAGALNLEFGGFP
        KV+QE IR +LE ++ P FSK SHG R+G+   T LK +R+      W    D+    D +  +KLI  +  +I D R   +IR    AG    E G F 
Subjt:  KVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKLITVMEDKIDDPRLFAVIRSIYVAGALNLEFGGFP

Query:  KGH-GLPQEGVLSPILMNIYLNLFDQEFFRLSMKYEAINENGNTGQDGSQSRLRSWFRRQLKGNSFEYPGEEKDK-------------------------
            G PQ  ++SPIL N++L+  D++  +L +K     E G+   D +  +L+   +  L+  + +  G E+D                          
Subjt:  KGH-GLPQEGVLSPILMNIYLNLFDQEFFRLSMKYEAINENGNTGQDGSQSRLRSWFRRQLKGNSFEYPGEEKDK-------------------------

Query:  IRVYCCRYMDEIFLAVSGSKDVALSFRSEIFDFMQKTLHLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLKE
        IRV   RY D+  + V+G K +A   RS + +F++    L+++ ++  +   ++   +FLG  +R   + S  +K +   K+
Subjt:  IRVYCCRYMDEIFLAVSGSKDVALSFRSEIFDFMQKTLHLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLKE

P0A3U0 Group II intron-encoded protein LtrA1.4e-2027.84Show/hide
Query:  SSRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKLITVMEDKIDDPRLFAVIRSI
        S +   L +P    K++QEA+RI+LE ++ P F  +SHG R  R   T LK I++E     W+   D+    D +    LI ++  KI D ++  +I   
Subjt:  SSRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKLITVMEDKIDDPRLFAVIRSI

Query:  YVAGALNLEFGGFPKGH-GLPQEGVLSPILMNIYLNLFDQEFFRLSMKYEAINENGNTGQD---GSQSRLRSWFRRQLKGNS-----FEYPGEEK-----
          AG   LE   + K + G PQ G+LSP+L NIYL+  D+   +L MK++  +    T +     ++ +  S   ++L+G        EY  + K     
Subjt:  YVAGALNLEFGGFPKGH-GLPQEGVLSPILMNIYLNLFDQEFFRLSMKYEAINENGNTGQD---GSQSRLRSWFRRQLKGNS-----FEYPGEEK-----

Query:  ------DKIRVYCCRYMDEIFLAVSGSKDVALSFRSEIFDFMQKTLHLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLKEK
              +K+  Y  RY D+  ++V GSK+     + ++  F+   L ++++ ++ +++   +   RFLG  +R  V+ S  +K   K+K++
Subjt:  ------DKIRVYCCRYMDEIFLAVSGSKDVALSFRSEIFDFMQKTLHLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLKEK

P0A3U1 Group II intron-encoded protein LtrA1.4e-2027.84Show/hide
Query:  SSRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKLITVMEDKIDDPRLFAVIRSI
        S +   L +P    K++QEA+RI+LE ++ P F  +SHG R  R   T LK I++E     W+   D+    D +    LI ++  KI D ++  +I   
Subjt:  SSRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKLITVMEDKIDDPRLFAVIRSI

Query:  YVAGALNLEFGGFPKGH-GLPQEGVLSPILMNIYLNLFDQEFFRLSMKYEAINENGNTGQD---GSQSRLRSWFRRQLKGNS-----FEYPGEEK-----
          AG   LE   + K + G PQ G+LSP+L NIYL+  D+   +L MK++  +    T +     ++ +  S   ++L+G        EY  + K     
Subjt:  YVAGALNLEFGGFPKGH-GLPQEGVLSPILMNIYLNLFDQEFFRLSMKYEAINENGNTGQD---GSQSRLRSWFRRQLKGNS-----FEYPGEEK-----

Query:  ------DKIRVYCCRYMDEIFLAVSGSKDVALSFRSEIFDFMQKTLHLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLKEK
              +K+  Y  RY D+  ++V GSK+     + ++  F+   L ++++ ++ +++   +   RFLG  +R  V+ S  +K   K+K++
Subjt:  ------DKIRVYCCRYMDEIFLAVSGSKDVALSFRSEIFDFMQKTLHLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLKEK

Q9CA78 Nuclear intron maturase 4, mitochondrial6.6e-25258.95Show/hide
Query:  LATNLASLIEESLD--VDLRRSKTQMELKRSLEIQIKKRVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMSTDCLISFESMAEELSNGNFD
        LA  LASL+EES     D  + +++MELKRSLE+++KKRVK Q +NGKF DL+ KVIA P TL++AYDC+R+NSNV I   +  ++F+S+AEELS+G FD
Subjt:  LATNLASLIEESLD--VDLRRSKTQMELKRSLEIQIKKRVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMSTDCLISFESMAEELSNGNFD

Query:  VNANTFSILS--SRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKLITVMEDKID
        V +NTFSI++    KEVL+LP + LKV+QEAIRIVLE VF PHFSKISH CRSGRG ++ LKYI   I   DW FT+ L+KK+D  V   L++VME+K++
Subjt:  VNANTFSILS--SRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKLITVMEDKID

Query:  DPRLFAVIRSIYVAGALNLEFGGFPKGHGLPQEGVLSPILMNIYLNLFDQEFFRLSMKYEAINENGNTGQDGSQSRLRSWFRRQLKGNSFEYPGEEKDKI
        D  L  ++RS++ A  LNLEFGGFPKGHGLPQEGVLS +LMNIYL+ FD EF+R+SM++EA+  +  T +D   S+LRSWFRRQ      +   E+   +
Subjt:  DPRLFAVIRSIYVAGALNLEFGGFPKGHGLPQEGVLSPILMNIYLNLFDQEFFRLSMKYEAINENGNTGQDGSQSRLRSWFRRQLKGNSFEYPGEEKDKI

Query:  RVYCCRYMDEIFLAVSGSKDVALSFRSEIFDFMQKTLHLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLKEKVELFALQKQETWNAWTVW
        RVYCCR+MDEI+ +VSG K VA   RSE   F++ +LHLD+  + +   C  T G+R LG LVR++V+ESP VK+VHKLKEKV LFALQK+E W   TV 
Subjt:  RVYCCRYMDEIFLAVSGSKDVALSFRSEIFDFMQKTLHLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLKEKVELFALQKQETWNAWTVW

Query:  LGKKWLAHGLKKVKESEIKHLA-KNSSLNQISSFRKAGMETDHWYKVLLKIWMQD-LNARAAESEEKILSKYAVEPSLPMELRDSFYEFQRCVKQYISAE
        +GKKWL HGLKKVKESEIK LA  NS+L+QIS  RKAGMETDHWYK+LL+IWM+D L   A  SEE +LSK+ VEP++P ELRD+FY+FQ     Y+S+E
Subjt:  LGKKWLAHGLKKVKESEIKHLA-KNSSLNQISSFRKAGMETDHWYKVLLKIWMQD-LNARAAESEEKILSKYAVEPSLPMELRDSFYEFQRCVKQYISAE

Query:  TASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLFRYRLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSEL-VLICDQVRKSCIRTL
        TA+  ALLP      +P F  +++AP N+I +RL+RY L+T KG+  S+  LIL D  QIIDW+ G+ RRW  WY  CSNF E+  LI +Q+R SCIRTL
Subjt:  TASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLFRYRLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSEL-VLICDQVRKSCIRTL

Query:  AAKHRIHESEIEKKFDSELSNIYSSPELEQEEEKKSSDTHGLDHDEALKYGISYSGLCLLSLARMVNPSRPCNCFVVGCLAPAPSVYTLHVMERQKFPGW
        AAK+RIHE+EIEK+ D ELS I S+ ++EQE + +  D+   D DE L YG+S SGLCLLSLAR+V+ SRPCNCFV+GC   AP+VYTLH MERQKFPGW
Subjt:  AAKHRIHESEIEKKFDSELSNIYSSPELEQEEEKKSSDTHGLDHDEALKYGISYSGLCLLSLARMVNPSRPCNCFVVGCLAPAPSVYTLHVMERQKFPGW

Query:  KTGFSSSIHPSLNKRRFGLCKKHLEDLYLGHISLQSIDFGAWK
        KTGFS  I  SLN RR GLCK+HL+DLY+G ISLQ++DFGAW+
Subjt:  KTGFSSSIHPSLNKRRFGLCKKHLEDLYLGHISLQSIDFGAWK

Q9LZA5 Nuclear intron maturase 3, mitochondrial1.5e-5425.49Show/hide
Query:  SLDVDLRRSKTQMELKRSLEIQIKKRVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMSTDCL---ISFESMAEELSNGNFDVNANTFSILS
        SL ++  ++ T+  +K  LE  + K    QY +GKF  L+   ++ P  L  A   + +++N      D +    S E M  E+  G FD+ +     +S
Subjt:  SLDVDLRRSKTQMELKRSLEIQIKKRVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMSTDCL---ISFESMAEELSNGNFDVNANTFSILS

Query:  SRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKM-DELVMAKLITVMEDKIDDPRLFAVIRSI
        S    L+LP +KLKVL EAIR+VLE V+   F+  S+G R G G  T ++Y++  ++NP WWF V  +++M +E  +  L   + +KI+D  L  +I+ +
Subjt:  SRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKM-DELVMAKLITVMEDKIDDPRLFAVIRSI

Query:  YVAGALNLEFGGFPKGHGLPQEGVLSPILMNIYLNLFDQEFFRLSMKYEAINENGNTGQDGSQSRLRSWFRRQLKGNSFEYPGEEKDKIRVYCCRYMDEI
        +  G L +E GG   G G PQE  L  IL+N+Y +  D+E   L +K +  N    TG + S             GN F  P      + +Y  RY+DEI
Subjt:  YVAGALNLEFGGFPKGHGLPQEGVLSPILMNIYLNLFDQEFFRLSMKYEAINENGNTGQDGSQSRLRSWFRRQLKGNSFEYPGEEKDKIRVYCCRYMDEI

Query:  FLAVSGSKDVALSFRSEIFDFMQKTLHLDVNHQEEMVSCGETHGIRFLGCL-------VRRSVQESPAVKSVHKLKEKVELFALQKQETWNAWTVWLGKK
         +  SGSK + +  +  I D +++ L L V+     +    +  I FLG         V R  +   AV+++ K + + ++  L+ +         LG K
Subjt:  FLAVSGSKDVALSFRSEIFDFMQKTLHLDVNHQEEMVSCGETHGIRFLGCL-------VRRSVQESPAVKSVHKLKEKVELFALQKQETWNAWTVWLGKK

Query:  WLAHGLKKVKESEIKHLAKNSSLNQISSFRKAGMETDHWYKVLLKIW----MQDLNARAAE--------SEEKILSKYAVEPSLPMELRDSFYEFQRCVK
           H LKK+K+S              + F+  G E ++  + + + W    MQD      E        +    LS   +   LP +L D++ EFQ  V 
Subjt:  WLAHGLKKVKESEIKHLAKNSSLNQISSFRKAGMETDHWYKVLLKIW----MQDLNARAAE--------SEEKILSKYAVEPSLPMELRDSFYEFQRCVK

Query:  QYISAETASTVALLPNYDPSVK---------------PTFITEIIAPVNSIRKRLFRYRLVTNKGHPCSSPFLILQDNTQIIDWFL-----GVSRRWFRW
        ++++   A  V  L + +  V+                    ++ AP   +RK +       + G P     L+  +++ II W+      G +++  R 
Subjt:  QYISAETASTVALLPNYDPSVK---------------PTFITEIIAPVNSIRKRLFRYRLVTNKGHPCSSPFLILQDNTQIIDWFL-----GVSRRWFRW

Query:  YNNCSNFSELVLICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSNIYSSPELEQEEEKKSSDTHGLDHDEALKYGISYSGLCLLSLARMVNPSRPCNCF
        Y      S+L                      +   E  F SE        E++   +K  SD   +D            G   L L R+ +     +C 
Subjt:  YNNCSNFSELVLICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSNIYSSPELEQEEEKKSSDTHGLDHDEALKYGISYSGLCLLSLARMVNPSRPCNCF

Query:  VVGCLAPAPSVYTLHVMERQKF------PGWKTGFSSSIHPSLNKRRFGLCKKHLEDLYLGHISLQSID
           C      ++ +H+++ +          W  G   +IH +LN++   LC  H+ D+YLG I+LQ +D
Subjt:  VVGCLAPAPSVYTLHVMERQKF------PGWKTGFSSSIHPSLNKRRFGLCKKHLEDLYLGHISLQSID

Arabidopsis top hitse value%identityAlignment
AT1G74350.1 Intron maturase, type II family protein4.7e-25358.95Show/hide
Query:  LATNLASLIEESLD--VDLRRSKTQMELKRSLEIQIKKRVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMSTDCLISFESMAEELSNGNFD
        LA  LASL+EES     D  + +++MELKRSLE+++KKRVK Q +NGKF DL+ KVIA P TL++AYDC+R+NSNV I   +  ++F+S+AEELS+G FD
Subjt:  LATNLASLIEESLD--VDLRRSKTQMELKRSLEIQIKKRVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMSTDCLISFESMAEELSNGNFD

Query:  VNANTFSILS--SRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKLITVMEDKID
        V +NTFSI++    KEVL+LP + LKV+QEAIRIVLE VF PHFSKISH CRSGRG ++ LKYI   I   DW FT+ L+KK+D  V   L++VME+K++
Subjt:  VNANTFSILS--SRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKLITVMEDKID

Query:  DPRLFAVIRSIYVAGALNLEFGGFPKGHGLPQEGVLSPILMNIYLNLFDQEFFRLSMKYEAINENGNTGQDGSQSRLRSWFRRQLKGNSFEYPGEEKDKI
        D  L  ++RS++ A  LNLEFGGFPKGHGLPQEGVLS +LMNIYL+ FD EF+R+SM++EA+  +  T +D   S+LRSWFRRQ      +   E+   +
Subjt:  DPRLFAVIRSIYVAGALNLEFGGFPKGHGLPQEGVLSPILMNIYLNLFDQEFFRLSMKYEAINENGNTGQDGSQSRLRSWFRRQLKGNSFEYPGEEKDKI

Query:  RVYCCRYMDEIFLAVSGSKDVALSFRSEIFDFMQKTLHLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLKEKVELFALQKQETWNAWTVW
        RVYCCR+MDEI+ +VSG K VA   RSE   F++ +LHLD+  + +   C  T G+R LG LVR++V+ESP VK+VHKLKEKV LFALQK+E W   TV 
Subjt:  RVYCCRYMDEIFLAVSGSKDVALSFRSEIFDFMQKTLHLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLKEKVELFALQKQETWNAWTVW

Query:  LGKKWLAHGLKKVKESEIKHLA-KNSSLNQISSFRKAGMETDHWYKVLLKIWMQD-LNARAAESEEKILSKYAVEPSLPMELRDSFYEFQRCVKQYISAE
        +GKKWL HGLKKVKESEIK LA  NS+L+QIS  RKAGMETDHWYK+LL+IWM+D L   A  SEE +LSK+ VEP++P ELRD+FY+FQ     Y+S+E
Subjt:  LGKKWLAHGLKKVKESEIKHLA-KNSSLNQISSFRKAGMETDHWYKVLLKIWMQD-LNARAAESEEKILSKYAVEPSLPMELRDSFYEFQRCVKQYISAE

Query:  TASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLFRYRLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSEL-VLICDQVRKSCIRTL
        TA+  ALLP      +P F  +++AP N+I +RL+RY L+T KG+  S+  LIL D  QIIDW+ G+ RRW  WY  CSNF E+  LI +Q+R SCIRTL
Subjt:  TASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLFRYRLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSEL-VLICDQVRKSCIRTL

Query:  AAKHRIHESEIEKKFDSELSNIYSSPELEQEEEKKSSDTHGLDHDEALKYGISYSGLCLLSLARMVNPSRPCNCFVVGCLAPAPSVYTLHVMERQKFPGW
        AAK+RIHE+EIEK+ D ELS I S+ ++EQE + +  D+   D DE L YG+S SGLCLLSLAR+V+ SRPCNCFV+GC   AP+VYTLH MERQKFPGW
Subjt:  AAKHRIHESEIEKKFDSELSNIYSSPELEQEEEKKSSDTHGLDHDEALKYGISYSGLCLLSLARMVNPSRPCNCFVVGCLAPAPSVYTLHVMERQKFPGW

Query:  KTGFSSSIHPSLNKRRFGLCKKHLEDLYLGHISLQSIDFGAWK
        KTGFS  I  SLN RR GLCK+HL+DLY+G ISLQ++DFGAW+
Subjt:  KTGFSSSIHPSLNKRRFGLCKKHLEDLYLGHISLQSIDFGAWK

AT5G04050.1 RNA-directed DNA polymerase (reverse transcriptase)7.6e-5427.24Show/hide
Query:  SLDVDLRRSKTQMELKRSLEIQIKKRVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMSTDCL---ISFESMAEELSNGNFDVNANTFSILS
        SL ++  ++ T+  +K  LE  + K    QY +GKF  L+   ++ P  L  A   + +++N      D +    S E M  E+  G FD+ +     +S
Subjt:  SLDVDLRRSKTQMELKRSLEIQIKKRVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMSTDCL---ISFESMAEELSNGNFDVNANTFSILS

Query:  SRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKM-DELVMAKLITVMEDKIDDPRLFAVIRSI
        S    L+LP +KLKVL EAIR+VLE V+   F+  S+G R G G  T ++Y++  ++NP WWF V  +++M +E  +  L   + +KI+D  L  +I+ +
Subjt:  SRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKM-DELVMAKLITVMEDKIDDPRLFAVIRSI

Query:  YVAGALNLEFGGFPKGHGLPQEGVLSPILMNIYLNLFDQEFFRLSMKYEAINENGNTGQDGSQSRLRSWFRRQLKGNSFEYPGEEKDKIRVYCCRYMDEI
        +  G L +E GG   G G PQE  L  IL+N+Y +  D+E   L +K +  N    TG + S             GN F  P      + +Y  RY+DEI
Subjt:  YVAGALNLEFGGFPKGHGLPQEGVLSPILMNIYLNLFDQEFFRLSMKYEAINENGNTGQDGSQSRLRSWFRRQLKGNSFEYPGEEKDKIRVYCCRYMDEI

Query:  FLAVSGSKDVALSFRSEIFDFMQKTLHLDVNHQEEMVSCGETHGIRFLGCL-------VRRSVQESPAVKSVHKLKEKVELFALQKQETWNAWTVWLGKK
         +  SGSK + +  +  I D +++ L L V+     +    +  I FLG         V R  +   AV+++ K + + ++  L+ +         LG K
Subjt:  FLAVSGSKDVALSFRSEIFDFMQKTLHLDVNHQEEMVSCGETHGIRFLGCL-------VRRSVQESPAVKSVHKLKEKVELFALQKQETWNAWTVWLGKK

Query:  WLAHGLKKVKESEIKHLAKNSSLNQISSFRKAGMETDHWYKVLLKIW----MQDLNARAAE--------SEEKILSKYAVEPSLPMELRDSFYEFQRCVK
           H LKK+K+S              + F+  G E ++  + + + W    MQD      E        +    LS   +   LP +L D++ EFQ  V 
Subjt:  WLAHGLKKVKESEIKHLAKNSSLNQISSFRKAGMETDHWYKVLLKIW----MQDLNARAAE--------SEEKILSKYAVEPSLPMELRDSFYEFQRCVK

Query:  QYISAETASTVALLPNYDPSVK---------------PTFITEIIAPVNSIRKRLFRYRLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCS
        ++++   A  V  L + +  V+                    ++ AP   +RK +       + G P     L+  +++ II W+ GV R+W  ++  C 
Subjt:  QYISAETASTVALLPNYDPSVK---------------PTFITEIIAPVNSIRKRLFRYRLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCS

Query:  NF
        N+
Subjt:  NF

AT5G04050.2 RNA-directed DNA polymerase (reverse transcriptase)3.0e-5024.53Show/hide
Query:  SLDVDLRRSKTQMELKRSLEIQIKKRVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMSTDCL---ISFESMAEELSNGNFDVNANTFSILS
        SL ++  ++ T+  +K  LE  + K    QY +GKF  L+   ++ P  L  A   + +++N      D +    S E M  E+  G FD+ +     +S
Subjt:  SLDVDLRRSKTQMELKRSLEIQIKKRVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMSTDCL---ISFESMAEELSNGNFDVNANTFSILS

Query:  SRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKM-DELVMAKLITVMEDKIDDPRLFAVIRSI
        S    L+LP +KLKVL EAIR+VLE V+   F+  S+G R G G  T ++Y++  ++NP WWF V  +++M +E  +  L   + +KI+D  L  +I+ +
Subjt:  SRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKM-DELVMAKLITVMEDKIDDPRLFAVIRSI

Query:  YVAGALNLEFGGFPKGHGLPQEGVLSPILMNIYLNLFDQEFFRLSMKYEAINENGNTGQDGSQSRLRSWFRRQLKGNSFEYPGEEKDKIRVYCCRYMDEI
        +  G L +E GG   G G PQE  L  IL+N+Y +  D+E   L +K +  N    TG + S             GN F  P      + +Y  RY+DEI
Subjt:  YVAGALNLEFGGFPKGHGLPQEGVLSPILMNIYLNLFDQEFFRLSMKYEAINENGNTGQDGSQSRLRSWFRRQLKGNSFEYPGEEKDKIRVYCCRYMDEI

Query:  FLAVSGSKDVALSFRSEIFDFMQKTLHLDVNHQEEMVSCGETHGIRFLGCL-------VRRSVQESPAVKSVHKLKEKVELFALQKQETWNAWTVWLGKK
         +  SGSK + +  +  I D +++ L L V+     +    +  I FLG         V R  +   AV+++ K + + ++  L+ +         LG K
Subjt:  FLAVSGSKDVALSFRSEIFDFMQKTLHLDVNHQEEMVSCGETHGIRFLGCL-------VRRSVQESPAVKSVHKLKEKVELFALQKQETWNAWTVWLGKK

Query:  WLAHGLKKVKESEIKHLAKNSSLNQISSFRKAGMETDHWYKVLLKIW----MQDLNARAAE--------SEEKILSKYAVEPSLPMELRDSFYEFQRCVK
           H LKK+K+S              + F+  G E ++  + + + W    MQD      E        +    LS   +   LP +L D++ EFQ  V 
Subjt:  WLAHGLKKVKESEIKHLAKNSSLNQISSFRKAGMETDHWYKVLLKIW----MQDLNARAAE--------SEEKILSKYAVEPSLPMELRDSFYEFQRCVK

Query:  QYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLFRYRLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELVLICDQVRKS
        ++++   A  V                                                L+D  + ++                  ++E  +  + + K 
Subjt:  QYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLFRYRLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELVLICDQVRKS

Query:  CIRTLAAKHRIHESEIEKKFDS-ELSNIYSSPELEQEEEKKSSDTHGLDHDEALKYGISYSGLCLLSLARMVNPSRPCNCFVVGCLAPAPSVYTLHVMER
        C++  A +  + ++      D  E ++  S  E++   +K  SD   +D            G   L L R+ +     +C    C      ++ +H+++ 
Subjt:  CIRTLAAKHRIHESEIEKKFDS-ELSNIYSSPELEQEEEKKSSDTHGLDHDEALKYGISYSGLCLLSLARMVNPSRPCNCFVVGCLAPAPSVYTLHVMER

Query:  QKF------PGWKTGFSSSIHPSLNKRRFGLCKKHLEDLYLGHISLQSID
        +          W  G   +IH +LN++   LC  H+ D+YLG I+LQ +D
Subjt:  QKF------PGWKTGFSSSIHPSLNKRRFGLCKKHLEDLYLGHISLQSID

ATMG00520.1 Intron maturase, type II family protein7.5e-1736.76Show/hide
Query:  KVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKLITVMEDKIDDPRLFAVIRSIYVAGALNLEFGGFP
        K+++EAIR+VLE ++ P F   SH  RSG+G  +VL+ I++E     W+   D+ K    +   +LI +++++IDDP+ F  I+ ++ AG L     G  
Subjt:  KVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKLITVMEDKIDDPRLFAVIRSIYVAGALNLEFGGFP

Query:  KG-HGLPQEGVLSPILMNIYLNLFDQEFFRLSMKYE
        +G + +P   +LS +  NIYL+  DQE  R+  KYE
Subjt:  KG-HGLPQEGVLSPILMNIYLNLFDQEFFRLSMKYE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGGATTACAAGGTCTTTGGTTATTCTATTTATGTAATTGGTGGTTATGCAAATATGAAATTTGGGGGGCTTCAAAGATTTTGCAGGATCAACATGCGGAACATTTC
AATTTTGCAAAATGTAAATGTTTGCAAAGTCAATTCATCCTTTGTTTCTGACATTGGAGAGTGTGTTCAGAGAGTTCAAAGTTCTGGAAATTATTCAACTCTCGCCTATG
CTGATGATGAAATTGACAAGGGAATAGAGAAAAAGAAACTGGCCACAAACTTGGCCTCACTTATTGAAGAATCTCTTGATGTTGATCTGAGAAGATCAAAGACTCAAATG
GAACTGAAGAGATCACTTGAAATTCAGATCAAGAAGAGGGTGAAGGCACAATATTTGAATGGGAAGTTTTTGGACTTGATGGGGAAAGTGATTGCTTGCCCCACAACTCT
TCAGAATGCTTATGACTGTGTTAGAATTAACTCAAATGTTGATATAATGTCGACTGACTGTTTAATCTCATTTGAATCTATGGCTGAAGAGCTATCTAATGGTAACTTTG
ATGTCAATGCCAATACTTTCTCCATATTAAGCTCAAGAAAAGAAGTACTCATTTTACCGAAGATAAAGTTGAAGGTTCTTCAAGAAGCCATTAGGATAGTTTTGGAGTGT
GTGTTTAGGCCACATTTTTCCAAGATATCTCATGGCTGTCGAAGTGGAAGAGGACACTCAACGGTATTGAAGTACATCAGAAAAGAGATAAAAAATCCTGATTGGTGGTT
CACAGTTGACTTAAGCAAAAAGATGGATGAGCTTGTAATGGCTAAACTCATTACAGTAATGGAGGACAAGATAGATGACCCCAGATTATTTGCTGTTATTAGAAGTATAT
ATGTGGCTGGGGCACTGAACTTGGAGTTTGGGGGTTTCCCAAAAGGTCACGGTCTTCCACAAGAGGGGGTTTTGTCTCCTATATTAATGAACATCTATCTCAACCTCTTT
GACCAAGAATTTTTCAGATTATCTATGAAATATGAGGCTATTAATGAGAATGGCAATACCGGTCAAGATGGGTCACAATCAAGGCTGCGGAGCTGGTTTAGGAGACAATT
GAAAGGAAATAGTTTTGAATATCCAGGTGAGGAGAAAGACAAAATAAGAGTATACTGTTGTCGCTATATGGATGAAATTTTTTTGGCGGTATCGGGTTCTAAAGATGTTG
CTCTTAGTTTTAGGTCTGAGATTTTTGATTTCATGCAGAAGACTTTGCATTTGGACGTCAATCATCAAGAGGAAATGGTATCATGTGGGGAGACTCATGGAATTCGTTTT
CTTGGTTGTTTGGTCAGACGAAGTGTGCAGGAAAGTCCAGCTGTGAAATCTGTCCACAAGTTGAAGGAAAAAGTTGAGCTATTTGCTTTACAAAAGCAGGAGACTTGGAA
TGCTTGGACGGTGTGGTTGGGAAAGAAATGGCTCGCTCATGGTTTGAAGAAGGTTAAAGAGTCGGAGATCAAGCATTTAGCTAAAAATAGCTCTTTGAATCAAATTTCCA
GTTTTCGTAAAGCTGGAATGGAAACTGATCACTGGTACAAGGTTCTATTGAAAATTTGGATGCAAGATCTAAATGCAAGGGCTGCAGAGAGTGAAGAAAAAATCTTATCT
AAGTATGCAGTGGAACCTTCTCTTCCTATGGAACTTCGAGATTCCTTTTATGAGTTCCAAAGGTGTGTGAAACAATATATTTCTGCTGAGACAGCTTCTACTGTTGCCCT
TTTACCAAATTATGATCCTTCTGTCAAACCTACTTTCATAACTGAGATTATAGCACCTGTCAATTCTATAAGAAAACGACTTTTTCGATATAGGTTAGTTACAAATAAGG
GACATCCATGCTCCTCTCCTTTCCTCATCTTACAAGATAACACTCAAATTATTGACTGGTTTTTAGGAGTATCTCGCCGTTGGTTTAGATGGTATAACAACTGTTCTAAC
TTCAGCGAGTTGGTCTTAATTTGTGATCAAGTTAGGAAATCCTGTATCCGAACGCTAGCAGCAAAGCATCGTATACACGAAAGTGAAATAGAAAAGAAGTTTGACTCAGA
ACTGAGTAACATTTACTCCTCTCCTGAACTAGAGCAAGAAGAAGAGAAGAAGTCATCAGATACTCATGGTTTAGACCATGATGAGGCACTAAAGTATGGAATTTCATATA
GTGGTCTGTGTTTGCTATCTCTTGCTAGAATGGTCAACCCATCTCGTCCATGCAATTGTTTTGTCGTTGGGTGTTTGGCTCCTGCACCAAGCGTTTATACTCTTCATGTC
ATGGAGAGACAAAAGTTTCCAGGATGGAAGACTGGATTCTCGAGTTCCATCCATCCTAGCTTGAACAAACGACGATTCGGGTTATGCAAAAAACATTTGGAGGATTTGTA
TTTGGGTCACATTTCATTGCAATCTATTGACTTTGGTGCATGGAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGCAGGATTACAAGGTCTTTGGTTATTCTATTTATGTAATTGGTGGTTATGCAAATATGAAATTTGGGGGGCTTCAAAGATTTTGCAGGATCAACATGCGGAACATTTC
AATTTTGCAAAATGTAAATGTTTGCAAAGTCAATTCATCCTTTGTTTCTGACATTGGAGAGTGTGTTCAGAGAGTTCAAAGTTCTGGAAATTATTCAACTCTCGCCTATG
CTGATGATGAAATTGACAAGGGAATAGAGAAAAAGAAACTGGCCACAAACTTGGCCTCACTTATTGAAGAATCTCTTGATGTTGATCTGAGAAGATCAAAGACTCAAATG
GAACTGAAGAGATCACTTGAAATTCAGATCAAGAAGAGGGTGAAGGCACAATATTTGAATGGGAAGTTTTTGGACTTGATGGGGAAAGTGATTGCTTGCCCCACAACTCT
TCAGAATGCTTATGACTGTGTTAGAATTAACTCAAATGTTGATATAATGTCGACTGACTGTTTAATCTCATTTGAATCTATGGCTGAAGAGCTATCTAATGGTAACTTTG
ATGTCAATGCCAATACTTTCTCCATATTAAGCTCAAGAAAAGAAGTACTCATTTTACCGAAGATAAAGTTGAAGGTTCTTCAAGAAGCCATTAGGATAGTTTTGGAGTGT
GTGTTTAGGCCACATTTTTCCAAGATATCTCATGGCTGTCGAAGTGGAAGAGGACACTCAACGGTATTGAAGTACATCAGAAAAGAGATAAAAAATCCTGATTGGTGGTT
CACAGTTGACTTAAGCAAAAAGATGGATGAGCTTGTAATGGCTAAACTCATTACAGTAATGGAGGACAAGATAGATGACCCCAGATTATTTGCTGTTATTAGAAGTATAT
ATGTGGCTGGGGCACTGAACTTGGAGTTTGGGGGTTTCCCAAAAGGTCACGGTCTTCCACAAGAGGGGGTTTTGTCTCCTATATTAATGAACATCTATCTCAACCTCTTT
GACCAAGAATTTTTCAGATTATCTATGAAATATGAGGCTATTAATGAGAATGGCAATACCGGTCAAGATGGGTCACAATCAAGGCTGCGGAGCTGGTTTAGGAGACAATT
GAAAGGAAATAGTTTTGAATATCCAGGTGAGGAGAAAGACAAAATAAGAGTATACTGTTGTCGCTATATGGATGAAATTTTTTTGGCGGTATCGGGTTCTAAAGATGTTG
CTCTTAGTTTTAGGTCTGAGATTTTTGATTTCATGCAGAAGACTTTGCATTTGGACGTCAATCATCAAGAGGAAATGGTATCATGTGGGGAGACTCATGGAATTCGTTTT
CTTGGTTGTTTGGTCAGACGAAGTGTGCAGGAAAGTCCAGCTGTGAAATCTGTCCACAAGTTGAAGGAAAAAGTTGAGCTATTTGCTTTACAAAAGCAGGAGACTTGGAA
TGCTTGGACGGTGTGGTTGGGAAAGAAATGGCTCGCTCATGGTTTGAAGAAGGTTAAAGAGTCGGAGATCAAGCATTTAGCTAAAAATAGCTCTTTGAATCAAATTTCCA
GTTTTCGTAAAGCTGGAATGGAAACTGATCACTGGTACAAGGTTCTATTGAAAATTTGGATGCAAGATCTAAATGCAAGGGCTGCAGAGAGTGAAGAAAAAATCTTATCT
AAGTATGCAGTGGAACCTTCTCTTCCTATGGAACTTCGAGATTCCTTTTATGAGTTCCAAAGGTGTGTGAAACAATATATTTCTGCTGAGACAGCTTCTACTGTTGCCCT
TTTACCAAATTATGATCCTTCTGTCAAACCTACTTTCATAACTGAGATTATAGCACCTGTCAATTCTATAAGAAAACGACTTTTTCGATATAGGTTAGTTACAAATAAGG
GACATCCATGCTCCTCTCCTTTCCTCATCTTACAAGATAACACTCAAATTATTGACTGGTTTTTAGGAGTATCTCGCCGTTGGTTTAGATGGTATAACAACTGTTCTAAC
TTCAGCGAGTTGGTCTTAATTTGTGATCAAGTTAGGAAATCCTGTATCCGAACGCTAGCAGCAAAGCATCGTATACACGAAAGTGAAATAGAAAAGAAGTTTGACTCAGA
ACTGAGTAACATTTACTCCTCTCCTGAACTAGAGCAAGAAGAAGAGAAGAAGTCATCAGATACTCATGGTTTAGACCATGATGAGGCACTAAAGTATGGAATTTCATATA
GTGGTCTGTGTTTGCTATCTCTTGCTAGAATGGTCAACCCATCTCGTCCATGCAATTGTTTTGTCGTTGGGTGTTTGGCTCCTGCACCAAGCGTTTATACTCTTCATGTC
ATGGAGAGACAAAAGTTTCCAGGATGGAAGACTGGATTCTCGAGTTCCATCCATCCTAGCTTGAACAAACGACGATTCGGGTTATGCAAAAAACATTTGGAGGATTTGTA
TTTGGGTCACATTTCATTGCAATCTATTGACTTTGGTGCATGGAAGTGA
Protein sequenceShow/hide protein sequence
MQDYKVFGYSIYVIGGYANMKFGGLQRFCRINMRNISILQNVNVCKVNSSFVSDIGECVQRVQSSGNYSTLAYADDEIDKGIEKKKLATNLASLIEESLDVDLRRSKTQM
ELKRSLEIQIKKRVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMSTDCLISFESMAEELSNGNFDVNANTFSILSSRKEVLILPKIKLKVLQEAIRIVLEC
VFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKLITVMEDKIDDPRLFAVIRSIYVAGALNLEFGGFPKGHGLPQEGVLSPILMNIYLNLF
DQEFFRLSMKYEAINENGNTGQDGSQSRLRSWFRRQLKGNSFEYPGEEKDKIRVYCCRYMDEIFLAVSGSKDVALSFRSEIFDFMQKTLHLDVNHQEEMVSCGETHGIRF
LGCLVRRSVQESPAVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSSLNQISSFRKAGMETDHWYKVLLKIWMQDLNARAAESEEKILS
KYAVEPSLPMELRDSFYEFQRCVKQYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLFRYRLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSN
FSELVLICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSNIYSSPELEQEEEKKSSDTHGLDHDEALKYGISYSGLCLLSLARMVNPSRPCNCFVVGCLAPAPSVYTLHV
MERQKFPGWKTGFSSSIHPSLNKRRFGLCKKHLEDLYLGHISLQSIDFGAWK