; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI04G10980 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI04G10980
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
Descriptionnuclear intron maturase 4, mitochondrial isoform X2
Genome locationChr4:9294372..9297350
RNA-Seq ExpressionCSPI04G10980
SyntenyCSPI04G10980
Gene Ontology termsGO:0000373 - Group II intron splicing (biological process)
GO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006315 - homing of group II introns (biological process)
GO:0007005 - mitochondrion organization (biological process)
GO:0009845 - seed germination (biological process)
GO:0032885 - regulation of polysaccharide biosynthetic process (biological process)
GO:0090615 - mitochondrial mRNA processing (biological process)
GO:1900864 - mitochondrial RNA modification (biological process)
GO:0005739 - mitochondrion (cellular component)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR024937 - Domain X
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0041778.1 hypothetical protein E6C27_scaffold67G001360 [Cucumis melo var. makuwa]0.0e+0092.34Show/hide
Query:  RNFSNLQSVNVCIFNSSFVSDI----------------------GKCVQIVQGSENYSTLARA--EIDKGMERMKLAINLASLVEESLDVDLRRSKTQME
        RNFSN QSVNVCI NSSFVSDI                      GKC QIVQ SENYSTLARA  EIDKGME+MKLA+NLASLVEESLDVDLRRSKT+ME
Subjt:  RNFSNLQSVNVCIFNSSFVSDI----------------------GKCVQIVQGSENYSTLARA--EIDKGMERMKLAINLASLVEESLDVDLRRSKTQME

Query:  LKRSIEIRIKERVKAQYLNGKFLDLMGNVIACPNTLQNVYDCIRINSNVDIKSNDRLISFESMAEELSNGNFDVNTNTFSILSSRKEVLILPKIKLKVLQ
        LKRS+EI+IKERVKAQYLNGKFLDLMGNVIACPNTLQN YDCIRINSNVDIKSND LISFESMAEELS+GNFDVNTNTFSILSSRKEVLILPKIKLKVLQ
Subjt:  LKRSIEIRIKERVKAQYLNGKFLDLMGNVIACPNTLQNVYDCIRINSNVDIKSNDRLISFESMAEELSNGNFDVNTNTFSILSSRKEVLILPKIKLKVLQ

Query:  EAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDLSKKMDELVMAKLITVMEDKIEDPKLFAVIRSIYLAGALNLEFGGFPKGHG
        EAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDLSKKMD+LVMAKLITVMEDKIEDPKLFAVIRSI+LAGALNLEFG FPKGHG
Subjt:  EAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDLSKKMDELVMAKLITVMEDKIEDPKLFAVIRSIYLAGALNLEFGGFPKGHG

Query:  LPQEGVLSPILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNNSDYSGEEKDKIRVYCCRYMDEIFLAVSGSKDVAHSFRSEI
        LPQEGVLSPILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQS+LRSWFRRQLKGN+SDY GEEKDKIRVYCCRYMDEIFLAVSGSKDVA SFRSEI
Subjt:  LPQEGVLSPILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNNSDYSGEEKDKIRVYCCRYMDEIFLAVSGSKDVAHSFRSEI

Query:  FDFVQKTLHLDVNREEEMVSCETHGIRFLGCLVRRSVQESPAVKSIHKLKEKVELFGLQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSSLNKI
        FDF+QKTLHLDVN EEEMVSCETHGIRFLGCLVRRSVQESPAVKSIHKLKEKVELFGLQKQETW +WTVWLGKKWLAHGLKKVKESEIKHLAKNSSLN+I
Subjt:  FDFVQKTLHLDVNREEEMVSCETHGIRFLGCLVRRSVQESPAVKSIHKLKEKVELFGLQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSSLNKI

Query:  SSFRKPGMETDHWYKVLLKIWMQDLNARAAESEEKILSKHAVELSLPFELRDSFYEFQRHVKEYISSETASTLALLPNYDPSAKPTFITEIIAPVNSIRK
        SSFRKPGMETDHWYKVLLKIWMQDLNARAAESEEKILSKHAVE SLPFELRDSFYEFQR V+EYISSETASTLALLPNYDPS KPTFITEIIAPVNSIRK
Subjt:  SSFRKPGMETDHWYKVLLKIWMQDLNARAAESEEKILSKHAVELSLPFELRDSFYEFQRHVKEYISSETASTLALLPNYDPSAKPTFITEIIAPVNSIRK

Query:  RLLRYRLVTNKGHPCSSPFLILQDNTQIIDWFVGVSRRLFRWYNNSSNFSELFLIFDQVRKSCIRTLAAKHRIHESEIEKKFDSELSKIYSSSEIDQEKE
        RL RYRLVTNKGHPCSSPFLILQDNTQIIDWF+GVSRR FRWYN SSNFSELFLIFDQVRKSCIRTLAAKHRIHESEIEKKFDSELSKIYSS EI+Q KE
Subjt:  RLLRYRLVTNKGHPCSSPFLILQDNTQIIDWFVGVSRRLFRWYNNSSNFSELFLIFDQVRKSCIRTLAAKHRIHESEIEKKFDSELSKIYSSSEIDQEKE

Query:  KSTDTHVLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHLADLYLGRISL
        KSTDTHVLDHDEAL YGISYSGLCLLSLARMVS+SRPCNCFV+GCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHLADLYLGRISL
Subjt:  KSTDTHVLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHLADLYLGRISL

Query:  QSVDFGAWK
        QSVDFGAWK
Subjt:  QSVDFGAWK

KAE8649366.1 hypothetical protein Csa_019152 [Cucumis sativus]0.0e+0099.59Show/hide
Query:  MERMKLAINLASLVEESLDVDLRRSKTQMELKRSIEIRIKERVKAQYLNGKFLDLMGNVIACPNTLQNVYDCIRINSNVDIKSNDRLISFESMAEELSNG
        MERMKLAINLASLVEESLDVDLRRSKTQMELKRS+EIRIKERVKAQYLNGKFLDLMGNVIACPNTLQNVYDCIRINSNVDIKSNDRLISFESMAEELSNG
Subjt:  MERMKLAINLASLVEESLDVDLRRSKTQMELKRSIEIRIKERVKAQYLNGKFLDLMGNVIACPNTLQNVYDCIRINSNVDIKSNDRLISFESMAEELSNG

Query:  NFDVNTNTFSILSSRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDLSKKMDELVMAKLITVMEDKI
        NFDVNTNTFSILSSRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDLSKKMDELVMAKLITVMEDKI
Subjt:  NFDVNTNTFSILSSRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDLSKKMDELVMAKLITVMEDKI

Query:  EDPKLFAVIRSIYLAGALNLEFGGFPKGHGLPQEGVLSPILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNNSDYSGEEKDK
        EDPKLFAVIRSIYLAGALNLEFGGFPKGHGLPQEGVLSPILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNNSDYSGEEKDK
Subjt:  EDPKLFAVIRSIYLAGALNLEFGGFPKGHGLPQEGVLSPILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNNSDYSGEEKDK

Query:  IRVYCCRYMDEIFLAVSGSKDVAHSFRSEIFDFVQKTLHLDVNREEEMVSCETHGIRFLGCLVRRSVQESPAVKSIHKLKEKVELFGLQKQETWNAWTVW
        IRVYCCRYMDEIFLAVSGSKDVAHSFRSEIF FVQKTLHLDVNREEEMVSCETHGIRFLGCLVRRSVQESPAVKSIHKLKEKVELFGLQKQETWNAWTVW
Subjt:  IRVYCCRYMDEIFLAVSGSKDVAHSFRSEIFDFVQKTLHLDVNREEEMVSCETHGIRFLGCLVRRSVQESPAVKSIHKLKEKVELFGLQKQETWNAWTVW

Query:  LGKKWLAHGLKKVKESEIKHLAKNSSLNKISSFRKPGMETDHWYKVLLKIWMQDLNARAAESEEKILSKHAVELSLPFELRDSFYEFQRHVKEYISSETA
        LGKKWLAHGLKKVKESEIKHLAKNSSLNKISSFRKPGMETDHWYKVLLKIWMQDLNARAAESEEKILSKHAVELSLPFELRDSFYEFQRHVKEYISSETA
Subjt:  LGKKWLAHGLKKVKESEIKHLAKNSSLNKISSFRKPGMETDHWYKVLLKIWMQDLNARAAESEEKILSKHAVELSLPFELRDSFYEFQRHVKEYISSETA

Query:  STLALLPNYDPSAKPTFITEIIAPVNSIRKRLLRYRLVTNKGHPCSSPFLILQDNTQIIDWFVGVSRRLFRWYNNSSNFSELFLIFDQVRKSCIRTLAAK
        STLALLPNYDPSAKPTFITEIIAPVNSIRKRLLRYRLVTNKGHPCSSPFLILQDNTQIIDWFVGVSRRLFRWYNNSSNFSELFLIFDQVRKSCIRTLAAK
Subjt:  STLALLPNYDPSAKPTFITEIIAPVNSIRKRLLRYRLVTNKGHPCSSPFLILQDNTQIIDWFVGVSRRLFRWYNNSSNFSELFLIFDQVRKSCIRTLAAK

Query:  HRIHESEIEKKFDSELSKIYSSSEIDQEKEKSTDTHVLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPGWKTGF
        HRIHESEIEKKFDSELSKIYSSSEIDQEKEKSTDTHVLDHDEALKYGISYSGLCLLS ARMVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPGWKTGF
Subjt:  HRIHESEIEKKFDSELSKIYSSSEIDQEKEKSTDTHVLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPGWKTGF

Query:  SSSIHPSLNKRRFGLCKQHLADLYLGRISLQSVDFGAWK
        SSSIHPSLNKRRFGLCKQHLADLYLGRISLQSVDFGAWK
Subjt:  SSSIHPSLNKRRFGLCKQHLADLYLGRISLQSVDFGAWK

XP_008442019.1 PREDICTED: uncharacterized protein LOC103486008 [Cucumis melo]0.0e+0094.79Show/hide
Query:  RNFSNLQSVNVCIFNSSFVSDIGKCVQIVQGSENYSTLARA--EIDKGMERMKLAINLASLVEESLDVDLRRSKTQMELKRSIEIRIKERVKAQYLNGKF
        RNFSN QSVNVCI NSSFVSDIGKC QIVQ SENYSTLARA  EIDKGME+MKLA+NLASLVEESLDVDLRRSKT+MELKRS+EI+IKERVKAQYLNGKF
Subjt:  RNFSNLQSVNVCIFNSSFVSDIGKCVQIVQGSENYSTLARA--EIDKGMERMKLAINLASLVEESLDVDLRRSKTQMELKRSIEIRIKERVKAQYLNGKF

Query:  LDLMGNVIACPNTLQNVYDCIRINSNVDIKSNDRLISFESMAEELSNGNFDVNTNTFSILSSRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGC
        LDLMGNVIACPNTLQN YDCIRINSNVDIKSND LISFESMA+ELS+GNFDVNTNTFSILSSRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGC
Subjt:  LDLMGNVIACPNTLQNVYDCIRINSNVDIKSNDRLISFESMAEELSNGNFDVNTNTFSILSSRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGC

Query:  RSGRGHSTALKYIKKEIKDPDWWFTVDLSKKMDELVMAKLITVMEDKIEDPKLFAVIRSIYLAGALNLEFGGFPKGHGLPQEGVLSPILTNIYLNLFDQE
        RSGRGHSTALKYIKKEIKDPDWWFTVDLSKKMDELVMAKLITVMEDKIEDPKLFAVIRSI+LAGALNLEFG FPKGHGLPQEGVLSPILTNIYLNLFDQE
Subjt:  RSGRGHSTALKYIKKEIKDPDWWFTVDLSKKMDELVMAKLITVMEDKIEDPKLFAVIRSIYLAGALNLEFGGFPKGHGLPQEGVLSPILTNIYLNLFDQE

Query:  FFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNNSDYSGEEKDKIRVYCCRYMDEIFLAVSGSKDVAHSFRSEIFDFVQKTLHLDVNREEEMVSCE
        FFRLSMKYEAINEYGNTGQDGSQS+LRSWFRRQLK N+SDY GEEKDKIRVYCCRYMDEIFLAVSGSKDVA SFRSEIFDF+QKTLHLDVN EEEMVSCE
Subjt:  FFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNNSDYSGEEKDKIRVYCCRYMDEIFLAVSGSKDVAHSFRSEIFDFVQKTLHLDVNREEEMVSCE

Query:  THGIRFLGCLVRRSVQESPAVKSIHKLKEKVELFGLQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSSLNKISSFRKPGMETDHWYKVLLKIWM
        THGIRFLGCLVRRSVQESPAVKSIHKLKEKVELFGLQKQETW +WTVWLGKKWLAHGLKKVKESEIKHLAKNSSLN+ISSFRKPGMETDHWYKVLLKIWM
Subjt:  THGIRFLGCLVRRSVQESPAVKSIHKLKEKVELFGLQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSSLNKISSFRKPGMETDHWYKVLLKIWM

Query:  QDLNARAAESEEKILSKHAVELSLPFELRDSFYEFQRHVKEYISSETASTLALLPNYDPSAKPTFITEIIAPVNSIRKRLLRYRLVTNKGHPCSSPFLIL
        QDLNARAAESEEKILSKHAVE SLPFELRDSFYEFQR V+EYISSETASTLALLPNYDPS KPTFITEIIAPVNSIRKRL RYRLVTNKGHPCSSPFLIL
Subjt:  QDLNARAAESEEKILSKHAVELSLPFELRDSFYEFQRHVKEYISSETASTLALLPNYDPSAKPTFITEIIAPVNSIRKRLLRYRLVTNKGHPCSSPFLIL

Query:  QDNTQIIDWFVGVSRRLFRWYNNSSNFSELFLIFDQVRKSCIRTLAAKHRIHESEIEKKFDSELSKIYSSSEIDQEKEKSTDTHVLDHDEALKYGISYSG
        QDNTQIIDWF+GVSRR FRWYN SSNFSELFLIFDQVRKSCIRTLAAKH+IHESEIEKKFDSELSKIYSS EI+QEKEKSTDTHVLDHDEAL YGISYSG
Subjt:  QDNTQIIDWFVGVSRRLFRWYNNSSNFSELFLIFDQVRKSCIRTLAAKHRIHESEIEKKFDSELSKIYSSSEIDQEKEKSTDTHVLDHDEALKYGISYSG

Query:  LCLLSLARMVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHLADLYLGRISLQSVDFGAWK
        LCLLSLARMVS+SRPCNCFV+GCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHLADLYLGRISLQSVDFGAWK
Subjt:  LCLLSLARMVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHLADLYLGRISLQSVDFGAWK

XP_011653460.1 nuclear intron maturase 4, mitochondrial [Cucumis sativus]0.0e+0099.62Show/hide
Query:  MRNFSNLQSVNVCIFNSSFVSDIGKCVQIVQGSENYSTLARAEIDKGMERMKLAINLASLVEESLDVDLRRSKTQMELKRSIEIRIKERVKAQYLNGKFL
        MRNFSNLQSVNVCIFNSSFVSDIGKCVQIVQGSENYSTLARAEIDKGMERMKLAINLASLVEESLDVDLRRSKTQMELKRS+EIRIKERVKAQYLNGKFL
Subjt:  MRNFSNLQSVNVCIFNSSFVSDIGKCVQIVQGSENYSTLARAEIDKGMERMKLAINLASLVEESLDVDLRRSKTQMELKRSIEIRIKERVKAQYLNGKFL

Query:  DLMGNVIACPNTLQNVYDCIRINSNVDIKSNDRLISFESMAEELSNGNFDVNTNTFSILSSRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCR
        DLMGNVIACPNTLQNVYDCIRINSNVDIKSNDRLISFESMAEELSNGNFDVNTNTFSILSSRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCR
Subjt:  DLMGNVIACPNTLQNVYDCIRINSNVDIKSNDRLISFESMAEELSNGNFDVNTNTFSILSSRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCR

Query:  SGRGHSTALKYIKKEIKDPDWWFTVDLSKKMDELVMAKLITVMEDKIEDPKLFAVIRSIYLAGALNLEFGGFPKGHGLPQEGVLSPILTNIYLNLFDQEF
        SGRGHSTALKYIKKEIKDPDWWFTVDLSKKMDELVMAKLITVMEDKIEDPKLFAVIRSIYLAGALNLEFGGFPKGHGLPQEGVLSPILTNIYLNLFDQEF
Subjt:  SGRGHSTALKYIKKEIKDPDWWFTVDLSKKMDELVMAKLITVMEDKIEDPKLFAVIRSIYLAGALNLEFGGFPKGHGLPQEGVLSPILTNIYLNLFDQEF

Query:  FRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNNSDYSGEEKDKIRVYCCRYMDEIFLAVSGSKDVAHSFRSEIFDFVQKTLHLDVNREEEMVSCET
        FRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNNSDYSGEEKDKIRVYCCRYMDEIFLAVSGSKDVAHSFRSEIF FVQKTLHLDVNREEEMVSCET
Subjt:  FRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNNSDYSGEEKDKIRVYCCRYMDEIFLAVSGSKDVAHSFRSEIFDFVQKTLHLDVNREEEMVSCET

Query:  HGIRFLGCLVRRSVQESPAVKSIHKLKEKVELFGLQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSSLNKISSFRKPGMETDHWYKVLLKIWMQ
        HGIRFLGCLVRRSVQESPAVKSIHKLKEKVELFGLQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSSLNKISSFRKPGMETDHWYKVLLKIWMQ
Subjt:  HGIRFLGCLVRRSVQESPAVKSIHKLKEKVELFGLQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSSLNKISSFRKPGMETDHWYKVLLKIWMQ

Query:  DLNARAAESEEKILSKHAVELSLPFELRDSFYEFQRHVKEYISSETASTLALLPNYDPSAKPTFITEIIAPVNSIRKRLLRYRLVTNKGHPCSSPFLILQ
        DLNARAAESEEKILSKHAVELSLPFELRDSFYEFQRHVKEYISSETASTLALLPNYDPSAKPTFITEIIAPVNSIRKRLLRYRLVTNKGHPCSSPFLILQ
Subjt:  DLNARAAESEEKILSKHAVELSLPFELRDSFYEFQRHVKEYISSETASTLALLPNYDPSAKPTFITEIIAPVNSIRKRLLRYRLVTNKGHPCSSPFLILQ

Query:  DNTQIIDWFVGVSRRLFRWYNNSSNFSELFLIFDQVRKSCIRTLAAKHRIHESEIEKKFDSELSKIYSSSEIDQEKEKSTDTHVLDHDEALKYGISYSGL
        DNTQIIDWFVGVSRRLFRWYNNSSNFSELFLIFDQVRKSCIRTLAAKHRIHESEIEKKFDSELSKIYSSSEIDQEKEKSTDTHVLDHDEALKYGISYSGL
Subjt:  DNTQIIDWFVGVSRRLFRWYNNSSNFSELFLIFDQVRKSCIRTLAAKHRIHESEIEKKFDSELSKIYSSSEIDQEKEKSTDTHVLDHDEALKYGISYSGL

Query:  CLLSLARMVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHLADLYLGRISLQSVDFGAWK
        CLLS ARMVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHLADLYLGRISLQSVDFGAWK
Subjt:  CLLSLARMVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHLADLYLGRISLQSVDFGAWK

XP_038882003.1 nuclear intron maturase 4, mitochondrial isoform X2 [Benincasa hispida]0.0e+0090.75Show/hide
Query:  MRNFSNLQSVNVCIFNSSFVSDIGKCVQIVQGSENYSTLARA--EIDKGMERMKLAINLASLVEESLDVDLRRSKTQMELKRSIEIRIKERVKAQYLNGK
        MRN +NL  VNVC  +SS VS IGK VQ VQ SENYSTL  A  EIDKGME+MKLA+NLASLVEESLDVDL+RSKTQMELKRS+EI+IKERVKAQYLNGK
Subjt:  MRNFSNLQSVNVCIFNSSFVSDIGKCVQIVQGSENYSTLARA--EIDKGMERMKLAINLASLVEESLDVDLRRSKTQMELKRSIEIRIKERVKAQYLNGK

Query:  FLDLMGNVIACPNTLQNVYDCIRINSNVDIKSNDRLISFESMAEELSNGNFDVNTNTFSILSSRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHG
        FLDLMG VIACP TLQN YDC+RINSNVDI SND LISFESMAEELSNGNFDVN NTFSILSSRKEVL+LPKI+LKVLQEAIRIVLECVFRPHFSKISHG
Subjt:  FLDLMGNVIACPNTLQNVYDCIRINSNVDIKSNDRLISFESMAEELSNGNFDVNTNTFSILSSRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHG

Query:  CRSGRGHSTALKYIKKEIKDPDWWFTVDLSKKMDELVMAKLITVMEDKIEDPKLFAVIRSIYLAGALNLEFGGFPKGHGLPQEGVLSPILTNIYLNLFDQ
        CRSGRGHSTALKYI+KEIK+PDWWFT+DLSKKMDELVMAKLITVMEDKIEDPKLFAVIRSIY+AGALNLEFGGFPKGHGLPQEG+LSPILTNIYLNLFDQ
Subjt:  CRSGRGHSTALKYIKKEIKDPDWWFTVDLSKKMDELVMAKLITVMEDKIEDPKLFAVIRSIYLAGALNLEFGGFPKGHGLPQEGVLSPILTNIYLNLFDQ

Query:  EFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNNSDYSGEEKDKIRVYCCRYMDEIFLAVSGSKDVAHSFRSEIFDFVQKTLHLDVNREEEMVSC
        EFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGN+SDY GE+KDKIRVYCCRYMDEIFLAVSGSKDVA SFRSEIF F+QKTLHLDVN +EEMVSC
Subjt:  EFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNNSDYSGEEKDKIRVYCCRYMDEIFLAVSGSKDVAHSFRSEIFDFVQKTLHLDVNREEEMVSC

Query:  -ETHGIRFLGCLVRRSVQESPAVKSIHKLKEKVELFGLQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSSLNKISSFRKPGMETDHWYKVLLKI
         ETHGIRFLGCLVRRSVQESPAVKS+HKLK+KVELF LQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSSLN+ISSFRK GMETDHWYKVLLKI
Subjt:  -ETHGIRFLGCLVRRSVQESPAVKSIHKLKEKVELFGLQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSSLNKISSFRKPGMETDHWYKVLLKI

Query:  WMQDLNARAAESEEKILSKHAVELSLPFELRDSFYEFQRHVKEYISSETASTLALLPNYDPSAKPTFITEIIAPVNSIRKRLLRYRLVTNKGHPCSSPFL
        WMQDLNARAAESEEKILSKHAVE SLP ELRDSFYEFQR V+EYIS+ETAST+ALLPNYDPS KPTFITEIIAPVNSIRKRLLRYRLVTNKGHPCSSPFL
Subjt:  WMQDLNARAAESEEKILSKHAVELSLPFELRDSFYEFQRHVKEYISSETASTLALLPNYDPSAKPTFITEIIAPVNSIRKRLLRYRLVTNKGHPCSSPFL

Query:  ILQDNTQIIDWFVGVSRRLFRWYNNSSNFSELFLIFDQVRKSCIRTLAAKHRIHESEIEKKFDSELSKIYSSSEIDQEKEKSTDTHVLDHDEALKYGISY
        ILQDNTQIIDWF+GVSRR FRWYNN SNFSEL LI D VRKSCIRTLAAKHRIHESEIEKKFDSELSK+YSS EI+QE+EKS DTH LDHDEALKYGISY
Subjt:  ILQDNTQIIDWFVGVSRRLFRWYNNSSNFSELFLIFDQVRKSCIRTLAAKHRIHESEIEKKFDSELSKIYSSSEIDQEKEKSTDTHVLDHDEALKYGISY

Query:  SGLCLLSLARMVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHLADLYLGRISLQSVDFGAWK
        SGLCLLSLARMVSQSRPCNCFV+GCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCK+HL DLYLG ISLQS+DFGAWK
Subjt:  SGLCLLSLARMVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHLADLYLGRISLQSVDFGAWK

TrEMBL top hitse value%identityAlignment
A0A0A0KWB0 Reverse transcriptase domain-containing protein0.0e+0099.62Show/hide
Query:  MRNFSNLQSVNVCIFNSSFVSDIGKCVQIVQGSENYSTLARAEIDKGMERMKLAINLASLVEESLDVDLRRSKTQMELKRSIEIRIKERVKAQYLNGKFL
        MRNFSNLQSVNVCIFNSSFVSDIGKCVQIVQGSENYSTLARAEIDKGMERMKLAINLASLVEESLDVDLRRSKTQMELKRS+EIRIKERVKAQYLNGKFL
Subjt:  MRNFSNLQSVNVCIFNSSFVSDIGKCVQIVQGSENYSTLARAEIDKGMERMKLAINLASLVEESLDVDLRRSKTQMELKRSIEIRIKERVKAQYLNGKFL

Query:  DLMGNVIACPNTLQNVYDCIRINSNVDIKSNDRLISFESMAEELSNGNFDVNTNTFSILSSRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCR
        DLMGNVIACPNTLQNVYDCIRINSNVDIKSNDRLISFESMAEELSNGNFDVNTNTFSILSSRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCR
Subjt:  DLMGNVIACPNTLQNVYDCIRINSNVDIKSNDRLISFESMAEELSNGNFDVNTNTFSILSSRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCR

Query:  SGRGHSTALKYIKKEIKDPDWWFTVDLSKKMDELVMAKLITVMEDKIEDPKLFAVIRSIYLAGALNLEFGGFPKGHGLPQEGVLSPILTNIYLNLFDQEF
        SGRGHSTALKYIKKEIKDPDWWFTVDLSKKMDELVMAKLITVMEDKIEDPKLFAVIRSIYLAGALNLEFGGFPKGHGLPQEGVLSPILTNIYLNLFDQEF
Subjt:  SGRGHSTALKYIKKEIKDPDWWFTVDLSKKMDELVMAKLITVMEDKIEDPKLFAVIRSIYLAGALNLEFGGFPKGHGLPQEGVLSPILTNIYLNLFDQEF

Query:  FRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNNSDYSGEEKDKIRVYCCRYMDEIFLAVSGSKDVAHSFRSEIFDFVQKTLHLDVNREEEMVSCET
        FRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNNSDYSGEEKDKIRVYCCRYMDEIFLAVSGSKDVAHSFRSEIF FVQKTLHLDVNREEEMVSCET
Subjt:  FRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNNSDYSGEEKDKIRVYCCRYMDEIFLAVSGSKDVAHSFRSEIFDFVQKTLHLDVNREEEMVSCET

Query:  HGIRFLGCLVRRSVQESPAVKSIHKLKEKVELFGLQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSSLNKISSFRKPGMETDHWYKVLLKIWMQ
        HGIRFLGCLVRRSVQESPAVKSIHKLKEKVELFGLQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSSLNKISSFRKPGMETDHWYKVLLKIWMQ
Subjt:  HGIRFLGCLVRRSVQESPAVKSIHKLKEKVELFGLQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSSLNKISSFRKPGMETDHWYKVLLKIWMQ

Query:  DLNARAAESEEKILSKHAVELSLPFELRDSFYEFQRHVKEYISSETASTLALLPNYDPSAKPTFITEIIAPVNSIRKRLLRYRLVTNKGHPCSSPFLILQ
        DLNARAAESEEKILSKHAVELSLPFELRDSFYEFQRHVKEYISSETASTLALLPNYDPSAKPTFITEIIAPVNSIRKRLLRYRLVTNKGHPCSSPFLILQ
Subjt:  DLNARAAESEEKILSKHAVELSLPFELRDSFYEFQRHVKEYISSETASTLALLPNYDPSAKPTFITEIIAPVNSIRKRLLRYRLVTNKGHPCSSPFLILQ

Query:  DNTQIIDWFVGVSRRLFRWYNNSSNFSELFLIFDQVRKSCIRTLAAKHRIHESEIEKKFDSELSKIYSSSEIDQEKEKSTDTHVLDHDEALKYGISYSGL
        DNTQIIDWFVGVSRRLFRWYNNSSNFSELFLIFDQVRKSCIRTLAAKHRIHESEIEKKFDSELSKIYSSSEIDQEKEKSTDTHVLDHDEALKYGISYSGL
Subjt:  DNTQIIDWFVGVSRRLFRWYNNSSNFSELFLIFDQVRKSCIRTLAAKHRIHESEIEKKFDSELSKIYSSSEIDQEKEKSTDTHVLDHDEALKYGISYSGL

Query:  CLLSLARMVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHLADLYLGRISLQSVDFGAWK
        CLLS ARMVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHLADLYLGRISLQSVDFGAWK
Subjt:  CLLSLARMVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHLADLYLGRISLQSVDFGAWK

A0A1S3B491 uncharacterized protein LOC1034860080.0e+0094.79Show/hide
Query:  RNFSNLQSVNVCIFNSSFVSDIGKCVQIVQGSENYSTLARA--EIDKGMERMKLAINLASLVEESLDVDLRRSKTQMELKRSIEIRIKERVKAQYLNGKF
        RNFSN QSVNVCI NSSFVSDIGKC QIVQ SENYSTLARA  EIDKGME+MKLA+NLASLVEESLDVDLRRSKT+MELKRS+EI+IKERVKAQYLNGKF
Subjt:  RNFSNLQSVNVCIFNSSFVSDIGKCVQIVQGSENYSTLARA--EIDKGMERMKLAINLASLVEESLDVDLRRSKTQMELKRSIEIRIKERVKAQYLNGKF

Query:  LDLMGNVIACPNTLQNVYDCIRINSNVDIKSNDRLISFESMAEELSNGNFDVNTNTFSILSSRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGC
        LDLMGNVIACPNTLQN YDCIRINSNVDIKSND LISFESMA+ELS+GNFDVNTNTFSILSSRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGC
Subjt:  LDLMGNVIACPNTLQNVYDCIRINSNVDIKSNDRLISFESMAEELSNGNFDVNTNTFSILSSRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGC

Query:  RSGRGHSTALKYIKKEIKDPDWWFTVDLSKKMDELVMAKLITVMEDKIEDPKLFAVIRSIYLAGALNLEFGGFPKGHGLPQEGVLSPILTNIYLNLFDQE
        RSGRGHSTALKYIKKEIKDPDWWFTVDLSKKMDELVMAKLITVMEDKIEDPKLFAVIRSI+LAGALNLEFG FPKGHGLPQEGVLSPILTNIYLNLFDQE
Subjt:  RSGRGHSTALKYIKKEIKDPDWWFTVDLSKKMDELVMAKLITVMEDKIEDPKLFAVIRSIYLAGALNLEFGGFPKGHGLPQEGVLSPILTNIYLNLFDQE

Query:  FFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNNSDYSGEEKDKIRVYCCRYMDEIFLAVSGSKDVAHSFRSEIFDFVQKTLHLDVNREEEMVSCE
        FFRLSMKYEAINEYGNTGQDGSQS+LRSWFRRQLK N+SDY GEEKDKIRVYCCRYMDEIFLAVSGSKDVA SFRSEIFDF+QKTLHLDVN EEEMVSCE
Subjt:  FFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNNSDYSGEEKDKIRVYCCRYMDEIFLAVSGSKDVAHSFRSEIFDFVQKTLHLDVNREEEMVSCE

Query:  THGIRFLGCLVRRSVQESPAVKSIHKLKEKVELFGLQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSSLNKISSFRKPGMETDHWYKVLLKIWM
        THGIRFLGCLVRRSVQESPAVKSIHKLKEKVELFGLQKQETW +WTVWLGKKWLAHGLKKVKESEIKHLAKNSSLN+ISSFRKPGMETDHWYKVLLKIWM
Subjt:  THGIRFLGCLVRRSVQESPAVKSIHKLKEKVELFGLQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSSLNKISSFRKPGMETDHWYKVLLKIWM

Query:  QDLNARAAESEEKILSKHAVELSLPFELRDSFYEFQRHVKEYISSETASTLALLPNYDPSAKPTFITEIIAPVNSIRKRLLRYRLVTNKGHPCSSPFLIL
        QDLNARAAESEEKILSKHAVE SLPFELRDSFYEFQR V+EYISSETASTLALLPNYDPS KPTFITEIIAPVNSIRKRL RYRLVTNKGHPCSSPFLIL
Subjt:  QDLNARAAESEEKILSKHAVELSLPFELRDSFYEFQRHVKEYISSETASTLALLPNYDPSAKPTFITEIIAPVNSIRKRLLRYRLVTNKGHPCSSPFLIL

Query:  QDNTQIIDWFVGVSRRLFRWYNNSSNFSELFLIFDQVRKSCIRTLAAKHRIHESEIEKKFDSELSKIYSSSEIDQEKEKSTDTHVLDHDEALKYGISYSG
        QDNTQIIDWF+GVSRR FRWYN SSNFSELFLIFDQVRKSCIRTLAAKH+IHESEIEKKFDSELSKIYSS EI+QEKEKSTDTHVLDHDEAL YGISYSG
Subjt:  QDNTQIIDWFVGVSRRLFRWYNNSSNFSELFLIFDQVRKSCIRTLAAKHRIHESEIEKKFDSELSKIYSSSEIDQEKEKSTDTHVLDHDEALKYGISYSG

Query:  LCLLSLARMVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHLADLYLGRISLQSVDFGAWK
        LCLLSLARMVS+SRPCNCFV+GCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHLADLYLGRISLQSVDFGAWK
Subjt:  LCLLSLARMVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHLADLYLGRISLQSVDFGAWK

A0A5A7TFZ7 Reverse transcriptase domain-containing protein0.0e+0092.34Show/hide
Query:  RNFSNLQSVNVCIFNSSFVSDI----------------------GKCVQIVQGSENYSTLARA--EIDKGMERMKLAINLASLVEESLDVDLRRSKTQME
        RNFSN QSVNVCI NSSFVSDI                      GKC QIVQ SENYSTLARA  EIDKGME+MKLA+NLASLVEESLDVDLRRSKT+ME
Subjt:  RNFSNLQSVNVCIFNSSFVSDI----------------------GKCVQIVQGSENYSTLARA--EIDKGMERMKLAINLASLVEESLDVDLRRSKTQME

Query:  LKRSIEIRIKERVKAQYLNGKFLDLMGNVIACPNTLQNVYDCIRINSNVDIKSNDRLISFESMAEELSNGNFDVNTNTFSILSSRKEVLILPKIKLKVLQ
        LKRS+EI+IKERVKAQYLNGKFLDLMGNVIACPNTLQN YDCIRINSNVDIKSND LISFESMAEELS+GNFDVNTNTFSILSSRKEVLILPKIKLKVLQ
Subjt:  LKRSIEIRIKERVKAQYLNGKFLDLMGNVIACPNTLQNVYDCIRINSNVDIKSNDRLISFESMAEELSNGNFDVNTNTFSILSSRKEVLILPKIKLKVLQ

Query:  EAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDLSKKMDELVMAKLITVMEDKIEDPKLFAVIRSIYLAGALNLEFGGFPKGHG
        EAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDLSKKMD+LVMAKLITVMEDKIEDPKLFAVIRSI+LAGALNLEFG FPKGHG
Subjt:  EAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDLSKKMDELVMAKLITVMEDKIEDPKLFAVIRSIYLAGALNLEFGGFPKGHG

Query:  LPQEGVLSPILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNNSDYSGEEKDKIRVYCCRYMDEIFLAVSGSKDVAHSFRSEI
        LPQEGVLSPILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQS+LRSWFRRQLKGN+SDY GEEKDKIRVYCCRYMDEIFLAVSGSKDVA SFRSEI
Subjt:  LPQEGVLSPILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNNSDYSGEEKDKIRVYCCRYMDEIFLAVSGSKDVAHSFRSEI

Query:  FDFVQKTLHLDVNREEEMVSCETHGIRFLGCLVRRSVQESPAVKSIHKLKEKVELFGLQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSSLNKI
        FDF+QKTLHLDVN EEEMVSCETHGIRFLGCLVRRSVQESPAVKSIHKLKEKVELFGLQKQETW +WTVWLGKKWLAHGLKKVKESEIKHLAKNSSLN+I
Subjt:  FDFVQKTLHLDVNREEEMVSCETHGIRFLGCLVRRSVQESPAVKSIHKLKEKVELFGLQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSSLNKI

Query:  SSFRKPGMETDHWYKVLLKIWMQDLNARAAESEEKILSKHAVELSLPFELRDSFYEFQRHVKEYISSETASTLALLPNYDPSAKPTFITEIIAPVNSIRK
        SSFRKPGMETDHWYKVLLKIWMQDLNARAAESEEKILSKHAVE SLPFELRDSFYEFQR V+EYISSETASTLALLPNYDPS KPTFITEIIAPVNSIRK
Subjt:  SSFRKPGMETDHWYKVLLKIWMQDLNARAAESEEKILSKHAVELSLPFELRDSFYEFQRHVKEYISSETASTLALLPNYDPSAKPTFITEIIAPVNSIRK

Query:  RLLRYRLVTNKGHPCSSPFLILQDNTQIIDWFVGVSRRLFRWYNNSSNFSELFLIFDQVRKSCIRTLAAKHRIHESEIEKKFDSELSKIYSSSEIDQEKE
        RL RYRLVTNKGHPCSSPFLILQDNTQIIDWF+GVSRR FRWYN SSNFSELFLIFDQVRKSCIRTLAAKHRIHESEIEKKFDSELSKIYSS EI+Q KE
Subjt:  RLLRYRLVTNKGHPCSSPFLILQDNTQIIDWFVGVSRRLFRWYNNSSNFSELFLIFDQVRKSCIRTLAAKHRIHESEIEKKFDSELSKIYSSSEIDQEKE

Query:  KSTDTHVLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHLADLYLGRISL
        KSTDTHVLDHDEAL YGISYSGLCLLSLARMVS+SRPCNCFV+GCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHLADLYLGRISL
Subjt:  KSTDTHVLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHLADLYLGRISL

Query:  QSVDFGAWK
        QSVDFGAWK
Subjt:  QSVDFGAWK

A0A6J1CXL0 nuclear intron maturase 4, mitochondrial isoform X20.0e+0082.93Show/hide
Query:  MRNFSNLQSVNVCIFNSSFVSDIGKCVQIVQGSENYSTLARAEID--KGMERMKLAINLASLVEESLDVDLRRSKTQMELKRSIEIRIKERVKAQYLNGK
        MRNF+ L+   +C  NSSFVSDIGKCVQ VQ SENYS LA A+ D  KGME+ KLA NLASLVEESLDVD RR K++MELKRS+EI+IK+RVKAQY+NGK
Subjt:  MRNFSNLQSVNVCIFNSSFVSDIGKCVQIVQGSENYSTLARAEID--KGMERMKLAINLASLVEESLDVDLRRSKTQMELKRSIEIRIKERVKAQYLNGK

Query:  FLDLMGNVIACPNTLQNVYDCIRINSNVDIKSNDRLISFESMAEELSNGNFDVNTNTFSILSSRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHG
        F+DLMG VIACP TLQN YDC+RINSNVDI SND LISFESMAEEL NG+FDVN NTFSI SS+KEVLILPK+KLKVLQEAIRIVLECVFRPHFSKISHG
Subjt:  FLDLMGNVIACPNTLQNVYDCIRINSNVDIKSNDRLISFESMAEELSNGNFDVNTNTFSILSSRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHG

Query:  CRSGRGHSTALKYIKKEIKDPDWWFTVDLSKKMDELVMAKLITVMEDKIEDPKLFAVIRSIYLAGALNLEFGGFPKGHGLPQEGVLSPILTNIYLNLFDQ
        CRSGRGHSTALKYI+KEI +PDWWFTVD+SKKMDEL MAKLI+VMEDKIEDP+ FA+IRSI+ AGALNLEFGGFPKGHGLPQEGVLSPIL NIYLNLFDQ
Subjt:  CRSGRGHSTALKYIKKEIKDPDWWFTVDLSKKMDELVMAKLITVMEDKIEDPKLFAVIRSIYLAGALNLEFGGFPKGHGLPQEGVLSPILTNIYLNLFDQ

Query:  EFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNNSDYSGEEKDKIRVYCCRYMDEIFLAVSGSKDVAHSFRSEIFDFVQKTLHLDVNREEEMVSC
        EFFRLSMKYEAIN+YGN  QDGSQS+LRSWFRR+LKGN+S+Y  +EKD IRVYCCRYMDEIF+AVSGSKDVA SFRSEI DF+QK+LHLDVN +EEMVSC
Subjt:  EFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNNSDYSGEEKDKIRVYCCRYMDEIFLAVSGSKDVAHSFRSEIFDFVQKTLHLDVNREEEMVSC

Query:  -ETHGIRFLGCLVRRSVQESPAVKSIHKLKEKVELFGLQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNS-SLNKISSFRKPGMETDHWYKVLLK
         ET GIRFLGCLVRRS +ESPAVK++HKLKEKVELF LQKQE WN WTVWLGKKWLAHGLKKVKESEIKHLAKNS SLN+ISSFRK GMETDHWYKVLLK
Subjt:  -ETHGIRFLGCLVRRSVQESPAVKSIHKLKEKVELFGLQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNS-SLNKISSFRKPGMETDHWYKVLLK

Query:  IWMQDLNARAAESEEKILSKHAVELSLPFELRDSFYEFQRHVKEYISSETASTLALLPNYDPSAKPTFITEIIAPVNSIRKRLLRYRLVTNKGHPCSSPF
        IWMQD+NA+AAE+EE ILS + VE SLP ELRDSFYEFQR V+EY+SSETAST+ALLPNYDPS K TFITEIIAPVNSIRKRLLRYRL+TNKG+PC+SPF
Subjt:  IWMQDLNARAAESEEKILSKHAVELSLPFELRDSFYEFQRHVKEYISSETASTLALLPNYDPSAKPTFITEIIAPVNSIRKRLLRYRLVTNKGHPCSSPF

Query:  LILQDNTQIIDWFVGVSRRLFRWYNNSSNFSELFLIFDQVRKSCIRTLAAKHRIHESEIEKKFDSELSKIYSSSEIDQ-EKEKSTDTHVLDHDEALKYGI
        LIL DNTQIIDWF+GV RR  +WY+N SNFSE+ LI DQVRKSCIRTLAAKHR HESEIEKKFD ELS+I S+ EI+Q E+E+++DTH L HDEA  YGI
Subjt:  LILQDNTQIIDWFVGVSRRLFRWYNNSSNFSELFLIFDQVRKSCIRTLAAKHRIHESEIEKKFDSELSKIYSSSEIDQ-EKEKSTDTHVLDHDEALKYGI

Query:  SYSGLCLLSLARMVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHLADLYLGRISLQSVDFGAWK
        SYSGLCLLSLARMVSQSRPCNCFV+GCLA APSVYTLHVMERQKFPGWKTGFSSSIHPSLN+RR GLCKQHL DLYLG ISLQSV+FGAWK
Subjt:  SYSGLCLLSLARMVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHLADLYLGRISLQSVDFGAWK

A0A6J1CYJ7 nuclear intron maturase 4, mitochondrial isoform X10.0e+0082.83Show/hide
Query:  MRNFSNLQSVNVCIFNSSFVSDI-GKCVQIVQGSENYSTLARAEID--KGMERMKLAINLASLVEESLDVDLRRSKTQMELKRSIEIRIKERVKAQYLNG
        MRNF+ L+   +C  NSSFVSDI GKCVQ VQ SENYS LA A+ D  KGME+ KLA NLASLVEESLDVD RR K++MELKRS+EI+IK+RVKAQY+NG
Subjt:  MRNFSNLQSVNVCIFNSSFVSDI-GKCVQIVQGSENYSTLARAEID--KGMERMKLAINLASLVEESLDVDLRRSKTQMELKRSIEIRIKERVKAQYLNG

Query:  KFLDLMGNVIACPNTLQNVYDCIRINSNVDIKSNDRLISFESMAEELSNGNFDVNTNTFSILSSRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISH
        KF+DLMG VIACP TLQN YDC+RINSNVDI SND LISFESMAEEL NG+FDVN NTFSI SS+KEVLILPK+KLKVLQEAIRIVLECVFRPHFSKISH
Subjt:  KFLDLMGNVIACPNTLQNVYDCIRINSNVDIKSNDRLISFESMAEELSNGNFDVNTNTFSILSSRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISH

Query:  GCRSGRGHSTALKYIKKEIKDPDWWFTVDLSKKMDELVMAKLITVMEDKIEDPKLFAVIRSIYLAGALNLEFGGFPKGHGLPQEGVLSPILTNIYLNLFD
        GCRSGRGHSTALKYI+KEI +PDWWFTVD+SKKMDEL MAKLI+VMEDKIEDP+ FA+IRSI+ AGALNLEFGGFPKGHGLPQEGVLSPIL NIYLNLFD
Subjt:  GCRSGRGHSTALKYIKKEIKDPDWWFTVDLSKKMDELVMAKLITVMEDKIEDPKLFAVIRSIYLAGALNLEFGGFPKGHGLPQEGVLSPILTNIYLNLFD

Query:  QEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNNSDYSGEEKDKIRVYCCRYMDEIFLAVSGSKDVAHSFRSEIFDFVQKTLHLDVNREEEMVS
        QEFFRLSMKYEAIN+YGN  QDGSQS+LRSWFRR+LKGN+S+Y  +EKD IRVYCCRYMDEIF+AVSGSKDVA SFRSEI DF+QK+LHLDVN +EEMVS
Subjt:  QEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNNSDYSGEEKDKIRVYCCRYMDEIFLAVSGSKDVAHSFRSEIFDFVQKTLHLDVNREEEMVS

Query:  C-ETHGIRFLGCLVRRSVQESPAVKSIHKLKEKVELFGLQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNS-SLNKISSFRKPGMETDHWYKVLL
        C ET GIRFLGCLVRRS +ESPAVK++HKLKEKVELF LQKQE WN WTVWLGKKWLAHGLKKVKESEIKHLAKNS SLN+ISSFRK GMETDHWYKVLL
Subjt:  C-ETHGIRFLGCLVRRSVQESPAVKSIHKLKEKVELFGLQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNS-SLNKISSFRKPGMETDHWYKVLL

Query:  KIWMQDLNARAAESEEKILSKHAVELSLPFELRDSFYEFQRHVKEYISSETASTLALLPNYDPSAKPTFITEIIAPVNSIRKRLLRYRLVTNKGHPCSSP
        KIWMQD+NA+AAE+EE ILS + VE SLP ELRDSFYEFQR V+EY+SSETAST+ALLPNYDPS K TFITEIIAPVNSIRKRLLRYRL+TNKG+PC+SP
Subjt:  KIWMQDLNARAAESEEKILSKHAVELSLPFELRDSFYEFQRHVKEYISSETASTLALLPNYDPSAKPTFITEIIAPVNSIRKRLLRYRLVTNKGHPCSSP

Query:  FLILQDNTQIIDWFVGVSRRLFRWYNNSSNFSELFLIFDQVRKSCIRTLAAKHRIHESEIEKKFDSELSKIYSSSEIDQ-EKEKSTDTHVLDHDEALKYG
        FLIL DNTQIIDWF+GV RR  +WY+N SNFSE+ LI DQVRKSCIRTLAAKHR HESEIEKKFD ELS+I S+ EI+Q E+E+++DTH L HDEA  YG
Subjt:  FLILQDNTQIIDWFVGVSRRLFRWYNNSSNFSELFLIFDQVRKSCIRTLAAKHRIHESEIEKKFDSELSKIYSSSEIDQ-EKEKSTDTHVLDHDEALKYG

Query:  ISYSGLCLLSLARMVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHLADLYLGRISLQSVDFGAWK
        ISYSGLCLLSLARMVSQSRPCNCFV+GCLA APSVYTLHVMERQKFPGWKTGFSSSIHPSLN+RR GLCKQHL DLYLG ISLQSV+FGAWK
Subjt:  ISYSGLCLLSLARMVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHLADLYLGRISLQSVDFGAWK

SwissProt top hitse value%identityAlignment
P03876 Putative COX1/OXI3 intron 2 protein4.2e-2225.21Show/hide
Query:  KAQYLNGKFLDLMGNVIACPNTLQNVYDCIRINSNVDIKSNDRLISFESMAEELSNGNFDVNTNTFSILSSRK----------EVLILPKIKLKVLQEAI
        K + +N + L LM ++      L   Y+ I+       K ++ +         L+  + D+NTN F     R+            L +   + K++QE++
Subjt:  KAQYLNGKFLDLMGNVIACPNTLQNVYDCIRINSNVDIKSNDRLISFESMAEELSNGNFDVNTNTFSILSSRK----------EVLILPKIKLKVLQEAI

Query:  RIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDLSKKMDELVMAKLITVMEDKIEDPKLFAVIRSIYLAGALNLEFGGFPKGHGLPQ
        R++LE ++   FS  SHG R      TA+   K  ++  +W+  VDL+K  D +    LI V+ ++I+D     ++  +  AG ++          G+PQ
Subjt:  RIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDLSKKMDELVMAKLITVMEDKIEDPKLFAVIRSIYLAGALNLEFGGFPKGHGLPQ

Query:  EGVLSPILTNIYLNLFDQEFFRLSMKYEAINEY-----GNTGQDGSQSRLRSWFRR-----------QLKGNNSDYSGEEKDKIRVYCCRYMDEIFLAVS
          V+SPIL NI+L+  D+    L  K+E  NE+      N G++   + L S   R           +L+ +     G +K   R Y  RY D+I + V 
Subjt:  EGVLSPILTNIYLNLFDQEFFRLSMKYEAINEY-----GNTGQDGSQSRLRSWFRR-----------QLKGNNSDYSGEEKDKIRVYCCRYMDEIFLAVS

Query:  GSKDVAHSFRSEIFDFVQKTLHLDVNREEEMVSCETHGIRFLGCLVRRSVQESPAVKSIHK
        GS +   +  ++I +F+++ L + +N ++ ++     G+ FLG  V+ +  E    + I K
Subjt:  GSKDVAHSFRSEIFDFVQKTLHLDVNREEEMVSCETHGIRFLGCLVRRSVQESPAVKSIHK

P0A3U0 Group II intron-encoded protein LtrA2.9e-2328.09Show/hide
Query:  SSRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDLSKKMDELVMAKLITVMEDKIEDPKLFAVIRSI
        S +   L +P    K++QEA+RI+LE ++ P F  +SHG R  R   TALK IK+E     W+   D+    D +    LI ++  KI+D K+  +I   
Subjt:  SSRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDLSKKMDELVMAKLITVMEDKIEDPKLFAVIRSI

Query:  YLAGALNLEFGGFPKGH-GLPQEGVLSPILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFR------RQLKGNNSDYSGEEKDKIR---
          AG   LE   + K + G PQ G+LSP+L NIYL+  D+   +L MK++            S  R+   +R      +++        GEEK K+    
Subjt:  YLAGALNLEFGGFPKGH-GLPQEGVLSPILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFR------RQLKGNNSDYSGEEKDKIR---

Query:  -------------------VYCCRYMDEIFLAVSGSKDVAHSFRSEIFDFVQKTLHLDVNREEEMVSCETHGIRFLGCLVRRSVQESPAVKSIHKLKEK
                           +   RY D+  ++V GSK+     + ++  F+   L ++++ E+ +++  +   RFLG  +R  V+ S  +K   K+K++
Subjt:  -------------------VYCCRYMDEIFLAVSGSKDVAHSFRSEIFDFVQKTLHLDVNREEEMVSCETHGIRFLGCLVRRSVQESPAVKSIHKLKEK

P0A3U1 Group II intron-encoded protein LtrA2.9e-2328.09Show/hide
Query:  SSRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDLSKKMDELVMAKLITVMEDKIEDPKLFAVIRSI
        S +   L +P    K++QEA+RI+LE ++ P F  +SHG R  R   TALK IK+E     W+   D+    D +    LI ++  KI+D K+  +I   
Subjt:  SSRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDLSKKMDELVMAKLITVMEDKIEDPKLFAVIRSI

Query:  YLAGALNLEFGGFPKGH-GLPQEGVLSPILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFR------RQLKGNNSDYSGEEKDKIR---
          AG   LE   + K + G PQ G+LSP+L NIYL+  D+   +L MK++            S  R+   +R      +++        GEEK K+    
Subjt:  YLAGALNLEFGGFPKGH-GLPQEGVLSPILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFR------RQLKGNNSDYSGEEKDKIR---

Query:  -------------------VYCCRYMDEIFLAVSGSKDVAHSFRSEIFDFVQKTLHLDVNREEEMVSCETHGIRFLGCLVRRSVQESPAVKSIHKLKEK
                           +   RY D+  ++V GSK+     + ++  F+   L ++++ E+ +++  +   RFLG  +R  V+ S  +K   K+K++
Subjt:  -------------------VYCCRYMDEIFLAVSGSKDVAHSFRSEIFDFVQKTLHLDVNREEEMVSCETHGIRFLGCLVRRSVQESPAVKSIHKLKEK

Q9CA78 Nuclear intron maturase 4, mitochondrial6.3e-24459.08Show/hide
Query:  LAINLASLVEESLD--VDLRRSKTQMELKRSIEIRIKERVKAQYLNGKFLDLMGNVIACPNTLQNVYDCIRINSNVDIKSNDRLISFESMAEELSNGNFD
        LA  LASLVEES     D  + +++MELKRS+E+R+K+RVK Q +NGKF DL+  VIA P TL++ YDCIR+NSNV I   +  ++F+S+AEELS+G FD
Subjt:  LAINLASLVEESLD--VDLRRSKTQMELKRSIEIRIKERVKAQYLNGKFLDLMGNVIACPNTLQNVYDCIRINSNVDIKSNDRLISFESMAEELSNGNFD

Query:  VNTNTFSILS--SRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDLSKKMDELVMAKLITVMEDKIE
        V +NTFSI++    KEVL+LP + LKV+QEAIRIVLE VF PHFSKISH CRSGRG ++ALKYI   I   DW FT+ L+KK+D  V   L++VME+K+E
Subjt:  VNTNTFSILS--SRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDLSKKMDELVMAKLITVMEDKIE

Query:  DPKLFAVIRSIYLAGALNLEFGGFPKGHGLPQEGVLSPILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNNSDYSGEEKDKI
        D  L  ++RS++ A  LNLEFGGFPKGHGLPQEGVLS +L NIYL+ FD EF+R+SM++EA+     T +D   S+LRSWFRRQ        + E+   +
Subjt:  DPKLFAVIRSIYLAGALNLEFGGFPKGHGLPQEGVLSPILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNNSDYSGEEKDKI

Query:  RVYCCRYMDEIFLAVSGSKDVAHSFRSEIFDFVQKTLHLDVNREEEMVSCE-THGIRFLGCLVRRSVQESPAVKSIHKLKEKVELFGLQKQETWNAWTVW
        RVYCCR+MDEI+ +VSG K VA   RSE   F++ +LHLD+  E +   CE T G+R LG LVR++V+ESP VK++HKLKEKV LF LQK+E W   TV 
Subjt:  RVYCCRYMDEIFLAVSGSKDVAHSFRSEIFDFVQKTLHLDVNREEEMVSCE-THGIRFLGCLVRRSVQESPAVKSIHKLKEKVELFGLQKQETWNAWTVW

Query:  LGKKWLAHGLKKVKESEIKHLA-KNSSLNKISSFRKPGMETDHWYKVLLKIWMQD-LNARAAESEEKILSKHAVELSLPFELRDSFYEFQRHVKEYISSE
        +GKKWL HGLKKVKESEIK LA  NS+L++IS  RK GMETDHWYK+LL+IWM+D L   A  SEE +LSKH VE ++P ELRD+FY+FQ     Y+SSE
Subjt:  LGKKWLAHGLKKVKESEIKHLA-KNSSLNKISSFRKPGMETDHWYKVLLKIWMQD-LNARAAESEEKILSKHAVELSLPFELRDSFYEFQRHVKEYISSE

Query:  TASTLALLPNYDPSAKPTFITEIIAPVNSIRKRLLRYRLVTNKGHPCSSPFLILQDNTQIIDWFVGVSRRLFRWYNNSSNFSEL-FLIFDQVRKSCIRTL
        TA+  ALLP      +P F  +++AP N+I +RL RY L+T KG+  S+  LIL D  QIIDW+ G+ RR   WY   SNF E+  LI +Q+R SCIRTL
Subjt:  TASTLALLPNYDPSAKPTFITEIIAPVNSIRKRLLRYRLVTNKGHPCSSPFLILQDNTQIIDWFVGVSRRLFRWYNNSSNFSEL-FLIFDQVRKSCIRTL

Query:  AAKHRIHESEIEKKFDSELSKIYSSSEIDQE-KEKSTDTHVLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPGW
        AAK+RIHE+EIEK+ D ELS I S+ +I+QE + +  D+   D DE L YG+S SGLCLLSLAR+VS+SRPCNCFVIGC   AP+VYTLH MERQKFPGW
Subjt:  AAKHRIHESEIEKKFDSELSKIYSSSEIDQE-KEKSTDTHVLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPGW

Query:  KTGFSSSIHPSLNKRRFGLCKQHLADLYLGRISLQSVDFGAWK
        KTGFS  I  SLN RR GLCKQHL DLY+G+ISLQ+VDFGAW+
Subjt:  KTGFSSSIHPSLNKRRFGLCKQHLADLYLGRISLQSVDFGAWK

Q9LZA5 Nuclear intron maturase 3, mitochondrial1.6e-5325.66Show/hide
Query:  IEIRIKERVKAQYLNGKFLDLMGNVIACPNTL----QNVYDCIRINSNVDIKSN-DRLISFESMAEELSNGNFDVNTNTFSILSSRKEVLILPKIKLKVL
        ++  ++  V  QY +GKF  L+ N ++ P  L    QN+   +  NS+ D+     R  S E M  E+  G FD+ +     +SS    L+LP +KLKVL
Subjt:  IEIRIKERVKAQYLNGKFLDLMGNVIACPNTL----QNVYDCIRINSNVDIKSN-DRLISFESMAEELSNGNFDVNTNTFSILSSRKEVLILPKIKLKVL

Query:  QEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDLSKKM-DELVMAKLITVMEDKIEDPKLFAVIRSIYLAGALNLEFGGFPKG
         EAIR+VLE V+   F+  S+G R G G  TA++Y+K  +++P WWF V  +++M +E  +  L   + +KI D  L  +I+ ++  G L +E GG   G
Subjt:  QEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDLSKKM-DELVMAKLITVMEDKIEDPKLFAVIRSIYLAGALNLEFGGFPKG

Query:  HGLPQEGVLSPILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNNSDYSGEEKDKIRVYCCRYMDEIFLAVSGSKDVAHSFRS
         G PQE  L  IL N+Y +  D+E   L +K +  N    TG + S   +  +F+                 + +Y  RY+DEI +  SGSK +    + 
Subjt:  HGLPQEGVLSPILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNNSDYSGEEKDKIRVYCCRYMDEIFLAVSGSKDVAHSFRS

Query:  EIFDFVQKTLHLDVNREEEMV-SCETHGIRFLGCL-------VRRSVQESPAVKSIHKLKEKVELFGLQKQETWNAWTVWLGKKWLAHGLKKVKESEIKH
         I D +++ L L V+R    + S  +  I FLG         V R  +   AV+++ K + + ++  L+ +         LG K   H LKK+K+S    
Subjt:  EIFDFVQKTLHLDVNREEEMV-SCETHGIRFLGCL-------VRRSVQESPAVKSIHKLKEKVELFGLQKQETWNAWTVWLGKKWLAHGLKKVKESEIKH

Query:  LAKNSSLNKISSFRKPGMETDHWYKVLLKIW----MQDLNARAAE--------SEEKILSKHAVELSLPFELRDSFYEFQRHVKEYISSETASTLALLPN
                  + F+  G E ++  + + + W    MQD      E        +    LS   +   LP +L D++ EFQ  V ++++   A    +L +
Subjt:  LAKNSSLNKISSFRKPGMETDHWYKVLLKIW----MQDLNARAAE--------SEEKILSKHAVELSLPFELRDSFYEFQRHVKEYISSETASTLALLPN

Query:  YDPSAK---------------PTFITEIIAPVNSIRKRLLRYRLVTNKGHPCSSPFLILQDNTQIIDWFV-----GVSRRLFRWYNNSSNFSELFLIFDQ
         +   +                    ++ AP   +RK +       + G P     L+  +++ II W+      G +++L R Y      S+L      
Subjt:  YDPSAK---------------PTFITEIIAPVNSIRKRLLRYRLVTNKGHPCSSPFLILQDNTQIIDWFV-----GVSRRLFRWYNNSSNFSELFLIFDQ

Query:  VRKSCIRTLAAKHRIHESEIEKKFDSELSKIYSSSEIDQEKEKSTDTHVLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCFVIGCLAPAPSVYTLHVM
                        +   E  F SE           +E +   D ++ D            G   L L R+ S     +C    C      ++ +H++
Subjt:  VRKSCIRTLAAKHRIHESEIEKKFDSELSKIYSSSEIDQEKEKSTDTHVLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCFVIGCLAPAPSVYTLHVM

Query:  ERQKF------PGWKTGFSSSIHPSLNKRRFGLCKQHLADLYLGRISLQSVD
        + +          W  G   +IH +LN++   LC  H++D+YLG+I+LQ VD
Subjt:  ERQKF------PGWKTGFSSSIHPSLNKRRFGLCKQHLADLYLGRISLQSVD

Arabidopsis top hitse value%identityAlignment
AT1G74350.1 Intron maturase, type II family protein4.5e-24559.08Show/hide
Query:  LAINLASLVEESLD--VDLRRSKTQMELKRSIEIRIKERVKAQYLNGKFLDLMGNVIACPNTLQNVYDCIRINSNVDIKSNDRLISFESMAEELSNGNFD
        LA  LASLVEES     D  + +++MELKRS+E+R+K+RVK Q +NGKF DL+  VIA P TL++ YDCIR+NSNV I   +  ++F+S+AEELS+G FD
Subjt:  LAINLASLVEESLD--VDLRRSKTQMELKRSIEIRIKERVKAQYLNGKFLDLMGNVIACPNTLQNVYDCIRINSNVDIKSNDRLISFESMAEELSNGNFD

Query:  VNTNTFSILS--SRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDLSKKMDELVMAKLITVMEDKIE
        V +NTFSI++    KEVL+LP + LKV+QEAIRIVLE VF PHFSKISH CRSGRG ++ALKYI   I   DW FT+ L+KK+D  V   L++VME+K+E
Subjt:  VNTNTFSILS--SRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDLSKKMDELVMAKLITVMEDKIE

Query:  DPKLFAVIRSIYLAGALNLEFGGFPKGHGLPQEGVLSPILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNNSDYSGEEKDKI
        D  L  ++RS++ A  LNLEFGGFPKGHGLPQEGVLS +L NIYL+ FD EF+R+SM++EA+     T +D   S+LRSWFRRQ        + E+   +
Subjt:  DPKLFAVIRSIYLAGALNLEFGGFPKGHGLPQEGVLSPILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNNSDYSGEEKDKI

Query:  RVYCCRYMDEIFLAVSGSKDVAHSFRSEIFDFVQKTLHLDVNREEEMVSCE-THGIRFLGCLVRRSVQESPAVKSIHKLKEKVELFGLQKQETWNAWTVW
        RVYCCR+MDEI+ +VSG K VA   RSE   F++ +LHLD+  E +   CE T G+R LG LVR++V+ESP VK++HKLKEKV LF LQK+E W   TV 
Subjt:  RVYCCRYMDEIFLAVSGSKDVAHSFRSEIFDFVQKTLHLDVNREEEMVSCE-THGIRFLGCLVRRSVQESPAVKSIHKLKEKVELFGLQKQETWNAWTVW

Query:  LGKKWLAHGLKKVKESEIKHLA-KNSSLNKISSFRKPGMETDHWYKVLLKIWMQD-LNARAAESEEKILSKHAVELSLPFELRDSFYEFQRHVKEYISSE
        +GKKWL HGLKKVKESEIK LA  NS+L++IS  RK GMETDHWYK+LL+IWM+D L   A  SEE +LSKH VE ++P ELRD+FY+FQ     Y+SSE
Subjt:  LGKKWLAHGLKKVKESEIKHLA-KNSSLNKISSFRKPGMETDHWYKVLLKIWMQD-LNARAAESEEKILSKHAVELSLPFELRDSFYEFQRHVKEYISSE

Query:  TASTLALLPNYDPSAKPTFITEIIAPVNSIRKRLLRYRLVTNKGHPCSSPFLILQDNTQIIDWFVGVSRRLFRWYNNSSNFSEL-FLIFDQVRKSCIRTL
        TA+  ALLP      +P F  +++AP N+I +RL RY L+T KG+  S+  LIL D  QIIDW+ G+ RR   WY   SNF E+  LI +Q+R SCIRTL
Subjt:  TASTLALLPNYDPSAKPTFITEIIAPVNSIRKRLLRYRLVTNKGHPCSSPFLILQDNTQIIDWFVGVSRRLFRWYNNSSNFSEL-FLIFDQVRKSCIRTL

Query:  AAKHRIHESEIEKKFDSELSKIYSSSEIDQE-KEKSTDTHVLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPGW
        AAK+RIHE+EIEK+ D ELS I S+ +I+QE + +  D+   D DE L YG+S SGLCLLSLAR+VS+SRPCNCFVIGC   AP+VYTLH MERQKFPGW
Subjt:  AAKHRIHESEIEKKFDSELSKIYSSSEIDQE-KEKSTDTHVLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPGW

Query:  KTGFSSSIHPSLNKRRFGLCKQHLADLYLGRISLQSVDFGAWK
        KTGFS  I  SLN RR GLCKQHL DLY+G+ISLQ+VDFGAW+
Subjt:  KTGFSSSIHPSLNKRRFGLCKQHLADLYLGRISLQSVDFGAWK

AT5G04050.1 RNA-directed DNA polymerase (reverse transcriptase)7.1e-4926.96Show/hide
Query:  IEIRIKERVKAQYLNGKFLDLMGNVIACPNTL----QNVYDCIRINSNVDIKSN-DRLISFESMAEELSNGNFDVNTNTFSILSSRKEVLILPKIKLKVL
        ++  ++  V  QY +GKF  L+ N ++ P  L    QN+   +  NS+ D+     R  S E M  E+  G FD+ +     +SS    L+LP +KLKVL
Subjt:  IEIRIKERVKAQYLNGKFLDLMGNVIACPNTL----QNVYDCIRINSNVDIKSN-DRLISFESMAEELSNGNFDVNTNTFSILSSRKEVLILPKIKLKVL

Query:  QEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDLSKKM-DELVMAKLITVMEDKIEDPKLFAVIRSIYLAGALNLEFGGFPKG
         EAIR+VLE V+   F+  S+G R G G  TA++Y+K  +++P WWF V  +++M +E  +  L   + +KI D  L  +I+ ++  G L +E GG   G
Subjt:  QEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDLSKKM-DELVMAKLITVMEDKIEDPKLFAVIRSIYLAGALNLEFGGFPKG

Query:  HGLPQEGVLSPILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNNSDYSGEEKDKIRVYCCRYMDEIFLAVSGSKDVAHSFRS
         G PQE  L  IL N+Y +  D+E   L +K +  N    TG + S   +  +F+                 + +Y  RY+DEI +  SGSK +    + 
Subjt:  HGLPQEGVLSPILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNNSDYSGEEKDKIRVYCCRYMDEIFLAVSGSKDVAHSFRS

Query:  EIFDFVQKTLHLDVNREEEMV-SCETHGIRFLGCL-------VRRSVQESPAVKSIHKLKEKVELFGLQKQETWNAWTVWLGKKWLAHGLKKVKESEIKH
         I D +++ L L V+R    + S  +  I FLG         V R  +   AV+++ K + + ++  L+ +         LG K   H LKK+K+S    
Subjt:  EIFDFVQKTLHLDVNREEEMV-SCETHGIRFLGCL-------VRRSVQESPAVKSIHKLKEKVELFGLQKQETWNAWTVWLGKKWLAHGLKKVKESEIKH

Query:  LAKNSSLNKISSFRKPGMETDHWYKVLLKIW----MQDLNARAAE--------SEEKILSKHAVELSLPFELRDSFYEFQRHVKEYISSETASTLALLPN
                  + F+  G E ++  + + + W    MQD      E        +    LS   +   LP +L D++ EFQ  V ++++   A    +L +
Subjt:  LAKNSSLNKISSFRKPGMETDHWYKVLLKIW----MQDLNARAAE--------SEEKILSKHAVELSLPFELRDSFYEFQRHVKEYISSETASTLALLPN

Query:  YDPSAK---------------PTFITEIIAPVNSIRKRLLRYRLVTNKGHPCSSPFLILQDNTQIIDWFVGVSRRLFRWYNNSSNF
         +   +                    ++ AP   +RK +       + G P     L+  +++ II W+ GV R+   ++    N+
Subjt:  YDPSAK---------------PTFITEIIAPVNSIRKRLLRYRLVTNKGHPCSSPFLILQDNTQIIDWFVGVSRRLFRWYNNSSNF

AT5G04050.2 RNA-directed DNA polymerase (reverse transcriptase)1.6e-4824.59Show/hide
Query:  IEIRIKERVKAQYLNGKFLDLMGNVIACPNTL----QNVYDCIRINSNVDIKSN-DRLISFESMAEELSNGNFDVNTNTFSILSSRKEVLILPKIKLKVL
        ++  ++  V  QY +GKF  L+ N ++ P  L    QN+   +  NS+ D+     R  S E M  E+  G FD+ +     +SS    L+LP +KLKVL
Subjt:  IEIRIKERVKAQYLNGKFLDLMGNVIACPNTL----QNVYDCIRINSNVDIKSN-DRLISFESMAEELSNGNFDVNTNTFSILSSRKEVLILPKIKLKVL

Query:  QEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDLSKKM-DELVMAKLITVMEDKIEDPKLFAVIRSIYLAGALNLEFGGFPKG
         EAIR+VLE V+   F+  S+G R G G  TA++Y+K  +++P WWF V  +++M +E  +  L   + +KI D  L  +I+ ++  G L +E GG   G
Subjt:  QEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDLSKKM-DELVMAKLITVMEDKIEDPKLFAVIRSIYLAGALNLEFGGFPKG

Query:  HGLPQEGVLSPILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNNSDYSGEEKDKIRVYCCRYMDEIFLAVSGSKDVAHSFRS
         G PQE  L  IL N+Y +  D+E   L +K +  N    TG + S   +  +F+                 + +Y  RY+DEI +  SGSK +    + 
Subjt:  HGLPQEGVLSPILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNNSDYSGEEKDKIRVYCCRYMDEIFLAVSGSKDVAHSFRS

Query:  EIFDFVQKTLHLDVNREEEMV-SCETHGIRFLGCL-------VRRSVQESPAVKSIHKLKEKVELFGLQKQETWNAWTVWLGKKWLAHGLKKVKESEIKH
         I D +++ L L V+R    + S  +  I FLG         V R  +   AV+++ K + + ++  L+ +         LG K   H LKK+K+S    
Subjt:  EIFDFVQKTLHLDVNREEEMV-SCETHGIRFLGCL-------VRRSVQESPAVKSIHKLKEKVELFGLQKQETWNAWTVWLGKKWLAHGLKKVKESEIKH

Query:  LAKNSSLNKISSFRKPGMETDHWYKVLLKIW----MQDLNARAAE--------SEEKILSKHAVELSLPFELRDSFYEFQRHVKEYISSETASTLALLPN
                  + F+  G E ++  + + + W    MQD      E        +    LS   +   LP +L D++ EFQ  V ++++   A        
Subjt:  LAKNSSLNKISSFRKPGMETDHWYKVLLKIW----MQDLNARAAE--------SEEKILSKHAVELSLPFELRDSFYEFQRHVKEYISSETASTLALLPN

Query:  YDPSAKPTFITEIIAPVNSIRKRLLRYRLVTNKGHPCSSPFLILQDNTQIIDWFVGVSRRLFRWYNNSSNFSELFLIFDQVRKSCIRTLAAKHRIHESEI
                                                  +L+D  + ++                  ++E  +  + + K C++  A +  + ++  
Subjt:  YDPSAKPTFITEIIAPVNSIRKRLLRYRLVTNKGHPCSSPFLILQDNTQIIDWFVGVSRRLFRWYNNSSNFSELFLIFDQVRKSCIRTLAAKHRIHESEI

Query:  EKKFDSELSKIYSSSEIDQEKEKSTDTHVLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKF------PGWKTGFSS
            D      + S   ++E +   D ++ D            G   L L R+ S     +C    C      ++ +H+++ +          W  G   
Subjt:  EKKFDSELSKIYSSSEIDQEKEKSTDTHVLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKF------PGWKTGFSS

Query:  SIHPSLNKRRFGLCKQHLADLYLGRISLQSVD
        +IH +LN++   LC  H++D+YLG+I+LQ VD
Subjt:  SIHPSLNKRRFGLCKQHLADLYLGRISLQSVD

ATCG00040.1 maturase K1.8e-0427.72Show/hide
Query:  PVNSIRKRLLRYRLVTNKGHPCSSPFLILQDNTQIIDWFVGVSRRLFRWYNNSSNFSELFLIFDQVRKSCIRTLAAKHRIHESEIEKKFDSELSKIYSSS
        P++SI   L + +     GHP S        ++ I++ FV + R +  +Y+ SS    L+ I   +R  C++TLA KH+       K+  S L + + + 
Subjt:  PVNSIRKRLLRYRLVTNKGHPCSSPFLILQDNTQIIDWFVGVSRRLFRWYNNSSNFSELFLIFDQVRKSCIRTLAAKHRIHESEIEKKFDSELSKIYSSS

Query:  E
        E
Subjt:  E

ATMG00520.1 Intron maturase, type II family protein1.2e-1636.76Show/hide
Query:  KVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDLSKKMDELVMAKLITVMEDKIEDPKLFAVIRSIYLAGALNLEFGGFP
        K+++EAIR+VLE ++ P F   SH  RSG+G  + L+ IK+E     W+   D+ K    +   +LI +++++I+DPK F  I+ ++ AG L     G  
Subjt:  KVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDLSKKMDELVMAKLITVMEDKIEDPKLFAVIRSIYLAGALNLEFGGFP

Query:  KG-HGLPQEGVLSPILTNIYLNLFDQEFFRLSMKYE
        +G + +P   +LS +  NIYL+  DQE  R+  KYE
Subjt:  KG-HGLPQEGVLSPILTNIYLNLFDQEFFRLSMKYE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGGAACTTTTCAAATTTGCAAAGTGTAAATGTTTGCATATTCAATTCATCCTTTGTTTCTGACATTGGAAAGTGTGTTCAAATAGTTCAGGGTTCTGAAAATTATTC
AACTCTCGCCCGTGCTGAAATTGACAAGGGAATGGAGAGAATGAAACTGGCCATAAACTTGGCCTCGCTTGTTGAAGAATCTCTTGATGTTGATCTGAGAAGATCAAAGA
CTCAAATGGAACTTAAGAGATCAATTGAAATTCGGATTAAGGAGAGGGTGAAGGCACAATATTTGAATGGGAAGTTTTTGGACTTGATGGGGAATGTAATTGCCTGCCCC
AATACTCTTCAAAATGTTTACGACTGTATTAGAATTAACTCAAATGTTGACATTAAGTCGAATGATCGTTTGATCTCATTTGAATCTATGGCTGAAGAGCTTTCTAATGG
TAATTTTGATGTCAATACCAATACTTTCTCCATATTAAGTTCAAGAAAAGAAGTACTAATTTTACCAAAGATAAAGTTGAAGGTTCTTCAGGAAGCCATTAGGATAGTTT
TGGAGTGTGTGTTTAGGCCACATTTTTCCAAGATATCTCATGGTTGTCGAAGTGGAAGAGGACACTCAACAGCATTGAAGTACATCAAAAAAGAGATAAAAGATCCTGAT
TGGTGGTTCACAGTTGACTTAAGCAAAAAGATGGATGAGCTTGTGATGGCTAAACTCATTACAGTAATGGAGGACAAGATAGAGGACCCCAAATTATTTGCTGTTATCAG
AAGTATATATTTGGCCGGAGCACTGAATTTGGAGTTTGGGGGTTTCCCAAAAGGTCACGGTCTTCCACAAGAGGGAGTTCTGTCTCCTATATTAACGAACATTTATCTAA
ACCTCTTTGACCAAGAATTTTTCAGATTATCTATGAAATACGAAGCTATTAATGAGTATGGTAATACTGGTCAAGATGGGTCACAATCAAGGCTACGGAGTTGGTTTAGG
AGACAATTGAAAGGAAATAATTCTGATTATTCAGGTGAGGAGAAAGACAAGATAAGAGTATATTGTTGTCGCTATATGGATGAAATCTTTTTAGCAGTATCAGGTTCTAA
AGATGTTGCTCATAGTTTTAGGTCTGAGATTTTTGATTTCGTGCAGAAGACTTTGCATTTGGACGTTAACCGTGAAGAGGAAATGGTATCATGTGAGACTCATGGAATTC
GTTTTCTTGGTTGTTTGGTCAGACGAAGTGTGCAGGAAAGTCCTGCTGTAAAATCCATCCACAAGTTGAAGGAAAAAGTTGAGCTATTTGGTTTACAAAAGCAGGAGACT
TGGAATGCTTGGACAGTGTGGTTGGGAAAGAAATGGCTTGCTCATGGTTTGAAGAAGGTTAAAGAGTCTGAGATTAAGCATTTAGCTAAAAATAGCTCTTTAAATAAAAT
TTCCAGTTTTCGTAAACCTGGAATGGAAACTGATCACTGGTACAAGGTTCTGTTGAAAATTTGGATGCAAGATCTAAATGCAAGAGCTGCAGAGAGTGAAGAAAAAATCT
TATCTAAGCATGCAGTGGAACTTTCTCTTCCTTTTGAACTTCGAGATTCCTTTTATGAATTCCAAAGGCATGTCAAAGAATACATTTCTTCTGAGACAGCGTCTACTCTT
GCCCTTTTACCAAATTATGACCCTTCTGCCAAACCTACTTTCATAACTGAGATTATAGCACCTGTCAATTCTATCAGAAAACGACTTTTGCGATATAGATTAGTCACAAA
TAAAGGACATCCATGCTCCTCTCCTTTCCTCATCTTACAAGATAACACCCAAATTATTGACTGGTTTGTAGGAGTATCTCGTCGTTTGTTTAGATGGTACAACAATTCTT
CTAACTTCAGCGAGTTGTTCTTAATTTTCGATCAAGTTAGGAAATCTTGTATCCGAACGCTAGCAGCAAAGCACCGGATACACGAAAGTGAAATAGAAAAGAAGTTTGAC
TCAGAATTGAGTAAGATTTACTCCTCTTCTGAAATAGATCAAGAAAAAGAGAAGTCAACAGATACCCATGTTTTAGACCACGATGAGGCACTAAAGTATGGAATTTCATA
TAGTGGTTTGTGTTTGCTATCTCTTGCTAGAATGGTCAGCCAATCTCGTCCTTGCAATTGTTTCGTCATTGGGTGTTTGGCTCCTGCACCAAGTGTTTATACTCTTCATG
TCATGGAGAGACAAAAGTTTCCGGGATGGAAGACTGGGTTCTCGAGTTCCATTCATCCTAGCTTGAACAAACGACGATTTGGGTTATGCAAACAACATTTGGCAGATTTG
TATTTGGGTCGCATTTCTTTGCAATCTGTTGATTTTGGTGCATGGAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGCGGAACTTTTCAAATTTGCAAAGTGTAAATGTTTGCATATTCAATTCATCCTTTGTTTCTGACATTGGAAAGTGTGTTCAAATAGTTCAGGGTTCTGAAAATTATTC
AACTCTCGCCCGTGCTGAAATTGACAAGGGAATGGAGAGAATGAAACTGGCCATAAACTTGGCCTCGCTTGTTGAAGAATCTCTTGATGTTGATCTGAGAAGATCAAAGA
CTCAAATGGAACTTAAGAGATCAATTGAAATTCGGATTAAGGAGAGGGTGAAGGCACAATATTTGAATGGGAAGTTTTTGGACTTGATGGGGAATGTAATTGCCTGCCCC
AATACTCTTCAAAATGTTTACGACTGTATTAGAATTAACTCAAATGTTGACATTAAGTCGAATGATCGTTTGATCTCATTTGAATCTATGGCTGAAGAGCTTTCTAATGG
TAATTTTGATGTCAATACCAATACTTTCTCCATATTAAGTTCAAGAAAAGAAGTACTAATTTTACCAAAGATAAAGTTGAAGGTTCTTCAGGAAGCCATTAGGATAGTTT
TGGAGTGTGTGTTTAGGCCACATTTTTCCAAGATATCTCATGGTTGTCGAAGTGGAAGAGGACACTCAACAGCATTGAAGTACATCAAAAAAGAGATAAAAGATCCTGAT
TGGTGGTTCACAGTTGACTTAAGCAAAAAGATGGATGAGCTTGTGATGGCTAAACTCATTACAGTAATGGAGGACAAGATAGAGGACCCCAAATTATTTGCTGTTATCAG
AAGTATATATTTGGCCGGAGCACTGAATTTGGAGTTTGGGGGTTTCCCAAAAGGTCACGGTCTTCCACAAGAGGGAGTTCTGTCTCCTATATTAACGAACATTTATCTAA
ACCTCTTTGACCAAGAATTTTTCAGATTATCTATGAAATACGAAGCTATTAATGAGTATGGTAATACTGGTCAAGATGGGTCACAATCAAGGCTACGGAGTTGGTTTAGG
AGACAATTGAAAGGAAATAATTCTGATTATTCAGGTGAGGAGAAAGACAAGATAAGAGTATATTGTTGTCGCTATATGGATGAAATCTTTTTAGCAGTATCAGGTTCTAA
AGATGTTGCTCATAGTTTTAGGTCTGAGATTTTTGATTTCGTGCAGAAGACTTTGCATTTGGACGTTAACCGTGAAGAGGAAATGGTATCATGTGAGACTCATGGAATTC
GTTTTCTTGGTTGTTTGGTCAGACGAAGTGTGCAGGAAAGTCCTGCTGTAAAATCCATCCACAAGTTGAAGGAAAAAGTTGAGCTATTTGGTTTACAAAAGCAGGAGACT
TGGAATGCTTGGACAGTGTGGTTGGGAAAGAAATGGCTTGCTCATGGTTTGAAGAAGGTTAAAGAGTCTGAGATTAAGCATTTAGCTAAAAATAGCTCTTTAAATAAAAT
TTCCAGTTTTCGTAAACCTGGAATGGAAACTGATCACTGGTACAAGGTTCTGTTGAAAATTTGGATGCAAGATCTAAATGCAAGAGCTGCAGAGAGTGAAGAAAAAATCT
TATCTAAGCATGCAGTGGAACTTTCTCTTCCTTTTGAACTTCGAGATTCCTTTTATGAATTCCAAAGGCATGTCAAAGAATACATTTCTTCTGAGACAGCGTCTACTCTT
GCCCTTTTACCAAATTATGACCCTTCTGCCAAACCTACTTTCATAACTGAGATTATAGCACCTGTCAATTCTATCAGAAAACGACTTTTGCGATATAGATTAGTCACAAA
TAAAGGACATCCATGCTCCTCTCCTTTCCTCATCTTACAAGATAACACCCAAATTATTGACTGGTTTGTAGGAGTATCTCGTCGTTTGTTTAGATGGTACAACAATTCTT
CTAACTTCAGCGAGTTGTTCTTAATTTTCGATCAAGTTAGGAAATCTTGTATCCGAACGCTAGCAGCAAAGCACCGGATACACGAAAGTGAAATAGAAAAGAAGTTTGAC
TCAGAATTGAGTAAGATTTACTCCTCTTCTGAAATAGATCAAGAAAAAGAGAAGTCAACAGATACCCATGTTTTAGACCACGATGAGGCACTAAAGTATGGAATTTCATA
TAGTGGTTTGTGTTTGCTATCTCTTGCTAGAATGGTCAGCCAATCTCGTCCTTGCAATTGTTTCGTCATTGGGTGTTTGGCTCCTGCACCAAGTGTTTATACTCTTCATG
TCATGGAGAGACAAAAGTTTCCGGGATGGAAGACTGGGTTCTCGAGTTCCATTCATCCTAGCTTGAACAAACGACGATTTGGGTTATGCAAACAACATTTGGCAGATTTG
TATTTGGGTCGCATTTCTTTGCAATCTGTTGATTTTGGTGCATGGAAGTGAATTGTTTTTGTTCTGCTTATGATTTCATTTAACTTCTAAATTACTTGTTGAGATAAGAT
TTGCCAAGAGAAATGCCATGACTGAGCCTTCGTGATGCATAATTTGTGCTAGCATAATGATTTTCTGGTCTCAATGGTGTCTGAAATCCTAATTGGAATGTTGCGCCGAT
TGAACATTGTTAAAACTAATGTGAGGACCAGGCTGGAACAACCCAATGTGAGAATCAGGTTTGAACAACGTTCAGTTGAATGGGTGATGTTTATATCGATATCTTGGAAA
CCTTTTCAATTGCTCGGATGAAGAATATTAGTATGGTGAGGAGTACATGCCAATGGT
Protein sequenceShow/hide protein sequence
MRNFSNLQSVNVCIFNSSFVSDIGKCVQIVQGSENYSTLARAEIDKGMERMKLAINLASLVEESLDVDLRRSKTQMELKRSIEIRIKERVKAQYLNGKFLDLMGNVIACP
NTLQNVYDCIRINSNVDIKSNDRLISFESMAEELSNGNFDVNTNTFSILSSRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPD
WWFTVDLSKKMDELVMAKLITVMEDKIEDPKLFAVIRSIYLAGALNLEFGGFPKGHGLPQEGVLSPILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFR
RQLKGNNSDYSGEEKDKIRVYCCRYMDEIFLAVSGSKDVAHSFRSEIFDFVQKTLHLDVNREEEMVSCETHGIRFLGCLVRRSVQESPAVKSIHKLKEKVELFGLQKQET
WNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSSLNKISSFRKPGMETDHWYKVLLKIWMQDLNARAAESEEKILSKHAVELSLPFELRDSFYEFQRHVKEYISSETASTL
ALLPNYDPSAKPTFITEIIAPVNSIRKRLLRYRLVTNKGHPCSSPFLILQDNTQIIDWFVGVSRRLFRWYNNSSNFSELFLIFDQVRKSCIRTLAAKHRIHESEIEKKFD
SELSKIYSSSEIDQEKEKSTDTHVLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHLADL
YLGRISLQSVDFGAWK