; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi01G016890 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi01G016890
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
Descriptionnuclear intron maturase 4, mitochondrial isoform X2
Genome locationchr01:15478013..15494006
RNA-Seq ExpressionLsi01G016890
SyntenyLsi01G016890
Gene Ontology termsGO:0000373 - Group II intron splicing (biological process)
GO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006315 - homing of group II introns (biological process)
GO:0007005 - mitochondrion organization (biological process)
GO:0009845 - seed germination (biological process)
GO:0032885 - regulation of polysaccharide biosynthetic process (biological process)
GO:0090615 - mitochondrial mRNA processing (biological process)
GO:1900864 - mitochondrial RNA modification (biological process)
GO:0005739 - mitochondrion (cellular component)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR024937 - Domain X
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0041778.1 hypothetical protein E6C27_scaffold67G001360 [Cucumis melo var. makuwa]0.0e+0087.77Show/hide
Query:  MTFGGLRRLCGMNMRNFSILQSVNVCKVKSSFVSDI----------------------GKCVQRVQSSENYSTLACADDEIDKGMEKKTLATNLAFLVEE
        M FGGLRR C +N RNFS  QSVNVC V SSFVSDI                      GKC Q VQSSENYSTLA ADDEIDKGMEK  LA NLA LVEE
Subjt:  MTFGGLRRLCGMNMRNFSILQSVNVCKVKSSFVSDI----------------------GKCVQRVQSSENYSTLACADDEIDKGMEKKTLATNLAFLVEE

Query:  SLDVDMRRPKTQMELKRSLEIQIKERVKAQYLNGKFLDLMGKVIACPTTLQNAYNCIRISSNVDIMSNDRLISFESMAEELSNGNFDVNGNTFSILSSRK
        SLDVD+RR KT+MELKRSLEIQIKERVKAQYLNGKFLDLMG VIACP TLQNAY+CIRI+SNVDI SND LISFESMAEELS+GNFDVN NTFSILSSRK
Subjt:  SLDVDMRRPKTQMELKRSLEIQIKERVKAQYLNGKFLDLMGKVIACPTTLQNAYNCIRISSNVDIMSNDRLISFESMAEELSNGNFDVNGNTFSILSSRK

Query:  EVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKMDELVMAKLITVMEDKIDDPRLFAVIRRIYVAG
        EVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGR HSTALKYI+KEIK+PDWWFT++L KKMD+LVMAKLITVMEDKI+DP+LFAVIR I++AG
Subjt:  EVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKMDELVMAKLITVMEDKIDDPRLFAVIRRIYVAG

Query:  ALNLEFGGFPKGHGLPQEGVLSPILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNSSDYPGEEKDKIRVYCCRYMDEIFLAV
        ALNLEFG FPKGHGLPQEGVLSPILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQS+LRSWFRRQLKGNSSDYPGEEKDKIRVYCCRYMDEIFLAV
Subjt:  ALNLEFGGFPKGHGLPQEGVLSPILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNSSDYPGEEKDKIRVYCCRYMDEIFLAV

Query:  SGSKDVALNFRSEIFDFMQKTLHLDVNHQQEIVSCGETHGICFLGCLVRRSVQESLAVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHGLKKVKE
        SGSKDVAL+FRSEIFDFMQKTLHLDVNH++E+VSC ETHGI FLGCLVRRSVQES AVKS+HKLKEKVELF LQKQETW +WTVWLGKKWLAHGLKKVKE
Subjt:  SGSKDVALNFRSEIFDFMQKTLHLDVNHQQEIVSCGETHGICFLGCLVRRSVQESLAVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHGLKKVKE

Query:  SEIKHLAKNRSSLNQISSFRKAGMETDHWYKVLLKIWMQDLNARAAESEEKILSKYAVEPSLPMELRDSFSEFQRCVKEYISAETASTVALLPNYDPSVK
        SEIKHLAKN SSLNQISSFRK GMETDHWYKVLLKIWMQDLNARAAESEEKILSK+AVEPSLP ELRDSF EFQR V+EYIS+ETAST+ALLPNYDPSVK
Subjt:  SEIKHLAKNRSSLNQISSFRKAGMETDHWYKVLLKIWMQDLNARAAESEEKILSKYAVEPSLPMELRDSFSEFQRCVKEYISAETASTVALLPNYDPSVK

Query:  PTFITEIIAPVNSIRKRLLRYRLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELILICDQVRKSCIRTLAAKHRIHESEIEKKFDS
        PTFITEIIAPVNSIRKRL RYRLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYN  SNFSEL LI DQVRKSCIRTLAAKHRIHESEIEKKFDS
Subjt:  PTFITEIIAPVNSIRKRLLRYRLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELILICDQVRKSCIRTLAAKHRIHESEIEKKFDS

Query:  ELSKIYSSPETEKEEEKKSSETHGLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRF
        ELSKIYSSPE E+ +E KS++TH LDHDEAL YGISYSGLCLLSLARMVS+SRPCNCFV+GCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRF
Subjt:  ELSKIYSSPETEKEEEKKSSETHGLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRF

Query:  GLCKQHLEDLYLGHISLQSVDFGAWK
        GLCKQHL DLYLG ISLQSVDFGAWK
Subjt:  GLCKQHLEDLYLGHISLQSVDFGAWK

XP_008442019.1 PREDICTED: uncharacterized protein LOC103486008 [Cucumis melo]0.0e+0090.05Show/hide
Query:  MTFGGLRRLCGMNMRNFSILQSVNVCKVKSSFVSDIGKCVQRVQSSENYSTLACADDEIDKGMEKKTLATNLAFLVEESLDVDMRRPKTQMELKRSLEIQ
        M FGGLRR C +N RNFS  QSVNVC V SSFVSDIGKC Q VQSSENYSTLA ADDEIDKGMEK  LA NLA LVEESLDVD+RR KT+MELKRSLEIQ
Subjt:  MTFGGLRRLCGMNMRNFSILQSVNVCKVKSSFVSDIGKCVQRVQSSENYSTLACADDEIDKGMEKKTLATNLAFLVEESLDVDMRRPKTQMELKRSLEIQ

Query:  IKERVKAQYLNGKFLDLMGKVIACPTTLQNAYNCIRISSNVDIMSNDRLISFESMAEELSNGNFDVNGNTFSILSSRKEVLILPKIKLKVLQEAIRIVLE
        IKERVKAQYLNGKFLDLMG VIACP TLQNAY+CIRI+SNVDI SND LISFESMA+ELS+GNFDVN NTFSILSSRKEVLILPKIKLKVLQEAIRIVLE
Subjt:  IKERVKAQYLNGKFLDLMGKVIACPTTLQNAYNCIRISSNVDIMSNDRLISFESMAEELSNGNFDVNGNTFSILSSRKEVLILPKIKLKVLQEAIRIVLE

Query:  CVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKMDELVMAKLITVMEDKIDDPRLFAVIRRIYVAGALNLEFGGFPKGHGLPQEGVLS
        CVFRPHFSKISHGCRSGR HSTALKYI+KEIK+PDWWFT++L KKMDELVMAKLITVMEDKI+DP+LFAVIR I++AGALNLEFG FPKGHGLPQEGVLS
Subjt:  CVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKMDELVMAKLITVMEDKIDDPRLFAVIRRIYVAGALNLEFGGFPKGHGLPQEGVLS

Query:  PILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNSSDYPGEEKDKIRVYCCRYMDEIFLAVSGSKDVALNFRSEIFDFMQKTL
        PILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQS+LRSWFRRQLK NSSDYPGEEKDKIRVYCCRYMDEIFLAVSGSKDVAL+FRSEIFDFMQKTL
Subjt:  PILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNSSDYPGEEKDKIRVYCCRYMDEIFLAVSGSKDVALNFRSEIFDFMQKTL

Query:  HLDVNHQQEIVSCGETHGICFLGCLVRRSVQESLAVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNRSSLNQISSFRKA
        HLDVNH++E+VSC ETHGI FLGCLVRRSVQES AVKS+HKLKEKVELF LQKQETW +WTVWLGKKWLAHGLKKVKESEIKHLAKN SSLNQISSFRK 
Subjt:  HLDVNHQQEIVSCGETHGICFLGCLVRRSVQESLAVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNRSSLNQISSFRKA

Query:  GMETDHWYKVLLKIWMQDLNARAAESEEKILSKYAVEPSLPMELRDSFSEFQRCVKEYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLLRYR
        GMETDHWYKVLLKIWMQDLNARAAESEEKILSK+AVEPSLP ELRDSF EFQR V+EYIS+ETAST+ALLPNYDPSVKPTFITEIIAPVNSIRKRL RYR
Subjt:  GMETDHWYKVLLKIWMQDLNARAAESEEKILSKYAVEPSLPMELRDSFSEFQRCVKEYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLLRYR

Query:  LVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELILICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSKIYSSPETEKEEEKKSSET
        LVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYN  SNFSEL LI DQVRKSCIRTLAAKH+IHESEIEKKFDSELSKIYSSPE E+E+E KS++T
Subjt:  LVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELILICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSKIYSSPETEKEEEKKSSET

Query:  HGLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHLEDLYLGHISLQSVDF
        H LDHDEAL YGISYSGLCLLSLARMVS+SRPCNCFV+GCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHL DLYLG ISLQSVDF
Subjt:  HGLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHLEDLYLGHISLQSVDF

Query:  GAWK
        GAWK
Subjt:  GAWK

XP_038882001.1 nuclear intron maturase 4, mitochondrial isoform X1 [Benincasa hispida]0.0e+0084.44Show/hide
Query:  PFARDDAGVVSSPPSWVLFSVSLSPPSWMLELCRLWVLFTLVDNGIREGLDELLKVASTEPETVLHYEFMQDYKCCYERFHHSSWLTRTEAMEVNDKFRW
        PFARDDA VVSSPPSWVLF+VSLSPPSWMLELCRLWVLFT+                                                         RW
Subjt:  PFARDDAGVVSSPPSWVLFSVSLSPPSWMLELCRLWVLFTLVDNGIREGLDELLKVASTEPETVLHYEFMQDYKCCYERFHHSSWLTRTEAMEVNDKFRW

Query:  FNVLVFGKVLIYSSDAFFFSNTLYSYAKMTFGGLRRLCGMNMRNFSILQSVNVCKVKSSFVSDIGKCVQRVQSSENYSTLACADDEIDKGMEKKTLATNL
        +               + F + +  YAKM FGGL+R C +NMRN + L  VNVCKV SS VS IGK VQRVQ+SENYSTL CADDEIDKGMEK  LA NL
Subjt:  FNVLVFGKVLIYSSDAFFFSNTLYSYAKMTFGGLRRLCGMNMRNFSILQSVNVCKVKSSFVSDIGKCVQRVQSSENYSTLACADDEIDKGMEKKTLATNL

Query:  AFLVEESLDVDMRRPKTQMELKRSLEIQIKERVKAQYLNGKFLDLMGKVIACPTTLQNAYNCIRISSNVDIMSNDRLISFESMAEELSNGNFDVNGNTFS
        A LVEESLDVD++R KTQMELKRSLEIQIKERVKAQYLNGKFLDLMGKVIACPTTLQNAY+C+RI+SNVDIMSND LISFESMAEELSNGNFDVN NTFS
Subjt:  AFLVEESLDVDMRRPKTQMELKRSLEIQIKERVKAQYLNGKFLDLMGKVIACPTTLQNAYNCIRISSNVDIMSNDRLISFESMAEELSNGNFDVNGNTFS

Query:  ILSSRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKMDELVMAKLITVMEDKIDDPRLFAVIR
        ILSSRKEVL+LPKI+LKVLQEAIRIVLECVFRPHFSKISHGCRSGR HSTALKYIRKEIKNPDWWFTI+L KKMDELVMAKLITVMEDKI+DP+LFAVIR
Subjt:  ILSSRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKMDELVMAKLITVMEDKIDDPRLFAVIR

Query:  RIYVAGALNLEFGGFPKGHGLPQEGVLSPILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNSSDYPGEEKDKIRVYCCRYMD
         IYVAGALNLEFGGFPKGHGLPQEG+LSPILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNSSDYPGE+KDKIRVYCCRYMD
Subjt:  RIYVAGALNLEFGGFPKGHGLPQEGVLSPILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNSSDYPGEEKDKIRVYCCRYMD

Query:  EIFLAVSGSKDVALNFRSEIFDFMQKTLHLDVNHQQEIVSCGETHGICFLGCLVRRSVQESLAVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHG
        EIFLAVSGSKDVAL+FRSEIF F+QKTLHLDVNHQ+E+VSCGETHGI FLGCLVRRSVQES AVKSVHKLK+KVELFALQKQETWNAWTVWLGKKWLAHG
Subjt:  EIFLAVSGSKDVALNFRSEIFDFMQKTLHLDVNHQQEIVSCGETHGICFLGCLVRRSVQESLAVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHG

Query:  LKKVKESEIKHLAKNRSSLNQISSFRKAGMETDHWYKVLLKIWMQDLNARAAESEEKILSKYAVEPSLPMELRDSFSEFQRCVKEYISAETASTVALLPN
        LKKVKESEIKHLAKN SSLNQISSFRKAGMETDHWYKVLLKIWMQDLNARAAESEEKILSK+AVEPSLP+ELRDSF EFQRCV+EYISAETASTVALLPN
Subjt:  LKKVKESEIKHLAKNRSSLNQISSFRKAGMETDHWYKVLLKIWMQDLNARAAESEEKILSKYAVEPSLPMELRDSFSEFQRCVKEYISAETASTVALLPN

Query:  YDPSVKPTFITEIIAPVNSIRKRLLRYRLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELILICDQVRKSCIRTLAAKHRIHESEI
        YDPSVKPTFITEIIAPVNSIRKRLLRYRLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELILICD VRKSCIRTLAAKHRIHESEI
Subjt:  YDPSVKPTFITEIIAPVNSIRKRLLRYRLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELILICDQVRKSCIRTLAAKHRIHESEI

Query:  EKKFDSELSKIYSSPETEKEEEKKSSETHGLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPS
        EKKFDSELSK+YSSPE E+EEE KS +THGLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCFV+GCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPS
Subjt:  EKKFDSELSKIYSSPETEKEEEKKSSETHGLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPS

Query:  LNKRRFGLCKQHLEDLYLGHISLQSVDFGAWK
        LNKRRFGLCK+HLEDLYLGHISLQS+DFGAWK
Subjt:  LNKRRFGLCKQHLEDLYLGHISLQSVDFGAWK

XP_038882003.1 nuclear intron maturase 4, mitochondrial isoform X2 [Benincasa hispida]0.0e+0092.27Show/hide
Query:  FFSNTLYSYAKMTFGGLRRLCGMNMRNFSILQSVNVCKVKSSFVSDIGKCVQRVQSSENYSTLACADDEIDKGMEKKTLATNLAFLVEESLDVDMRRPKT
        FF N LYSYAKM FGGL+R C +NMRN + L  VNVCKV SS VS IGK VQRVQ+SENYSTL CADDEIDKGMEK  LA NLA LVEESLDVD++R KT
Subjt:  FFSNTLYSYAKMTFGGLRRLCGMNMRNFSILQSVNVCKVKSSFVSDIGKCVQRVQSSENYSTLACADDEIDKGMEKKTLATNLAFLVEESLDVDMRRPKT

Query:  QMELKRSLEIQIKERVKAQYLNGKFLDLMGKVIACPTTLQNAYNCIRISSNVDIMSNDRLISFESMAEELSNGNFDVNGNTFSILSSRKEVLILPKIKLK
        QMELKRSLEIQIKERVKAQYLNGKFLDLMGKVIACPTTLQNAY+C+RI+SNVDIMSND LISFESMAEELSNGNFDVN NTFSILSSRKEVL+LPKI+LK
Subjt:  QMELKRSLEIQIKERVKAQYLNGKFLDLMGKVIACPTTLQNAYNCIRISSNVDIMSNDRLISFESMAEELSNGNFDVNGNTFSILSSRKEVLILPKIKLK

Query:  VLQEAIRIVLECVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKMDELVMAKLITVMEDKIDDPRLFAVIRRIYVAGALNLEFGGFPK
        VLQEAIRIVLECVFRPHFSKISHGCRSGR HSTALKYIRKEIKNPDWWFTI+L KKMDELVMAKLITVMEDKI+DP+LFAVIR IYVAGALNLEFGGFPK
Subjt:  VLQEAIRIVLECVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKMDELVMAKLITVMEDKIDDPRLFAVIRRIYVAGALNLEFGGFPK

Query:  GHGLPQEGVLSPILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNSSDYPGEEKDKIRVYCCRYMDEIFLAVSGSKDVALNFR
        GHGLPQEG+LSPILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNSSDYPGE+KDKIRVYCCRYMDEIFLAVSGSKDVAL+FR
Subjt:  GHGLPQEGVLSPILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNSSDYPGEEKDKIRVYCCRYMDEIFLAVSGSKDVALNFR

Query:  SEIFDFMQKTLHLDVNHQQEIVSCGETHGICFLGCLVRRSVQESLAVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNRS
        SEIF F+QKTLHLDVNHQ+E+VSCGETHGI FLGCLVRRSVQES AVKSVHKLK+KVELFALQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKN S
Subjt:  SEIFDFMQKTLHLDVNHQQEIVSCGETHGICFLGCLVRRSVQESLAVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNRS

Query:  SLNQISSFRKAGMETDHWYKVLLKIWMQDLNARAAESEEKILSKYAVEPSLPMELRDSFSEFQRCVKEYISAETASTVALLPNYDPSVKPTFITEIIAPV
        SLNQISSFRKAGMETDHWYKVLLKIWMQDLNARAAESEEKILSK+AVEPSLP+ELRDSF EFQRCV+EYISAETASTVALLPNYDPSVKPTFITEIIAPV
Subjt:  SLNQISSFRKAGMETDHWYKVLLKIWMQDLNARAAESEEKILSKYAVEPSLPMELRDSFSEFQRCVKEYISAETASTVALLPNYDPSVKPTFITEIIAPV

Query:  NSIRKRLLRYRLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELILICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSKIYSSPET
        NSIRKRLLRYRLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELILICD VRKSCIRTLAAKHRIHESEIEKKFDSELSK+YSSPE 
Subjt:  NSIRKRLLRYRLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELILICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSKIYSSPET

Query:  EKEEEKKSSETHGLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHLEDLY
        E+EEE KS +THGLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCFV+GCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCK+HLEDLY
Subjt:  EKEEEKKSSETHGLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHLEDLY

Query:  LGHISLQSVDFGAWK
        LGHISLQS+DFGAWK
Subjt:  LGHISLQSVDFGAWK

XP_038882004.1 nuclear intron maturase 4, mitochondrial isoform X3 [Benincasa hispida]0.0e+0092.41Show/hide
Query:  MTFGGLRRLCGMNMRNFSILQSVNVCKVKSSFVSDIGKCVQRVQSSENYSTLACADDEIDKGMEKKTLATNLAFLVEESLDVDMRRPKTQMELKRSLEIQ
        M FGGL+R C +NMRN + L  VNVCKV SS VS IGK VQRVQ+SENYSTL CADDEIDKGMEK  LA NLA LVEESLDVD++R KTQMELKRSLEIQ
Subjt:  MTFGGLRRLCGMNMRNFSILQSVNVCKVKSSFVSDIGKCVQRVQSSENYSTLACADDEIDKGMEKKTLATNLAFLVEESLDVDMRRPKTQMELKRSLEIQ

Query:  IKERVKAQYLNGKFLDLMGKVIACPTTLQNAYNCIRISSNVDIMSNDRLISFESMAEELSNGNFDVNGNTFSILSSRKEVLILPKIKLKVLQEAIRIVLE
        IKERVKAQYLNGKFLDLMGKVIACPTTLQNAY+C+RI+SNVDIMSND LISFESMAEELSNGNFDVN NTFSILSSRKEVL+LPKI+LKVLQEAIRIVLE
Subjt:  IKERVKAQYLNGKFLDLMGKVIACPTTLQNAYNCIRISSNVDIMSNDRLISFESMAEELSNGNFDVNGNTFSILSSRKEVLILPKIKLKVLQEAIRIVLE

Query:  CVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKMDELVMAKLITVMEDKIDDPRLFAVIRRIYVAGALNLEFGGFPKGHGLPQEGVLS
        CVFRPHFSKISHGCRSGR HSTALKYIRKEIKNPDWWFTI+L KKMDELVMAKLITVMEDKI+DP+LFAVIR IYVAGALNLEFGGFPKGHGLPQEG+LS
Subjt:  CVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKMDELVMAKLITVMEDKIDDPRLFAVIRRIYVAGALNLEFGGFPKGHGLPQEGVLS

Query:  PILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNSSDYPGEEKDKIRVYCCRYMDEIFLAVSGSKDVALNFRSEIFDFMQKTL
        PILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNSSDYPGE+KDKIRVYCCRYMDEIFLAVSGSKDVAL+FRSEIF F+QKTL
Subjt:  PILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNSSDYPGEEKDKIRVYCCRYMDEIFLAVSGSKDVALNFRSEIFDFMQKTL

Query:  HLDVNHQQEIVSCGETHGICFLGCLVRRSVQESLAVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNRSSLNQISSFRKA
        HLDVNHQ+E+VSCGETHGI FLGCLVRRSVQES AVKSVHKLK+KVELFALQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKN SSLNQISSFRKA
Subjt:  HLDVNHQQEIVSCGETHGICFLGCLVRRSVQESLAVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNRSSLNQISSFRKA

Query:  GMETDHWYKVLLKIWMQDLNARAAESEEKILSKYAVEPSLPMELRDSFSEFQRCVKEYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLLRYR
        GMETDHWYKVLLKIWMQDLNARAAESEEKILSK+AVEPSLP+ELRDSF EFQRCV+EYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLLRYR
Subjt:  GMETDHWYKVLLKIWMQDLNARAAESEEKILSKYAVEPSLPMELRDSFSEFQRCVKEYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLLRYR

Query:  LVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELILICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSKIYSSPETEKEEEKKSSET
        LVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELILICD VRKSCIRTLAAKHRIHESEIEKKFDSELSK+YSSPE E+EEE KS +T
Subjt:  LVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELILICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSKIYSSPETEKEEEKKSSET

Query:  HGLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHLEDLYLGHISLQSVDF
        HGLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCFV+GCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCK+HLEDLYLGHISLQS+DF
Subjt:  HGLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHLEDLYLGHISLQSVDF

Query:  GAWK
        GAWK
Subjt:  GAWK

TrEMBL top hitse value%identityAlignment
A0A0A0KWB0 Reverse transcriptase domain-containing protein0.0e+0089.93Show/hide
Query:  MTFGGLRRLCGMNMRNFSILQSVNVCKVKSSFVSDIGKCVQRVQSSENYSTLACADDEIDKGMEKKTLATNLAFLVEESLDVDMRRPKTQMELKRSLEIQ
        M FGGLRR C +NMRNFS LQSVNVC   SSFVSDIGKCVQ VQ SENYSTLA A  EIDKGME+  LA NLA LVEESLDVD+RR KTQMELKRSLEI+
Subjt:  MTFGGLRRLCGMNMRNFSILQSVNVCKVKSSFVSDIGKCVQRVQSSENYSTLACADDEIDKGMEKKTLATNLAFLVEESLDVDMRRPKTQMELKRSLEIQ

Query:  IKERVKAQYLNGKFLDLMGKVIACPTTLQNAYNCIRISSNVDIMSNDRLISFESMAEELSNGNFDVNGNTFSILSSRKEVLILPKIKLKVLQEAIRIVLE
        IKERVKAQYLNGKFLDLMG VIACP TLQN Y+CIRI+SNVDI SNDRLISFESMAEELSNGNFDVN NTFSILSSRKEVLILPKIKLKVLQEAIRIVLE
Subjt:  IKERVKAQYLNGKFLDLMGKVIACPTTLQNAYNCIRISSNVDIMSNDRLISFESMAEELSNGNFDVNGNTFSILSSRKEVLILPKIKLKVLQEAIRIVLE

Query:  CVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKMDELVMAKLITVMEDKIDDPRLFAVIRRIYVAGALNLEFGGFPKGHGLPQEGVLS
        CVFRPHFSKISHGCRSGR HSTALKYI+KEIK+PDWWFT++L KKMDELVMAKLITVMEDKI+DP+LFAVIR IY+AGALNLEFGGFPKGHGLPQEGVLS
Subjt:  CVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKMDELVMAKLITVMEDKIDDPRLFAVIRRIYVAGALNLEFGGFPKGHGLPQEGVLS

Query:  PILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNSSDYPGEEKDKIRVYCCRYMDEIFLAVSGSKDVALNFRSEIFDFMQKTL
        PILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGN+SDY GEEKDKIRVYCCRYMDEIFLAVSGSKDVA +FRSEIF F+QKTL
Subjt:  PILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNSSDYPGEEKDKIRVYCCRYMDEIFLAVSGSKDVALNFRSEIFDFMQKTL

Query:  HLDVNHQQEIVSCGETHGICFLGCLVRRSVQESLAVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNRSSLNQISSFRKA
        HLDVN ++E+VSC ETHGI FLGCLVRRSVQES AVKS+HKLKEKVELF LQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKN SSLN+ISSFRK 
Subjt:  HLDVNHQQEIVSCGETHGICFLGCLVRRSVQESLAVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNRSSLNQISSFRKA

Query:  GMETDHWYKVLLKIWMQDLNARAAESEEKILSKYAVEPSLPMELRDSFSEFQRCVKEYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLLRYR
        GMETDHWYKVLLKIWMQDLNARAAESEEKILSK+AVE SLP ELRDSF EFQR VKEYIS+ETAST+ALLPNYDPS KPTFITEIIAPVNSIRKRLLRYR
Subjt:  GMETDHWYKVLLKIWMQDLNARAAESEEKILSKYAVEPSLPMELRDSFSEFQRCVKEYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLLRYR

Query:  LVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELILICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSKIYSSPETEKEEEKKSSET
        LVTNKGHPCSSPFLILQDNTQIIDWF+GVSRR FRWYNN SNFSEL LI DQVRKSCIRTLAAKHRIHESEIEKKFDSELSKIYSS E ++E+E KS++T
Subjt:  LVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELILICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSKIYSSPETEKEEEKKSSET

Query:  HGLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHLEDLYLGHISLQSVDF
        H LDHDEALKYGISYSGLCLLS ARMVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHL DLYLG ISLQSVDF
Subjt:  HGLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHLEDLYLGHISLQSVDF

Query:  GAWK
        GAWK
Subjt:  GAWK

A0A1S3B491 uncharacterized protein LOC1034860080.0e+0090.05Show/hide
Query:  MTFGGLRRLCGMNMRNFSILQSVNVCKVKSSFVSDIGKCVQRVQSSENYSTLACADDEIDKGMEKKTLATNLAFLVEESLDVDMRRPKTQMELKRSLEIQ
        M FGGLRR C +N RNFS  QSVNVC V SSFVSDIGKC Q VQSSENYSTLA ADDEIDKGMEK  LA NLA LVEESLDVD+RR KT+MELKRSLEIQ
Subjt:  MTFGGLRRLCGMNMRNFSILQSVNVCKVKSSFVSDIGKCVQRVQSSENYSTLACADDEIDKGMEKKTLATNLAFLVEESLDVDMRRPKTQMELKRSLEIQ

Query:  IKERVKAQYLNGKFLDLMGKVIACPTTLQNAYNCIRISSNVDIMSNDRLISFESMAEELSNGNFDVNGNTFSILSSRKEVLILPKIKLKVLQEAIRIVLE
        IKERVKAQYLNGKFLDLMG VIACP TLQNAY+CIRI+SNVDI SND LISFESMA+ELS+GNFDVN NTFSILSSRKEVLILPKIKLKVLQEAIRIVLE
Subjt:  IKERVKAQYLNGKFLDLMGKVIACPTTLQNAYNCIRISSNVDIMSNDRLISFESMAEELSNGNFDVNGNTFSILSSRKEVLILPKIKLKVLQEAIRIVLE

Query:  CVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKMDELVMAKLITVMEDKIDDPRLFAVIRRIYVAGALNLEFGGFPKGHGLPQEGVLS
        CVFRPHFSKISHGCRSGR HSTALKYI+KEIK+PDWWFT++L KKMDELVMAKLITVMEDKI+DP+LFAVIR I++AGALNLEFG FPKGHGLPQEGVLS
Subjt:  CVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKMDELVMAKLITVMEDKIDDPRLFAVIRRIYVAGALNLEFGGFPKGHGLPQEGVLS

Query:  PILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNSSDYPGEEKDKIRVYCCRYMDEIFLAVSGSKDVALNFRSEIFDFMQKTL
        PILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQS+LRSWFRRQLK NSSDYPGEEKDKIRVYCCRYMDEIFLAVSGSKDVAL+FRSEIFDFMQKTL
Subjt:  PILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNSSDYPGEEKDKIRVYCCRYMDEIFLAVSGSKDVALNFRSEIFDFMQKTL

Query:  HLDVNHQQEIVSCGETHGICFLGCLVRRSVQESLAVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNRSSLNQISSFRKA
        HLDVNH++E+VSC ETHGI FLGCLVRRSVQES AVKS+HKLKEKVELF LQKQETW +WTVWLGKKWLAHGLKKVKESEIKHLAKN SSLNQISSFRK 
Subjt:  HLDVNHQQEIVSCGETHGICFLGCLVRRSVQESLAVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNRSSLNQISSFRKA

Query:  GMETDHWYKVLLKIWMQDLNARAAESEEKILSKYAVEPSLPMELRDSFSEFQRCVKEYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLLRYR
        GMETDHWYKVLLKIWMQDLNARAAESEEKILSK+AVEPSLP ELRDSF EFQR V+EYIS+ETAST+ALLPNYDPSVKPTFITEIIAPVNSIRKRL RYR
Subjt:  GMETDHWYKVLLKIWMQDLNARAAESEEKILSKYAVEPSLPMELRDSFSEFQRCVKEYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLLRYR

Query:  LVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELILICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSKIYSSPETEKEEEKKSSET
        LVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYN  SNFSEL LI DQVRKSCIRTLAAKH+IHESEIEKKFDSELSKIYSSPE E+E+E KS++T
Subjt:  LVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELILICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSKIYSSPETEKEEEKKSSET

Query:  HGLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHLEDLYLGHISLQSVDF
        H LDHDEAL YGISYSGLCLLSLARMVS+SRPCNCFV+GCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHL DLYLG ISLQSVDF
Subjt:  HGLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHLEDLYLGHISLQSVDF

Query:  GAWK
        GAWK
Subjt:  GAWK

A0A5A7TFZ7 Reverse transcriptase domain-containing protein0.0e+0087.77Show/hide
Query:  MTFGGLRRLCGMNMRNFSILQSVNVCKVKSSFVSDI----------------------GKCVQRVQSSENYSTLACADDEIDKGMEKKTLATNLAFLVEE
        M FGGLRR C +N RNFS  QSVNVC V SSFVSDI                      GKC Q VQSSENYSTLA ADDEIDKGMEK  LA NLA LVEE
Subjt:  MTFGGLRRLCGMNMRNFSILQSVNVCKVKSSFVSDI----------------------GKCVQRVQSSENYSTLACADDEIDKGMEKKTLATNLAFLVEE

Query:  SLDVDMRRPKTQMELKRSLEIQIKERVKAQYLNGKFLDLMGKVIACPTTLQNAYNCIRISSNVDIMSNDRLISFESMAEELSNGNFDVNGNTFSILSSRK
        SLDVD+RR KT+MELKRSLEIQIKERVKAQYLNGKFLDLMG VIACP TLQNAY+CIRI+SNVDI SND LISFESMAEELS+GNFDVN NTFSILSSRK
Subjt:  SLDVDMRRPKTQMELKRSLEIQIKERVKAQYLNGKFLDLMGKVIACPTTLQNAYNCIRISSNVDIMSNDRLISFESMAEELSNGNFDVNGNTFSILSSRK

Query:  EVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKMDELVMAKLITVMEDKIDDPRLFAVIRRIYVAG
        EVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGR HSTALKYI+KEIK+PDWWFT++L KKMD+LVMAKLITVMEDKI+DP+LFAVIR I++AG
Subjt:  EVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKMDELVMAKLITVMEDKIDDPRLFAVIRRIYVAG

Query:  ALNLEFGGFPKGHGLPQEGVLSPILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNSSDYPGEEKDKIRVYCCRYMDEIFLAV
        ALNLEFG FPKGHGLPQEGVLSPILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQS+LRSWFRRQLKGNSSDYPGEEKDKIRVYCCRYMDEIFLAV
Subjt:  ALNLEFGGFPKGHGLPQEGVLSPILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNSSDYPGEEKDKIRVYCCRYMDEIFLAV

Query:  SGSKDVALNFRSEIFDFMQKTLHLDVNHQQEIVSCGETHGICFLGCLVRRSVQESLAVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHGLKKVKE
        SGSKDVAL+FRSEIFDFMQKTLHLDVNH++E+VSC ETHGI FLGCLVRRSVQES AVKS+HKLKEKVELF LQKQETW +WTVWLGKKWLAHGLKKVKE
Subjt:  SGSKDVALNFRSEIFDFMQKTLHLDVNHQQEIVSCGETHGICFLGCLVRRSVQESLAVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHGLKKVKE

Query:  SEIKHLAKNRSSLNQISSFRKAGMETDHWYKVLLKIWMQDLNARAAESEEKILSKYAVEPSLPMELRDSFSEFQRCVKEYISAETASTVALLPNYDPSVK
        SEIKHLAKN SSLNQISSFRK GMETDHWYKVLLKIWMQDLNARAAESEEKILSK+AVEPSLP ELRDSF EFQR V+EYIS+ETAST+ALLPNYDPSVK
Subjt:  SEIKHLAKNRSSLNQISSFRKAGMETDHWYKVLLKIWMQDLNARAAESEEKILSKYAVEPSLPMELRDSFSEFQRCVKEYISAETASTVALLPNYDPSVK

Query:  PTFITEIIAPVNSIRKRLLRYRLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELILICDQVRKSCIRTLAAKHRIHESEIEKKFDS
        PTFITEIIAPVNSIRKRL RYRLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYN  SNFSEL LI DQVRKSCIRTLAAKHRIHESEIEKKFDS
Subjt:  PTFITEIIAPVNSIRKRLLRYRLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELILICDQVRKSCIRTLAAKHRIHESEIEKKFDS

Query:  ELSKIYSSPETEKEEEKKSSETHGLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRF
        ELSKIYSSPE E+ +E KS++TH LDHDEAL YGISYSGLCLLSLARMVS+SRPCNCFV+GCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRF
Subjt:  ELSKIYSSPETEKEEEKKSSETHGLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRF

Query:  GLCKQHLEDLYLGHISLQSVDFGAWK
        GLCKQHL DLYLG ISLQSVDFGAWK
Subjt:  GLCKQHLEDLYLGHISLQSVDFGAWK

A0A6J1CXL0 nuclear intron maturase 4, mitochondrial isoform X20.0e+0084.22Show/hide
Query:  KMTFGGLRRLCGMNMRNFSILQSVNVCKVKSSFVSDIGKCVQRVQSSENYSTLACADDEIDKGMEKKTLATNLAFLVEESLDVDMRRPKTQMELKRSLEI
        +M FGG +R C MNMRNF++L+   +CKV SSFVSDIGKCVQRVQ+SENYS LACADD+  KGMEKK LA NLA LVEESLDVD RRPK++MELKRSLEI
Subjt:  KMTFGGLRRLCGMNMRNFSILQSVNVCKVKSSFVSDIGKCVQRVQSSENYSTLACADDEIDKGMEKKTLATNLAFLVEESLDVDMRRPKTQMELKRSLEI

Query:  QIKERVKAQYLNGKFLDLMGKVIACPTTLQNAYNCIRISSNVDIMSNDRLISFESMAEELSNGNFDVNGNTFSILSSRKEVLILPKIKLKVLQEAIRIVL
        QIK+RVKAQY+NGKF+DLMGKVIACP TLQNAY+C+RI+SNVDI SND LISFESMAEEL NG+FDVN NTFSI SS+KEVLILPK+KLKVLQEAIRIVL
Subjt:  QIKERVKAQYLNGKFLDLMGKVIACPTTLQNAYNCIRISSNVDIMSNDRLISFESMAEELSNGNFDVNGNTFSILSSRKEVLILPKIKLKVLQEAIRIVL

Query:  ECVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKMDELVMAKLITVMEDKIDDPRLFAVIRRIYVAGALNLEFGGFPKGHGLPQEGVL
        ECVFRPHFSKISHGCRSGR HSTALKYIRKEI NPDWWFT+++ KKMDEL MAKLI+VMEDKI+DP  FA+IR I+ AGALNLEFGGFPKGHGLPQEGVL
Subjt:  ECVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKMDELVMAKLITVMEDKIDDPRLFAVIRRIYVAGALNLEFGGFPKGHGLPQEGVL

Query:  SPILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNSSDYPGEEKDKIRVYCCRYMDEIFLAVSGSKDVALNFRSEIFDFMQKT
        SPIL NIYLNLFDQEFFRLSMKYEAIN+YGN  QDGSQS+LRSWFRR+LKGN S+YP +EKD IRVYCCRYMDEIF+AVSGSKDVAL+FRSEI DF+QK+
Subjt:  SPILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNSSDYPGEEKDKIRVYCCRYMDEIFLAVSGSKDVALNFRSEIFDFMQKT

Query:  LHLDVNHQQEIVSCGETHGICFLGCLVRRSVQESLAVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNRSSLNQISSFRK
        LHLDVNHQ+E+VSC ET GI FLGCLVRRS +ES AVK+VHKLKEKVELFALQKQE WN WTVWLGKKWLAHGLKKVKESEIKHLAKN  SLNQISSFRK
Subjt:  LHLDVNHQQEIVSCGETHGICFLGCLVRRSVQESLAVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNRSSLNQISSFRK

Query:  AGMETDHWYKVLLKIWMQDLNARAAESEEKILSKYAVEPSLPMELRDSFSEFQRCVKEYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLLRY
         GMETDHWYKVLLKIWMQD+NA+AAE+EE ILS Y VEPSLP+ELRDSF EFQR V+EY+S+ETASTVALLPNYDPSVK TFITEIIAPVNSIRKRLLRY
Subjt:  AGMETDHWYKVLLKIWMQDLNARAAESEEKILSKYAVEPSLPMELRDSFSEFQRCVKEYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLLRY

Query:  RLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELILICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSKIYSSPETEKEEEKKSSE
        RL+TNKG+PC+SPFLIL DNTQIIDWFLGV RRW +WY+NCSNFSE+ILICDQVRKSCIRTLAAKHR HESEIEKKFD ELS+I S+PE E+EEE+++S+
Subjt:  RLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELILICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSKIYSSPETEKEEEKKSSE

Query:  THGLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHLEDLYLGHISLQSVD
        THGL HDEA  YGISYSGLCLLSLARMVSQSRPCNCFV+GCLA APSVYTLHVMERQKFPGWKTGFSSSIHPSLN+RR GLCKQHL+DLYLGHISLQSV+
Subjt:  THGLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHLEDLYLGHISLQSVD

Query:  FGAWK
        FGAWK
Subjt:  FGAWK

A0A6J1CYJ7 nuclear intron maturase 4, mitochondrial isoform X10.0e+0084.12Show/hide
Query:  KMTFGGLRRLCGMNMRNFSILQSVNVCKVKSSFVSDI-GKCVQRVQSSENYSTLACADDEIDKGMEKKTLATNLAFLVEESLDVDMRRPKTQMELKRSLE
        +M FGG +R C MNMRNF++L+   +CKV SSFVSDI GKCVQRVQ+SENYS LACADD+  KGMEKK LA NLA LVEESLDVD RRPK++MELKRSLE
Subjt:  KMTFGGLRRLCGMNMRNFSILQSVNVCKVKSSFVSDI-GKCVQRVQSSENYSTLACADDEIDKGMEKKTLATNLAFLVEESLDVDMRRPKTQMELKRSLE

Query:  IQIKERVKAQYLNGKFLDLMGKVIACPTTLQNAYNCIRISSNVDIMSNDRLISFESMAEELSNGNFDVNGNTFSILSSRKEVLILPKIKLKVLQEAIRIV
        IQIK+RVKAQY+NGKF+DLMGKVIACP TLQNAY+C+RI+SNVDI SND LISFESMAEEL NG+FDVN NTFSI SS+KEVLILPK+KLKVLQEAIRIV
Subjt:  IQIKERVKAQYLNGKFLDLMGKVIACPTTLQNAYNCIRISSNVDIMSNDRLISFESMAEELSNGNFDVNGNTFSILSSRKEVLILPKIKLKVLQEAIRIV

Query:  LECVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKMDELVMAKLITVMEDKIDDPRLFAVIRRIYVAGALNLEFGGFPKGHGLPQEGV
        LECVFRPHFSKISHGCRSGR HSTALKYIRKEI NPDWWFT+++ KKMDEL MAKLI+VMEDKI+DP  FA+IR I+ AGALNLEFGGFPKGHGLPQEGV
Subjt:  LECVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKMDELVMAKLITVMEDKIDDPRLFAVIRRIYVAGALNLEFGGFPKGHGLPQEGV

Query:  LSPILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNSSDYPGEEKDKIRVYCCRYMDEIFLAVSGSKDVALNFRSEIFDFMQK
        LSPIL NIYLNLFDQEFFRLSMKYEAIN+YGN  QDGSQS+LRSWFRR+LKGN S+YP +EKD IRVYCCRYMDEIF+AVSGSKDVAL+FRSEI DF+QK
Subjt:  LSPILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNSSDYPGEEKDKIRVYCCRYMDEIFLAVSGSKDVALNFRSEIFDFMQK

Query:  TLHLDVNHQQEIVSCGETHGICFLGCLVRRSVQESLAVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNRSSLNQISSFR
        +LHLDVNHQ+E+VSC ET GI FLGCLVRRS +ES AVK+VHKLKEKVELFALQKQE WN WTVWLGKKWLAHGLKKVKESEIKHLAKN  SLNQISSFR
Subjt:  TLHLDVNHQQEIVSCGETHGICFLGCLVRRSVQESLAVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNRSSLNQISSFR

Query:  KAGMETDHWYKVLLKIWMQDLNARAAESEEKILSKYAVEPSLPMELRDSFSEFQRCVKEYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLLR
        K GMETDHWYKVLLKIWMQD+NA+AAE+EE ILS Y VEPSLP+ELRDSF EFQR V+EY+S+ETASTVALLPNYDPSVK TFITEIIAPVNSIRKRLLR
Subjt:  KAGMETDHWYKVLLKIWMQDLNARAAESEEKILSKYAVEPSLPMELRDSFSEFQRCVKEYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLLR

Query:  YRLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELILICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSKIYSSPETEKEEEKKSS
        YRL+TNKG+PC+SPFLIL DNTQIIDWFLGV RRW +WY+NCSNFSE+ILICDQVRKSCIRTLAAKHR HESEIEKKFD ELS+I S+PE E+EEE+++S
Subjt:  YRLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELILICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSKIYSSPETEKEEEKKSS

Query:  ETHGLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHLEDLYLGHISLQSV
        +THGL HDEA  YGISYSGLCLLSLARMVSQSRPCNCFV+GCLA APSVYTLHVMERQKFPGWKTGFSSSIHPSLN+RR GLCKQHL+DLYLGHISLQSV
Subjt:  ETHGLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHLEDLYLGHISLQSV

Query:  DFGAWK
        +FGAWK
Subjt:  DFGAWK

SwissProt top hitse value%identityAlignment
P03876 Putative COX1/OXI3 intron 2 protein5.0e-2024.79Show/hide
Query:  KAQYLNGKFLDLMGKVIACPTTLQNAYNCIRISSNVDIMSNDRLISFESM-AEELSNGNFDVNGNTFSILSSRK----------EVLILPKIKLKVLQEA
        K + +N + L LM  +      L  AYN I+ S   ++      I+ + +    L+  + D+N N F     R+            L +   + K++QE+
Subjt:  KAQYLNGKFLDLMGKVIACPTTLQNAYNCIRISSNVDIMSNDRLISFESM-AEELSNGNFDVNGNTFSILSSRK----------EVLILPKIKLKVLQEA

Query:  IRIVLECVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKMDELVMAKLITVMEDKIDDPRLFAVIRRIYVAGALNLEFGGFPKGHGLP
        +R++LE ++   FS  SHG R   +  TA+   +  ++  +W+  ++L K  D +    LI V+ ++I D     ++ ++  AG ++          G+P
Subjt:  IRIVLECVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKMDELVMAKLITVMEDKIDDPRLFAVIRRIYVAGALNLEFGGFPKGHGLP

Query:  QEGVLSPILTNIYLNLFDQEFFRLSMKYEAINEY-----GNTGQDGSQSRLRSWFRR-----------QLKGNSSDYPGEEKDKIRVYCCRYMDEIFLAV
        Q  V+SPIL NI+L+  D+    L  K+E  NE+      N G++   + L S   R           +L+ +     G +K   R Y  RY D+I + V
Subjt:  QEGVLSPILTNIYLNLFDQEFFRLSMKYEAINEY-----GNTGQDGSQSRLRSWFRR-----------QLKGNSSDYPGEEKDKIRVYCCRYMDEIFLAV

Query:  SGSKDVALNFRSEIFDFMQKTLHLDVNHQQEIVSCGETHGICFLGCLVRRSVQESLAVKSVHK
         GS +   N  ++I +F+++ L + +N  + ++   +  G+ FLG  V+ +  E    + + K
Subjt:  SGSKDVALNFRSEIFDFMQKTLHLDVNHQQEIVSCGETHGICFLGCLVRRSVQESLAVKSVHK

P0A3U0 Group II intron-encoded protein LtrA3.2e-1926.33Show/hide
Query:  SSRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKMDELVMAKLITVMEDKIDDPRLFAVIRRI
        S +   L +P    K++QEA+RI+LE ++ P F  +SHG R  R+  TALK I++E     W+   +++   D +    LI ++  KI D ++  +I + 
Subjt:  SSRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKMDELVMAKLITVMEDKIDDPRLFAVIRRI

Query:  YVAGALNLEFGGFPKGH-GLPQEGVLSPILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFR------RQLKGNSSDYPGEEKDKIR---
          AG   LE   + K + G PQ G+LSP+L NIYL+  D+   +L MK++            S  R+   +R      +++        GEEK K+    
Subjt:  YVAGALNLEFGGFPKGH-GLPQEGVLSPILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFR------RQLKGNSSDYPGEEKDKIR---

Query:  -------------------VYCCRYMDEIFLAVSGSKDVALNFRSEIFDFMQKTLHLDVNHQQEIVSCGETHGICFLGCLVRRSVQESLAVKSVHKLKEK
                           +   RY D+  ++V GSK+     + ++  F+   L ++++ ++ +++   +    FLG  +R  V+ S  +K   K+K++
Subjt:  -------------------VYCCRYMDEIFLAVSGSKDVALNFRSEIFDFMQKTLHLDVNHQQEIVSCGETHGICFLGCLVRRSVQESLAVKSVHKLKEK

P0A3U1 Group II intron-encoded protein LtrA3.2e-1926.33Show/hide
Query:  SSRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKMDELVMAKLITVMEDKIDDPRLFAVIRRI
        S +   L +P    K++QEA+RI+LE ++ P F  +SHG R  R+  TALK I++E     W+   +++   D +    LI ++  KI D ++  +I + 
Subjt:  SSRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKMDELVMAKLITVMEDKIDDPRLFAVIRRI

Query:  YVAGALNLEFGGFPKGH-GLPQEGVLSPILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFR------RQLKGNSSDYPGEEKDKIR---
          AG   LE   + K + G PQ G+LSP+L NIYL+  D+   +L MK++            S  R+   +R      +++        GEEK K+    
Subjt:  YVAGALNLEFGGFPKGH-GLPQEGVLSPILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFR------RQLKGNSSDYPGEEKDKIR---

Query:  -------------------VYCCRYMDEIFLAVSGSKDVALNFRSEIFDFMQKTLHLDVNHQQEIVSCGETHGICFLGCLVRRSVQESLAVKSVHKLKEK
                           +   RY D+  ++V GSK+     + ++  F+   L ++++ ++ +++   +    FLG  +R  V+ S  +K   K+K++
Subjt:  -------------------VYCCRYMDEIFLAVSGSKDVALNFRSEIFDFMQKTLHLDVNHQQEIVSCGETHGICFLGCLVRRSVQESLAVKSVHKLKEK

Q9CA78 Nuclear intron maturase 4, mitochondrial9.4e-24558.2Show/hide
Query:  TLATNLAFLVEESLD--VDMRRPKTQMELKRSLEIQIKERVKAQYLNGKFLDLMGKVIACPTTLQNAYNCIRISSNVDIMSNDRLISFESMAEELSNGNF
        +LA  LA LVEES     D  +P+++MELKRSLE+++K+RVK Q +NGKF DL+ KVIA P TL++AY+CIR++SNV I   +  ++F+S+AEELS+G F
Subjt:  TLATNLAFLVEESLD--VDMRRPKTQMELKRSLEIQIKERVKAQYLNGKFLDLMGKVIACPTTLQNAYNCIRISSNVDIMSNDRLISFESMAEELSNGNF

Query:  DVNGNTFSILS--SRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKMDELVMAKLITVMEDKI
        DV  NTFSI++    KEVL+LP + LKV+QEAIRIVLE VF PHFSKISH CRSGR  ++ALKYI   I   DW FT+ L KK+D  V   L++VME+K+
Subjt:  DVNGNTFSILS--SRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKMDELVMAKLITVMEDKI

Query:  DDPRLFAVIRRIYVAGALNLEFGGFPKGHGLPQEGVLSPILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNSSDYPGEEKDK
        +D  L  ++R ++ A  LNLEFGGFPKGHGLPQEGVLS +L NIYL+ FD EF+R+SM++EA+     T +D   S+LRSWFRRQ          E+   
Subjt:  DDPRLFAVIRRIYVAGALNLEFGGFPKGHGLPQEGVLSPILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNSSDYPGEEKDK

Query:  IRVYCCRYMDEIFLAVSGSKDVALNFRSEIFDFMQKTLHLDVNHQQEIVSCGETHGICFLGCLVRRSVQESLAVKSVHKLKEKVELFALQKQETWNAWTV
        +RVYCCR+MDEI+ +VSG K VA + RSE   F++ +LHLD+  + +   C  T G+  LG LVR++V+ES  VK+VHKLKEKV LFALQK+E W   TV
Subjt:  IRVYCCRYMDEIFLAVSGSKDVALNFRSEIFDFMQKTLHLDVNHQQEIVSCGETHGICFLGCLVRRSVQESLAVKSVHKLKEKVELFALQKQETWNAWTV

Query:  WLGKKWLAHGLKKVKESEIKHLAKNRSSLNQISSFRKAGMETDHWYKVLLKIWMQD-LNARAAESEEKILSKYAVEPSLPMELRDSFSEFQRCVKEYISA
         +GKKWL HGLKKVKESEIK LA + S+L+QIS  RKAGMETDHWYK+LL+IWM+D L   A  SEE +LSK+ VEP++P ELRD+F +FQ     Y+S+
Subjt:  WLGKKWLAHGLKKVKESEIKHLAKNRSSLNQISSFRKAGMETDHWYKVLLKIWMQD-LNARAAESEEKILSKYAVEPSLPMELRDSFSEFQRCVKEYISA

Query:  ETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLLRYRLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSEL-ILICDQVRKSCIRT
        ETA+  ALLP      +P F  +++AP N+I +RL RY L+T KG+  S+  LIL D  QIIDW+ G+ RRW  WY  CSNF E+  LI +Q+R SCIRT
Subjt:  ETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLLRYRLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSEL-ILICDQVRKSCIRT

Query:  LAAKHRIHESEIEKKFDSELSKIYSSPETEKEEEKKSSETHGLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPG
        LAAK+RIHE+EIEK+ D ELS I S+ + E+E + +  ++   D DE L YG+S SGLCLLSLAR+VS+SRPCNCFVIGC   AP+VYTLH MERQKFPG
Subjt:  LAAKHRIHESEIEKKFDSELSKIYSSPETEKEEEKKSSETHGLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPG

Query:  WKTGFSSSIHPSLNKRRFGLCKQHLEDLYLGHISLQSVDFGAWK
        WKTGFS  I  SLN RR GLCKQHL+DLY+G ISLQ+VDFGAW+
Subjt:  WKTGFSSSIHPSLNKRRFGLCKQHLEDLYLGHISLQSVDFGAWK

Q9LZA5 Nuclear intron maturase 3, mitochondrial5.8e-5325.5Show/hide
Query:  QIKERVKAQYLNGKFLDLMGKVIACPTTLQNAYNCIRISSNVDIMSNDRL---ISFESMAEELSNGNFDVNGNTFSILSSRKEVLILPKIKLKVLQEAIR
        +++  V  QY +GKF  L+   ++ P  L  A   + +S+N      DR+    S E M  E+  G FD+       +SS    L+LP +KLKVL EAIR
Subjt:  QIKERVKAQYLNGKFLDLMGKVIACPTTLQNAYNCIRISSNVDIMSNDRL---ISFESMAEELSNGNFDVNGNTFSILSSRKEVLILPKIKLKVLQEAIR

Query:  IVLECVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKM-DELVMAKLITVMEDKIDDPRLFAVIRRIYVAGALNLEFGGFPKGHGLPQ
        +VLE V+   F+  S+G R G    TA++Y++  ++NP WWF +   ++M +E  +  L   + +KI+D  L  +I++++  G L +E GG   G G PQ
Subjt:  IVLECVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKM-DELVMAKLITVMEDKIDDPRLFAVIRRIYVAGALNLEFGGFPKGHGLPQ

Query:  EGVLSPILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNSSDYPGEEKDKIRVYCCRYMDEIFLAVSGSKDVALNFRSEIFDF
        E  L  IL N+Y +  D+E   L +K +  N    TG + S             GN    P      + +Y  RY+DEI +  SGSK + ++ +  I D 
Subjt:  EGVLSPILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNSSDYPGEEKDKIRVYCCRYMDEIFLAVSGSKDVALNFRSEIFDF

Query:  MQKTLHLDVNHQQEIVSCGETHGICFLGCL-------VRRSVQESLAVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNR
        +++ L L V+     +    +  I FLG         V R  +   AV+++ K + + ++  L+ +         LG K   H LKK+K+S         
Subjt:  MQKTLHLDVNHQQEIVSCGETHGICFLGCL-------VRRSVQESLAVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNR

Query:  SSLNQISSFRKAGMETDHWYKVLLKIW----MQDLNARAAE--------SEEKILSKYAVEPSLPMELRDSFSEFQRCVKEYISAETASTVALLPNYDPS
              + F+  G E ++  + + + W    MQD      E        +    LS   +   LP +L D++ EFQ  V ++++   A  V  L + +  
Subjt:  SSLNQISSFRKAGMETDHWYKVLLKIW----MQDLNARAAE--------SEEKILSKYAVEPSLPMELRDSFSEFQRCVKEYISAETASTVALLPNYDPS

Query:  VK---------------PTFITEIIAPVNSIRKRLLRYRLVTNKGHPCSSPFLILQDNTQIIDWFL-----GVSRRWFRWYNNCSNFSELILICDQVRKS
        V+                    ++ AP   +RK +       + G P     L+  +++ II W+      G +++  R Y      S+L          
Subjt:  VK---------------PTFITEIIAPVNSIRKRLLRYRLVTNKGHPCSSPFLILQDNTQIIDWFL-----GVSRRWFRWYNNCSNFSELILICDQVRKS

Query:  CIRTLAAKHRIHESEIEKKFDSELSKIYSSPETEKEEEKKSSETHGLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCFVIGCLAPAPSVYTLHVMERQ
                    +   E  F SE        E +   +K  S+   +D            G   L L R+ S     +C    C      ++ +H+++ +
Subjt:  CIRTLAAKHRIHESEIEKKFDSELSKIYSSPETEKEEEKKSSETHGLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCFVIGCLAPAPSVYTLHVMERQ

Query:  KF------PGWKTGFSSSIHPSLNKRRFGLCKQHLEDLYLGHISLQSVD
                  W  G   +IH +LN++   LC  H+ D+YLG I+LQ VD
Subjt:  KF------PGWKTGFSSSIHPSLNKRRFGLCKQHLEDLYLGHISLQSVD

Arabidopsis top hitse value%identityAlignment
AT1G04620.1 coenzyme F420 hydrogenase family / dehydrogenase, beta subunit family6.3e-1071.74Show/hide
Query:  LELCRLWVLFT-LVDNGIREGLDELLKVASTEPETVLHYEFMQDYK
        L L +L+VL T  VDNG R+GLD+ LK AS EPETVLHYEFMQDYK
Subjt:  LELCRLWVLFT-LVDNGIREGLDELLKVASTEPETVLHYEFMQDYK

AT1G74350.1 Intron maturase, type II family protein6.7e-24658.2Show/hide
Query:  TLATNLAFLVEESLD--VDMRRPKTQMELKRSLEIQIKERVKAQYLNGKFLDLMGKVIACPTTLQNAYNCIRISSNVDIMSNDRLISFESMAEELSNGNF
        +LA  LA LVEES     D  +P+++MELKRSLE+++K+RVK Q +NGKF DL+ KVIA P TL++AY+CIR++SNV I   +  ++F+S+AEELS+G F
Subjt:  TLATNLAFLVEESLD--VDMRRPKTQMELKRSLEIQIKERVKAQYLNGKFLDLMGKVIACPTTLQNAYNCIRISSNVDIMSNDRLISFESMAEELSNGNF

Query:  DVNGNTFSILS--SRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKMDELVMAKLITVMEDKI
        DV  NTFSI++    KEVL+LP + LKV+QEAIRIVLE VF PHFSKISH CRSGR  ++ALKYI   I   DW FT+ L KK+D  V   L++VME+K+
Subjt:  DVNGNTFSILS--SRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKMDELVMAKLITVMEDKI

Query:  DDPRLFAVIRRIYVAGALNLEFGGFPKGHGLPQEGVLSPILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNSSDYPGEEKDK
        +D  L  ++R ++ A  LNLEFGGFPKGHGLPQEGVLS +L NIYL+ FD EF+R+SM++EA+     T +D   S+LRSWFRRQ          E+   
Subjt:  DDPRLFAVIRRIYVAGALNLEFGGFPKGHGLPQEGVLSPILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNSSDYPGEEKDK

Query:  IRVYCCRYMDEIFLAVSGSKDVALNFRSEIFDFMQKTLHLDVNHQQEIVSCGETHGICFLGCLVRRSVQESLAVKSVHKLKEKVELFALQKQETWNAWTV
        +RVYCCR+MDEI+ +VSG K VA + RSE   F++ +LHLD+  + +   C  T G+  LG LVR++V+ES  VK+VHKLKEKV LFALQK+E W   TV
Subjt:  IRVYCCRYMDEIFLAVSGSKDVALNFRSEIFDFMQKTLHLDVNHQQEIVSCGETHGICFLGCLVRRSVQESLAVKSVHKLKEKVELFALQKQETWNAWTV

Query:  WLGKKWLAHGLKKVKESEIKHLAKNRSSLNQISSFRKAGMETDHWYKVLLKIWMQD-LNARAAESEEKILSKYAVEPSLPMELRDSFSEFQRCVKEYISA
         +GKKWL HGLKKVKESEIK LA + S+L+QIS  RKAGMETDHWYK+LL+IWM+D L   A  SEE +LSK+ VEP++P ELRD+F +FQ     Y+S+
Subjt:  WLGKKWLAHGLKKVKESEIKHLAKNRSSLNQISSFRKAGMETDHWYKVLLKIWMQD-LNARAAESEEKILSKYAVEPSLPMELRDSFSEFQRCVKEYISA

Query:  ETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLLRYRLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSEL-ILICDQVRKSCIRT
        ETA+  ALLP      +P F  +++AP N+I +RL RY L+T KG+  S+  LIL D  QIIDW+ G+ RRW  WY  CSNF E+  LI +Q+R SCIRT
Subjt:  ETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLLRYRLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSEL-ILICDQVRKSCIRT

Query:  LAAKHRIHESEIEKKFDSELSKIYSSPETEKEEEKKSSETHGLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPG
        LAAK+RIHE+EIEK+ D ELS I S+ + E+E + +  ++   D DE L YG+S SGLCLLSLAR+VS+SRPCNCFVIGC   AP+VYTLH MERQKFPG
Subjt:  LAAKHRIHESEIEKKFDSELSKIYSSPETEKEEEKKSSETHGLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPG

Query:  WKTGFSSSIHPSLNKRRFGLCKQHLEDLYLGHISLQSVDFGAWK
        WKTGFS  I  SLN RR GLCKQHL+DLY+G ISLQ+VDFGAW+
Subjt:  WKTGFSSSIHPSLNKRRFGLCKQHLEDLYLGHISLQSVDFGAWK

AT5G04050.1 RNA-directed DNA polymerase (reverse transcriptase)1.0e-5227.15Show/hide
Query:  QIKERVKAQYLNGKFLDLMGKVIACPTTLQNAYNCIRISSNVDIMSNDRL---ISFESMAEELSNGNFDVNGNTFSILSSRKEVLILPKIKLKVLQEAIR
        +++  V  QY +GKF  L+   ++ P  L  A   + +S+N      DR+    S E M  E+  G FD+       +SS    L+LP +KLKVL EAIR
Subjt:  QIKERVKAQYLNGKFLDLMGKVIACPTTLQNAYNCIRISSNVDIMSNDRL---ISFESMAEELSNGNFDVNGNTFSILSSRKEVLILPKIKLKVLQEAIR

Query:  IVLECVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKM-DELVMAKLITVMEDKIDDPRLFAVIRRIYVAGALNLEFGGFPKGHGLPQ
        +VLE V+   F+  S+G R G    TA++Y++  ++NP WWF +   ++M +E  +  L   + +KI+D  L  +I++++  G L +E GG   G G PQ
Subjt:  IVLECVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKM-DELVMAKLITVMEDKIDDPRLFAVIRRIYVAGALNLEFGGFPKGHGLPQ

Query:  EGVLSPILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNSSDYPGEEKDKIRVYCCRYMDEIFLAVSGSKDVALNFRSEIFDF
        E  L  IL N+Y +  D+E   L +K +  N    TG + S             GN    P      + +Y  RY+DEI +  SGSK + ++ +  I D 
Subjt:  EGVLSPILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNSSDYPGEEKDKIRVYCCRYMDEIFLAVSGSKDVALNFRSEIFDF

Query:  MQKTLHLDVNHQQEIVSCGETHGICFLGCL-------VRRSVQESLAVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNR
        +++ L L V+     +    +  I FLG         V R  +   AV+++ K + + ++  L+ +         LG K   H LKK+K+S         
Subjt:  MQKTLHLDVNHQQEIVSCGETHGICFLGCL-------VRRSVQESLAVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNR

Query:  SSLNQISSFRKAGMETDHWYKVLLKIW----MQDLNARAAE--------SEEKILSKYAVEPSLPMELRDSFSEFQRCVKEYISAETASTVALLPNYDPS
              + F+  G E ++  + + + W    MQD      E        +    LS   +   LP +L D++ EFQ  V ++++   A  V  L + +  
Subjt:  SSLNQISSFRKAGMETDHWYKVLLKIW----MQDLNARAAE--------SEEKILSKYAVEPSLPMELRDSFSEFQRCVKEYISAETASTVALLPNYDPS

Query:  VK---------------PTFITEIIAPVNSIRKRLLRYRLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNF
        V+                    ++ AP   +RK +       + G P     L+  +++ II W+ GV R+W  ++  C N+
Subjt:  VK---------------PTFITEIIAPVNSIRKRLLRYRLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNF

AT5G04050.2 RNA-directed DNA polymerase (reverse transcriptase)1.9e-4628.51Show/hide
Query:  QIKERVKAQYLNGKFLDLMGKVIACPTTLQNAYNCIRISSNVDIMSNDRL---ISFESMAEELSNGNFDVNGNTFSILSSRKEVLILPKIKLKVLQEAIR
        +++  V  QY +GKF  L+   ++ P  L  A   + +S+N      DR+    S E M  E+  G FD+       +SS    L+LP +KLKVL EAIR
Subjt:  QIKERVKAQYLNGKFLDLMGKVIACPTTLQNAYNCIRISSNVDIMSNDRL---ISFESMAEELSNGNFDVNGNTFSILSSRKEVLILPKIKLKVLQEAIR

Query:  IVLECVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKM-DELVMAKLITVMEDKIDDPRLFAVIRRIYVAGALNLEFGGFPKGHGLPQ
        +VLE V+   F+  S+G R G    TA++Y++  ++NP WWF +   ++M +E  +  L   + +KI+D  L  +I++++  G L +E GG   G G PQ
Subjt:  IVLECVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKM-DELVMAKLITVMEDKIDDPRLFAVIRRIYVAGALNLEFGGFPKGHGLPQ

Query:  EGVLSPILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNSSDYPGEEKDKIRVYCCRYMDEIFLAVSGSKDVALNFRSEIFDF
        E  L  IL N+Y +  D+E   L +K +  N    TG + S             GN    P      + +Y  RY+DEI +  SGSK + ++ +  I D 
Subjt:  EGVLSPILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNSSDYPGEEKDKIRVYCCRYMDEIFLAVSGSKDVALNFRSEIFDF

Query:  MQKTLHLDVNHQQEIVSCGETHGICFLGCL-------VRRSVQESLAVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNR
        +++ L L V+     +    +  I FLG         V R  +   AV+++ K + + ++  L+ +         LG K   H LKK+K+S         
Subjt:  MQKTLHLDVNHQQEIVSCGETHGICFLGCL-------VRRSVQESLAVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNR

Query:  SSLNQISSFRKAGMETDHWYKVLLKIW----MQDLNARAAE--------SEEKILSKYAVEPSLPMELRDSFSEFQRCVKEYISAETASTV
              + F+  G E ++  + + + W    MQD      E        +    LS   +   LP +L D++ EFQ  V ++++   A  V
Subjt:  SSLNQISSFRKAGMETDHWYKVLLKIW----MQDLNARAAE--------SEEKILSKYAVEPSLPMELRDSFSEFQRCVKEYISAETASTV

AT5G04050.2 RNA-directed DNA polymerase (reverse transcriptase)3.0e-0429.21Show/hide
Query:  GLCLLSLARMVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKF------PGWKTGFSSSIHPSLNKRRFGLCKQHLEDLYLGHISLQSVD
        G   L L R+ S     +C    C      ++ +H+++ +          W  G   +IH +LN++   LC  H+ D+YLG I+LQ VD
Subjt:  GLCLLSLARMVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKF------PGWKTGFSSSIHPSLNKRRFGLCKQHLEDLYLGHISLQSVD

ATMG00520.1 Intron maturase, type II family protein4.5e-1635.29Show/hide
Query:  KVLQEAIRIVLECVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKMDELVMAKLITVMEDKIDDPRLFAVIRRIYVAGALNLEFGGFP
        K+++EAIR+VLE ++ P F   SH  RSG+   + L+ I++E     W+   ++RK    +   +LI +++++IDDP+ F  I++++ AG L     G  
Subjt:  KVLQEAIRIVLECVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKMDELVMAKLITVMEDKIDDPRLFAVIRRIYVAGALNLEFGGFP

Query:  KG-HGLPQEGVLSPILTNIYLNLFDQEFFRLSMKYE
        +G + +P   +LS +  NIYL+  DQE  R+  KYE
Subjt:  KG-HGLPQEGVLSPILTNIYLNLFDQEFFRLSMKYE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGCCGAAGTCCTAAGGAGCTTGTCACATCCACATTTCTTAGAAGAGAAGGGTTTGTCTCGTATGAGTCACAAGTCGAGCTGACGCAACCCTCATCTACTACACAAAA
GAGCTCCTCGTCCACCATAGTTTTCTTGTCTTCTCCATGCCGCCACTCTGCGTCTCCAACCACCGCGCCCTTCGCCAGGGACGACGCCGGCGTGGTTTCTTCGCCGCCGT
CGTGGGTTCTTTTCTCCGTGAGTCTTTCACCACCGTCGTGGATGCTTGAGCTGTGCCGTTTGTGGGTTCTCTTCACCTTAGTGGATAATGGAATTCGAGAAGGACTAGAT
GAACTTCTGAAGGTTGCAAGTACAGAACCAGAGACAGTTCTTCATTACGAGTTCATGCAAGATTACAAGTGTTGTTATGAGAGATTTCATCATTCCTCTTGGTTGACAAG
AACTGAAGCGATGGAGGTTAACGACAAATTCCGATGGTTTAATGTGCTCGTATTTGGCAAAGTTCTTATTTACAGTTCTGATGCTTTTTTTTTTTCCAACACTCTTTACA
GTTATGCAAAAATGACATTTGGGGGGCTTCGAAGATTATGTGGGATGAACATGCGGAACTTTTCAATTTTGCAAAGTGTAAATGTTTGCAAAGTCAAATCATCCTTTGTT
TCTGACATTGGAAAGTGTGTTCAGAGAGTTCAGAGTTCTGAAAATTATTCAACTCTCGCTTGTGCTGATGATGAAATTGACAAGGGCATGGAGAAAAAGACACTGGCCAC
GAACTTGGCCTTTCTTGTTGAAGAATCTCTTGATGTTGATATGAGAAGACCAAAGACTCAAATGGAACTTAAGAGATCCCTTGAAATTCAGATTAAGGAGAGGGTGAAGG
CACAATATTTGAATGGGAAGTTTTTGGACTTGATGGGTAAAGTGATTGCCTGCCCCACAACTCTTCAAAATGCTTACAACTGTATTAGAATTAGCTCAAATGTTGATATA
ATGTCGAATGACCGTTTAATCTCATTTGAATCTATGGCTGAAGAGCTTTCTAATGGTAATTTTGATGTCAATGGCAATACTTTCTCCATATTAAGTTCGAGAAAAGAAGT
ACTCATTTTACCAAAGATAAAGTTGAAGGTTCTTCAGGAAGCCATTAGGATAGTTTTGGAGTGTGTGTTTAGGCCACATTTTTCCAAGATATCTCATGGCTGTCGGAGTG
GAAGAGCACACTCAACGGCATTGAAGTACATCAGAAAAGAGATAAAAAATCCTGATTGGTGGTTCACAATTGAATTAAGAAAAAAGATGGATGAGCTTGTGATGGCTAAA
CTCATTACAGTAATGGAGGACAAGATAGACGACCCCAGACTATTTGCTGTTATCAGAAGAATATATGTGGCTGGGGCACTAAATTTGGAGTTTGGGGGTTTCCCAAAAGG
TCACGGTCTTCCACAAGAGGGAGTTCTGTCTCCTATATTAACAAACATCTATCTAAACCTCTTTGATCAAGAATTTTTCAGATTATCAATGAAATATGAGGCTATTAATG
AGTATGGTAATACTGGTCAAGATGGGTCACAATCAAGGCTGCGGAGCTGGTTTAGGAGACAATTGAAAGGAAATAGTTCTGATTATCCAGGTGAGGAGAAAGACAAAATA
AGAGTCTATTGTTGTCGCTATATGGATGAAATTTTTTTGGCGGTATCAGGTTCTAAAGATGTTGCTCTTAATTTTAGGTCAGAGATTTTTGATTTCATGCAGAAGACTTT
GCATTTAGACGTTAATCATCAACAGGAAATAGTATCATGTGGGGAGACTCATGGAATTTGTTTTCTTGGTTGTTTGGTCAGACGAAGTGTGCAGGAAAGTCTTGCTGTAA
AATCCGTCCACAAGTTGAAGGAAAAAGTTGAGCTATTTGCTTTACAAAAGCAGGAGACTTGGAATGCTTGGACAGTGTGGTTGGGAAAGAAATGGCTCGCTCATGGTTTG
AAGAAGGTTAAAGAGTCGGAGATCAAGCATTTAGCTAAGAATAGGTCTTCTTTGAATCAAATTTCTAGTTTTCGTAAAGCTGGAATGGAAACTGATCACTGGTACAAGGT
TCTATTGAAAATTTGGATGCAAGATCTAAATGCAAGGGCTGCAGAGAGTGAAGAAAAAATCTTATCTAAGTATGCAGTGGAACCTTCTCTTCCTATGGAACTTCGAGATT
CCTTTTCTGAGTTTCAAAGGTGCGTGAAAGAATATATTTCTGCTGAGACAGCTTCTACTGTTGCTCTCTTACCAAATTATGATCCTTCTGTCAAACCTACTTTCATAACT
GAGATTATAGCACCCGTCAATTCTATAAGAAAACGACTATTGCGATATAGGTTAGTTACAAATAAAGGACATCCATGCTCCTCTCCTTTCCTCATCTTACAAGATAACAC
TCAAATTATTGACTGGTTTTTAGGAGTATCTCGTCGTTGGTTTAGATGGTACAATAATTGTTCTAACTTCAGCGAGTTGATCTTAATTTGCGATCAAGTTAGGAAATCTT
GTATCAGAACGCTAGCGGCAAAGCATCGTATACACGAAAGTGAAATAGAAAAGAAGTTTGACTCAGAACTGAGTAAGATTTACTCCTCTCCTGAAACAGAGAAAGAAGAA
GAGAAGAAGTCATCAGAAACCCATGGTTTAGACCATGATGAGGCACTAAAGTATGGAATTTCATATAGTGGTTTGTGTTTGCTATCTCTTGCTAGAATGGTCAGCCAATC
TCGTCCTTGCAATTGTTTTGTCATTGGGTGTTTGGCTCCTGCACCAAGTGTTTATACTCTTCATGTCATGGAGAGACAAAAGTTTCCGGGATGGAAGACTGGATTCTCGA
GTTCCATCCATCCTAGCTTGAACAAACGACGATTCGGGTTATGCAAACAACATTTGGAGGATTTGTATTTGGGTCACATTTCATTGCAATCTGTTGACTTTGGTGCATGG
AAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGTGCCGAAGTCCTAAGGAGCTTGTCACATCCACATTTCTTAGAAGAGAAGGGTTTGTCTCGTATGAGTCACAAGTCGAGCTGACGCAACCCTCATCTACTACACAAAA
GAGCTCCTCGTCCACCATAGTTTTCTTGTCTTCTCCATGCCGCCACTCTGCGTCTCCAACCACCGCGCCCTTCGCCAGGGACGACGCCGGCGTGGTTTCTTCGCCGCCGT
CGTGGGTTCTTTTCTCCGTGAGTCTTTCACCACCGTCGTGGATGCTTGAGCTGTGCCGTTTGTGGGTTCTCTTCACCTTAGTGGATAATGGAATTCGAGAAGGACTAGAT
GAACTTCTGAAGGTTGCAAGTACAGAACCAGAGACAGTTCTTCATTACGAGTTCATGCAAGATTACAAGTGTTGTTATGAGAGATTTCATCATTCCTCTTGGTTGACAAG
AACTGAAGCGATGGAGGTTAACGACAAATTCCGATGGTTTAATGTGCTCGTATTTGGCAAAGTTCTTATTTACAGTTCTGATGCTTTTTTTTTTTCCAACACTCTTTACA
GTTATGCAAAAATGACATTTGGGGGGCTTCGAAGATTATGTGGGATGAACATGCGGAACTTTTCAATTTTGCAAAGTGTAAATGTTTGCAAAGTCAAATCATCCTTTGTT
TCTGACATTGGAAAGTGTGTTCAGAGAGTTCAGAGTTCTGAAAATTATTCAACTCTCGCTTGTGCTGATGATGAAATTGACAAGGGCATGGAGAAAAAGACACTGGCCAC
GAACTTGGCCTTTCTTGTTGAAGAATCTCTTGATGTTGATATGAGAAGACCAAAGACTCAAATGGAACTTAAGAGATCCCTTGAAATTCAGATTAAGGAGAGGGTGAAGG
CACAATATTTGAATGGGAAGTTTTTGGACTTGATGGGTAAAGTGATTGCCTGCCCCACAACTCTTCAAAATGCTTACAACTGTATTAGAATTAGCTCAAATGTTGATATA
ATGTCGAATGACCGTTTAATCTCATTTGAATCTATGGCTGAAGAGCTTTCTAATGGTAATTTTGATGTCAATGGCAATACTTTCTCCATATTAAGTTCGAGAAAAGAAGT
ACTCATTTTACCAAAGATAAAGTTGAAGGTTCTTCAGGAAGCCATTAGGATAGTTTTGGAGTGTGTGTTTAGGCCACATTTTTCCAAGATATCTCATGGCTGTCGGAGTG
GAAGAGCACACTCAACGGCATTGAAGTACATCAGAAAAGAGATAAAAAATCCTGATTGGTGGTTCACAATTGAATTAAGAAAAAAGATGGATGAGCTTGTGATGGCTAAA
CTCATTACAGTAATGGAGGACAAGATAGACGACCCCAGACTATTTGCTGTTATCAGAAGAATATATGTGGCTGGGGCACTAAATTTGGAGTTTGGGGGTTTCCCAAAAGG
TCACGGTCTTCCACAAGAGGGAGTTCTGTCTCCTATATTAACAAACATCTATCTAAACCTCTTTGATCAAGAATTTTTCAGATTATCAATGAAATATGAGGCTATTAATG
AGTATGGTAATACTGGTCAAGATGGGTCACAATCAAGGCTGCGGAGCTGGTTTAGGAGACAATTGAAAGGAAATAGTTCTGATTATCCAGGTGAGGAGAAAGACAAAATA
AGAGTCTATTGTTGTCGCTATATGGATGAAATTTTTTTGGCGGTATCAGGTTCTAAAGATGTTGCTCTTAATTTTAGGTCAGAGATTTTTGATTTCATGCAGAAGACTTT
GCATTTAGACGTTAATCATCAACAGGAAATAGTATCATGTGGGGAGACTCATGGAATTTGTTTTCTTGGTTGTTTGGTCAGACGAAGTGTGCAGGAAAGTCTTGCTGTAA
AATCCGTCCACAAGTTGAAGGAAAAAGTTGAGCTATTTGCTTTACAAAAGCAGGAGACTTGGAATGCTTGGACAGTGTGGTTGGGAAAGAAATGGCTCGCTCATGGTTTG
AAGAAGGTTAAAGAGTCGGAGATCAAGCATTTAGCTAAGAATAGGTCTTCTTTGAATCAAATTTCTAGTTTTCGTAAAGCTGGAATGGAAACTGATCACTGGTACAAGGT
TCTATTGAAAATTTGGATGCAAGATCTAAATGCAAGGGCTGCAGAGAGTGAAGAAAAAATCTTATCTAAGTATGCAGTGGAACCTTCTCTTCCTATGGAACTTCGAGATT
CCTTTTCTGAGTTTCAAAGGTGCGTGAAAGAATATATTTCTGCTGAGACAGCTTCTACTGTTGCTCTCTTACCAAATTATGATCCTTCTGTCAAACCTACTTTCATAACT
GAGATTATAGCACCCGTCAATTCTATAAGAAAACGACTATTGCGATATAGGTTAGTTACAAATAAAGGACATCCATGCTCCTCTCCTTTCCTCATCTTACAAGATAACAC
TCAAATTATTGACTGGTTTTTAGGAGTATCTCGTCGTTGGTTTAGATGGTACAATAATTGTTCTAACTTCAGCGAGTTGATCTTAATTTGCGATCAAGTTAGGAAATCTT
GTATCAGAACGCTAGCGGCAAAGCATCGTATACACGAAAGTGAAATAGAAAAGAAGTTTGACTCAGAACTGAGTAAGATTTACTCCTCTCCTGAAACAGAGAAAGAAGAA
GAGAAGAAGTCATCAGAAACCCATGGTTTAGACCATGATGAGGCACTAAAGTATGGAATTTCATATAGTGGTTTGTGTTTGCTATCTCTTGCTAGAATGGTCAGCCAATC
TCGTCCTTGCAATTGTTTTGTCATTGGGTGTTTGGCTCCTGCACCAAGTGTTTATACTCTTCATGTCATGGAGAGACAAAAGTTTCCGGGATGGAAGACTGGATTCTCGA
GTTCCATCCATCCTAGCTTGAACAAACGACGATTCGGGTTATGCAAACAACATTTGGAGGATTTGTATTTGGGTCACATTTCATTGCAATCTGTTGACTTTGGTGCATGG
AAGTGAATGGTTTTTGTTCTGTTTATGATCTCATTTAACTTCTTGATTACTTACTGGGATAGATTTGCTGAGAGAAACGCCATGACTGAGCCTTCATGGTGCCTGATTTG
GCTAGCATAGTGATTTTCTGATCTCAATGGTGTCTGAAATCCTCATTGGATTGTTGCACTGATGGAACATTGTTAAAACCAATGTGAGGACCAGGTTAGAGCAACCAAAT
GTGAGAACCAGGTTATAACAACATTCAGTTGAATGGGTGATGTTTATATCGATATCTTGGAAACCTTCTCAAATGCTCGGAGGAAGAATATTAGTATGGAGATTTGAACT
TCCGACCTCAAAGGAGTGCATGCCAATCATCTATGCTCACTTTGACTACTATTGTAGAGATTATCAAAGACGAAGCGATGGTGGTTAAACAATGTGCTTCAAGGTAAGAA
AATTGGAATGAGGTAGAGGCATGTTGTGTTATCTCAGAGCTGAGGAGATACTCTCATTAAATTTGACTCAGTTAGATTCATGCCCCTTTTACAAAGGTGCTTGACGAATT
CAGAAGAGTCTTCTACCAGTGGGAATAAATTTACTTAACCGATACTCATCCTAATCTGCAAAGCTCAATGCATCTTGAAGTCTTTGGGAGGTTTTGGGAAGGCACATATG
CAAGCATAGCTCTGGATATGCCAAACGTGAGTATTATTTTCCTCCCTTGTTGATTGAGTGAAGGATATTTCTCAGCTTTAAAAACCATTGTATTTGTAAATTTTATGTGC
TTAACAAGGAAAATGAAATATATGTCATGTTGATGTTAATTTGTTGTTAAGTTATGCATTATTGTGAGAGGACGTTTGATTTATTGAAAGAATTTAGTTGAATGAGCATT
TTTACTATAAA
Protein sequenceShow/hide protein sequence
MCRSPKELVTSTFLRREGFVSYESQVELTQPSSTTQKSSSSTIVFLSSPCRHSASPTTAPFARDDAGVVSSPPSWVLFSVSLSPPSWMLELCRLWVLFTLVDNGIREGLD
ELLKVASTEPETVLHYEFMQDYKCCYERFHHSSWLTRTEAMEVNDKFRWFNVLVFGKVLIYSSDAFFFSNTLYSYAKMTFGGLRRLCGMNMRNFSILQSVNVCKVKSSFV
SDIGKCVQRVQSSENYSTLACADDEIDKGMEKKTLATNLAFLVEESLDVDMRRPKTQMELKRSLEIQIKERVKAQYLNGKFLDLMGKVIACPTTLQNAYNCIRISSNVDI
MSNDRLISFESMAEELSNGNFDVNGNTFSILSSRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKMDELVMAK
LITVMEDKIDDPRLFAVIRRIYVAGALNLEFGGFPKGHGLPQEGVLSPILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNSSDYPGEEKDKI
RVYCCRYMDEIFLAVSGSKDVALNFRSEIFDFMQKTLHLDVNHQQEIVSCGETHGICFLGCLVRRSVQESLAVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHGL
KKVKESEIKHLAKNRSSLNQISSFRKAGMETDHWYKVLLKIWMQDLNARAAESEEKILSKYAVEPSLPMELRDSFSEFQRCVKEYISAETASTVALLPNYDPSVKPTFIT
EIIAPVNSIRKRLLRYRLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELILICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSKIYSSPETEKEE
EKKSSETHGLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHLEDLYLGHISLQSVDFGAW
K