; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10017477 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10017477
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionnuclear intron maturase 4, mitochondrial isoform X2
Genome locationChr03:14665262..14674620
RNA-Seq ExpressionHG10017477
SyntenyHG10017477
Gene Ontology termsGO:0000373 - Group II intron splicing (biological process)
GO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006315 - homing of group II introns (biological process)
GO:0007005 - mitochondrion organization (biological process)
GO:0009845 - seed germination (biological process)
GO:0032885 - regulation of polysaccharide biosynthetic process (biological process)
GO:0090615 - mitochondrial mRNA processing (biological process)
GO:1900864 - mitochondrial RNA modification (biological process)
GO:0005739 - mitochondrion (cellular component)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR024937 - Domain X


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008442019.1 PREDICTED: uncharacterized protein LOC103486008 [Cucumis melo]7.8e-30276.15Show/hide
Query:  MTFGGLRRLCGMNMRNFSILQSVNVCKVKSSFVSDIGKCVQRVQSSENYSTLACADDEIDKGMEKKTLATNLAFLVEESLDVDMRRPKTQMELKRSLEIQ
        M FGGLRR C +N RNFS  QSVNVC V SSFVSDIGKC Q VQSSENYSTLA ADDEIDKGMEK  LA NLA LVEESLDVD+RR KT+MELKRSLEIQ
Subjt:  MTFGGLRRLCGMNMRNFSILQSVNVCKVKSSFVSDIGKCVQRVQSSENYSTLACADDEIDKGMEKKTLATNLAFLVEESLDVDMRRPKTQMELKRSLEIQ

Query:  IKERVKAQYLNGKFLDLMGKVIACPTTLQNAYNCIRISSNVDIMSNDRLISFESMAEELSNGNFDVNGNTFSILSSRKEVLILPKIKLKVLQEAIRIVLE
        IKERVKAQYLNGKFLDLMG VIACP TLQNAY+CIRI+SNVDI SND LISFESMA+ELS+GNFDVN NTFSILSSRKEVLILPKIKLKVLQEAIRIVLE
Subjt:  IKERVKAQYLNGKFLDLMGKVIACPTTLQNAYNCIRISSNVDIMSNDRLISFESMAEELSNGNFDVNGNTFSILSSRKEVLILPKIKLKVLQEAIRIVLE

Query:  CVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKMDELVMAKLITVMEDKIDDPRLFAVIRRIYVAGALNLEFGGFPKGHGLPQEGVLS
        CVFRPHFSKISHGCRSGR HSTALKYI+KEIK+PDWWFT++L KKMDELVMAKLITVMEDKI+DP+LFAVIR I++AGALNLEFG FPKGHGLPQEGVLS
Subjt:  CVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKMDELVMAKLITVMEDKIDDPRLFAVIRRIYVAGALNLEFGGFPKGHGLPQEGVLS

Query:  PILTNIYLKSLDQEFFRLSMKYEAINEYGNTGQDGSQSG-AEWFRRQLKGNSSDYPGEEKDKIRVYCCRYMDEIFWRVQ---------------------
        PILTNIYL   DQEFFRLSMKYEAINEYGNTGQDGSQS    WFRRQLK NSSDYPGEEKDKIRVYCCRYMDEIF  V                      
Subjt:  PILTNIYLKSLDQEFFRLSMKYEAINEYGNTGQDGSQSG-AEWFRRQLKGNSSDYPGEEKDKIRVYCCRYMDEIFWRVQ---------------------

Query:  ----------------------VLKRRSVQESLAVKSVHKLKEKVELFALQKQENWNAWTVGLGKKWLAHVLKKVKVSEIKHLAKNRSSLNQISSFRKAG
                               L RRSVQES AVKS+HKLKEKVELF LQKQE W +WTV LGKKWLAH LKKVK SEIKHLAKN SSLNQISSFRK G
Subjt:  ----------------------VLKRRSVQESLAVKSVHKLKEKVELFALQKQENWNAWTVGLGKKWLAHVLKKVKVSEIKHLAKNRSSLNQISSFRKAG

Query:  METDHWC----------------------------------------------VKEYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLLRYRL
        METDHW                                               V+EYIS+ETAST+ALLPNYDPSVKPTFITEIIAPVNSIRKRL RYRL
Subjt:  METDHWC----------------------------------------------VKEYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLLRYRL

Query:  VTNKGHPCSSPFLILQDNTQNIDWFLGVSRRWFRWYNNCSNFSELILICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSKIYSSPETEKEEEKKSSETH
        VTNKGHPCSSPFLILQDNTQ IDWFLGVSRRWFRWYN  SNFSEL LI DQVRKSCIRTLAAKH+IHESEIEKKFDSELSKIYSSPE E+E+E KS++TH
Subjt:  VTNKGHPCSSPFLILQDNTQNIDWFLGVSRRWFRWYNNCSNFSELILICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSKIYSSPETEKEEEKKSSETH

Query:  GLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCWSWG
         LDHDEAL YGISYSGLCLLSLARMVS+SRPCNC+  G
Subjt:  GLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCWSWG

XP_011653460.1 nuclear intron maturase 4, mitochondrial [Cucumis sativus]1.3e-30176.42Show/hide
Query:  MTFGGLRRLCGMNMRNFSILQSVNVCKVKSSFVSDIGKCVQRVQSSENYSTLACADDEIDKGMEKKTLATNLAFLVEESLDVDMRRPKTQMELKRSLEIQ
        M FGGLRR C +NMRNFS LQSVNVC   SSFVSDIGKCVQ VQ SENYSTLA A  EIDKGME+  LA NLA LVEESLDVD+RR KTQMELKRSLEI+
Subjt:  MTFGGLRRLCGMNMRNFSILQSVNVCKVKSSFVSDIGKCVQRVQSSENYSTLACADDEIDKGMEKKTLATNLAFLVEESLDVDMRRPKTQMELKRSLEIQ

Query:  IKERVKAQYLNGKFLDLMGKVIACPTTLQNAYNCIRISSNVDIMSNDRLISFESMAEELSNGNFDVNGNTFSILSSRKEVLILPKIKLKVLQEAIRIVLE
        IKERVKAQYLNGKFLDLMG VIACP TLQN Y+CIRI+SNVDI SNDRLISFESMAEELSNGNFDVN NTFSILSSRKEVLILPKIKLKVLQEAIRIVLE
Subjt:  IKERVKAQYLNGKFLDLMGKVIACPTTLQNAYNCIRISSNVDIMSNDRLISFESMAEELSNGNFDVNGNTFSILSSRKEVLILPKIKLKVLQEAIRIVLE

Query:  CVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKMDELVMAKLITVMEDKIDDPRLFAVIRRIYVAGALNLEFGGFPKGHGLPQEGVLS
        CVFRPHFSKISHGCRSGR HSTALKYI+KEIK+PDWWFT++L KKMDELVMAKLITVMEDKI+DP+LFAVIR IY+AGALNLEFGGFPKGHGLPQEGVLS
Subjt:  CVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKMDELVMAKLITVMEDKIDDPRLFAVIRRIYVAGALNLEFGGFPKGHGLPQEGVLS

Query:  PILTNIYLKSLDQEFFRLSMKYEAINEYGNTGQDGSQSG-AEWFRRQLKGNSSDYPGEEKDKIRVYCCRYMDEIFWRVQ---------------------
        PILTNIYL   DQEFFRLSMKYEAINEYGNTGQDGSQS    WFRRQLKGN+SDY GEEKDKIRVYCCRYMDEIF  V                      
Subjt:  PILTNIYLKSLDQEFFRLSMKYEAINEYGNTGQDGSQSG-AEWFRRQLKGNSSDYPGEEKDKIRVYCCRYMDEIFWRVQ---------------------

Query:  ----------------------VLKRRSVQESLAVKSVHKLKEKVELFALQKQENWNAWTVGLGKKWLAHVLKKVKVSEIKHLAKNRSSLNQISSFRKAG
                               L RRSVQES AVKS+HKLKEKVELF LQKQE WNAWTV LGKKWLAH LKKVK SEIKHLAKN SSLN+ISSFRK G
Subjt:  ----------------------VLKRRSVQESLAVKSVHKLKEKVELFALQKQENWNAWTVGLGKKWLAHVLKKVKVSEIKHLAKNRSSLNQISSFRKAG

Query:  METDHWC----------------------------------------------VKEYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLLRYRL
        METDHW                                               VKEYIS+ETAST+ALLPNYDPS KPTFITEIIAPVNSIRKRLLRYRL
Subjt:  METDHWC----------------------------------------------VKEYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLLRYRL

Query:  VTNKGHPCSSPFLILQDNTQNIDWFLGVSRRWFRWYNNCSNFSELILICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSKIYSSPETEKEEEKKSSETH
        VTNKGHPCSSPFLILQDNTQ IDWF+GVSRR FRWYNN SNFSEL LI DQVRKSCIRTLAAKHRIHESEIEKKFDSELSKIYSS E ++E+E KS++TH
Subjt:  VTNKGHPCSSPFLILQDNTQNIDWFLGVSRRWFRWYNNCSNFSELILICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSKIYSSPETEKEEEKKSSETH

Query:  GLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCWSWG
         LDHDEALKYGISYSGLCLLS ARMVSQSRPCNC+  G
Subjt:  GLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCWSWG

XP_038882001.1 nuclear intron maturase 4, mitochondrial isoform X1 [Benincasa hispida]0.0e+0078.57Show/hide
Query:  YAKMTFGGLRRLCGMNMRNFSILQSVNVCKVKSSFVSDIGKCVQRVQSSENYSTLACADDEIDKGMEKKTLATNLAFLVEESLDVDMRRPKTQMELKRSL
        YAKM FGGL+R C +NMRN + L  VNVCKV SS VS IGK VQRVQ+SENYSTL CADDEIDKGMEK  LA NLA LVEESLDVD++R KTQMELKRSL
Subjt:  YAKMTFGGLRRLCGMNMRNFSILQSVNVCKVKSSFVSDIGKCVQRVQSSENYSTLACADDEIDKGMEKKTLATNLAFLVEESLDVDMRRPKTQMELKRSL

Query:  EIQIKERVKAQYLNGKFLDLMGKVIACPTTLQNAYNCIRISSNVDIMSNDRLISFESMAEELSNGNFDVNGNTFSILSSRKEVLILPKIKLKVLQEAIRI
        EIQIKERVKAQYLNGKFLDLMGKVIACPTTLQNAY+C+RI+SNVDIMSND LISFESMAEELSNGNFDVN NTFSILSSRKEVL+LPKI+LKVLQEAIRI
Subjt:  EIQIKERVKAQYLNGKFLDLMGKVIACPTTLQNAYNCIRISSNVDIMSNDRLISFESMAEELSNGNFDVNGNTFSILSSRKEVLILPKIKLKVLQEAIRI

Query:  VLECVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKMDELVMAKLITVMEDKIDDPRLFAVIRRIYVAGALNLEFGGFPKGHGLPQEG
        VLECVFRPHFSKISHGCRSGR HSTALKYIRKEIKNPDWWFTI+L KKMDELVMAKLITVMEDKI+DP+LFAVIR IYVAGALNLEFGGFPKGHGLPQEG
Subjt:  VLECVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKMDELVMAKLITVMEDKIDDPRLFAVIRRIYVAGALNLEFGGFPKGHGLPQEG

Query:  VLSPILTNIYLKSLDQEFFRLSMKYEAINEYGNTGQDGSQSG-AEWFRRQLKGNSSDYPGEEKDKIRVYCCRYMDEIFWRVQ------------------
        +LSPILTNIYL   DQEFFRLSMKYEAINEYGNTGQDGSQS    WFRRQLKGNSSDYPGE+KDKIRVYCCRYMDEIF  V                   
Subjt:  VLSPILTNIYLKSLDQEFFRLSMKYEAINEYGNTGQDGSQSG-AEWFRRQLKGNSSDYPGEEKDKIRVYCCRYMDEIFWRVQ------------------

Query:  --------------------------VLKRRSVQESLAVKSVHKLKEKVELFALQKQENWNAWTVGLGKKWLAHVLKKVKVSEIKHLAKNRSSLNQISSF
                                   L RRSVQES AVKSVHKLK+KVELFALQKQE WNAWTV LGKKWLAH LKKVK SEIKHLAKN SSLNQISSF
Subjt:  --------------------------VLKRRSVQESLAVKSVHKLKEKVELFALQKQENWNAWTVGLGKKWLAHVLKKVKVSEIKHLAKNRSSLNQISSF

Query:  RKAGMETDHW----------------------------------------------CVKEYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLL
        RKAGMETDHW                                              CV+EYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLL
Subjt:  RKAGMETDHW----------------------------------------------CVKEYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLL

Query:  RYRLVTNKGHPCSSPFLILQDNTQNIDWFLGVSRRWFRWYNNCSNFSELILICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSKIYSSPETEKEEEKKS
        RYRLVTNKGHPCSSPFLILQDNTQ IDWFLGVSRRWFRWYNNCSNFSELILICD VRKSCIRTLAAKHRIHESEIEKKFDSELSK+YSSPE E+EEE KS
Subjt:  RYRLVTNKGHPCSSPFLILQDNTQNIDWFLGVSRRWFRWYNNCSNFSELILICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSKIYSSPETEKEEEKKS

Query:  SETHGLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCWSWG
         +THGLDHDEALKYGISYSGLCLLSLARMVSQSRPCNC+  G
Subjt:  SETHGLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCWSWG

XP_038882003.1 nuclear intron maturase 4, mitochondrial isoform X2 [Benincasa hispida]0.0e+0078.6Show/hide
Query:  SYAKMTFGGLRRLCGMNMRNFSILQSVNVCKVKSSFVSDIGKCVQRVQSSENYSTLACADDEIDKGMEKKTLATNLAFLVEESLDVDMRRPKTQMELKRS
        SYAKM FGGL+R C +NMRN + L  VNVCKV SS VS IGK VQRVQ+SENYSTL CADDEIDKGMEK  LA NLA LVEESLDVD++R KTQMELKRS
Subjt:  SYAKMTFGGLRRLCGMNMRNFSILQSVNVCKVKSSFVSDIGKCVQRVQSSENYSTLACADDEIDKGMEKKTLATNLAFLVEESLDVDMRRPKTQMELKRS

Query:  LEIQIKERVKAQYLNGKFLDLMGKVIACPTTLQNAYNCIRISSNVDIMSNDRLISFESMAEELSNGNFDVNGNTFSILSSRKEVLILPKIKLKVLQEAIR
        LEIQIKERVKAQYLNGKFLDLMGKVIACPTTLQNAY+C+RI+SNVDIMSND LISFESMAEELSNGNFDVN NTFSILSSRKEVL+LPKI+LKVLQEAIR
Subjt:  LEIQIKERVKAQYLNGKFLDLMGKVIACPTTLQNAYNCIRISSNVDIMSNDRLISFESMAEELSNGNFDVNGNTFSILSSRKEVLILPKIKLKVLQEAIR

Query:  IVLECVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKMDELVMAKLITVMEDKIDDPRLFAVIRRIYVAGALNLEFGGFPKGHGLPQE
        IVLECVFRPHFSKISHGCRSGR HSTALKYIRKEIKNPDWWFTI+L KKMDELVMAKLITVMEDKI+DP+LFAVIR IYVAGALNLEFGGFPKGHGLPQE
Subjt:  IVLECVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKMDELVMAKLITVMEDKIDDPRLFAVIRRIYVAGALNLEFGGFPKGHGLPQE

Query:  GVLSPILTNIYLKSLDQEFFRLSMKYEAINEYGNTGQDGSQSG-AEWFRRQLKGNSSDYPGEEKDKIRVYCCRYMDEIFWRVQ-----------------
        G+LSPILTNIYL   DQEFFRLSMKYEAINEYGNTGQDGSQS    WFRRQLKGNSSDYPGE+KDKIRVYCCRYMDEIF  V                  
Subjt:  GVLSPILTNIYLKSLDQEFFRLSMKYEAINEYGNTGQDGSQSG-AEWFRRQLKGNSSDYPGEEKDKIRVYCCRYMDEIFWRVQ-----------------

Query:  ---------------------------VLKRRSVQESLAVKSVHKLKEKVELFALQKQENWNAWTVGLGKKWLAHVLKKVKVSEIKHLAKNRSSLNQISS
                                    L RRSVQES AVKSVHKLK+KVELFALQKQE WNAWTV LGKKWLAH LKKVK SEIKHLAKN SSLNQISS
Subjt:  ---------------------------VLKRRSVQESLAVKSVHKLKEKVELFALQKQENWNAWTVGLGKKWLAHVLKKVKVSEIKHLAKNRSSLNQISS

Query:  FRKAGMETDHW----------------------------------------------CVKEYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRL
        FRKAGMETDHW                                              CV+EYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRL
Subjt:  FRKAGMETDHW----------------------------------------------CVKEYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRL

Query:  LRYRLVTNKGHPCSSPFLILQDNTQNIDWFLGVSRRWFRWYNNCSNFSELILICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSKIYSSPETEKEEEKK
        LRYRLVTNKGHPCSSPFLILQDNTQ IDWFLGVSRRWFRWYNNCSNFSELILICD VRKSCIRTLAAKHRIHESEIEKKFDSELSK+YSSPE E+EEE K
Subjt:  LRYRLVTNKGHPCSSPFLILQDNTQNIDWFLGVSRRWFRWYNNCSNFSELILICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSKIYSSPETEKEEEKK

Query:  SSETHGLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCWSWG
        S +THGLDHDEALKYGISYSGLCLLSLARMVSQSRPCNC+  G
Subjt:  SSETHGLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCWSWG

XP_038882004.1 nuclear intron maturase 4, mitochondrial isoform X3 [Benincasa hispida]0.0e+0078.48Show/hide
Query:  MTFGGLRRLCGMNMRNFSILQSVNVCKVKSSFVSDIGKCVQRVQSSENYSTLACADDEIDKGMEKKTLATNLAFLVEESLDVDMRRPKTQMELKRSLEIQ
        M FGGL+R C +NMRN + L  VNVCKV SS VS IGK VQRVQ+SENYSTL CADDEIDKGMEK  LA NLA LVEESLDVD++R KTQMELKRSLEIQ
Subjt:  MTFGGLRRLCGMNMRNFSILQSVNVCKVKSSFVSDIGKCVQRVQSSENYSTLACADDEIDKGMEKKTLATNLAFLVEESLDVDMRRPKTQMELKRSLEIQ

Query:  IKERVKAQYLNGKFLDLMGKVIACPTTLQNAYNCIRISSNVDIMSNDRLISFESMAEELSNGNFDVNGNTFSILSSRKEVLILPKIKLKVLQEAIRIVLE
        IKERVKAQYLNGKFLDLMGKVIACPTTLQNAY+C+RI+SNVDIMSND LISFESMAEELSNGNFDVN NTFSILSSRKEVL+LPKI+LKVLQEAIRIVLE
Subjt:  IKERVKAQYLNGKFLDLMGKVIACPTTLQNAYNCIRISSNVDIMSNDRLISFESMAEELSNGNFDVNGNTFSILSSRKEVLILPKIKLKVLQEAIRIVLE

Query:  CVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKMDELVMAKLITVMEDKIDDPRLFAVIRRIYVAGALNLEFGGFPKGHGLPQEGVLS
        CVFRPHFSKISHGCRSGR HSTALKYIRKEIKNPDWWFTI+L KKMDELVMAKLITVMEDKI+DP+LFAVIR IYVAGALNLEFGGFPKGHGLPQEG+LS
Subjt:  CVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKMDELVMAKLITVMEDKIDDPRLFAVIRRIYVAGALNLEFGGFPKGHGLPQEGVLS

Query:  PILTNIYLKSLDQEFFRLSMKYEAINEYGNTGQDGSQSG-AEWFRRQLKGNSSDYPGEEKDKIRVYCCRYMDEIFWRVQ---------------------
        PILTNIYL   DQEFFRLSMKYEAINEYGNTGQDGSQS    WFRRQLKGNSSDYPGE+KDKIRVYCCRYMDEIF  V                      
Subjt:  PILTNIYLKSLDQEFFRLSMKYEAINEYGNTGQDGSQSG-AEWFRRQLKGNSSDYPGEEKDKIRVYCCRYMDEIFWRVQ---------------------

Query:  -----------------------VLKRRSVQESLAVKSVHKLKEKVELFALQKQENWNAWTVGLGKKWLAHVLKKVKVSEIKHLAKNRSSLNQISSFRKA
                                L RRSVQES AVKSVHKLK+KVELFALQKQE WNAWTV LGKKWLAH LKKVK SEIKHLAKN SSLNQISSFRKA
Subjt:  -----------------------VLKRRSVQESLAVKSVHKLKEKVELFALQKQENWNAWTVGLGKKWLAHVLKKVKVSEIKHLAKNRSSLNQISSFRKA

Query:  GMETDHW----------------------------------------------CVKEYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLLRYR
        GMETDHW                                              CV+EYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLLRYR
Subjt:  GMETDHW----------------------------------------------CVKEYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLLRYR

Query:  LVTNKGHPCSSPFLILQDNTQNIDWFLGVSRRWFRWYNNCSNFSELILICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSKIYSSPETEKEEEKKSSET
        LVTNKGHPCSSPFLILQDNTQ IDWFLGVSRRWFRWYNNCSNFSELILICD VRKSCIRTLAAKHRIHESEIEKKFDSELSK+YSSPE E+EEE KS +T
Subjt:  LVTNKGHPCSSPFLILQDNTQNIDWFLGVSRRWFRWYNNCSNFSELILICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSKIYSSPETEKEEEKKSSET

Query:  HGLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCWSWG
        HGLDHDEALKYGISYSGLCLLSLARMVSQSRPCNC+  G
Subjt:  HGLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCWSWG

TrEMBL top hitse value%identityAlignment
A0A0A0KWB0 Reverse transcriptase domain-containing protein6.5e-30276.42Show/hide
Query:  MTFGGLRRLCGMNMRNFSILQSVNVCKVKSSFVSDIGKCVQRVQSSENYSTLACADDEIDKGMEKKTLATNLAFLVEESLDVDMRRPKTQMELKRSLEIQ
        M FGGLRR C +NMRNFS LQSVNVC   SSFVSDIGKCVQ VQ SENYSTLA A  EIDKGME+  LA NLA LVEESLDVD+RR KTQMELKRSLEI+
Subjt:  MTFGGLRRLCGMNMRNFSILQSVNVCKVKSSFVSDIGKCVQRVQSSENYSTLACADDEIDKGMEKKTLATNLAFLVEESLDVDMRRPKTQMELKRSLEIQ

Query:  IKERVKAQYLNGKFLDLMGKVIACPTTLQNAYNCIRISSNVDIMSNDRLISFESMAEELSNGNFDVNGNTFSILSSRKEVLILPKIKLKVLQEAIRIVLE
        IKERVKAQYLNGKFLDLMG VIACP TLQN Y+CIRI+SNVDI SNDRLISFESMAEELSNGNFDVN NTFSILSSRKEVLILPKIKLKVLQEAIRIVLE
Subjt:  IKERVKAQYLNGKFLDLMGKVIACPTTLQNAYNCIRISSNVDIMSNDRLISFESMAEELSNGNFDVNGNTFSILSSRKEVLILPKIKLKVLQEAIRIVLE

Query:  CVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKMDELVMAKLITVMEDKIDDPRLFAVIRRIYVAGALNLEFGGFPKGHGLPQEGVLS
        CVFRPHFSKISHGCRSGR HSTALKYI+KEIK+PDWWFT++L KKMDELVMAKLITVMEDKI+DP+LFAVIR IY+AGALNLEFGGFPKGHGLPQEGVLS
Subjt:  CVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKMDELVMAKLITVMEDKIDDPRLFAVIRRIYVAGALNLEFGGFPKGHGLPQEGVLS

Query:  PILTNIYLKSLDQEFFRLSMKYEAINEYGNTGQDGSQSG-AEWFRRQLKGNSSDYPGEEKDKIRVYCCRYMDEIFWRVQ---------------------
        PILTNIYL   DQEFFRLSMKYEAINEYGNTGQDGSQS    WFRRQLKGN+SDY GEEKDKIRVYCCRYMDEIF  V                      
Subjt:  PILTNIYLKSLDQEFFRLSMKYEAINEYGNTGQDGSQSG-AEWFRRQLKGNSSDYPGEEKDKIRVYCCRYMDEIFWRVQ---------------------

Query:  ----------------------VLKRRSVQESLAVKSVHKLKEKVELFALQKQENWNAWTVGLGKKWLAHVLKKVKVSEIKHLAKNRSSLNQISSFRKAG
                               L RRSVQES AVKS+HKLKEKVELF LQKQE WNAWTV LGKKWLAH LKKVK SEIKHLAKN SSLN+ISSFRK G
Subjt:  ----------------------VLKRRSVQESLAVKSVHKLKEKVELFALQKQENWNAWTVGLGKKWLAHVLKKVKVSEIKHLAKNRSSLNQISSFRKAG

Query:  METDHWC----------------------------------------------VKEYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLLRYRL
        METDHW                                               VKEYIS+ETAST+ALLPNYDPS KPTFITEIIAPVNSIRKRLLRYRL
Subjt:  METDHWC----------------------------------------------VKEYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLLRYRL

Query:  VTNKGHPCSSPFLILQDNTQNIDWFLGVSRRWFRWYNNCSNFSELILICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSKIYSSPETEKEEEKKSSETH
        VTNKGHPCSSPFLILQDNTQ IDWF+GVSRR FRWYNN SNFSEL LI DQVRKSCIRTLAAKHRIHESEIEKKFDSELSKIYSS E ++E+E KS++TH
Subjt:  VTNKGHPCSSPFLILQDNTQNIDWFLGVSRRWFRWYNNCSNFSELILICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSKIYSSPETEKEEEKKSSETH

Query:  GLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCWSWG
         LDHDEALKYGISYSGLCLLS ARMVSQSRPCNC+  G
Subjt:  GLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCWSWG

A0A1S3B491 uncharacterized protein LOC1034860083.8e-30276.15Show/hide
Query:  MTFGGLRRLCGMNMRNFSILQSVNVCKVKSSFVSDIGKCVQRVQSSENYSTLACADDEIDKGMEKKTLATNLAFLVEESLDVDMRRPKTQMELKRSLEIQ
        M FGGLRR C +N RNFS  QSVNVC V SSFVSDIGKC Q VQSSENYSTLA ADDEIDKGMEK  LA NLA LVEESLDVD+RR KT+MELKRSLEIQ
Subjt:  MTFGGLRRLCGMNMRNFSILQSVNVCKVKSSFVSDIGKCVQRVQSSENYSTLACADDEIDKGMEKKTLATNLAFLVEESLDVDMRRPKTQMELKRSLEIQ

Query:  IKERVKAQYLNGKFLDLMGKVIACPTTLQNAYNCIRISSNVDIMSNDRLISFESMAEELSNGNFDVNGNTFSILSSRKEVLILPKIKLKVLQEAIRIVLE
        IKERVKAQYLNGKFLDLMG VIACP TLQNAY+CIRI+SNVDI SND LISFESMA+ELS+GNFDVN NTFSILSSRKEVLILPKIKLKVLQEAIRIVLE
Subjt:  IKERVKAQYLNGKFLDLMGKVIACPTTLQNAYNCIRISSNVDIMSNDRLISFESMAEELSNGNFDVNGNTFSILSSRKEVLILPKIKLKVLQEAIRIVLE

Query:  CVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKMDELVMAKLITVMEDKIDDPRLFAVIRRIYVAGALNLEFGGFPKGHGLPQEGVLS
        CVFRPHFSKISHGCRSGR HSTALKYI+KEIK+PDWWFT++L KKMDELVMAKLITVMEDKI+DP+LFAVIR I++AGALNLEFG FPKGHGLPQEGVLS
Subjt:  CVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKMDELVMAKLITVMEDKIDDPRLFAVIRRIYVAGALNLEFGGFPKGHGLPQEGVLS

Query:  PILTNIYLKSLDQEFFRLSMKYEAINEYGNTGQDGSQSG-AEWFRRQLKGNSSDYPGEEKDKIRVYCCRYMDEIFWRVQ---------------------
        PILTNIYL   DQEFFRLSMKYEAINEYGNTGQDGSQS    WFRRQLK NSSDYPGEEKDKIRVYCCRYMDEIF  V                      
Subjt:  PILTNIYLKSLDQEFFRLSMKYEAINEYGNTGQDGSQSG-AEWFRRQLKGNSSDYPGEEKDKIRVYCCRYMDEIFWRVQ---------------------

Query:  ----------------------VLKRRSVQESLAVKSVHKLKEKVELFALQKQENWNAWTVGLGKKWLAHVLKKVKVSEIKHLAKNRSSLNQISSFRKAG
                               L RRSVQES AVKS+HKLKEKVELF LQKQE W +WTV LGKKWLAH LKKVK SEIKHLAKN SSLNQISSFRK G
Subjt:  ----------------------VLKRRSVQESLAVKSVHKLKEKVELFALQKQENWNAWTVGLGKKWLAHVLKKVKVSEIKHLAKNRSSLNQISSFRKAG

Query:  METDHWC----------------------------------------------VKEYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLLRYRL
        METDHW                                               V+EYIS+ETAST+ALLPNYDPSVKPTFITEIIAPVNSIRKRL RYRL
Subjt:  METDHWC----------------------------------------------VKEYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLLRYRL

Query:  VTNKGHPCSSPFLILQDNTQNIDWFLGVSRRWFRWYNNCSNFSELILICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSKIYSSPETEKEEEKKSSETH
        VTNKGHPCSSPFLILQDNTQ IDWFLGVSRRWFRWYN  SNFSEL LI DQVRKSCIRTLAAKH+IHESEIEKKFDSELSKIYSSPE E+E+E KS++TH
Subjt:  VTNKGHPCSSPFLILQDNTQNIDWFLGVSRRWFRWYNNCSNFSELILICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSKIYSSPETEKEEEKKSSETH

Query:  GLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCWSWG
         LDHDEAL YGISYSGLCLLSLARMVS+SRPCNC+  G
Subjt:  GLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCWSWG

A0A5A7TFZ7 Reverse transcriptase domain-containing protein5.1e-29974.08Show/hide
Query:  MTFGGLRRLCGMNMRNFSILQSVNVCKVKSSFVSDI----------------------GKCVQRVQSSENYSTLACADDEIDKGMEKKTLATNLAFLVEE
        M FGGLRR C +N RNFS  QSVNVC V SSFVSDI                      GKC Q VQSSENYSTLA ADDEIDKGMEK  LA NLA LVEE
Subjt:  MTFGGLRRLCGMNMRNFSILQSVNVCKVKSSFVSDI----------------------GKCVQRVQSSENYSTLACADDEIDKGMEKKTLATNLAFLVEE

Query:  SLDVDMRRPKTQMELKRSLEIQIKERVKAQYLNGKFLDLMGKVIACPTTLQNAYNCIRISSNVDIMSNDRLISFESMAEELSNGNFDVNGNTFSILSSRK
        SLDVD+RR KT+MELKRSLEIQIKERVKAQYLNGKFLDLMG VIACP TLQNAY+CIRI+SNVDI SND LISFESMAEELS+GNFDVN NTFSILSSRK
Subjt:  SLDVDMRRPKTQMELKRSLEIQIKERVKAQYLNGKFLDLMGKVIACPTTLQNAYNCIRISSNVDIMSNDRLISFESMAEELSNGNFDVNGNTFSILSSRK

Query:  EVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKMDELVMAKLITVMEDKIDDPRLFAVIRRIYVAG
        EVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGR HSTALKYI+KEIK+PDWWFT++L KKMD+LVMAKLITVMEDKI+DP+LFAVIR I++AG
Subjt:  EVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKMDELVMAKLITVMEDKIDDPRLFAVIRRIYVAG

Query:  ALNLEFGGFPKGHGLPQEGVLSPILTNIYLKSLDQEFFRLSMKYEAINEYGNTGQDGSQSG-AEWFRRQLKGNSSDYPGEEKDKIRVYCCRYMDEIFWRV
        ALNLEFG FPKGHGLPQEGVLSPILTNIYL   DQEFFRLSMKYEAINEYGNTGQDGSQS    WFRRQLKGNSSDYPGEEKDKIRVYCCRYMDEIF  V
Subjt:  ALNLEFGGFPKGHGLPQEGVLSPILTNIYLKSLDQEFFRLSMKYEAINEYGNTGQDGSQSG-AEWFRRQLKGNSSDYPGEEKDKIRVYCCRYMDEIFWRV

Query:  Q-------------------------------------------VLKRRSVQESLAVKSVHKLKEKVELFALQKQENWNAWTVGLGKKWLAHVLKKVKVS
                                                     L RRSVQES AVKS+HKLKEKVELF LQKQE W +WTV LGKKWLAH LKKVK S
Subjt:  Q-------------------------------------------VLKRRSVQESLAVKSVHKLKEKVELFALQKQENWNAWTVGLGKKWLAHVLKKVKVS

Query:  EIKHLAKNRSSLNQISSFRKAGMETDHWC----------------------------------------------VKEYISAETASTVALLPNYDPSVKP
        EIKHLAKN SSLNQISSFRK GMETDHW                                               V+EYIS+ETAST+ALLPNYDPSVKP
Subjt:  EIKHLAKNRSSLNQISSFRKAGMETDHWC----------------------------------------------VKEYISAETASTVALLPNYDPSVKP

Query:  TFITEIIAPVNSIRKRLLRYRLVTNKGHPCSSPFLILQDNTQNIDWFLGVSRRWFRWYNNCSNFSELILICDQVRKSCIRTLAAKHRIHESEIEKKFDSE
        TFITEIIAPVNSIRKRL RYRLVTNKGHPCSSPFLILQDNTQ IDWFLGVSRRWFRWYN  SNFSEL LI DQVRKSCIRTLAAKHRIHESEIEKKFDSE
Subjt:  TFITEIIAPVNSIRKRLLRYRLVTNKGHPCSSPFLILQDNTQNIDWFLGVSRRWFRWYNNCSNFSELILICDQVRKSCIRTLAAKHRIHESEIEKKFDSE

Query:  LSKIYSSPETEKEEEKKSSETHGLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCWSWG
        LSKIYSSPE E+ +E KS++TH LDHDEAL YGISYSGLCLLSLARMVS+SRPCNC+  G
Subjt:  LSKIYSSPETEKEEEKKSSETHGLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCWSWG

A0A6J1CXL0 nuclear intron maturase 4, mitochondrial isoform X27.9e-29271.49Show/hide
Query:  KMTFGGLRRLCGMNMRNFSILQSVNVCKVKSSFVSDIGKCVQRVQSSENYSTLACADDEIDKGMEKKTLATNLAFLVEESLDVDMRRPKTQMELKRSLEI
        +M FGG +R C MNMRNF++L+   +CKV SSFVSDIGKCVQRVQ+SENYS LACADD+  KGMEKK LA NLA LVEESLDVD RRPK++MELKRSLEI
Subjt:  KMTFGGLRRLCGMNMRNFSILQSVNVCKVKSSFVSDIGKCVQRVQSSENYSTLACADDEIDKGMEKKTLATNLAFLVEESLDVDMRRPKTQMELKRSLEI

Query:  QIKERVKAQYLNGKFLDLMGKVIACPTTLQNAYNCIRISSNVDIMSNDRLISFESMAEELSNGNFDVNGNTFSILSSRKEVLILPKIKLKVLQEAIRIVL
        QIK+RVKAQY+NGKF+DLMGKVIACP TLQNAY+C+RI+SNVDI SND LISFESMAEEL NG+FDVN NTFSI SS+KEVLILPK+KLKVLQEAIRIVL
Subjt:  QIKERVKAQYLNGKFLDLMGKVIACPTTLQNAYNCIRISSNVDIMSNDRLISFESMAEELSNGNFDVNGNTFSILSSRKEVLILPKIKLKVLQEAIRIVL

Query:  ECVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKMDELVMAKLITVMEDKIDDPRLFAVIRRIYVAGALNLEFGGFPKGHGLPQEGVL
        ECVFRPHFSKISHGCRSGR HSTALKYIRKEI NPDWWFT+++ KKMDEL MAKLI+VMEDKI+DP  FA+IR I+ AGALNLEFGGFPKGHGLPQEGVL
Subjt:  ECVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKMDELVMAKLITVMEDKIDDPRLFAVIRRIYVAGALNLEFGGFPKGHGLPQEGVL

Query:  SPILTNIYLKSLDQEFFRLSMKYEAINEYGNTGQDGSQSG-AEWFRRQLKGNSSDYPGEEKDKIRVYCCRYMDEIFWRVQ--------------------
        SPIL NIYL   DQEFFRLSMKYEAIN+YGN  QDGSQS    WFRR+LKGN S+YP +EKD IRVYCCRYMDEIF  V                     
Subjt:  SPILTNIYLKSLDQEFFRLSMKYEAINEYGNTGQDGSQSG-AEWFRRQLKGNSSDYPGEEKDKIRVYCCRYMDEIFWRVQ--------------------

Query:  ------------------------VLKRRSVQESLAVKSVHKLKEKVELFALQKQENWNAWTVGLGKKWLAHVLKKVKVSEIKHLAKNRSSLNQISSFRK
                                 L RRS +ES AVK+VHKLKEKVELFALQKQE WN WTV LGKKWLAH LKKVK SEIKHLAKN  SLNQISSFRK
Subjt:  ------------------------VLKRRSVQESLAVKSVHKLKEKVELFALQKQENWNAWTVGLGKKWLAHVLKKVKVSEIKHLAKNRSSLNQISSFRK

Query:  AGMETDHWC----------------------------------------------VKEYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLLRY
         GMETDHW                                               V+EY+S+ETASTVALLPNYDPSVK TFITEIIAPVNSIRKRLLRY
Subjt:  AGMETDHWC----------------------------------------------VKEYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLLRY

Query:  RLVTNKGHPCSSPFLILQDNTQNIDWFLGVSRRWFRWYNNCSNFSELILICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSKIYSSPETEKEEEKKSSE
        RL+TNKG+PC+SPFLIL DNTQ IDWFLGV RRW +WY+NCSNFSE+ILICDQVRKSCIRTLAAKHR HESEIEKKFD ELS+I S+PE E+EEE+++S+
Subjt:  RLVTNKGHPCSSPFLILQDNTQNIDWFLGVSRRWFRWYNNCSNFSELILICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSKIYSSPETEKEEEKKSSE

Query:  THGLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCWSWG
        THGL HDEA  YGISYSGLCLLSLARMVSQSRPCNC+  G
Subjt:  THGLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCWSWG

A0A6J1CYJ7 nuclear intron maturase 4, mitochondrial isoform X12.0e-29071.39Show/hide
Query:  KMTFGGLRRLCGMNMRNFSILQSVNVCKVKSSFVSDI-GKCVQRVQSSENYSTLACADDEIDKGMEKKTLATNLAFLVEESLDVDMRRPKTQMELKRSLE
        +M FGG +R C MNMRNF++L+   +CKV SSFVSDI GKCVQRVQ+SENYS LACADD+  KGMEKK LA NLA LVEESLDVD RRPK++MELKRSLE
Subjt:  KMTFGGLRRLCGMNMRNFSILQSVNVCKVKSSFVSDI-GKCVQRVQSSENYSTLACADDEIDKGMEKKTLATNLAFLVEESLDVDMRRPKTQMELKRSLE

Query:  IQIKERVKAQYLNGKFLDLMGKVIACPTTLQNAYNCIRISSNVDIMSNDRLISFESMAEELSNGNFDVNGNTFSILSSRKEVLILPKIKLKVLQEAIRIV
        IQIK+RVKAQY+NGKF+DLMGKVIACP TLQNAY+C+RI+SNVDI SND LISFESMAEEL NG+FDVN NTFSI SS+KEVLILPK+KLKVLQEAIRIV
Subjt:  IQIKERVKAQYLNGKFLDLMGKVIACPTTLQNAYNCIRISSNVDIMSNDRLISFESMAEELSNGNFDVNGNTFSILSSRKEVLILPKIKLKVLQEAIRIV

Query:  LECVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKMDELVMAKLITVMEDKIDDPRLFAVIRRIYVAGALNLEFGGFPKGHGLPQEGV
        LECVFRPHFSKISHGCRSGR HSTALKYIRKEI NPDWWFT+++ KKMDEL MAKLI+VMEDKI+DP  FA+IR I+ AGALNLEFGGFPKGHGLPQEGV
Subjt:  LECVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKMDELVMAKLITVMEDKIDDPRLFAVIRRIYVAGALNLEFGGFPKGHGLPQEGV

Query:  LSPILTNIYLKSLDQEFFRLSMKYEAINEYGNTGQDGSQSG-AEWFRRQLKGNSSDYPGEEKDKIRVYCCRYMDEIFWRVQ-------------------
        LSPIL NIYL   DQEFFRLSMKYEAIN+YGN  QDGSQS    WFRR+LKGN S+YP +EKD IRVYCCRYMDEIF  V                    
Subjt:  LSPILTNIYLKSLDQEFFRLSMKYEAINEYGNTGQDGSQSG-AEWFRRQLKGNSSDYPGEEKDKIRVYCCRYMDEIFWRVQ-------------------

Query:  -------------------------VLKRRSVQESLAVKSVHKLKEKVELFALQKQENWNAWTVGLGKKWLAHVLKKVKVSEIKHLAKNRSSLNQISSFR
                                  L RRS +ES AVK+VHKLKEKVELFALQKQE WN WTV LGKKWLAH LKKVK SEIKHLAKN  SLNQISSFR
Subjt:  -------------------------VLKRRSVQESLAVKSVHKLKEKVELFALQKQENWNAWTVGLGKKWLAHVLKKVKVSEIKHLAKNRSSLNQISSFR

Query:  KAGMETDHWC----------------------------------------------VKEYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLLR
        K GMETDHW                                               V+EY+S+ETASTVALLPNYDPSVK TFITEIIAPVNSIRKRLLR
Subjt:  KAGMETDHWC----------------------------------------------VKEYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLLR

Query:  YRLVTNKGHPCSSPFLILQDNTQNIDWFLGVSRRWFRWYNNCSNFSELILICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSKIYSSPETEKEEEKKSS
        YRL+TNKG+PC+SPFLIL DNTQ IDWFLGV RRW +WY+NCSNFSE+ILICDQVRKSCIRTLAAKHR HESEIEKKFD ELS+I S+PE E+EEE+++S
Subjt:  YRLVTNKGHPCSSPFLILQDNTQNIDWFLGVSRRWFRWYNNCSNFSELILICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSKIYSSPETEKEEEKKSS

Query:  ETHGLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCWSWG
        +THGL HDEA  YGISYSGLCLLSLARMVSQSRPCNC+  G
Subjt:  ETHGLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCWSWG

SwissProt top hitse value%identityAlignment
B1N1A3 Putative nicotine oxidoreductase1.7e-2027.09Show/hide
Query:  KVLQEAIRIVLECVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKMDELVMAKLITVMEDKIDDPRLFAVIRRIYVAGALNLEFGGFP
        KV+QE IR +LE ++ P FSK SHG R+G++  TALK +R+      W    +++   D +  +KLI  +  +I D R   +IR+   AG    E G F 
Subjt:  KVLQEAIRIVLECVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKMDELVMAKLITVMEDKIDDPRLFAVIRRIYVAGALNLEFGGFP

Query:  KGH-GLPQEGVLSPILTNIYLKSLDQEFFRLSMKYEAINEYGNTGQDGSQSGAEWFRRQLKGNSSDYPGEEKDKIRVYCCRYMDEIFWRVQVLKRRSVQE
            G PQ  ++SPIL N++L  LD++  +L +K     E G+   D +    +  +  L+  +    G E+D           ++      L R +   
Subjt:  KGH-GLPQEGVLSPILTNIYLKSLDQEFFRLSMKYEAINEYGNTGQDGSQSGAEWFRRQLKGNSSDYPGEEKDKIRVYCCRYMDEIFWRVQVLKRRSVQE

Query:  SLAVKSVHKLKEKVELFALQKQENWNAWTVGL-GKKWLAHVLKKVKVSEIKHLAKNRSSLNQISSFRKAGMETDHWC-VKEYISAETASTVALLPNYD--
         + VK V    +               W +G+ G K LA  L+ V V E    A    S+ + +  R A  ET  +      I +E +  + +L N    
Subjt:  SLAVKSVHKLKEKVELFALQKQENWNAWTVGL-GKKWLAHVLKKVKVSEIKHLAKNRSSLNQISSFRKAGMETDHWC-VKEYISAETASTVALLPNYD--

Query:  PSVKPTFITEIIAPVNSIRKRLLRYRLVTNKGHPCSSPFLILQDNTQNIDWFLGVSRRWFRWYNNCSNFSELILICDQVRKSCIRTLAAKHRIHESEIEK
        P     +   + AP+N +  +L        KG+P +    I  D+ Q ++    V R    +Y+    FS L  I   ++ +  +TLA KHR   S+I  
Subjt:  PSVKPTFITEIIAPVNSIRKRLLRYRLVTNKGHPCSSPFLILQDNTQNIDWFLGVSRRWFRWYNNCSNFSELILICDQVRKSCIRTLAAKHRIHESEIEK

Query:  KFDSEL
        K  + L
Subjt:  KFDSEL

P0A3U0 Group II intron-encoded protein LtrA5.0e-1736Show/hide
Query:  SSRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKMDELVMAKLITVMEDKIDDPRLFAVIRRI
        S +   L +P    K++QEA+RI+LE ++ P F  +SHG R  R+  TALK I++E     W+   +++   D +    LI ++  KI D ++  +I + 
Subjt:  SSRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKMDELVMAKLITVMEDKIDDPRLFAVIRRI

Query:  YVAGALNLEFGGFPKGH-GLPQEGVLSPILTNIYLKSLDQEFFRLSMKYE
          AG   LE   + K + G PQ G+LSP+L NIYL  LD+   +L MK++
Subjt:  YVAGALNLEFGGFPKGH-GLPQEGVLSPILTNIYLKSLDQEFFRLSMKYE

P0A3U1 Group II intron-encoded protein LtrA5.0e-1736Show/hide
Query:  SSRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKMDELVMAKLITVMEDKIDDPRLFAVIRRI
        S +   L +P    K++QEA+RI+LE ++ P F  +SHG R  R+  TALK I++E     W+   +++   D +    LI ++  KI D ++  +I + 
Subjt:  SSRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKMDELVMAKLITVMEDKIDDPRLFAVIRRI

Query:  YVAGALNLEFGGFPKGH-GLPQEGVLSPILTNIYLKSLDQEFFRLSMKYE
          AG   LE   + K + G PQ G+LSP+L NIYL  LD+   +L MK++
Subjt:  YVAGALNLEFGGFPKGH-GLPQEGVLSPILTNIYLKSLDQEFFRLSMKYE

Q9CA78 Nuclear intron maturase 4, mitochondrial8.8e-16348.31Show/hide
Query:  TLATNLAFLVEESLD--VDMRRPKTQMELKRSLEIQIKERVKAQYLNGKFLDLMGKVIACPTTLQNAYNCIRISSNVDIMSNDRLISFESMAEELSNGNF
        +LA  LA LVEES     D  +P+++MELKRSLE+++K+RVK Q +NGKF DL+ KVIA P TL++AY+CIR++SNV I   +  ++F+S+AEELS+G F
Subjt:  TLATNLAFLVEESLD--VDMRRPKTQMELKRSLEIQIKERVKAQYLNGKFLDLMGKVIACPTTLQNAYNCIRISSNVDIMSNDRLISFESMAEELSNGNF

Query:  DVNGNTFSILS--SRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKMDELVMAKLITVMEDKI
        DV  NTFSI++    KEVL+LP + LKV+QEAIRIVLE VF PHFSKISH CRSGR  ++ALKYI   I   DW FT+ L KK+D  V   L++VME+K+
Subjt:  DVNGNTFSILS--SRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKMDELVMAKLITVMEDKI

Query:  DDPRLFAVIRRIYVAGALNLEFGGFPKGHGLPQEGVLSPILTNIYLKSLDQEFFRLSMKYEAINEYGNTGQDGSQSG-AEWFRRQLKGNSSDYPGEEKDK
        +D  L  ++R ++ A  LNLEFGGFPKGHGLPQEGVLS +L NIYL   D EF+R+SM++EA+     T +D   S    WFRRQ          E+   
Subjt:  DDPRLFAVIRRIYVAGALNLEFGGFPKGHGLPQEGVLSPILTNIYLKSLDQEFFRLSMKYEAINEYGNTGQDGSQSG-AEWFRRQLKGNSSDYPGEEKDK

Query:  IRVYCCRYMDEIFWRVQ--------------------------------------------VLKRRSVQESLAVKSVHKLKEKVELFALQKQENWNAWTV
        +RVYCCR+MDEI++ V                                              L R++V+ES  VK+VHKLKEKV LFALQK+E W   TV
Subjt:  IRVYCCRYMDEIFWRVQ--------------------------------------------VLKRRSVQESLAVKSVHKLKEKVELFALQKQENWNAWTV

Query:  GLGKKWLAHVLKKVKVSEIKHLAKNRSSLNQISSFRKAGMETDHW-----------------------------------------------CVKEYISA
         +GKKWL H LKKVK SEIK LA + S+L+QIS  RKAGMETDHW                                                   Y+S+
Subjt:  GLGKKWLAHVLKKVKVSEIKHLAKNRSSLNQISSFRKAGMETDHW-----------------------------------------------CVKEYISA

Query:  ETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLLRYRLVTNKGHPCSSPFLILQDNTQNIDWFLGVSRRWFRWYNNCSNFSEL-ILICDQVRKSCIRT
        ETA+  ALLP      +P F  +++AP N+I +RL RY L+T KG+  S+  LIL D  Q IDW+ G+ RRW  WY  CSNF E+  LI +Q+R SCIRT
Subjt:  ETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLLRYRLVTNKGHPCSSPFLILQDNTQNIDWFLGVSRRWFRWYNNCSNFSEL-ILICDQVRKSCIRT

Query:  LAAKHRIHESEIEKKFDSELSKIYSSPETEKEEEKKSSETHGLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCWSWG
        LAAK+RIHE+EIEK+ D ELS I S+ + E+E + +  ++   D DE L YG+S SGLCLLSLAR+VS+SRPCNC+  G
Subjt:  LAAKHRIHESEIEKKFDSELSKIYSSPETEKEEEKKSSETHGLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCWSWG

Q9LZA5 Nuclear intron maturase 3, mitochondrial1.7e-3331.82Show/hide
Query:  QIKERVKAQYLNGKFLDLMGKVIACPTTLQNAYNCIRISSNVDIMSNDRL---ISFESMAEELSNGNFDVNGNTFSILSSRKEVLILPKIKLKVLQEAIR
        +++  V  QY +GKF  L+   ++ P  L  A   + +S+N      DR+    S E M  E+  G FD+       +SS    L+LP +KLKVL EAIR
Subjt:  QIKERVKAQYLNGKFLDLMGKVIACPTTLQNAYNCIRISSNVDIMSNDRL---ISFESMAEELSNGNFDVNGNTFSILSSRKEVLILPKIKLKVLQEAIR

Query:  IVLECVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKM-DELVMAKLITVMEDKIDDPRLFAVIRRIYVAGALNLEFGGFPKGHGLPQ
        +VLE V+   F+  S+G R G    TA++Y++  ++NP WWF +   ++M +E  +  L   + +KI+D  L  +I++++  G L +E GG   G G PQ
Subjt:  IVLECVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKM-DELVMAKLITVMEDKIDDPRLFAVIRRIYVAGALNLEFGGFPKGHGLPQ

Query:  EGVLSPILTNIYLKSLDQEFFRLSMKYEAINEYGNTGQDGSQSGAEWFRRQLKGNSSDYPGEEKDKIRVYCCRYMDEIFWRVQVLKRRSVQESLAVKSVH
        E  L  IL N+Y   LD+E   L +K +  N    TG D   +G  +F+                 + +Y  RY+DEI   V     + +   L  + V 
Subjt:  EGVLSPILTNIYLKSLDQEFFRLSMKYEAINEYGNTGQDGSQSGAEWFRRQLKGNSSDYPGEEKDKIRVYCCRYMDEIFWRVQVLKRRSVQESLAVKSVH

Query:  KLKEKVEL
         L++++EL
Subjt:  KLKEKVEL

Arabidopsis top hitse value%identityAlignment
AT1G74350.1 Intron maturase, type II family protein6.2e-16448.31Show/hide
Query:  TLATNLAFLVEESLD--VDMRRPKTQMELKRSLEIQIKERVKAQYLNGKFLDLMGKVIACPTTLQNAYNCIRISSNVDIMSNDRLISFESMAEELSNGNF
        +LA  LA LVEES     D  +P+++MELKRSLE+++K+RVK Q +NGKF DL+ KVIA P TL++AY+CIR++SNV I   +  ++F+S+AEELS+G F
Subjt:  TLATNLAFLVEESLD--VDMRRPKTQMELKRSLEIQIKERVKAQYLNGKFLDLMGKVIACPTTLQNAYNCIRISSNVDIMSNDRLISFESMAEELSNGNF

Query:  DVNGNTFSILS--SRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKMDELVMAKLITVMEDKI
        DV  NTFSI++    KEVL+LP + LKV+QEAIRIVLE VF PHFSKISH CRSGR  ++ALKYI   I   DW FT+ L KK+D  V   L++VME+K+
Subjt:  DVNGNTFSILS--SRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKMDELVMAKLITVMEDKI

Query:  DDPRLFAVIRRIYVAGALNLEFGGFPKGHGLPQEGVLSPILTNIYLKSLDQEFFRLSMKYEAINEYGNTGQDGSQSG-AEWFRRQLKGNSSDYPGEEKDK
        +D  L  ++R ++ A  LNLEFGGFPKGHGLPQEGVLS +L NIYL   D EF+R+SM++EA+     T +D   S    WFRRQ          E+   
Subjt:  DDPRLFAVIRRIYVAGALNLEFGGFPKGHGLPQEGVLSPILTNIYLKSLDQEFFRLSMKYEAINEYGNTGQDGSQSG-AEWFRRQLKGNSSDYPGEEKDK

Query:  IRVYCCRYMDEIFWRVQ--------------------------------------------VLKRRSVQESLAVKSVHKLKEKVELFALQKQENWNAWTV
        +RVYCCR+MDEI++ V                                              L R++V+ES  VK+VHKLKEKV LFALQK+E W   TV
Subjt:  IRVYCCRYMDEIFWRVQ--------------------------------------------VLKRRSVQESLAVKSVHKLKEKVELFALQKQENWNAWTV

Query:  GLGKKWLAHVLKKVKVSEIKHLAKNRSSLNQISSFRKAGMETDHW-----------------------------------------------CVKEYISA
         +GKKWL H LKKVK SEIK LA + S+L+QIS  RKAGMETDHW                                                   Y+S+
Subjt:  GLGKKWLAHVLKKVKVSEIKHLAKNRSSLNQISSFRKAGMETDHW-----------------------------------------------CVKEYISA

Query:  ETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLLRYRLVTNKGHPCSSPFLILQDNTQNIDWFLGVSRRWFRWYNNCSNFSEL-ILICDQVRKSCIRT
        ETA+  ALLP      +P F  +++AP N+I +RL RY L+T KG+  S+  LIL D  Q IDW+ G+ RRW  WY  CSNF E+  LI +Q+R SCIRT
Subjt:  ETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLLRYRLVTNKGHPCSSPFLILQDNTQNIDWFLGVSRRWFRWYNNCSNFSEL-ILICDQVRKSCIRT

Query:  LAAKHRIHESEIEKKFDSELSKIYSSPETEKEEEKKSSETHGLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCWSWG
        LAAK+RIHE+EIEK+ D ELS I S+ + E+E + +  ++   D DE L YG+S SGLCLLSLAR+VS+SRPCNC+  G
Subjt:  LAAKHRIHESEIEKKFDSELSKIYSSPETEKEEEKKSSETHGLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCWSWG

AT5G04050.1 RNA-directed DNA polymerase (reverse transcriptase)1.2e-3431.82Show/hide
Query:  QIKERVKAQYLNGKFLDLMGKVIACPTTLQNAYNCIRISSNVDIMSNDRL---ISFESMAEELSNGNFDVNGNTFSILSSRKEVLILPKIKLKVLQEAIR
        +++  V  QY +GKF  L+   ++ P  L  A   + +S+N      DR+    S E M  E+  G FD+       +SS    L+LP +KLKVL EAIR
Subjt:  QIKERVKAQYLNGKFLDLMGKVIACPTTLQNAYNCIRISSNVDIMSNDRL---ISFESMAEELSNGNFDVNGNTFSILSSRKEVLILPKIKLKVLQEAIR

Query:  IVLECVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKM-DELVMAKLITVMEDKIDDPRLFAVIRRIYVAGALNLEFGGFPKGHGLPQ
        +VLE V+   F+  S+G R G    TA++Y++  ++NP WWF +   ++M +E  +  L   + +KI+D  L  +I++++  G L +E GG   G G PQ
Subjt:  IVLECVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKM-DELVMAKLITVMEDKIDDPRLFAVIRRIYVAGALNLEFGGFPKGHGLPQ

Query:  EGVLSPILTNIYLKSLDQEFFRLSMKYEAINEYGNTGQDGSQSGAEWFRRQLKGNSSDYPGEEKDKIRVYCCRYMDEIFWRVQVLKRRSVQESLAVKSVH
        E  L  IL N+Y   LD+E   L +K +  N    TG D   +G  +F+                 + +Y  RY+DEI   V     + +   L  + V 
Subjt:  EGVLSPILTNIYLKSLDQEFFRLSMKYEAINEYGNTGQDGSQSGAEWFRRQLKGNSSDYPGEEKDKIRVYCCRYMDEIFWRVQVLKRRSVQESLAVKSVH

Query:  KLKEKVEL
         L++++EL
Subjt:  KLKEKVEL

AT5G04050.2 RNA-directed DNA polymerase (reverse transcriptase)1.2e-3431.82Show/hide
Query:  QIKERVKAQYLNGKFLDLMGKVIACPTTLQNAYNCIRISSNVDIMSNDRL---ISFESMAEELSNGNFDVNGNTFSILSSRKEVLILPKIKLKVLQEAIR
        +++  V  QY +GKF  L+   ++ P  L  A   + +S+N      DR+    S E M  E+  G FD+       +SS    L+LP +KLKVL EAIR
Subjt:  QIKERVKAQYLNGKFLDLMGKVIACPTTLQNAYNCIRISSNVDIMSNDRL---ISFESMAEELSNGNFDVNGNTFSILSSRKEVLILPKIKLKVLQEAIR

Query:  IVLECVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKM-DELVMAKLITVMEDKIDDPRLFAVIRRIYVAGALNLEFGGFPKGHGLPQ
        +VLE V+   F+  S+G R G    TA++Y++  ++NP WWF +   ++M +E  +  L   + +KI+D  L  +I++++  G L +E GG   G G PQ
Subjt:  IVLECVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKM-DELVMAKLITVMEDKIDDPRLFAVIRRIYVAGALNLEFGGFPKGHGLPQ

Query:  EGVLSPILTNIYLKSLDQEFFRLSMKYEAINEYGNTGQDGSQSGAEWFRRQLKGNSSDYPGEEKDKIRVYCCRYMDEIFWRVQVLKRRSVQESLAVKSVH
        E  L  IL N+Y   LD+E   L +K +  N    TG D   +G  +F+                 + +Y  RY+DEI   V     + +   L  + V 
Subjt:  EGVLSPILTNIYLKSLDQEFFRLSMKYEAINEYGNTGQDGSQSGAEWFRRQLKGNSSDYPGEEKDKIRVYCCRYMDEIFWRVQVLKRRSVQESLAVKSVH

Query:  KLKEKVEL
         L++++EL
Subjt:  KLKEKVEL

ATMG00520.1 Intron maturase, type II family protein8.8e-1736.03Show/hide
Query:  KVLQEAIRIVLECVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKMDELVMAKLITVMEDKIDDPRLFAVIRRIYVAGALNLEFGGFP
        K+++EAIR+VLE ++ P F   SH  RSG+   + L+ I++E     W+   ++RK    +   +LI +++++IDDP+ F  I++++ AG L     G  
Subjt:  KVLQEAIRIVLECVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKMDELVMAKLITVMEDKIDDPRLFAVIRRIYVAGALNLEFGGFP

Query:  KG-HGLPQEGVLSPILTNIYLKSLDQEFFRLSMKYE
        +G + +P   +LS +  NIYL  LDQE  R+  KYE
Subjt:  KG-HGLPQEGVLSPILTNIYLKSLDQEFFRLSMKYE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACACCATCTGGGAGGGTTCACAACCTCCCGATGAGGCCTGACCACATCCTCCTTTGCTCACACATGTCCCTCGAATTTATTAACAAAGCCTTGAATAAAATCTACAA
GCAAATGGGATCGATGTACCACCTCCATGGAAGAACACTACCTGGTCATGCAAGTTATGCAAAAATGACATTTGGGGGGCTTCGAAGATTATGTGGGATGAACATGCGGA
ACTTTTCAATTTTGCAAAGTGTAAATGTTTGCAAAGTCAAATCATCCTTTGTTTCTGACATTGGAAAGTGTGTTCAGAGAGTTCAGAGTTCTGAAAATTATTCAACTCTC
GCTTGTGCTGATGATGAAATTGACAAGGGCATGGAGAAAAAGACACTGGCCACGAACTTGGCCTTTCTTGTTGAAGAATCTCTTGATGTTGATATGAGAAGACCAAAGAC
TCAAATGGAACTTAAGAGATCCCTTGAAATTCAGATTAAGGAGAGGGTGAAGGCACAATATTTGAATGGGAAGTTTTTGGACTTGATGGGTAAAGTGATTGCCTGCCCCA
CAACTCTTCAAAATGCTTACAACTGTATTAGAATTAGCTCAAATGTTGATATAATGTCGAATGACCGTTTAATCTCATTTGAATCTATGGCTGAAGAGCTTTCTAATGGT
AATTTTGATGTCAATGGCAATACTTTCTCCATATTAAGTTCGAGAAAAGAAGTACTCATTTTACCAAAGATAAAGTTGAAGGTTCTTCAGGAAGCCATTAGGATAGTTTT
GGAGTGTGTGTTTAGGCCACATTTTTCCAAGATATCTCATGGCTGTCGGAGTGGAAGAGCACACTCAACGGCATTGAAGTACATCAGAAAAGAGATAAAAAATCCTGATT
GGTGGTTCACAATTGAATTAAGAAAAAAGATGGATGAGCTTGTGATGGCTAAACTCATTACAGTAATGGAGGACAAGATAGACGACCCCAGACTATTTGCTGTTATCAGA
AGAATATATGTGGCTGGGGCACTAAATTTGGAGTTTGGGGGTTTCCCAAAAGGTCACGGTCTTCCACAAGAGGGAGTTCTGTCTCCTATATTAACAAACATCTATCTAAA
ATCTCTTGATCAAGAATTTTTCAGATTATCAATGAAATATGAGGCTATTAATGAGTATGGTAATACTGGTCAAGATGGGTCACAATCAGGTGCGGAGTGGTTTAGGAGAC
AATTGAAAGGAAATAGTTCTGATTATCCAGGTGAGGAGAAAGACAAAATAAGAGTCTATTGTTGTCGCTATATGGATGAAATTTTTTGGCGGGTACAGGTTCTAAAGAGA
CGAAGTGTGCAGGAAAGTCTTGCTGTAAAATCCGTCCACAAGTTGAAGGAAAAAGTTGAGCTATTTGCTTTACAAAAGCAGGAGAATTGGAATGCTTGGACAGTGGGGTT
GGGAAAGAAATGGCTCGCTCATGTTTTGAAGAAGGTTAAAGTGTCGGAGATCAAGCATTTAGCTAAGAATAGGTCTTCTTTGAATCAAATTTCTAGTTTTCGTAAAGCTG
GAATGGAAACTGATCACTGGTGCGTGAAAGAATATATTTCTGCTGAGACAGCTTCTACTGTTGCTCTCTTACCAAATTATGATCCTTCTGTCAAACCTACTTTCATAACT
GAGATTATAGCACCCGTCAATTCTATAAGAAAACGACTATTGCGATATAGGTTAGTTACAAATAAAGGACATCCATGCTCCTCTCCTTTCCTCATCTTACAAGATAACAC
TCAAAATATTGACTGGTTTTTAGGAGTATCTCGTCGTTGGTTTAGATGGTACAATAATTGTTCTAACTTCAGCGAGTTGATCTTAATTTGCGATCAAGTTAGGAAATCTT
GTATCAGAACGCTAGCGGCAAAGCATCGTATACACGAAAGTGAAATAGAAAAGAAGTTTGACTCAGAACTGAGTAAGATTTACTCCTCTCCTGAAACAGAGAAAGAAGAA
GAGAAGAAGTCATCAGAAACCCATGGTTTAGACCATGATGAGGCACTAAAGTATGGAATTTCATATAGTGGTTTGTGTTTGCTATCTCTTGCTAGAATGGTCAGCCAATC
TCGTCCTTGCAATTGTTGGTCATGGGGTGTTTGGCTCCTGCACCAAGTGTTTATACTCTTCATGTCATGGAGAGACAAAAGTTTCCGGGATGGAAGACTGGATTCTCGGT
TCACCATCCTAGCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGACACCATCTGGGAGGGTTCACAACCTCCCGATGAGGCCTGACCACATCCTCCTTTGCTCACACATGTCCCTCGAATTTATTAACAAAGCCTTGAATAAAATCTACAA
GCAAATGGGATCGATGTACCACCTCCATGGAAGAACACTACCTGGTCATGCAAGTTATGCAAAAATGACATTTGGGGGGCTTCGAAGATTATGTGGGATGAACATGCGGA
ACTTTTCAATTTTGCAAAGTGTAAATGTTTGCAAAGTCAAATCATCCTTTGTTTCTGACATTGGAAAGTGTGTTCAGAGAGTTCAGAGTTCTGAAAATTATTCAACTCTC
GCTTGTGCTGATGATGAAATTGACAAGGGCATGGAGAAAAAGACACTGGCCACGAACTTGGCCTTTCTTGTTGAAGAATCTCTTGATGTTGATATGAGAAGACCAAAGAC
TCAAATGGAACTTAAGAGATCCCTTGAAATTCAGATTAAGGAGAGGGTGAAGGCACAATATTTGAATGGGAAGTTTTTGGACTTGATGGGTAAAGTGATTGCCTGCCCCA
CAACTCTTCAAAATGCTTACAACTGTATTAGAATTAGCTCAAATGTTGATATAATGTCGAATGACCGTTTAATCTCATTTGAATCTATGGCTGAAGAGCTTTCTAATGGT
AATTTTGATGTCAATGGCAATACTTTCTCCATATTAAGTTCGAGAAAAGAAGTACTCATTTTACCAAAGATAAAGTTGAAGGTTCTTCAGGAAGCCATTAGGATAGTTTT
GGAGTGTGTGTTTAGGCCACATTTTTCCAAGATATCTCATGGCTGTCGGAGTGGAAGAGCACACTCAACGGCATTGAAGTACATCAGAAAAGAGATAAAAAATCCTGATT
GGTGGTTCACAATTGAATTAAGAAAAAAGATGGATGAGCTTGTGATGGCTAAACTCATTACAGTAATGGAGGACAAGATAGACGACCCCAGACTATTTGCTGTTATCAGA
AGAATATATGTGGCTGGGGCACTAAATTTGGAGTTTGGGGGTTTCCCAAAAGGTCACGGTCTTCCACAAGAGGGAGTTCTGTCTCCTATATTAACAAACATCTATCTAAA
ATCTCTTGATCAAGAATTTTTCAGATTATCAATGAAATATGAGGCTATTAATGAGTATGGTAATACTGGTCAAGATGGGTCACAATCAGGTGCGGAGTGGTTTAGGAGAC
AATTGAAAGGAAATAGTTCTGATTATCCAGGTGAGGAGAAAGACAAAATAAGAGTCTATTGTTGTCGCTATATGGATGAAATTTTTTGGCGGGTACAGGTTCTAAAGAGA
CGAAGTGTGCAGGAAAGTCTTGCTGTAAAATCCGTCCACAAGTTGAAGGAAAAAGTTGAGCTATTTGCTTTACAAAAGCAGGAGAATTGGAATGCTTGGACAGTGGGGTT
GGGAAAGAAATGGCTCGCTCATGTTTTGAAGAAGGTTAAAGTGTCGGAGATCAAGCATTTAGCTAAGAATAGGTCTTCTTTGAATCAAATTTCTAGTTTTCGTAAAGCTG
GAATGGAAACTGATCACTGGTGCGTGAAAGAATATATTTCTGCTGAGACAGCTTCTACTGTTGCTCTCTTACCAAATTATGATCCTTCTGTCAAACCTACTTTCATAACT
GAGATTATAGCACCCGTCAATTCTATAAGAAAACGACTATTGCGATATAGGTTAGTTACAAATAAAGGACATCCATGCTCCTCTCCTTTCCTCATCTTACAAGATAACAC
TCAAAATATTGACTGGTTTTTAGGAGTATCTCGTCGTTGGTTTAGATGGTACAATAATTGTTCTAACTTCAGCGAGTTGATCTTAATTTGCGATCAAGTTAGGAAATCTT
GTATCAGAACGCTAGCGGCAAAGCATCGTATACACGAAAGTGAAATAGAAAAGAAGTTTGACTCAGAACTGAGTAAGATTTACTCCTCTCCTGAAACAGAGAAAGAAGAA
GAGAAGAAGTCATCAGAAACCCATGGTTTAGACCATGATGAGGCACTAAAGTATGGAATTTCATATAGTGGTTTGTGTTTGCTATCTCTTGCTAGAATGGTCAGCCAATC
TCGTCCTTGCAATTGTTGGTCATGGGGTGTTTGGCTCCTGCACCAAGTGTTTATACTCTTCATGTCATGGAGAGACAAAAGTTTCCGGGATGGAAGACTGGATTCTCGGT
TCACCATCCTAGCTTGA
Protein sequenceShow/hide protein sequence
MTPSGRVHNLPMRPDHILLCSHMSLEFINKALNKIYKQMGSMYHLHGRTLPGHASYAKMTFGGLRRLCGMNMRNFSILQSVNVCKVKSSFVSDIGKCVQRVQSSENYSTL
ACADDEIDKGMEKKTLATNLAFLVEESLDVDMRRPKTQMELKRSLEIQIKERVKAQYLNGKFLDLMGKVIACPTTLQNAYNCIRISSNVDIMSNDRLISFESMAEELSNG
NFDVNGNTFSILSSRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRAHSTALKYIRKEIKNPDWWFTIELRKKMDELVMAKLITVMEDKIDDPRLFAVIR
RIYVAGALNLEFGGFPKGHGLPQEGVLSPILTNIYLKSLDQEFFRLSMKYEAINEYGNTGQDGSQSGAEWFRRQLKGNSSDYPGEEKDKIRVYCCRYMDEIFWRVQVLKR
RSVQESLAVKSVHKLKEKVELFALQKQENWNAWTVGLGKKWLAHVLKKVKVSEIKHLAKNRSSLNQISSFRKAGMETDHWCVKEYISAETASTVALLPNYDPSVKPTFIT
EIIAPVNSIRKRLLRYRLVTNKGHPCSSPFLILQDNTQNIDWFLGVSRRWFRWYNNCSNFSELILICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSKIYSSPETEKEE
EKKSSETHGLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCWSWGVWLLHQVFILFMSWRDKSFRDGRLDSRFTILA