; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0012055 (gene) of Snake gourd v1 genome

Gene IDTan0012055
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptiontRNA_int_end_N2 domain-containing protein
Genome locationLG10:66686247..66688284
RNA-Seq ExpressionTan0012055
SyntenyTan0012055
Gene Ontology termsGO:0000379 - tRNA-type intron splice site recognition and cleavage (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0000214 - tRNA-intron endonuclease complex (cellular component)
GO:0004519 - endonuclease activity (molecular function)
InterPro domainsIPR024336 - tRNA-splicing endonuclease, subunit Sen54, N-terminal
IPR024337 - tRNA-splicing endonuclease, subunit Sen54


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6596967.1 Chromatin assembly factor 1 subunit FAS1, partial [Cucurbita argyrosperma subsp. sororia]2.8e-11080.08Show/hide
Query:  EQDIRDEEECLCSFGYM-QVAIQKHASIARWNDRMGMA-CCREQGDLWTTTGIVRCGKIYCSIEETLFLIEVGALHLLDYDNLNLSLKDVYKKVAEGKNG
        EQDI++EEECLCS G M ++  +KHAS ARWND MGMA     +G LWTT+GIVRCGKIYCSIEETLFLIEVGALHLLD+DN +LSLKDVYKKVAEGK+ 
Subjt:  EQDIRDEEECLCSFGYM-QVAIQKHASIARWNDRMGMA-CCREQGDLWTTTGIVRCGKIYCSIEETLFLIEVGALHLLDYDNLNLSLKDVYKKVAEGKNG

Query:  CLWEQFEVYRHLKSLGFIVGKHKVPWSVKGVRNGNDISSRSSIFENKGATNFESEDERSISELLDSTQLNEVTPIFDVFLPHSKFKKSSPGDPNFMVLLT
        C+WEQFEVYRHLKSLG+IVGKHKVPWSVKG RNG DISS+SSI ENKG+T+F SEDE+SI EL+D+ QLNEVTPIFDV+LPHSKF+KSSPGDPNFMV LT
Subjt:  CLWEQFEVYRHLKSLGFIVGKHKVPWSVKGVRNGNDISSRSSIFENKGATNFESEDERSISELLDSTQLNEVTPIFDVFLPHSKFKKSSPGDPNFMVLLT

Query:  RGYPPPRKNIEVIDRTSRGIPMKYCHVEHGRVCFFSLDKVELPVLP
        RGYPPP+ +IEVI+R S GIPMKYCHVEHGRVCFFS DKVELPVLP
Subjt:  RGYPPPRKNIEVIDRTSRGIPMKYCHVEHGRVCFFSLDKVELPVLP

KAG7028441.1 tRNA-splicing endonuclease subunit Sen54 [Cucurbita argyrosperma subsp. argyrosperma]2.8e-11080.08Show/hide
Query:  EQDIRDEEECLCSFGYM-QVAIQKHASIARWNDRMGMA-CCREQGDLWTTTGIVRCGKIYCSIEETLFLIEVGALHLLDYDNLNLSLKDVYKKVAEGKNG
        EQDI++EEECLCS G M ++  +KHAS ARWND MGMA     +G LWTT+GIVRCGKIYCSIEETLFLIEVGALHLLD+DN +LSLKDVYKKVAEGK+ 
Subjt:  EQDIRDEEECLCSFGYM-QVAIQKHASIARWNDRMGMA-CCREQGDLWTTTGIVRCGKIYCSIEETLFLIEVGALHLLDYDNLNLSLKDVYKKVAEGKNG

Query:  CLWEQFEVYRHLKSLGFIVGKHKVPWSVKGVRNGNDISSRSSIFENKGATNFESEDERSISELLDSTQLNEVTPIFDVFLPHSKFKKSSPGDPNFMVLLT
        C+WEQFEVYRHLKSLG+IVGKHKVPWSVKG RNG DISS+SSI ENKG+T+F SEDE+SI EL+D+ QLNEVTPIFDV+LPHSKF+KSSPGDPNFMV LT
Subjt:  CLWEQFEVYRHLKSLGFIVGKHKVPWSVKGVRNGNDISSRSSIFENKGATNFESEDERSISELLDSTQLNEVTPIFDVFLPHSKFKKSSPGDPNFMVLLT

Query:  RGYPPPRKNIEVIDRTSRGIPMKYCHVEHGRVCFFSLDKVELPVLP
        RGYPPP+ +IEVI+R S GIPMKYCHVEHGRVCFFS DKVELPVLP
Subjt:  RGYPPPRKNIEVIDRTSRGIPMKYCHVEHGRVCFFSLDKVELPVLP

XP_022156818.1 uncharacterized protein LOC111023660 [Momordica charantia]1.1e-10979.67Show/hide
Query:  EQDIRDEEECLCSFGYM-QVAIQKHASIARWNDRMGMA-CCREQGDLWTTTGIVRCGKIYCSIEETLFLIEVGALHLLDYDNLNLSLKDVYKKVAEGKNG
        EQD+ DEEECLC+ G M ++  +KHAS ARWND+MGMA     +G LWTTTGIVRCGKIYCS EETLFL+EVGALHLLD+DN +LSLKDVYKKVAEGKNG
Subjt:  EQDIRDEEECLCSFGYM-QVAIQKHASIARWNDRMGMA-CCREQGDLWTTTGIVRCGKIYCSIEETLFLIEVGALHLLDYDNLNLSLKDVYKKVAEGKNG

Query:  CLWEQFEVYRHLKSLGFIVGKHKVPWSVKGVRNGNDISSRSSIFENKGATNFESEDERSISELLDSTQLNEVTPIFDVFLPHSKFKKSSPGDPNFMVLLT
        CLWEQFEVYRHLKSLGFIVGKHKVPWSVKGVRNG+DIS +SSI EN+GA + ES+DERSISELL S QL++V PIFDVFLPHSKF+KSSPGDPNFMV LT
Subjt:  CLWEQFEVYRHLKSLGFIVGKHKVPWSVKGVRNGNDISSRSSIFENKGATNFESEDERSISELLDSTQLNEVTPIFDVFLPHSKFKKSSPGDPNFMVLLT

Query:  RGYPPPRKNIEVIDRTSRGIPMKYCHVEHGRVCFFSLDKVELPVLP
        RGYPPP+K+IE ++RTSRGI +KYCHVEHGRVCFFS DK+ELPVLP
Subjt:  RGYPPPRKNIEVIDRTSRGIPMKYCHVEHGRVCFFSLDKVELPVLP

XP_022940787.1 tRNA-splicing endonuclease subunit Sen54 [Cucurbita moschata]6.2e-11079.67Show/hide
Query:  EQDIRDEEECLCSFGYM-QVAIQKHASIARWNDRMGMA-CCREQGDLWTTTGIVRCGKIYCSIEETLFLIEVGALHLLDYDNLNLSLKDVYKKVAEGKNG
        EQDI++EEECLCS G M ++  +KHAS ARWND MGMA     +G LWTT+GIVRCGKIYCSIEETLFLIEVGALHLLD+DN +LSLKDVYKKVAEGK+ 
Subjt:  EQDIRDEEECLCSFGYM-QVAIQKHASIARWNDRMGMA-CCREQGDLWTTTGIVRCGKIYCSIEETLFLIEVGALHLLDYDNLNLSLKDVYKKVAEGKNG

Query:  CLWEQFEVYRHLKSLGFIVGKHKVPWSVKGVRNGNDISSRSSIFENKGATNFESEDERSISELLDSTQLNEVTPIFDVFLPHSKFKKSSPGDPNFMVLLT
        C+WEQFEVYRHLKSLG+IVGKHKVPWSVKG +NG DISS+SSI ENKG+T+F SEDE+SI EL+D+ QLNEVTPIFDV+LPHSKF+KSSPGDPNFMV LT
Subjt:  CLWEQFEVYRHLKSLGFIVGKHKVPWSVKGVRNGNDISSRSSIFENKGATNFESEDERSISELLDSTQLNEVTPIFDVFLPHSKFKKSSPGDPNFMVLLT

Query:  RGYPPPRKNIEVIDRTSRGIPMKYCHVEHGRVCFFSLDKVELPVLP
        RGYPPP+ +IEVI+R S GIPMKYCHVEHGRVCFFS DKVELPVLP
Subjt:  RGYPPPRKNIEVIDRTSRGIPMKYCHVEHGRVCFFSLDKVELPVLP

XP_023538818.1 tRNA-splicing endonuclease subunit Sen54-like [Cucurbita pepo subsp. pepo]9.6e-11180.49Show/hide
Query:  EQDIRDEEECLCSFGYM-QVAIQKHASIARWNDRMGMA-CCREQGDLWTTTGIVRCGKIYCSIEETLFLIEVGALHLLDYDNLNLSLKDVYKKVAEGKNG
        EQDI++EEECLCS G M ++  +KHAS ARWND MGMA     +G LWTT+GIVRCGKIYCSIEETLFLIEVGALHLLD+DN +LSLKDVYKKVAEGK+ 
Subjt:  EQDIRDEEECLCSFGYM-QVAIQKHASIARWNDRMGMA-CCREQGDLWTTTGIVRCGKIYCSIEETLFLIEVGALHLLDYDNLNLSLKDVYKKVAEGKNG

Query:  CLWEQFEVYRHLKSLGFIVGKHKVPWSVKGVRNGNDISSRSSIFENKGATNFESEDERSISELLDSTQLNEVTPIFDVFLPHSKFKKSSPGDPNFMVLLT
        C+WEQFEVYRHLKSLG+IVGKHKVPWSVKG RNG DISSRSSI ENKG+T+F SEDE+SI EL+D+ QLNEVTPIFDV+LPHSKF+KSSPGDPNFMV LT
Subjt:  CLWEQFEVYRHLKSLGFIVGKHKVPWSVKGVRNGNDISSRSSIFENKGATNFESEDERSISELLDSTQLNEVTPIFDVFLPHSKFKKSSPGDPNFMVLLT

Query:  RGYPPPRKNIEVIDRTSRGIPMKYCHVEHGRVCFFSLDKVELPVLP
        RGYPPP+ +IEVI+R S GIPMKYCHVEHGRVCFFS DKVELPVLP
Subjt:  RGYPPPRKNIEVIDRTSRGIPMKYCHVEHGRVCFFSLDKVELPVLP

TrEMBL top hitse value%identityAlignment
A0A6J1DRN3 uncharacterized protein LOC1110236605.1e-11079.67Show/hide
Query:  EQDIRDEEECLCSFGYM-QVAIQKHASIARWNDRMGMA-CCREQGDLWTTTGIVRCGKIYCSIEETLFLIEVGALHLLDYDNLNLSLKDVYKKVAEGKNG
        EQD+ DEEECLC+ G M ++  +KHAS ARWND+MGMA     +G LWTTTGIVRCGKIYCS EETLFL+EVGALHLLD+DN +LSLKDVYKKVAEGKNG
Subjt:  EQDIRDEEECLCSFGYM-QVAIQKHASIARWNDRMGMA-CCREQGDLWTTTGIVRCGKIYCSIEETLFLIEVGALHLLDYDNLNLSLKDVYKKVAEGKNG

Query:  CLWEQFEVYRHLKSLGFIVGKHKVPWSVKGVRNGNDISSRSSIFENKGATNFESEDERSISELLDSTQLNEVTPIFDVFLPHSKFKKSSPGDPNFMVLLT
        CLWEQFEVYRHLKSLGFIVGKHKVPWSVKGVRNG+DIS +SSI EN+GA + ES+DERSISELL S QL++V PIFDVFLPHSKF+KSSPGDPNFMV LT
Subjt:  CLWEQFEVYRHLKSLGFIVGKHKVPWSVKGVRNGNDISSRSSIFENKGATNFESEDERSISELLDSTQLNEVTPIFDVFLPHSKFKKSSPGDPNFMVLLT

Query:  RGYPPPRKNIEVIDRTSRGIPMKYCHVEHGRVCFFSLDKVELPVLP
        RGYPPP+K+IE ++RTSRGI +KYCHVEHGRVCFFS DK+ELPVLP
Subjt:  RGYPPPRKNIEVIDRTSRGIPMKYCHVEHGRVCFFSLDKVELPVLP

A0A6J1FKK4 tRNA-splicing endonuclease subunit Sen543.0e-11079.67Show/hide
Query:  EQDIRDEEECLCSFGYM-QVAIQKHASIARWNDRMGMA-CCREQGDLWTTTGIVRCGKIYCSIEETLFLIEVGALHLLDYDNLNLSLKDVYKKVAEGKNG
        EQDI++EEECLCS G M ++  +KHAS ARWND MGMA     +G LWTT+GIVRCGKIYCSIEETLFLIEVGALHLLD+DN +LSLKDVYKKVAEGK+ 
Subjt:  EQDIRDEEECLCSFGYM-QVAIQKHASIARWNDRMGMA-CCREQGDLWTTTGIVRCGKIYCSIEETLFLIEVGALHLLDYDNLNLSLKDVYKKVAEGKNG

Query:  CLWEQFEVYRHLKSLGFIVGKHKVPWSVKGVRNGNDISSRSSIFENKGATNFESEDERSISELLDSTQLNEVTPIFDVFLPHSKFKKSSPGDPNFMVLLT
        C+WEQFEVYRHLKSLG+IVGKHKVPWSVKG +NG DISS+SSI ENKG+T+F SEDE+SI EL+D+ QLNEVTPIFDV+LPHSKF+KSSPGDPNFMV LT
Subjt:  CLWEQFEVYRHLKSLGFIVGKHKVPWSVKGVRNGNDISSRSSIFENKGATNFESEDERSISELLDSTQLNEVTPIFDVFLPHSKFKKSSPGDPNFMVLLT

Query:  RGYPPPRKNIEVIDRTSRGIPMKYCHVEHGRVCFFSLDKVELPVLP
        RGYPPP+ +IEVI+R S GIPMKYCHVEHGRVCFFS DKVELPVLP
Subjt:  RGYPPPRKNIEVIDRTSRGIPMKYCHVEHGRVCFFSLDKVELPVLP

A0A6J1I794 tRNA-splicing endonuclease subunit Sen54 isoform X12.0e-10979.67Show/hide
Query:  EQDIRDEEECLCSFGYM-QVAIQKHASIARWNDRMGMA-CCREQGDLWTTTGIVRCGKIYCSIEETLFLIEVGALHLLDYDNLNLSLKDVYKKVAEGKNG
        EQDI++EEECL S G M ++  +KHAS ARWND MGMA     +G LWTT+GIVRCGKIYCSIEETLFLIEVGALHLLD+DN +LSLKDVYKKVAEGK+ 
Subjt:  EQDIRDEEECLCSFGYM-QVAIQKHASIARWNDRMGMA-CCREQGDLWTTTGIVRCGKIYCSIEETLFLIEVGALHLLDYDNLNLSLKDVYKKVAEGKNG

Query:  CLWEQFEVYRHLKSLGFIVGKHKVPWSVKGVRNGNDISSRSSIFENKGATNFESEDERSISELLDSTQLNEVTPIFDVFLPHSKFKKSSPGDPNFMVLLT
        C+WEQFEVYRHLKSLG+IVGKHKVPWSVKG RNG DISSRSSI ENKG+T+FESEDE+SI ELL++ QLNE+TPIFDV+LPHSKF+KSSPGDPNFMV LT
Subjt:  CLWEQFEVYRHLKSLGFIVGKHKVPWSVKGVRNGNDISSRSSIFENKGATNFESEDERSISELLDSTQLNEVTPIFDVFLPHSKFKKSSPGDPNFMVLLT

Query:  RGYPPPRKNIEVIDRTSRGIPMKYCHVEHGRVCFFSLDKVELPVLP
        RGYPPP+ +IEVI+R S GIPMKYCHVEHGRVCFFS DKV+LPVLP
Subjt:  RGYPPPRKNIEVIDRTSRGIPMKYCHVEHGRVCFFSLDKVELPVLP

A0A6J1I8D9 tRNA-splicing endonuclease subunit Sen54 isoform X32.0e-10979.67Show/hide
Query:  EQDIRDEEECLCSFGYM-QVAIQKHASIARWNDRMGMA-CCREQGDLWTTTGIVRCGKIYCSIEETLFLIEVGALHLLDYDNLNLSLKDVYKKVAEGKNG
        EQDI++EEECL S G M ++  +KHAS ARWND MGMA     +G LWTT+GIVRCGKIYCSIEETLFLIEVGALHLLD+DN +LSLKDVYKKVAEGK+ 
Subjt:  EQDIRDEEECLCSFGYM-QVAIQKHASIARWNDRMGMA-CCREQGDLWTTTGIVRCGKIYCSIEETLFLIEVGALHLLDYDNLNLSLKDVYKKVAEGKNG

Query:  CLWEQFEVYRHLKSLGFIVGKHKVPWSVKGVRNGNDISSRSSIFENKGATNFESEDERSISELLDSTQLNEVTPIFDVFLPHSKFKKSSPGDPNFMVLLT
        C+WEQFEVYRHLKSLG+IVGKHKVPWSVKG RNG DISSRSSI ENKG+T+FESEDE+SI ELL++ QLNE+TPIFDV+LPHSKF+KSSPGDPNFMV LT
Subjt:  CLWEQFEVYRHLKSLGFIVGKHKVPWSVKGVRNGNDISSRSSIFENKGATNFESEDERSISELLDSTQLNEVTPIFDVFLPHSKFKKSSPGDPNFMVLLT

Query:  RGYPPPRKNIEVIDRTSRGIPMKYCHVEHGRVCFFSLDKVELPVLP
        RGYPPP+ +IEVI+R S GIPMKYCHVEHGRVCFFS DKV+LPVLP
Subjt:  RGYPPPRKNIEVIDRTSRGIPMKYCHVEHGRVCFFSLDKVELPVLP

A0A6J1ICR8 tRNA-splicing endonuclease subunit Sen54 isoform X22.0e-10979.67Show/hide
Query:  EQDIRDEEECLCSFGYM-QVAIQKHASIARWNDRMGMA-CCREQGDLWTTTGIVRCGKIYCSIEETLFLIEVGALHLLDYDNLNLSLKDVYKKVAEGKNG
        EQDI++EEECL S G M ++  +KHAS ARWND MGMA     +G LWTT+GIVRCGKIYCSIEETLFLIEVGALHLLD+DN +LSLKDVYKKVAEGK+ 
Subjt:  EQDIRDEEECLCSFGYM-QVAIQKHASIARWNDRMGMA-CCREQGDLWTTTGIVRCGKIYCSIEETLFLIEVGALHLLDYDNLNLSLKDVYKKVAEGKNG

Query:  CLWEQFEVYRHLKSLGFIVGKHKVPWSVKGVRNGNDISSRSSIFENKGATNFESEDERSISELLDSTQLNEVTPIFDVFLPHSKFKKSSPGDPNFMVLLT
        C+WEQFEVYRHLKSLG+IVGKHKVPWSVKG RNG DISSRSSI ENKG+T+FESEDE+SI ELL++ QLNE+TPIFDV+LPHSKF+KSSPGDPNFMV LT
Subjt:  CLWEQFEVYRHLKSLGFIVGKHKVPWSVKGVRNGNDISSRSSIFENKGATNFESEDERSISELLDSTQLNEVTPIFDVFLPHSKFKKSSPGDPNFMVLLT

Query:  RGYPPPRKNIEVIDRTSRGIPMKYCHVEHGRVCFFSLDKVELPVLP
        RGYPPP+ +IEVI+R S GIPMKYCHVEHGRVCFFS DKV+LPVLP
Subjt:  RGYPPPRKNIEVIDRTSRGIPMKYCHVEHGRVCFFSLDKVELPVLP

SwissProt top hitse value%identityAlignment
O74908 Probable tRNA-splicing endonuclease subunit sen542.3e-0637.11Show/hide
Query:  KHASIARWNDRMGMACC-REQGDLWTTTGIVRC-GKIYCSIEETLFLIEVGALHLLDYDNLNLSLKDVYKKVAEGKNGCLWEQFEVYRHLKSLGFIV
        KHA IA WN + GM+C  +  G L+ T G      +++   EETL+L+E G++     + L +SL+ VY   +    G L E + VY HL+  GF V
Subjt:  KHASIARWNDRMGMACC-REQGDLWTTTGIVRC-GKIYCSIEETLFLIEVGALHLLDYDNLNLSLKDVYKKVAEGKNGCLWEQFEVYRHLKSLGFIV

Arabidopsis top hitse value%identityAlignment
AT3G02370.1 unknown protein6.1e-3950.32Show/hide
Query:  EQGDLWTTTGIVRCGKIYCSIEETLFLIEVGALHLL-DYDNLNLSLKDVYKKVAEGKNGCLWEQFEVYRHLKSLGFIVGKHKVPWSVKGVRNGNDISSRS
        ++G LWTTTGI+R GK YC IEE L+L E+G L LL D D++ +SLKD+Y ++AEGK GC WE +EVYR+LK LG+I+G+H VPW+ K   N    ++ S
Subjt:  EQGDLWTTTGIVRCGKIYCSIEETLFLIEVGALHLL-DYDNLNLSLKDVYKKVAEGKNGCLWEQFEVYRHLKSLGFIVGKHKVPWSVKGVRNGNDISSRS

Query:  SIFENKGATNFESEDERSISELLDSTQLNEVTPIFDVFLPHSKFKKSSPGDPNFM
           E+  A  F  +D  S+++LL    + +  P+FDV+LP+S+FKKSSPG+P+F+
Subjt:  SIFENKGATNFESEDERSISELLDSTQLNEVTPIFDVFLPHSKFKKSSPGDPNFM

AT3G02370.2 unknown protein6.5e-4147.65Show/hide
Query:  EQGDLWTTTGIVRCGKIYCSIEETLFLIEVGALHLL-DYDNLNLSLKDVYKKVAEGKNGCLWEQFEVYRHLKSLGFIVGKHKVPWSVKGVRNGNDISSRS
        ++G LWTTTGI+R GK YC IEE L+L E+G L LL D D++ +SLKD+Y ++AEGK GC WE +EVYR+LK LG+I+G+H VPW+ K   N    ++ S
Subjt:  EQGDLWTTTGIVRCGKIYCSIEETLFLIEVGALHLL-DYDNLNLSLKDVYKKVAEGKNGCLWEQFEVYRHLKSLGFIVGKHKVPWSVKGVRNGNDISSRS

Query:  SIFENKGATNFESEDERSISELLDSTQLNEVTPIFDVFLPHSKFKKSSPGDPNFMVLLTRGYPPPRKNIE
           E+  A  F  +D  S+++LL    + +  P+FDV+LP+S+FKKSSPG+P+F+   +   PP ++ I+
Subjt:  SIFENKGATNFESEDERSISELLDSTQLNEVTPIFDVFLPHSKFKKSSPGDPNFMVLLTRGYPPPRKNIE

AT3G57360.1 unknown protein1.7e-4940Show/hide
Query:  DEEECLCSFGYMQVAIQKHASIARWNDRMGMACCR-EQGDLWTTTGIVRCGKIYCSIEETLFLIEVGALHLL-DYDNLNLSLKDVYKKVAEGKNGCLWEQ
        D+EE        ++  +  +S ARW   +GMA    ++G LWTTTGI+R GK YC IEE L+L E+G L +L + D++ + LKD+Y+K+AE K+GC WE 
Subjt:  DEEECLCSFGYMQVAIQKHASIARWNDRMGMACCR-EQGDLWTTTGIVRCGKIYCSIEETLFLIEVGALHLL-DYDNLNLSLKDVYKKVAEGKNGCLWEQ

Query:  FEVYRHLKSLGFIVGKHKVPWSVKGVRNGNDISSRSSIFENKGATNFESEDERSISELLDSTQLNEVTPIFDVFLPHSKFKKSSPGDPNFMVLLTRGYPP
        +EVYR+LK LG+I+G+H V W++K        ++R +  E          D  ++++LL   Q+ +   +FDV+LP+S+FKKSSPG+P+F+   +   PP
Subjt:  FEVYRHLKSLGFIVGKHKVPWSVKGVRNGNDISSRSSIFENKGATNFESEDERSISELLDSTQLNEVTPIFDVFLPHSKFKKSSPGDPNFMVLLTRGYPP

Query:  PRKNIEVIDRTSRGIPMKYCHVEHGRVCFFSLDKVELPVL
         +++I+V+ +     P+ +CH+  GR  FFS   ++LPVL
Subjt:  PRKNIEVIDRTSRGIPMKYCHVEHGRVCFFSLDKVELPVL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACGACATTTACTGAGCAAGACATAAGAGATGAAGAAGAATGTCTTTGTTCATTTGGGTACATGCAAGTTGCAATTCAGAAGCATGCTTCAATTGCTCGATGGAATGA
TCGGATGGGAATGGCGTGTTGTAGAGAACAAGGCGACCTTTGGACGACGACGGGCATTGTGCGTTGTGGCAAGATTTATTGTTCCATTGAGGAAACTTTATTTCTTATTG
AAGTTGGGGCCTTGCATCTTCTGGATTATGATAATTTAAATCTTTCTTTGAAAGATGTATACAAGAAGGTAGCTGAAGGAAAAAATGGATGTCTTTGGGAGCAGTTTGAG
GTTTATAGGCACCTCAAATCTCTTGGTTTCATTGTTGGAAAGCATAAAGTTCCTTGGTCTGTGAAGGGTGTTAGGAATGGAAATGACATTTCTTCTCGAAGTTCTATATT
TGAGAACAAAGGAGCGACAAATTTTGAATCAGAAGATGAGAGGTCGATCTCTGAGCTATTAGATTCCACTCAACTCAATGAAGTGACACCCATTTTTGATGTTTTTCTTC
CACATAGCAAGTTTAAAAAATCTTCTCCTGGGGACCCAAATTTTATGGTCCTCTTGACTAGGGGATACCCACCTCCAAGAAAAAATATTGAAGTTATTGATAGAACATCG
AGAGGCATTCCAATGAAATATTGTCATGTTGAACATGGACGTGTTTGTTTCTTCTCACTTGATAAGGTGGAGTTGCCCGTCTTACCGTGA
mRNA sequenceShow/hide mRNA sequence
ATGACGACATTTACTGAGCAAGACATAAGAGATGAAGAAGAATGTCTTTGTTCATTTGGGTACATGCAAGTTGCAATTCAGAAGCATGCTTCAATTGCTCGATGGAATGA
TCGGATGGGAATGGCGTGTTGTAGAGAACAAGGCGACCTTTGGACGACGACGGGCATTGTGCGTTGTGGCAAGATTTATTGTTCCATTGAGGAAACTTTATTTCTTATTG
AAGTTGGGGCCTTGCATCTTCTGGATTATGATAATTTAAATCTTTCTTTGAAAGATGTATACAAGAAGGTAGCTGAAGGAAAAAATGGATGTCTTTGGGAGCAGTTTGAG
GTTTATAGGCACCTCAAATCTCTTGGTTTCATTGTTGGAAAGCATAAAGTTCCTTGGTCTGTGAAGGGTGTTAGGAATGGAAATGACATTTCTTCTCGAAGTTCTATATT
TGAGAACAAAGGAGCGACAAATTTTGAATCAGAAGATGAGAGGTCGATCTCTGAGCTATTAGATTCCACTCAACTCAATGAAGTGACACCCATTTTTGATGTTTTTCTTC
CACATAGCAAGTTTAAAAAATCTTCTCCTGGGGACCCAAATTTTATGGTCCTCTTGACTAGGGGATACCCACCTCCAAGAAAAAATATTGAAGTTATTGATAGAACATCG
AGAGGCATTCCAATGAAATATTGTCATGTTGAACATGGACGTGTTTGTTTCTTCTCACTTGATAAGGTGGAGTTGCCCGTCTTACCGTGA
Protein sequenceShow/hide protein sequence
MTTFTEQDIRDEEECLCSFGYMQVAIQKHASIARWNDRMGMACCREQGDLWTTTGIVRCGKIYCSIEETLFLIEVGALHLLDYDNLNLSLKDVYKKVAEGKNGCLWEQFE
VYRHLKSLGFIVGKHKVPWSVKGVRNGNDISSRSSIFENKGATNFESEDERSISELLDSTQLNEVTPIFDVFLPHSKFKKSSPGDPNFMVLLTRGYPPPRKNIEVIDRTS
RGIPMKYCHVEHGRVCFFSLDKVELPVLP