; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022602 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022602
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptiontRNA_int_end_N2 domain-containing protein
Genome locationchr7:34229680..34235034
RNA-Seq ExpressionLag0022602
SyntenyLag0022602
Gene Ontology termsGO:0000379 - tRNA-type intron splice site recognition and cleavage (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0000214 - tRNA-intron endonuclease complex (cellular component)
GO:0004519 - endonuclease activity (molecular function)
InterPro domainsIPR024336 - tRNA-splicing endonuclease, subunit Sen54, N-terminal
IPR024337 - tRNA-splicing endonuclease, subunit Sen54


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6596967.1 Chromatin assembly factor 1 subunit FAS1, partial [Cucurbita argyrosperma subsp. sororia]1.7e-10687.27Show/hide
Query:  MEATDWESSSGGASGDDDIYEQDMKDEEECLCASGYMRKLQFRKHASTARWNDQMGMAEVLENKGSLWTTTGIVRCGKIYCSIEETLFLIEVGALHLLDH
        MEATDWE SSGGAS DDD +EQD+K+EEECLC+SG MRKLQFRKHASTARWND+MGMAEVLENKGSLWTT+GIVRCGKIYCSIEETLFLIEVGALHLLDH
Subjt:  MEATDWESSSGGASGDDDIYEQDMKDEEECLCASGYMRKLQFRKHASTARWNDQMGMAEVLENKGSLWTTTGIVRCGKIYCSIEETLFLIEVGALHLLDH

Query:  DNSSLSLKDVYKKVAEGKSGCLWEQFEVYRHLKSLGFIVGKHKVPWSVKSVRNSSEISSRSSIENKGASDLESEDERSISELLDAIQLDEVTPIFDVFLP
        DNSSLSLKDVYKKVAEGKS C+WEQFEVYRHLKSLG+IVGKHKVPWSVK  RN  +ISS+SSIENKG++D  SEDE+SI EL+DAIQL+EVTPIFDV+LP
Subjt:  DNSSLSLKDVYKKVAEGKSGCLWEQFEVYRHLKSLGFIVGKHKVPWSVKSVRNSSEISSRSSIENKGASDLESEDERSISELLDAIQLDEVTPIFDVFLP

Query:  HSKFRKSSPGDPNFMVCLTR
        HSKFRKSSPGDPNFMVCLTR
Subjt:  HSKFRKSSPGDPNFMVCLTR

KAG7028441.1 tRNA-splicing endonuclease subunit Sen54 [Cucurbita argyrosperma subsp. argyrosperma]1.7e-10687.27Show/hide
Query:  MEATDWESSSGGASGDDDIYEQDMKDEEECLCASGYMRKLQFRKHASTARWNDQMGMAEVLENKGSLWTTTGIVRCGKIYCSIEETLFLIEVGALHLLDH
        MEATDWE SSGGAS DDD +EQD+K+EEECLC+SG MRKLQFRKHASTARWND+MGMAEVLENKGSLWTT+GIVRCGKIYCSIEETLFLIEVGALHLLDH
Subjt:  MEATDWESSSGGASGDDDIYEQDMKDEEECLCASGYMRKLQFRKHASTARWNDQMGMAEVLENKGSLWTTTGIVRCGKIYCSIEETLFLIEVGALHLLDH

Query:  DNSSLSLKDVYKKVAEGKSGCLWEQFEVYRHLKSLGFIVGKHKVPWSVKSVRNSSEISSRSSIENKGASDLESEDERSISELLDAIQLDEVTPIFDVFLP
        DNSSLSLKDVYKKVAEGKS C+WEQFEVYRHLKSLG+IVGKHKVPWSVK  RN  +ISS+SSIENKG++D  SEDE+SI EL+DAIQL+EVTPIFDV+LP
Subjt:  DNSSLSLKDVYKKVAEGKSGCLWEQFEVYRHLKSLGFIVGKHKVPWSVKSVRNSSEISSRSSIENKGASDLESEDERSISELLDAIQLDEVTPIFDVFLP

Query:  HSKFRKSSPGDPNFMVCLTR
        HSKFRKSSPGDPNFMVCLTR
Subjt:  HSKFRKSSPGDPNFMVCLTR

XP_022156818.1 uncharacterized protein LOC111023660 [Momordica charantia]1.5e-11090.91Show/hide
Query:  MEATDWESSSGGASGDDDIYEQDMKDEEECLCASGYMRKLQFRKHASTARWNDQMGMAEVLENKGSLWTTTGIVRCGKIYCSIEETLFLIEVGALHLLDH
        MEATDWESSSGGASGDD+IYEQD++DEEECLCASG MRKLQFRKHASTARWNDQMGMAEVLEN+GSLWTTTGIVRCGKIYCS EETLFL+EVGALHLLDH
Subjt:  MEATDWESSSGGASGDDDIYEQDMKDEEECLCASGYMRKLQFRKHASTARWNDQMGMAEVLENKGSLWTTTGIVRCGKIYCSIEETLFLIEVGALHLLDH

Query:  DNSSLSLKDVYKKVAEGKSGCLWEQFEVYRHLKSLGFIVGKHKVPWSVKSVRNSSEISSRSSIENKGASDLESEDERSISELLDAIQLDEVTPIFDVFLP
        DNSSLSLKDVYKKVAEGK+GCLWEQFEVYRHLKSLGFIVGKHKVPWSVK VRN S+IS +SSIEN+GA DLES+DERSISELL +IQLD+V PIFDVFLP
Subjt:  DNSSLSLKDVYKKVAEGKSGCLWEQFEVYRHLKSLGFIVGKHKVPWSVKSVRNSSEISSRSSIENKGASDLESEDERSISELLDAIQLDEVTPIFDVFLP

Query:  HSKFRKSSPGDPNFMVCLTR
        HSKFRKSSPGDPNFMVCLTR
Subjt:  HSKFRKSSPGDPNFMVCLTR

XP_023538818.1 tRNA-splicing endonuclease subunit Sen54-like [Cucurbita pepo subsp. pepo]5.8e-10787.73Show/hide
Query:  MEATDWESSSGGASGDDDIYEQDMKDEEECLCASGYMRKLQFRKHASTARWNDQMGMAEVLENKGSLWTTTGIVRCGKIYCSIEETLFLIEVGALHLLDH
        MEATDWE SSGGAS DDD +EQD+K+EEECLC+SG MRKLQFRKHASTARWND+MGMAEVLENKGSLWTT+GIVRCGKIYCSIEETLFLIEVGALHLLDH
Subjt:  MEATDWESSSGGASGDDDIYEQDMKDEEECLCASGYMRKLQFRKHASTARWNDQMGMAEVLENKGSLWTTTGIVRCGKIYCSIEETLFLIEVGALHLLDH

Query:  DNSSLSLKDVYKKVAEGKSGCLWEQFEVYRHLKSLGFIVGKHKVPWSVKSVRNSSEISSRSSIENKGASDLESEDERSISELLDAIQLDEVTPIFDVFLP
        DNSSLSLKDVYKKVAEGKS C+WEQFEVYRHLKSLG+IVGKHKVPWSVK  RN  +ISSRSSIENKG++D  SEDE+SI EL+DAIQL+EVTPIFDV+LP
Subjt:  DNSSLSLKDVYKKVAEGKSGCLWEQFEVYRHLKSLGFIVGKHKVPWSVKSVRNSSEISSRSSIENKGASDLESEDERSISELLDAIQLDEVTPIFDVFLP

Query:  HSKFRKSSPGDPNFMVCLTR
        HSKFRKSSPGDPNFMVCLTR
Subjt:  HSKFRKSSPGDPNFMVCLTR

XP_038883355.1 uncharacterized protein LOC120074337 [Benincasa hispida]3.1e-10889.09Show/hide
Query:  MEATDWESSSGGASGDDDIYEQDMKDEEECLCASGYMRKLQFRKHASTARWNDQMGMAEVLENKGSLWTTTGIVRCGKIYCSIEETLFLIEVGALHLLDH
        MEA DWESSSGGASGD+D YE+D+ +EEECLCASGY+RKLQFRKHASTARWND+MGMAEVLENKGSLWTTTGIVRCGKIYCSIEETLFLIEVGALHLLDH
Subjt:  MEATDWESSSGGASGDDDIYEQDMKDEEECLCASGYMRKLQFRKHASTARWNDQMGMAEVLENKGSLWTTTGIVRCGKIYCSIEETLFLIEVGALHLLDH

Query:  DNSSLSLKDVYKKVAEGKSGCLWEQFEVYRHLKSLGFIVGKHKVPWSVKSVRNSSEISSRSSIENKGASDLESEDERSISELLDAIQLDEVTPIFDVFLP
        DNSSLSL+DVYKK+AEGK+GCLWEQFEVYRHLKSLGFIVGKHKVPWS+KSVR+ S ISS SSIENKGASD++SEDERSISELLD IQL+EV PIFDVFLP
Subjt:  DNSSLSLKDVYKKVAEGKSGCLWEQFEVYRHLKSLGFIVGKHKVPWSVKSVRNSSEISSRSSIENKGASDLESEDERSISELLDAIQLDEVTPIFDVFLP

Query:  HSKFRKSSPGDPNFMVCLTR
        H+KFRKSSPGDPNFMV LTR
Subjt:  HSKFRKSSPGDPNFMVCLTR

TrEMBL top hitse value%identityAlignment
A0A6J1DRN3 uncharacterized protein LOC1110236607.2e-11190.91Show/hide
Query:  MEATDWESSSGGASGDDDIYEQDMKDEEECLCASGYMRKLQFRKHASTARWNDQMGMAEVLENKGSLWTTTGIVRCGKIYCSIEETLFLIEVGALHLLDH
        MEATDWESSSGGASGDD+IYEQD++DEEECLCASG MRKLQFRKHASTARWNDQMGMAEVLEN+GSLWTTTGIVRCGKIYCS EETLFL+EVGALHLLDH
Subjt:  MEATDWESSSGGASGDDDIYEQDMKDEEECLCASGYMRKLQFRKHASTARWNDQMGMAEVLENKGSLWTTTGIVRCGKIYCSIEETLFLIEVGALHLLDH

Query:  DNSSLSLKDVYKKVAEGKSGCLWEQFEVYRHLKSLGFIVGKHKVPWSVKSVRNSSEISSRSSIENKGASDLESEDERSISELLDAIQLDEVTPIFDVFLP
        DNSSLSLKDVYKKVAEGK+GCLWEQFEVYRHLKSLGFIVGKHKVPWSVK VRN S+IS +SSIEN+GA DLES+DERSISELL +IQLD+V PIFDVFLP
Subjt:  DNSSLSLKDVYKKVAEGKSGCLWEQFEVYRHLKSLGFIVGKHKVPWSVKSVRNSSEISSRSSIENKGASDLESEDERSISELLDAIQLDEVTPIFDVFLP

Query:  HSKFRKSSPGDPNFMVCLTR
        HSKFRKSSPGDPNFMVCLTR
Subjt:  HSKFRKSSPGDPNFMVCLTR

A0A6J1FKK4 tRNA-splicing endonuclease subunit Sen541.8e-10686.82Show/hide
Query:  MEATDWESSSGGASGDDDIYEQDMKDEEECLCASGYMRKLQFRKHASTARWNDQMGMAEVLENKGSLWTTTGIVRCGKIYCSIEETLFLIEVGALHLLDH
        MEATDWE SSGGAS DDD +EQD+K+EEECLC+SG MRKLQFRKHASTARWND+MGMAEVLENKGSLWTT+GIVRCGKIYCSIEETLFLIEVGALHLLDH
Subjt:  MEATDWESSSGGASGDDDIYEQDMKDEEECLCASGYMRKLQFRKHASTARWNDQMGMAEVLENKGSLWTTTGIVRCGKIYCSIEETLFLIEVGALHLLDH

Query:  DNSSLSLKDVYKKVAEGKSGCLWEQFEVYRHLKSLGFIVGKHKVPWSVKSVRNSSEISSRSSIENKGASDLESEDERSISELLDAIQLDEVTPIFDVFLP
        DNSSLSLKDVYKKVAEGKS C+WEQFEVYRHLKSLG+IVGKHKVPWSVK  +N  +ISS+SSIENKG++D  SEDE+SI EL+DAIQL+EVTPIFDV+LP
Subjt:  DNSSLSLKDVYKKVAEGKSGCLWEQFEVYRHLKSLGFIVGKHKVPWSVKSVRNSSEISSRSSIENKGASDLESEDERSISELLDAIQLDEVTPIFDVFLP

Query:  HSKFRKSSPGDPNFMVCLTR
        HSKFRKSSPGDPNFMVCLTR
Subjt:  HSKFRKSSPGDPNFMVCLTR

A0A6J1I794 tRNA-splicing endonuclease subunit Sen54 isoform X17.0e-10686.82Show/hide
Query:  MEATDWESSSGGASGDDDIYEQDMKDEEECLCASGYMRKLQFRKHASTARWNDQMGMAEVLENKGSLWTTTGIVRCGKIYCSIEETLFLIEVGALHLLDH
        MEATDWE SSGGAS DDD +EQD+K+EEECL +SG MRKLQFRKHASTARWND+MGMAEVLENKGSLWTT+GIVRCGKIYCSIEETLFLIEVGALHLLDH
Subjt:  MEATDWESSSGGASGDDDIYEQDMKDEEECLCASGYMRKLQFRKHASTARWNDQMGMAEVLENKGSLWTTTGIVRCGKIYCSIEETLFLIEVGALHLLDH

Query:  DNSSLSLKDVYKKVAEGKSGCLWEQFEVYRHLKSLGFIVGKHKVPWSVKSVRNSSEISSRSSIENKGASDLESEDERSISELLDAIQLDEVTPIFDVFLP
        DNSSLSLKDVYKKVAEGKS C+WEQFEVYRHLKSLG+IVGKHKVPWSVK  RN  +ISSRSSIENKG++D ESEDE+SI ELL+A+QL+E+TPIFDV+LP
Subjt:  DNSSLSLKDVYKKVAEGKSGCLWEQFEVYRHLKSLGFIVGKHKVPWSVKSVRNSSEISSRSSIENKGASDLESEDERSISELLDAIQLDEVTPIFDVFLP

Query:  HSKFRKSSPGDPNFMVCLTR
        HSKFRKSSPGDPNFMVCLTR
Subjt:  HSKFRKSSPGDPNFMVCLTR

A0A6J1I8D9 tRNA-splicing endonuclease subunit Sen54 isoform X37.0e-10686.82Show/hide
Query:  MEATDWESSSGGASGDDDIYEQDMKDEEECLCASGYMRKLQFRKHASTARWNDQMGMAEVLENKGSLWTTTGIVRCGKIYCSIEETLFLIEVGALHLLDH
        MEATDWE SSGGAS DDD +EQD+K+EEECL +SG MRKLQFRKHASTARWND+MGMAEVLENKGSLWTT+GIVRCGKIYCSIEETLFLIEVGALHLLDH
Subjt:  MEATDWESSSGGASGDDDIYEQDMKDEEECLCASGYMRKLQFRKHASTARWNDQMGMAEVLENKGSLWTTTGIVRCGKIYCSIEETLFLIEVGALHLLDH

Query:  DNSSLSLKDVYKKVAEGKSGCLWEQFEVYRHLKSLGFIVGKHKVPWSVKSVRNSSEISSRSSIENKGASDLESEDERSISELLDAIQLDEVTPIFDVFLP
        DNSSLSLKDVYKKVAEGKS C+WEQFEVYRHLKSLG+IVGKHKVPWSVK  RN  +ISSRSSIENKG++D ESEDE+SI ELL+A+QL+E+TPIFDV+LP
Subjt:  DNSSLSLKDVYKKVAEGKSGCLWEQFEVYRHLKSLGFIVGKHKVPWSVKSVRNSSEISSRSSIENKGASDLESEDERSISELLDAIQLDEVTPIFDVFLP

Query:  HSKFRKSSPGDPNFMVCLTR
        HSKFRKSSPGDPNFMVCLTR
Subjt:  HSKFRKSSPGDPNFMVCLTR

A0A6J1ICR8 tRNA-splicing endonuclease subunit Sen54 isoform X27.0e-10686.82Show/hide
Query:  MEATDWESSSGGASGDDDIYEQDMKDEEECLCASGYMRKLQFRKHASTARWNDQMGMAEVLENKGSLWTTTGIVRCGKIYCSIEETLFLIEVGALHLLDH
        MEATDWE SSGGAS DDD +EQD+K+EEECL +SG MRKLQFRKHASTARWND+MGMAEVLENKGSLWTT+GIVRCGKIYCSIEETLFLIEVGALHLLDH
Subjt:  MEATDWESSSGGASGDDDIYEQDMKDEEECLCASGYMRKLQFRKHASTARWNDQMGMAEVLENKGSLWTTTGIVRCGKIYCSIEETLFLIEVGALHLLDH

Query:  DNSSLSLKDVYKKVAEGKSGCLWEQFEVYRHLKSLGFIVGKHKVPWSVKSVRNSSEISSRSSIENKGASDLESEDERSISELLDAIQLDEVTPIFDVFLP
        DNSSLSLKDVYKKVAEGKS C+WEQFEVYRHLKSLG+IVGKHKVPWSVK  RN  +ISSRSSIENKG++D ESEDE+SI ELL+A+QL+E+TPIFDV+LP
Subjt:  DNSSLSLKDVYKKVAEGKSGCLWEQFEVYRHLKSLGFIVGKHKVPWSVKSVRNSSEISSRSSIENKGASDLESEDERSISELLDAIQLDEVTPIFDVFLP

Query:  HSKFRKSSPGDPNFMVCLTR
        HSKFRKSSPGDPNFMVCLTR
Subjt:  HSKFRKSSPGDPNFMVCLTR

SwissProt top hitse value%identityAlignment
O74908 Probable tRNA-splicing endonuclease subunit sen541.0e-0532.52Show/hide
Query:  MRKLQFRKHASTARWNDQMGMAEVLENKGSLWTTTGIVRC-GKIYCSIEETLFLIEVGALHLLDHDNSSLSLKDVYKKVAEGKSGCLWEQFEVYRHLKSL
        + +L   KHA  A WN Q GM+ V +  G L+ T G      +++   EETL+L+E G++     +   +SL+ VY   +    G L E + VY HL+  
Subjt:  MRKLQFRKHASTARWNDQMGMAEVLENKGSLWTTTGIVRC-GKIYCSIEETLFLIEVGALHLLDHDNSSLSLKDVYKKVAEGKSGCLWEQFEVYRHLKSL

Query:  GF-IVGKHKVPWSVKSVRNSSEI
        GF ++  + VP      R  S+I
Subjt:  GF-IVGKHKVPWSVKSVRNSSEI

Q74ZJ5 tRNA-splicing endonuclease subunit SEN543.3e-0430.39Show/hide
Query:  ARWNDQMGMAEVLENKGSLWTTTGIV-RCGKIYCSIEETLFLIEVGAL----------HLLDHDNSSLSLKDVYKKVAEGKSGCLWEQFEVYRHLKSLGF
        A W ++  MA +   +GS   T G V + G+ +  + E ++L+E G +             +H++  LS++DVY   +  +     ++F VY HLK LGF
Subjt:  ARWNDQMGMAEVLENKGSLWTTTGIV-RCGKIYCSIEETLFLIEVGAL----------HLLDHDNSSLSLKDVYKKVAEGKSGCLWEQFEVYRHLKSLGF

Query:  IV
        IV
Subjt:  IV

Arabidopsis top hitse value%identityAlignment
AT3G02370.1 unknown protein1.3e-4048.17Show/hide
Query:  MAEVLENKGSLWTTTGIVRCGKIYCSIEETLFLIEVGALHLL-DHDNSSLSLKDVYKKVAEGKSGCLWEQFEVYRHLKSLGFIVGKHKVPWSVKSVRNSS
        MAEV   +G LWTTTGI+R GK YC IEE L+L E+G L LL D D+  +SLKD+Y ++AEGK GC WE +EVYR+LK LG+I+G+H VPW+ K   N++
Subjt:  MAEVLENKGSLWTTTGIVRCGKIYCSIEETLFLIEVGALHLL-DHDNSSLSLKDVYKKVAEGKSGCLWEQFEVYRHLKSLGFIVGKHKVPWSVKSVRNSS

Query:  EISSRSSIENKGASDLESEDERSISELLDAIQLDEVTPIFDVFLPHSKFRKSSPGDPNFMVCLT
              S+     +    +D  S+++LL  + + +  P+FDV+LP+S+F+KSSPG+P+F+ C +
Subjt:  EISSRSSIENKGASDLESEDERSISELLDAIQLDEVTPIFDVFLPHSKFRKSSPGDPNFMVCLT

AT3G02370.2 unknown protein1.3e-4048.17Show/hide
Query:  MAEVLENKGSLWTTTGIVRCGKIYCSIEETLFLIEVGALHLL-DHDNSSLSLKDVYKKVAEGKSGCLWEQFEVYRHLKSLGFIVGKHKVPWSVKSVRNSS
        MAEV   +G LWTTTGI+R GK YC IEE L+L E+G L LL D D+  +SLKD+Y ++AEGK GC WE +EVYR+LK LG+I+G+H VPW+ K   N++
Subjt:  MAEVLENKGSLWTTTGIVRCGKIYCSIEETLFLIEVGALHLL-DHDNSSLSLKDVYKKVAEGKSGCLWEQFEVYRHLKSLGFIVGKHKVPWSVKSVRNSS

Query:  EISSRSSIENKGASDLESEDERSISELLDAIQLDEVTPIFDVFLPHSKFRKSSPGDPNFMVCLT
              S+     +    +D  S+++LL  + + +  P+FDV+LP+S+F+KSSPG+P+F+ C +
Subjt:  EISSRSSIENKGASDLESEDERSISELLDAIQLDEVTPIFDVFLPHSKFRKSSPGDPNFMVCLT

AT3G57360.1 unknown protein1.6e-4644.69Show/hide
Query:  MEATDWESSS-----GGASGDDDIYEQDMKDEEECLCASGYMRKLQFRKHASTARWNDQMGMAEVLENKGSLWTTTGIVRCGKIYCSIEETLFLIEVGAL
        ME  DWE+SS     GG   DDD          E   + G + KLQFR  +S ARW  ++GMAEV   +G LWTTTGI+R GK YC IEE L+L E+G L
Subjt:  MEATDWESSS-----GGASGDDDIYEQDMKDEEECLCASGYMRKLQFRKHASTARWNDQMGMAEVLENKGSLWTTTGIVRCGKIYCSIEETLFLIEVGAL

Query:  HLL-DHDNSSLSLKDVYKKVAEGKSGCLWEQFEVYRHLKSLGFIVGKHKVPWSVKSVRNSSEISSRSSIENKGASDLE-SEDERSISELLDAIQLDEVTP
         +L + D+  + LKD+Y+K+AE KSGC WE +EVYR+LK LG+I+G+H V W++K        ++R + E + A   E   D  ++++LL  +Q+ +   
Subjt:  HLL-DHDNSSLSLKDVYKKVAEGKSGCLWEQFEVYRHLKSLGFIVGKHKVPWSVKSVRNSSEISSRSSIENKGASDLE-SEDERSISELLDAIQLDEVTP

Query:  IFDVFLPHSKFRKSSPGDPNFMVCLT
        +FDV+LP+S+F+KSSPG+P+F+ C +
Subjt:  IFDVFLPHSKFRKSSPGDPNFMVCLT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGCTACAGATTGGGAAAGCTCTTCAGGAGGAGCTAGTGGTGATGATGACATTTACGAGCAAGACATGAAAGATGAAGAAGAATGTCTCTGCGCATCTGGGTACAT
GCGCAAGTTGCAATTTAGGAAGCATGCTTCAACTGCTCGATGGAATGATCAAATGGGAATGGCAGAAGTTTTAGAGAACAAGGGTAGCCTTTGGACGACAACTGGCATTG
TACGTTGTGGCAAGATTTATTGTTCCATTGAGGAAACTTTGTTTCTTATTGAAGTTGGGGCCTTACATCTTCTGGATCATGATAATTCAAGTCTTTCTTTGAAAGATGTA
TATAAGAAGGTAGCTGAAGGAAAGAGCGGATGTCTTTGGGAGCAGTTTGAGGTTTATAGGCACCTCAAATCTCTTGGTTTCATTGTTGGAAAGCATAAAGTTCCTTGGTC
TGTGAAGAGTGTTAGGAATAGCAGTGAAATTTCTTCTCGAAGTTCTATTGAAAACAAAGGAGCGTCAGATCTTGAATCAGAAGATGAGAGGTCGATCTCCGAACTATTGG
ATGCCATTCAGCTTGATGAAGTGACACCCATTTTTGATGTTTTTCTTCCACATAGCAAGTTTAGAAAGTCTTCTCCTGGTGACCCAAATTTTATGGTCTGCTTGACTAGA
TTGATGGCCCATTATTCTTGGAAGAATGTTAAGATTGCCCTTGAGGATTTCTTTAAATCTTCAGTCTTGATCAACCCCTTTATGGATGATAAAGCCTTGATTCAGGTGGC
AGATGGTTGTTTGGATCTTTCTGTGAATGGCAAGTGGAAGAAATTTGGGAACCCTCACTTGAAATTGGAATTGTGGGTTAAGCAAGTGATATTGGATGAAGAAGTGGACC
TTGTTAATGAAGGGGTAATGTTGAATGAATCGCCATTTATTTCTTGTTATCAGGAGGAATTTAATGTGGCAATAGGTTCTCCAAAAGTTGCTTCAATGCATGATGAGCAT
ATTAATTACATAGGCTGTATTGAACGTCCCTCCAAGATGATTAATGACAGCCATTGCAATTTGAAAGATGATATACAACCGCTACTGGGATCGTCAAATATCAATTCTGA
AAATGGGTTAATTCAGTTGAAGGATTTTAAGGAATTGTCTGACCAAATTCCAAGGGAGAGAGACCATCTTAATGAGGTGTTGGGTTCTCCAAAAGAAAGTTCGGCCGCCC
AGATTCATAAAACTCCTTCCAAGAATTTTAATGCCGTTACTTGCAATTTAAATGATGATGTCCAGCAGGTAACATTAAAGACCTATTCTCGGAAAAAGGCTTCTCATTCA
TTGGCAGTTAAGTCCAACATTAATGCTGATCATTTGGAAGCTGAATGCACTCATTTAATTGCTGCAAATAAGGTTGTGGGATCTTCAAATGTCAGTGCTAGAAATGGTTT
GTTTCAGGCCAAGGAATTTAAAGAATCTTCTGTTCGAATTCCAGGAGGAGTAATATTTTCATTAGAGGAGACTAAACCAGCTGCAATTGATTTGAAATTCATTAAATCCT
TATGGAGTCTCAAGGAAATCGGCTGGTCTTTTGTGGAAGCCAATGGAAAATCAGGGAATTATGCAACCGCCATTGTCTGTTGTTTTATGTATTGCCAAGTCCTTAGTTTC
TGCTTGCTCCCTGAGGGACTACATAAATATCAGCTCCAGTGTTCCACGCCATTCAGATTTTGCTTTATGGTGTGGCCTTGTGGGGACAATGCTGCAGGATCTTTCTTGCC
TGTAATTCCAAGAACTTTTAGTGTGTGTGGTACTGCATATGATCTGCGTCATACAATATTTGGGTTTCCTGGAATGTATCATGAAATGTTCAAAAGTTCAGATAATCATC
CTTCACCAAGTCAAACTATGACCTTGCATTTGGAGAGGTCACCTGTTGTTTCTTCTAGTTTGTGTTTTGAGGTTGTTGCTGATTTTCAGTTGATATGGTGGAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGGCTACAGATTGGGAAAGCTCTTCAGGAGGAGCTAGTGGTGATGATGACATTTACGAGCAAGACATGAAAGATGAAGAAGAATGTCTCTGCGCATCTGGGTACAT
GCGCAAGTTGCAATTTAGGAAGCATGCTTCAACTGCTCGATGGAATGATCAAATGGGAATGGCAGAAGTTTTAGAGAACAAGGGTAGCCTTTGGACGACAACTGGCATTG
TACGTTGTGGCAAGATTTATTGTTCCATTGAGGAAACTTTGTTTCTTATTGAAGTTGGGGCCTTACATCTTCTGGATCATGATAATTCAAGTCTTTCTTTGAAAGATGTA
TATAAGAAGGTAGCTGAAGGAAAGAGCGGATGTCTTTGGGAGCAGTTTGAGGTTTATAGGCACCTCAAATCTCTTGGTTTCATTGTTGGAAAGCATAAAGTTCCTTGGTC
TGTGAAGAGTGTTAGGAATAGCAGTGAAATTTCTTCTCGAAGTTCTATTGAAAACAAAGGAGCGTCAGATCTTGAATCAGAAGATGAGAGGTCGATCTCCGAACTATTGG
ATGCCATTCAGCTTGATGAAGTGACACCCATTTTTGATGTTTTTCTTCCACATAGCAAGTTTAGAAAGTCTTCTCCTGGTGACCCAAATTTTATGGTCTGCTTGACTAGA
TTGATGGCCCATTATTCTTGGAAGAATGTTAAGATTGCCCTTGAGGATTTCTTTAAATCTTCAGTCTTGATCAACCCCTTTATGGATGATAAAGCCTTGATTCAGGTGGC
AGATGGTTGTTTGGATCTTTCTGTGAATGGCAAGTGGAAGAAATTTGGGAACCCTCACTTGAAATTGGAATTGTGGGTTAAGCAAGTGATATTGGATGAAGAAGTGGACC
TTGTTAATGAAGGGGTAATGTTGAATGAATCGCCATTTATTTCTTGTTATCAGGAGGAATTTAATGTGGCAATAGGTTCTCCAAAAGTTGCTTCAATGCATGATGAGCAT
ATTAATTACATAGGCTGTATTGAACGTCCCTCCAAGATGATTAATGACAGCCATTGCAATTTGAAAGATGATATACAACCGCTACTGGGATCGTCAAATATCAATTCTGA
AAATGGGTTAATTCAGTTGAAGGATTTTAAGGAATTGTCTGACCAAATTCCAAGGGAGAGAGACCATCTTAATGAGGTGTTGGGTTCTCCAAAAGAAAGTTCGGCCGCCC
AGATTCATAAAACTCCTTCCAAGAATTTTAATGCCGTTACTTGCAATTTAAATGATGATGTCCAGCAGGTAACATTAAAGACCTATTCTCGGAAAAAGGCTTCTCATTCA
TTGGCAGTTAAGTCCAACATTAATGCTGATCATTTGGAAGCTGAATGCACTCATTTAATTGCTGCAAATAAGGTTGTGGGATCTTCAAATGTCAGTGCTAGAAATGGTTT
GTTTCAGGCCAAGGAATTTAAAGAATCTTCTGTTCGAATTCCAGGAGGAGTAATATTTTCATTAGAGGAGACTAAACCAGCTGCAATTGATTTGAAATTCATTAAATCCT
TATGGAGTCTCAAGGAAATCGGCTGGTCTTTTGTGGAAGCCAATGGAAAATCAGGGAATTATGCAACCGCCATTGTCTGTTGTTTTATGTATTGCCAAGTCCTTAGTTTC
TGCTTGCTCCCTGAGGGACTACATAAATATCAGCTCCAGTGTTCCACGCCATTCAGATTTTGCTTTATGGTGTGGCCTTGTGGGGACAATGCTGCAGGATCTTTCTTGCC
TGTAATTCCAAGAACTTTTAGTGTGTGTGGTACTGCATATGATCTGCGTCATACAATATTTGGGTTTCCTGGAATGTATCATGAAATGTTCAAAAGTTCAGATAATCATC
CTTCACCAAGTCAAACTATGACCTTGCATTTGGAGAGGTCACCTGTTGTTTCTTCTAGTTTGTGTTTTGAGGTTGTTGCTGATTTTCAGTTGATATGGTGGAAGTGA
Protein sequenceShow/hide protein sequence
MEATDWESSSGGASGDDDIYEQDMKDEEECLCASGYMRKLQFRKHASTARWNDQMGMAEVLENKGSLWTTTGIVRCGKIYCSIEETLFLIEVGALHLLDHDNSSLSLKDV
YKKVAEGKSGCLWEQFEVYRHLKSLGFIVGKHKVPWSVKSVRNSSEISSRSSIENKGASDLESEDERSISELLDAIQLDEVTPIFDVFLPHSKFRKSSPGDPNFMVCLTR
LMAHYSWKNVKIALEDFFKSSVLINPFMDDKALIQVADGCLDLSVNGKWKKFGNPHLKLELWVKQVILDEEVDLVNEGVMLNESPFISCYQEEFNVAIGSPKVASMHDEH
INYIGCIERPSKMINDSHCNLKDDIQPLLGSSNINSENGLIQLKDFKELSDQIPRERDHLNEVLGSPKESSAAQIHKTPSKNFNAVTCNLNDDVQQVTLKTYSRKKASHS
LAVKSNINADHLEAECTHLIAANKVVGSSNVSARNGLFQAKEFKESSVRIPGGVIFSLEETKPAAIDLKFIKSLWSLKEIGWSFVEANGKSGNYATAIVCCFMYCQVLSF
CLLPEGLHKYQLQCSTPFRFCFMVWPCGDNAAGSFLPVIPRTFSVCGTAYDLRHTIFGFPGMYHEMFKSSDNHPSPSQTMTLHLERSPVVSSSLCFEVVADFQLIWWK