; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh18G004040 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh18G004040
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionRho_N domain-containing protein
Genome locationCmo_Chr18:2620130..2623023
RNA-Seq ExpressionCmoCh18G004040
SyntenyCmoCh18G004040
Gene Ontology termsGO:0006353 - DNA-templated transcription, termination (biological process)
InterPro domainsIPR011112 - Rho termination factor, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573318.1 hypothetical protein SDJN03_27205, partial [Cucurbita argyrosperma subsp. sororia]3.8e-9897.04Show/hide
Query:  IFTLKEIADGYRSKSIQLAVSSNGNDGKTGHQPVRRSSAPGRTRKNVSSLRKTDTHKNEDMKKPKSNNQEEIIALFRKIQTSIAEEAASSIDENSNKDED
        +  L EIADGYRSKSIQLAVSSNGNDGKTGHQPVRRSSAPGRTRKNVSSLRKTDTHKNEDMKKPKSNNQEEIIALFRKIQTSIAEEAASSIDE++NKDED
Subjt:  IFTLKEIADGYRSKSIQLAVSSNGNDGKTGHQPVRRSSAPGRTRKNVSSLRKTDTHKNEDMKKPKSNNQEEIIALFRKIQTSIAEEAASSIDENSNKDED

Query:  GTESILEALTESRKQVKGKTLKNAGVKGLRRTGTSEAAAEFKLVRPPSNFVKRSPIPSPAGGNGTHLTVENMKLKELKAVAKSRGIKGYSKLKKNELLEL
        GTESILEALTESRKQVKGKTLKNAGVKGLRRTGTSEAAAEFKLVRPPSNFVKRSPIPSPAGGNGTHLTVENMKLKELKAVAKSRGIKGYSKLKKNELLEL
Subjt:  GTESILEALTESRKQVKGKTLKNAGVKGLRRTGTSEAAAEFKLVRPPSNFVKRSPIPSPAGGNGTHLTVENMKLKELKAVAKSRGIKGYSKLKKNELLEL

Query:  LTS
        LTS
Subjt:  LTS

KAG7012488.1 hypothetical protein SDJN02_25240, partial [Cucurbita argyrosperma subsp. argyrosperma]1.5e-9796.55Show/hide
Query:  IFTLKEIADGYRSKSIQLAVSSNGNDGKTGHQPVRRSSAPGRTRKNVSSLRKTDTHKNEDMKKPKSNNQEEIIALFRKIQTSIAEEAASSIDENSNKDED
        +  L EIADGYRSKSIQLAVSSNGNDGKTGHQPVRRSSAPGRTRKNVSSLRKTDTHKNEDMKKPKSNNQEEIIALFRKIQTSIAEEAASSIDE++NKDED
Subjt:  IFTLKEIADGYRSKSIQLAVSSNGNDGKTGHQPVRRSSAPGRTRKNVSSLRKTDTHKNEDMKKPKSNNQEEIIALFRKIQTSIAEEAASSIDENSNKDED

Query:  GTESILEALTESRKQVKGKTLKNAGVKGLRRTGTSEAAAEFKLVRPPSNFVKRSPIPSPAGGNGTHLTVENMKLKELKAVAKSRGIKGYSKLKKNELLEL
        GTESILEALTESRKQVKGKTLKNAGVKGLRRTGTSEAA EFKLVRPPSNFVKRSPIPSPAGGNGTHLTVENMKLKELKAVAKSRGIKGYSKLKKNELLEL
Subjt:  GTESILEALTESRKQVKGKTLKNAGVKGLRRTGTSEAAAEFKLVRPPSNFVKRSPIPSPAGGNGTHLTVENMKLKELKAVAKSRGIKGYSKLKKNELLEL

Query:  LTS
        LTS
Subjt:  LTS

XP_022954445.1 uncharacterized protein LOC111456710, partial [Cucurbita moschata]4.5e-9998.03Show/hide
Query:  IFTLKEIADGYRSKSIQLAVSSNGNDGKTGHQPVRRSSAPGRTRKNVSSLRKTDTHKNEDMKKPKSNNQEEIIALFRKIQTSIAEEAASSIDENSNKDED
        +  L EIADGYRSKSIQLAVSSNGNDGKTGHQPVRRSSAPGRTRKNVSSLRKTDTHKNEDMKKPKSNNQEEIIALFRKIQTSIAEEAASSIDENSNKDED
Subjt:  IFTLKEIADGYRSKSIQLAVSSNGNDGKTGHQPVRRSSAPGRTRKNVSSLRKTDTHKNEDMKKPKSNNQEEIIALFRKIQTSIAEEAASSIDENSNKDED

Query:  GTESILEALTESRKQVKGKTLKNAGVKGLRRTGTSEAAAEFKLVRPPSNFVKRSPIPSPAGGNGTHLTVENMKLKELKAVAKSRGIKGYSKLKKNELLEL
        GTESILEALTESRKQVKGKTLKNAGVKGLRRTGTSEAAAEFKLVRPPSNFVKRSPIPSPAGGNGTHLTVENMKLKELKAVAKSRGIKGYSKLKKNELLEL
Subjt:  GTESILEALTESRKQVKGKTLKNAGVKGLRRTGTSEAAAEFKLVRPPSNFVKRSPIPSPAGGNGTHLTVENMKLKELKAVAKSRGIKGYSKLKKNELLEL

Query:  LTS
        LTS
Subjt:  LTS

XP_022994883.1 uncharacterized protein LOC111490473 [Cucurbita maxima]6.3e-10995.18Show/hide
Query:  MEAVVLQSRTLIRFPNLVSFTRRRPIFTLKEIADGYRSKSIQLAVSSNGNDGKTGHQPVRRSSAPGRTRKNVSSLRKTDTHKNEDMKKPKSNNQEEIIAL
        MEAVV QSRTLIRFPNLVSFTRRRPIFTLKEIADGYRS SIQLAVSSNG DG  GHQPVRRSSAPGRTRKNV SLRKTDTHKNEDMKKPKSNNQEEIIAL
Subjt:  MEAVVLQSRTLIRFPNLVSFTRRRPIFTLKEIADGYRSKSIQLAVSSNGNDGKTGHQPVRRSSAPGRTRKNVSSLRKTDTHKNEDMKKPKSNNQEEIIAL

Query:  FRKIQTSIAEEAASSIDENSNKDEDGTESILEALTESRKQVKGKTLKNAGVKGLRRTGTSEAAAEFKLVRPPSNFVKRSPIPSPAGGNGTHLTVENMKLK
        FRKIQTSIAEEAASSIDE+SNKDEDGTESILEALTESRKQVKGKTLKNAGVKGLRR  TSEAAAEFKLVRPPSNFVKRSPIP+PAGGNG+HLTVENMKLK
Subjt:  FRKIQTSIAEEAASSIDENSNKDEDGTESILEALTESRKQVKGKTLKNAGVKGLRRTGTSEAAAEFKLVRPPSNFVKRSPIPSPAGGNGTHLTVENMKLK

Query:  ELKAVAKSRGIKGYSKLKKNELLELLTS
        ELKAVAKSRGIKGYSKLKKNELLELLTS
Subjt:  ELKAVAKSRGIKGYSKLKKNELLELLTS

XP_023542167.1 uncharacterized protein LOC111802132, partial [Cucurbita pepo subsp. pepo]1.2e-9494.61Show/hide
Query:  IFTLKEIADGYRSKSIQLAVSSNGNDGKTGHQPVRRSSAPGRTRKNVSSLRKTDTHKNEDMKKPKSNNQEEIIALFRKIQTSIAEEAASSIDENSNKDED
        +  L EIADGYRSK+IQLAVSSNGNDGKTGHQPVRRSSAPGRTRKNV SLRKTDTHKNEDMKKPKSNNQEEIIALFRKIQTSIAEEAASSIDE+SNKDED
Subjt:  IFTLKEIADGYRSKSIQLAVSSNGNDGKTGHQPVRRSSAPGRTRKNVSSLRKTDTHKNEDMKKPKSNNQEEIIALFRKIQTSIAEEAASSIDENSNKDED

Query:  GTESILEALTESRKQVKGKTLKNA-GVKGLRRTGTSEAAAEFKLVRPPSNFVKRSPIPSPAGGNGTHLTVENMKLKELKAVAKSRGIKGYSKLKKNELLE
        GTESILEALTESRKQVKGKTLKNA GVKGLRR GTSEAAAEFKLVRPPSNFVKRSPIPSPAGGNG+HLTVENMKL+ELKAVAKSRGIKGYSKLKKNELLE
Subjt:  GTESILEALTESRKQVKGKTLKNA-GVKGLRRTGTSEAAAEFKLVRPPSNFVKRSPIPSPAGGNGTHLTVENMKLKELKAVAKSRGIKGYSKLKKNELLE

Query:  LLTS
        LLTS
Subjt:  LLTS

TrEMBL top hitse value%identityAlignment
A0A0A0LX66 Rho_N domain-containing protein8.4e-5960.73Show/hide
Query:  MEAVVLQSRTLIRFPNLVSFTRRRPIFTLKEIADGYRSKSIQLAVSSNGNDGKTGHQPVRRSSAPGRTRKNVSSLRKTDTHKNED-MKKPKSNNQEEIIA
        MEAVV   R LIRFPNL+S  RRRP F  K++AD Y SK+IQ +VS +  DG  G++P RR+S PG+ RK+ SS RKT+T K+E+ +KK ++N+QEE+IA
Subjt:  MEAVVLQSRTLIRFPNLVSFTRRRPIFTLKEIADGYRSKSIQLAVSSNGNDGKTGHQPVRRSSAPGRTRKNVSSLRKTDTHKNED-MKKPKSNNQEEIIA

Query:  LFRKIQTSIAEEAASSIDENSNKDEDGTESILEALTESRKQVKGKTLKNAGVKGLRRTGTSE----------AAAEFKLVRPPSNFVKRSPIP-----SP
        LFRKIQTSIA+E+ASSIDE S KDE+   SILE L ESRKQ+KGKT K AG K LR  G SE           AA+FKLVRPPS FVKRSPIP     S 
Subjt:  LFRKIQTSIAEEAASSIDENSNKDEDGTESILEALTESRKQVKGKTLKNAGVKGLRRTGTSE----------AAAEFKLVRPPSNFVKRSPIP-----SP

Query:  AGGNGTHL---TVENMKLKELKAVAKSRGIKGYSKLKKNELLELLTS
        A      L   + ENMKL ELKA+AKSRGIKGYSKLKKNEL+E+L S
Subjt:  AGGNGTHL---TVENMKLKELKAVAKSRGIKGYSKLKKNELLELLTS

A0A6J1CF22 uncharacterized protein LOC111010657 isoform X14.8e-5460.26Show/hide
Query:  RRRPIFTLKEIADGYRSKSIQLAVSSNGNDGKTGHQPVRRSSAPGRTRKNVSSLRKTDTHKNEDMKKPKSNNQEEIIALFRKIQTSIAEEAASSIDENSN
        R   +  L EIA    SK IQ++V+SNG  G  G +P RRSS PGRTRKN  +  +      ED+K PKSNNQEEIIALFRKIQTSIA+++A++ DE+S+
Subjt:  RRRPIFTLKEIADGYRSKSIQLAVSSNGNDGKTGHQPVRRSSAPGRTRKNVSSLRKTDTHKNEDMKKPKSNNQEEIIALFRKIQTSIAEEAASSIDENSN

Query:  KDEDGTESILEALTESRKQVKGKTLKNAGVKGLRRTGTSE--------AAAEFKLVRPPSNFVKRSPIPSPAGGNGTHL-------------------TV
        +DE G ESILE+L ESRKQVKG+T K AGVK LRR G SE         AAEFKLVRPPS FVKRSPIPSP G NG+                     +V
Subjt:  KDEDGTESILEALTESRKQVKGKTLKNAGVKGLRRTGTSE--------AAAEFKLVRPPSNFVKRSPIPSPAGGNGTHL-------------------TV

Query:  ENMKLKELKAVAKSRGIKGYSKLKKNELLELLTS
        ENMKL ELKAVAKSRGIKGYSKLKKNELLELL S
Subjt:  ENMKLKELKAVAKSRGIKGYSKLKKNELLELLTS

A0A6J1CGN3 uncharacterized protein LOC111010657 isoform X21.3e-6464.31Show/hide
Query:  MEAVVLQSRTLIRFPNLVSFTRRRPIFTLKEIADGYRSKSIQLAVSSNGNDGKTGHQPVRRSSAPGRTRKNVSSLRKTDTHKNEDMKKPKSNNQEEIIAL
        MEAVV QSRTL RFPNLVSF  RRPIF LKEIA    SK IQ++V+SNG  G  G +P RRSS PGRTRKN  +  +      ED+K PKSNNQEEIIAL
Subjt:  MEAVVLQSRTLIRFPNLVSFTRRRPIFTLKEIADGYRSKSIQLAVSSNGNDGKTGHQPVRRSSAPGRTRKNVSSLRKTDTHKNEDMKKPKSNNQEEIIAL

Query:  FRKIQTSIAEEAASSIDENSNKDEDGTESILEALTESRKQVKGKTLKNAGVKGLRRTGTSE--------AAAEFKLVRPPSNFVKRSPIPSPAGGNGTHL
        FRKIQTSIA+++A++ DE+S++DE G ESILE+L ESRKQVKG+T K AGVK LRR G SE         AAEFKLVRPPS FVKRSPIPSP G NG+  
Subjt:  FRKIQTSIAEEAASSIDENSNKDEDGTESILEALTESRKQVKGKTLKNAGVKGLRRTGTSE--------AAAEFKLVRPPSNFVKRSPIPSPAGGNGTHL

Query:  -------------------TVENMKLKELKAVAKSRGIKGYSKLKKNELLELLTS
                           +VENMKL ELKAVAKSRGIKGYSKLKKNELLELL S
Subjt:  -------------------TVENMKLKELKAVAKSRGIKGYSKLKKNELLELLTS

A0A6J1GSF6 uncharacterized protein LOC1114567102.2e-9998.03Show/hide
Query:  IFTLKEIADGYRSKSIQLAVSSNGNDGKTGHQPVRRSSAPGRTRKNVSSLRKTDTHKNEDMKKPKSNNQEEIIALFRKIQTSIAEEAASSIDENSNKDED
        +  L EIADGYRSKSIQLAVSSNGNDGKTGHQPVRRSSAPGRTRKNVSSLRKTDTHKNEDMKKPKSNNQEEIIALFRKIQTSIAEEAASSIDENSNKDED
Subjt:  IFTLKEIADGYRSKSIQLAVSSNGNDGKTGHQPVRRSSAPGRTRKNVSSLRKTDTHKNEDMKKPKSNNQEEIIALFRKIQTSIAEEAASSIDENSNKDED

Query:  GTESILEALTESRKQVKGKTLKNAGVKGLRRTGTSEAAAEFKLVRPPSNFVKRSPIPSPAGGNGTHLTVENMKLKELKAVAKSRGIKGYSKLKKNELLEL
        GTESILEALTESRKQVKGKTLKNAGVKGLRRTGTSEAAAEFKLVRPPSNFVKRSPIPSPAGGNGTHLTVENMKLKELKAVAKSRGIKGYSKLKKNELLEL
Subjt:  GTESILEALTESRKQVKGKTLKNAGVKGLRRTGTSEAAAEFKLVRPPSNFVKRSPIPSPAGGNGTHLTVENMKLKELKAVAKSRGIKGYSKLKKNELLEL

Query:  LTS
        LTS
Subjt:  LTS

A0A6J1K6B8 uncharacterized protein LOC1114904733.1e-10995.18Show/hide
Query:  MEAVVLQSRTLIRFPNLVSFTRRRPIFTLKEIADGYRSKSIQLAVSSNGNDGKTGHQPVRRSSAPGRTRKNVSSLRKTDTHKNEDMKKPKSNNQEEIIAL
        MEAVV QSRTLIRFPNLVSFTRRRPIFTLKEIADGYRS SIQLAVSSNG DG  GHQPVRRSSAPGRTRKNV SLRKTDTHKNEDMKKPKSNNQEEIIAL
Subjt:  MEAVVLQSRTLIRFPNLVSFTRRRPIFTLKEIADGYRSKSIQLAVSSNGNDGKTGHQPVRRSSAPGRTRKNVSSLRKTDTHKNEDMKKPKSNNQEEIIAL

Query:  FRKIQTSIAEEAASSIDENSNKDEDGTESILEALTESRKQVKGKTLKNAGVKGLRRTGTSEAAAEFKLVRPPSNFVKRSPIPSPAGGNGTHLTVENMKLK
        FRKIQTSIAEEAASSIDE+SNKDEDGTESILEALTESRKQVKGKTLKNAGVKGLRR  TSEAAAEFKLVRPPSNFVKRSPIP+PAGGNG+HLTVENMKLK
Subjt:  FRKIQTSIAEEAASSIDENSNKDEDGTESILEALTESRKQVKGKTLKNAGVKGLRRTGTSEAAAEFKLVRPPSNFVKRSPIPSPAGGNGTHLTVENMKLK

Query:  ELKAVAKSRGIKGYSKLKKNELLELLTS
        ELKAVAKSRGIKGYSKLKKNELLELLTS
Subjt:  ELKAVAKSRGIKGYSKLKKNELLELLTS

SwissProt top hitse value%identityAlignment
Q8L4E7 SAP-like protein BP-733.3e-0460Show/hide
Query:  VENMKLKELKAVAKSRGIKGYSKLKKNELLELLTS
        +  +K+ EL+ +AKSRGIKGYSK+KKN+L+ELL++
Subjt:  VENMKLKELKAVAKSRGIKGYSKLKKNELLELLTS

Arabidopsis top hitse value%identityAlignment
AT1G06190.1 Rho termination factor3.4e-0460Show/hide
Query:  VENMKLKELKAVAKSRGIKGYSKLKKNELLELLTS
        +  +KL EL+ +AKSRG+KG SK+KK EL+ELL S
Subjt:  VENMKLKELKAVAKSRGIKGYSKLKKNELLELLTS

AT4G18740.1 Rho termination factor2.6e-2038.3Show/hide
Query:  GRTRKNVSSLRKTDTHKNEDMKKPKSNNQEEIIALFRKIQTSIAEEAASSIDENSNKDEDG-----TESILEALTESRKQVKGKTLKNAGVKGLRRTGTS
        GR++K     +K    +  +   P  +NQEEII+L ++IQ+SI++  +  ++E  N DE       T++IL+ L +SRK+ +G T     VK        
Subjt:  GRTRKNVSSLRKTDTHKNEDMKKPKSNNQEEIIALFRKIQTSIAEEAASSIDENSNKDEDG-----TESILEALTESRKQVKGKTLKNAGVKGLRRTGTS

Query:  EAAAEFKLVRPPSNFVKRSPIPSPAGGNGTHLTVEN--------------------MKLKELKAVAKSRGIKGYSKLKKNELLELLTS
            + +L RPPS+FVKR+P+ S A G    L V N                    MKL ELK VAK+RGIKGYSKL+K+ELLEL+ S
Subjt:  EAAAEFKLVRPPSNFVKRSPIPSPAGGNGTHLTVEN--------------------MKLKELKAVAKSRGIKGYSKLKKNELLELLTS

AT4G18740.2 Rho termination factor6.4e-1939.29Show/hide
Query:  GRTRKNVSSLRKTDTHKNEDMKKPKSNNQEEIIALFRKIQTSIAEEAASSIDENSNKDEDG-----TESILEALTESRKQVKGKTLKNAGVKGLRRTGTS
        GR++K     +K    +  +   P  +NQEEII+L ++IQ+SI++  +  ++E  N DE       T++IL+ L +SRK+ +G T     VK        
Subjt:  GRTRKNVSSLRKTDTHKNEDMKKPKSNNQEEIIALFRKIQTSIAEEAASSIDENSNKDEDG-----TESILEALTESRKQVKGKTLKNAGVKGLRRTGTS

Query:  EAAAEFKLVRPPSNFVKRSPIPSPAGGNGTHLTVENMKLKELKAVAKSRGIKGYSKLKKNELLELLTS
            + +L RPPS+FVKR+P+ S A G            +ELK VAK+RGIKGYSKL+K+ELLEL+ S
Subjt:  EAAAEFKLVRPPSNFVKRSPIPSPAGGNGTHLTVENMKLKELKAVAKSRGIKGYSKLKKNELLELLTS

AT4G18740.3 Rho termination factor6.4e-1939.29Show/hide
Query:  GRTRKNVSSLRKTDTHKNEDMKKPKSNNQEEIIALFRKIQTSIAEEAASSIDENSNKDEDG-----TESILEALTESRKQVKGKTLKNAGVKGLRRTGTS
        GR++K     +K    +  +   P  +NQEEII+L ++IQ+SI++  +  ++E  N DE       T++IL+ L +SRK+ +G T     VK        
Subjt:  GRTRKNVSSLRKTDTHKNEDMKKPKSNNQEEIIALFRKIQTSIAEEAASSIDENSNKDEDG-----TESILEALTESRKQVKGKTLKNAGVKGLRRTGTS

Query:  EAAAEFKLVRPPSNFVKRSPIPSPAGGNGTHLTVENMKLKELKAVAKSRGIKGYSKLKKNELLELLTS
            + +L RPPS+FVKR+P+ S A G            +ELK VAK+RGIKGYSKL+K+ELLEL+ S
Subjt:  EAAAEFKLVRPPSNFVKRSPIPSPAGGNGTHLTVENMKLKELKAVAKSRGIKGYSKLKKNELLELLTS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGCAGTAGTTCTTCAGTCTCGAACCCTAATCCGTTTCCCCAATTTGGTCTCTTTCACCAGGAGGAGACCCATTTTCACTTTGAAAGAGATTGCAGATGGGTATCG
TAGCAAGAGCATTCAATTGGCTGTTTCAAGCAATGGAAACGATGGAAAAACAGGGCATCAGCCTGTTCGTAGAAGCTCTGCGCCTGGAAGAACAAGGAAGAATGTGTCAT
CCTTGAGAAAAACAGATACCCATAAGAATGAAGACATGAAAAAACCCAAATCAAATAACCAGGAGGAAATAATTGCTCTCTTTAGGAAGATACAGACTTCCATTGCTGAG
GAAGCTGCAAGCTCCATTGATGAAAATTCTAACAAGGATGAAGATGGAACTGAGTCTATTTTGGAGGCTCTTACTGAATCAAGGAAGCAAGTAAAAGGCAAAACTTTAAA
GAATGCAGGAGTTAAAGGGCTGAGAAGAACAGGCACATCTGAAGCTGCTGCAGAATTCAAGTTAGTACGGCCACCGTCTAACTTTGTGAAGAGATCACCAATCCCATCGC
CGGCAGGAGGAAATGGTACGCATCTTACAGTGGAGAATATGAAACTTAAAGAGCTGAAAGCAGTTGCAAAATCTAGAGGAATTAAGGGATACTCCAAGTTGAAGAAAAAT
GAGCTGCTGGAACTGCTGACATCTTAA
mRNA sequenceShow/hide mRNA sequence
ACAGCTTCACTACAACGCCATACATCTGATATCCAGCTATCACCACAGCTTCACTACAACGCCATACATCTGATATCCAGCTATCACCACTGTCCATGAGACAAGGTCCT
TGTCGGGTTGAAATTCTGGGAGCTGCAATGAAGCTCCATGACATTTTTAAGATCACCCACTTCACAAAACTCCATGCCGTTTCGAATTAGGTTTGACAAAGATGAAAGAA
CGCCGAAATTTCAAGCTCACTAAGCCAAGATGATAGTGCCCTGCCGGAGATATCCAGAACATCGGAGGGGTTCTGCTATCAGTACCATCTTTTGCCGCAAATCCCAGTAA
GAAACGGCAAAAACGGTGATGAATTTCCGCTATTAACCAAATAGCGGTAGCAATAATGGCGACAAATCAAAAGAAACGACGAACACTGAGATAGCCACGTGGGAGCACGA
GATTCAGAAGCTTGTGGACAGCCCACATTATAACACACCCATACATTACACCTTGTGGATCAGATCCAGTTTGAACATTCATAGAATTCCATTTTTCTGGGAAAAATCAA
ATGGAAGCAGTAGTTCTTCAGTCTCGAACCCTAATCCGTTTCCCCAATTTGGTCTCTTTCACCAGGAGGAGACCCATTTTCACTTTGAAAGAGATTGCAGATGGGTATCG
TAGCAAGAGCATTCAATTGGCTGTTTCAAGCAATGGAAACGATGGAAAAACAGGGCATCAGCCTGTTCGTAGAAGCTCTGCGCCTGGAAGAACAAGGAAGAATGTGTCAT
CCTTGAGAAAAACAGATACCCATAAGAATGAAGACATGAAAAAACCCAAATCAAATAACCAGGAGGAAATAATTGCTCTCTTTAGGAAGATACAGACTTCCATTGCTGAG
GAAGCTGCAAGCTCCATTGATGAAAATTCTAACAAGGATGAAGATGGAACTGAGTCTATTTTGGAGGCTCTTACTGAATCAAGGAAGCAAGTAAAAGGCAAAACTTTAAA
GAATGCAGGAGTTAAAGGGCTGAGAAGAACAGGCACATCTGAAGCTGCTGCAGAATTCAAGTTAGTACGGCCACCGTCTAACTTTGTGAAGAGATCACCAATCCCATCGC
CGGCAGGAGGAAATGGTACGCATCTTACAGTGGAGAATATGAAACTTAAAGAGCTGAAAGCAGTTGCAAAATCTAGAGGAATTAAGGGATACTCCAAGTTGAAGAAAAAT
GAGCTGCTGGAACTGCTGACATCTTAAACCATGGCAGACCATTTTGGGTACAAATTCATGGAGCTGAACAACACCAGTCATCACAGTAAATCATTGGTCGTTTGTGTTCT
TCTTGTTGCTTTTCCTTTTGTGTTCCTCGTTCTATTCGTTTCACGTGTTTTCGTTACAAAATTACAAGTTTGGACCATAACATTCAAAAGCGTATGTCCATACATTGAAT
CTATAACGATGCTCTAGAGTATAAGACATTGTTTGATATTGTGAGATCCTAGATCGGTTGGAGAGGGAAACGAATCATTCTTTATAAAGGTGTGGAAA
Protein sequenceShow/hide protein sequence
MEAVVLQSRTLIRFPNLVSFTRRRPIFTLKEIADGYRSKSIQLAVSSNGNDGKTGHQPVRRSSAPGRTRKNVSSLRKTDTHKNEDMKKPKSNNQEEIIALFRKIQTSIAE
EAASSIDENSNKDEDGTESILEALTESRKQVKGKTLKNAGVKGLRRTGTSEAAAEFKLVRPPSNFVKRSPIPSPAGGNGTHLTVENMKLKELKAVAKSRGIKGYSKLKKN
ELLELLTS