; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0007943 (gene) of Snake gourd v1 genome

Gene IDTan0007943
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionATPase assembly factor ATP
Genome locationLG05:1429311..1432783
RNA-Seq ExpressionTan0007943
SyntenyTan0007943
Gene Ontology termsGO:0033615 - mitochondrial proton-transporting ATP synthase complex assembly (biological process)
GO:0005634 - nucleus (cellular component)
GO:0005743 - mitochondrial inner membrane (cellular component)
GO:0032592 - integral component of mitochondrial membrane (cellular component)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR007849 - ATPase assembly factor ATP10


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004145771.1 uncharacterized protein LOC101222490 isoform X2 [Cucumis sativus]2.7e-13591.18Show/hide
Query:  MFGLKRLLPHACSIRASVTMQLSGYQEKFLVFPSQHLAQLTSNRFLDIYQLGNKTAIEKERARLADEMNRGYFADIAELKQHGGKIAAANKILIPAMAAV
        MFGLKRL+PHACSIRAS+TMQLS Y++KFLVFPSQHLAQLTSNRFLDIYQLGNKTAIEKERARLADE+NRGYFAD++ELKQHGGKIAAANKILIPAMAAV
Subjt:  MFGLKRLLPHACSIRASVTMQLSGYQEKFLVFPSQHLAQLTSNRFLDIYQLGNKTAIEKERARLADEMNRGYFADIAELKQHGGKIAAANKILIPAMAAV

Query:  KFPEFEVSYSDGKTLKLPIKFDANAIEGNSSASPLPVATLLCLSFRASSQAMIDSWSAPFLDAFSSSKNVQLYEVSFIDSWFLCRNPIKKVLLRLMRKSS
        KFPEFEVSYSDGKTLKLPIK D N IEGNSS S LP+ATLLCLSFRA+SQAMIDSWSA FL+AFSSS NVQLYEVSFIDSWFLCRNPIKK+LLRLMRKSS
Subjt:  KFPEFEVSYSDGKTLKLPIKFDANAIEGNSSASPLPVATLLCLSFRASSQAMIDSWSAPFLDAFSSSKNVQLYEVSFIDSWFLCRNPIKKVLLRLMRKSS

Query:  SNAKNDSLQRQIVYSFGDHYYFRKELKILNLLTGYVFLLDKFGRIRWQGFGLATQEEVSSLLSCAPLLLEEK
         NA+NDSLQRQIVYSFGDHYYFRKELKILNLLTGYVFL+DK GRIRWQGFGLATQEEVSSLLSCA LLLEEK
Subjt:  SNAKNDSLQRQIVYSFGDHYYFRKELKILNLLTGYVFLLDKFGRIRWQGFGLATQEEVSSLLSCAPLLLEEK

XP_022140745.1 uncharacterized protein LOC111011334 isoform X1 [Momordica charantia]4.2e-13690.81Show/hide
Query:  MFGLKRLLPHACSIRASVTMQLSGYQEKFLVFPSQHLAQLTSNRFLDIYQLGNKTAIEKERARLADEMNRGYFADIAELKQHGGKIAAANKILIPAMAAV
        MFGLKRL+PHACSIRAS+TMQLSGYQEKFLVFPSQHL+ L SNRFLDIYQLGNKTAIEKERARLADEMNRGYFADI+ELKQHGGKIAAANKILIPA+AAV
Subjt:  MFGLKRLLPHACSIRASVTMQLSGYQEKFLVFPSQHLAQLTSNRFLDIYQLGNKTAIEKERARLADEMNRGYFADIAELKQHGGKIAAANKILIPAMAAV

Query:  KFPEFEVSYSDGKTLKLPIKFDANAIEGNSSASPLPVATLLCLSFRASSQAMIDSWSAPFLDAFSSSKNVQLYEVSFIDSWFLCRNPIKKVLLRLMRKSS
        KFP+FEVSYSD KTLKLP+KFDAN +E NSSA  LPVATLLCLSFRASSQ MIDSWSAPFLDAFSSSKNVQLYEVSFIDSWFLCRNPIK+VLLRLMRKSS
Subjt:  KFPEFEVSYSDGKTLKLPIKFDANAIEGNSSASPLPVATLLCLSFRASSQAMIDSWSAPFLDAFSSSKNVQLYEVSFIDSWFLCRNPIKKVLLRLMRKSS

Query:  SNAKNDSLQRQIVYSFGDHYYFRKELKILNLLTGYVFLLDKFGRIRWQGFGLATQEEVSSLLSCAPLLLEEK
         NA+NDSLQRQ+VYSFGDHYYFRKELKILNLLTGY+FLLDKFGRIRWQGFGLATQEE+SSLLSC  L+LEEK
Subjt:  SNAKNDSLQRQIVYSFGDHYYFRKELKILNLLTGYVFLLDKFGRIRWQGFGLATQEEVSSLLSCAPLLLEEK

XP_022958438.1 uncharacterized protein LOC111459659 [Cucurbita moschata]2.6e-13892.65Show/hide
Query:  MFGLKRLLPHACSIRASVTMQLSGYQEKFLVFPSQHLAQLTSNRFLDIYQLGNKTAIEKERARLADEMNRGYFADIAELKQHGGKIAAANKILIPAMAAV
        MFGLKRL+PH CSIR S+TMQLSGYQEK LVFPSQHLAQLTSNRFLDIYQLGNKTAIEKERARLADE+NRGYFADIAELKQHGGKIAAANKILIPAMAAV
Subjt:  MFGLKRLLPHACSIRASVTMQLSGYQEKFLVFPSQHLAQLTSNRFLDIYQLGNKTAIEKERARLADEMNRGYFADIAELKQHGGKIAAANKILIPAMAAV

Query:  KFPEFEVSYSDGKTLKLPIKFDANAIEGNSSASPLPVATLLCLSFRASSQAMIDSWSAPFLDAFSSSKNVQLYEVSFIDSWFLCRNPIKKVLLRLMRKSS
        KFPEFEVSYSDGK+LKLPIKFDAN +EGNS AS LP+ATLLCLSFRASSQAMI+SWSAPFLDAFSSSKNVQLYEVSFIDSW LCRNPIKKVLLRLMRKSS
Subjt:  KFPEFEVSYSDGKTLKLPIKFDANAIEGNSSASPLPVATLLCLSFRASSQAMIDSWSAPFLDAFSSSKNVQLYEVSFIDSWFLCRNPIKKVLLRLMRKSS

Query:  SNAKNDSLQRQIVYSFGDHYYFRKELKILNLLTGYVFLLDKFGRIRWQGFGLATQEEVSSLLSCAPLLLEEK
         NA+NDSLQR+IVYSFGDHYYFRKELKILNLL+GY+FLLDKFGRIRWQGFGLATQEEVSSLLSCA LLLEEK
Subjt:  SNAKNDSLQRQIVYSFGDHYYFRKELKILNLLTGYVFLLDKFGRIRWQGFGLATQEEVSSLLSCAPLLLEEK

XP_022995314.1 uncharacterized protein LOC111490897 [Cucurbita maxima]1.0e-13792.28Show/hide
Query:  MFGLKRLLPHACSIRASVTMQLSGYQEKFLVFPSQHLAQLTSNRFLDIYQLGNKTAIEKERARLADEMNRGYFADIAELKQHGGKIAAANKILIPAMAAV
        MFGLKRL+PHACSIRAS+ MQLSGYQEKFLVFPSQHLAQLTSNRFL+IYQLGNKTAIEKERARLADE+NRGYFADIAELKQHGGKIAAANKILIPAMAAV
Subjt:  MFGLKRLLPHACSIRASVTMQLSGYQEKFLVFPSQHLAQLTSNRFLDIYQLGNKTAIEKERARLADEMNRGYFADIAELKQHGGKIAAANKILIPAMAAV

Query:  KFPEFEVSYSDGKTLKLPIKFDANAIEGNSSASPLPVATLLCLSFRASSQAMIDSWSAPFLDAFSSSKNVQLYEVSFIDSWFLCRNPIKKVLLRLMRKSS
        KFPEFEVSYSDGK+LKLPIKFDAN +EGN+SAS LPVATLLCLSFRASSQAMI+SWS PFLDAFSSSKN+QLYEVSFIDSW LCRNPIKKVLLRLMRKSS
Subjt:  KFPEFEVSYSDGKTLKLPIKFDANAIEGNSSASPLPVATLLCLSFRASSQAMIDSWSAPFLDAFSSSKNVQLYEVSFIDSWFLCRNPIKKVLLRLMRKSS

Query:  SNAKNDSLQRQIVYSFGDHYYFRKELKILNLLTGYVFLLDKFGRIRWQGFGLATQEEVSSLLSCAPLLLEEK
         NA+ DSLQR+IVYSFGDHYYFRKELKILNLL+GY+FLLDKFGRIRWQGFGLATQEEVSSLLSCA LLLEEK
Subjt:  SNAKNDSLQRQIVYSFGDHYYFRKELKILNLLTGYVFLLDKFGRIRWQGFGLATQEEVSSLLSCAPLLLEEK

XP_023532866.1 uncharacterized protein LOC111794906 isoform X1 [Cucurbita pepo subsp. pepo]1.6e-13892.65Show/hide
Query:  MFGLKRLLPHACSIRASVTMQLSGYQEKFLVFPSQHLAQLTSNRFLDIYQLGNKTAIEKERARLADEMNRGYFADIAELKQHGGKIAAANKILIPAMAAV
        MFGLKRL+PHACSIRAS+TMQLSGYQEKFLVFPS HLAQLTSNRFLDIYQLGNKTAIEKERARLADEMNRGYFAD+AELKQHGGKIAAANKILIPAMAAV
Subjt:  MFGLKRLLPHACSIRASVTMQLSGYQEKFLVFPSQHLAQLTSNRFLDIYQLGNKTAIEKERARLADEMNRGYFADIAELKQHGGKIAAANKILIPAMAAV

Query:  KFPEFEVSYSDGKTLKLPIKFDANAIEGNSSASPLPVATLLCLSFRASSQAMIDSWSAPFLDAFSSSKNVQLYEVSFIDSWFLCRNPIKKVLLRLMRKSS
        KFPEFEVSYSDGK+LKLPIKFDAN +EGN+SAS LPVATLLCLSFR SSQAMI+SWSAPFLDAFSSSKNVQLYEVSFIDSW LCRNPIKK+LLRLMRKSS
Subjt:  KFPEFEVSYSDGKTLKLPIKFDANAIEGNSSASPLPVATLLCLSFRASSQAMIDSWSAPFLDAFSSSKNVQLYEVSFIDSWFLCRNPIKKVLLRLMRKSS

Query:  SNAKNDSLQRQIVYSFGDHYYFRKELKILNLLTGYVFLLDKFGRIRWQGFGLATQEEVSSLLSCAPLLLEEK
         NA NDSLQR+IVYSFGDHYYFRKELKI+NLL+GY+FLLDKFGRIRWQGFGLATQEEVSSLLSCA LLLEEK
Subjt:  SNAKNDSLQRQIVYSFGDHYYFRKELKILNLLTGYVFLLDKFGRIRWQGFGLATQEEVSSLLSCAPLLLEEK

TrEMBL top hitse value%identityAlignment
A0A0A0KGL5 Uncharacterized protein1.3e-13591.18Show/hide
Query:  MFGLKRLLPHACSIRASVTMQLSGYQEKFLVFPSQHLAQLTSNRFLDIYQLGNKTAIEKERARLADEMNRGYFADIAELKQHGGKIAAANKILIPAMAAV
        MFGLKRL+PHACSIRAS+TMQLS Y++KFLVFPSQHLAQLTSNRFLDIYQLGNKTAIEKERARLADE+NRGYFAD++ELKQHGGKIAAANKILIPAMAAV
Subjt:  MFGLKRLLPHACSIRASVTMQLSGYQEKFLVFPSQHLAQLTSNRFLDIYQLGNKTAIEKERARLADEMNRGYFADIAELKQHGGKIAAANKILIPAMAAV

Query:  KFPEFEVSYSDGKTLKLPIKFDANAIEGNSSASPLPVATLLCLSFRASSQAMIDSWSAPFLDAFSSSKNVQLYEVSFIDSWFLCRNPIKKVLLRLMRKSS
        KFPEFEVSYSDGKTLKLPIK D N IEGNSS S LP+ATLLCLSFRA+SQAMIDSWSA FL+AFSSS NVQLYEVSFIDSWFLCRNPIKK+LLRLMRKSS
Subjt:  KFPEFEVSYSDGKTLKLPIKFDANAIEGNSSASPLPVATLLCLSFRASSQAMIDSWSAPFLDAFSSSKNVQLYEVSFIDSWFLCRNPIKKVLLRLMRKSS

Query:  SNAKNDSLQRQIVYSFGDHYYFRKELKILNLLTGYVFLLDKFGRIRWQGFGLATQEEVSSLLSCAPLLLEEK
         NA+NDSLQRQIVYSFGDHYYFRKELKILNLLTGYVFL+DK GRIRWQGFGLATQEEVSSLLSCA LLLEEK
Subjt:  SNAKNDSLQRQIVYSFGDHYYFRKELKILNLLTGYVFLLDKFGRIRWQGFGLATQEEVSSLLSCAPLLLEEK

A0A1S3C8Y6 uncharacterized protein LOC103497992 isoform X11.2e-13389.34Show/hide
Query:  MFGLKRLLPHACSIRASVTMQLSGYQEKFLVFPSQHLAQLTSNRFLDIYQLGNKTAIEKERARLADEMNRGYFADIAELKQHGGKIAAANKILIPAMAAV
        MFGLKRL+PHACSIRAS+TMQLS Y+EKFLVFPSQHLAQLTSNRFLDIYQLGNKTAIEKERARLADE+NRGYFAD++ELK+HGGKIAAANKILIPAMAAV
Subjt:  MFGLKRLLPHACSIRASVTMQLSGYQEKFLVFPSQHLAQLTSNRFLDIYQLGNKTAIEKERARLADEMNRGYFADIAELKQHGGKIAAANKILIPAMAAV

Query:  KFPEFEVSYSDGKTLKLPIKFDANAIEGNSSASPLPVATLLCLSFRASSQAMIDSWSAPFLDAFSSSKNVQLYEVSFIDSWFLCRNPIKKVLLRLMRKSS
        KFPEFEVSYSDGKTLKLPIK D N +EGNSS S LP+ATLLCLSFRA+SQAMIDSWSA FL+AFSSS NVQLYEVSFIDSWFLCR+PIKK+LLRLMRKSS
Subjt:  KFPEFEVSYSDGKTLKLPIKFDANAIEGNSSASPLPVATLLCLSFRASSQAMIDSWSAPFLDAFSSSKNVQLYEVSFIDSWFLCRNPIKKVLLRLMRKSS

Query:  SNAKNDSLQRQIVYSFGDHYYFRKELKILNLLTGYVFLLDKFGRIRWQGFGLATQEEVSSLLSCAPLLLEEK
         NA+NDSLQRQIVYSFGDHYYFRKELKILNLLTGY+FL+DK GRIRWQG GLAT+EEVSSLLSCA LLLEEK
Subjt:  SNAKNDSLQRQIVYSFGDHYYFRKELKILNLLTGYVFLLDKFGRIRWQGFGLATQEEVSSLLSCAPLLLEEK

A0A6J1CG05 uncharacterized protein LOC111011334 isoform X12.0e-13690.81Show/hide
Query:  MFGLKRLLPHACSIRASVTMQLSGYQEKFLVFPSQHLAQLTSNRFLDIYQLGNKTAIEKERARLADEMNRGYFADIAELKQHGGKIAAANKILIPAMAAV
        MFGLKRL+PHACSIRAS+TMQLSGYQEKFLVFPSQHL+ L SNRFLDIYQLGNKTAIEKERARLADEMNRGYFADI+ELKQHGGKIAAANKILIPA+AAV
Subjt:  MFGLKRLLPHACSIRASVTMQLSGYQEKFLVFPSQHLAQLTSNRFLDIYQLGNKTAIEKERARLADEMNRGYFADIAELKQHGGKIAAANKILIPAMAAV

Query:  KFPEFEVSYSDGKTLKLPIKFDANAIEGNSSASPLPVATLLCLSFRASSQAMIDSWSAPFLDAFSSSKNVQLYEVSFIDSWFLCRNPIKKVLLRLMRKSS
        KFP+FEVSYSD KTLKLP+KFDAN +E NSSA  LPVATLLCLSFRASSQ MIDSWSAPFLDAFSSSKNVQLYEVSFIDSWFLCRNPIK+VLLRLMRKSS
Subjt:  KFPEFEVSYSDGKTLKLPIKFDANAIEGNSSASPLPVATLLCLSFRASSQAMIDSWSAPFLDAFSSSKNVQLYEVSFIDSWFLCRNPIKKVLLRLMRKSS

Query:  SNAKNDSLQRQIVYSFGDHYYFRKELKILNLLTGYVFLLDKFGRIRWQGFGLATQEEVSSLLSCAPLLLEEK
         NA+NDSLQRQ+VYSFGDHYYFRKELKILNLLTGY+FLLDKFGRIRWQGFGLATQEE+SSLLSC  L+LEEK
Subjt:  SNAKNDSLQRQIVYSFGDHYYFRKELKILNLLTGYVFLLDKFGRIRWQGFGLATQEEVSSLLSCAPLLLEEK

A0A6J1H332 uncharacterized protein LOC1114596591.3e-13892.65Show/hide
Query:  MFGLKRLLPHACSIRASVTMQLSGYQEKFLVFPSQHLAQLTSNRFLDIYQLGNKTAIEKERARLADEMNRGYFADIAELKQHGGKIAAANKILIPAMAAV
        MFGLKRL+PH CSIR S+TMQLSGYQEK LVFPSQHLAQLTSNRFLDIYQLGNKTAIEKERARLADE+NRGYFADIAELKQHGGKIAAANKILIPAMAAV
Subjt:  MFGLKRLLPHACSIRASVTMQLSGYQEKFLVFPSQHLAQLTSNRFLDIYQLGNKTAIEKERARLADEMNRGYFADIAELKQHGGKIAAANKILIPAMAAV

Query:  KFPEFEVSYSDGKTLKLPIKFDANAIEGNSSASPLPVATLLCLSFRASSQAMIDSWSAPFLDAFSSSKNVQLYEVSFIDSWFLCRNPIKKVLLRLMRKSS
        KFPEFEVSYSDGK+LKLPIKFDAN +EGNS AS LP+ATLLCLSFRASSQAMI+SWSAPFLDAFSSSKNVQLYEVSFIDSW LCRNPIKKVLLRLMRKSS
Subjt:  KFPEFEVSYSDGKTLKLPIKFDANAIEGNSSASPLPVATLLCLSFRASSQAMIDSWSAPFLDAFSSSKNVQLYEVSFIDSWFLCRNPIKKVLLRLMRKSS

Query:  SNAKNDSLQRQIVYSFGDHYYFRKELKILNLLTGYVFLLDKFGRIRWQGFGLATQEEVSSLLSCAPLLLEEK
         NA+NDSLQR+IVYSFGDHYYFRKELKILNLL+GY+FLLDKFGRIRWQGFGLATQEEVSSLLSCA LLLEEK
Subjt:  SNAKNDSLQRQIVYSFGDHYYFRKELKILNLLTGYVFLLDKFGRIRWQGFGLATQEEVSSLLSCAPLLLEEK

A0A6J1K5D3 uncharacterized protein LOC1114908974.9e-13892.28Show/hide
Query:  MFGLKRLLPHACSIRASVTMQLSGYQEKFLVFPSQHLAQLTSNRFLDIYQLGNKTAIEKERARLADEMNRGYFADIAELKQHGGKIAAANKILIPAMAAV
        MFGLKRL+PHACSIRAS+ MQLSGYQEKFLVFPSQHLAQLTSNRFL+IYQLGNKTAIEKERARLADE+NRGYFADIAELKQHGGKIAAANKILIPAMAAV
Subjt:  MFGLKRLLPHACSIRASVTMQLSGYQEKFLVFPSQHLAQLTSNRFLDIYQLGNKTAIEKERARLADEMNRGYFADIAELKQHGGKIAAANKILIPAMAAV

Query:  KFPEFEVSYSDGKTLKLPIKFDANAIEGNSSASPLPVATLLCLSFRASSQAMIDSWSAPFLDAFSSSKNVQLYEVSFIDSWFLCRNPIKKVLLRLMRKSS
        KFPEFEVSYSDGK+LKLPIKFDAN +EGN+SAS LPVATLLCLSFRASSQAMI+SWS PFLDAFSSSKN+QLYEVSFIDSW LCRNPIKKVLLRLMRKSS
Subjt:  KFPEFEVSYSDGKTLKLPIKFDANAIEGNSSASPLPVATLLCLSFRASSQAMIDSWSAPFLDAFSSSKNVQLYEVSFIDSWFLCRNPIKKVLLRLMRKSS

Query:  SNAKNDSLQRQIVYSFGDHYYFRKELKILNLLTGYVFLLDKFGRIRWQGFGLATQEEVSSLLSCAPLLLEEK
         NA+ DSLQR+IVYSFGDHYYFRKELKILNLL+GY+FLLDKFGRIRWQGFGLATQEEVSSLLSCA LLLEEK
Subjt:  SNAKNDSLQRQIVYSFGDHYYFRKELKILNLLTGYVFLLDKFGRIRWQGFGLATQEEVSSLLSCAPLLLEEK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G08220.1 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: mitochondrial proton-transporting ATP synthase complex assembly; LOCATED IN: mitochondrial inner membrane; EXPRESSED IN: 18 plant structures; EXPRESSED DURING: 7 growth stages; CONTAINS InterPro DOMAIN/s: ATPase assembly factor ATP10, mitochondria (InterPro:IPR007849); Has 168 Blast hits to 168 proteins in 86 species: Archae - 6; Bacteria - 0; Metazoa - 2; Fungi - 107; Plants - 30; Viruses - 0; Other Eukaryotes - 23 (source: NCBI BLink).3.5e-8062.24Show/hide
Query:  PSQHLA-QLTSNRFLDIYQLGNKTAIEKERARLADEMNRGYFADIAELKQHGGKIAAANKILIPAMAAVKFPEFEVSYSDGKTLKLPIKFDANAIEGNSS
        PSQ  A + T+  FLD Y+ GNK AIE ERARL DEMNRGYFAD+ E K+HGGKIAAANK +IPA +A+KFP   V++S+GK+LKLPI  ++N ++  S 
Subjt:  PSQHLA-QLTSNRFLDIYQLGNKTAIEKERARLADEMNRGYFADIAELKQHGGKIAAANKILIPAMAAVKFPEFEVSYSDGKTLKLPIKFDANAIEGNSS

Query:  ASPLPVATLLCLSFRASSQAMIDSWSAPFLDAFSSSKNVQLYEVSFIDSWFLCRNPIKKVLLRLMRKSSSNAKNDSLQRQIVYSFGDHYYFRKELKILNL
           +P  +L+CLSFRASSQ MI SWS PFL++F + K++QL+EVSFID W L   PI+K+LLR+++K ++N +N  LQRQ+ Y+FGDHYYFRKE+K+LNL
Subjt:  ASPLPVATLLCLSFRASSQAMIDSWSAPFLDAFSSSKNVQLYEVSFIDSWFLCRNPIKKVLLRLMRKSSSNAKNDSLQRQIVYSFGDHYYFRKELKILNL

Query:  LTGYVFLLDKFGRIRWQGFGLATQEEVSSLLSCAPLLLEEK
        LTGY+ LLDK GRIRWQGFG AT EEVS LLSC  LLLE++
Subjt:  LTGYVFLLDKFGRIRWQGFGLATQEEVSSLLSCAPLLLEEK

AT1G08220.2 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: mitochondrial proton-transporting ATP synthase complex assembly; LOCATED IN: mitochondrial inner membrane; EXPRESSED IN: 18 plant structures; EXPRESSED DURING: 7 growth stages; CONTAINS InterPro DOMAIN/s: ATPase assembly factor ATP10, mitochondria (InterPro:IPR007849); Has 152 Blast hits to 152 proteins in 76 species: Archae - 6; Bacteria - 0; Metazoa - 2; Fungi - 92; Plants - 30; Viruses - 0; Other Eukaryotes - 22 (source: NCBI BLink).9.6e-7062.44Show/hide
Query:  MNRGYFADIAELKQHGGKIAAANKILIPAMAAVKFPEFEVSYSDGKTLKLPIKFDANAIEGNSSASPLPVATLLCLSFRASSQAMIDSWSAPFLDAFSSS
        MNRGYFAD+ E K+HGGKIAAANK +IPA +A+KFP   V++S+GK+LKLPI  ++N ++  S    +P  +L+CLSFRASSQ MI SWS PFL++F + 
Subjt:  MNRGYFADIAELKQHGGKIAAANKILIPAMAAVKFPEFEVSYSDGKTLKLPIKFDANAIEGNSSASPLPVATLLCLSFRASSQAMIDSWSAPFLDAFSSS

Query:  KNVQLYEVSFIDSWFLCRNPIKKVLLRLMRKSSSNAKNDSLQRQIVYSFGDHYYFRKELKILNLLTGYVFLLDKFGRIRWQGFGLATQEEVSSLLSCAPL
        K++QL+EVSFID W L   PI+K+LLR+++K ++N +N  LQRQ+ Y+FGDHYYFRKE+K+LNLLTGY+ LLDK GRIRWQGFG AT EEVS LLSC  L
Subjt:  KNVQLYEVSFIDSWFLCRNPIKKVLLRLMRKSSSNAKNDSLQRQIVYSFGDHYYFRKELKILNLLTGYVFLLDKFGRIRWQGFGLATQEEVSSLLSCAPL

Query:  LLEEK
        LLE++
Subjt:  LLEEK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTGGATTGAAGCGATTACTTCCCCATGCTTGCTCGATTCGAGCTTCTGTAACAATGCAGCTCTCTGGTTACCAGGAAAAATTCCTTGTTTTTCCTTCGCAGCATTT
AGCTCAGCTGACATCCAATCGCTTCCTCGACATTTATCAGCTTGGAAACAAAACAGCCATTGAGAAAGAGCGCGCTCGGCTTGCAGATGAAATGAACAGAGGATACTTTG
CTGATATTGCAGAGCTTAAGCAACATGGTGGAAAGATTGCAGCAGCTAACAAGATTCTAATTCCGGCTATGGCTGCTGTAAAATTTCCCGAGTTTGAAGTGAGCTATTCT
GATGGTAAAACGTTGAAGCTGCCCATTAAATTTGATGCTAATGCAATTGAAGGCAATAGTTCGGCATCACCCTTGCCTGTGGCCACGTTACTGTGTCTTTCTTTCAGAGC
AAGCTCCCAGGCCATGATTGATTCTTGGAGTGCCCCTTTTCTCGATGCCTTCTCTAGTTCAAAGAATGTCCAGTTATATGAGGTTTCATTTATAGATTCGTGGTTCTTGT
GTCGAAATCCAATTAAGAAAGTGCTTCTTCGGCTAATGAGGAAATCCAGTAGCAATGCAAAGAATGATTCACTTCAAAGGCAGATTGTATACTCGTTTGGCGACCATTAT
TACTTCAGAAAGGAGCTAAAAATACTAAATCTTCTAACTGGGTATGTCTTCCTGCTTGACAAATTTGGTAGAATAAGATGGCAAGGCTTTGGATTGGCAACTCAAGAGGA
GGTCTCATCTCTTCTTTCATGCGCGCCACTTCTTTTGGAAGAGAAATGA
mRNA sequenceShow/hide mRNA sequence
AGCGGAGTACAGAGATGACTGTACAGTTTTCCGATTAGGGTTCATCAGAGGATCAACAGAGAGTGGGAAGGGGAAACCGAGAGTGAAGCCGAATTGATCAGAGATATGTT
TGGATTGAAGCGATTACTTCCCCATGCTTGCTCGATTCGAGCTTCTGTAACAATGCAGCTCTCTGGTTACCAGGAAAAATTCCTTGTTTTTCCTTCGCAGCATTTAGCTC
AGCTGACATCCAATCGCTTCCTCGACATTTATCAGCTTGGAAACAAAACAGCCATTGAGAAAGAGCGCGCTCGGCTTGCAGATGAAATGAACAGAGGATACTTTGCTGAT
ATTGCAGAGCTTAAGCAACATGGTGGAAAGATTGCAGCAGCTAACAAGATTCTAATTCCGGCTATGGCTGCTGTAAAATTTCCCGAGTTTGAAGTGAGCTATTCTGATGG
TAAAACGTTGAAGCTGCCCATTAAATTTGATGCTAATGCAATTGAAGGCAATAGTTCGGCATCACCCTTGCCTGTGGCCACGTTACTGTGTCTTTCTTTCAGAGCAAGCT
CCCAGGCCATGATTGATTCTTGGAGTGCCCCTTTTCTCGATGCCTTCTCTAGTTCAAAGAATGTCCAGTTATATGAGGTTTCATTTATAGATTCGTGGTTCTTGTGTCGA
AATCCAATTAAGAAAGTGCTTCTTCGGCTAATGAGGAAATCCAGTAGCAATGCAAAGAATGATTCACTTCAAAGGCAGATTGTATACTCGTTTGGCGACCATTATTACTT
CAGAAAGGAGCTAAAAATACTAAATCTTCTAACTGGGTATGTCTTCCTGCTTGACAAATTTGGTAGAATAAGATGGCAAGGCTTTGGATTGGCAACTCAAGAGGAGGTCT
CATCTCTTCTTTCATGCGCGCCACTTCTTTTGGAAGAGAAATGAGCAGGAAAAATTAGTCAGATGATATTGGAATCGGTATCGATCTTGATTTTCAAATGGTGGGAATTA
ATACAATACGAAGGAAGGATTTTGTGAGAGTTAAATTTCTAATATTATATATAATAAAGTCAAGTTGTCAAATGTGAGATGACGACTATTTTTGTTATATAATGTATACT
CATTTTATTTCCAAGGTTTTGTTTGTAATTTCACATCTTGTCCGTTTATTATTGAGAATGCCAAGGTTAATGCTAC
Protein sequenceShow/hide protein sequence
MFGLKRLLPHACSIRASVTMQLSGYQEKFLVFPSQHLAQLTSNRFLDIYQLGNKTAIEKERARLADEMNRGYFADIAELKQHGGKIAAANKILIPAMAAVKFPEFEVSYS
DGKTLKLPIKFDANAIEGNSSASPLPVATLLCLSFRASSQAMIDSWSAPFLDAFSSSKNVQLYEVSFIDSWFLCRNPIKKVLLRLMRKSSSNAKNDSLQRQIVYSFGDHY
YFRKELKILNLLTGYVFLLDKFGRIRWQGFGLATQEEVSSLLSCAPLLLEEK