; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC02G047100 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC02G047100
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionATPase assembly factor ATP
Genome locationCiama_Chr02:34842592..34846527
RNA-Seq ExpressionCaUC02G047100
SyntenyCaUC02G047100
Gene Ontology termsGO:0033615 - mitochondrial proton-transporting ATP synthase complex assembly (biological process)
GO:0005743 - mitochondrial inner membrane (cellular component)
GO:0032592 - integral component of mitochondrial membrane (cellular component)
InterPro domainsIPR007849 - ATPase assembly factor ATP10


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004145771.1 uncharacterized protein LOC101222490 isoform X2 [Cucumis sativus]1.1e-10573.72Show/hide
Query:  RLVPHACSIRASLTMQLSSYREKFLVFPSHHLAQLTSNRFLDIYQILLDAVVFGYKGVMFRIKIHFFGCLQLGNKTAIEKERARLADEMNRGYFADISEL
        RLVPHACSIRASLTMQLS Y +KFLVFPS HLAQLTSNRFLDIY                          QLGNKTAIEKERARLADE+NRGYFAD+SEL
Subjt:  RLVPHACSIRASLTMQLSSYREKFLVFPSHHLAQLTSNRFLDIYQILLDAVVFGYKGVMFRIKIHFFGCLQLGNKTAIEKERARLADEMNRGYFADISEL

Query:  KQHGGKIAAANKILIPAMAAVKFPEFEVSYSDGKTLKLPIKFDANVVEGNSSASVLPMATLLCLSFRANSQAFFIALIGVNLSGW------------MLH
        KQHGGKIAAANKILIPAMAAVKFPEFEVSYSDGKTLKLPIK D NV+EGNSS S LPMATLLCLSFRANSQA   +     L+ +             + 
Subjt:  KQHGGKIAAANKILIPAMAAVKFPEFEVSYSDGKTLKLPIKFDANVVEGNSSASVLPMATLLCLSFRANSQAFFIALIGVNLSGW------------MLH

Query:  QWLLCRNPIKKVLLRLMRKSKGNAQNDSLQRQIVYSFGDHYYFRKELKILNLLTGY--------KIRWQGFGLATQEEVSSLLSCASLLLEEK
         W LCRNPIKK+LLRLMRKS GNAQNDSLQRQIVYSFGDHYYFRKELKILNLLTGY        +IRWQGFGLATQEEVSSLLSCASLLLEEK
Subjt:  QWLLCRNPIKKVLLRLMRKSKGNAQNDSLQRQIVYSFGDHYYFRKELKILNLLTGY--------KIRWQGFGLATQEEVSSLLSCASLLLEEK

XP_008458658.1 PREDICTED: uncharacterized protein LOC103497992 isoform X1 [Cucumis melo]2.3e-10372.7Show/hide
Query:  RLVPHACSIRASLTMQLSSYREKFLVFPSHHLAQLTSNRFLDIYQILLDAVVFGYKGVMFRIKIHFFGCLQLGNKTAIEKERARLADEMNRGYFADISEL
        RLVPHACSIRASLTMQLS Y EKFLVFPS HLAQLTSNRFLDIY                          QLGNKTAIEKERARLADE+NRGYFAD+SEL
Subjt:  RLVPHACSIRASLTMQLSSYREKFLVFPSHHLAQLTSNRFLDIYQILLDAVVFGYKGVMFRIKIHFFGCLQLGNKTAIEKERARLADEMNRGYFADISEL

Query:  KQHGGKIAAANKILIPAMAAVKFPEFEVSYSDGKTLKLPIKFDANVVEGNSSASVLPMATLLCLSFRANSQAFFIALIGVNLSGW------------MLH
        K+HGGKIAAANKILIPAMAAVKFPEFEVSYSDGKTLKLPIK D NVVEGNSS S LP+ATLLCLSFRANSQA   +     L+ +             + 
Subjt:  KQHGGKIAAANKILIPAMAAVKFPEFEVSYSDGKTLKLPIKFDANVVEGNSSASVLPMATLLCLSFRANSQAFFIALIGVNLSGW------------MLH

Query:  QWLLCRNPIKKVLLRLMRKSKGNAQNDSLQRQIVYSFGDHYYFRKELKILNLLTGY--------KIRWQGFGLATQEEVSSLLSCASLLLEEK
         W LCR+PIKK+LLRLMRKS GNAQNDSLQRQIVYSFGDHYYFRKELKILNLLTGY        +IRWQG GLAT+EEVSSLLSCASLLLEEK
Subjt:  QWLLCRNPIKKVLLRLMRKSKGNAQNDSLQRQIVYSFGDHYYFRKELKILNLLTGY--------KIRWQGFGLATQEEVSSLLSCASLLLEEK

XP_022958438.1 uncharacterized protein LOC111459659 [Cucurbita moschata]2.3e-10372.35Show/hide
Query:  RLVPHACSIRASLTMQLSSYREKFLVFPSHHLAQLTSNRFLDIYQILLDAVVFGYKGVMFRIKIHFFGCLQLGNKTAIEKERARLADEMNRGYFADISEL
        RLVPH CSIR SLTMQLS Y+EK LVFPS HLAQLTSNRFLDIY                          QLGNKTAIEKERARLADE+NRGYFADI+EL
Subjt:  RLVPHACSIRASLTMQLSSYREKFLVFPSHHLAQLTSNRFLDIYQILLDAVVFGYKGVMFRIKIHFFGCLQLGNKTAIEKERARLADEMNRGYFADISEL

Query:  KQHGGKIAAANKILIPAMAAVKFPEFEVSYSDGKTLKLPIKFDANVVEGNSSASVLPMATLLCLSFRANSQAFFIALIGVNLSGW------------MLH
        KQHGGKIAAANKILIPAMAAVKFPEFEVSYSDGK+LKLPIKFDAN VEGNS AS LP+ATLLCLSFRA+SQA   +     L  +             + 
Subjt:  KQHGGKIAAANKILIPAMAAVKFPEFEVSYSDGKTLKLPIKFDANVVEGNSSASVLPMATLLCLSFRANSQAFFIALIGVNLSGW------------MLH

Query:  QWLLCRNPIKKVLLRLMRKSKGNAQNDSLQRQIVYSFGDHYYFRKELKILNLLTGY--------KIRWQGFGLATQEEVSSLLSCASLLLEEK
         WLLCRNPIKKVLLRLMRKS  NAQNDSLQR+IVYSFGDHYYFRKELKILNLL+GY        +IRWQGFGLATQEEVSSLLSCASLLLEEK
Subjt:  QWLLCRNPIKKVLLRLMRKSKGNAQNDSLQRQIVYSFGDHYYFRKELKILNLLTGY--------KIRWQGFGLATQEEVSSLLSCASLLLEEK

XP_023532866.1 uncharacterized protein LOC111794906 isoform X1 [Cucurbita pepo subsp. pepo]3.9e-10372.01Show/hide
Query:  RLVPHACSIRASLTMQLSSYREKFLVFPSHHLAQLTSNRFLDIYQILLDAVVFGYKGVMFRIKIHFFGCLQLGNKTAIEKERARLADEMNRGYFADISEL
        RLVPHACSIRASLTMQLS Y+EKFLVFPS HLAQLTSNRFLDIY                          QLGNKTAIEKERARLADEMNRGYFAD++EL
Subjt:  RLVPHACSIRASLTMQLSSYREKFLVFPSHHLAQLTSNRFLDIYQILLDAVVFGYKGVMFRIKIHFFGCLQLGNKTAIEKERARLADEMNRGYFADISEL

Query:  KQHGGKIAAANKILIPAMAAVKFPEFEVSYSDGKTLKLPIKFDANVVEGNSSASVLPMATLLCLSFRANSQAFFIALIGVNLSGW------------MLH
        KQHGGKIAAANKILIPAMAAVKFPEFEVSYSDGK+LKLPIKFDAN VEGN+SAS LP+ATLLCLSFR +SQA   +     L  +             + 
Subjt:  KQHGGKIAAANKILIPAMAAVKFPEFEVSYSDGKTLKLPIKFDANVVEGNSSASVLPMATLLCLSFRANSQAFFIALIGVNLSGW------------MLH

Query:  QWLLCRNPIKKVLLRLMRKSKGNAQNDSLQRQIVYSFGDHYYFRKELKILNLLTGY--------KIRWQGFGLATQEEVSSLLSCASLLLEEK
         WLLCRNPIKK+LLRLMRKS  NA NDSLQR+IVYSFGDHYYFRKELKI+NLL+GY        +IRWQGFGLATQEEVSSLLSCASLLLEEK
Subjt:  QWLLCRNPIKKVLLRLMRKSKGNAQNDSLQRQIVYSFGDHYYFRKELKILNLLTGY--------KIRWQGFGLATQEEVSSLLSCASLLLEEK

XP_038900439.1 uncharacterized protein LOC120087664 isoform X1 [Benincasa hispida]3.0e-10373.04Show/hide
Query:  RLVPHACSIRASLTMQLSSYREKFLVFPSHHLAQLTSNRFLDIYQILLDAVVFGYKGVMFRIKIHFFGCLQLGNKTAIEKERARLADEMNRGYFADISEL
        RLVP ACSIRASLTMQLSS  EKFLVFPS HLAQLT NRFLDIY                          Q GNK AIEKERARLADEMNRGYFADISEL
Subjt:  RLVPHACSIRASLTMQLSSYREKFLVFPSHHLAQLTSNRFLDIYQILLDAVVFGYKGVMFRIKIHFFGCLQLGNKTAIEKERARLADEMNRGYFADISEL

Query:  KQHGGKIAAANKILIPAMAAVKFPEFEVSYSDGKTLKLPIKFDANVVEGNSSASVLPMATLLCLSFRANSQAFFIALIGVNLSGW------------MLH
        KQHGGKIAAANKILIPAMAAVKFPEFEVSYSDGKTLKLPIK D +VVE NSSAS LPMATLLCLSFRANSQA   +     L+ +             + 
Subjt:  KQHGGKIAAANKILIPAMAAVKFPEFEVSYSDGKTLKLPIKFDANVVEGNSSASVLPMATLLCLSFRANSQAFFIALIGVNLSGW------------MLH

Query:  QWLLCRNPIKKVLLRLMRKSKGNAQNDSLQRQIVYSFGDHYYFRKELKILNLLTGY--------KIRWQGFGLATQEEVSSLLSCASLLLEEK
         W LCRNPIKK+LLRLMRK  GNAQNDSLQRQIVYSFGDHYYFRKELKILNLLTGY        +IRWQGFGLATQEEVSSLLSCASLLLEEK
Subjt:  QWLLCRNPIKKVLLRLMRKSKGNAQNDSLQRQIVYSFGDHYYFRKELKILNLLTGY--------KIRWQGFGLATQEEVSSLLSCASLLLEEK

TrEMBL top hitse value%identityAlignment
A0A0A0KGL5 Uncharacterized protein5.3e-10673.72Show/hide
Query:  RLVPHACSIRASLTMQLSSYREKFLVFPSHHLAQLTSNRFLDIYQILLDAVVFGYKGVMFRIKIHFFGCLQLGNKTAIEKERARLADEMNRGYFADISEL
        RLVPHACSIRASLTMQLS Y +KFLVFPS HLAQLTSNRFLDIY                          QLGNKTAIEKERARLADE+NRGYFAD+SEL
Subjt:  RLVPHACSIRASLTMQLSSYREKFLVFPSHHLAQLTSNRFLDIYQILLDAVVFGYKGVMFRIKIHFFGCLQLGNKTAIEKERARLADEMNRGYFADISEL

Query:  KQHGGKIAAANKILIPAMAAVKFPEFEVSYSDGKTLKLPIKFDANVVEGNSSASVLPMATLLCLSFRANSQAFFIALIGVNLSGW------------MLH
        KQHGGKIAAANKILIPAMAAVKFPEFEVSYSDGKTLKLPIK D NV+EGNSS S LPMATLLCLSFRANSQA   +     L+ +             + 
Subjt:  KQHGGKIAAANKILIPAMAAVKFPEFEVSYSDGKTLKLPIKFDANVVEGNSSASVLPMATLLCLSFRANSQAFFIALIGVNLSGW------------MLH

Query:  QWLLCRNPIKKVLLRLMRKSKGNAQNDSLQRQIVYSFGDHYYFRKELKILNLLTGY--------KIRWQGFGLATQEEVSSLLSCASLLLEEK
         W LCRNPIKK+LLRLMRKS GNAQNDSLQRQIVYSFGDHYYFRKELKILNLLTGY        +IRWQGFGLATQEEVSSLLSCASLLLEEK
Subjt:  QWLLCRNPIKKVLLRLMRKSKGNAQNDSLQRQIVYSFGDHYYFRKELKILNLLTGY--------KIRWQGFGLATQEEVSSLLSCASLLLEEK

A0A1S3C8Y6 uncharacterized protein LOC103497992 isoform X11.1e-10372.7Show/hide
Query:  RLVPHACSIRASLTMQLSSYREKFLVFPSHHLAQLTSNRFLDIYQILLDAVVFGYKGVMFRIKIHFFGCLQLGNKTAIEKERARLADEMNRGYFADISEL
        RLVPHACSIRASLTMQLS Y EKFLVFPS HLAQLTSNRFLDIY                          QLGNKTAIEKERARLADE+NRGYFAD+SEL
Subjt:  RLVPHACSIRASLTMQLSSYREKFLVFPSHHLAQLTSNRFLDIYQILLDAVVFGYKGVMFRIKIHFFGCLQLGNKTAIEKERARLADEMNRGYFADISEL

Query:  KQHGGKIAAANKILIPAMAAVKFPEFEVSYSDGKTLKLPIKFDANVVEGNSSASVLPMATLLCLSFRANSQAFFIALIGVNLSGW------------MLH
        K+HGGKIAAANKILIPAMAAVKFPEFEVSYSDGKTLKLPIK D NVVEGNSS S LP+ATLLCLSFRANSQA   +     L+ +             + 
Subjt:  KQHGGKIAAANKILIPAMAAVKFPEFEVSYSDGKTLKLPIKFDANVVEGNSSASVLPMATLLCLSFRANSQAFFIALIGVNLSGW------------MLH

Query:  QWLLCRNPIKKVLLRLMRKSKGNAQNDSLQRQIVYSFGDHYYFRKELKILNLLTGY--------KIRWQGFGLATQEEVSSLLSCASLLLEEK
         W LCR+PIKK+LLRLMRKS GNAQNDSLQRQIVYSFGDHYYFRKELKILNLLTGY        +IRWQG GLAT+EEVSSLLSCASLLLEEK
Subjt:  QWLLCRNPIKKVLLRLMRKSKGNAQNDSLQRQIVYSFGDHYYFRKELKILNLLTGY--------KIRWQGFGLATQEEVSSLLSCASLLLEEK

A0A5A7STN2 Uncharacterized protein1.1e-10372.7Show/hide
Query:  RLVPHACSIRASLTMQLSSYREKFLVFPSHHLAQLTSNRFLDIYQILLDAVVFGYKGVMFRIKIHFFGCLQLGNKTAIEKERARLADEMNRGYFADISEL
        RLVPHACSIRASLTMQLS Y EKFLVFPS HLAQLTSNRFLDIY                          QLGNKTAIEKERARLADE+NRGYFAD+SEL
Subjt:  RLVPHACSIRASLTMQLSSYREKFLVFPSHHLAQLTSNRFLDIYQILLDAVVFGYKGVMFRIKIHFFGCLQLGNKTAIEKERARLADEMNRGYFADISEL

Query:  KQHGGKIAAANKILIPAMAAVKFPEFEVSYSDGKTLKLPIKFDANVVEGNSSASVLPMATLLCLSFRANSQAFFIALIGVNLSGW------------MLH
        K+HGGKIAAANKILIPAMAAVKFPEFEVSYSDGKTLKLPIK D NVVEGNSS S LP+ATLLCLSFRANSQA   +     L+ +             + 
Subjt:  KQHGGKIAAANKILIPAMAAVKFPEFEVSYSDGKTLKLPIKFDANVVEGNSSASVLPMATLLCLSFRANSQAFFIALIGVNLSGW------------MLH

Query:  QWLLCRNPIKKVLLRLMRKSKGNAQNDSLQRQIVYSFGDHYYFRKELKILNLLTGY--------KIRWQGFGLATQEEVSSLLSCASLLLEEK
         W LCR+PIKK+LLRLMRKS GNAQNDSLQRQIVYSFGDHYYFRKELKILNLLTGY        +IRWQG GLAT+EEVSSLLSCASLLLEEK
Subjt:  QWLLCRNPIKKVLLRLMRKSKGNAQNDSLQRQIVYSFGDHYYFRKELKILNLLTGY--------KIRWQGFGLATQEEVSSLLSCASLLLEEK

A0A6J1H332 uncharacterized protein LOC1114596591.1e-10372.35Show/hide
Query:  RLVPHACSIRASLTMQLSSYREKFLVFPSHHLAQLTSNRFLDIYQILLDAVVFGYKGVMFRIKIHFFGCLQLGNKTAIEKERARLADEMNRGYFADISEL
        RLVPH CSIR SLTMQLS Y+EK LVFPS HLAQLTSNRFLDIY                          QLGNKTAIEKERARLADE+NRGYFADI+EL
Subjt:  RLVPHACSIRASLTMQLSSYREKFLVFPSHHLAQLTSNRFLDIYQILLDAVVFGYKGVMFRIKIHFFGCLQLGNKTAIEKERARLADEMNRGYFADISEL

Query:  KQHGGKIAAANKILIPAMAAVKFPEFEVSYSDGKTLKLPIKFDANVVEGNSSASVLPMATLLCLSFRANSQAFFIALIGVNLSGW------------MLH
        KQHGGKIAAANKILIPAMAAVKFPEFEVSYSDGK+LKLPIKFDAN VEGNS AS LP+ATLLCLSFRA+SQA   +     L  +             + 
Subjt:  KQHGGKIAAANKILIPAMAAVKFPEFEVSYSDGKTLKLPIKFDANVVEGNSSASVLPMATLLCLSFRANSQAFFIALIGVNLSGW------------MLH

Query:  QWLLCRNPIKKVLLRLMRKSKGNAQNDSLQRQIVYSFGDHYYFRKELKILNLLTGY--------KIRWQGFGLATQEEVSSLLSCASLLLEEK
         WLLCRNPIKKVLLRLMRKS  NAQNDSLQR+IVYSFGDHYYFRKELKILNLL+GY        +IRWQGFGLATQEEVSSLLSCASLLLEEK
Subjt:  QWLLCRNPIKKVLLRLMRKSKGNAQNDSLQRQIVYSFGDHYYFRKELKILNLLTGY--------KIRWQGFGLATQEEVSSLLSCASLLLEEK

A0A6J1K5D3 uncharacterized protein LOC1114908975.5e-10372.35Show/hide
Query:  RLVPHACSIRASLTMQLSSYREKFLVFPSHHLAQLTSNRFLDIYQILLDAVVFGYKGVMFRIKIHFFGCLQLGNKTAIEKERARLADEMNRGYFADISEL
        RLVPHACSIRASL MQLS Y+EKFLVFPS HLAQLTSNRFL+IY                          QLGNKTAIEKERARLADE+NRGYFADI+EL
Subjt:  RLVPHACSIRASLTMQLSSYREKFLVFPSHHLAQLTSNRFLDIYQILLDAVVFGYKGVMFRIKIHFFGCLQLGNKTAIEKERARLADEMNRGYFADISEL

Query:  KQHGGKIAAANKILIPAMAAVKFPEFEVSYSDGKTLKLPIKFDANVVEGNSSASVLPMATLLCLSFRANSQAFFIALIGVNLSGW------------MLH
        KQHGGKIAAANKILIPAMAAVKFPEFEVSYSDGK+LKLPIKFDAN VEGN+SAS LP+ATLLCLSFRA+SQA   +     L  +             + 
Subjt:  KQHGGKIAAANKILIPAMAAVKFPEFEVSYSDGKTLKLPIKFDANVVEGNSSASVLPMATLLCLSFRANSQAFFIALIGVNLSGW------------MLH

Query:  QWLLCRNPIKKVLLRLMRKSKGNAQNDSLQRQIVYSFGDHYYFRKELKILNLLTGY--------KIRWQGFGLATQEEVSSLLSCASLLLEEK
         WLLCRNPIKKVLLRLMRKS  NAQ DSLQR+IVYSFGDHYYFRKELKILNLL+GY        +IRWQGFGLATQEEVSSLLSCASLLLEEK
Subjt:  QWLLCRNPIKKVLLRLMRKSKGNAQNDSLQRQIVYSFGDHYYFRKELKILNLLTGY--------KIRWQGFGLATQEEVSSLLSCASLLLEEK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G08220.1 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: mitochondrial proton-transporting ATP synthase complex assembly; LOCATED IN: mitochondrial inner membrane; EXPRESSED IN: 18 plant structures; EXPRESSED DURING: 7 growth stages; CONTAINS InterPro DOMAIN/s: ATPase assembly factor ATP10, mitochondria (InterPro:IPR007849); Has 168 Blast hits to 168 proteins in 86 species: Archae - 6; Bacteria - 0; Metazoa - 2; Fungi - 107; Plants - 30; Viruses - 0; Other Eukaryotes - 23 (source: NCBI BLink).9.1e-5853.07Show/hide
Query:  FFGCLQLGNKTAIEKERARLADEMNRGYFADISELKQHGGKIAAANKILIPAMAAVKFPEFEVSYSDGKTLKLPIKFDANVVEGNSSASVLPMATLLCLS
        F    + GNK AIE ERARL DEMNRGYFAD+ E K+HGGKIAAANK +IPA +A+KFP   V++S+GK+LKLPI  ++N V+  + + V+P  +L+CLS
Subjt:  FFGCLQLGNKTAIEKERARLADEMNRGYFADISELKQHGGKIAAANKILIPAMAAVKFPEFEVSYSDGKTLKLPIKFDANVVEGNSSASVLPMATLLCLS

Query:  FRANSQAFFIALIGVNLSGW------------MLHQWLLCRNPIKKVLLRLMRKSKGNAQNDSLQRQIVYSFGDHYYFRKELKILNLLTGY--------K
        FRA+SQ    +     L  +             + +WLL   PI+K+LLR+++K   N +N  LQRQ+ Y+FGDHYYFRKE+K+LNLLTGY        +
Subjt:  FRANSQAFFIALIGVNLSGW------------MLHQWLLCRNPIKKVLLRLMRKSKGNAQNDSLQRQIVYSFGDHYYFRKELKILNLLTGY--------K

Query:  IRWQGFGLATQEEVSSLLSCASLLLEEK
        IRWQGFG AT EEVS LLSC SLLLE++
Subjt:  IRWQGFGLATQEEVSSLLSCASLLLEEK

AT1G08220.2 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: mitochondrial proton-transporting ATP synthase complex assembly; LOCATED IN: mitochondrial inner membrane; EXPRESSED IN: 18 plant structures; EXPRESSED DURING: 7 growth stages; CONTAINS InterPro DOMAIN/s: ATPase assembly factor ATP10, mitochondria (InterPro:IPR007849); Has 152 Blast hits to 152 proteins in 76 species: Archae - 6; Bacteria - 0; Metazoa - 2; Fungi - 92; Plants - 30; Viruses - 0; Other Eukaryotes - 22 (source: NCBI BLink).3.1e-5052.2Show/hide
Query:  MNRGYFADISELKQHGGKIAAANKILIPAMAAVKFPEFEVSYSDGKTLKLPIKFDANVVEGNSSASVLPMATLLCLSFRANSQAFFIALIGVNLSGW---
        MNRGYFAD+ E K+HGGKIAAANK +IPA +A+KFP   V++S+GK+LKLPI  ++N V+  + + V+P  +L+CLSFRA+SQ    +     L  +   
Subjt:  MNRGYFADISELKQHGGKIAAANKILIPAMAAVKFPEFEVSYSDGKTLKLPIKFDANVVEGNSSASVLPMATLLCLSFRANSQAFFIALIGVNLSGW---

Query:  ---------MLHQWLLCRNPIKKVLLRLMRKSKGNAQNDSLQRQIVYSFGDHYYFRKELKILNLLTGY--------KIRWQGFGLATQEEVSSLLSCASL
                  + +WLL   PI+K+LLR+++K   N +N  LQRQ+ Y+FGDHYYFRKE+K+LNLLTGY        +IRWQGFG AT EEVS LLSC SL
Subjt:  ---------MLHQWLLCRNPIKKVLLRLMRKSKGNAQNDSLQRQIVYSFGDHYYFRKELKILNLLTGY--------KIRWQGFGLATQEEVSSLLSCASL

Query:  LLEEK
        LLE++
Subjt:  LLEEK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGGGATCAGAGATGACCGTACAGTTCTCCGATTAGGGTTCATCGGAGGCTCAACAGAAACTCGATTAGTACCTCATGCTTGCTCGATTCGAGCTTCTTTAACAAT
GCAGCTCTCTAGTTACAGGGAAAAATTCCTTGTTTTTCCTTCCCACCATTTAGCTCAGCTGACTTCCAATCGCTTCCTCGACATTTATCAGATTCTTCTTGATGCAGTTG
TATTCGGTTACAAAGGGGTAATGTTTCGGATTAAGATCCATTTTTTCGGATGTTTGCAGCTTGGAAACAAAACAGCCATTGAGAAAGAGCGCGCTCGGCTTGCAGATGAA
ATGAATAGAGGATACTTTGCTGATATTTCAGAGCTAAAGCAACATGGCGGAAAGATTGCAGCAGCTAACAAGATTCTAATTCCGGCTATGGCTGCTGTAAAATTTCCGGA
GTTTGAAGTGAGTTATTCTGATGGTAAAACGTTGAAGCTACCGATTAAATTTGATGCTAATGTGGTAGAAGGCAATAGTTCGGCATCGGTCTTGCCAATGGCCACATTAC
TGTGTCTTTCTTTCAGAGCTAACTCCCAGGCTTTCTTCATTGCATTGATCGGTGTCAATCTTAGTGGTTGGATGCTACATCAGTGGCTCTTATGTCGAAACCCAATTAAG
AAAGTGCTTCTTCGGCTAATGAGGAAATCCAAAGGCAATGCACAGAATGATTCACTTCAAAGGCAGATTGTATACTCGTTTGGCGACCATTATTACTTCAGAAAGGAGCT
AAAAATATTAAATCTTCTCACTGGGTACAAAATAAGATGGCAAGGCTTTGGATTGGCAACTCAAGAGGAGGTGTCATCTCTTCTTTCATGCGCGTCACTTCTTTTGGAAG
AAAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGGGGATCAGAGATGACCGTACAGTTCTCCGATTAGGGTTCATCGGAGGCTCAACAGAAACTCGATTAGTACCTCATGCTTGCTCGATTCGAGCTTCTTTAACAAT
GCAGCTCTCTAGTTACAGGGAAAAATTCCTTGTTTTTCCTTCCCACCATTTAGCTCAGCTGACTTCCAATCGCTTCCTCGACATTTATCAGATTCTTCTTGATGCAGTTG
TATTCGGTTACAAAGGGGTAATGTTTCGGATTAAGATCCATTTTTTCGGATGTTTGCAGCTTGGAAACAAAACAGCCATTGAGAAAGAGCGCGCTCGGCTTGCAGATGAA
ATGAATAGAGGATACTTTGCTGATATTTCAGAGCTAAAGCAACATGGCGGAAAGATTGCAGCAGCTAACAAGATTCTAATTCCGGCTATGGCTGCTGTAAAATTTCCGGA
GTTTGAAGTGAGTTATTCTGATGGTAAAACGTTGAAGCTACCGATTAAATTTGATGCTAATGTGGTAGAAGGCAATAGTTCGGCATCGGTCTTGCCAATGGCCACATTAC
TGTGTCTTTCTTTCAGAGCTAACTCCCAGGCTTTCTTCATTGCATTGATCGGTGTCAATCTTAGTGGTTGGATGCTACATCAGTGGCTCTTATGTCGAAACCCAATTAAG
AAAGTGCTTCTTCGGCTAATGAGGAAATCCAAAGGCAATGCACAGAATGATTCACTTCAAAGGCAGATTGTATACTCGTTTGGCGACCATTATTACTTCAGAAAGGAGCT
AAAAATATTAAATCTTCTCACTGGGTACAAAATAAGATGGCAAGGCTTTGGATTGGCAACTCAAGAGGAGGTGTCATCTCTTCTTTCATGCGCGTCACTTCTTTTGGAAG
AAAAATGAGAAATTATCCTCCTCCATAAAATGATTCACTTCCAGAAGTTTTGTTCGACGACGATGCAAGGAAAATTAGCAAATACTAATATCTATGAATAAAGATATAAT
TTTCCAATGGTGTGAATTAACACCAAGGAAAGATTTGTGCGAGCTTAATTTCCTGAAAGCAACTATTTCACATATATATAATATCAAGTCGACAAACGATGACTGATTTT
GTTTTTGTTTGTATTTTTTCTTCAACAGTTGCGGAGTCGGGTCGAATTATTCCAAGATTTTTGGAATGATAATTGGTGGCTTATTTGAAATCCCCC
Protein sequenceShow/hide protein sequence
MAGIRDDRTVLRLGFIGGSTETRLVPHACSIRASLTMQLSSYREKFLVFPSHHLAQLTSNRFLDIYQILLDAVVFGYKGVMFRIKIHFFGCLQLGNKTAIEKERARLADE
MNRGYFADISELKQHGGKIAAANKILIPAMAAVKFPEFEVSYSDGKTLKLPIKFDANVVEGNSSASVLPMATLLCLSFRANSQAFFIALIGVNLSGWMLHQWLLCRNPIK
KVLLRLMRKSKGNAQNDSLQRQIVYSFGDHYYFRKELKILNLLTGYKIRWQGFGLATQEEVSSLLSCASLLLEEK