; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi04G010690 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi04G010690
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr04:12771401..12778084
RNA-Seq ExpressionLsi04G010690
SyntenyLsi04G010690
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8652677.1 hypothetical protein Csa_014062 [Cucumis sativus]3.7e-3177.55Show/hide
Query:  PSVALEVVNLTTSREVWLALEELYGATSKARINQLRSILQNTIKGTMRMAEYLSMMKQASENMQLAGSPISQDDLISYVLAGLDVEYLPIVCDIEGKN
        P VA EVVNL  SRE+WLALEELYGATSK+    +  ILQNT KGTMRM EYLSMMKQ  ENMQLAGSPIS +DL SYVL GLDVEY+PIVCDIEGKN
Subjt:  PSVALEVVNLTTSREVWLALEELYGATSKARINQLRSILQNTIKGTMRMAEYLSMMKQASENMQLAGSPISQDDLISYVLAGLDVEYLPIVCDIEGKN

KAF4385593.1 hypothetical protein F8388_010149 [Cannabis sativa]4.8e-3145.57Show/hide
Query:  VLAVLRGQKADGYVFGTKTQPPQFLKSSGEGGSSVSNVNPSFEEWTAIDQALLGWLLGSMTPSVALEVVNLTTSREVWLALEELYGATSKARINQLRSIL
        V A++RG + DGY+ GTK +P +FL +    G     VNP FE W   DQ LLGWL GSMT S+A EV+   +S  +W ALE L+GA S+ +++  R+ +
Subjt:  VLAVLRGQKADGYVFGTKTQPPQFLKSSGEGGSSVSNVNPSFEEWTAIDQALLGWLLGSMTPSVALEVVNLTTSREVWLALEELYGATSKARINQLRSIL

Query:  QNTIKGTMRMAEYLSMMKQASENMQLAGSPISQDDLISYVLAGLDVEYLPIVCDIEGK
        Q   KG M MAEYL   +Q ++ + LAG P  +  L+S +L+GLD+EYLPIV  +E +
Subjt:  QNTIKGTMRMAEYLSMMKQASENMQLAGSPISQDDLISYVLAGLDVEYLPIVCDIEGK

XP_022143579.1 ankyrin repeat-containing protein NPR4-like [Momordica charantia]3.2e-4358.23Show/hide
Query:  VLAVLRGQKADGYVFGTKTQPPQFLKSSGEGGSSVSNVNPSFEEWTAIDQALLGWLLGSMTPSVALEVVNLTTSREVWLALEELYGATSKARINQLRSIL
        VLAVL GQK DGYV  TKT P ++  ++ + G     +NP++EEW+A+DQA  GWL GSMTPS+A +VVNL TS EVW ALE L+G+TSKARINQLR+ L
Subjt:  VLAVLRGQKADGYVFGTKTQPPQFLKSSGEGGSSVSNVNPSFEEWTAIDQALLGWLLGSMTPSVALEVVNLTTSREVWLALEELYGATSKARINQLRSIL

Query:  QNTIKGTMRMAEYLSMMKQASENMQLAGSPISQDDLISYVLAGLDVEYLPIVCDIEGK
        QNT KG M+M  YL+ MKQ SE+++LAG P++   L S +L G + EYLPI+C IE K
Subjt:  QNTIKGTMRMAEYLSMMKQASENMQLAGSPISQDDLISYVLAGLDVEYLPIVCDIEGK

XP_022157748.1 uncharacterized protein LOC111024384 isoform X1 [Momordica charantia]2.3e-4949.62Show/hide
Query:  VLAVLRGQKADGYVFGTKTQPPQFLKS-SGEGGSSVSNVNPSFEEWTAIDQALLGWLLGSMTPSVALEVVNLTTSREVWLALEELYGATSKARINQLRSI
        VLAVLRGQK DGYV GT  +PPQFL S   EG S    VNP + EW A+DQALLGWL GSMTPS+A +VV+  +SREVW ALE+LYGATSKARINQLR++
Subjt:  VLAVLRGQKADGYVFGTKTQPPQFLKS-SGEGGSSVSNVNPSFEEWTAIDQALLGWLLGSMTPSVALEVVNLTTSREVWLALEELYGATSKARINQLRSI

Query:  LQNTIKGTMRMAEYLSMMKQASENMQLAGSPISQDDLISYVLAGLDVEYLPIVCDIEGK----------------------NVVNVNGDFGDFGDASTNY
        LQNT K +++M+EYL +MKQASE+++LAG P++ + L+S VL+GL+ EYLPIVC IEGK                      N+V+     G   D S NY
Subjt:  LQNTIKGTMRMAEYLSMMKQASENMQLAGSPISQDDLISYVLAGLDVEYLPIVCDIEGK----------------------NVVNVNGDFGDFGDASTNY

Query:  VYSRQGN--NVKNEQQQRFPGNSFRNQNNHFAPRNNQFQGIGNNGNGGIFQNRRGRGSLP
        V+S+Q +  N +  Q Q   G    + N++ A  N + +G G       F   RG  S P
Subjt:  VYSRQGN--NVKNEQQQRFPGNSFRNQNNHFAPRNNQFQGIGNNGNGGIFQNRRGRGSLP

XP_022157750.1 uncharacterized protein LOC111024384 isoform X2 [Momordica charantia]2.3e-4949.62Show/hide
Query:  VLAVLRGQKADGYVFGTKTQPPQFLKS-SGEGGSSVSNVNPSFEEWTAIDQALLGWLLGSMTPSVALEVVNLTTSREVWLALEELYGATSKARINQLRSI
        VLAVLRGQK DGYV GT  +PPQFL S   EG S    VNP + EW A+DQALLGWL GSMTPS+A +VV+  +SREVW ALE+LYGATSKARINQLR++
Subjt:  VLAVLRGQKADGYVFGTKTQPPQFLKS-SGEGGSSVSNVNPSFEEWTAIDQALLGWLLGSMTPSVALEVVNLTTSREVWLALEELYGATSKARINQLRSI

Query:  LQNTIKGTMRMAEYLSMMKQASENMQLAGSPISQDDLISYVLAGLDVEYLPIVCDIEGK----------------------NVVNVNGDFGDFGDASTNY
        LQNT K +++M+EYL +MKQASE+++LAG P++ + L+S VL+GL+ EYLPIVC IEGK                      N+V+     G   D S NY
Subjt:  LQNTIKGTMRMAEYLSMMKQASENMQLAGSPISQDDLISYVLAGLDVEYLPIVCDIEGK----------------------NVVNVNGDFGDFGDASTNY

Query:  VYSRQGN--NVKNEQQQRFPGNSFRNQNNHFAPRNNQFQGIGNNGNGGIFQNRRGRGSLP
        V+S+Q +  N +  Q Q   G    + N++ A  N + +G G       F   RG  S P
Subjt:  VYSRQGN--NVKNEQQQRFPGNSFRNQNNHFAPRNNQFQGIGNNGNGGIFQNRRGRGSLP

TrEMBL top hitse value%identityAlignment
A0A0A0LUB0 Uncharacterized protein1.8e-3177.55Show/hide
Query:  PSVALEVVNLTTSREVWLALEELYGATSKARINQLRSILQNTIKGTMRMAEYLSMMKQASENMQLAGSPISQDDLISYVLAGLDVEYLPIVCDIEGKN
        P VA EVVNL  SRE+WLALEELYGATSK+    +  ILQNT KGTMRM EYLSMMKQ  ENMQLAGSPIS +DL SYVL GLDVEY+PIVCDIEGKN
Subjt:  PSVALEVVNLTTSREVWLALEELYGATSKARINQLRSILQNTIKGTMRMAEYLSMMKQASENMQLAGSPISQDDLISYVLAGLDVEYLPIVCDIEGKN

A0A6J1CPQ7 ankyrin repeat-containing protein NPR4-like1.6e-4358.23Show/hide
Query:  VLAVLRGQKADGYVFGTKTQPPQFLKSSGEGGSSVSNVNPSFEEWTAIDQALLGWLLGSMTPSVALEVVNLTTSREVWLALEELYGATSKARINQLRSIL
        VLAVL GQK DGYV  TKT P ++  ++ + G     +NP++EEW+A+DQA  GWL GSMTPS+A +VVNL TS EVW ALE L+G+TSKARINQLR+ L
Subjt:  VLAVLRGQKADGYVFGTKTQPPQFLKSSGEGGSSVSNVNPSFEEWTAIDQALLGWLLGSMTPSVALEVVNLTTSREVWLALEELYGATSKARINQLRSIL

Query:  QNTIKGTMRMAEYLSMMKQASENMQLAGSPISQDDLISYVLAGLDVEYLPIVCDIEGK
        QNT KG M+M  YL+ MKQ SE+++LAG P++   L S +L G + EYLPI+C IE K
Subjt:  QNTIKGTMRMAEYLSMMKQASENMQLAGSPISQDDLISYVLAGLDVEYLPIVCDIEGK

A0A6J1DTZ7 uncharacterized protein LOC111024384 isoform X21.1e-4949.62Show/hide
Query:  VLAVLRGQKADGYVFGTKTQPPQFLKS-SGEGGSSVSNVNPSFEEWTAIDQALLGWLLGSMTPSVALEVVNLTTSREVWLALEELYGATSKARINQLRSI
        VLAVLRGQK DGYV GT  +PPQFL S   EG S    VNP + EW A+DQALLGWL GSMTPS+A +VV+  +SREVW ALE+LYGATSKARINQLR++
Subjt:  VLAVLRGQKADGYVFGTKTQPPQFLKS-SGEGGSSVSNVNPSFEEWTAIDQALLGWLLGSMTPSVALEVVNLTTSREVWLALEELYGATSKARINQLRSI

Query:  LQNTIKGTMRMAEYLSMMKQASENMQLAGSPISQDDLISYVLAGLDVEYLPIVCDIEGK----------------------NVVNVNGDFGDFGDASTNY
        LQNT K +++M+EYL +MKQASE+++LAG P++ + L+S VL+GL+ EYLPIVC IEGK                      N+V+     G   D S NY
Subjt:  LQNTIKGTMRMAEYLSMMKQASENMQLAGSPISQDDLISYVLAGLDVEYLPIVCDIEGK----------------------NVVNVNGDFGDFGDASTNY

Query:  VYSRQGN--NVKNEQQQRFPGNSFRNQNNHFAPRNNQFQGIGNNGNGGIFQNRRGRGSLP
        V+S+Q +  N +  Q Q   G    + N++ A  N + +G G       F   RG  S P
Subjt:  VYSRQGN--NVKNEQQQRFPGNSFRNQNNHFAPRNNQFQGIGNNGNGGIFQNRRGRGSLP

A0A6J1DU77 uncharacterized protein LOC111024384 isoform X11.1e-4949.62Show/hide
Query:  VLAVLRGQKADGYVFGTKTQPPQFLKS-SGEGGSSVSNVNPSFEEWTAIDQALLGWLLGSMTPSVALEVVNLTTSREVWLALEELYGATSKARINQLRSI
        VLAVLRGQK DGYV GT  +PPQFL S   EG S    VNP + EW A+DQALLGWL GSMTPS+A +VV+  +SREVW ALE+LYGATSKARINQLR++
Subjt:  VLAVLRGQKADGYVFGTKTQPPQFLKS-SGEGGSSVSNVNPSFEEWTAIDQALLGWLLGSMTPSVALEVVNLTTSREVWLALEELYGATSKARINQLRSI

Query:  LQNTIKGTMRMAEYLSMMKQASENMQLAGSPISQDDLISYVLAGLDVEYLPIVCDIEGK----------------------NVVNVNGDFGDFGDASTNY
        LQNT K +++M+EYL +MKQASE+++LAG P++ + L+S VL+GL+ EYLPIVC IEGK                      N+V+     G   D S NY
Subjt:  LQNTIKGTMRMAEYLSMMKQASENMQLAGSPISQDDLISYVLAGLDVEYLPIVCDIEGK----------------------NVVNVNGDFGDFGDASTNY

Query:  VYSRQGN--NVKNEQQQRFPGNSFRNQNNHFAPRNNQFQGIGNNGNGGIFQNRRGRGSLP
        V+S+Q +  N +  Q Q   G    + N++ A  N + +G G       F   RG  S P
Subjt:  VYSRQGN--NVKNEQQQRFPGNSFRNQNNHFAPRNNQFQGIGNNGNGGIFQNRRGRGSLP

A0A7J6FZV1 Uncharacterized protein2.3e-3145.57Show/hide
Query:  VLAVLRGQKADGYVFGTKTQPPQFLKSSGEGGSSVSNVNPSFEEWTAIDQALLGWLLGSMTPSVALEVVNLTTSREVWLALEELYGATSKARINQLRSIL
        V A++RG + DGY+ GTK +P +FL +    G     VNP FE W   DQ LLGWL GSMT S+A EV+   +S  +W ALE L+GA S+ +++  R+ +
Subjt:  VLAVLRGQKADGYVFGTKTQPPQFLKSSGEGGSSVSNVNPSFEEWTAIDQALLGWLLGSMTPSVALEVVNLTTSREVWLALEELYGATSKARINQLRSIL

Query:  QNTIKGTMRMAEYLSMMKQASENMQLAGSPISQDDLISYVLAGLDVEYLPIVCDIEGK
        Q   KG M MAEYL   +Q ++ + LAG P  +  L+S +L+GLD+EYLPIV  +E +
Subjt:  QNTIKGTMRMAEYLSMMKQASENMQLAGSPISQDDLISYVLAGLDVEYLPIVCDIEGK

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.6e-1325.42Show/hide
Query:  KQVLAVLRGQKADGYVFGTKTQPPQFLKSSGEGGSSVSNVNPSFEEWTAIDQALLGWLLGSMTPSVALEVVNLTTSREVWLALEELYGATSKARINQLRS
        +QV A+  G +  G++ G+ T PP  +     G  +   VNP +  W   D+ +   +LG+++ SV   V   TT+ ++W  L ++Y   S   + QLR+
Subjt:  KQVLAVLRGQKADGYVFGTKTQPPQFLKSSGEGGSSVSNVNPSFEEWTAIDQALLGWLLGSMTPSVALEVVNLTTSREVWLALEELYGATSKARINQLRS

Query:  ILQNTIKGTMRMAEYLSMMKQASENMQLAGSPISQDDLISYVLAGLDVEYLPIVCDIEGKNVVNVNGDFGD--------FGDASTNYVYSRQGNNVKNEQ
         L+   KGT  + +Y+  +    + + L G P+  D+ +  VL  L  EY P++  I  K+      +  +            S+  V     N V +  
Subjt:  ILQNTIKGTMRMAEYLSMMKQASENMQLAGSPISQDDLISYVLAGLDVEYLPIVCDIEGKNVVNVNGDFGD--------FGDASTNYVYSRQGNNVKNEQ

Query:  QQRFPGNSFRNQNNHFAPRNNQFQGIGNNGNGGIFQ
              N+    NN+   RNN++    NN N   +Q
Subjt:  QQRFPGNSFRNQNNHFAPRNNQFQGIGNNGNGGIFQ

Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)4.7e-0828.99Show/hide
Query:  NPSFEEWTAIDQALLGWLLGSMTP-SVALEVVNLTTSREVWLALEELYGATSKARINQLRSILQNTIKGTMRMAEYLSMMKQASENMQLAGSPISQDDLI
        N +   W   D  +   L G++TP       V  +TSR++WL ++  +     AR  +L S L+    G MR+A+Y   MK+ +++++    P++  +L+
Subjt:  NPSFEEWTAIDQALLGWLLGSMTP-SVALEVVNLTTSREVWLALEELYGATSKARINQLRSILQNTIKGTMRMAEYLSMMKQASENMQLAGSPISQDDLI

Query:  SYVLAGLDVEYLPIVCDIEGKNVVNVNGDFGDFGDAST
         YVL GL+ ++  I+      NV+     F  F DA+T
Subjt:  SYVLAGLDVEYLPIVCDIEGKNVVNVNGDFGDFGDAST

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)7.5e-0624.89Show/hide
Query:  EEWTAIDQALLGWLLGSMTPSVALEVVNL-TTSREVWLALEELYGATSKARINQLRSILQNTIKGTMRMAEYLSMMKQASENMQLAGSPISQDDLISYVL
        + W   D  +  W+ G++T S+   ++ +  T+R++WL+LE L+    +AR  Q  + L+ T    + + EY   +K  S+ +    SPIS   L+ ++L
Subjt:  EEWTAIDQALLGWLLGSMTPSVALEVVNL-TTSREVWLALEELYGATSKARINQLRSILQNTIKGTMRMAEYLSMMKQASENMQLAGSPISQDDLISYVL

Query:  AGLDVEYLPIVCDIEGKNVVNVNGDFGDFGDASTNYVY--SRQGNNVKN------------------EQQQRFPGNSFRNQNNHFAPRNNQFQGIGNNGN
         GL  +Y  I+      NV+     F  F +A +  +   SR  N  K+                   QQ+R+P     N +N    R+ +    G + +
Subjt:  AGLDVEYLPIVCDIEGKNVVNVNGDFGDFGDASTNYVY--SRQGNNVKN------------------EQQQRFPGNSFRNQNNHFAPRNNQFQGIGNNGN

Query:  GGIFQNRRGRGSLPMPTTPSALSHPTSSP
        G    N   R + P    P+ +  P  SP
Subjt:  GGIFQNRRGRGSLPMPTTPSALSHPTSSP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAAGCAGGTCCTAGCCGTTCTCCGAGGTCAAAAGGCCGATGGATATGTTTTTGGCACCAAAACTCAGCCACCACAGTTTTTAAAGTCTAGTGGTGAAGGTGGTTC
ATCTGTTTCAAATGTTAACCCATCTTTTGAAGAATGGACAGCAATAGACCAAGCTCTTCTTGGTTGGTTACTCGGTTCAATGACTCCTTCTGTGGCATTAGAGGTCGTCA
ATTTGACAACGTCAAGAGAGGTATGGTTGGCTCTAGAGGAGCTTTATGGTGCAACTAGTAAAGCTCGTATAAATCAATTGAGATCAATCCTTCAAAATACTATAAAAGGG
ACGATGAGGATGGCAGAATATCTATCAATGATGAAACAAGCATCTGAGAATATGCAATTGGCAGGATCACCAATTTCACAAGATGATCTAATCTCATATGTCTTAGCGGG
TCTTGATGTTGAATACCTACCAATTGTGTGCGATATTGAAGGTAAGAATGTTGTGAATGTTAATGGAGATTTTGGAGATTTTGGAGATGCTAGTACAAATTATGTCTATA
GTAGGCAAGGGAATAATGTCAAGAATGAGCAGCAACAACGTTTCCCTGGGAATAGTTTTCGAAATCAGAACAATCATTTTGCTCCTAGAAATAATCAATTCCAAGGAATA
GGAAACAATGGTAATGGTGGTATTTTTCAAAATCGAAGAGGTAGAGGCAGTCTTCCAATGCCAACTACTCCCTCTGCCTTGTCACATCCCACTTCTTCACCTACTGATGT
TCAGCCTAGTCCTTTATCCTTGGGTCCTTCAACTTCATCTTGTCCTTTATCCCAAAGTAATTATGACCCTCAGCCCATTTATAGCATGGCAGGAACACCTTCCCACATTG
CTTGTTCTGACCAAGTTGCAATTTCACATGAAGCTAACGGAACATCACTAGTCAGCATCTCTCCAGCTGGTTCTGCCATTTCTCCTGTTGACCATCCTCGAGAACCAGAA
AATTTGGATCCACAAATGGCTCCATGTGATTTTGTTGATGTACCTACTCTTTCTCCTCCTGAAATTTTGAATACTAATGAAGTGTTAGATCAACCATCTCACACTATACA
AACTCGATCTAAAAGTGGAGTTTTTAAGCCTAATCCCAAATATGCCTACCATGTTTCTGCCGAAGATTGGTCCTCTGTTTTCTTATTCATTCTCCTCGTCTTTGATTCTA
AGGTTTCCCTGGCCAGACATGGCGGCTACTGTTGGGTTGTTTCGGCCTTGAAGGAGCCCAGTCATACTTCCCCGCTGATCTCCACCACTGCTATTACACAACAGCTCTCC
GCCACCGCCATCGCACAACAACTCCTCTGTCACCACCGTCACACAGCTACTCGTCTGTCGCGTCGTTTTCACTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGAAGCAGGTCCTAGCCGTTCTCCGAGGTCAAAAGGCCGATGGATATGTTTTTGGCACCAAAACTCAGCCACCACAGTTTTTAAAGTCTAGTGGTGAAGGTGGTTC
ATCTGTTTCAAATGTTAACCCATCTTTTGAAGAATGGACAGCAATAGACCAAGCTCTTCTTGGTTGGTTACTCGGTTCAATGACTCCTTCTGTGGCATTAGAGGTCGTCA
ATTTGACAACGTCAAGAGAGGTATGGTTGGCTCTAGAGGAGCTTTATGGTGCAACTAGTAAAGCTCGTATAAATCAATTGAGATCAATCCTTCAAAATACTATAAAAGGG
ACGATGAGGATGGCAGAATATCTATCAATGATGAAACAAGCATCTGAGAATATGCAATTGGCAGGATCACCAATTTCACAAGATGATCTAATCTCATATGTCTTAGCGGG
TCTTGATGTTGAATACCTACCAATTGTGTGCGATATTGAAGGTAAGAATGTTGTGAATGTTAATGGAGATTTTGGAGATTTTGGAGATGCTAGTACAAATTATGTCTATA
GTAGGCAAGGGAATAATGTCAAGAATGAGCAGCAACAACGTTTCCCTGGGAATAGTTTTCGAAATCAGAACAATCATTTTGCTCCTAGAAATAATCAATTCCAAGGAATA
GGAAACAATGGTAATGGTGGTATTTTTCAAAATCGAAGAGGTAGAGGCAGTCTTCCAATGCCAACTACTCCCTCTGCCTTGTCACATCCCACTTCTTCACCTACTGATGT
TCAGCCTAGTCCTTTATCCTTGGGTCCTTCAACTTCATCTTGTCCTTTATCCCAAAGTAATTATGACCCTCAGCCCATTTATAGCATGGCAGGAACACCTTCCCACATTG
CTTGTTCTGACCAAGTTGCAATTTCACATGAAGCTAACGGAACATCACTAGTCAGCATCTCTCCAGCTGGTTCTGCCATTTCTCCTGTTGACCATCCTCGAGAACCAGAA
AATTTGGATCCACAAATGGCTCCATGTGATTTTGTTGATGTACCTACTCTTTCTCCTCCTGAAATTTTGAATACTAATGAAGTGTTAGATCAACCATCTCACACTATACA
AACTCGATCTAAAAGTGGAGTTTTTAAGCCTAATCCCAAATATGCCTACCATGTTTCTGCCGAAGATTGGTCCTCTGTTTTCTTATTCATTCTCCTCGTCTTTGATTCTA
AGGTTTCCCTGGCCAGACATGGCGGCTACTGTTGGGTTGTTTCGGCCTTGAAGGAGCCCAGTCATACTTCCCCGCTGATCTCCACCACTGCTATTACACAACAGCTCTCC
GCCACCGCCATCGCACAACAACTCCTCTGTCACCACCGTCACACAGCTACTCGTCTGTCGCGTCGTTTTCACTAA
Protein sequenceShow/hide protein sequence
MEKQVLAVLRGQKADGYVFGTKTQPPQFLKSSGEGGSSVSNVNPSFEEWTAIDQALLGWLLGSMTPSVALEVVNLTTSREVWLALEELYGATSKARINQLRSILQNTIKG
TMRMAEYLSMMKQASENMQLAGSPISQDDLISYVLAGLDVEYLPIVCDIEGKNVVNVNGDFGDFGDASTNYVYSRQGNNVKNEQQQRFPGNSFRNQNNHFAPRNNQFQGI
GNNGNGGIFQNRRGRGSLPMPTTPSALSHPTSSPTDVQPSPLSLGPSTSSCPLSQSNYDPQPIYSMAGTPSHIACSDQVAISHEANGTSLVSISPAGSAISPVDHPREPE
NLDPQMAPCDFVDVPTLSPPEILNTNEVLDQPSHTIQTRSKSGVFKPNPKYAYHVSAEDWSSVFLFILLVFDSKVSLARHGGYCWVVSALKEPSHTSPLISTTAITQQLS
ATAIAQQLLCHHRHTATRLSRRFH