; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0031406 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0031406
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr11:8061244..8062496
RNA-Seq ExpressionLag0031406
SyntenyLag0031406
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026100.1 uncharacterized protein E6C27_scaffold19G00360 [Cucumis melo var. makuwa]4.2e-3937.77Show/hide
Query:  MSPEVATQVMGYENTQDLWAAVQELFGVQSRVEEDYLR-------------------------------SPVSTRSLVSQVLLGLDEEYNPVVAMVHGKM
        M+P+VA Q+MG+ N +DLW A Q+ FGVQSR EED+LR                               SPV  R+L+SQVLLGLDE YN V+ ++ GK 
Subjt:  MSPEVATQVMGYENTQDLWAAVQELFGVQSRVEEDYLR-------------------------------SPVSTRSLVSQVLLGLDEEYNPVVAMVHGKM

Query:  SITCNSISVNMV-------------NNKDAGN-------QRGQQNYSNNRQNYSGRGNKKGGGGNRGRGRGRSYGSYTNNNKPICQVCGKVDHTALMCYQ
         I+   +   ++               K  GN          Q+   N ++N+S   NKK  G NR    G+  G+   NN P CQ+CGK  H+AL+CY 
Subjt:  SITCNSISVNMV-------------NNKDAGN-------QRGQQNYSNNRQNYSGRGNKKGGGGNRGRGRGRSYGSYTNNNKPICQVCGKVDHTALMCYQ

Query:  RFNKEFSGPSQVQNRNDGNTNCPNAQTQLTAFIANQGSNQFATPESVIDPNWYADSGASNHVTSDFSCLANPTDYKVSVSKLARDNNVYVEFHDGFCLVK
        RFNKEFS P  VQ+RN+ ++N  +       F++ Q +  FATP++V+DPNWY DSGA+NHVT + S + NPT+Y            +Y           
Subjt:  RFNKEFSGPSQVQNRNDGNTNCPNAQTQLTAFIANQGSNQFATPESVIDPNWYADSGASNHVTSDFSCLANPTDYKVSVSKLARDNNVYVEFHDGFCLVK

Query:  AKHTGKILLRGTLNEGLYKFELV
            G+ LLRGTL +G Y+ E V
Subjt:  AKHTGKILLRGTLNEGLYKFELV

TYK05754.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]5.9e-4937.03Show/hide
Query:  VATQVMGYENTQDLWAAVQELFGVQSRVEEDYLR-------------------------------SPVSTRSLVSQVLLGLDEEYNPVVAMVHGKMSIT-
        +A Q+MG+ N +DLW A Q+LFGVQSR EED+LR                               SPV  R+ +SQ LLGLDE YNPV+A++ GK  I+ 
Subjt:  VATQVMGYENTQDLWAAVQELFGVQSRVEEDYLR-------------------------------SPVSTRSLVSQVLLGLDEEYNPVVAMVHGKMSIT-

Query:  ----------------------CNSISVNMVNNKDAGNQRGQQNYSNNRQNYSGRGNKKG--GGGNRGRGRGRSYGSYTNNNKPICQVCGKVDHTALMCY
                                +I  N+VN     N    + YSN++ + + R N +G  GG N GRGRG+  G     NKP CQVC K  H+AL+CY
Subjt:  ----------------------CNSISVNMVNNKDAGNQRGQQNYSNNRQNYSGRGNKKG--GGGNRGRGRGRSYGSYTNNNKPICQVCGKVDHTALMCY

Query:  QRFNKEFSGPSQVQNRNDGNTNCPNAQTQLTAFIANQGSNQFATPESVIDPNWYADSGASNHVTSDFSCLANPTDYK-----------------------
         RFNKEF  P  VQ+R   ++N  +  + LT  +  Q  NQFAT ++VI+ NWY DSGA+NH+T ++S L+NP++Y                        
Subjt:  QRFNKEFSGPSQVQNRNDGNTNCPNAQTQLTAFIANQGSNQFATPESVIDPNWYADSGASNHVTSDFSCLANPTDYK-----------------------

Query:  ----------------------VSVSKLARDNNVYVEFHDGFCLVKAKHTGKILLRGTLNEGLYKFELVK
                              VSVSKLA+DNNVY+EFH  +C +K K TG+ LL  T+ +GLY  + ++
Subjt:  ----------------------VSVSKLARDNNVYVEFHDGFCLVKAKHTGKILLRGTLNEGLYKFELVK

XP_016902197.1 PREDICTED: uncharacterized protein LOC107991581 isoform X1 [Cucumis melo]3.1e-4236.45Show/hide
Query:  MSPEVATQVMGYENTQDLWAAVQELFGVQSRVEEDYLRSPVSTRSLVSQVLLGLDEEYNPVVAMVHGKMSIT----------------------------
        M+P+VA Q+MG+ N +DLW A Q+ FGVQSR EED+LR  + T         GLDE YN V+ ++ GK  I+                            
Subjt:  MSPEVATQVMGYENTQDLWAAVQELFGVQSRVEEDYLRSPVSTRSLVSQVLLGLDEEYNPVVAMVHGKMSIT----------------------------

Query:  CNSISVNMVNNKDAGNQRGQQN---YSNNRQNYSG-RGNKKGGGGNRGRGRGRSYGSYTNNNKPICQVCGKVDHTALMCYQRFNKEFSGPSQVQNRNDGN
          S ++NM        QR Q N   Y  NRQ++SG RGN                     NN P CQ+CGK  H+AL+CY RFNKEFS P  VQNRN+ +
Subjt:  CNSISVNMVNNKDAGNQRGQQN---YSNNRQNYSG-RGNKKGGGGNRGRGRGRSYGSYTNNNKPICQVCGKVDHTALMCYQRFNKEFSGPSQVQNRNDGN

Query:  TNCPNAQTQLTAFIANQGSNQFATPESVIDPNWYADSGASNHVTSDFSCLANPTDYK-------------------------------------------
        +N  +       F++ Q +  FATP++V+DPNWY DSGA+NHVT + S + NPT+Y                                            
Subjt:  TNCPNAQTQLTAFIANQGSNQFATPESVIDPNWYADSGASNHVTSDFSCLANPTDYK-------------------------------------------

Query:  --VSVSKLARDNNVYVEFHDGFCLVKAKHTGK
          +SVSKLA+DN++Y+EFH   C +K K TGK
Subjt:  --VSVSKLARDNNVYVEFHDGFCLVKAKHTGK

XP_016902203.1 PREDICTED: uncharacterized protein LOC107991581 isoform X3 [Cucumis melo]5.5e-3937.83Show/hide
Query:  MSPEVATQVMGYENTQDLWAAVQELFGVQSRVEEDYLRSPVSTRSLVSQVLLGLDEEYNPVVAMVHGKMSIT----------------------------
        M+P+VA Q+MG+ N +DLW A Q+ FGVQSR EED+LR  + T         GLDE YN V+ ++ GK  I+                            
Subjt:  MSPEVATQVMGYENTQDLWAAVQELFGVQSRVEEDYLRSPVSTRSLVSQVLLGLDEEYNPVVAMVHGKMSIT----------------------------

Query:  CNSISVNMVNNKDAGNQRGQQN---YSNNRQNYSG-RGNKKGGGGNRGRGRGRSYGSYTNNNKPICQVCGKVDHTALMCYQRFNKEFSGPSQVQNRNDGN
          S ++NM        QR Q N   Y  NRQ++SG RGN                     NN P CQ+CGK  H+AL+CY RFNKEFS P  VQNRN+ +
Subjt:  CNSISVNMVNNKDAGNQRGQQN---YSNNRQNYSG-RGNKKGGGGNRGRGRGRSYGSYTNNNKPICQVCGKVDHTALMCYQRFNKEFSGPSQVQNRNDGN

Query:  TNCPNAQTQLTAFIANQGSNQFATPESVIDPNWYADSGASNHVTSDFSCLANPTDYKVSVSKLARDNNVYVEFHDGFCLVKAKHTGKILLRGTLNEGLYK
        +N  +       F++ Q +  FATP++V+DPNWY DSGA+NHVT + S + NPT+Y            +Y               G+ LLRGTL +G Y+
Subjt:  TNCPNAQTQLTAFIANQGSNQFATPESVIDPNWYADSGASNHVTSDFSCLANPTDYKVSVSKLARDNNVYVEFHDGFCLVKAKHTGKILLRGTLNEGLYK

Query:  FELV
         E V
Subjt:  FELV

XP_038905164.1 uncharacterized protein LOC120091275 isoform X4 [Benincasa hispida]9.5e-3941.11Show/hide
Query:  MSPEVATQVMGYENTQDLWAAVQELFGVQSRVEEDYLR-------------------------------SPVSTRSLVSQVLLGLDEEYNPVVAMVHGKM
        M+PEVA QVMG E  +DLW ++ +LFGVQSRVEEDYLR                               SP+  R+LVSQVLLGLDEEYN +VAM+ G++
Subjt:  MSPEVATQVMGYENTQDLWAAVQELFGVQSRVEEDYLR-------------------------------SPVSTRSLVSQVLLGLDEEYNPVVAMVHGKM

Query:  SIT-----------------------------CNSISVNMVNNKDAGNQRGQQNYSNNRQNYSGRGNKKGGGGNRGRGRGRSYGSYTNNNKPICQVCGKV
         ++                              ++ SVNM N +       Q N +N+     G G + GGG  RGRGRGR      NN KP+CQVCGKV
Subjt:  SIT-----------------------------CNSISVNMVNNKDAGNQRGQQNYSNNRQNYSGRGNKKGGGGNRGRGRGRSYGSYTNNNKPICQVCGKV

Query:  DHTALMCYQRFNKEFSGPSQVQNRND--GNTNCPNAQTQLTAFIANQGSNQFAT-PESVIDPNWYADSGASNHVTSDFSCLANPTDY
         H A  C+ R++++F  P+  QN+ +   N    N Q   TA     GSN F T  E++ D NWY DSGASNHVTSDF+ L NP +Y
Subjt:  DHTALMCYQRFNKEFSGPSQVQNRND--GNTNCPNAQTQLTAFIANQGSNQFAT-PESVIDPNWYADSGASNHVTSDFSCLANPTDY

TrEMBL top hitse value%identityAlignment
A0A1S4E1U6 uncharacterized protein LOC107991581 isoform X11.5e-4236.45Show/hide
Query:  MSPEVATQVMGYENTQDLWAAVQELFGVQSRVEEDYLRSPVSTRSLVSQVLLGLDEEYNPVVAMVHGKMSIT----------------------------
        M+P+VA Q+MG+ N +DLW A Q+ FGVQSR EED+LR  + T         GLDE YN V+ ++ GK  I+                            
Subjt:  MSPEVATQVMGYENTQDLWAAVQELFGVQSRVEEDYLRSPVSTRSLVSQVLLGLDEEYNPVVAMVHGKMSIT----------------------------

Query:  CNSISVNMVNNKDAGNQRGQQN---YSNNRQNYSG-RGNKKGGGGNRGRGRGRSYGSYTNNNKPICQVCGKVDHTALMCYQRFNKEFSGPSQVQNRNDGN
          S ++NM        QR Q N   Y  NRQ++SG RGN                     NN P CQ+CGK  H+AL+CY RFNKEFS P  VQNRN+ +
Subjt:  CNSISVNMVNNKDAGNQRGQQN---YSNNRQNYSG-RGNKKGGGGNRGRGRGRSYGSYTNNNKPICQVCGKVDHTALMCYQRFNKEFSGPSQVQNRNDGN

Query:  TNCPNAQTQLTAFIANQGSNQFATPESVIDPNWYADSGASNHVTSDFSCLANPTDYK-------------------------------------------
        +N  +       F++ Q +  FATP++V+DPNWY DSGA+NHVT + S + NPT+Y                                            
Subjt:  TNCPNAQTQLTAFIANQGSNQFATPESVIDPNWYADSGASNHVTSDFSCLANPTDYK-------------------------------------------

Query:  --VSVSKLARDNNVYVEFHDGFCLVKAKHTGK
          +SVSKLA+DN++Y+EFH   C +K K TGK
Subjt:  --VSVSKLARDNNVYVEFHDGFCLVKAKHTGK

A0A1S4E1V2 uncharacterized protein LOC107991581 isoform X32.7e-3937.83Show/hide
Query:  MSPEVATQVMGYENTQDLWAAVQELFGVQSRVEEDYLRSPVSTRSLVSQVLLGLDEEYNPVVAMVHGKMSIT----------------------------
        M+P+VA Q+MG+ N +DLW A Q+ FGVQSR EED+LR  + T         GLDE YN V+ ++ GK  I+                            
Subjt:  MSPEVATQVMGYENTQDLWAAVQELFGVQSRVEEDYLRSPVSTRSLVSQVLLGLDEEYNPVVAMVHGKMSIT----------------------------

Query:  CNSISVNMVNNKDAGNQRGQQN---YSNNRQNYSG-RGNKKGGGGNRGRGRGRSYGSYTNNNKPICQVCGKVDHTALMCYQRFNKEFSGPSQVQNRNDGN
          S ++NM        QR Q N   Y  NRQ++SG RGN                     NN P CQ+CGK  H+AL+CY RFNKEFS P  VQNRN+ +
Subjt:  CNSISVNMVNNKDAGNQRGQQN---YSNNRQNYSG-RGNKKGGGGNRGRGRGRSYGSYTNNNKPICQVCGKVDHTALMCYQRFNKEFSGPSQVQNRNDGN

Query:  TNCPNAQTQLTAFIANQGSNQFATPESVIDPNWYADSGASNHVTSDFSCLANPTDYKVSVSKLARDNNVYVEFHDGFCLVKAKHTGKILLRGTLNEGLYK
        +N  +       F++ Q +  FATP++V+DPNWY DSGA+NHVT + S + NPT+Y            +Y               G+ LLRGTL +G Y+
Subjt:  TNCPNAQTQLTAFIANQGSNQFATPESVIDPNWYADSGASNHVTSDFSCLANPTDYKVSVSKLARDNNVYVEFHDGFCLVKAKHTGKILLRGTLNEGLYK

Query:  FELV
         E V
Subjt:  FELV

A0A5A7SIT7 Uncharacterized protein2.1e-3937.77Show/hide
Query:  MSPEVATQVMGYENTQDLWAAVQELFGVQSRVEEDYLR-------------------------------SPVSTRSLVSQVLLGLDEEYNPVVAMVHGKM
        M+P+VA Q+MG+ N +DLW A Q+ FGVQSR EED+LR                               SPV  R+L+SQVLLGLDE YN V+ ++ GK 
Subjt:  MSPEVATQVMGYENTQDLWAAVQELFGVQSRVEEDYLR-------------------------------SPVSTRSLVSQVLLGLDEEYNPVVAMVHGKM

Query:  SITCNSISVNMV-------------NNKDAGN-------QRGQQNYSNNRQNYSGRGNKKGGGGNRGRGRGRSYGSYTNNNKPICQVCGKVDHTALMCYQ
         I+   +   ++               K  GN          Q+   N ++N+S   NKK  G NR    G+  G+   NN P CQ+CGK  H+AL+CY 
Subjt:  SITCNSISVNMV-------------NNKDAGN-------QRGQQNYSNNRQNYSGRGNKKGGGGNRGRGRGRSYGSYTNNNKPICQVCGKVDHTALMCYQ

Query:  RFNKEFSGPSQVQNRNDGNTNCPNAQTQLTAFIANQGSNQFATPESVIDPNWYADSGASNHVTSDFSCLANPTDYKVSVSKLARDNNVYVEFHDGFCLVK
        RFNKEFS P  VQ+RN+ ++N  +       F++ Q +  FATP++V+DPNWY DSGA+NHVT + S + NPT+Y            +Y           
Subjt:  RFNKEFSGPSQVQNRNDGNTNCPNAQTQLTAFIANQGSNQFATPESVIDPNWYADSGASNHVTSDFSCLANPTDYKVSVSKLARDNNVYVEFHDGFCLVK

Query:  AKHTGKILLRGTLNEGLYKFELV
            G+ LLRGTL +G Y+ E V
Subjt:  AKHTGKILLRGTLNEGLYKFELV

A0A5A7UDA8 Retrovirus-related Pol polyprotein from transposon TNT 1-943.0e-3045.81Show/hide
Query:  DLWAAVQELFGVQSRVEEDYLRSPVSTRSLVSQVLLGLDEEYNPVVAMVHGKMSITCNSISVNMVNNKDAGNQRGQQNYSN------NRQNYSG-RGNKK
        DLW A+Q+ FGVQSR EED+LR     +    +VL  LDE YNPV+ ++ GK  I    + +   +N       GQ+N+SN      NRQ +SG RGN  
Subjt:  DLWAAVQELFGVQSRVEEDYLRSPVSTRSLVSQVLLGLDEEYNPVVAMVHGKMSITCNSISVNMVNNKDAGNQRGQQNYSN------NRQNYSG-RGNKK

Query:  GGGGNRGRGRGRSYGSYTNNNKPICQVCGKVDHTALMCYQRFNKEFSGPSQVQNRNDGNTNCPNAQTQLTAFIANQGSNQFATPESVIDPNWYADSGASN
            N G+GRG         NKP CQVCGK  H+AL+CY RFNKEFS P  VQ+RN+ ++N  +       F++ Q    F TP +V DPNWY DSGA+N
Subjt:  GGGGNRGRGRGRSYGSYTNNNKPICQVCGKVDHTALMCYQRFNKEFSGPSQVQNRNDGNTNCPNAQTQLTAFIANQGSNQFATPESVIDPNWYADSGASN

Query:  HVT
        HVT
Subjt:  HVT

A0A5D3C373 Retrovirus-related Pol polyprotein from transposon TNT 1-942.9e-4937.03Show/hide
Query:  VATQVMGYENTQDLWAAVQELFGVQSRVEEDYLR-------------------------------SPVSTRSLVSQVLLGLDEEYNPVVAMVHGKMSIT-
        +A Q+MG+ N +DLW A Q+LFGVQSR EED+LR                               SPV  R+ +SQ LLGLDE YNPV+A++ GK  I+ 
Subjt:  VATQVMGYENTQDLWAAVQELFGVQSRVEEDYLR-------------------------------SPVSTRSLVSQVLLGLDEEYNPVVAMVHGKMSIT-

Query:  ----------------------CNSISVNMVNNKDAGNQRGQQNYSNNRQNYSGRGNKKG--GGGNRGRGRGRSYGSYTNNNKPICQVCGKVDHTALMCY
                                +I  N+VN     N    + YSN++ + + R N +G  GG N GRGRG+  G     NKP CQVC K  H+AL+CY
Subjt:  ----------------------CNSISVNMVNNKDAGNQRGQQNYSNNRQNYSGRGNKKG--GGGNRGRGRGRSYGSYTNNNKPICQVCGKVDHTALMCY

Query:  QRFNKEFSGPSQVQNRNDGNTNCPNAQTQLTAFIANQGSNQFATPESVIDPNWYADSGASNHVTSDFSCLANPTDYK-----------------------
         RFNKEF  P  VQ+R   ++N  +  + LT  +  Q  NQFAT ++VI+ NWY DSGA+NH+T ++S L+NP++Y                        
Subjt:  QRFNKEFSGPSQVQNRNDGNTNCPNAQTQLTAFIANQGSNQFATPESVIDPNWYADSGASNHVTSDFSCLANPTDYK-----------------------

Query:  ----------------------VSVSKLARDNNVYVEFHDGFCLVKAKHTGKILLRGTLNEGLYKFELVK
                              VSVSKLA+DNNVY+EFH  +C +K K TG+ LL  T+ +GLY  + ++
Subjt:  ----------------------VSVSKLARDNNVYVEFHDGFCLVKAKHTGKILLRGTLNEGLYKFELVK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCACCCGAAGTAGCAACTCAGGTCATGGGATATGAGAACACTCAAGACCTTTGGGCAGCTGTGCAAGAGCTGTTTGGAGTTCAGTCGCGTGTAGAGGAGGACTACCT
CCGAAGTCCGGTGTCCACTCGATCCTTGGTATCGCAAGTTCTATTGGGCTTAGATGAAGAATATAACCCTGTGGTTGCAATGGTGCATGGTAAAATGAGCATAACCTGCA
ACAGTATCTCTGTGAACATGGTGAACAACAAAGATGCTGGAAATCAAAGGGGACAACAAAACTACTCAAATAATCGTCAAAACTATTCTGGTAGAGGAAATAAAAAAGGA
GGAGGAGGTAATCGTGGTCGTGGAAGAGGTCGGAGCTACGGATCCTACACCAACAATAACAAACCAATCTGTCAGGTATGTGGGAAGGTAGACCATACTGCTCTCATGTG
CTATCAGCGTTTTAATAAGGAATTTTCTGGTCCCTCCCAAGTTCAAAACAGGAATGATGGAAATACCAACTGTCCAAATGCACAGACACAACTTACTGCCTTTATAGCTA
ATCAAGGTTCAAATCAGTTTGCCACACCCGAGTCTGTCATCGATCCTAATTGGTATGCAGACAGTGGAGCTTCAAATCATGTGACCAGTGACTTCAGTTGCCTAGCCAAT
CCCACAGACTATAAAGTTAGTGTCTCCAAATTAGCTCGTGACAATAATGTTTATGTTGAATTTCACGATGGTTTCTGTCTTGTTAAGGCCAAACATACGGGCAAAATACT
ACTGAGAGGAACACTTAATGAAGGGCTATACAAGTTTGAACTTGTGAAAGCTACCTCATTTGATGCTACACAAACAGCCAACCAAGAAGGTTCTGGAGTCAATAAAGCTG
TCTCCTCTGGTTTTGTTGGTTTGAGTAATGTTAACATGGTTGTGTCCAAAGTTTTTTTGGCATAG
mRNA sequenceShow/hide mRNA sequence
ATGTCACCCGAAGTAGCAACTCAGGTCATGGGATATGAGAACACTCAAGACCTTTGGGCAGCTGTGCAAGAGCTGTTTGGAGTTCAGTCGCGTGTAGAGGAGGACTACCT
CCGAAGTCCGGTGTCCACTCGATCCTTGGTATCGCAAGTTCTATTGGGCTTAGATGAAGAATATAACCCTGTGGTTGCAATGGTGCATGGTAAAATGAGCATAACCTGCA
ACAGTATCTCTGTGAACATGGTGAACAACAAAGATGCTGGAAATCAAAGGGGACAACAAAACTACTCAAATAATCGTCAAAACTATTCTGGTAGAGGAAATAAAAAAGGA
GGAGGAGGTAATCGTGGTCGTGGAAGAGGTCGGAGCTACGGATCCTACACCAACAATAACAAACCAATCTGTCAGGTATGTGGGAAGGTAGACCATACTGCTCTCATGTG
CTATCAGCGTTTTAATAAGGAATTTTCTGGTCCCTCCCAAGTTCAAAACAGGAATGATGGAAATACCAACTGTCCAAATGCACAGACACAACTTACTGCCTTTATAGCTA
ATCAAGGTTCAAATCAGTTTGCCACACCCGAGTCTGTCATCGATCCTAATTGGTATGCAGACAGTGGAGCTTCAAATCATGTGACCAGTGACTTCAGTTGCCTAGCCAAT
CCCACAGACTATAAAGTTAGTGTCTCCAAATTAGCTCGTGACAATAATGTTTATGTTGAATTTCACGATGGTTTCTGTCTTGTTAAGGCCAAACATACGGGCAAAATACT
ACTGAGAGGAACACTTAATGAAGGGCTATACAAGTTTGAACTTGTGAAAGCTACCTCATTTGATGCTACACAAACAGCCAACCAAGAAGGTTCTGGAGTCAATAAAGCTG
TCTCCTCTGGTTTTGTTGGTTTGAGTAATGTTAACATGGTTGTGTCCAAAGTTTTTTTGGCATAG
Protein sequenceShow/hide protein sequence
MSPEVATQVMGYENTQDLWAAVQELFGVQSRVEEDYLRSPVSTRSLVSQVLLGLDEEYNPVVAMVHGKMSITCNSISVNMVNNKDAGNQRGQQNYSNNRQNYSGRGNKKG
GGGNRGRGRGRSYGSYTNNNKPICQVCGKVDHTALMCYQRFNKEFSGPSQVQNRNDGNTNCPNAQTQLTAFIANQGSNQFATPESVIDPNWYADSGASNHVTSDFSCLAN
PTDYKVSVSKLARDNNVYVEFHDGFCLVKAKHTGKILLRGTLNEGLYKFELVKATSFDATQTANQEGSGVNKAVSSGFVGLSNVNMVVSKVFLA