; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0008116 (gene) of Snake gourd v1 genome

Gene IDTan0008116
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionmyocardial zonula adherens protein-like
Genome locationLG02:92772130..92773832
RNA-Seq ExpressionTan0008116
SyntenyTan0008116
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6585310.1 hypothetical protein SDJN03_18043, partial [Cucurbita argyrosperma subsp. sororia]1.9e-6165.49Show/hide
Query:  LPQEIDNYIKESIDKTLGLPTSTQTLELKLATS-------EAQCLSLQSKLKQHILSTKSAKEEAKLSAQGLKKMIEKNEELIEENSLLVDHCKIMESI-
        +PQE+D YIKESID TLGLP STQTL LKLA S       EAQCLSLQSKL QHI STK+A+ EA +SAQGLKK IE+NE L +ENS L D  K M+ I 
Subjt:  LPQEIDNYIKESIDKTLGLPTSTQTLELKLATS-------EAQCLSLQSKLKQHILSTKSAKEEAKLSAQGLKKMIEKNEELIEENSLLVDHCKIMESI-

Query:  -----ALDIAQKAETRVRELEDEAKKLTEDLNFFKEEHQMRKFWFSSMEELLVEMALADLI-GETGSSAAEFLSANIENESCRRLLHLSQSLRPSTLKAL
             A+  A++AE R R+L DEA KLTEDLNFFKE++QMRKF F+SMEE+LVE  L DL+ G+TG +  +FL AN+ N+SCRRLLHLSQSLRPST +AL
Subjt:  -----ALDIAQKAETRVRELEDEAKKLTEDLNFFKEEHQMRKFWFSSMEELLVEMALADLI-GETGSSAAEFLSANIENESCRRLLHLSQSLRPSTLKAL

Query:  SLAAKIKTLEKDNKQLQILLAKAQGE
        SLAA+IKTLE++N+QLQI L+ AQGE
Subjt:  SLAAKIKTLEKDNKQLQILLAKAQGE

KAG7020228.1 hypothetical protein SDJN02_16911, partial [Cucurbita argyrosperma subsp. argyrosperma]2.1e-6066.22Show/hide
Query:  IDNYIKESIDKTLGLPTSTQTLELKLATS-------EAQCLSLQSKLKQHILSTKSAKEEAKLSAQGLKKMIEKNEELIEENSLLVDHCKIMESI-----
        +D YIKESID TLGLP STQTL LKLA S       EAQCLSLQSKL QHI STK+A+ EA +SAQGLKK IE+NE L +ENS L D  K M+ I     
Subjt:  IDNYIKESIDKTLGLPTSTQTLELKLATS-------EAQCLSLQSKLKQHILSTKSAKEEAKLSAQGLKKMIEKNEELIEENSLLVDHCKIMESI-----

Query:  -ALDIAQKAETRVRELEDEAKKLTEDLNFFKEEHQMRKFWFSSMEELLVEMALADLI-GETGSSAAEFLSANIENESCRRLLHLSQSLRPSTLKALSLAA
         A+  A++AE R R+L DEA KLTEDLNFFKE++QMRKF F+SMEE+LVE AL DL+ G+TGS+  +FL AN+ N+SCRRLLHLSQSLRPST +ALSLAA
Subjt:  -ALDIAQKAETRVRELEDEAKKLTEDLNFFKEEHQMRKFWFSSMEELLVEMALADLI-GETGSSAAEFLSANIENESCRRLLHLSQSLRPSTLKALSLAA

Query:  KIKTLEKDNKQLQILLAKAQGE
        +IKTLE++N+QLQI L+ AQGE
Subjt:  KIKTLEKDNKQLQILLAKAQGE

XP_022131622.1 uncharacterized protein LOC111004753 [Momordica charantia]9.6e-5057.89Show/hide
Query:  SLPQEIDNYIKESIDKTLGLPTSTQTLELKLATSEA-------QCLSLQSKLKQHILSTKSAKEEAKLSAQGLKKMIEKNEELIEENSLLVDHCKIMES-
        ++PQEID+YIKESI  +LGLP S  TLELKL  SEA       QCLSLQSKL   IL+ +S+K EAKLSAQ LKK IEKN +L EENS L   CK +++ 
Subjt:  SLPQEIDNYIKESIDKTLGLPTSTQTLELKLATSEA-------QCLSLQSKLKQHILSTKSAKEEAKLSAQGLKKMIEKNEELIEENSLLVDHCKIMES-

Query:  -----IALDIAQKAETRVRELEDEAKKLTEDLNFFKEEHQMRKFWFSSMEELLVEMALADLIGETGSSAAEFLSANIENESCRRLLHLSQSLRPSTLKAL
             +A+D A +   RVRELE E  +LT+DL FF+ ++Q RKF  SSMEE LV+  LA  IGET + AAEFL  N    SCRRLL+LS SL P+T KAL
Subjt:  -----IALDIAQKAETRVRELEDEAKKLTEDLNFFKEEHQMRKFWFSSMEELLVEMALADLIGETGSSAAEFLSANIENESCRRLLHLSQSLRPSTLKAL

Query:  SLAAKIKTLEKDNKQLQILLAKAQGEAK
        SLAAKIK LEK+ ++LQ+LL  AQ E K
Subjt:  SLAAKIKTLEKDNKQLQILLAKAQGEAK

XP_023538594.1 uncharacterized protein LOC111799313 [Cucurbita pepo subsp. pepo]6.2e-4165Show/hide
Query:  LSAQGLKKMIEKNEELIEENSLLVDHCKIMESI------ALDIAQKAETRVRELEDEAKKLTEDLNFFKEEHQMRKFWFSSMEELLVEMALADLI-GETG
        +SAQGLKK IE+NE L +ENS L D  K M+ I      A+ +A++AE R R+L +EA KLTEDLNFFKE+HQMRKF F+SMEE+LVE AL DL+ G+TG
Subjt:  LSAQGLKKMIEKNEELIEENSLLVDHCKIMESI------ALDIAQKAETRVRELEDEAKKLTEDLNFFKEEHQMRKFWFSSMEELLVEMALADLI-GETG

Query:  SSAAEFLSANIENESCRRLLHLSQSLRPSTLKALSLAAKIKTLEKDNKQLQILLAKAQGE
         +  +FL AN+ NESCRRLLHLSQSLRPST +ALSLAA+IKTLE++N+QLQI L+ AQGE
Subjt:  SSAAEFLSANIENESCRRLLHLSQSLRPSTLKALSLAAKIKTLEKDNKQLQILLAKAQGE

XP_038885825.1 uncharacterized protein LOC120076121 [Benincasa hispida]9.3e-4561.08Show/hide
Query:  EEAKLSAQGLKKMIEKNEELIEENSLLVDHCKIMES------IALDIAQKAETRVRELEDEAKKLTEDLNFFKEEHQMRKFWFSSMEELLVEMALADLIG
        EEAKL+AQGLKK IEKN++L EENS LV+ CK M++      +ALD A++ + RV ELEDE ++L  DL FFK++HQM KF F+SMEE+LVE AL+ LIG
Subjt:  EEAKLSAQGLKKMIEKNEELIEENSLLVDHCKIMES------IALDIAQKAETRVRELEDEAKKLTEDLNFFKEEHQMRKFWFSSMEELLVEMALADLIG

Query:  ETGSSAAEFLSANIENESCRRLLHLSQSLRPSTLKALSLAAKIKTLEKDNKQLQILLAKAQGEAK--TDFCSYM----KTEDHGN
        E  SSA+EFL ANIEN+SCRRLLHLS+SL P + + L+LAAKIK+LEKDN+QLQ+LL  AQGEAK  +D   Y+     T D GN
Subjt:  ETGSSAAEFLSANIENESCRRLLHLSQSLRPSTLKALSLAAKIKTLEKDNKQLQILLAKAQGEAK--TDFCSYM----KTEDHGN

TrEMBL top hitse value%identityAlignment
A0A1Q3CT80 Uncharacterized protein3.0e-3342.05Show/hide
Query:  MSLPQEIDNYIKESIDKTLGLPTSTQTLELKLATSEA-------QCLSLQSKLKQHILSTKSAKEEAKLSAQGLKKMIEKNEELIEENSLLVDHCKIME-
        M LPQEID+YIKESID++LGLP STQTL+LKL  +E        Q   L SKLKQ   +   A+ EA ++A  ++K +E+N++L  E + L+  C   E 
Subjt:  MSLPQEIDNYIKESIDKTLGLPTSTQTLELKLATSEA-------QCLSLQSKLKQHILSTKSAKEEAKLSAQGLKKMIEKNEELIEENSLLVDHCKIME-

Query:  ----------------SIALDIAQKAETRVRELEDEAKKLTEDLNFFKEEHQMR-----KFWFSSMEELLVEMALADLIG-ETGSSAAEFLSANIENESC
                        + A + A++AE RV++LE+E  KL EDL F+K  ++ R         ++ EE L+E  LA LIG E  +S   FL AN  NESC
Subjt:  ----------------SIALDIAQKAETRVRELEDEAKKLTEDLNFFKEEHQMR-----KFWFSSMEELLVEMALADLIG-ETGSSAAEFLSANIENESC

Query:  RRLLHLSQSLRPSTLKALSLAAKIKTLEKDNKQLQILLAKAQGEAKTDFCSYMKTEDHGNRRIR
        ++LL +  SLRPST KALSLAA++KTLEKD + L++ L +A+ E K  F      ++   R +R
Subjt:  RRLLHLSQSLRPSTLKALSLAAKIKTLEKDNKQLQILLAKAQGEAKTDFCSYMKTEDHGNRRIR

A0A2N9IYL3 Uncharacterized protein1.8e-3341.67Show/hide
Query:  MSLPQEIDNYIKESIDKTLGLPTSTQTLELKLATSE-------AQCLSLQSKLKQHILSTKSAKEEAKLSAQGLKKMIEKNEELIEENSLLVDHCKIME-
        M LPQEID+YIK+SID +LGLP STQTLELKL+ S+       ++CLSLQ KL++        + E+ ++AQ LKK +E+N++L  E + L+  C   E 
Subjt:  MSLPQEIDNYIKESIDKTLGLPTSTQTLELKLATSE-------AQCLSLQSKLKQHILSTKSAKEEAKLSAQGLKKMIEKNEELIEENSLLVDHCKIME-

Query:  ----------------SIALDIAQKAETRVRELEDEAKKLTEDLNFFKEEHQMRKFWFSS----MEELLVEMALADLIGETG-SSAAEFLSANIENESCR
                        + A + A++AE RV ELE+E ++L+E+L F+K E +MR    S+    +E  L+E  L  LI +   +SA  FL AN  +ESC+
Subjt:  ----------------SIALDIAQKAETRVRELEDEAKKLTEDLNFFKEEHQMRKFWFSS----MEELLVEMALADLIGETG-SSAAEFLSANIENESCR

Query:  RLLHLSQSLRPSTLKALSLAAKIKTLEKDNKQLQILLAKAQGEAKTDFCSYMKTEDHGNRRIRE
        RLL +   LRPST K LSLAA++KTLEKD   L+I L +A+ E K  F      ++   R +R+
Subjt:  RLLHLSQSLRPSTLKALSLAAKIKTLEKDNKQLQILLAKAQGEAKTDFCSYMKTEDHGNRRIRE

A0A6J1BRI9 uncharacterized protein LOC1110047534.6e-5057.89Show/hide
Query:  SLPQEIDNYIKESIDKTLGLPTSTQTLELKLATSEA-------QCLSLQSKLKQHILSTKSAKEEAKLSAQGLKKMIEKNEELIEENSLLVDHCKIMES-
        ++PQEID+YIKESI  +LGLP S  TLELKL  SEA       QCLSLQSKL   IL+ +S+K EAKLSAQ LKK IEKN +L EENS L   CK +++ 
Subjt:  SLPQEIDNYIKESIDKTLGLPTSTQTLELKLATSEA-------QCLSLQSKLKQHILSTKSAKEEAKLSAQGLKKMIEKNEELIEENSLLVDHCKIMES-

Query:  -----IALDIAQKAETRVRELEDEAKKLTEDLNFFKEEHQMRKFWFSSMEELLVEMALADLIGETGSSAAEFLSANIENESCRRLLHLSQSLRPSTLKAL
             +A+D A +   RVRELE E  +LT+DL FF+ ++Q RKF  SSMEE LV+  LA  IGET + AAEFL  N    SCRRLL+LS SL P+T KAL
Subjt:  -----IALDIAQKAETRVRELEDEAKKLTEDLNFFKEEHQMRKFWFSSMEELLVEMALADLIGETGSSAAEFLSANIENESCRRLLHLSQSLRPSTLKAL

Query:  SLAAKIKTLEKDNKQLQILLAKAQGEAK
        SLAAKIK LEK+ ++LQ+LL  AQ E K
Subjt:  SLAAKIKTLEKDNKQLQILLAKAQGEAK

A0A6J1GIQ2 uncharacterized protein LOC1114545508.2e-3963.12Show/hide
Query:  LSAQGLKKMIEKNEELIEENSLLVDHCKIMESI------ALDIAQKAETRVRELEDEAKKLTEDLNFFKEEHQMRKFWFSSMEELLVEMALADLI-GETG
        +SAQGLKK IE+NE L +ENS L D  K M+ I      A+   ++AE R R+L +EA KLTEDLNFFKE+HQMRKF F+SMEE+LVE AL DL+  +TG
Subjt:  LSAQGLKKMIEKNEELIEENSLLVDHCKIMESI------ALDIAQKAETRVRELEDEAKKLTEDLNFFKEEHQMRKFWFSSMEELLVEMALADLI-GETG

Query:  SSAAEFLSANIENESCRRLLHLSQSLRPSTLKALSLAAKIKTLEKDNKQLQILLAKAQGE
        S+  +FL AN+ N+SCRRLLHLSQSLRPST +AL LAA+IKTLE++N+QLQI L+ AQGE
Subjt:  SSAAEFLSANIENESCRRLLHLSQSLRPSTLKALSLAAKIKTLEKDNKQLQILLAKAQGE

A0A7N2KRL3 Uncharacterized protein3.0e-3343.56Show/hide
Query:  MSLPQEIDNYIKESIDKTLGLPTSTQTLELKLATS-EAQ------CLSLQSKLKQHILSTKSAKEEAKLSAQGLKKMIEKNEELIEENSLLVDHCKIME-
        M LPQEID+YIK+SID +LGLP ST+TLELKL+ S EAQ      CLSLQ KL++        + EA ++AQ LKK +E+N +L  E + L+  C   E 
Subjt:  MSLPQEIDNYIKESIDKTLGLPTSTQTLELKLATS-EAQ------CLSLQSKLKQHILSTKSAKEEAKLSAQGLKKMIEKNEELIEENSLLVDHCKIME-

Query:  ----------------SIALDIAQKAETRVRELEDEAKKLTEDLNFFKEEHQMRKFWFSS----MEELLVEMALADLIGETG-SSAAEFLSANIENESCR
                        + A + A++AE RV ELE+E +KL+E+L F+K E +MR    S+    +E  L+E  L  LI +   SSA  FL AN  +ESC+
Subjt:  ----------------SIALDIAQKAETRVRELEDEAKKLTEDLNFFKEEHQMRKFWFSS----MEELLVEMALADLIGETG-SSAAEFLSANIENESCR

Query:  RLLHLSQSLRPSTLKALSLAAKIKTLEKDNKQLQILLAKAQGEAKTDFCSYMKTEDHGNRRIRE
        RLL +   LRPST K LSLAA+ KTLE D + L+I LA+A+ E K  F      ++   R +R+
Subjt:  RLLHLSQSLRPSTLKALSLAAKIKTLEKDNKQLQILLAKAQGEAKTDFCSYMKTEDHGNRRIRE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G14680.1 unknown protein1.8e-2234.98Show/hide
Query:  MSLPQEIDNYIKESIDKTLGLPTSTQTLELKLATSE-------AQCLSLQSKLKQHILSTKSAKEEAKLSAQGLKKMIEKNEELIEENSLLVDHCKIMES
        M L +E+D++IKE+ID +LGLP S   L+ KL T+E        Q LSL S+LK+        + EA ++AQ LKK +E+N++L  E   LV+ CK  E 
Subjt:  MSLPQEIDNYIKESIDKTLGLPTSTQTLELKLATSE-------AQCLSLQSKLKQHILSTKSAKEEAKLSAQGLKKMIEKNEELIEENSLLVDHCKIMES

Query:  IAL-----------------DIAQKAETRVRELEDEAKKLTEDLNFFKEEHQMRKFWFSSMEELLVEMALADLIG--ETGSSAAEFLSANIENES-CRRL
                            + A++AE+RVRELE+E +K+++++          K    S E+ LV+  LA  +   E+ S    FL AN E +  C  L
Subjt:  IAL-----------------DIAQKAETRVRELEDEAKKLTEDLNFFKEEHQMRKFWFSSMEELLVEMALADLIG--ETGSSAAEFLSANIENES-CRRL

Query:  LHLSQSLRPSTLKALSLAAKIKTLEKDNKQLQILLAKAQGEAK
        L     L+P+T K +SL + +K +EK+ + L + LAKA+ E +
Subjt:  LHLSQSLRPSTLKALSLAAKIKTLEKDNKQLQILLAKAQGEAK

AT4G09060.1 unknown protein8.7e-2533.33Show/hide
Query:  SLPQEIDNYIKESIDKTLGLPTSTQTLELKLATSE-------AQCLSLQSKLKQHILSTKSAKEEAKLSAQGLKKMIEKNEELIEENSLLVDHCKIMESI
        +L QE++ YIK++ID +LGLP S ++L+ KL T+E        Q   L S+LK+        + EA ++AQ LKK +E+N++L  E   L+  CK +E  
Subjt:  SLPQEIDNYIKESIDKTLGLPTSTQTLELKLATSE-------AQCLSLQSKLKQHILSTKSAKEEAKLSAQGLKKMIEKNEELIEENSLLVDHCKIMESI

Query:  AL-----------------DIAQKAETRVRELEDEAKKLTEDLNFFKEE--HQMRKFWFSSMEELLVEMALADLIGETGSSAAE-FLSANIENESCRRLL
         L                 + A++AE RVRELEDE  +++E++  FK +      +   + +EE L++  L  LI +  +     FL ANI ++SC+ LL
Subjt:  AL-----------------DIAQKAETRVRELEDEAKKLTEDLNFFKEE--HQMRKFWFSSMEELLVEMALADLIGETGSSAAE-FLSANIENESCRRLL

Query:  HLSQSLRPSTLKALSLAAKIKTLEKDNKQLQILLAKAQGEAKTDFCSYMKTEDHGNRRIREMD---CSFDPRR
             L+PST K LSL +K K  EK+ + + + LAKA+ E +       K +    + +R+     CS D  R
Subjt:  HLSQSLRPSTLKALSLAAKIKTLEKDNKQLQILLAKAQGEAKTDFCSYMKTEDHGNRRIREMD---CSFDPRR

AT4G09060.2 unknown protein8.1e-2330.77Show/hide
Query:  SLPQEIDNYIKESIDKTLGLPTSTQTLELKLATSE-------AQCLSLQSKLKQHILSTKSAKEEAKLSAQGLKKMIEKNEELIEENSLLVDHCKIMESI
        +L QE++ YIK++ID +LGLP S ++L+ KL T+E        Q   L S+LK+        + EA ++AQ LKK +E+N++L  E   L+  CK +E  
Subjt:  SLPQEIDNYIKESIDKTLGLPTSTQTLELKLATSE-------AQCLSLQSKLKQHILSTKSAKEEAKLSAQGLKKMIEKNEELIEENSLLVDHCKIMESI

Query:  AL-----------------DIAQKAETRVRELEDEAKKLTEDLNFFKEE-------------HQMRKFWF---------------SSMEELLVEMALADL
         L                 + A++AE RVRELEDE  +++E++  FK +             +  +K +                + +EE L++  L  L
Subjt:  AL-----------------DIAQKAETRVRELEDEAKKLTEDLNFFKEE-------------HQMRKFWF---------------SSMEELLVEMALADL

Query:  IGETGSSAAE-FLSANIENESCRRLLHLSQSLRPSTLKALSLAAKIKTLEKDNKQLQILLAKAQGEAKTDFCSYMKTEDHGNRRIREMD---CSFDPRR
        I +  +     FL ANI ++SC+ LL     L+PST K LSL +K K  EK+ + + + LAKA+ E +       K +    + +R+     CS D  R
Subjt:  IGETGSSAAE-FLSANIENESCRRLLHLSQSLRPSTLKALSLAAKIKTLEKDNKQLQILLAKAQGEAKTDFCSYMKTEDHGNRRIREMD---CSFDPRR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTCTTCCTCAAGAAATCGACAATTACATCAAAGAATCAATCGACAAAACTCTGGGCCTCCCCACCTCCACTCAAACATTGGAACTGAAACTCGCCACCTCTGAAGC
CCAATGCCTCTCTTTGCAGTCTAAATTAAAGCAACACATCCTCTCTACCAAAAGTGCCAAGGAGGAAGCGAAGTTGAGTGCGCAGGGCTTGAAGAAGATGATCGAGAAAA
ACGAAGAATTGATTGAAGAGAATTCGCTCTTGGTGGACCATTGCAAGATAATGGAATCCATTGCGCTCGATATCGCCCAGAAAGCGGAAACTCGTGTGCGGGAGTTGGAA
GATGAGGCTAAGAAATTAACAGAGGATTTGAATTTCTTCAAGGAAGAACACCAAATGCGAAAGTTTTGGTTTTCTTCAATGGAGGAGCTTTTGGTTGAGATGGCTTTAGC
GGATTTGATTGGGGAAACAGGTTCGTCTGCTGCTGAGTTCTTGTCTGCAAATATAGAGAACGAATCCTGTCGAAGATTGCTGCATTTGTCGCAGAGCTTAAGACCTTCAA
CTCTCAAAGCTCTGTCTCTGGCTGCTAAAATAAAGACCCTTGAGAAAGACAATAAACAACTTCAGATCCTCCTCGCTAAGGCTCAAGGGGAGGCCAAAACTGACTTTTGC
AGTTACATGAAAACAGAAGACCATGGGAACCGTCGAATTCGGGAGATGGATTGTAGCTTTGATCCAAGAAGATATAACTTTAAAGTTTGTTCAATTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGTCTTCCTCAAGAAATCGACAATTACATCAAAGAATCAATCGACAAAACTCTGGGCCTCCCCACCTCCACTCAAACATTGGAACTGAAACTCGCCACCTCTGAAGC
CCAATGCCTCTCTTTGCAGTCTAAATTAAAGCAACACATCCTCTCTACCAAAAGTGCCAAGGAGGAAGCGAAGTTGAGTGCGCAGGGCTTGAAGAAGATGATCGAGAAAA
ACGAAGAATTGATTGAAGAGAATTCGCTCTTGGTGGACCATTGCAAGATAATGGAATCCATTGCGCTCGATATCGCCCAGAAAGCGGAAACTCGTGTGCGGGAGTTGGAA
GATGAGGCTAAGAAATTAACAGAGGATTTGAATTTCTTCAAGGAAGAACACCAAATGCGAAAGTTTTGGTTTTCTTCAATGGAGGAGCTTTTGGTTGAGATGGCTTTAGC
GGATTTGATTGGGGAAACAGGTTCGTCTGCTGCTGAGTTCTTGTCTGCAAATATAGAGAACGAATCCTGTCGAAGATTGCTGCATTTGTCGCAGAGCTTAAGACCTTCAA
CTCTCAAAGCTCTGTCTCTGGCTGCTAAAATAAAGACCCTTGAGAAAGACAATAAACAACTTCAGATCCTCCTCGCTAAGGCTCAAGGGGAGGCCAAAACTGACTTTTGC
AGTTACATGAAAACAGAAGACCATGGGAACCGTCGAATTCGGGAGATGGATTGTAGCTTTGATCCAAGAAGATATAACTTTAAAGTTTGTTCAATTTAA
Protein sequenceShow/hide protein sequence
MSLPQEIDNYIKESIDKTLGLPTSTQTLELKLATSEAQCLSLQSKLKQHILSTKSAKEEAKLSAQGLKKMIEKNEELIEENSLLVDHCKIMESIALDIAQKAETRVRELE
DEAKKLTEDLNFFKEEHQMRKFWFSSMEELLVEMALADLIGETGSSAAEFLSANIENESCRRLLHLSQSLRPSTLKALSLAAKIKTLEKDNKQLQILLAKAQGEAKTDFC
SYMKTEDHGNRRIREMDCSFDPRRYNFKVCSI